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NOVEL GENES, COMPOSITIONS, KITS, AND METHODS FOR 
IDENTIFICATION, ASSESSMENT, PREVENTION, 
AND THERAPY OF BREAST CANCER 

5 RELATED* APPLICATIONS 

The present application claims priority to U.S. provisional application serial no. 
60/176,077, filed January 14, 2000, U.S. provisional application serial no. 60/189,167, 
filed March 14, 2000, U.S. provisional application serial no. 60/192,099, filed March 24, 
2000, U.S. provisional application serial no. 60/193,480, filed March 29, 2000, U.S. 
10 provisional application serial no. 60/205,230, filed May 15, 2000, U.S. provisional 

application serial no. 60/21 1,315, filed June 9, 2000, U.S. provisional application serial 
no. 60/220,534, filed July 25, 2000, all of which are expressly incorporated by reference. 

FIELD OF THE INVENTION 
1 5 The field of the invention is breast cancer, including diagnosis, characterization, 

management, and therapy of breast cancer. 

BACKGROUND OF THE INVENTION 
The increased number of cancer cases reported in the United States, and, indeed, 

20. around the world, is a major concern. Currently there are only a handful of treatments 
available for specific types of cancer, and these provide no absolute guarantee of 
success. In order to be most effective, these treatments require not only an early 
detection of the malignancy, but a reliable assessment of the severity of the malignancy. 
The incidence of breast cancer, a leading cause of death in women, has been 

25 gradually increasing in the United States over the last thirty years. In 1997, it was 
estimated that 181,000 new cases were reported in the U.S., and that 44,000 people 
would die of breast cancer (Parker et al, 1997, CA Cancer J. Clin. 47:5-27; Chu et al, 
1996, J. Nat. Cancer Inst. 88:1571-1579). While the pathogenesis of breast cancer is 
unclear, transformation of normal breast epithelium to a malignant phenotype may be 

30 the result of genetic factors, especially in women under 30 (Miki et al, 1 994, Science, 
266:66-71). The discovery and characterization of BRCA1 and BRCA2 has recently 
expanded our knowledge of genetic factors which can contribute to familial breast 
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cancer. Germ-line mutations within these two loci are associated with a 50 to 85% 

lifetime risk of breast and/or ovarian cancer (Casey 3 1997, Curr. Opin. Oncol 9:88-93; 

Marcus et al 9 1996, Cancer 77:697-709). However, it is likely that other, non-genetic 

factors also have a significant effect on the etiology of the disease. Regardless of its 
5 origin, breast cancer morbidity and mortality increases significantly if it is not detected 

early in its progression. Thus, considerable effort has focused on the early detection of 

cellular transformation and tumor formation in breast tissue. 

Currently, the principal manner of identifying breast cancer is through detection 

of the presence of dense tumorous tissue. This may be accomplished to varying degrees 
1 0 of effectiveness by direct examination of the outside of the breast, or through 

mammography or other X-ray imaging methods (Jatoi, 1999, Am. J. Surg. 177:518-524). 

The latter approach is not without considerable cost, however. Every time a 

mammogram is taken, the patient incurs a small risk of having a breast tumor induced 

by the ionizing properties of the radiation used during the test. In addition, the process is 
1 5 expensive and the subjective interpretations of a technician can lead to imprecision, e.g. , 

one study showed major clinical disagreements for .about one-third of a set of 

mammograms that were interpreted individually by a surveyed group of radiologists. 

Moreover, many women find that undergoing a mammogram is a painful experience. 

Accordingly, the National Cancer Institute has not recommended mammograms for 
20 women under fifty years of age, since this group is not as likely to develop breast 

cancers as aie older women. It is compelling to note, however, that while only about 

22% of breast cancers occur in women under fifty, data suggests that breast cancer is 

more aggressive in pre-menopausal women. 

It would therefore be beneficial to provide specific methods and reagents for the 
25 diagnosis , staging, prognosis, monitoring, and treatment of diseases associated with 

breast cancer, or to indicate a predisposition to such for preventative measures. 

SUMMARY OF THE INVENTION 
The invention relates to novel genes associated with breast cancer as well as 
30 methods of assessing whether a patient is afflicted with breast cancer. The methods of 
the present invention comprise the step of comparing the level of expression of a marker 
in a patient sample, wherein the marker is listed in Tables 1-6 and the normal level of 
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expression of the marker in a control, e.g., a sample from a patient without breast 
cancer. A significant difference between the level of expression of the marker in the 
patient sample and the normal level is an indication that the patient is afflicted with 
breast cancer. • Preferably, a protein corresponding to the marker is a secreted protein or 
5 is predicted to correspond to a secreted protein. Alternatively, the marker can 
correspond to a protein having an extracellular portion, to one which is normally 
expressed in breast tissue at a detectable level, or both. 

In one method, the marker(s) are preferably selected such that the positive 
predictive value of the method is at least about 10%. Also preferred are embodiments of 

1 0 the method wherein the marker is over- or under-expressed by at least two-fold in at 

least about 20% of stage 0 breast cancer patients, stage I breast cancer patients, stage IIA 
breast cancer patients, stage IIB breast cancer patients, stage IIIA breast cancer patients, 
stage IIIB breast cancer patients, stage IV breast cancer patients, grade I breast cancer 
patients, grade II breast cancer patients, grade III breast cancer patients, malignant breast 

1 5 cancer patients, ductal carcinoma breast cancer patients, and lobular carcinoma breast 
cancer patients. 

In one embodiment of the methods of the present invention, the patient sample is 
a breast tissue-associated body fluid. Such fluids include, for example, blood fluids, 
lymph and cystic fluids, as well as nipple aspirates. In another embodiment, the sample 
20 comprises cells obtained from the patient. In another embodiment, the patient sample is 
in vivo. 

In accordance with the methods of the present invention, the level of expression 
of the marker in a sample can be assessed, for example, by detecting the presence in the 
sample of: 

25 • a protein or a fragment of the protein corresponding to the marker (e.g. 

using a reagent, such as an antibody, an antibody derivative, or an antibody 
fragment, which binds specifically with the protein or a fragment of the 
protein) 

• a metabolite which is produced directly (i.e., catalyzed) or indirectly by a 
30 protein corresponding to the marker 

• a transcribed polynucleotide (e.g. an mRNA or a cDNA), or fragment 
thereof, having at least a portion with which the marker is substantially 
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homologous (e.g. by contacting a mixture of transcribed polynucleotides 
obtained from the sample with a substrate having one or more of the markers 
listed in Tables 1-6 fixed thereto at selected positions) 
• a transcribed polynucleotide or fragment thereof, wherein the 
5 polynucleotide anneals with the marker under stringent hybridization 

conditions. ' 

The methods of the present invention are particularly useful for patients with an 
identified breast mass or symptoms associated with breast cancer. The methods of the 
present invention can also be of particular use with patients having an enhanced risk of 
10 developing breast cancer (e.g. , patients having a familial history of breast cancer, 

patients identified as having a mutant oncogene, and patients at least about 50 years of 
age). The methods of the present invention may further be of particular use in 
monitoring the efficacy of treatment of a breast cancer patient (e.g. the efficacy of 
chemotherapy). 

15 The methods of the present invention may be performed using a plurality (e.g. 2, 

3, 5, or 10 or more) of markers. According to a method involving a plurality of markers, 
the level of expression in the sample of each of a plurality of markers independently 
selected from the markers listed in Tables 1-6 is compared with the normal level of 
expression of each of the plurality of markers in samples of the same type obtained from 

20 control humans not afflicted with breast cancer. A significantly enhanced level of 
expression of one or more of the markers listed in Tables 1-6 in the sample, relative to 
the corresponding normal levels, is an indication that the patient is afflicted with breast 
cancer. The markers of Tables 1-6 may a\so be used in combination with known breast 
cancer markers in the methods of the present invention. 

25 In a preferred method of assessing whether a patient is afflicted with breast cancer 

(e.g., new detection ("screening"), detection of recurrence, reflex testing), the method 
comprises comparing: 

a) the level of expression of a marker in a patient sample, wherein at least 
one marker is selected from the markers of Tables 1-6, and 

30 b) the normal level of expression of the marker in a control non-breast 

cancer sample. 
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A significant difference between the level of expression of the marker in the patient 
sample and the normal level is an indication that the patient is afflicted with breast 
cancer. 

The methods of the present invention further include a method of assessing the 
5 efficacy of a test compound for inhibiting breast cancer in a patient. This method 
comprises comparing: . 

a) expression of a marker in a first sample obtained from the patient and 
maintained in the presence of the test compound, wherein the marker is 
selected from the group consisting of the markers listed in Tables 1-6, 

10 and 

b) expression of the marker in a second sample obtained from the patient 
and maintained in the absence of the test compound. 

A significantly lower level of expression of the marker in the first sample, relative to the 
second sample, is an indication that the test compound is efficacious for inhibiting breast 
1 5 cancer in the patient. For example, the first and second samples can be portions of a 
single sample obtained from the patient or portions of pooled samples obtained from the 
patient. 

The invention further relates to a method of assessing the efficacy of a therapy for 
inhibiting breast cancer in a patient. This method comprises comparing: 
20 a) expression of a marker in a first sample obtained from the patient prior 

to providing at least a portion of the therapy to the patient, wherein the 
marker is selected from the group consisting of the markers listed in 
Tables 1-6, and 

b) expression of the marker in a second sample obtained from the patient 
25 following provision of the portion of the therapy. 

A significantly lower level of expression of the marker in the second sample, relative to 
the first sample, is an indication that the therapy is efficacious for inhibiting breast 
cancer in the patient. 

It will be appreciated that in these methods the "therapy" may be any therapy for 
30 treating breast cancer including, but not limited to, chemotherapy, radiation therapy and 
surgical removal of tissue, e.g., a breast tumor. Thus, the methods of the invention may 
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be used to evaluate a patient before, during and after therapy, for example, to evaluate 
the reduction in tumor burden. 

The present invention therefore further comprises a method for monitoring the 
progression of breast cancer in a patient, the method comprising: 
5 a) detecting in a patient sample at a first time point, the expression of a marker, 

wherein the marker is selected from the group consisting of the markers listed in Tables 
1-6; 

b) repeating step a) at a subsequent time point; and 

c) comparing the level of expression detected in steps a) and b), and therefrom 
10 monitoring the progression of breast cancer in the patient. 

The invention also includes a method of selecting a composition for inhibiting 
breast cancer in a patient. This method comprises the steps of: 

a) obtaining a sample comprising cancer cells from the patient; 

b) separately maintaining aliquots of the sample in the presence of a 
1 5 plurality of test compositions; 

c) comparing expression of a marker listed in Tables 1-6 in each of the 
aliquots; and 

d) selecting one of the test compositions which induces a lower level of 
expression of the marker in the aliquot containing that test composition, 

20 relative to other test compositions. 

In addition, the invention includes a method of inhibiting breast cancer in a 
patient. This method comprises the steps of: 

a) obtaining a sample comprising cancer cells from the p'itient; 

b) separately maintaining aliquots of the sample in the presence of a 
25 plurality of test compositions; 

c) comparing expression of a marker listed in Tables i-6 in each of the 
aliquots; and 

d) administering to the patient at least one of the test compositions which 
induces a lower level of expression of the marker in the aliquot 

30 containing that test composition, relative to other test compositions. 
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The invention also includes a kit for assessing whether a patient is afflicted with 
breast cancer. This kit comprises reagents for assessing expression of a marker listed in 
Tables 1-6. 

In another aspect, the invention relates to a kit for assessing the suitability of each 
5 of a plurality of compounds for inhibiting breast cancer in a patient. The kit comprises a 
reagent for assessing expression of a marker listed in Tables 1-6, and may also comprise 
a plurality of compounds. 

In another aspect, the invention relates to a kit for assessing the presence of breast 
cancer cells. This kit comprises an antibody, wherein the antibody binds specifically 
10 with a protein corresponding to a marker listed in Tables 1-6. The kit may also 

comprise a plurality of antibodies, wherein the plurality binds specifically with a protein 
corresponding to a different marker which is also listed in Tables 1-6. 

The invention also includes a kit for assessing the presence of breast cancer cells, 
wherein the kit comprises a nucleic acid probe. The probe binds specifically with a 
1 5 transcribed polynucleotide corresponding to a marker listed in Tables 1 -6. The kit may 
also comprise a plurality of probes, wherein each of the probes binds specifically with a 
transcribed polynucleotide corresponding to a different marker listed in Tables 1-6. 

The invention further relates to a method of making an isolated hybridoma which 
produces an antibody useful for assessing whether a patient is afflicted with breast 
20 cancer. The method comprises isolating a protein or protein fragment corresponding to 
a marker listed in Tables 1-6, immunizing a mammal using the isolated protein or 
protein fragment, isolating splenocytes from the immunized mammal, fusing the isolated 
splenocytes with an immortalized cell line to form hybridomas, and screening individual 
hybridomas for production of an antibody which specifically binds with the protein or 
25 protein fragment to isolate the hybridoma. The invention also includes an antibody 
produced by this method. 

The invention further includes a method of assessing the breast carcinogenic or 
irregular growth promoting potential of a test compound. This method comprises the 
steps of: 

30 a) maintaining separate aliquots of breast cells in the presence and 

absence of the test compound; and 

b) comparing expression of a marker in each of the aliquots. 
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The marker is selected from those listed in Tables 1-6. A significantly enhanced 
level of expression of the marker in the aliquot maintained in the presence of (ot 
exposed to) the test compound, relative to the aliquot maintained in the absence of the 
test compound, is an indication that the test compound possesses breast carcinogenic or 
5 irregular growth promoting potential. 

Additionally, the invention includes a kit for assessing the breast carcinogenic 
potential of a test compound. The kit comprises breast cells and a reagent for assessing 
expression of a marker in each of the aliquots. The marker is selected from those listed 
in Tables 1-6. 

1 0 The invention further includes a method of treating a patient afflicted with breast 

cancer, comprising providing to cells of the patient an antisense oligonucleotide 
complementary to a polynucleotide corresponding to a marker listed in Tables 1-6. 

The invention includes a method of inhibiting breast cancer in a patient at risk for 
developing breast cancer. This method comprises inhibiting expression or 

1 5 overexpression of a gene corresponding to a marker listed in Tables 1 -6. 

It will be appreciated that the methods and kits of the present invention may also 
include known cancer markers including known breast cancer markers. It will further be 
appreciated that the methods and kits may be used to identify cancers other than breast 
cancer: 

20 

DETAILED DESCRIPTION OF THE INVENTION 
The invention relates to newly discovered correlations between expression of 
certain markers and the cancerous state of breast cells. It has been discovered that the 
level of expression of individual markers and combinations of markers described herein 
25 correlates with the presence of breast cancer in a patient. Methods are provided for 
detecting the presence of breast cancer in a sample, the absence of breast cancer in a 
sample, the stage of breast cancer, and other characteristics of breast cancer that are 
relevant to prevention, diagnosis, characterization, and therapy of breast cancer in a 
patient. 

30 



WO 01/51628 PCT/USO 1/00798 

-9- 

Definitions 

As used herein, each of the following terms has the meaning associated with it in 
this section. 

The articles "a" and "an" are used herein to refer to one or to more than one (i.e. 
5 to at least one) of the grammatical object of the article. By way of example, "an 
element" means one element or more than one element. 

A "marker" is a naturally-occurring polymer corresponding to at least one of the 
novel nucleic acids listed in Tables 1-6. For example, markers include, without 
limitation, sense and anti-sense strands of genomic DNA (i.e. including any introns 
1 0 occurring therein), RNA generated by transcription of genomic DNA (/. e. prior to 
splicing), RNA generated by splicing of RNA transcribed from genomic DNA, and 
proteins generated by translation of spliced RNA (e.g. including proteins both before 
and after cleavage of normally cleaved regions such as transmembrane signal 
sequences). As used herein, "marker" may also include a cDNA made by reverse 
1 5 transcription of an RNA generated by transcription of genomic DNA (including spliced 
RNA). 

As used herein a "polynucleotide corresponds to" another (a first) polynucleotide 
if it is related to the first polynucleotide by any of the following relationships: 1) The 
second polynucleotide comprises the first polynucleotide and the second polynucleotide 

20 encodes a gene product. 2) The second polynucleotide is 5' or 3' to the first 
polynucleotide in cDNA, RNA, genomic DNA, or fragment of any of these 
polynucleotides. For example, a second polynucleotide may be fragment of a gene that 
includes the first and second polynucleotides. The first and second polynucleotides are 
related in that they are components of the gene coding for a gene product, such as a 

25 protein or antibody. However, it is not necessary that the second polynucleotide 
comprises or overlaps with the first polynucleotide to be encompassed within the 
definition of "corresponding to" as used herein. For example, the first polynucleotide 
may be a fragment of a 3' untranslated region of the second polynucleotide. The first 
and second polynucleotide may be fragments of a gene coding for a gene product. The 

30 second polynucleotide may be an exon of the gene while the first polynucleotide may be 
an intron of the gene. 3) The second polynucleotide is the complement of the first 
polynucleotide. 



WO 01/51628 PCT/US0 1/00798 

-10- 

The term "probe" refers to any molecule which is capable of selectively binding 
to a specifically intended target molecule, for example a marker of the invention. 
Probes can be either synthesized by one skilled in the art, or derived from appropriate 
biological preparations. For purposes of detection of the target molecule, probes may be 
5 specifically designed to be labeled, as described herein. Examples of molecules that can 
be utilized as probes include, but are not limited to, RNA, DNA, proteins, antibodies, 
and organic monomers. 

A "breast-associated" body fluid is a fluid which, when in the body of a patient, 
contacts or passes through breast cells or into which cells, nucleic acids or proteins shed 
10 from breast cells are capable of passing. Exemplary breast-associated body fluids 
include blood fluids, lymph, cystic fluid, urine and nipple aspirates. 

The "normal" level of expression of a marker is the level of expression of the 
marker in breast cells of a patient, e.g. a human, not afflicted with breast cancer. 

"Over-expression" and "under-expression" of a marker refer to expression of the 
1 5 marker of a patient at a greater or lesser level, respectively, than normal level of 
expression of the marker {e.g. at least two-fold greater or lesser level). 

. As used herein, the term "promoter/regulatory sequence" means a nucleic acid 
sequence which is required for expression of a gene product operably linked to the 
promoter/regulatory sequence. In some instances, this sequence may be the core 
20 promoter sequence and in other instances, this sequence may also include an enhancer 
sequence and other regulatory elements which are required for expression of the gene 
product. The promoter/regulatory sequence may, for example, be one which expresses 
the gene product in a tissue-specific manner. 

A "constitutive" promoter is a nucleotide sequence which, when operably linked 
25 with a polynucleotide which encodes or specifies a gene product, causes the gene 

product to be produced in a living human cell under most or all physiological conditions 
of the cell. 

An "inducible" promoter is a nucleotide sequence which, when operably linked 
with a polynucleotide which encodes or specifies a gene product, causes the gene 
30 product to be produced in a living human cell substantially only when an inducer which 
corresponds to the promoter is present in the cell. 
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A "tissue-specific" promoter is a nucleotide sequence which, when operably 
linked with a polynucleotide which encodes or specifies a gene product, causes the gene 
product to be produced in a living human cell substantially only if the cell is a cell of the 
tissue type corresponding to the promoter. 
5 A "transcribed polynucleotide" is a polynucleotide (e.g. an RNA, a cDNA, or an 

analog of one of an RNA or cDNA) which is complementary to or homologous with all 
or a portion of a mature RNA made by transcription of a genomic DNA corresponding 
to a marker of the invention and normal post-transcriptional processing (e.g. splicing), if 
any, of the transcript. 

1 0 "Complementary" refers to the broad concept of sequence complementarity 

between regions of two nucleic acid strands or between two regions of the same nucleic 
acid strand. It is known that an adenine residue of a first nucleic acid region is capable 
of forming specific hydrogen bonds ("base pairing") with a residue of a second nucleic 
acid region which is antiparallel to the first region if the residue is thymine or uracil. 

1 5 Similarly, it is known that a cytosine residue of a first nucleic acid strand is capable of 
base pairing with a residue of a second nucleic acid strand which is antiparallel to the 
first strand if the residue is guanine. A first region of a nucleic acid is complementary to 
a second region of the same or a different nucleic acid if, when the two regions are 
arranged in an antiparallel fashion, at least one nucleotide residue of the first region is 

20 capable of base pairing with a residue of the second region. Preferably, the first region 
comprises a first portion and the second region comprises a second portion, whereby, 
when the first and second portions are arranged in an antiparallel fashion, at least about 
50%, and preferably at least about 75%, at least about 90%, or at least about 95% of the 
nucleotide residues of the first portio n are capable of base pairing with nucleotide 

25 residues in the second portion. More preferably, all nucleotide residues of the first 
portion are capable of base pairing with nucleotide residues in the second portion. 

"Homologous" as used herein, refers to nucleotide sequence similarity between 
two regions of the same nucleic acid strand or between regions of two different nucleic 
acid strands. When a nucleotide residue position in both regions is occupied by the 

30 same nucleotide residue, then the regions are homologous at that position. A first region 
is homologous to a second region if at least one nucleotide residue position of each 
region is occupied by the same residue. Homology between two regions is expressed in 
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terms of the proportion of nucleotide residue positions of the two regions that are 
occupied by the same nucleotide residue. By way of example, a region having the 
nucleotide sequence S'-ATTGCC-S' and a region having the nucleotide sequence 5- 
TATGGC-3' share 50% homology. Preferably, the first region comprises a first portion 
5 and the second region comprises a second portion, whereby, at least about 50%, and 
preferably at least about 75%, at least about 90%, or at least about 95% of the nucleotide 
residue positions of each of the portions are occupied by the same nucleotide residue. 
More preferably, all nucleotide residue positions of each of the portions are occupied by 
the same nucleotide residue. 
1 0 A marker is "fixed" to a substrate if it is covalently or non-co valently associated 

with the substrate such the substrate can be rinsed with a fluid (e.g. standard saline 
citrate, pH 7.4) without a substantial fraction of the marker dissociating from the 
substrate. 

As used herein, a "naturally-occurring" nucleic acid molecule refers to an RNA or 
1 5 DNA molecule having a nucleotide sequence that occurs in nature (e.g. encodes a 
natural protein). 

Expression of a marker in a patient is "significantly" higher or lower than the 
normal level of expression of a marker if the level of expression of the marker is greater 
or less, respectively, than the normal level by an amount greater than the standard error 

20 of the assay employed to assess expression, and preferably at least twice, and more 
preferably three, four, five or ten times that amount. Alternately, expression of the 
marker in the patient can be considered "significantly" higher or lower than the normal 
level of expression if the level of expression is at least about two, and preferably at least 
about three, four, or five times, higher or lower, respectively, than the normal level of 

25 expression of the marker. 

Breast cancer is "inhibited" if at least one symptom of the cancer is alleviated, 
terminated, slowed, or prevented. As used herein, breast cancer is also "inhibited" if 
recurrence or metastasis of the cancer is reduced, slowed, delayed, or prevented. 

A kit is any manufacture (e.g. a package or container) comprising at least one 

30 reagent, e.g. a probe, for specifically detecting a marker of the invention, the 

manufacture being promoted, distributed, or sold as a unit for performing the methods of 
the present invention. 
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Description 

The present invention is based, in part, on identification of novel markers which 
are expressed at a different level in breast cancer cells than they are in normal (i.e. non- 
' 5 cancerous) breast cells. The markers of the invention correspond to nucleic acid and 
polypeptide molecules which can be detected in one or both of normal and cancerous 
breast cells. The presence, absence, or level of expression of one or more of these 
markers in breast cells is herein correlated with the cancerous state of the tissue. The 
invention thus includes compositions, kits, and methods for assessing the cancerous state 
10 of breast cells (e.g. cells obtained from a human, cultured human cells, archived or 
preserved human cells and in vivo cells). 

The compositions, kits, and methods of the invention have the following uses, 
among others: 



15 



1) 

2) 
3) 
4) 
5) 



assessing whether a patient is afflicted with breast cancer; 

assessing the stage of breast cancer in a human patient; 

assessing the grade of breast cancer in a patient; 

assessing the benign or malignant nature of breast cancer in a patient; 

assessing the histological type of neoplasm (e.g. ductal, lobular, etc.) 



20 



6) 



associated with breast cancer in a patient; 
making an isolated hybridoma which produces an antibody useful for 



7) 
8) 



assessing whether a patient is afflicted with breast cancer; 
assessing the presence of breast cancer cells; 

assessing the efficacy of one or more test compounds for inhibiting breast 



25 



9) 



cancer in a patient; 
assessing the efficacy of a therapy for inhibiting breast cancer in a 



10) 
11) 



patient; 

monitoring the progression of breast cancer in a patient; 
selecting a composition or therapy for inhibiting breast cancer in a 



30 



12) 

13) 
14) 



patient; 

treating a patient afflicted with breast cancer; 

inhibiting breast cancer in a patient; 

assessing the carcinogenic potential of a test compound; and 
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15) inhibiting breast cancer in a patient at risk for developing breast 
cancer. 

The invention thus includes a method of assessing whether a patient is afflicted 
with breast cancer. This method comprises comparing the level of expression of a 
5 marker in a patient sample and the normal level of expression of the marker in a control, 
e.g., a non-breast cancer sample. A significant difference between the level of 
expression of the marker in the patient sample and the normal level is an indication that 
the patient is afflicted with breast cancer. The marker is selected from the group 
consisting of the markers listed in Tables 1-6. 

10 The polynucleotides set forth in Tables 1-6 represent previously unidentified 

nucleotide sequences. These nucleotide sequences were identified through subtracted 
library experiments described herein. In Tables 1 and 3, SEQ ID NOS 316-470, 793- 
890, 1255-1363, 2125-2454 and 3352-3626 are preferred and SEQ ID NOS 1-315, 676- 
792, 1056-1254, 1645-2124 and 2942-3351 are most preferred. In Tables 2 and 4, SEQ 

15 ID NOS: 1879-1959 are preferred and SEQ ID NOS: 1-1878 are most preferred. Also 
provided by this invention are polynucleotides that correspond to the polynucleotides of 
Tables 1-6. In one embodiment, these polynucleotides are obtained by identification of 
a larger fragment or full-length coding sequence of these polynucleotides. Gene 
delivery vehicles, host cells, compositions and databases (all describe herein) containing 

20 these polynucleotides are also provided by this invention. 

The invention also encompasses polynucleotides which differ from that of the 
polynucleotides described above, but which produce the same phenotypic effect, e.g. 
allelic variants, These altered, but pheno typically equivalent polynucleotides are 
referred to "equivalent nucleic acids." This invention also encompasses polynucleotides 

25 characterized by changes in non-coding regions that do not alter the polypeptide 

produced therefrom when compared to the polynucleotide herein. This invention further 
encompasses polynucleotides, which hybridize to the polynucleotides of the subject 
invention under conditions of moderate or high stringency. Alternatively, the 
polynucleotides are at least 85%, or at least 90%, or more preferably, greater or equal to 

30 95% identical as determined by a sequence alignment program when run under default 
parameters. 
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Any marker or combination of markers listed in Tables 1-6, as well as any known 
markers in combination with the markers set forth in Tables 1-6, may be used in the 
compositions, kits, and methods of the present invention. In general, it is preferable to 
use markers for which the difference between the level of expression of the marker in 
5 breast cancer cells and the level of expression of the same marker in normal breast cells 
is as great as possible. Although this difference can be as small as the limit of detection 
of the method for assessing expression of the marker, it is preferred that the difference 
be at least greater than the standard error of the assessment method, and preferably a 
difference of at least 2-, 3-, 4-, 5-, 6-, 7-, 8-, 9-, 10-, 15-, 20-, 25-, 100-, 500-, 1000-fold 
10 or greater. 

It is recognized that certain markers correspond to proteins which are secreted 
from breast cells (i.e. one or both of normal and cancerous cells) to the extracellular 
space surrounding the cells. These markers are preferably used in certain embodiments 
of the compositions, kits, and methods of the invention, owing to the fact that the protein 

15 corresponding to each of these markers can be detected in an breast-associated body 
fluid sample, which may be more easily collected from a human patient than a tissue 
biopsy sample. In addition, preferred in vivo techniques for detection of a protein 
corresponding to a marker of the invention include introducing into a subject a labeled 
antibody directed against the protein. For example, the antibody can be labeled with a 

20 radioactive marker whose presence and location in a subject can be detected by standard 
imaging techniques. 

Although not every marker corresponding to a secreted protein is indicated as 
such herein, it is a simple matter for the skilled artisan to determine whether any 
particular marker corresponds to a secreted protein. In order to make this determination, 

25 the protein corresponding to a marker is expressed in a test cell (e.g. a cell of a breast 
cell line), extracellular fluid is collected, and the presence or absence of the protein in 
the extracellular fluid is assessed (e.g. using a labeled antibody which binds specifically 
with the protein). 

The following is an example of a method which can be used to detect secretion of 
30 a protein corresponding to a marker of the invention. About 8 x 10 5 293T cells are 
incubated at 37°C in wells containing growth medium (Dulbecco's modified Eagle's 
medium {DMEM} supplemented with 10% fetal bovine serum) under a 5% (v/v) CO , 
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95% air atmosphere to about 60-70% confluence. The cells are then transfected using a 
standard transfection mixture comprising 2 micrograms of DNA comprising an 
expression vector encoding the protein and 10 microliters of LipofectAMINE™ 
(GIBCO/BRL Catalog no. 18342-012) per well. The transfection mixture is maintained 
5 for about 5 hours, and then replaced with fresh growth medium and maintained in an air 
atmosphere. Each well is gently rinsed twice with DMEM which does not contain 
methionine or cysteine (DMEM-MC; ICN Catalog no. 16-424- 54). About 1 milliliter 
of DMEM-MC and about 50 microcuries of Trans- 35 S™ reagent (ICN Catalog no. 
51006) are added to each well. The wells are maintained under the 5% C0 7 atmosphere 
10 described above and incubated at 37°C for a selected period. Following incubation, 150 
microliters of conditioned medium is removed and centrifuged to remove floating cells 
and debris. The presence of the protein in the supernatant is an indication that the 
protein is secreted. 

Examples of breast-associated body fluids include blood fluids (e.g. whole blood, 

1 5 blood serum, blood having platelets removed therefrom, etc.) 5 lymph, ascitic fluid, cystic 
fluid, urine and nipple aspirates. In these embodiments, the level of expression of the 
marker can be assessed by assessing the amount (e.g. absolute amount or concentration) 
of the marker in a breast-associated body fluid obtained from a patient. The fluid can, of 
course, be subjected to a variety of well-known post-collection preparative and storage 

20 techniques (e.g. storage, freezing, ultrafiltration, concentration, evaporation, 
centrifugation, etc.) prior to assessing the amount of the marker in the fluid. 

Many breast-associated body fluids (i.e. usually excluding urine) can have breast 
cells therein, particularly when the breast cells are cancerous, and, more particularly, 
when the breast cancer is metastasizing. Thus, the compositions, kits, and methods of 

25 the invention can be used to detect expression of markers corresponding to proteins 
having at least one portion which is displayed on the surface of cells which express it. 
It is a simple matter for the skilled artisan to determine whether the protein 
corresponding to any particular marker comprises a cell-surface protein. For example, 
immunological methods may be used to detect such proteins on whole cells, or well 

30 known computer-based sequence analysis methods (e.g. the SIGNALP program; Nielsen 
et al, 1997, Protein Engineering 10:1-6) may be used to predict the presence of at least 
one extracellular domain (i.e. including both secreted proteins and proteins having at 
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least one cell-surface domain). Expression of a marker corresponding to a protein 
having at least one portion which is displayed on the surface of a cell which expresses it 
may be detected without necessarily lysing the cell (e.g. using a labeled antibody which 
binds specifically with a cell-surface domain of the protein). 
5 Expression of a marker of the invention may be assessed by any of a wide variety 

of well known methods for detecting expression of a transcribed molecule or protein. 
Non-limiting examples of such methods include immunological methods for detection 
of secreted, cell-surface, cytoplasmic, or nuclear proteins, protein purification methods, 
protein function or activity assays, nucleic acid hybridization methods, nucleic acid 

1 0 reverse transcription methods, and nucleic acid amplification methods. 

In a preferred embodiment, expression of a marker is assessed using an antibody 
(e.g. a radio-labeled, chromophore-labeled, fluorophore-labeled, or enzyme-labeled 
antibody), an antibody derivative (e.g. an antibody conjugated with a substrate or with 
the protein or ligand of a protein-ligand pair {e.g. biotin-streptavidin} ), or an antibody 

15 fragment (e.g. a single-chain antibody, an isolated antibody hypervariable domain, etc.) 
which binds specifically with a protein or a fragment thereof, corresponding to the 
marker, such as the protein encoded by the open reading frame corresponding to the 
marker or such a protein which has undergone all or a portion of its normal post- 
translational modification. 

20 In another preferred embodiment, expression of a marker is assessed by preparing 

mRNA/cDNA (i.e. a transcribed polynucleotide) from cells in a patient sample, and by 
hybridizing the mRNA/cDNA with a reference polynucleotide which is a complement of 
a polynucleotide comprising the marker, and fragments thereof. cDNA can, optionally, 
be amplified using any of a variety of polymerase chain reaction methods prior to 

25 hybridization with the reference polynucleotide; preferably, it is not amplified. 

Expression of one or more markers can likewise be detected using quantitative PCR to 
assess the level of expression of the marker(s). Alternatively, any of the many known 
methods of detecting mutations or variants (e.g. single nucleotide polymorphisms, 
deletions, etc.) of a marker of the invention may be used to detect occurrence of a 

3 0 marker in a patient. 
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In a related embodiment, a mixture of transcribed polynucleotides obtained from 
the sample is contacted with a substrate having fixed thereto a polynucleotide 
complementary to or homologous with at least a portion (e.g. at least 7, 10, 15, 20, 25, 
30, 40, 50, 100, 500, or more nucleotide residues) of a marker of the invention. If 
5 polynucleotides complementary to or homologous with are differentially detectable on 
the substrate (e.g. detectable using different chromophores or fluorophores, or fixed to 
different selected positions), then the levels of expression of a plurality of markers can 
be assessed simultaneously using a single substrate (e.g. a "gene chip" microarray of 
polynucleotides fixed at selected positions). When a method of assessing marker 

10 expression is used which involves hybridization of one nucleic acid with another, it is 
preferred that the hybridization be performed under stringent hybridization conditions. 

Because the compositions, kits, and methods of the invention rely on detection of 
a difference in expression levels of one or more markers of the invention, it is preferable 
that the level of expression of the marker is significantly greater than the minimum 

1 5 detection limit of the method used to assess expression in at least one of normal breast 
cells and cancerous breast cells. 

It is understood that by routine screening of additional patient samples using one 
or more of the markers of the invention, it will be realized that certain of the markers are 
over- or under-expressed in cancers of various types, including specific breast cancers, 

20 as well as other cancers such as ovarian cancer, cervical cancer, etc. For example, it will 
be confirmed that some of the markers of the invention are over- or under-expressed in 
most (i.e. 50% or more) or substantially all (i.e. 80% or more) of breast cancer. 
Furthermore, it will be confirmed that certain of the markers of the invention are 
associated with breast cancer of various stages (i.e. stage 0, 1, II, II, and IV breast 

25 cancers, as well as subclassifications IIA, IIB, IIIA, and IIIB, using the F^GO Stage 
Grouping system for primary carcinoma of the breast; (see Breast, In: A merican Joint 
Committee on Cancer: AJCC Cancer Staging Manual. Lippincott-Raven Publishers, 5th 
ed., 1997, pp. 171-180), of various histologic subtypes (e.g. serous, mucinous, 
endometroid, and clear cell subtypes, as well as subclassifications and alternate 

30 classifications adenocarcinoma, papillary adenocarcinoma, papillary 

cystadenocarcinoma, surface papillary carcinoma, malignant adenofibroma, 
cystadenofibroma, adenocarcinoma, cystadenocarcinoma, adenoacanthoma, 
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endometrioid stromal sarcoma, mesodermal (Miillerian) mixed tumor, mesonephroid 
tumor, malignant carcinoma, Brenner tumor, mixed epithelial tumor, and 
undifferentiated carcinoma, using the WHO/FIGO system for classification of malignant 
breast tumors; Scully, Atlas of Tumor Pathology, 3d series, Washington DC), and 
5 various grades (i.e. grade I {well differentiated} , grade II {moderately well 

differentiated}, and grade III {poorly differentiated from surrounding normal tissue} )). 
In addition, as a greater number of patient samples are assessed for expression of the 
markers of the invention and the outcomes of the individual patients from whom the 
samples were obtained are correlated, it will also be confirmed that altered expression of 

1 0 certain of the markers of the invention are strongly correlated with malignant cancers 
and that altered expression of other markers of the invention are strongly correlated with 
benign tumors. The compositions, kits, and methods of the invention are thus useful for 
characterizing one or more of the stage, grade, histological type, and benign/malignant 
nature of breast cancer in patients. In addition, these compositions, kits, and methods 

1 5 can be used to detect and differentiate lobular and ductal carcinoma breast cancers. 
When the compositions, kits, and methods of the invention are used for 
characterizing one or more of the stage, grade, histological type, and benign/malignant 
nature of breast cancer in a patient, it is preferred that the marker or panel of markers of 
the invention is selected such that a positive result is obtained in at least about 20%, and 

20 preferably at least about 40%, 60%, or 80%, and more preferably in substantially all 
patients afflicted with an breast cancer of the corresponding stage, grade, histological 
type, or benign/malignant nature. Preferably, the marker or panel of markers of the 
invention is selected such that a PPV of greater than about 10% is obtained for the 
general population (more preferably coupled with an assay specificity greater than 

25 99.5%). 

When a plurality of markers of the invention are used in the compositions, kits, 
and methods of the invention, the level of expression of each marker in a patient sample 
can be compared with the normal level of expression of each of the plurality of markers 
in non-cancerous samples of the same type, either in a single reaction mixture (i.e. using 
30 reagents, such as different fluorescent probes, for each marker) or in individual reaction 
mixtures corresponding to one or more of the markers. In one embodiment, a 
significantly enhanced level of expression of more than one of the plurality of markers 
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in the sample, relative to the corresponding normal levels, is an indication that the 
patient is afflicted with breast cancer. In another embodiment, a significantly lower 
level of expression in the sample of each of the plurality of markers, relative to the 
corresponding normal levels, is an indication that the patient is afflicted with breast 
5 cancer. In yet another embodiment, a significantly enhanced level of expression of one 
or more markers and a significantly lower level of expression of one or more markers in 
a sample relative to the corresponding normal levels, is an indication that the patient is 
afflicted with breast cancer. When a plurality of markers is used, it is preferred that 2, 3, 
4, 5, 8, 10, 12, 15, 20, 30, or 50 or more individual markers be used, wherein fewer 

1 0 markers are preferred. 

In order to maximize the sensitivity of the compositions, kits, and methods of the 
invention (i.e. by interference attributable to cells of non-breast origin in a patient 
sample), it is preferable that the marker of the invention used therein be a marker which 
has a restricted tissue distribution, e.g., normally not expressed in a non-breast tissue. 

15 Only a small number of markers are known to be associated with breast cancers 

(e.g. BRCA1 and BRCA2). These markers are not, of course, included among the 
markers of the invention, although they may be used together with one or more markers 
of the invention in a panel of markers, for example. It is well known that certain types 
of genes, such as oncogenes, tumor suppressor genes, growth factor-like genes, 

20 protease-like genes, and protein kinase-like genes are often involved with development 
of cancers of various types. Thus, among the markers of the invention, use of those 
which correspond to proteins which resemble known proteins encoded by known 
oncogenes and tumor suppressor genes, and those which correspond to proteins which 
resemble growth factors, proteases, and protein kinases are preferred. 

25 Known oncogenes and tumor suppressor genes include, for example, abl, abr, 

akt2, ape, bcl2a, bcUfi, bcl3, bcr, brcal, brca2, cbl, ccndl, cdc42, cdk4, crk- II, 
csflr/Jhts, dbl, dec, dpc4lsmad4, e-cad, e2fllrbap, egfrlerbb-1, elkl, elk3, eph, erg, etsl, 
ets2,ferJgrlsrc2,flillergb2Jos,jpslfes,fral,fra2,fyn^ hck, hek, her2lerbb- 2/neu, 
her3lerbb-3, her4lerbb-4, hrasl, hst2, listfl, igfbp2, ink4a, ink4b, int21fgf3,junjunb, ' 

30 jund, kip2, kit, h*as2a, kras2b, Ick, lyn, mas, max, mcc, mdm2 ) met, mlhl, mmplO, mos, 
msh2, msh3, msh6, myb, myba, mybb, myc, mycll, myen, nfl, nf2, nme2, nras,p53, 
pdgfb,phb t pi?nl,pmsl,pms2,ptc,pten, rafl, rapla, rbl, rel, ret, rosl, ski, srcl, tall, 
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tgfbrl, tgft>3, tgfbr3, thral, thrb, tiaml, timp3, tjpl, tp53, trk, vav, vhl, vil2, wafl, wntl, 
wnt2, wtl, and yes 1 (Hesketh, 1997, In: The Oncogene and Tumour Suppressor Gene 
Facts Book, 2nd Ed., Academic Press; Fishel et al, 1994, Science 266:1403-1405). 
Known growth factors include platelet-derived growth factor alpha, platelet- 
5 derived growth factor beta (simian sarcoma viral {v-sis} oncogene homolog), 

thrombopoietin (myeloproliferative leukemia virus oncogene ligand, megakaryocyte 
growth and development factor), erythropoietin, B cell growth factor, macrophage 
stimulating factor 1 (hepatocyte growth factor-like protein), hepatocyte growth factor 
(hepapoietin A), insulin-like growth factor 1 (somatomedia C), hepatoma-derived 

10 growth factor, amphiregulin (schwannoma-derived growth factor), bone morphogenetic 
proteins 1, 2, 3, 3 beta, and 4, bone morphogenetic protein 7 (osteogenic protein 1), bone 
morphogenetic protein 8 (osteogenic protein 2), connective tissue growth factor, 
connective tissue activation peptide 3, epidermal growth factor (EGF), teratocarcinoma- 
derived growth factor 1, endothelin, endothelin 2, endothelin 3, stromal cell-derived 

1 5 factor 1 , vascular endothelial growth factor (VEGF), VEGF-B, VEGF-C, placental 

growth factor (vascular endothelial growth factor-related protein), transforming growth 
factor alpha, transforming growth factor beta 1 and its precursors, transforming growth 
factor beta 2 and its precursors, fibroblast growth factor 1 (acidic), fibroblast growth 
factor 2 (basic), fibroblast growth factor 5 and its precursors, fibroblast growth factor 6 

20 and its precursors, fibroblast growth factor 7 (keratinocyte growth factor), fibroblast 
growth factor 8 (androgen-induced), fibroblast growth factor 9 (glia-activating factor), 
pleiotrophin (heparin binding growth factor 8, neurite growth-promoting factor 1), 
brain-derived neurotrophic factor, and recombinant glial growth factor 2. 

Known proteases include interleukin-1 beta convertase and its precursors, Mch6 

25 and its precursors, Mcli2 isoform alpha, Mch4, Cpp32 isoform alpha, Lice2 gamma 
cysteine protease, Ich-lS, Ich-IL, Ich-2 and its precursors, TY protease, matrix 
metalloproteinase 1 (interstitial collagenase), matrix metalloproteinase 2 (gelatinase A, 
72kD gelatinase, 72kD type IV collagenase), matrix metalloproteinase 7 (matrilysin), 
matrix metalloproteinase 8 (neutrophil collagenase), matrix metalloproteinase 12 

30 (macrophage elastase), matrix metalloproteinase 13 (collagenase 3), metallopeptidase 1, 
cysteine-rich metalloprotease (disintegrin) and its precursors, subtilisin-like protease Pc8 
and its precursors, chymotrypsin, snake venom-like protease, cathepsin 1, cathepsin D 
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(lysosomal aspartyl protease), stromelysin, aminopeptidase N, plasminogen, tissue 
plasminogen activator, plasminogen activator inhibitor type II, and urokinase-type 
plasminogen activator. 

Known protein kinases include DAP kinase, serine/threonine protein kinases 
5 NIK, PK428, Krs-2, S AK, and EMK, interferon-inducible double stranded RNA 
dependent protein kinase, FAST kinase, AIM1, IPLl-like midbody-associated protein 
kinase-1, NIMA-like protein kinase 1 (NLK1), the cyclin-dependent kinases (cdkl-10), 
checkpoint kinase Chkl, Nek3 protein kinase, BMK1 beta kinase, Clkl, Clk2, Clk3, 
extracellular signal-regulated kinases 1, 3, and 6, cdc28 protein kinase 1, cdc28 protein 

10 kinase 2, pLK, Mytl , c-Jun N-terminal kinase 2, Cam kinase 1 , the MAP kinases, 

insulin-stimulated protein kinase 1, beta-adrenergic receptor kinase 2, ribosomal protein 
S6 kinase, kinase suppressor of ras-1 (KSR1), putative serine/threonine protein kinase 
Prk, PkB kinase, cAMP-dependent protein kinase, cGMP-dependent protein kinase, type 
II cGMP-dependent protein kinase, protein kinases Dyrk2, Dyrk3, and Dyrk4, Rho- 

1 5 associated coiled-coil containing protein kinase pi 60ROCK, protein tyrosine kinase t- 
Rorl, Ste20-related kinases, cell adhesion kinase beta, protein kinase 3, stress-activated 
protein kinase 4, protein kinase Zpk, serine kinase hPAK65, dual specificity mitogen- 
activated protein kinases 1 and 2, casein kinase I gamma 2, p21 -activated protein kinase 
Pakl, lipid-activated protein kinase PRK2, focal adhesion kinase, dual-specificity 

20 tyrosine-phosphorylation regulated kinase, myosin light chain kinase, serine kinases 
SRPK2, TESK1, and VRK2, B lymphocyte serine/threonine protein kinase, stress- 
activated protein kinases JNK1 and JNK2, phosphorylase kinase, protein tyrosine kinase 
Tec, Jak2 kinase, protein kinase Ndr, MEK kinase 3, SHB adaptor protein (a Src 
homology 2 protein), agammaglobulinaemia protein-tyrosine kinase (Atk), protein 

25 kinase ATR, guanylate kinase 1, thrombopoeitin receptor and its precursors, DAG 
kinase epsilon, and kinases encoded by oncogenes or viral oncogenes such as v-fgr 
(Gardner-Rasheed), v-abl (Abelson murine leukemia viral oncogene homolog 1), v-arg 
(Abelson murine leukemia viral oncogene homolog, Abelson-related gene), v-fes and v- 
fps (feline sarcoma viral oncogene and Fuj inarm avian sarcoma viral oncogene 

30 homologs), proto-oncogene c-cot, oncogene pirn A, and oncogene masl . 
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It is recognized that the compositions, kits, and methods of the invention will be 
of particular utility to patients having an enhanced risk of developing breast cancer and 
their medical advisors. Patients recognized as having an enhanced risk of developing 
breast cancer include, for example, patients having a familial history of breast cancer, 
5 patients identified as having a mutant oncogene (i.e. at least one allele), and patients of 
advancing age (i.e. women older than about 50 or 60 years). 

The level of expression of a marker in normal (i.e. non-cancerous) human breast 
tissue can be assessed in a variety of ways. In one embodiment, this normal level of 
expression is assessed by assessing the level of expression of the marker in a portion of 

1 0 breast cells which appears to be non-cancerous and by comparing this normal level of 
expression with the level of expression in a portion of the breast cells which is suspected 
of being cancerous. For example, when mammography or other medical procedure, 
reveals the presence of a lump in a patient's breast, the normal level of expression of a 
marker may be assessed using the non-affected breast tissue, and this normal level of 

1 5 expression may be compared with the level of expression of the same marker in an 
affected portion (i.e. the lump) of the affected breast. Alternately, and particularly as 
further information becomes available as a result of routine performance of the methods 
described herein, population-average values for normal expression of the markers of the 
invention may be used. In other embodiments, the 'normal* level of expression of a 

20 marker may be determined by assessing expression of the marker in a patient sample 
obtained from a non-cancer-afflicted patient, from a patient sample obtained from a ' 
patient before the suspected onset of breast cancer in the patient, from archived patient 
samples, and the like. 

The invention includes compositions, kits, and methods for assessing the presence 

25 of breast cancer cells in a sample (e.g. an archived tissue sample or a sample obtained 
from a patient). These compositions, kits, and methods are substantially the same as 
those described above, except that, where necessary, the compositions, kits, and 
methods are adapted for use with samples other than patient samples. For example, 
when the sample to be used is a parafinized, archived human tissue sample, it can be 

30 necessary to adjust the ratio of compounds in the compositions of the invention, in the 
kits of the invention, or the methods used to assess levels of marker expression in the 
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sample. Such methods are well known in the art and within the skill of the ordinary 
artisan. 

The invention includes a kit for assessing the presence of breast cancer cells (e.g. 
in a sample such as a patient sample). The kit comprises a plurality of reagents, each of 
5 which is capable of binding specifically with a nucleic acid or polypeptide 

corresponding to a marker of the invention. Suitable reagents for binding with a 
polypeptide corresponding to a marker of the invention include antibodies, antibody 
derivatives, antibody fragments, and the like. Suitable reagents for binding with a 
nucleic acid (e.g. a genomic DNA, an mRNA, a spliced mRNA, a cDNA, or the like) 

10 include complementary nucleic acids. For example, the nucleic acid reagents may 
include oligonucleotides (labeled or non-labeled) fixed to a substrate, labeled 
oligonucleotides not bound with a substrate, pairs of PCR primers, molecular beacon 
probes, and the like. 

The kit of the invention may optionally comprise additional components useful 

1 5 for performing the methods of the invention. By way of example, the kit may comprise 
fluids (e.g. SSC buffer) suitable for annealing complementary nucleic acids or for 
binding an antibody with a protein with which it specifically binds, one or more sample 
compartments, an instructional material which describes performance of a method of the 
invention, a sample of normal breast cells, a sample of breast cancer cells, and the like. 

20 The invention also includes a method of making an isolated hybridoma which 

produces an antibody useful for assessing whether patient is afflicted with breast cancer. 
In this method, a protein corresponding to a marker of the invention is isolated (e.g. by 
purification from a cell in which it is expressed or by transcription and translation of a 
nucleic acid encoding the protein in vivo or in vitro using known methods). A 

25 vertebrate, preferably a mammal such as a mouse, rat, rabbit, or sheep, is immunized 
using the isolated protein or protein fragment. The vertebrate may optionally (and 
preferably) be immunized at least one additional time with the isolated protein or protein 
fragment, so that the vertebrate exhibits a robust immune response to the protein or 
protein fragment. Splenocytes are isolated from the immunized vertebrate and fused 

30 with an immortalized cell line to form hybridomas, using any of a variety of methods 
well known in the art. Hybridomas formed in this manner are then screened using 
standard methods to identify one or more hybridomas which produce an antibody which 
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specifically binds with the protein or protein fragment. The invention also includes 
hybridomas made by this method and antibodies made using such hybridomas. 

The invention also includes a method of assessing the efficacy of a test compound 
for inhibiting breast cancer cells. As described above, differences in the level of 
5 expression of the markers of the invention correlate with the cancerous state of breast 
cells. Although it is recognized that changes in the levels of expression of certain of the 
markers of the invention likely result from the cancerous state of breast cells, it is 
likewise recognized that changes in the levels of expression of other of the markers of 
the invention induce, maintain, and promote the cancerous state of those cells. Thus, 

10 compounds which inhibit breast cancer in a patient will cause the level of expression of 
one or more of the markers of the invention to change to a level nearer the normal level 
of expression for that marker {i.e. the level of expression for the marker in non- 
cancerous breast cells). 

This method thus comprises comparing expression of a marker in a first breast 

1 5 cell sample and maintained in the presence of the test compound and expression of the 
marker in a second breast cell sample and maintained in the absence of the test 
compound. A significant alteration in the level of expression of a marker listed in 
Tables 1-6, may be is an indication that the test compound inhibits breast cancer {e.g., 
decreases in expression in those markers that are over-expressed in breast cancer cells or 

20 more aggressive breast cancer cells and breast cancer cells from patients with poor 
clinical outcome or increases expression in those markers that are under-expressed in 
breast cancer cells or in more aggressive breast cancer cells or breast cancer cells from 
patients with poor clinical outcome. The breast cell samples may, for example, be 
aliquots of a single sample of normal breast cells obtained from a patient, pooled 

25 samples of normal breast cells obtained from a patient, cells of a normal breast cell line, 
aliquots of a single sample of breast cancer cells obtained from a patient, pooled samples 
of breast cancer cells obtained from a patient, cells of a breast cancer cell line, or the 
like. In one embodiment, the samples are breast cancer cells obtained from a patient and 
a plurality of compounds known to be effective for inhibiting various breast cancers are 

30 tested in order to identify the compound which is likely to best inhibit the breast cancer 
in the patient 
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This method may likewise be used to assess the efficacy of a therapy for 
inhibiting breast cancer in a patient. In this method, the level of expression of one or 
more markers of the invention in a pair of samples (one subjected to the therapy, the 
other not subjected to the therapy) is assessed. As with the method of assessing the 
5 efficacy of test compounds, if the therapy induces a significant alteration in the level of 
expression of a marker listed in Tables 1-6, or blocks induction of a marker listed in 
Tables 1-6, then the therapy may be efficacious for inhibiting breast cancer. As above, if 
samples from a selected patient are used in this method, then alternative therapies can be 
assessed in vitro in order to select a therapy most likely to be efficacious for inhibiting 

1 0 breast cancer in the patient. 

As described herein, breast cancer in patients is associated with levels of 
expression of one or more markers listed in Tables 1-6. While, as discussed above, 
some of these changes in expression level result from occurrence of the breast cancer, 
others of these changes induce, maintain, and promote the cancerous state of breast 

1 5 cancer cells. Thus, breast cancer characterized by an alteration in the level of expression 
of one or more markers listed in Tables 1-6 can be inhibited by hampering or increasing 
expression of those markers. 

Expression of a marker listed in Tables 1-6 can be inhibited in a number of ways, 
generally known in the art. For example, an antisense oligonucleotide can be provided 

20 to the breast cancer cells in order to inhibit transcription, translation, or both, of the 
marker(s). Alternately, a polynucleotide encoding an antibody, an antibody derivative, 
or an antibody fragment, and operably linked with an appropriate promoter/regulator 
region, can be provided to the cell in order to generate intracellular antibodies which 
will inhibit the fraction or activity of the protein corresponding to the marker (s). Using 

25 the methods described herein, a variety of molecules, particularly including molecules 
sufficiently small that they are able to cross the cell membrane, can be screened in order 
to identify molecules which inhibit expression of the marker(s). The compound so 
identified can be provided to the patient in order to inhibit expression of the marker(s) in 
the breast cancer cells of the patient. 

30 Expression of a marker listed within Tables 1-6 can be enhanced in number of 

ways generally known in the art. For example, a polynucleotide encoding the marker 
and operably linked with an appropriate promoter/regulator region can be provided to 
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breast cancer cells of the patient in order to induce enhanced expression of the protein 
(and mRNA) corresponding to the marker therein. Alternatively, if the protein is 
capable of crossing the cell membrane, inserting itself in the cell membrane, or is 
normally a secreted protein, then expression of the protein can be enhanced by providing 
5 the protein {e.g. directly or by way of the bloodstream or another breast-associated fluid) 
to breast cancer cells in the patient. 

As described above, the cancerous state of human breast cells is correlated with 
changes in the levels of expression of the markers of the invention. The invention thus 
includes a method for assessing the human breast cell carcinogenic potential of a test 

1 0 compound. This mediod comprises maintaining separate aliquots of human breast cells 
in the presence and absence of the test compound. Expression of a marker of the 
invention in each of the aliquots is compared. A significant alteration in the level of 
expression of a marker listed in Tables 1-6 in the aliquot maintained in the presence of 
the test compound (relative to the aliquot maintained in the absence of the test 

1 5 compound) may be an indication that the test compound possesses human breast cell 
carcinogenic potential. The relative carcinogenic potentials of various test compounds 
can be assessed by comparing the degree of enhancement or inhibition of the level of 
expression of the relevant markers, by comparing the number of markers for which the 
level of expression is enhanced or inhibited, or by comparing both. 

20 Various aspects of the invention are described in further detail in the following 

subsections. 

I Isolated Nucleic Acid Molecules 

One aspect of the invention pertains to novel isolated nucleic acid molecules that 

25 correspond to a marker of the invention, including nucleic acids which encode a 
polypeptide corresponding to a marker of the invention or a portion of such a 
polypeptide. Isolated nucleic acids of the invention also include nucleic acid molecules 
sufficient for use as hybridization probes to identify nucleic acid molecules that 
correspond to a marker of the invention, including nucleic acids which encode a 

30 polypeptide corresponding to a marker of the invention, and fragments of such nucleic 
acid molecules, e.g. , those suitable for use as PCR primers for the amplification or 
mutation of nucleic acid molecules. As used herein, the term "nucleic acid molecule" is 
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intended to include DNA molecules (e.g., cDNA or genomic DNA) and RNA molecules 
(e.g., mRNA) and analogs of the DNA or RNA generated using nucleotide analogs. The 
nucleic acid molecule can be single-stranded or double-stranded, but preferably is 
double-stranded DNA. 

5 An "isolated" nucleic acid molecule is one which is separated from other nucleic 

acid molecules which are present in the natural source of the nucleic acid molecule. 
Preferably, an "isolated" nucleic acid molecule is free of sequences (preferably protein- 
encoding sequences) which naturally flank the nucleic acid (i.e., sequences located at the 
5' and 3 r ends of the nucleic acid) in the genomic DNA of the organism from which the 

10 nucleic acid is derived. For example, in various embodiments, the isolated nucleic acid 
molecule can contain less than about 5 kB, 4 kB, 3 kB, 2 kB, 1 kB, 0.5 kB or 0.1 kB of 
nucleotide sequences which naturally flank the nucleic acid molecule in genomic DNA 
of the cell from which the nucleic acid is derived. Moreover, an "isolated" nucleic acid 
molecule, such as a cDNA molecule, can be substantially free of other cellular material, 

1 5 or culture medium when produced by recombinant techniques, or substantially free of 
chemical precursors or other chemicals when chemically synthesized. 

A nucleic acid molecule of the present invention, e.g. , a nucleic acid encoding a 
protein corresponding to a marker listed in Tables 1-6, can be isolated using standard 
molecular biology techniques and the sequence information in the database records 

20 described herein. Using all or a portion of such nucleic acid sequences, nucleic acid 
molecules of the invention can be isolated using standard hybridization and cloning 
techniques (e.g., as described in Sambrook et ah, ed., Molecular Cloning: A Laboratory 
Manual, 2nded., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 
1989). 

25 A process for identifying a larger fragment or the full-length coding sequence of a 

marker of the present invention is thus also provided. Any conventional recombinant 
DNA techniques applicable for isolating polynucleotides may also be employed. One 
such method involves the 5-RACE-PCR technique, in which the poly-A mRNA that 
contains the coding sequence of particular interest is first reverse transcribed with a 3'- 

30 primer comprising a sequence disclosed herein. The newly synthesized cDNA strand is 
then tagged with an anchor primer with a known sequence, which preferably contains a 
convenient cloning restriction site attached at the 5'end. The tagged cDNA is then 
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amplified with the 3'-primer (or a nested primer sharing sequence homology to the 
internal sequences of the coding region) and the 5 f -anchor primer. The amplification 
may be conducted under conditions of various levels of stringency to optimize the 
amplification specificity. 5'-RACE-PCR can be readily performed using commercial kits 
5 (available from, e.g., BRL Life Technologies Inc., Clontech) according to the 
manufacturer's instructions. 

Isolating the complete coding sequence of a gene can also be carried out in a 
hybridization assay using a suitable probe. The probe preferably comprises at least 10 
nucleotides, and more preferably exhibits sequence homology to the polynucleotides of 
1 0 the markers of the present invention. Other high throughput screens for cDNAs, such as 
those involving gene chip technology, can also be employed in obtaining the complete 
cDNA sequence. 

In addition, databases exist that reduce the complexity of ESTs by assembling 
contiguous EST sequences into tentative genes. For example, TIGR has assembled 

1 5 human ESTs into a datable called THC for tentative human consensus sequences. The 
THC database allows for a more definitive assignment compared to ESTs alone. 
Software programs exist (TIGR assembler and TIGEM EST assembly machine and 
contig assembly program (see Huang, X . , 1996, Genomes 33:21-23)) that allow for 
assembling ESTs into contiguous sequences from any organism. 

20 Alternatively, mRNA from a sample preparation is used to construct cDNA 

library in the ZAP Express vector following the procedure described in Velculescu et 
al, 1997, Science 270:484. The ZAP Express cDNA synthesis kit (Stratagene) is used 
accordingly to the manufacturer's protocol. Plates containing 250 to 2000 plaques are- 
hybridized as described in Rupert et al 9 1988, Mol Cell. Bio. 8:3104 to oligonucleotide 

25 probes with the same conditions previously described for standard probes except that the 
hybridization temperature is reduced to a room temperature. Washes are performed in 
6X standard-saline-citrate 0.1% SDS for 30 minutes at room temperature. The probes 
are labeled with 32 P-ATP trough use of T4 polynucleotide kinase. 

A partial cDNA (3' fragment) can be isolated by 3 ? directed PCR reaction. This 

30 procedure is a modification of the protocol described in Polyak et al y 1997, Nature 

389:300. Briefly, the procedure uses SAGE tags in PCR reaction such that the resultant 
PCR product contains the SAGE tag of interest as well as additional cDNA, the length 
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of which is defined by the position of the tag with respect to the 3' end of the cDNA. 
The cDNA product derived from such a transcript driven PCR reaction can be used for 
many applications. 

RNA from a source to express the cDNA corresponding to a given tag is first 
5 converted to double-stranded cDNA using any standard cDNA protocol. Similar 
conditions used to generate cDNA for SAGE library construction can be employed 
except that a modified oligo-dT primer is used to derive the first strand synthesis. For 
example, the oligonucleotide of composition 5*-B-TCC GGC GCG CCG TTT TCC 
CAG TCA CGA(30)- 3' 5 contains a poly-T stretch at the 3' end for hybridization and 

10 priming from poly- A tails, an Ml 3 priming site for use in subsequent PCR steps, a 5' 
Biotin label (B) for capture to strepavidin-coated magnetic beads, and an AscI restriction 
. endonuclease site for releasing the cDNA from the strepavidin-coated magnetic beads. 
Theoretically, any sufficiently-sized DNA region capable of hybridizing to a PCR 
primer can be used as well as any other 8 base pair recognizing endonuclease. 

1 5 cDNA constructed utilizing this or similar modified oligo-dT primer is then 

processed exactly as described in U.S. Patent No. 5,695,937 up until adapter ligation 
where only one adapter is ligated to the cDNA pool. After Adapter ligation, the cDNA 
is released from the streptavidin-coated magnetic beads and is then used as a template 
for cDNA amplification. 

20 Various PCR protocols can be employed using PCR priming sites within the 3 1 

modified oligo-dT primer and the SAGE tag. The SAGE tag-derived PCR primer 
employed can be of varying length dictated by 5' extension of the tag into the adaptor 
sequence. cDNA products are now available for a variety of applications. 

This technique can be further modified by: (1) altering the length and/or content 

25 of the modified oligo-dT primer; (2) ligating adaptors other than that previously 

employed within the SAGE protocol; (3) performing PCR from template retained on the 
streptavidin-coated magnetic beads; and (4) priming first strand cDNA synthesis with 
non-oligo-dT based primers. 

Gene trapper technology can also be used. The reagents and manufacturer's 

30 instructions for this technology are commercially available from Life Technologies, Inc., 
Gaithsburg, Maryland. Briefly, a complex population of single-stranded phagemid 
DNA containing directional cDNA inserts is enriched for the target sequence by 
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hybridization in solution to a biotinylated oligonucleotide probe complementary to the 
target sequence. The hybrids are captured on streptavidin-coated paramagnetic beads. 
A magnet retrieves the paramagnetic beads from the solution, leaving nonhybridized 
single-stranded DNAs behind. Subsequently, the captured single-stranded DNA target 
5 is released from the biotinylated oligonucleotide. After release, the cDNA clone is 
further enriched by using a nonbiotinylated target oligonucleotide to specifically prime 
conversion of the single-stranded DNA. Following transformation and plating, typically 
20% to 100% of the colonies represent the cDNA clone of interest. To identify the 
desired cDNA clone, the colonies may be screened by colony hybridization using the 

10 32 P-labeled oligonucleotide as described above for solution hybridization, or 
alternatively by DNA sequencing and alignment of all sequences obtained from 
numerous clones to determine a consensus sequence. 

A nucleic acid molecule of the invention can be amplified using cDNA, mRNA, 
or genomic DNA as a template and appropriate oligonucleotide primers according to 

1 5 standard PCR amplification techniques. The nucleic acid so amplified can be cloned 
into an appropriate vector and characterized by DNA sequence analysis. Furthermore, 
oligonucleotides corresponding to all or a portion of a nucleic acid molecule of the 
invention can be prepared by standard synthetic techniques, e.g. , using an automated 
DNA synthesizer. 

20 In another preferred embodiment, an isolated nucleic acid molecule of the 

invention comprises a nucleic acid molecule which has a nucleotide sequence 
complementary to the nucleotide sequence of a nucleic acid corresponding to a marker 
of the invention or to the nucleotide sequence of a nucleic acid encoding a protein which 
corresponds to a marker of the invention. A nucleic acid molecule which is 

25 complementary to a given nucleotide sequence is one which is sufficiently 

complementary to the given nucleotide sequence that it can hybridize to the given 
nucleotide sequence thereby forming a stable duplex. 

Moreover, a nucleic acid molecule of the invention can comprise only a portion of 
a nucleic acid sequence, wherein the full length nucleic acid sequence comprises a 

30 marker of the invention or which encodes a polypeptide corresponding to a marker of 
the invention. Such nucleic acids can be used, for example, as a probe or primer. The 
probe/primer typically is used as one or more substantially purified oligonucleotides. 
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The oligonucleotide typically comprises a region of nucleotide sequence that hybridizes 
under stringent conditions to at least about 7, preferably about 15, more preferably about 
25, 50, 75, 100, 125, 150, 175, 200, 250, 300, 350, or 400 or more consecutive 
nucleotides of a nucleic acid of the invention. 
5 Probes based on the sequence of a nucleic acid molecule of the invention can be 

used to detect transcripts or genomic sequences corresponding to one or more markers 
of the invention. The probe comprises a label group attached thereto, e.g., a 
radioisotope, a fluorescent compound, an enzyme, or an enzyme co-factor. Such probes 
can be used as part of a diagnostic test kit for identifying cells or tissues which mis- 

10 express the protein, such as by measuring levels of a nucleic acid molecule encoding the 
protein in a sample of cells from a subject, e.g., detecting mRNA levels or determining 
whether a gene encoding the protein has been mutated or deleted. 

The invention further encompasses nucleic acid molecules that differ, due to 
degeneracy of the genetic code, from the nucleotide sequence of nucleic acids encoding 

15 a protein which corresponds to a marker of the invention, and thus encode the same 
protein. 

In addition to the nucleotide sequences described herein, it will be appreciated by 
those skilled in the art that DNA sequence polymorphisms that lead to changes in the 
amino acid sequence can exist within a population (e.g. , the human population). Such 

20 genetic polymorphisms can exist among individuals within a population due to natural 
allelic variation. An allele is one of a group of genes which occur alternatively at a 
given genetic locus. In addition, it wall be appreciated that DNA polymorphisms that 
affect RNA expression levels can also exist that may affect the overall expression level 
of that gene (e.g., by affecting regulation or degradation). 

25 As used herein, the phrase "allelic variant" refers to a nucleotide sequence which 

occurs at a given locus or to a polypeptide encoded by the nucleotide sequence. 

As used herein, the terms "gene" and "recombinant gene" refer to nucleic acid 
molecules comprising an open reading frame encoding a polypeptide corresponding to a 
marker of the invention. Such natural allelic variations can typically result in 1 -5% 

30 variance in the nucleotide sequence of a given gene. Alternative alleles can be identified 
by sequencing the gene of interest in a number of different individuals. This can be 
readily carried out by using hybridization probes to identify the same genetic locus in a 
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variety of individuals. Any and all such nucleotide variations and resulting amino acid 
polymorphisms or variations that are the result of natural allelic variation and that do not 
alter the functional activity are intended to be within the scope of the invention. 

In another embodiment, an isolated nucleic acid molecule of the invention is at 
5 least 7, 15, 20, 25, 30, 40, 60, 80, 100, 150, 200, 250, 300, 350, 400, 450, 550, 650, 700, 
800, 900, 1000, 1200, 1400, 1600, 1800, 2000, 2200, 2400, 2600, 2800, 3000, 3500, 
4000, 4500, or more nucleotides in length and hybridizes under stringent conditions to a 
nucleic acid corresponding to a marker of the invention or to a nucleic acid encoding a 
protein corresponding to a marker of the invention. As used herein, the term "hybridizes 
1 0 under stringent conditions" is intended to describe conditions for hybridization and 
washing under which nucleotide sequences at least 75% (80%, 85%, preferably 90%) 
identical to each other typically remain hybridized to each other. Such stringent 
conditions are known to those skilled in the art and can be found in sections 6.3.1-6.3.6 
of Current Protocols in Molecular Biology, John Wiley & Sons, N. Y. (1989). A 
1 5 preferred, non-limiting example of stringent hybridization conditions for annealing two 
single-stranded DNA each of which is at least about 100 bases in length and/or for 
annealing a single-stranded DNA and a single-stranded RNA each of which is at least 
about 100 bases in length, are hybridization in 6X sodium chloride/sodium citrate (SSC) 
at about 45°C, followed by one or more washes in 0.2X SSC, 0.1% SDS at 50-65°C. 
20 Further preferred hybridization conditions are taught in Lockhart, et al, Nature 

Biotechnology, Volume 14, 1996 August: 1675-1680; Breslauer, et al, Proc. Natl. Acad. 
Sci. USA, Volume 83, 1986 June: 3746-3750; Van Ness, et al, Nucleic Acids Research, 
Volume 19, No. 19, 1991 September: 5143-5151; McGraw, et al, BioTechniques, 
Volume 8, No. 6 1990: 674-678; andMilner, etal, Nature Biotechnology, Volume 15, 
1997 June: 537-541, all expressly incorporated by reference. 

In addition to naturally-occurring allelic variants of a nucleic acid molecule of the 
invention that can exist in the population, the skilled artisan will further appreciate that 
sequence changes can be introduced by mutation thereby leading to changes in the 
amino acid sequence of the encoded protein, without altering the biological activity of 
the protein encoded thereby. For example, one can make nucleotide substitutions 
leading to amino acid substitutions at "non-essential" amino acid residues. A "non- 
essential" amino acid residue is a residue that can be altered from the wild-type 
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sequence without altering the biological activity, whereas an "essential" amino acid 
residue is required for biological activity. For example, amino acid residues that are not 
conserved or only semi-conserved among homologs of various species may be non- 
essential for activity and thus would be likely targets for alteration. Alternatively, amino 
5 acid residues that are conserved among the homologs of various species (e.g., murine 
and human) may be essential for activity and thus would not be likely targets for 
alteration. 

Accordingly, another aspect of the invention pertains to nucleic acid molecules 
encoding a polypeptide of the invention that contain changes in amino acid residues that 

10 are not essential for activity. Such polypeptides differ in amino acid sequence from the 
naturally-occurring proteins which correspond to the markers of the invention, yet retain 
biological activity. In one embodiment, such a protein has an amino acid sequence that 
is at least about 40% identical, 50%, 60%, 70%, 80%, 90%, 95%, or 98% identical to the 
amino acid sequence of one of the proteins which correspond to the markers of the 

15 invention. 

An isolated nucleic acid molecule encoding a variant protein can be created by 
introducing one or more nucleotide substitutions, additions or deletions into the 
nucleotide sequence of nucleic acids of the invention, such that one or more amino acid 
residue substitutions, additions, or deletions are introduced into the encoded protein. 

20 Mutations can be introduced by standard techniques, such as site-directed mutagenesis 
and PCR-mediated mutagenesis. Preferably, conservative amino acid substitutions are 
made at one or more predicted non-essential amino acid residues. A "conservative 
amino acid substitution" is one in which the amino acid residue is replaced with an 
amino acid residue having a similar side chain. Families of amino acid residues having 

25 similar side chains have been defined in the art. These families include amino acids 
with basic side chains (e.g., lysine, arginine, histidine), acidic side chains (e.g., aspartic 
acid, glutamic acid), uncharged polar side chains (e.g., glycine, asparagine, glutamine, 
serine, threonine, tyrosine, cysteine), non-polar side chains (e.g., alanine, valine, leucine, 
isoleucine, proline, phenylalanine, methionine, tryptophan), beta-branched side chains 

30 (e.g., threonine, valine, isoleucine) and aromatic side chains (e.g., tyrosine, 

phenylalanine, tryptophan, histidine). Alternatively, mutations can be introduced 
randomly along all or part of the coding sequence, such as by saturation mutagenesis, 
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and the resultant mutants can be screened for biological activity to identify mutants that 
retain activity. Following mutagenesis, the encoded protein can be expressed 
recombinantly and the activity of the protein can be determined. 

The present invention encompasses antisense nucleic acid molecules, i.e., 
5 molecules which are complementary to a sense nucleic acid of the invention, e.g. , 
complementary to the coding strand of a double-stranded cDNA molecule 
corresponding to a marker of the invention or complementary to an mRNA sequence 
corresponding to a marker of the invention. Accordingly, an antisense nucleic acid of 
the invention can hydrogen bond to (i.e. anneal with) a sense nucleic acid of the 

10 invention. The antisense nucleic acid can be complementary to an entire coding strand, 
or to only a portion thereof, e.g., all or part of the protein coding region (or open reading 
frame). An antisense nucleic acid molecule can also be antisense to all or part of a non- 
coding region of the coding strand of a nucleotide sequence encoding a polypeptide of 
the invention. The non-coding regions ("5* and 3* untranslated regions") are the 5' and 3' 

1 5 sequences which flank the coding region and are not translated into amino acids. 

An antisense oligonucleotide can be, for example, about 5, 10, 15, 20, 25, 30, 35, 
40, 45, or 50 or more nucleotides in length. An antisense nucleic acid of the invention 
can be constructed using chemical synthesis and enzymatic ligation reactions using 
procedures known in the art. For example, an antisense nucleic acid (e.g., an antisense 

20 oligonucleotide) can be chemically synthesized using naturally occurring nucleotides or 
variously modified nucleotides designed to increase the biological stability of the 
molecules or to increase the physical stability of the duplex formed between the 
antisense and sense nucleic acids, e.g., phosphorothioate derivatives and acridine 
substituted nucleotides can be used. Examples of modified nucleotides which can be 

25 used to generate the antisense nucleic acid include 5-fluorouracil, 5-bromouracil, 5- 
chlorouracil, 5-iodouracil, hypoxanthine, xanthine, 4-acetylcytosine, 5- 
(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl-2-thiouridine 9 5- 
carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, inosine, 
N6-isopentenyladenine, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 2- 

30 methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7- 
methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta- 
D-mannosylqueosine, 5'-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio- 
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N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, 
queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5- 
methyluracil, uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl- 
2-thiouracil, 3-(3-amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. 
5 Alternatively, the antisense nucleic acid can be produced biologically using an 
expression vector into which a nucleic acid has been sub-cloned in an antisense 
orientation (i.e., RNA transcribed from the inserted nucleic acid will be of an antisense 
orientation to a target nucleic acid of interest, described further in the following 
subsection). 

10 The antisense nucleic acid molecules of the invention are typically administered 

to a subject or generated in situ such that they hybridize with or bind to cellular mRNA 
and/or genomic DNA encoding a polypeptide corresponding to a selected marker of the 
invention to thereby inhibit expression of the marker, e.g., by inhibiting transcription 
and/or translation. The hybridization can be by conventional nucleotide 

15 complementarity to form a stable duplex, or, for example, in the case of an antisense 
nucleic acid molecule which binds to DNA duplexes, through specific interactions in the 
major groove of the double helix. Examples of a route of administration of antisense 
nucleic acid molecules of the invention includes direct injection at a tissue site or 
infusion of the antisense nucleic acid into an breast-associated body fluid. Alternatively, 

20 antisense nucleic acid molecules can be modified to target selected cells and then 
administered systemically. For example, for systemic administration, antisense 
molecules can be modified such that they specifically bind to receptors or antigens 
expressed on a selected cell surface, e.g., by linking the antisense nucleic acid molecules 
to peptides or antibodies which bind to cell surface receptors or antigens. The antisense 

25 nucleic acid molecules can also be delivered to cells using the vectors described herein. 
To achieve sufficient intracellular concentrations of the antisense molecules, vector 
constructs in which the antisense nucleic acid molecule is placed under the control of a 
strong pol II or pol III promoter are preferred. 

An antisense nucleic acid molecule of the invention can be an a-anomeric nucleic 

30 acid molecule. An a-anomeric nucleic acid molecule forms specific double-stranded 
hybrids with complementary RNA in which, contrary to the usual a-units, the strands 
run parallel to each other (Gaultier <tf a/., 1987 r , Nucleic Acids Res. 15:6625-6641). The 
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antisense nucleic acid molecule can also comprise a 2'-o-methylribonucleotide (Inoue et 
al, 1987, Nucleic Acids Res. 15:6131-6148) or a chimeric RNA-DNA analogue (Inoue 
etal, 1987, FEBS Lett. 215:327-330). 

The invention also encompasses ribozymes. Ribozymes are catalytic RNA 
5 molecules with ribonuclease activity which are capable of cleaving a single-stranded 
nucleic acid, such as an mRNA 5 to which they have a complementary region. Thus, 
ribozymes {e.g., hammerhead ribozymes as described in Haselhoff and Gerlach, 1988, 
Nature 334:585-591) can be used to catalytically cleave mRNA transcripts to thereby 
inhibit translation of the protein encoded by the mRNA. A ribozyme having specificity 

10 for a nucleic acid molecule encoding a polypeptide corresponding to a marker of the 
invention can be designed based upon the nucleotide sequence of a cDNA 
corresponding to the marker. For example, a derivative of a Tetrahymena L-19 IVS 
RNA can be constructed in which the nucleotide sequence of the active site is 
complementary to the nucleotide sequence to be cleaved (see Cech et al U.S. Patent No. 

15 4,987,071 ; and Cech et al U.S. Patent No. 5,1 16,742). Alternatively, an mRNA 

encoding a polypeptide of the invention can be used to select a catalytic RNA having a 
specific ribonuclease activity from a pool of RNA molecules (see, e.g., Bartel and 
Szostak, 1993, Science 261:141 1-1418). 

The invention also encompasses nucleic acid molecules which form triple helical 

20 structures. For example, expression of a polypeptide of the invention can be inhibited 
by targeting nucleotide sequences complementary to the regulatory region of the gene 
encoding the polypeptide (e.g., the promoter and/or enhancer) to form triple helical 
structures that prevent transcription of the gene in target cells. See generally Helene 
(1991) Anticancer Drug Bes. 6(6):569-84; Helene (1992) Ann. N.Y.Acad Sci. 660:27- 

25 36; and Maher (1992) Bioassays 14(12):807-1 5. 

In various embodiments, the nucleic, acid molecules of the invention can be 
modified at the base moiety, sugar moiety or phosphate backbone to improve, e.g., the 
stability, hybridization, or solubility of the molecule. For example, the deoxyribose 
phosphate backbone of the nucleic acids can be modified to generate peptide nucleic 

30 acids (see Hyrup et al, 1996, Bioorganic & Medicinal Chemistry 4(1): 5-23). As used 
herein, the terms "peptide nucleic acids" or "PNAs" refer to nucleic acid mimics, e.g., 
DNA mimics, in which the deoxyribose phosphate backbone is replaced by a 
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pseudopeptide backbone and only the four natural nucleobases are retained. The neutral 
backbone of PNAs has been shown to allow for specific hybridization to DNA and RNA 
under conditions of low ionic strength. The synthesis of PNA oligomers can be 
performed using standard solid phase peptide synthesis protocols as described in Hyrup 
5 et al (1996), supra; Perry-O'Keefe et al (1996) Proc. Natl Acad. Set USA 93:14670- 
675. 

PNAs can be used in therapeutic and diagnostic applications. For example, PNAs 
can be used as antisense or antigene agents for sequence-specific modulation of gene 
expression by, e.g., inducing transcription or translation arrest or inhibiting replication. 

10 PNAs can also be used, e.g. , in the analysis of single base pair mutations in a gene by, 
e.g., PNA directed PCR clamping; as artificial restriction enzymes when used in 
combination with other enzymes, e.g., SI nucleases (Hyrup (1996), supra; or as probes 
or primers for DNA sequence and hybridization (Hyrup, 1996, supra; Perry-O'Keefe et 
al, 1996, Proc. Natl Acad. Sci. USA 93:14670-675). 

15 In another embodiment, PNAs can be modified, e.g. , to enhance their stability or 

cellular uptake, by attaching lipophilic or other helper groups to PNA, by the formation 
of PNA-DNA chimeras, or by the use of liposomes or other techniques of drug delivery 
known in the art. For example, PNA-DNA chimeras can be generated which can 
combine the advantageous properties of PNA and DNA. Such chimeras allow DNA 

20 recognition enzymes, e.g., RNASE H and DNA polymerases, to interact with the DNA 
portion while the PNA portion would provide high binding affinity and specificity. 
PNA-DNA chimeras can be linked using linkers of appropriate lengths selected in terms 
of base stacking, number of bonds between the nucleobases, and orientation (Hyrup, 
1996, supra). The synthesis of PNA-DNA chimeras can be performed as described in 

25 Hyrup (1996), supra, and Finn et al. (1996) Nucleic Acids Res. 24(17):3357-63. For 
example, a DNA chain can be synthesized on a solid support using standard 
phosphoramidite coupling chemistry and modified nucleoside analogs. Compounds 
such as 5 , -(4-methoxytrityl)amino-5 , -deoxy-thymidine phosphoramidite can be used as a 
link between the PNA and the 5' end of DNA (Mag et al, 1 989, Nucleic Acids Res. 

30 17:5973-88). PNA monomers are then coupled in a step-wise manner to produce a 
chimeric molecule with a 5' PNA segment and a 3' DNA segment (Finn et al, 1996, 
Nucleic Acids Res. 24(17):3357-63). Alternatively, chimeric molecules can be 
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synthesized with a 5' DNA segment and a 3' PNA segment (Peterser et al, 1975, 
Bioorganic Med. Chem. Lett 5:1119-111 24). 

In other embodiments, the oligonucleotide can include other appended groups 
such as peptides {e.g., for targeting host cell receptors in vivo), or agents facilitating 
5 transport across the cell membrane (see, e.g., Letsinger et al, 1989, Proc. Natl Acad. 
Set USA 86:6553-6556; Lemaitre et al, 1987, Proc. Natl. Acad. Sci. USA 84:648-652; 
PCT Publication No. WO 88/09810) or the blood-brain barrier (see, e.g., PCT 
Publication No. WO 89/10134). In addition, oligonucleotides can be modified with 
hybridization-triggered cleavage agents (see, e.g., Krol etal, 1988, Bio/Techniques 
10 6:958-976) or intercalating agents (see, e.g., Zon, 1988, Pharm. Res. 5:539-549). To 
this end, the oligonucleotide can be conjugated to another molecule, e.g., a. peptide, 
hybridization triggered cross-linking agent, transport agent, hybridization-triggered 
cleavage agent, etc. 

The invention also includes molecular beacon nucleic acids having at least one 
1 5 region which is complementary to a nucleic acid of the invention, such that the 
molecular beacon is useful for quantitating the presence of the nucleic acid of the 
invention in a sample. A "molecular beacon" nucleic acid is a nucleic acid comprising a 
pair of complementary regions and having a fluorophore and a fluorescent quencher 
associated therewith. The fluorophore and quencher are associated with different 
20 portions of the nucleic acid in such an orientation that when the complementary regions 
are annealed with one another, fluorescence of the fluorophore is quenched by the 
quencher. When the complementary regions of the nucleic acid are not annealed with 
one another, fluorescence of the fluorophore is quenched to a lesser degree. Molecular 
beacon nucleic acids are described, for example, in U.S. Patent 5,876,930. 

25 

II. Isolated Proteins and Antibodies 

One aspect of the invention pertains to isolated proteins which correspond to 
individual markers of the invention, and biologically active portions thereof, as well as 
polypeptide fragments suitable for use as immunogens to raise antibodies directed 
30 against a polypeptide corresponding to a marker of the invention. In one embodiment, 
the native polypeptide corresponding to a marker can be isolated from cells or tissue 
sources by an appropriate purification scheme using standard protein purification 
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techniques. In another embodiment, polypeptides corresponding to a marker of the 
invention are produced by recombinant DNA techniques. Alternative to recombinant 
expression, a polypeptide corresponding to a marker of the invention can be synthesized 
chemically using standard peptide synthesis techniques. 
5 An "isolated" or "purified" protein or biologically active portion thereof is 

substantially free of cellular material or other contaminating proteins from the cell or 
tissue source from which the protein is derived, or substantially free of chemical 
precursors or other chemicals when chemically synthesized. The language 
"substantially free of cellular material" includes preparations of protein in which the 

10 protein is separated from cellular components of the cells from which it is isolated or 
recombinantly produced. Thus, protein that is substantially free of cellular material 
includes preparations of protein having less than about 30%, 20%, 10%, or 5% (by dry 
weight) of heterologous protein (also referred to herein as a "contaminating protein"). 
When the protein or biologically active portion thereof is recombinantly produced, it is 

1 5 also preferably substantially free of culture medium, i. e. , culture medium represents less 
than about 20%, 10%, or 5% of the volume of the protein preparation. When the protein 
is produced by chemical synthesis, it is preferably substantially free of chemical 
precursors or other chemicals, i.e., it is separated from chemical precursors or other 
chemicals which are involved in the synthesis of the protein. Accordingly such 

20 preparations of the protein have less than about 30%, 20%, 1 0%, 5% (by dry weight) of 

chemical precursors or compounds other than the polypeptide of interest. 

o 

Biologically active portions of a polypeptide corresponding to a marker of the 
invention include polypeptides comprising amino acid sequences sufficiently identical to 
or derived from the amino acid sequence of the protein corresponding to the marker, 

25 which include fewer amino acids than the full length protein, and exhibit at least one 
activity of the corresponding full-length protein. Typically, biologically active portions 
comprise a domain or motif with at least one activity of the corresponding protein. A 
biologically active portion of a protein of the invention can be a polypeptide which is, 
for example, 10, 25, 50, 100 or more amino acids in length. Moreover, other 

30 biologically active portions, in which other regions of the protein are deleted, can be 
prepared by recombinant techniques and evaluated for one or more of the functional 
activities of the native form of a polypeptide of the invention. 
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Preferred polypeptides have amino acid sequences encoded by the nucleic acid 
sequences described herein. Other useful proteins are substantially identical {e.g., at 
least about 40%, preferably 50%, 60%, 70%, 80%, 90%, 95%, or 99%) to one of these 
sequences and retain the functional activity of the protein of the corresponding 
5 naturally-occurring protein yet differ in amino acid sequence due to natural allelic 
variation or mutagenesis. 

To deteimine the percent identity of two amino acid sequences or of two nucleic 
acids, the sequences are aligned for optimal comparison purposes (e.g., gaps can be 
introduced in the sequence of a first amino acid^or nucleic acid sequence for optimal 

10 alignment with a second amino or nucleic acid sequence). The ammo acid residues or 
nucleotides at corresponding amino acid positions or nucleotide positions are then 
compared. When a position in the first sequence is occupied by the same amino acid 
residue or nucleotide as the corresponding position in the second sequence, then the 
molecules are identical at that position. The percent identity between the two sequences 

15 is a function of the number of identical positions shared by the sequences (/. e. , % 

identity = # of identical positions/total # of positions (e.g., overlapping positions) xlOO). 
In one embodiment the two sequences are the same length. 

The determination of percent identity between two sequences can be 
accomplished using a mathematical algorithm. A preferred, non-limiting example of a 

20 mathematical algorithm utilized for the comparison of two sequences is the algorithm of 
Karlin and Altschul (1990) Proc. Natl. Acad. Set. USA 87:2264-2268, modified as in 
Karlin and Altschul (1993) Proc. Natl. Acad. Sci. USA 90:5873-5877. Such an 
algorithm is incorporated into the NBLAST and XBLAST programs of Altschul, et al 
(1990) J. Mol Biol 215:403-410. BLAST nucleotide searches can be performed with 

25 the NBLAST program, score = 1 00, wordlength = 12 to obtain nucleotide sequences 
homologous to a nucleic acid molecules of the invention. BLAST protein searches can 
be performed with the XBLAST program, score = 50, wordlength = 3 to obtain amino 
acid sequences homologous to a protein molecules of the invention. To obtain gapped 
alignments for comparison purposes, Gapped BLAST can be utilized as described in 

30 Altschul et al (1997) Nucleic Acids Res. 25:3389-3402. Alternatively, PSI-Blast can be 
used to perform an iterated search which detects distant relationships between 
molecules. When utilizing BLAST, Gapped BLAST, and PSI-Blast programs, the 
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default parameters of the respective programs (e.g., XBLAST and NBLAST) can be 
used. See http://www.ncbi.nlm.nih.gov. Another preferred, non-limiting example of a 
mathematical algorithm utilized for the comparison of sequences is the algorithm of 
Myers and Miller, (1988) CABIOS 4:1 1-17. Such an algorithm is incorporated into the 
5 ALIGN program (version 2.0) which is part of the GCG sequence alignment software 
package. When utilizing the ALIGN program for comparing amino acid sequences, a 
PAM120 weight residue table, a gap length penalty of 12, and a gap penalty of 4 can be 
used. Yet another useful algorithm for identifying regions of local sequence similarity 
and alignment is the FASTA algorithm as described in Pearson and Lipman (1988) 

10 Proc. Natl Acad Set USA 85:2444-2448. When using the FASTA algorithm for 

comparing nucleotide or amino acid sequences, a PAM120 weight residue table can, for 
example, be used with a £-tuple value of 2. 

The percent identity between two sequences can be determined using techniques 
similar to those described above, with or without allowing gaps. In calculating percent 

1 5 identity, only exact matches are counted. 

The invention also provides chimeric or fusion proteins corresponding to a 
marker of the invention. As used herein, a "chimeric protein" or "fusion protein" 
comprises all or part (preferably a biologically active part) of a polypeptide 
corresponding to a marker of the invention operably linked to a heterologous 

20 polypeptide (/. e. , a polypeptide other than the polypeptide corresponding to the marker). 
Within the fusion protein, the term "operably linked" is intended to indicate that the 
polypeptide of the invention and the heterologous polypeptide are fused in-frame to each 
other. The heterologous polypeptide can be fused to the amino-terminus or the 
carboxyl-terminus of the polypeptide of the invention. 

25 One useful fusion protein is a GST fusion protein in which a polypeptide 

corresponding to a marker of the invention is fused to the carboxyl terminus of GST 
sequences. Such fusion proteins can facilitate the purification of a recombinant 
polypeptide of the invention. 

In another embodiment, the fusion protein contains a heterologous signal 

30 sequence at its amino terminus. For example, the native signal sequence of a 

polypeptide corresponding to a marker of the invention can be removed and replaced 
with a signal sequence from another protein. For example, the gp67 secretory sequence 
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of the baculovirus envelope protein can be used as a heterologous signal sequence 
(Ausubel et al 9 ed., Current Protocols in Molecular Biology, John Wiley & Sons, NY, 
1992). Other examples of eukaryotic heterologous signal sequences include the 
secretory sequences of melittin and human placental alkaline phosphatase (Stratagene; 
5 La Jolla, California). In yet another example, useful prokaryotic heterologous signal 
sequences include the phoA secretory signal (Sambrook et aL, supra) and the protein A 
secretory signal (Pharmacia Biotech; Piscataway, New Jersey). 

In yet another embodiment, the fusion protein is an immunoglobulin fusion 
protein in which all or part of a polypeptide corresponding to a marker of the invention 

1 0 is fused to sequences derived from a member of the immunoglobulin protein family. 
The immunoglobulin fusion proteins of the invention can be incorporated into 
pharmaceutical compositions and administered to a subject to inhibit an interaction 
between a ligand (soluble or membrane- bound) and a protein on the surface of a cell 
(receptor), to thereby suppress signal transduction in vivo. The immunoglobulin fusion 

1 5 protein can be used to affect the bioavailability of a cognate ligand of a polypeptide of 
the invention. Inhibition of ligand/receptor interaction can be useful therapeutically, 
both for treating proliferative and differentiative disorders and for modulating (e.g. 
promoting or inhibiting) cell survival. Moreover, the immunoglobulin fusion proteins of 
the invention can be used as immunogens to produce antibodies directed against a 

20 polypeptide of the invention in a subject, to purify ligands and in screening assays to 
identify molecules which inhibit the interaction of receptors with ligands. 

Chimeric and fusion proteins of the invention can be produced by standard 
recombinant DNA techniques. In another embodiment, the fusion gene can be 
synthesized by conventional techniques including automated DNA synthesizers. 

25 Alternatively, PCR amplification of gene fragments can be carried out using anchor 
primers which give rise to complementary overhangs between two consecutive gene 
fragments which can subsequently be annealed and re-amplified to generate a chimeric 
gene sequence (see, e.g., Ausubel et al, supra). Moreover, many expression vectors are 
commercially available that already encode a fusion moiety (e.g., a GST polypeptide). 

30 A nucleic acid encoding a polypeptide of the invention can be cloned into such an 

expression vector such that the fusion moiety is linked in-frame to the polypeptide of the 
invention. 
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A signal sequence can be used to facilitate secretion and isolation of the secreted 
protein or other proteins of interest. Signal sequences are typically characterized by a 
core of hydrophobic amino acids which are generally cleaved from the mature protein 
during secretion in one or more cleavage events. Such signal peptides contain 
5 processing sites that allow cleavage of the signal sequence from the mature proteins as 
they pass through the secretory pathway. Thus, the invention pertains to the described 
polypeptides having a signal sequence, as well as to polypeptides from which the signal 
sequence has been proteolytically cleaved (i.e., the cleavage products). In one 
embodiment, a nucleic acid sequence encoding a signal sequence can be operably linked 

10 in an expression vector to a protein of interest, such as a protein which is ordinarily not 
secreted or is otherwise difficult to isolate. The signal sequence directs secretion of the 
protein, such as from a eukaryotic host into which the -expression vector is transformed, 
and the signal sequence is subsequently or concurrently cleaved. The protein can then 
be readily purified from the extracellular medium by art recognized methods. 

15 Alternatively, the signal sequence can be linked to the protein of interest using a 
sequence which facilitates purification, such as with a GST domain. 

The present invention also pertains to variants of the polypeptides corresponding 
to individual markers of the invention. Such variants have an altered amino acid 
sequence which can function as either agonists (mimetics) or as antagonists. Variants 

20 can be generated by mutagenesis, e.g., discrete point mutation or truncation. An agonist 
can retain substantially the same, or a subset, of the biological activities of the naturally 
occurring form of the protein. An antagonist of a protein can inhibit one or more of the 
activities of the naturally occurring form of the protein by, for example, competitively 
binding to a downstream or upstream member of a cellular signaling cascade which 

25 includes the protein of interest Thus, specific biological effects can be elicited by 
treatment wich a variant of limited function. Treatment of a subject with a variant 
having a subset of the biological activities of the naturally occurring form of the protein 
can have fewer side effects in a subject relative to treatment with the naturally occurring 
form of the protein. 

30 Variants of a protein of the invention which function as either agonists (mimetics) 

or as antagonists can be identified by screening combinatorial libraries of mutants, e.g., 
truncation mutants, of the protein of the invention for agonist or antagonist activity. In 
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one embodiment, a variegated library of variants is generated by combinatorial 
mutagenesis at the nucleic acid level and is encoded by a variegated gene library. A 
variegated library of variants can be produced by, for example, enzymatically ligating a 
mixture of synthetic oligonucleotides into gene sequences such that a degenerate set of 
5 potential protein sequences is expressible as individual polypeptides, or alternatively, as 
a set of larger fusion proteins (e.g., for phage display). There are a variety of methods 
which can be used to produce libraries of potential variants of the polypeptides of the 
invention from a degenerate oligonucleotide sequence. Methods for synthesizing 
degenerate oligonucleotides are known in the art (see, e.g., Narang, 1983, Tetrahedron 
10 39:3; Itakura et al, 1984, Annu. Rev. Biochem. 53:323; Itakura et aL, 1984, Science 
198:1056; Ike etal, 1983 Nucleic Acid Res. 11:477). 

In addition, libraries of fragments of the coding sequence of a polypeptide 
corresponding to a marker of the invention can be used to generate a variegated 
population of polypeptides for screening and subsequent selection of variants. For 
15 example, a library of coding sequence fragments can be generated by treating a double 
stranded PCR fragment of the coding sequence of interest with a nuclease under 
conditions wherein nicking occurs only about once per molecule, denaturing the double 
stranded DNA, renaturing the DNA to form double stranded DNA which can include 
sense/antisense pairs from different nicked products, removing single stranded portions 
20 from reformed duplexes by treatment with S 1 nuclease, and ligating the resulting 

fragment library into an expression vector. By this method, an expression library can be 
derived which encodes amino terminal and internal fragments of various sizes of the 
protein of interest. 

Several techniques are known in the art for screening gene products of 
25 combinatorial libraries made by point mutations or truncation, and for screening cDNA 
libraries for gene products having a selected property. The most widely used 
techniques, which are amenable to high through-put analysis, for screening large gene 
libraries typically include cloning the gene library into replicable expression vectors, 
transforming appropriate cells with the resulting library of vectors, and expressing the 
30 combinatorial genes under conditions in which detection of a desired activity facilitates 
isolation of the vector encoding the gene whose product was detected. Recursive 
ensemble mutagenesis (REM), a technique which enhances the frequency of functional 
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mutants in the libraries, can be used in combination with the screening assays to identify 
variants of a protein of the invention (Arkin and Yourvan, 1992, Proc. Natl. Acad. Set 
USA 59:781 1-7815; Delgrave et al, 1993, Protein Engineering 6(3):327- 331). 

An isolated polypeptide corresponding to a marker of the invention, or a fragment 

5 thereof, can be used as an immunogen to generate antibodies using standard techniques 
for polyclonal and monoclonal antibody preparation. The full-length polypeptide or 
protein can be used or, alternatively, the invention provides antigenic peptide fragments 
for use as immunogens. The antigenic peptide of a protein of the invention comprises at 
least 8 (preferably 10, 15, 20, or 30 or more) amino acid residues of the amino acid 

1 0 sequence of one of the polypeptides of the invention, and encompasses an epitope of the 
protein such that an antibody raised against the peptide forms a specific immune 
complex with a marker of the invention to which the protein corresponds. Preferred 
epitopes encompassed by the antigenic peptide are regions that are located on the 
surface of the protein, e.g., hydrophilic regions. Hydrophobicity sequence analysis, 

1 5 hydrophilicity sequence analysis, or similar analyses can be used to identify hydrophilic 
regions. 

An immunogen typically is used to prepare antibodies by immunizing a suitable 
(i.e. immunocompetent) subject such as a rabbit, goat, mouse, or other mammal or 
vertebrate. An appropriate immunogenic preparation can contain, for example, 

20 recombinantly-expressed or chemically-synthesized polypeptide. The preparation can 
further include an adjuvant, such as Freund's complete or incomplete adjuvant, or a 
similar immunostimulatory agent. 

Accordingly, another aspect of the invention pertains to antibodies directed 
against a polypeptide of the invention. The terms "antibody" and "antibody substance" 

25 as used interchangeably herein refer to immunoglobulin molecules and immunologically 
active portions of immunoglobulin molecules, I e. , molecules that contain an antigen 
binding site which specifically binds an antigen, such as a polypeptide of the invention, 
e.g., an epitope of a polypeptide of the invention. A molecule which specifically binds 
to a given polypeptide of the invention is a molecule which binds the polypeptide, but 

30 does not substantially bind other molecules in a sample, e.g. , a biological sample, which 
naturally contains the polypeptide. Examples of immunologically active portions of 
immunoglobulin molecules include F(ab) and F(ab') 2 fragments which can be generated 
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by treating the antibody with an enzyme such as pepsin. The invention provides 
polyclonal and monoclonal antibodies. The term "monoclonal antibody" or 
"monoclonal antibody composition" , as used herein, refers to a population of antibody 
molecules that contain only one species of an antigen binding site capable of 
5 immunoreacting with a particular epitope. 

Polyclonal antibodies can be prepared as described above by immunizing a 
suitable, subject with a polypeptide of the invention as an immunogen. Preferred 
polyclonal antibody compositions are ones that have been selected for antibodies 
directed against a polypeptide or polypeptides of the invention. Particularly preferred 

10 polyclonal antibody preparations are ones that contain only antibodies directed against a 
polypeptide or polypeptides of the invention. Particularly preferred immunogen 
compositions are those that contain no other human proteins such as, for example, 
immunogen compositions made using a non-human host cell for recombinant expression 
of a polypeptide of the invention. In such a maimer, the only human epitope or epitopes 

15 recognized by the resulting antibody compositions raised against this immunogen will 
be present as part of a polypeptide or polypeptides of the invention. 

The antibody titer in the immunized subject can be monitored over time by 
standard techniques, such as with an enzyme linked immunosorbent assay (ELISA) 
using immobilized polypeptide. If desired, the antibody molecules can be harvested or 

20 isolated from the subject (e.g., from the blood or serum of the subject) and further 

purified by well-known techniques, such as protein A chromatography to obtain the IgG 
fraction. Alternatively, antibodies specific for a protein or polypeptide of the invention 
can be selected or (e.g., partially purified) or purified by, e.g., affinity chromatography. 
For example, a recombinantly expressed and purified (or partially purified) protein of 

25 the invention is produced as described herein, and covalently or non-covalently coupled 
to a solid support such as, for example, a chromatography column. The column can then 
be used to affinity purify antibodies specific for the proteins of the invention from a 
sample containing antibodies directed against a large number of different epitopes, 
thereby generating a substantially purified antibody composition, i.e., one that is 

30 substantially free of contaminating antibodies. By a substantially purified antibody 
composition is meant, in this context, that the antibody sample contains at most only 
30% (by dry weight) of contaminating antibodies directed against epitopes other than 
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those of the desired protein or polypeptide of the invention, and preferably at most 20%, 
yet more preferably at most 10%, and most preferably at most 5% (by dry weight) of the 
sample is contaminating antibodies. A purified antibody composition means that at least 
99% of the antibodies in the composition are directed against the desired protein or 
5 polypeptide of the invention. 

At an appropriate time after immunization, e.g., when the specific antibody titers 
are highest, antibody-producing cells can be obtained from the subject and used to 
prepare monoclonal antibodies by standard techniques, such as the hybridoma technique 
originally described by Kohler and Milstein (1975) Nature 256:495-497, the human B 

10 cell hybridoma technique (see Kozbor et al, 1983, Immunol Today 4:72), the EBV- 
hybridoma technique (see Cole et al, pp. 77-96 In Monoclonal Antibodies and Cancer 
Therapy, Alan R. Liss, Inc., 1985) or trioma techniques. The technology for producing 
hybridomas is well known (see generally Current Protocols in Immunology, Coligan et 
al ed., John Wiley & Sons, New York, 1994). Hybridoma cells producing a 

1 5 monoclonal antibody of the invention are detected by screening the hybridoma culture 
supernatants for antibodies that bind the polypeptide of interest, e.g., using a standard 
ELISA assay. 

Alternative to preparing monoclonal antibody-secreting hybridomas, a 

monoclonal antibody directed against a polypeptide of the invention can be identified 
20 and isolated by screening a recombinant combinatorial immunoglobulin libiary {e.g., an 

antibody phage display library) with the polypeptide of interest. Kits for generating and 

screening phage display libraries are commercially available {e.g., the Pharmacia 
. Recombinant Phage Antibody System, Catalog No. 27-9400-01; and the Stravagene 

SurfZAP Phage Display Kit, Catalog No. 2406 1 2). Additionally, examples of methods 
25 and reagents particularly amenable for use in generating and screening antibody display 

library can be found in, for example, U.S. Patent No. 5,223,409; PCT Publication No. 

WO 92/18619; PCT Publication No. WO 91/17271; PCT Publication No. WO 

92/20791 ; PCT Publication No. WO 92/15679; PCT Publication No. WO 93/01288; 

PCT Publication No. WO 92/01047; PCT Publication No. WO 92/09690; PCT 
30 Publication No. WO 90/02809; Fuchs et al (1991) Bio/Technology 9:1370-1372; Hay et 

al (1992) Hum. Antibod. Hybridomas 3:81-85; Huse et al. (1989) Science 246:1275- 

1281; Griffiths etal (1993) EMBO J. 12:725-734. 
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Additionally, recombinant antibodies, such as chimeric and humanized 
monoclonal antibodies, comprising both human and non-human portions, which can be 
made using standard recombinant DNA techniques, are within the scope of the 
invention. A chimeric antibody is a molecule in which different portions are derived 
5 from different animal species, such as those having a variable region derived from a 
murine mAb and a human immunoglobulin constant region. (See, e.g., Cabilly et al., 
U.S. Patent No. 4,816,567; and Boss et al., U.S. Patent No. 4,816,397, which are 
incorporated herein by reference in their entirety.) Humanized antibodies are antibody 
molecules from non-human species having one or more complementarily determining 

10 regions (CDRs) from the non-human species and a framework region from a human 
immunoglobulin molecule. (See, e.g., Queen, U.S. Patent No. 5,585,089, which is 
incorporated herein by reference in its entirety.) Such chimeric and humanized 
monoclonal antibodies can be produced by recombinant DNA techniques known in the 
art, for example using methods described in PCT Publication No. WO 87/02671; 

1 5 European Patent Application 1 84, 1 87; European Patent Application 1 7 1 ,496; European 
Patent Application 173,494; PCT Publication No. WO 86/01533; U.S. Patent No. 
4,816,567; European Patent Application 125,023; Better et al (1988) Science 240:1041- 
1043; Liu etal. (1987) Proc. Natl Acad. Sci. USA 84:3439-3443; Liu et a!. (1987) J. 
Immunol 139:3521-3526; Sun eta!. (1987) Proa Natl Acad Sci. USA 84:214-218; 

20 Nishimura et al (1987) Cancer Res. 47:999-1005; Wood et al (1985) Nature 314:446- 
449; and Shaw et al (1988) J. Natl Cancer Inst 80:1553-1559); Morrison (1985) 
Science 229:1202-1207; Oi et al (1986) Bio/Techniques 4:214; U.S. Patent 5,225,539; 
Jones et al (1986) Nature 321.552-525; Verhoeyan et al (1988) Science 239:1534; and 
Beidlere/a/. (1988) J. Immunol 141:4053-4060. 

25 Antibodies of the invention may be used as therapeutic agents in treating cancers. 

In a preferred embodiment, completely human antibodies of the invention are used for 
therapeutic treatment of human cancer patients, particularly those having breast cancer. 
Such antibodies can be produced, for example, using transgenic mice which are 
incapable of expressing endogenous immunoglobulin heavy and light chains genes, but 

30 which can express human heavy and light chain genes. The transgenic mice are 
immunized in the normal fashion with a selected antigen, e.g., all or a portion of a 
polypeptide corresponding to a marker of the invention. Monoclonal antibodies directed 
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against the antigen can be obtained using conventional hybridoma technology. The 
human immunoglobulin transgenes harbored by the transgenic mice rearrange during B 
cell differentiation, and subsequently undergo class switching and somatic mutation. 
Thus, using such a technique, it is possible to produce therapeutically useful IgG, IgA 
5 and IgE antibodies. For an overview of this technology for producing human antibodies, 
see Lonberg and Huszar (1995) Int. Rev. Immunol 13:65-93). For a detailed discussion 
of this technology for producing human antibodies and human monoclonal antibodies 
and protocols for producing such antibodies, see, e.g., U.S. Patent 5,625,126; U.S. 
Patent 5,633,425; U.S. Patent 5,569,825; U.S. Patent 5,661,016; and U.S. Patent 

10 5,545,806. In addition, companies such as Abgenix, Inc. (Freemont, CA), can be 
engaged to provide human antibodies directed against a selected antigen using 
technology similar to that described above. 

Completely human antibodies which recognize a selected epitope can be 
generated using a technique referred to as "guided selection." In this approach a 

15 selected non-human monoclonal antibody, e.g. , a murine antibody, is used to guide the 
selection of a completely human antibody recognizing the same epitope (Jespers et al } 
1994, Bio/technology 12:899-903). 

An antibody directed against a polypeptide corresponding to a marker of the 
invention (e.g., a monoclonal antibody) can be used to isolate the polypeptide by 

20 standard techniques, such as affinity chromatography or immunoprecipitation. 

Moreover, such an antibody can be used to detect the marker (e.g., in a cellular lysate or 
cell supernatant) in order to evaluate the level and pattern of expression of the marker. 
The antibodies can also be used diagnostically to monitor protein levels in tissues or 
body fluids (e.g. in an ovary-associated body fluid) as part of a clinical testing . 

25 procedure, e.g. , to, for example, determine the efficacy of a given treatment regimen. 
Detection can be facilitated by coupling the antibody to a detectable substance. 
Examples of detectable substances include various enzymes, prosthetic groups, 
fluorescent materials, luminescent materials, bioluminescent materials, and radioactive 
materials. Examples of suitable enzymes include horseradish peroxidase, alkaline 

30 phosphatase, (3-gaIactosidase, or acetylcholinesterase; examples of suitable prosthetic 
group complexes include streptavidin/biotin and avidin/biotin; examples of suitable 
fluorescent materials include umbelliferone, fluorescein, fluorescein isothiocyanate, 
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rhodamine, dichlorotriazinylamine fluorescein, dansyl chloride or phycoerythrin; an 
example of a luminescent material includes luminol; examples of bioluminescent 
materials include luciferase, luciferin, and aequorin, and examples of suitable 
radioactive material include * 25 I, 13 *I, 35 S or 3 H. 
5 Further, an antibody (or fragment thereof) can be conjugated to a 

therapeutic moiety such as a cytotoxin, a therapeutic agent or a radioactive metal ion. A 
cytotoxin or cytotoxic agent includes any agent that is detrimental to cells. Examples 
include taxol, cytochalasin B, gramicidin D, ethidium bromide, emetine, mitomycin, 
etoposide, tenoposide, vincristine, vinblastine, colchicin, doxorubicin, daunorubicin, 

1 0 dihydroxy anthracin dione, mitoxantrone, mithramycin, actinomycin D, 

1-dehydrotestosterone, glucocorticoids, procaine, tetracaine, lidocaine, propranolol, and 
puromycin and analogs or homologs thereof. Therapeutic agents include, but are not 
limited to, antimetabolites (e.g., methotrexate, 6-mercaptopurine, 6-thioguanine, 
cytarabine, 5-fluorouraciI decarbazine), alkylating agents (e.g., mechlorethamine, 

1 5 thioepa chlorambucil, melphalan, carmustine (BSNU) and lomustine (CCNU), 
cyclothosphamide, busulfan, dibromomannitol, streptozotocin, mitomycin C, and 
cis-dichlorodiamine platinum (II) (DDP) cisplatin), anthracyclines (e.g., daunorubicin 
(formerly daunomycin) and doxorubicin), antibiotics (e.g., dactinomycin (formerly 
actinomycin), bleomycin, mithramycin, and anthramycin (AMC)), and anti-mitotic 

20 agents (e.g., vincristine and vinblastine). 

The conjugates of the invention can be used for modifying a given biological 
response, the drug moiety is not to be construed as limited to classical chemical 
therapeutic agents. For example, the drug moiety may be a protein or polypeptide 
possessing.a desired biological activity. Such proteins may include, for example, a toxin 

25 such as abrin, ricin A, pseudomonas exotoxin, or diphtheria, toxin; a protein such as 
tumor necrosis factor, .alpha.-interferon, .beta. -interferon, nerve growth factor, platelet 
derived growth factor, tissue plasminogen activator; or, biological response modifiers 
such as, for example, lymphokines, interleukin-l ("IL-1"), interleukin-2 ("IL-2"), 
interleukin-6 ("IL-6"), granulocyte macrophase colony stimulating factor ("GM-CSF"), 

30 granulocyte colony stimulating factor ("G-CSF"), or other growth factors. 
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Techniques for conjugating such therapeutic moiety to antibodies are well 
known, see, e.g., Amon et al., "Monoclonal Antibodies For Immunotargeting Of Drugs 
In Cancer Therapy", in Monoclonal Antibodies And Cancer Therapy, Reisfeld et al. 
(eds.), pp. 243-56 (Alan R. Liss, Inc. 1985); Hellstrom et al, "Antibodies For Drug 
5 Delivery", in Controlled Drug Delivery (2nd Ed.), Robinson et al. (eds.), pp. 623-53 
(Marcel Dekker, Inc. 1987); Thorpe, "Antibody Carriers Of Cytotoxic Agents In Cancer 
Therapy: A Review", in Monoclonal Antibodies '84: Biological And Clinical 
Applications, Pinchera et al (eds.), pp. 475-506 (1985); "Analysis, Results, And Future 
Prospective Of The Therapeutic Use Of Radiolabeled Antibody In Cancer Therapy", in 

10 Monoclonal Antibodies For Cancer Detection And Therapy, Baldwin et al. (eds.) 5 pp. 
303-16 (Academic Press 1985), and Thorpe et al., "The Preparation And Cytotoxic 
Properties Of Antibody-Toxin Conjugates", Immunol. Rev., 62:1 19-58 (1982). 

Alternatively, an antibody can be conjugated to a second antibody to form an 
antibody heteroconjugate as described by Segal in U.S. Patent No. 4,676,980. 

15 Accordingly, in one aspect, the invention provides substantially purified antibodies or 
fragments thereof, and non-human antibodies or fragments thereof, which antibodies or 
fragments specifically bind to a polypeptide comprising an amino acid sequence selected 
from the group consisting of the amino acid sequences of the present invention, an 
amino acid sequence encoded by the cDNA of the present invention, a fragment of at 

20 least 1 5 amino acid residues of an amino acid sequence of the present invention, an 
amino acid sequence which is at least 95% identical to the amino acid sequence of the 
present invention (wherein the percent identity is determined using the ALIGN program 
of the GCG software package with a PAM120 weight residue table, a gap length penalty 
of 12, and a gap penalty of 4) and an amino acid sequence which is encoded by a nucleic 

25 acid molecule which hybridizes to a nucleic acid molecule consisting of the nucleic acid 
molecules of the present invention, or a complement thereof, under conditions of 
hybridization of 6X SSC at 45°C and washing in 0.2 X SSC, 0.1% SDS at 65°C. In 
various embodiments, the substantially purified antibodies of the invention, or fragments 
thereof, can be human, non-human, chimeric and/or humanized antibodies. 

30 In another aspect, the invention provides non-human antibodies or fragments 

thereof, which antibodies or fragments specifically bind to a polypeptide comprising an 
amino acid sequence selected from the group consisting of: the amino acid sequence of 
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the present invention, an amino acid sequence encoded by the cDNA of the present 
invention, a fragment of at least 1 5 amino acid residues of the amino acid sequence of 
the present invention, an amino acid sequence which is at least 95% identical to the 
amino acid sequence of the present invention (wherein the percent identity is determined 
5 using the ALIGN program of the GCG software package with a PAM120 weight residue 
table, a gap length penalty of 12, and a gap penalty of 4) and an amino acid sequence 
which is encoded by a nucleic acid molecule which hybridizes to a nucleic acid 
molecule consisting of the nucleic acid molecules of the present invention, or a 
complement thereof, under conditions of hybridization of 6X SSC at 45°C and washing 
1 0 in 0.2 X SSC, 0. 1 % SDS at 65°C. Such non-human antibodies can be goat, mouse, 

sheep, horse, chicken, rabbit, or rat antibodies. Alternatively, the non-human antibodies 
of the invention can be chimeric and/or humanized antibodies. In addition, the non- 
human antibodies of the invention can be polyclonal antibodies or monoclonal 
antibodies. 

15 In still a further aspect, the invention provides monoclonal antibodies or 

fragments thereof, which antibodies or fragments specifically bind to a polypeptide 
comprising an amino acid sequence selected from the group consisting of the amino acid 
sequences of the present invention, an amino acid sequence encoded by the cDNA of the 
present invention, a fragment of at least 15 amino acid residues of an amino acid 

20 sequence of the present invention, an amino acid sequence which is at least 95% 
identical to an amino acid sequence of the present invention (wherein the percent 
identity is determined using the ALIGN program of the GCG software package with a 
PAM120 weight residue table, a gap length penalty of 12, and a gap penalty of 4) and an 
amino acid sequence which is encoded by a nucleic acid molecule which hybridizes to a 

25 nucleic acid molecule consisting of the nucleic acid molecules of the present invention, 
or a complement thereof, under conditions of hybridization of 6X SSC at 45°C and 
washing in 0.2 X SSC, 0.1% SDS at 65°C. The monoclonal antibodies can be human, 
humanized, chimeric and/or non-human antibodies. 

The substantially purified antibodies or fragments thereof may specifically bind 

30 to a signal peptide, a secreted sequence, an extracellular domain, a transmembrane or a 
cytoplasmic domain or cytoplasmic membrane of a polypeptide of the invention. In a 
particularly preferred embodiment, the substantially purified antibodies or fragments 
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thereof, the non-human antibodies or fragments thereof, and/or the monoclonal 
antibodies or fragments thereof, of the invention specifically bind to a secreted sequence 
or an extracellular domain of the amino acid sequences of the present invention. 

Any of the antibodies of the invention can be conjugated to a therapeutic moiety 
5 or to a detectable substance. Non-limiting examples of detectable substances that can be 
conjugated to the antibodies of the invention are an enzyme, a prosthetic group, a 
fluorescent material, a luminescent material, a bioluminescent material, and a 
radioactive material. 

The invention also provides a kit containing an antibody of the invention 

10 conjugated to a detectable substance, and instructions for use. Still another aspect of the 
invention is a pharmaceutical composition comprising an antibody of the invention and 
a pharmaceutically acceptable carrier. In preferred embodiments, the pharmaceutical 
composition contains an antibody of the invention, a therapeutic moiety, and a 
pharmaceutically acceptable carrier. 

1 5 Still another aspect of the invention is a method of making an antibody that 

specifically recognizes a polypeptide of the present invention, the method comprising 
immunizing a mammal with a polypeptide. The polypeptide used as an immungen 
comprises an amino acid sequence selected from the group consisting of the amino acid 
sequence of the present invention, an amino acid sequence encoded by the cDNA of the 

20 nucleic acid molecules of the present invention, a fragment of at least 1 5 amino acid 
residues of the amino acid sequence of the present invention, an amino acid sequence 
which is at least 95% identical to the amino acid sequence of the present invention 
(wherein the percent identity is determined using the ALIGN program of the GCG 
software package with a PAM120 weight residue table, a gap length penalty of 12, and a 

25 gap penalty of 4) and an amino acid sequence which is encoded by a nucleic acid 
molecule which hybridizes to a nucleic acid molecule consisting of the nucleic acid 
molecules of the present invention, or a complement thereof, under conditions of 
hybridization of 6X SSC at 45°C and washing in 0.2 X SSC, 0.1% SDS at 65°C. 

After immunization, a sample is collected from the mammal that contains an 

30 antibody that specifically recognizes the polypeptide. Preferably, the polypeptide is 

recombinantly produced using a non-human host cell. Optionally, the antibodies can be 
further purified from the sample using techniques well known to those of skill in the art. 
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The method can further comprise producing a monoclonal antibody- producing cell from 
the cells of the mammal Optionally, antibodies are collected from the 
antibody-producing cell. 

5 HI. Recombinant Expression Vectors and Host Cells 

Another aspect of the invention pertains to vectors, preferably expression vectors, 
containing a nucleic acid encoding a polypeptide corresponding to a marker of the 
invention (or a portion of such a polypeptide). As used herein, the term "vector" refers 
to a nucleic acid molecule capable of transporting another nucleic acid to which it has 

10 been linked. One type of vector is a "plasmid", which refers to a circular double 

stranded DNA loop into which additional DNA segments can be ligated. Another type 
of vector is a viral vector, wherein additional DNA segments can be ligated into the viral 
genome. Certain vectors are capable of autonomous replication in a host cell into which 
they are introduced (e.g., bacterial vectors having a bacterial origin of replication and 

1 5 episomal mammalian vectors). Other vectors (e.g. , non-episomal mammalian vectors) 
are integrated into the genome of a host cell upon introduction into the host cell, and 
thereby are replicated along with the host genome. Moreover, certain vectors, namely 
expression vectors, are capable of directing the expression of genes to which they are 
operably linked. In general, expression vectors of utility in recombinant DNA 

20 techniques are often in the form of plasmids (vectors). However, the invention is 

intended to include such other forms of expression vectors, such as viral vectors (e.g., 
replication defective retroviruses, adenoviruses and adeno-associated viruses), which 
serve equivalent functions. 

The recombinant expression vectors of the invention comprise a nucleic acid of 

25 the invention in a form suitable for expression of the nucleic acid in a host cell. This 
means that the recombinant expression vectors include one or more regulatory 
• sequences, selected on the basis of the host cells to be used for expression, which is 
operably linked to the nucleic acid sequence to be expressed. Within a recombinant 
expression vector, "operably linked" is intended to mean that the nucleotide sequence of 

30 interest is linked to the regulatory sequence(s) in a manner which allows for expression 
of the nucleotide sequence (e.g., in an in vitro transcription/translation system or in a 
host cell when the vector is introduced into the host cell). The term "regulatory 
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sequence" is intended to include promoters, enhancers and other expression control 
elements (e.g., polyadenylation signals). Such regulatory sequences are described, for 
example, in Goeddel, Methods in Enzymology: Gene Expression Technology vol. 185, 
Academic Press, San Diego, CA (1991). Regulatory sequences include those which 
5 direct constitutive expression of a nucleotide sequence in many types of host cell and 
those which direct expression of the nucleotide sequence only in certain host cells (e.g., 
tissue-specific regulatory sequences). It will be appreciated by those skilled in the art 
that the design of the expression vector can depend on such factors as the choice of the 
host cell to be transformed, the level of expression of protein desired, and the like. The 
10 expression vectors of the invention can be introduced into host cells to thereby produce 
proteins or peptides, including fusion proteins or peptides, encoded by nucleic acids as 
described herein. 

The recombinant expression vectors of the invention can be designed for 
expression of a polypeptide corresponding to a marker of the invention in prokaryotic 
15 (e.g., E. coli) or eukaryotic cells (e.g., insect cells {using baculovirus expression 

vectors}, yeast cells or mammalian cells). Suitable host cells are discussed further in 
Goeddel, supra. Alternatively, the recombinant expression vector can be transcribed 
and translated in vitro, for example using T7 promoter regulatory sequences and T7 
polymerase. 

20 Expression of proteins in prokaryotes is most often carried out in E. coli with 

vectors containing constitutive or inducible promoters directing the expression of either 
fusion or non- fusion proteins. Fusion vectors add a number of amino acids to a protein 
encoded therein, usually to the amino terminus of the recombinant protein. Such fusion 
vectors typically serve three purposes: 1) to increase expression of recombinant protein; 

25 2) to increase the solubility of the recombinant protein; and 3) to aid in the purification 
of the recombinant protein by acting as a ligand in affinity purification. Often, in fusion 
expression vectors, a proteolytic cleavage site is introduced at the junction of the fusion 
moiety and the recombinant protein to enable separation of the recombinant protein 
from the fusion moiety subsequent to purification of the fusion protein. Such enzymes, 

30 and their cognate recognition sequences, include Factor Xa, thrombin and enterokinase. 
Typical fusion expression vectors include pGEX (Pharmacia Biotech Inc; Smith and 
Johnson, 1988, Gene 67:31-40), pMAL (New England Biolabs, Beverly, MA) and 
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pRIT5 (Pharmacia, Piscataway, NJ) which fuse glutathione S-transferase (GST), 
maltose E binding protein, or protein A, respectively, to the target recombinant protein. 

Examples of suitable inducible non-fusion E. coli expression vectors include pTrc 
(Amann et aL, 1988, Gene 69:301-315) and pET lid (Studier et aL, p. 60-89, In Gene 
5 Expression Technology: Methods in Enzymology vol.185, Academic Press, San Diego, 
CA, 1991). Target gene expression from the pTrc vector relies on host RNA 
polymerase transcription from a hybrid trp-lac fusion promoter. Target gene expression 
from the pET 1 Id vector relies on transcription from a T7 gnlO-lac fusion promoter 
mediated by a co-expressed viral RNA polymerase (T7 gnl). This viral polymerase is 

10 supplied by host strains BL21(DE3) or HMS174(DE3) from a resident prophage 
harboring a T7 gnl gene under the transcriptional control of the lacUV 5 promoter. 

One strategy to maximize recombinant protein expression in E. coli is to express 
the protein in a host bacteria with an impaired capacity to proteolytically cleave the 
recombinant protein (Gottesman, p, 1 19-128, In Gene Expression Technology: Methods 

1 5 in Enzymology vol. 1 85, Academic Press, San Diego, CA, 1 990. Another strategy is to 
alter the nucleic acid sequence of the nucleic acid to be inserted into an expression 
vector so that the individual codons for each amino acid are those preferentially utilized 
in E. coli (Wada et al y 1992, Nucleic Acids Res. 20:21 1 1-2118). Such alteration of 
nucleic acid sequences of the invention can be carried out by standard DNA synthesis 

20 techniques. 

In another embodiment, the expression vector is a yeast expression vector. 
Examples of vectors for expression in yeast S. cerevisiae include pYepSecl (Baldari et 
aL, 1987, EMBOJ, 6:229-234), pMFa (Kurjan and Herskowitz, 1982, Cell 30:933- 
943), pJRY88 (Schultz et al, 1987, Gene 54:1 13-123), pYES2 (Invitrogen Corporation, 

25 San Diego, CA), and pPicZ (Invitrogen Corp, San Diego, CA). 

Alternatively, the expression vector is a baculovirus expression vector. 
Baculovirus vectors available for expression of proteins in cultured insect cells {e.g., Sf 
9 cells) include the pAc series (Smith et al, 1983,Mo/. CellBiol 3:2156-2165) and the 
pVL series (Lucklow and Summers, 1989, Virology 170:31-39). 

30 In yet another embodiment, a nucleic acid of the invention is expressed in 

mammalian cells using a mammalian expression vector. Examples of mammalian 
expression vectors include pCDM8 (Seed, 1987, Nature 329:840) and pMT2PC 
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(Kaufinan et aL, 1987, EMBO J. 6:187-195). When used in mammalian cells, the 
expression vector's control functions are often provided by viral regulatory elements. 
For example, commonly used promoters are derived from polyoma, Adenovirus 2, 
cytomegalovirus and Simian Virus 40. For other suitable expression systems for both 
5 prokaryotic and eukaryotic cells see chapters 16 and 17 of Sambrook et aL, supra. 

In another embodiment, the recombinant mammalian expression vector is capable 
of directing expression of the nucleic acid preferentially in a particular cell type (e.g. , • 
tissue-specific regulatory elements are used to express the nucleic acid). Tissue-specific 
regulatory elements are known in the art. Non-limiting examples of suitable tissue- 

10 specific promoters include the albumin promoter (liver-specific; Pinkert et aL, 1 987, 
Genes Dev. 1:268-277), lymphoid-specific promoters (Calame and Eaton, 1988, Adv. 
Immunol. 43:235-275), in particular promoters of T cell receptors (Winoto and 
Baltimore, 1989, EMBO J. 8:729-733) and immunoglobulins (Banerji et ah, 1983, Cell 
33:729-740; Queen and Baltimore, 1983, Cell 33:741-748), neuron-specific promoters 

15 (e.g., the neurofilament promoter; Byrne and Ruddle, 1989, Proc. Natl. Acad. ScL USA 
86:5473-5477), pancreas-specific promoters (Edlund etal., 1985, Science 230:912-916), 
and mammary gland-specific promoters (e.g., milk whey promoter; U.S. Patent No. 
4,873,316 and European Application Publication No. 264,166). Developmentally- 
regulated promoters are also encompassed, for example the murine hox promoters 

20 (Kessel and Gruss, 1990, Science 249:374-379) and the a-fetoprotein promoter (Camper 
and Tilghman, 1989, Genes Dev. 3:537-546). 

The invention further provides a recombinant expression vector comprising a 
DNA molecule of the invention cloned into the expression vector in an antisense 
orientation. That is, the DNA molecule is operably linked to a regulatory sequence in a 

25 manner which allows for expression (by transcription of the DNA molecule) of an RNA 
molecule which is antisense to the mRN A encoding a polypeptide of the invention. 
Regulatory sequences operably linked to a nucleic acid cloned in the antisense 
orientation can be chosen which direct the continuous expression of the antisense RNA 
molecule in a variety of cell types, for instance viral promoters and/or enhancers, or 

30 regulatory sequences can be chosen which direct constitutive, tissue-specific or cell type 
specific expression of antisense RNA. The antisense expression vector can be in the 
form of a recombinant plasmid, phagemid, or attenuated virus in which antisense nucleic 
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acids are produced under the control of a high efficiency regulatory region, the activity 
of which can be determined by the cell type into which the vector is introduced. For a 
discussion of the regulation of gene expression using antisense genes see Weintraub et 
al, 1986, Trends in Genetics, Vol. 1(1). 
5 Another aspect of the invention pertains to host cells into which a recombinant 

expression vector of the invention has been introduced. The terms "host cell" and 
"recombinant host cell" are used interchangeably herein. It is understood that such 
terms refer not only to the particular subject cell but to the progeny or potential progeny 
of such a cell. Because certain modifications may occur in succeeding generations due 
1 0 to either mutation or environmental influences, such progeny may not, in fact, be 
identical to the parent cell, but are still included within the scope of the term as used 
herein. 

A host cell can be any prokaryotic {e.g., E. coli) or eukaryotic cell {e.g., insect 
cells, yeast or mammalian cells). 

1 5 Vector DNA can be introduced into prokaryotic or eukaryotic cells via 

conventional transformation or transfection techniques. As used herein, the terms 
"transformation" and "transfection" are intended to refer to a variety of art-recognized 
techniques for introducing foreign nucleic acid into a host cell, including calcium 
phosphate or calcium chloride co-precipitation, DEAE-dextran-mediated transfection, 

20 lipofection, or electroporation. Suitable methods for transforming or transfecting host 
cells ;an be found in Sambrook, et al {supra), and other laboratory manuals. 

For stable transfection of mammalian cells, it is blown that, depending upon the 
expression vector and transfection technique used, only a small fraction of cells may 
integrate the foreign DNA into their genome. In order to identify and select these 

25 integrants, a gene that encodes a selectable marker (e.g., for resistance to antibiotics) is 
generally introduced into the host cells along with the gene of interest. Preferred 
selectable markers include those which confer resistance to drugs, such as G418, 
hygromycin and methotrexate. Cells stably transfected with the introduced nucleic acid 
can be identified by drug selection (e.g., cells that have incorporated the selectable 

30 marker gene will survive, while the other cells die). 
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A host cell of the invention, such as a prokaryotic or eukaryotic host cell in 
culture, can be used to produce a polypeptide corresponding to a marker of the 
invention. Accordingly, the invention further provides methods for producing a 
polypeptide corresponding to a marker of the invention using the host cells of the 
5 invention. In one embodiment, the method comprises culturing the host cell of 

invention (into which a recombinant expression vector encoding a polypeptide of the 
invention has been introduced) in a suitable medium such that the marker is produced. 
In another embodiment, the method further comprises isolating the marker polypeptide 
from the medium or the host cell. 

1 0 The host cells of the invention can also be used to produce nonhuman transgenic 

animals. For example, in one embodiment, a host cell of the invention is a fertilized 
oocyte or an embryonic stem cell into which a sequences encoding a polypeptide 
corresponding to a marker of the invention have been introduced. Such host cells can 
then be used to create non-human transgenic animals in which exogenous sequences 

15 encoding a marker protein of the invention have been introduced into their genome or 
homologous recombinant animals in which endogenous gene(s) encoding a polypeptide 
corresponding to a marker of the invention sequences have been altered. Such animals 
are useful for studying the function and/or activity of the polypeptide corresponding to 
the marker and for identifying and/or evaluating modulators of polypeptide activity. As 

20 used herein, a "transgenic animal" is a non-human animal, preferably a mamma!, more 
preferably a rodent such as a rat or mouse, in which one or more of the cells of the 
animal includes a transgene. Other examples of transgenic animals include no n- human 
primates, sheep, dogs, cows, goats, chickens, amphibians, etc. A transgene is exogenous 
DNA which is integrated into the genome of a cell from which a transgenic animal 

25 develops and which remains in the genome of the mature animal, thereby directing the 
expression of an encoded gene product in one or more cell types or tissues of the 
transgenic animal. As used herein, an "homologous recombinant animal" is a non- 
human animal, preferably a mammal, more preferably a mouse, in which an endogenous 
gene has been altered by homologous recombination between the endogenous gene and 

30 an exogenous DNA molecule introduced into a cell of the animal, e.g., an embryonic 
cell of the animal, prior to development of the animal. 
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A transgenic animal of the invention can be created by introducing a nucleic acid 
encoding a polypeptide corresponding to a marker of the invention into the male 
pronuclei of a fertilized oocyte, e.g., by microinjection, retroviral infection, and allowing 
the oocyte to develop in a pseudopregnant female foster animal. Intronic sequences and 
5 polyadenylation signals can also be included in the transgene to increase the efficiency 
of expression of the transgene. A tissue-specific regulatory sequence(s) can be operably 
linked to the transgene to direct expression of the polypeptide of the invention to 
particular cells. Methods for generating transgenic animals via embryo manipulation 
and microinjection, particularly animals such as mice, have become conventional in the 

10 art and are described, for example, in U.S. Patent Nos. 4,736,866 and 4,870,009, U.S. 
Patent No. 4,873,191 and in Hogan, Manipulating the Mouse Embryo, Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1986. Similar methods are used for 
production of other transgenic animals. A transgenic founder animal can be identified 
based upon the presence of the transgene in its genome and/or expression of mRNA 

1 5 encoding the transgene in tissues or cells of the animals. A transgenic founder animal 
can then be used to breed additional animals carrying the transgene. Moreover, 
transgenic animals carrying the transgene can further be bred to other transgenic animals 
carrying other transgenes. 

To create an homologous recombinant animal, a vector is prepared which 

20 contains at least a portion of a gene encoding a polypeptide corresponding to a marker of 
the invention into which a deletion, addition or substitution has been introduced to 
thereby alter, e.g., functionally disrupt, the gene. In a preferred embodiment, the vector 
is designed such that, upon homologous recombination, the endogenous gene is 
functionally disrupted (i.e., no longer encodes a functional protein; also referred to as a 

25 "knock out" vector). Alternatively, the vector can be designed such that, upon 

homologous recombination, the endogenous gene is mutated or otherwise altered but 
still encodes functional protein (e.g., the upstream regulatory region can be altered to 
thereby alter the expression of the endogenous protein). In the homologous 
recombination vector, the altered portion of the gene is flanked at its 5 f and 3' ends by 

30 additional nucleic acid of the gene to allow for homologous recombination to occur 
between the exogenous gene carried by the vector and an endogenous gene in an 
embryonic stem cell. The additional flanking nucleic acid sequences are of sufficient 
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length for successful homologous recombination with the endogenous gene. Typically, 
several kilobases of flanking DNA (both at the 5' and 3 ! ends) are included in the vector 
(see, e.g., Thomas and Capecchi, 1987, Cell 51:503 for a description of homologous 
recombination vectors). The vector is introduced into an embryonic stem cell line (e.g. , 
5 by electroporation) and cells in which the introduced gene has homologously 
recombined with the endogenous gene are selected (see, e.g., Li et al, 1992, Cell 
69:915). The selected cells are then injected into a blastocyst of an animal (e.g., a 
mouse) to form aggregation chimeras (see, e.g., Bradley, Teratocarcinomas and 
Embryonic Stem Cells: A Practical Approach, Robertson, Ed., IRL, Oxford, 1987, pp. 

10 113-152). A chimeric embryo can then be implanted into a suitable pseudopregnant 
female foster animal and the embryo brought to term. Progeny harboring the 
homologously recombined DNA in their germ cells can be used to breed animals in 
which all cells of the animal contain the homologously recombined DNA by germline 
transmission of the transgene. Methods for constructing homologous recombination 

1 5 vectors and homologous recombinant animals are described further in Bradley (1991) 
Current Opinion in Bio/Technology 2:823-829 and in PCT Publication NOS. WO 
90/1 1354, WO 91/01 140, WO 92/0968, and WO 93/04169. 

In another embodiment, transgenic non-human animals can be produced which 
contain selected systems which allow for regulated expression of the transgene. One 

20 example of such a system is the cre/loxP recombinase system of bacteriophage P 1 . For 
a description of the cre/loxP recombinase system, see, e.g., Lakso et al. (1992) Proc. 
Natl. Acad. Set USA 89:6232-6236. Another example of a recombinase system is the 
FLP recombinase system of Saccharomyces cerevisiae (O'Gorman et al, 1991, Science 
251:1351-1355). If a cre/loxP recombinase system is used to regulate expression of the 

25 transgene, animals containing transgenes encoding both the Cre recombinase and a 

selected protein are required. Such animals can be provided through the construction of 
"double" transgenic animals, e.g., by mating two transgenic animals, one containing a 
transgene encoding a selected protein and the other containing a transgene encoding a 
recombinase. 

30 Clones of the non-human transgenic animals described herein can also be 

produced according to the methods described in Wilmut et al (1997) Nature 385:810- 
813 and PCT Publication NOS. WO 97/07668 and WO 97/07669. 
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IV. Pharmaceutical Compositions 

The nucleic acid molecules, polypeptides, and antibodies (also referred to herein 
as "active compounds") corresponding to a marker of the invention can be incorporated 
5 into pharmaceutical compositions suitable for administration. Such compositions 
typically comprise the nucleic acid molecule, protein, or antibody and a 
pharmaceutically acceptable carrier. As used herein the language "pharmaceutically 
acceptable carrier" is intended to include any and all solvents, dispersion media, 
coatings, antibacterial and antifungal agents, isotonic and absorption delaying agents, 

10 and the like, compatible with pharmaceutical administration. The use of such media and 
agents for pharmaceutically active substances is well known in the art. Except insofar as 
any conventional media or agent is incompatible with the active compound, use thereof 
in the compositions is contemplated. Supplementary active compounds can also be 
incorporated into the compositions. 

15 The invention includes methods for preparing pharmaceutical compositions for 

modulating the expression or activity of a polypeptide or nucleic acid corresponding to a 
marker of the invention. Such methods comprise formulating a pharmaceutically 
acceptable carrier with an agent which modulates expression or activity of a polypeptide 
or nucleic acid corresponding to a marker of the invention. Such compositions can 

20 further include additional active agents. Thus, the invention further includes methods 
for preparing a pharmaceutical composition by formulating a phaimaceutically 
acceptable carrier with an agent which modulates expression or activity of a polypeptide 
or nucleic acid corresponding to a marker of the invention and one or more additional 
active compounds. 

25 The invention also provides methods (also referred to herein as "screening 

assays") for identifying modulators, i.e., candidate or test compounds or agents (e.g., 
peptides, peptidomimetics, peptoids, small molecules or other drugs) which (a) bind to 
the marker, or (b) have a modulatory (e.g., stimulatory or inhibitory) effect on the 
activity of the marker or, more specifically, (c) have a modulatory effect on the 

30 interactions of the marker with one or more of its natural substrates (e.g., peptide, 
protein, hormone, co-factor, or nucleic acid), or (d) have a modulatory effect on the 
expression of the marker. Such assays typically comprise a reaction between the marker 
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and one or more assay components. The other components may be either the test 
compound itself, or a combination of test compound and a natural binding partner of the 
marker. 

The test compounds of the present invention may be obtained from any available 
5 source, including systematic libraries of natural and/or synthetic compounds. Test 
compounds may also be obtained by any of the numerous approaches in combinatorial 
library methods known in the art, including: biological libraries; peptoid libraries 
(libraries of molecules having the functionalities of peptides, but with a novel, non- 
peptide backbone which are resistant to enzymatic degradation but which nevertheless 

10 remain bioactive; see, e.g., Zuckermann et al, 1994, J. Med. Chem. 37:2678-85); 
spatially addressable parallel solid phase or solution phase libraries; synthetic library 
methods requiring deconvolution; the 'one-bead one-compound' library method; and 
synthetic library methods using affinity chromatography selection. The biological 
library and peptoid library approaches are limited to peptide libraries, while the other 

1 5 four approaches are applicable to peptide, non-peptide oligomer or small molecule 
libraries of compounds (Lam, 1997, Anticancer Drug Des. 12:145). 

Examples of methods for the synthesis of molecular libraries can be found in the 
art, for example in: DeWitt et al (1993) Proc. Natl. Acad. ScL U.S.A. 90:6909; Erb et 
al (1994) Proc. Natl Acad Sci. USA 91:11422; Zuckermann et al. (1994). J. Med. 

20 Chem. 37:2678; Cho etal. (1993) Science 261:1303; Can-ell et al. (1994) Angew. Chem. 
Int. Ed Engl 33:2059; Carell etal (1994) Angew. Chem. Int. Ed. Engl 33:2061; and in 
Gallop et al. (1994) J. Med. Chem. 37:1233. 

Libraries of compounds may be presented in solution (e.g., Houghten, 1992, 
Biotechniques 13:412-421), or on beads (Lam, 1991, Nature 354:82-84), chips (Fodor, 

25 1993, Nature 364:555-556), bacteria and/or spores, (Ladner, USP 5,223,409), plasmids 
(Cull et al 1 992, Proc Natl Acad Sci USA 89: 1 865-1 869) or on phage (Scott and Smith, 
1990, Science 249:386-390; Devlin, 1990, Science 249:404-406; Cwirla et al, 1990, 
Proc. Natl. Acad. Sci. 87:6378-6382; Felici, 1991,7. Mol Biol 222:301-310; Ladner, 
supra.). 

30 In one embodiment, the invention provides assays for screening candidate or test 

compounds which are substrates of a marker or biologically active portion thereof. In 
another embodiment, the invention provides assays for screening candidate or test 
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compounds which bind to a marker or biologically active portion thereof. Determining 
the ability of the test compound to directly bind to a marker can be accomplished, for 
example, by coupling the compound with a radioisotope or enzymatic label such that 
binding of the compound to the marker can be determined by detecting the labeled 
5 marker compound in a complex. For example, compounds (e.g., marker substrates) can 
be labeled with 125 1, 35 S, 14 C, or 3 H, either directly or indirectly, and the radioisotope 
detected by direct counting of radioemission or by scintillation counting. Alternatively, 
assay components can be enzymatically labeled with, for example, horseradish 
peroxidase, alkaline phosphatase, or luciferase, and the enzymatic label detected by 

10 determination of conversion of an appropriate substrate to product. 

In another embodiment, the invention provides assays for screening candidate or 
test compounds which modulate the activity of a marker or a biologically active portion 
thereof. In all likelihood, the marker can, in vivo, interact with one or more molecules, 
such as but not limited to, peptides, proteins, hormones, cofactors and nucleic acids. For 

15 the purposes of this discussion, such cellular and extracellular molecules are referred to 
herein as "binding partners" or marker "substrate". 

One necessary embodiment of the invention in order to facilitate such screening 
is the use of the marker to identify its natural in vivo binding partners. There are many 
ways to accomplish this which are known to one skilled in the art. One example is the 

20 use of the marker protein as "bait protein" in a two-hybrid assay or three-hybrid assay 
(see, e.g., U.S. Patent No. 5,283,317; Zervos et al, 1993, Cell 72:223-232; Madura et ah 
1993, J. Biol. Chem. 268:12046-12054; Bartel .,1993, Biotechniques 14:920-924; 
Iwabuchi et al, 1993 Oncogene 8:1693-1696; Brent WO94/10300) in order to identify 
other proteins which bind to or interact with the marker (binding partners) and, 

25 therefore, are possibly involved in the natural function of the marker. Such marker 
binding partners are also likely to be involved in the propagation of signals by the 
marker or downstream elements of a marker-mediated signaling pathway. Alternatively, 
such marker binding partners may also be found to be inhibitors of the marker. 

The two-hybrid system is based on the modular nature of most transcription 

30 factors, which consist of separable DNA-binding and activation domains. Briefly, the 
assay utilizes two different DNA constructs. In one construct, the gene that encodes a 
marker protein fused to a gene encoding the DNA binding domain of a known 



WO 01/51628 PCT/US01/00798 

- 66 - 

transcription factor {e.g., GAL-4). In the other construct, a DNA sequence, from a 
library of DNA sequences, that encodes an unidentified protein ("prey" or "sample") is 
fused to a gene that codes for the activation domain of the known transcription factor. If 
the "bait" and the "prey" proteins are able to interact, in vivo, forming a marker- 
5 dependent complex, the DNA-binding and activation domains of the transcription factor 
are brought into close proximity. This proximity allows transcription of a reporter gene 
(e.g., LacZ) which is operably linked to a transcriptional regulatory site responsive to 
the transcription factor. Expression of the reporter gene can be readily detected and cell 
colonies containing the functional transcription factor can be isolated and used to obtain 

1 0 the cloned gene which encodes the protein which interacts with the marker protein. 

In a further embodiment, assays may be devised through the use of the invention 
for the purpose of identifying compounds which modulate (e.g., affect either positively 
or negatively) interactions between a marker and its substrates and/or binding partners. 
Such compounds can include, but are not limited to, molecules such as antibodies, 

15 peptides, hormones, oligonucleotides, nucleic acids, and analogs thereof. Such 
compounds may also be obtained from any available source, including systematic 
libraries of natural and/or synthetic compounds. The preferred assay components for use 
in this embodiment is an breast cancer marker identified herein, the known binding 
partner and/or substrate of same, and the test compound. Test compounds can be 

20 supplied from any source. 

The basic principle of the assay systems used to identify compounds that interfere 
with the interaction between the marker and its binding partner involves preparing a 
reaction mixture containing the marker and its binding partner under conditions and for 
a time sufficient to allow the two products to interact and bind, thus forming a complex. 

25 In order to test an agent for inhibitory activity, the reaction mixture is prepared in the 
presence and absence of the test compound. The test compound can be initially included 
in the reaction mixture, or can be added at a time subsequent to the addition of the 
marker and its binding partner. Control reaction mixtures are incubated without the test 
compound or with a placebo. The formation of any complexes between the marker and 

30 its binding partner is then detected. The formation of a complex in the control reaction, 
but less or no such formation in the reaction mixture containing the test compound, 
indicates that the compound interferes with the interaction of the marker and its binding 
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partner. Conversely, the formation of more complex in the presence of compound than 
in the control reaction indicates that the compound may enhance interaction of the 
marker and its binding partner. 

The assay for compounds that interfere with the interaction of the marker with its 
5 binding partner may be conducted in a heterogeneous or homogeneous format. 

Heterogeneous assays involve anchoring either the marker or its binding partner onto a 
solid phase and detecting complexes anchored to the solid phase at the end of the 
reaction. In homogeneous assays, the entire reaction is carried out in a liquid phase. In 
either approach, the order of addition of reactants can be varied to obtain different 

10 information about the compounds being tested. For example, test compounds that 
interfere with the interaction between the markers and the binding partners (e.g., by 
competition) can be identified by conducting the reaction in the presence of the test 
substance, i.e., by adding the test substance to the reaction mixture prior to or 
simultaneously with the marker and its interactive binding partner. Alternatively, test 

1 5 compounds that disrupt preformed complexes, e.g., compounds with higher binding 

constants that displace one of the components from the complex, can be tested by adding 
the test compound to the reaction mixture after complexes have been formed. The 
various formats are briefly described below. 

In a heterogeneous assay system, either the marker or its binding partner is 

20 anchored onto a solid surface or matrix, while the other corresponding non-anchored 

component may be labeled, either directly or indirectly. In practice, microtitre plates are 
often utilized for this approach. The anchored species can be immobilized by a number 
of methods, either non- oovalent or covalent, that are typically well known to one who 
practices the art. Non-oovalent attachment can often be accomplished simply by coating 

25 the solid surface with a solution of the marker or its binding partner and drying. 

Alternatively, an immobilized antibody specific for the assay component to be anchored 
can be used for this purpose. Such surfaces can often be prepared in advance and stored. 

In related embodiments, a fusion protein can be provided which adds a domain 
that allows one or both of the assay components to be anchored to a matrix. For 

30 example, glutathione-S-transferase/marker fusion proteins or glutathione-S- 

transferase/binding partner can be adsorbed onto glutathione sepharose beads (Sigma 
Chemical, St. Louis, MO) or glutathione derivatized microtiter plates, which are then 
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combined with the test compound or the test compound and either the non-adsorbed 
marker or its binding partner, and the mixture incubated under conditions conducive to 
complex formation (e.g., physiological conditions). Following incubation, the beads or 
microtiter plate wells are washed to remove any unbound assay components, the 
5 immobilized complex assessed either directly or indirectly, for example, as described 
above. Alternatively, the complexes can be dissociated from the matrix, and the level of 
marker binding or activity determined using standard techniques. 

Other techniques for immobilizing proteins on matrices can also be used in the 
screening assays of the invention. For example, either a marker or a marker binding 

10 partner can be immobilized utilizing conjugation of biotin and streptavidin. Biotinylated 
marker protein or target molecules can be prepared from biotin-NHS (N-hydroxy- 
succinimide) using techniques known in the art (e.g., biotinylation kit, Pierce Chemicals, 
Rockford, IL), and immobilized in the wells of streptavidin-coated 96 well plates (Pierce 
Chemical). In certain embodiments, the protein-immobilized surfaces can be prepared in 

1 5 advance and stored. 

In order to conduct the assay, the corresponding partner of the immobilized assay 
component is exposed to the coated surface with or without the test compound. After 
the reaction is complete, unreacted assay components are removed (e.g., by washing) 
and any complexes formed will remain immobilized on the solid surface. The detection 

20 of complexes anchored on the solid surface can be accomplished in a number of ways. 
Where the non-immobilized component is pre-labeled, the detection of label 
immobilized on the surface indicates that complexes were formed. Where the non- 
immobilized component is not pre-labeled, an indirect label can be used to detect 
complexes anchored on the surface; e.g., using a labeled antibody specific for the 

25 initially non-immobilized species (the antibody, in turn, can be directly labeled or 
indirectly labeled with, e.g., a labeled anti-Ig antibody). Depending upon the order of 
addition of reaction components, test compounds which modulate (inhibit or enhance) 
complex formation or which disrupt preformed complexes can be detected. 

In an alternate embodiment of the invention, a homogeneous assay may be used. 

30 This is typically a reaction, analogous to those mentioned above, which is conducted in 
a liquid phase in the presence or absence of the test compound. The formed complexes 
are then separated from unreacted components, and the amount of complex formed is 
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determined. As mentioned for heterogeneous assay systems, the order of addition of 
reactants to the liquid phase can yield information about which test compounds 
modulate (inhibit or enhance) complex formation and which disrupt preformed 
complexes. 

5 In such a homogeneous assay, the reaction products may be separated from 

unreacted assay components by any of a number of standard techniques, including but 
not limited to: differential centrifrigation, chromatography, electrophoresis and 
immunoprecipitation. In differential centrifugation, complexes of molecules may be 
separated from uncomplexed molecules through a series of centrifugal steps, due to the 

1 0 different sedimentation equilibria of complexes based on their different sizes and 
densities (see, for example, Rivas, G., and Minton, A.P., Trends Biochem Sci 1993 
Aug;18(8):284-7). Standard chromatographic techniques may also be utilized to separate 
complexed molecules from uncomplexed ones. For example, gel filtration 
chromatography separates molecules based on size, and through the utilization of an 

1 5 appropriate gel filtration resin in a column format, for example, the relatively larger 
complex may be separated from the relatively smaller uncomplexed components. 
Similarly, the relatively different charge properties of the complex as compared to the 
uncomplexed molecules may be exploited to differentially separate the complex from 
the remaining individual reactants, for example through the use of ion-exchange 

20 chromatography resins. Such resins and chromatographic techniques are well known to 
one skilled in the art (see, e.g., Heegaard, 199S, JMol. Recognit 1 1:141-148; Hage and 
Tweed, 1997, J. Chromatogr. B. Biomed. Sci. AppL, 699:499-525). Gel electrophoresis 
may also be employed to separate complexed molecules from unbound species (see, 
e.g., Ausubel et al (eds.), In: Current Protocols in Molecular Biology, J. Wiley & Sons, 

25 New York. 1999). In this technique, protein or nucleic acid complexes are separated 

based on size or charge, for example. In order to maintain the binding interaction during 
the electrophoretic process, non-denaturing gels in the absence of reducing agent are 
typically preferred, but conditions appropriate to the particular interactants will be well 
known to one skilled in the art. Immunoprecipitation is another common technique 

30 utilized for the isolation of a protein-protein complex from solution (see, e.g., Ausubel et 
al (eds.), In: Current Protocols in Molecular Biology, J. Wiley & Sons, New York. 
1999). In this technique, all proteins binding to an antibody specific to one of the 
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binding molecules are precipitated from solution by conjugating the antibody to a 
polymer bead that may be readily collected by centrifugation. The bound assay 
components are released from the beads (through a specific proteolysis event or other 
technique well known in the art which will not disturb the protein-protein interaction in 
5 the complex), and a second immunoprecipitation step is performed, this time utilizing 
antibodies specific for the correspondingly different interacting assay component. In this 
manner, only formed complexes should remain attached to the beads. Variations in 
complex formation in both the presence and the absence of a test compound can be 
compared, thus offering information about the ability of the compound to modulate 

1 0 interactions between the marker and its binding partner. 

Also within the scope of the present invention are methods for direct detection of 
interactions between the marker and its natural binding partner and/or a test compound 
in a homogeneous or heterogeneous assay system without further sample manipulation. 
For example, the technique of fluorescence energy transfer may be utilized (see, e.g., 

1 5 Lakowicz et al } U.S. Patent No. 5,63 1 ,1 69; Stavrianopoulos et al, U.S. Patent No. 

4,868,103). Generally, this technique involves the addition of a fluorophore label on a 
first 'donor' molecule (e.g., marker or test compound) such that its emitted fluorescent 
energy will be absorbed by a fluorescent label on a second, 'acceptor' molecule (e.g., 
marker or test compound), which in turn is able to fluoresce due to the absorbed energy. 

20 Alternately, the 'donor' protein molecule may simply utilize the natural fluorescent 
energy of tryptophan residues. Labels are chosen that emit different wavelengths of 
light, such that the 'acceptor' molecule label may be differentiated from that of the 
'donor'. Since the efficiency of energy transfer between the labels is related to the 
distance separating the molecules, spatial relationships between the molecules can be 

25 assessed. In a situation in which binding occurs between the molecules, the fluorescent 
emission of the 'acceptor' molecule label in the assay should be maximal. An FET 
binding event can be conveniently measured through standard fluorometric detection 
means well known in the art (e.g., using a fluorimeter). A test substance which either 
enhances or hinders participation of one of the species in the preformed complex will 

30 result in the generation of a signal variant to that of background. In this way, test 

substances that modulate interactions between a marker and its binding partner can be 
identified in controlled assays. 
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In another embodiment, modulators of marker expression are identified in a 
method wherein a cell is contacted with a candidate compound and the expression of 
mRNA or protein, corresponding to a marker in the cell, is determined. The level of 
expression of mRNA or protein in the presence of the candidate compound is compared 
5 to the level of expression of mRNA or protein in the absence of the candidate 

compound. The candidate compound can then be identified as a modulator of marker 
expression based on this comparison. For example, when expression, of marker mRNA 
or protein is greater (statistically significantly greater) in the presence of the candidate 
compound than in its absence, the candidate compound is identified as a stimulator of 

10 marker mRNA or protein expression. Conversely, when expression of marker mRNA 
or protein is less (statistically significantly less) in the presence of the candidate 
compound than in its absence, the candidate compound is identified as an inhibitor of 
marker mRNA or protein expression. The level of marker mRNA or protein expression 
in the cells can be determined by methods described herein for detecting marker mRNA 

15 or protein. 

In another aspect, the invention pertains to a combination of two or more of the 
assays described herein. For example, a modulating agent can be identified using a cell- 
based or a cell free assay, and the ability of the agent to modulate the activity of a 
marker protein can be further confirmed in vivo, e.g., in a whole animal model for 

20 cellular transformation and/or tumorigenesis. 

This invention further pertains to novel agents identified by the above-described 
screening assays. Accordingly, it is within the scope of this invention to further use an 
agent identified as described herein in an appropriate animal model. For example, an 
agent identified as described herein (e.g., an marker modulating agent, an antisense 

25 marker nucleic acid molecule, an marker-specific antibody, or an marker-binding 
partner) can be used in an animal model to determine the efficacy, toxicity, or side 
effects of treatment with such an agent. Alternatively, an agent identified as described 
herein can be used in an animal model to determine the mechanism of action of such an 
agent. Furthermore, this invention pertains to uses of novel agents identified by the 

30 above-described screening assays for treatments as described herein. 
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It is understood that appropriate doses of small molecule agents and protein or 
polypeptide agents depends upon a number of factors within the knowledge of the 
ordinarily skilled physician, veterinarian, or researcher. The dose(s) of these agents will 
vary, for example, depending upon the identity, size, and condition of the subject or 
5 sample being treated, further depending upon the route by which the composition is to 
be administered, if applicable, and the effect which the practitioner desires the agent to 
have upon the nucleic acid or .polypeptide of the invention. Exemplary doses of a small 
molecule include milligram or microgram amounts per kilogram of subject or sample 
weight (e.g. about 1 microgram per kilogram to about 500 milligrams per kilogram, 

1 0 about 100 micrograms per kilogram to about 5 milligrams per kilogram, or about 1 
microgram per kilogram to about 50 micrograms per kilogram). Exemplary doses of a 
protein or polypeptide include gram, milligram or microgram amounts per kilogram of 
subject or sample weight (e.g. about 1 microgram per kilogram to about 5 grams per 
kilogram, about 100 micrograms per kilogram to about 500 milligrams per kilogram, or 

15 about 1 milligram per kilogram to about 50 milligrams per kilogram). It is furthermore 
understood that appropriate doses of one of these agents depend upon the potency of the 
agent with respect to the expression or activity to be modulated. Such appropriate doses 
can be determined using the assays described herein. When one or more of these agents 
is to be administered to an animal (e.g. a human) in order to modulate expression or 

20 activity of a polypeptide or nucleic acid of the invention, a physician, veterinarian, or 
researcher can, for example, prescribe a relatively low dose at first, subsequently 
increasing the dose until an appropriate response is obtained. In addition, it is 
understood that the specific dose level for any particular animal subject will depend 
upon a variety of factors including the activity of the specific agent employed, the age, 

25 body weight, general health, gender, and diet of the subject, the time of administration, 
the route of administration, the rate of excretion, any drug combination, and th? degree 
of expression or activity to be modulated. 

A pharmaceutical composition of the invention is formulated to be compatible 
with its intended route of administration. Examples of routes of administration include 

30 parenteral, e.g., intravenous, intradermal, subcutaneous, oral (e.g., inhalation), 

transdermal (topical), transmucosal, and rectal administration. Solutions or suspensions 
used for parenteral, intradermal, or subcutaneous application can include the following 
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components: a sterile diluent such as water for injection, saline solution, fixed oils, 
polyethylene glycols, glycerine, propylene glycol or other synthetic solvents; 
antibacterial agents such as benzyl alcohol or methyl parabens; antioxidants such as 
ascorbic acid or sodium bisulfite; chelating agents such as ethylenediamine-tetraacetic 
5 acid; buffers such as acetates, citrates or phosphates and agents for the adjustment of 
tonicity such as sodium chloride or dextrose. pH can be adjusted with acids or bases, 
such as hydrochloric acid or sodium hydroxide. The parenteral preparation can be 
enclosed in ampules, disposable syringes or multiple dose vials made of glass or plastic. 
Pharmaceutical compositions suitable for injectable use include sterile aqueous 

10 solutions (where water soluble) or dispersions and sterile powders for the 

extemporaneous preparation of sterile injectable solutions or dispersions. For 
intravenous administration, suitable carriers include physiological saline, bacteriostatic 
water, Cremophor EL (BASF; Parsippany, NJ) or phosphate buffered saline (PBS). In 
all cases, the composition must be sterile and should be fluid to the extent that easy 

1 5 syringability exists. It must be stable under the conditions of manufacture and storage 
and must be preserved against the contaminating action of microorganisms such as 
bacteria and fungi. The carrier can be a solvent or dispersion medium containing, for 
example, water, ethanol, polyol (for example, glycerol, propylene glycol, and liquid 
polyethylene glycol, and the like), and suitable mixtures thereof. The proper fluidity can 

20 be maintained, for example, by the use of a coating such as lecithin, by the maintenance 
of the required particle size in the case of dispersion and by the use of surfactants. 
Prevention of the action of microorganisms can be achieved by various antibacterial and 
antifungal agents, for example, parabens, chlorobutanol, phenol, ascorbic acid, 
thimerosal, and the like. In many cases, it will be preferable to include isotonic agents, 

25 for example, sugars, polyalcohols such as mannitol, sorbitol, or sodium chloride in the 
composition. Prolonged absorption of the injectable compositions can be brought about 
by including in the composition an agent which delays absorption, for example, 
aluminum monostearate and gelatin. 

Sterile injectable solutions can be prepared by incorporating the active compound 

30 (e.g., a polypeptide or antibody) in the required amount in an appropriate solvent with 
one or a combination of ingredients enumerated above, as required, followed by filtered 
sterilization. Generally, dispersions are prepared by incorporating the active compound 
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into a sterile vehicle which contains a basic dispersion medium, and then incorporating 
the required other ingredients from those enumerated above. In the case of sterile 
powders for the preparation of sterile injectable solutions, the preferred methods of 
preparation are vacuum drying and freeze-drying which yields a powder of the active 
5 ingredient plus any additional desired ingredient from a previously sterile-filtered 
solution thereof. 

Oral compositions generally include an inert diluent or an edible carrier. They 
can be enclosed in gelatin capsules or compressed into tablets. For the purpose of oral 
therapeutic administration, the active compound can be incorporated with excipients and 

10 used in the form of tablets, troches, or capsules. Oral compositions can also be prepared 
using a fluid carrier for use as a mouthwash, wherein the compound in the fluid carrier is 
applied orally and swished and expectorated or swallowed. 

Pharmaceutically compatible binding agents, and/or adjuvant materials can be 
included as part of the composition. The tablets, pills, capsules, troches, and the like can 

15 contain any of the following ingredients, or compounds of a similar nature: a binder 

such as microcrystalline cellulose, gum tragacanth or gelatin; an excipient such as starch 
or lactose, a disintegrating agent such as alginic acid, Primogel, or corn starch; a 
lubricant such as magnesium stearate or Sterotes; a glidant such as colloidal silicon 
dioxide; a sweetening agent such as sucrose or saccharin; or a flavoring agent such as 

20 peppermint, methyl salicylate, or orange flavoring. 

For administration by inhalation, the compounds are delivered in the form of an 
aerosol spray from a pressurized container or dispenser which contains a suitable 
propellant, e.g., a gas such as carbon dioxide, or a nebulizer. 

Systemic administration can also be by transmucosal or transdermal means. For 

25 transmucosal or transdermal administration, penetrants appropriate to the barrier to be 
permeated are used in the formulation. Such penetrants are generally known in the art, 
and include, for example, for transmucosal administration, detergents, bile salts, and 
fusidic acid derivatives. Transmucosal administration can be accomplished through the 
use of nasal sprays or suppositories. For transdermal administration, the active 

30 compounds are formulated into ointments, salves, gels, or creams as generally known in 
the art 
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The compounds can also be prepared in the form of suppositories {e.g., with 
conventional suppository bases such as cocoa butter and other glycerides) or retention 
enemas for rectal delivery. 

In one embodiment, the active compounds are prepared with carriers that will 
5 protect the compound against rapid elimination from the body, such as a controlled 
release formulation, including implants and microencapsulated delivery systems. 
Biodegradable, biocompatible polymers can be used, such as ethylene vinyl acetate, 
polyanhydrides, polyglycolic acid, collagen, polyorthoesters, and polylactic acid. 
Methods for preparation of such formulations will be apparent to those skilled in the art. 

1 0 The materials can also be obtained commercially from Alza Corporation and Nova 

Pharmaceuticals, Inc. Liposomal suspensions (including liposomes having monoclonal 
antibodies incorporated therein or thereon) can also be used as pharmaceutically 
acceptable carriers. These can be prepared according to methods known to those skilled 
in the art, for example, as described in U.S! Patent No. 4,522,81 1 . 

15 It is especially advantageous to formulate oral or parenteral compositions in 

dosage unit form for ease of administration and uniformity of dosage. Dosage unit form 
as used herein refers to physically discrete units suited as unitary dosages for the subject 
to be treated; each unit containing a predetermined quantity of active compound 
calculated to produce the desired therapeutic effect in association with the required 

20 pharmaceutical carrier. The specification for the dosage unit forms of the invention are 
dictated by and directly dependent on the unique characteristics of the active compound 
and the particular therapeutic effect to be achieved, and the limitations inherent in the art 
of compounding such an active compound for the treatment of individuals. 

For antibodies, the preferred dosage is 0.1 mg/kg to 100 mg/kg of body weight 

25 (generally 10 mg/kg to 20 mg/kg). If the antibody is to act in the brain, a dosage of 50 
mg/kg to 100 mg/kg is usually appropriate. Generally, partially human antibodies and 
fully human antibodies have a longer half-life within the human body than other - 
antibodies. Accordingly, lower dosages and less frequent administration is often 
possible. Modifications such as lipidation can be used to stabilize antibodies and to 

30 enhance uptake and tissue penetration (e.g. , into the breast epithelium). A method for 
lipidation of antibodies is described by Cruikshank et al (1997) 1 Acquired Immune 
Deficiency Syndromes and Human Retrovirology 14:193. 
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The nucleic acid molecules corresponding to a marker of the invention can be 
inserted into vectors and used as gene therapy vectors. Gene therapy vectors can be 
delivered to a subject by 3 for example, intravenous injection, local administration (U.S. 
Patent 5,328,470), or by stereotactic injection (see, e.g., Chen et al 9 1994, Proc. Natl. 
5 Acad. Set USA 91 :3054-3057). The pharmaceutical preparation of the gene therapy 
vector can include the gene therapy vector in an acceptable diluent, or can comprise a 
slow release matrix in which the gene delivery vehicle is imbedded. Alternatively, 
where the complete gene delivery vector can be produced intact from recombinant cells, 
e.g. retroviral vectors, the pharmaceutical preparation can include one or more cells 
10 which produce the gene delivery system. 

The pharmaceutical compositions can be included in a container, pack, or 
dispenser together with instructions for administration. 

V. Electronic Apparatus Readable Media and Arrays 

1 5 Electronic apparatus readable media comprising a breast cancer marker of the 

present invention is also provided. As used herein, "electronic apparatus readable 
media" refers to any suitable medium for storing, holding or containing data or 
information that can be read and accessed directly by an electronic apparatus. Such 
media can include, but are not limited to: magnetic storage media, such as floppy discs, 

20 hard disc storage medium, and magnetic tape; optical storage media such as compact 
disc; electronic storage media such as RAM, ROM, EPROM, EEPROM and the like; 
general hard disks and hybrids of these categories such as magnetic/optical storage 
media. The medium is adapted or configured for having recorded thereon a marker of 
the present invention. 

25 As used herein, the term "electronic apparatus" is intended to include any suitable 

computing or processing apparatus or other device configured or adapted for storing 
data or information. Examples of electronic apparatus suitable for use with the present 
invention include stand-alone computing apparatus; networks, including a local area 
network (LAN), a wide area network (WAN) Internet, Intranet, and Extranet; electronic 

30 appliances such as a personal digital assistants (PDAs), cellular phone, pager and the 
like; and local and distributed processing systems. 
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As used herein, "recorded" refers to aprocess for storing or encoding information 
on the electronic apparatus readable medium. Those skilled in the art can readily adopt 
any of the presently known methods for recording information on known media to 
generate manufactures comprising the markers of the present invention. 
5 A variety of software programs and formats can be used to store the marker 

information of the present invention on the electronic apparatus readable medium. For 
example, the nucleic acid sequence corresponding to the markers can be represented in a 
word processing text file, formatted in commercially-available software such as 
WordPerfect and MicroSoft Word, or represented in the form of an ASCII file, stored in 

10 a database application, such as DB2, Sybase, Oracle, or the like, as well as in other 

forms. Any number of dataprocessor structuring formats (e.g., text file or database) may 
be employed in order to obtain or create a medium having recorded thereon the markers 
of the present invention. 

By providing the markers of the invention in readable form, one can routinely 

15 access the marker sequence information for a variety of purposes. For example, one 

skilled in the art can use the nucleotide or amino acid sequences of the present invention 
in readable form to compare a target sequence or target structural motif with the 
sequence information stored within the data storage means. Search means are used to 
identify fragments or regions of the sequences of the invention which match a particular 

20 target sequence or target motif. 

The present invention therefore provides a medium for holding instructions for 
performing a method for determining, whether a subject has breast cancer or a pre- 
disposition to breast cancer, wherein the method comprises the steps of determining the 
presence or absence of a breast can cer marker and based on the presence or absence of 

25 the breast cancer marker, determining whether the subject has breast cancer or a pre- 
disposition to breast cancer and/or recommending a particular treatment for the breast 
cancer or pre- breast cancer condition. 

The present invention further provides in an electronic system and/or in a 
network, a method for determining whether a subject has breast cancer or a pre- 

30 disposition to breast cancer associated with a breast cancer marker wherein the method 
comprises the steps of determining the presence or absence of the breast cancer marker, 
and based on the presence or absence of the breast cancer marker, determining whether 
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the subject has breast cancer or a pre-disposition to breast cancer, and/or recommending 
a particular treatment for the breast cancer or pre- breast cancer condition. The method 
may further comprise the step of receiving phenotypic information associated with the 
subject and/or acquiring from a network phenotypic information associated with the 
5 subject. 

The present invention also provides in a network, a method for determining 
whether a subject has breast cancer or a pre-disposition to breast cancer associated with 
a breast cancer marker, said method comprising the steps of receiving information 
associated with the breast cancer marker receiving phenotypic information associated 

1 0 with the subject, acquiring information from the network corresponding to the breast 
cancer marker and/or breast cancer, and based on one or more of the phenotypic 
information, the breast cancer marker, and the acquired information, determining 
whether the subject has breast cancer or a pre-disposition to breast cancer. The method 
may further comprise the step of recommending a particular treatment for the breast 

1 5 cancer or pre- breast cancer condition. 

The present invention also provides a business method for determining whether a 
subject has breast cancer or a pre-disposition to breast cancer, said method comprising 
the steps of receiving information associated with the breast cancer marker, receiving 
phenotypic information associated with the subject, acquiring information from the 

20 network corresponding to the breast cancer marker and/or breast cancer, and based on 
one or more of the phenotypic information, the breast cancer marker, and the acquired 
information, determining whether the subject has breast cancer or a pre-disposition to 
breast cancer. The method may further comprise the step of recommending a particular 
treatment for the breast cancer or pre- breast cancer condition. 

25 The invention also includes an array comprising a breast cancer marker of the 

present invention. The array can be used to assay expression of one or more genes in 
the array. In one embodiment, the array can be used to assay gene expression in a tissue 
to ascertain tissue specificity of genes in the array. In this manner, up to about 7600 
genes can be simultaneously assayed for expression. This allows a profile to be 

30 developed showing a battery of genes specifically expressed in one or more tissues. 



WO 01/51628 PCT/US01/00798 

-79- 

In addition to such qualitative determination, the invention allows the quantitation 
of gene expression. Thus, not only tissue specificity, but also the level of expression of 
a battery of genes in the tissue is ascertainable. Thus, genes can be grouped on the basis 
of their tissue expression per se and level of expression in that tissue. This is useful, for 

5 example, in ascertaining the relationship of gene expression between or among tissues. 
Thus, one tissue can be perturbed and the effect on gene expression in a second tissue 
can be determined. In this context, the effect of one cell type on another cell type in 
response to a biological stimulus can be determined. Such a determination is useful, for 
example, to know the effect of cell-cell interaction at the level of gene expression. If an 

1 0 agent is administered therapeutically to treat one cell type but has an undesirable effect 
on another cell type, the invention provides an assay to determine the molecular basis of 
the undesirable effect and thus provides the opportunity to co-administer a counteracting 
agent or otherwise treat the undesired effect. Similarly, even within a single cell type, 
undesirable biological effects can be determined at the molecular level. Thus, the 

1 5 effects of an agent on expression of other than the target gene can be ascertained and 
counteracted. 

In another embodiment, the array can be used to monitor the time course of 
expression of one or more genes in the array. This can occur in various biological 
contexts, as disclosed herein, for example development of breast cancer, progression of 
20 breast cancer, and processes, such a cellular transformation associated with breast 
cancer. 

The array is also useful for ascertaining the effect of the expression of a gene on 
the expression of other genes in the same cell or in different cells , This provides, for 
example, for a selection of alternate molecular targets for therapeutic intervention if the 
25 ultimate or downstream target cannot be regulated. 

The array is also useful for ascertaining differential expression patterns of one or 
more genes in normal and abnormal cells. This provides a battery of genes that could 
serve as a molecular target for diagnosis or therapeutic intervention. 

30 VI. Predictive Medicine 

The present invention pertains to the field of predictive medicine in which 
diagnostic assays, prognostic assays, pharmacogenomics, and monitoring clinical trails 
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are used for prognostic (predictive) purposes to thereby treat an individual 
prophylactically. Accordingly, one aspect of the present invention relates to diagnostic 
assays for determining the level of expression of polypeptides or nucleic acids 
corresponding to one or more markers of the invention, in order to determine whether 
5 an individual is at risk of developing breast cancer. Such assays can be used for 

prognostic or predictive purposes to thereby prophylactically treat an individual prior to 
the onset of the cancer. 

Yet another aspect of the invention pertains to monitoring the influence of agents 
(e.g. , drugs or other compounds administered either to inhibit breast cancer or to treat or 
10 prevent any other disorder {i.e. in order to understand any breast carcinogenic effects 
that such treatment may have} ) on the expression or activity of a marker of the 
invention in clinical trials. These and other agents are described in further detail in the 
following sections. 

15 A. Diagnostic Assays 

An exemplary method for detecting the presence or absence of a polypeptide or 
nucleic acid corresponding to a marker of the invention in a biological sample involves 
obtaining a biological sample (e.g. a breast-associated body fluid) from a test subject 
and contacting the biological sample with a compound or an agent capable of detecting 

20 the polypeptide or nucleic acid (e.g., mRNA, genomic DNA, or cDNA). The detection 
methods of the invention can thus be used to detect mRNA, protein, cDNA, or genomic 
DNA, for example, in a biological sample in vitro as well as in vivo. For example, in 
vitro techniques for detection of mRNA include Northern hybridizations and in situ 
hybridizations. In vitro techniques for detection of a polypeptide corresponding to a 

25 marker of the invention include enzyme linked immunosorbent assays (ELISAs), 

Western blots, immunoprecipitations and immunofluorescence. In vitro techniques for 
detection of genomic DNA include Southern hybridizations. Furthermore, in vivo 
techniques for detection of a polypeptide corresponding to a marker of the invention 
include introducing into a subject a labeled antibody directed against the polypeptide. 

30 For example, the antibody can be labeled with a radioactive marker whose presence and 
location in a subject can be detected by standard imaging techniques. 
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A general principle of such diagnostic and prognostic assays involves preparing 
a sample or reaction mixture that may contain a marker, and a probe, under appropriate 
conditions and for a time sufficient to allow the marker and probe to interact and bind, 
thus forming a complex that can be removed and/or detected in the reaction mixture. 
5 These assays can be conducted in a variety of ways. 

For example, one method to conduct such an assay would involve anchoring the 
marker or probe onto a solid phase support, also referred to as a substrate, and detecting 
target marker/probe complexes anchored on the solid phase at the end of the reaction. 
In one embodiment of such a method, a sample from a subject, which is to be assayed 

10 for presence and/or concentration of marker, can be anchored onto a carrier or solid 
phase support. In another embodiment, the reverse situation is possible, in which the 
probe can be anchored to a solid phase and a sample from a subject can be allowed to 
react as an unanchored component of the assay. 

There are many established methods for anchoring assay components to a solid 

15 phase. These include, without limitation, marker or probe molecules which are 

immobilized through conjugation of biotin and streptavidin. Such biotinylated assay 
components can be prepared from biotin-NHS (N-hydroxy-succinimide) using 
techniques known in the art (e.g., biotinylation kit, Pierce Chemicals, Rockford, IL), and 
immobilized in the wells of streptavidin-coated 96 well plates (Pierce Chemical). In 

20 certain embodiments, the surfaces with immobilized assay components can be prepared 
in advance and stored. 

Other suitable carriers or solid phase supports for such assays include any 
material capable of binding the class of molecule to which the marker or probe belongs. 
Well-known supports or carriers include, but are not limited to, glass, polystyrene, 

25 nylon, polypropylene, nylon, polyethylene, dextran, amylases, natural and modified 
celluloses, polyacrylamides, gabbros, and magnetite. 

In order to conduct assays with the above mentioned approaches, the non- 
immobilized component is added to the solid phase upon which the second component 
is anchored. After the reaction is complete, uncomplexed components may be removed 

30 (e.g. , by washing) under conditions such that any complexes formed will remain 

immobilized upon the solid phase. The detection of marker/probe complexes anchored 
to the solid phase can be accomplished in a number of methods outlined herein. 
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In a preferred embodiment, the probe, when it is the unanchored assay 
component, can be labeled for the purpose of detection and readout of the assay, either 
directly or indirectly, with detectable labels discussed herein and which are well-known 
to one skilled in the art. 
5 It is also possible to directly detect marker/probe complex formation without 

further manipulation or labeling of either component (marker or probe), for example by 
utilizing the technique of fluorescence energy transfer (see, for example, Lakowicz et 
al 9 U.S. Patent No. 5,631,169; Stavrianopoulos, et a/., U.S. Patent No. 4,868,103). A 
fluorophore label on the first, 'donor' molecule is selected such that, upon excitation 

1 0 with incident light of appropriate wavelength, its emitted fluorescent energy will be 

absorbed by a fluorescent label on a second 'acceptor' molecule, which in turn is able to 
fluoresce due to the absorbed energy. Alternately, the 'donor' protein molecule may 
simply utilize the natural fluorescent energy of tryptophan residues. Labels are chosen 
that emit different wavelengths of light, such that the 'acceptor' molecule label may be 

1 5 differentiated from that of the 'donor'. Since the efficiency of energy transfer between 
the labels is related to the distance separating the molecules, spatial relationships 
between the molecules can be assessed. In a situation in which binding occurs between 
the molecules, the fluorescent emission of the 'acceptor 5 molecule label in the assay 
should be maximal. An FET binding event can be conveniently measured through 

20 standard fluorometric detection means well known in the art (e.g. , using a fluorimeter). 

In another embodiment, determination of the ability of a probe to recognize a 
marker can be accomplished without labeling either assay component (probe or marker) 
by utilizing a technology such as real-time Biomolecular Interaction Analysis (BIA) 
(see, Sjolander, S. and Urbaniczky, C, 1991, Anal. Chem. 63:2338-2345 and 

25 Szabo et al, 1995, Curr. Opin. Struct. Biol. 5:699-705). Af; used herein, "BIA" or 

"surface plasmon resonance" is a technology for studying biospecific interactions in real 
time, without labeling any of the interactants (e.g., BIAcore). Changes in the mass at the 
binding surface (indicative of a binding event) result in alterations of the refractive index 
oflight near the surface (the optical phenomenon of surface plasmon resonance (SPR)), 

30 resulting in a detectable signal which can be used as an indication of real-time reactions 
between biological molecules. 
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Alternatively, in another embodiment, analogous diagnostic and prognostic 
assays can be conducted with marker and probe as solutes in a liquid phase. In such an 
assay, the complexed marker and probe are separated from uncomplexed components by 
any of a number of standard techniques, including but not limited to: differential 
5 centrifugation, chromatography, electrophoresis and immunoprecipitation. In 
differential centrifugation, marker/probe complexes may be separated from 
uncomplexed assay components through a series of centrifugal steps, due to the different 
sedimentation equilibria of complexes based on their different sizes and densities (see, 
for example, Rivas, G., and Minton, A.P., 1 993, Trends Biochem Sci 1 8(8):284-7). 

10 Standard chromatographic techniques may also be utilized to separate complexed 
molecules from uncomplexed ones. For example, gel filtration chromatography 
separates molecules based on size, and through the utilization of an appropriate gel 
filtration resin in a column format, for example, the relatively larger complex may be 
separated from the relatively smaller uncomplexed components. Similarly, the 

15 relatively different charge properties of the marker/probe complex as compared to the 
uncomplexed components may be exploited to differentiate the complex from 
uncomplexed components, for example through the utilization of ion-exchange 
chromatography resins. Such resins and chromatographic techniques are well known to 
one skilled in the art (see, e.g., Heegaard, N.H., 1998, J. Mol Recognit. Winter 1 1(1- 

20 6):141-8; Hage, D.S., and Tweed, S.A. JChromatogr B Biomed Sci Appl 1997 Oct 
10;699(l-2):499-525). Gel electrophoresis may also be employed to separate 
complexed assay components from unbound components (see, e.g., Ausubel et al, ed., 
Current Protocols in Molecular Biology, John Wiley & Sons, New York, 1987-1999). 
In this technique, protein or nucleic acid complexes are separated based on size or 

25 charge, for example. In order to maintain the binding interaction during the 

electrophoretic process, non-denaturing gel matrix materials and conditions in the 
absence of reducing agent are typically preferred. Appropriate conditions to the 
particular assay and components thereof will be well known to one skilled in the art. 
In a particular embodiment, the level of mRNA corresponding to the marker can 

30 be determined both by in situ and by in vitro formats in a biological sample using 

methods known in the art. The term "biological sample" is intended to include tissues, 
cells, biological fluids and isolates thereof, isolated from a subject, as well as tissues, 
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cells and fluids present within a subject. Many expression detection methods use 
isolated RNA. For in vitro methods, any RNA isolation technique that does not select 
against the isolation of mRNA can be utilized for the purification of RNA from breast 
cells (see, e.g., Ausubel et al 9 ed., Current Protocols in Molecular Biology, John Wiley 
5 & Sons, New York 1987-1999). Additionally, large numbers of tissue samples can 
readily be processed using techniques well known to those of skill in the art, such as, for 
example, the single-step RNA isolation process of Chomczynski (1 989, U.S. Patent No. 
4,843,155). 

Hie isolated mRNA can be used in hybridization or amplification assays that 

10 include, but are not limited to, Southern or Northern analyses, polymerase chain reaction 
analyses and probe arrays. One preferred diagnostic method for the detection of mRNA 
levels involves contacting the isolated mRNA with a nucleic acid molecule (probe) that 
can hybridize to the mRNA encoded by the gene being detected. The nucleic acid probe 
can be, for example, a full-length cDNA, or a portion thereof, such as an oligonucleotide 

15 of at least 7, 1 5, 30, 50, 100, 250 or 500 nucleotides in length and sufficient to 

specifically hybridize under stringent conditions to a mRNA or genomic DNA encoding 
a marker of the present invention. Other suitable probes for use in the diagnostic assays 
of the invention are described herein. Hybridization of an mRNA with the probe 
indicates that the marker in question is being expressed. 

20 In one format, the mRNA is immobilized on a solid surface and contacted with a 

probe, for example by running the isolated mRNA on an agarose gel and transferring the 
mRNA from the gel to a membrane, such as nitrocellulose. In an alternative format, the 
probe(s) are immobilized on a solid surface and the mRNA is contacted with the 
probe(s), for example, in an Affymetrix gene chip array. A skilled artisan can readily 

25 adapt known mRNA detection methods for use in detecting the level of mRNA encoded 
by the markers of the present invention. 

An alternative method for determining the level of mRNA corresponding to a 
marker of the present invention in a sample involves the process of nucleic acid 
amplification, e.g., by rtPCR (the experimental embodiment set forth in Mullis, 1987, 

30 U.S. Patent No. 4,683,202), ligase chain reaction (Barany, 1991, Proc. Natl. Acad. Sci. 
USA, 88:189-193), self sustained sequence replication (Guatelli et al, 1990, Proc. Natl. 
Acad Sci. USA 87:1874-1878), transcriptional amplification system (Kwoh et aL, 1989, 
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Proc. Natl Acad. Sci. USA 86:1173-1177), Q-Beta Replicase (Lizardi etal, 1988, 
Bio/Technology 6:1 197), rolling circle replication (Lizardi et al, U.S. Patent No. 
5,854,033) or any other nucleic acid amplification method, followed by the detection of 
the amplified molecules using techniques well known to those of skill in the art. These 
5 detection schemes are especially useful for the detection of nucleic acid molecules if 
such molecules are present in very low numbers. As used herein, amplification primers 
are defined as being a pair of nucleic acid molecules that can anneal to 5 1 or 3' regions of 
a gene (plus and minus strands, respectively, or vice- versa) and contain a short region in 
between. In general, amplification primers are from about 10 to 30 nucleotides in length 

10 and flank a region from about 50 to 200 nucleotides in length. Under appropriate 
conditions and with appropriate reagents, such primers permit the amplification of a 
nucleic acid molecule comprising the nucleotide sequence flanked by the primers. 

For in situ methods, mRNA does not need to be isolated from the breast cells 
prior to detection. In such methods, a cell or tissue sample is.prepared/processed using 

1 5 known histological methods. The sample is then immobilized on a support, typically a 
glass slide, and then contacted with a probe that can hybridize to mRNA that encodes 
the marker. 

As an alternative to making determinations based on the absolute expression level 
of the marker, determinations may be based on the normalized expression level of the 

20 marker. Expression levels are normalized by correcting the absolute expression level of 
a marker by comparing its expression to the expression of a gene that is not a marker, 
e.g., a housekeeping gene that is constitutively expressed. Suitable genes for 
normalization include housekeeping genes such as the actin gene, or epithelial cell- 
specific genes. This normalization allows the comparison of the expression level in one 

25 sample, e.g. , a patient sample, to another sample, e.g. , a non-breast cancer sample, or 
between samples from different sources. 

Alternatively, the expression level can be provided as a relative expression level. 
To determine a relative expression level of a marker, the level of expression of the 
marker is determined for 10 or more samples of normal versus cancer cell isolates, 

30 preferably 50 or more samples, prior to the determination of the expression level for the 
sample in question. The mean expression level of each of the genes assayed in the 
larger number of samples is determined and this is used as a baseline expression level 
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for the marker. The expression level of the marker determined for the test sample 
(absolute level of expression) is then divided by the mean expression value obtained for 
that marker. This provides a relative expression level. 

Preferably, the samples used in the baseline determination will be from breast 
5 cancer or from non-breast cancer cells of breast tissue. The choice of the cell source is 
dependent on the use of the relative expression level. Using expression found in noimal 
tissues as a mean expression score aids in validating whether the marker assayed is 
breast specific (versus normal cells). In addition, as more data is accumulated, the mean 
expression value can be revised, providing improved relative expression values based on 

1 0 accumulated data. Expression data from breast cells provides a means for grading the 
severity of the breast cancer state. 

In another embodiment of the present invention, a polypeptide corresponding to a 
marker is detected. A preferred agent for detecting a polypeptide of the invention is an 
antibody capable of binding to a polypeptide corresponding to a marker of the invention, 

1 5 preferably an antibody with a detectable label. Antibodies can be polyclonal, or more 
preferably, monoclonal. An intact antibody, or a fragment thereof (e.g., Fab or F(ab') 2 ) 
can be used. The term "labeled", with regard to the probe or antibody, is intended to 
encompass direct labeling of the probe or antibody by coupling (i.e., physically linking) 
a detectable substance to the probe or antibody, as well as indirect labeling of the probe 

20 or antibody by reactivity with another reagent that is directly labeled. Examples of 
indirect labeling include detection of a primary antibody using a fluorescently labeled 
secondary antibody and end-labeling of a DNA probe with biotin such that it can be 
detected with fluorescently labeled streptavidin. 

Proteins from breast cells can be isolated using techniques that are well known to 

25 those of skill in the art. The protein isolation methods employed can, for example, be 
such as those described in Harlow and Lane (Harlow and Lane, 1988, Antibodies: A 
Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New 
York). 

A variety of formats can be employed to determine whether a sample contains a 
30 protein that binds to a given antibody. Examples of such formats include, but are not 
limited to, enzyme immunoassay (EIA), radioimmunoassay (RIA), Western blot 
analysis and enzyme linked immunoabsorbant assay (ELISA). A skilled artisan can 
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readily adapt known protein/antibody detection methods for use in determining whether 
breast cells express a marker of the present invention. 

In one format, antibodies, or antibody fragments, can be used in methods such as 
Western blots or immunofluorescence techniques to detect the expressed proteins. In 
5 such uses, it is generally preferable to immobilize either the antibody or proteins on a 
solid support. Suitable solid phase supports or carriers include any support capable of 
binding an antigen or an antibody. Well-known supports or carriers include glass, 
polystyrene, polypropylene, polyethylene, dextran, nylon, amylases, natural and 
modified celluloses, polyacrylamides, gabbros, and magnetite. 

1 0 One skilled in the art will know many other suitable carriers for binding antibody 

or antigen, and will be able to adapt such support for use with the present invention. For 
example, protein isolated from breast cells can be run on a polyacrylamide gel 
electrophoresis and immobilized onto a solid phase support such as nitrocellulose. The 
support can then be washed with suitable buffers followed by treatment with the 

1 5 • detectably labeled antibody. The solid phase support can then be washed with the buffer 
a second time to remove unbound antibody. The amount of bound label on the solid 
support can then be detected by conventional means. 

The invention also encompasses kits for detecting the presence of a polypeptide 
or nucleic acid corresponding to a marker of the invention in a biological sample (e.g. an 

20 breast-associated body fluid). Such kits can be used to determine if a subject is 

suffering Ixom or is at increased risk of developing breast cancer. For example, the kit 
can comprise a labeled compound or agent capable of detecting a polypeptide or an 
mRNA encoding a polypeptide corresponding to a marker of the invention in a 
biological sample and means for determining the amount of the polypeptide or mRNA in 

25 the sarrple (e.g., an antibody which binds the polypeptide or an oligonucleotide probe 
which binds to DNA or mRNA encoding the polypeptide). Kits can also include 
instructions for interpreting the results obtained using the kit. 

For antibody-based kits, the kit can comprise, for example: (1) a first antibody 
(e.g., attached to a solid support) which binds to a polypeptide corresponding to a 

30 marker of the invention; and, optionally, (2) a second, different antibody which binds to 
either the polypeptide or the first antibody and is conjugated to a detectable label. 
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For oligonucleotide-based kits, the kit can comprise, for example: (1) an 
oligonucleotide, e.g. , a detectably labeled oligonucleotide, which hybridizes to a nucleic 
acid sequence encoding a polypeptide corresponding to a marker of the invention or (2) 
a pair of primers useful for amplifying a nucleic acid molecule corresponding to a 
5 marker of the invention. The kit can also comprise, e.g. 9 a buffering agent, a 

preservative, or a protein stabilizing agent. The kit can further comprise components 
necessary for detecting the detectable label (e.g., an enzyme or a substrate). The kit can 
also contain a control sample or a series of control samples which can be assayed and 
compared to the test sample. Each component of the kit can be enclosed within an 
10 individual container and all of the various containers can be within a single package, 
along with instructions for interpreting the results of the assays performed using the kit. 

B. Pharmacogenomics 

Agents or modulators which have a stimulatory or inhibitory effect on expression 

15 of a marker of the invention can be administered to individuals to treat (prophylactically 
or therapeutically) breast cancer in the patient. In conjunction with such treatment, the 
pharmacogenomics (i.e., the study of the relationship between an individual's genotype 
and that individual's response to a foreign compound or drug) of the individual may be 
considered. Differences in metabolism of therapeutics can lead to severe toxicity or 

20 therapeutic failure by altering the relation between dose and blood concentration of the 
pharmacologically active drug. Thus, the pharmacogenomics of the individual permits 
the selection of effective agents (e.g., drugs) for prophylactic or therapeutic treatments 
based on a consideration of the individual's genotype. Such pharmacogenomics can 
further be used to determine appropriate dosages and therapeutic regimens. 

25 Accordingly, the level of expression of a marker of the invention in an individual can be 
determined to thereby select appropriate agent(s) for therapeutic or prophylactic 
treatment of the individual. 

Pharmacogenomics deals with clinically significant variations in the response to 
drugs due to altered drug disposition and abnormal action in affected persons. See, e.g., 

30 Linder (1997) Clin. Chem. 43(2):254-266. In general, two types of pharmacogenetic 
conditions can be differentiated. Genetic conditions transmitted as a single factor 
altering the way drugs act on the body are referred to as "altered drug action." Genetic 
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conditions transmitted as single factors altering the way the body acts on drugs are 
referred to as "altered drug metabolism". These pharmacogenetic conditions can occur 
either as rare defects or as polymorphisms. For example, glucose-6-phosphate 
dehydrogenase (G6PD) deficiency is a common inherited enzymopathy in which the 
5 main clinical complication is hemolysis after ingestion of oxidant drugs (anti-malarials, 
sulfonamides, analgesics, nitrofurans) and consumption of fava beans. 

As an illustrative embodiment, the activity of drug metabolizing enzymes is a 
major determinant of both the intensity and duration of drug action. The discovery of 
genetic polymorphisms of drug metabolizing enzymes {e.g., N-acetyltransf erase 2 (NAT 

10 2) and cytochrome P450 enzymes CYP2D6 and CYP2C 1 9) has provided an explanation 
as to why some patients do' not obtain the expected drug effects or show exaggerated 
drug response and serious toxicity after taking the standard and safe dose of a drug. 
These polymorphisms are expressed in two phenotypes in the population, the extensive 
. metabolizer (EM) and poor metabolizer (PM). The prevalence of PM is different among 

1 5 different populations. For example, the gene coding for CYP2D6 is highly polymorphic 
and several mutations have been identified in PM, which all lead to the absence of 
functional CYP2D6. Poor metabolizers of CYP2D6 and CYP2C 1 9 quite frequently 
experience exaggerated drug response and side effects when they receive standard 
doses. If a metabolite is the active therapeutic moiety, a PM will show no therapeutic 

20 response, as demonstrated for the analgesic effect of codeine mediated by its CYP2D6- 
formed metabolite morphine. The other extreme are the so called ultra-rapid 
metabolizers who do not respond to standard doses. Recently, the molecular basis of 
ultra-rapid metabolism has been identified to be due to CYP2D6 gene amplification. 
Thus, the level of expression of a marker of the invention in an individual can be 

25 determined to thereby select appropriate agent(s) for therapeutic or prophylactic 

treatment of the individual. In addition, pharmacogenetic studies can be used to apply 
genotyping of polymorphic alleles encoding drug-metabolizing enzymes to the 
identification of an individual's drug responsiveness phenotype. This knowledge, when 
applied to dosing or drug selection, can avoid adverse reactions or therapeutic failure 

30 and thus enhance therapeutic or prophylactic efficiency when treating a subject with a 
modulator of expression of a marker of the invention. 
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This invention also provides a process for preparing a database comprising at 
least one of the markers set forth in Tables 1-6. For example, the polynucleotide 
sequences are stored in a digital storage medium such that a data processing system for 
standardized representation of the genes that identify a breast cancer cell is compiled. 
5 The data processing system is useful to analyze gene expression between two cells by 
first selecting a cell suspected of being of a neoplastic phenotype or genotype and then 
isolating polynucleotides from the cell. The isolated polynucleotides are sequenced. 
The sequences from the sample are compared with the sequence(s) present in the 
database using homology search techniques. Greater than 90%, more preferably greater 

10 than 95% and more preferably, greater than or equal to 97% sequence identity between 
the test sequence and the polynucleotides of the present invention is a positive indication 
that the polynucleotide has been isolated from a breast cancer cell as defined above. 

In an alternative embodiment, the polynucleotides of this invention are sequenced 
and the information regarding sequence and in some embodiments, relative expression, 

15 is stored in any functionally relevant program, e.g., in Compare Report using the SAGE 
software (available though Dr. Ken Kinzler at John Hopkins University). The Compare 
Report provides a tabulation of the polynucleotide sequences and their abundance for 
the samples normalized to a defined number of polynucleotides per library (say 25,000). 
This is then imported into MS-ACCESS either directly or via copying the data into an 

20 Excel spreadsheet first and then from there into MS-ACCESS for additional 

manipulations. Other programs such as SYBASE or Oracle that permit the comparison 
of polynucleotide numbers could be used as alternatives to MS-ACCESS. 
Enhancements to the software can be designed to incorporate these additional functions. 
The5,e functions consist in standard Boolean, algebraic, and text search operations, 

25 applied in various combinations to reduce a large input set of polynucleotides to a 
manageable subset of a polynucleotide of specifically defined interest. 

One skilled in the art may create groups containing one or more project(s) by 
combining the counts of specific polynucleotides within a group (e.g., GroupNormal = 
Normall + Normal2, GroupTumorl + TumorCellLine). Additional characteristic values 

30 are also calculated for each tag in the group (e.g., average count, minimum count, 
maximum count). One skilled in the art may calculate individual tag count ratios 
between groups, for example the ratio of the average GroupNormal count to the average 
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GroupTumor count for each polynucleotide. A statistical measure of the significance of 
observed differences in tag counts between groups may be calculated. 

C. Monitoring Clinical Trials 

5 Monitoring the influence of agents (e.g., drug compounds) on the level of 

expression of a marker of the invention can be applied not only in basic drug screening, 
but also in clinical trials. For example, the effectiveness of an agent to affect marker 
expression can be monitored in clinical trials of subjects receiving .treatment for breast 
cancer. In a preferred embodiment, the present invention provides a method for 

10 monitoring the effectiveness of treatment of a subject with an agent (e.g., an agonist, 
antagonist, peptidomimetic, protein, peptide, nucleic acid, small molecule, or other drug 
candidate) comprising the steps of (i) obtaining a pre-administration sample from a 
subject prior to administration of the agent; (ii) detecting the level of expression of one 
or more selected markers of the invention in the pre-administration sample; (iii) 

15 obtaining one or more post-administration samples from the subject; (iv) detecting the 
level of expression of the marker(s) in the post-administration samples; (v) comparing 
the level of expression of the marker(s) in the pre-administration sample with the level 
of expression of the marker(s) in the post-administration sample or samples; and (vi) 
altering the administration of the agent to the subject accordingly. For example, 

20 increased administration of the agent can be desirable to increase expression of the 
marker(s) to higher levels than detected, i.e., to increase the effectiveness of the agent. 
Alternatively, decreased administration of the agent can be desirable to decrease 
expression of the marker(s) to lower levels than detected, i.e., to decrease the. 
effectiveness of the agent. 

25 

D. Surrogate Markers 

The markers of the invention may serve as surrogate markers for one or more 
disorders or disease states or for conditions leading up to disease states, and in 
particular, breast cancer. As used herein, a "surrogate marker" is an objective 
30 biochemical marker which correlates with the absence or presence of a disease or 
disorder, or with the progression of a disease or disorder (e.g., with the presence or 
absence of a tumor). The presence or quantity of such markers is independent of the 
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disease. Therefore, these markers may serve to indicate whether a particular course of 
treatment is effective in lessening a disease state or disorder. Surrogate markers are of 
particular use when the presence or extent of a disease state or disorder is difficult to 
assess through standard methodologies (e.g., early stage tumors), or when an assessment 
5 of disease progression is desired before a potentially dangerous clinical endpoint is 
reached (e.g., an assessment of cardiovascular disease may be made using cholesterol 
levels as a surrogate marker, and an analysis of HIV infection may be made using HIV 
RNA levels as a surrogate marker, well in advance of the undesirable clinical outcomes 
of myocardial infarction or fully-developed AIDS). Examples of the use of surrogate 

10 markers in the art include: Koomen et ah (2000) J. Mass. Spectrom. 35: 258-264; and 
James (1994) AIDS Treatment News Archive 209. 

The markers of the invention are also useful as pharmacodynamic markers. As 
used herein, a "pharmacodynamic marker" is an objective biochemical marker which 
correlates specifically with drug effects. The presence or quantity of a 

1 5 pharmacodynamic marker is not related to the disease state or disorder for which the 
drug is being administered; therefore, the presence or quantity of the marker is 
indicative of the presence or activity of the drug in a subject. For example, a 
pharmacodynamic marker may be indicative of the concentration of the drug in a 
biological tissue, in that the marker is either expressed or transcribed or not expressed or 

20 transcribed in that tissue in relationship to the level of the drug. In this fashion, the 
distribution or uptake of the drug may be monitored by the pharmacodynamic marker. 
Similarly, the presence or quantity of the pharmacodynamic marker may be related to 
the presence or quantity of the metabolic product of a drug, such that the presence or 
quantity of the marker is indicative of the relative breakdown rate of the drug in vivo. 

25 Pharmacodynamic markers are of particular use in increasing the sensitivity of detection 
of drug effects, particularly when the drug is administered in low doses. Since even a 
small amount of a drug may be sufficient to activate multiple rounds of marker 
transcription or expression, the amplified marker may be in a quantity which is more 
readily detectable than the drug itself. Also, the marker may be more easily detected 

30 due to the nature of the marker itself; for example, using the methods described herein, 
antibodies may be employed in an immune-based detection system for a protein marker, 
or marker-specific radiolabeled probes may be used to detect a mRNA marker. 
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Furthermore, the use of a pharmacodynamic marker may offer mechanism-based 
prediction of risk due to drug treatment beyond the range of possible direct 
observations. Examples of the use of pharmacodynamic markers in the art include: 
Matsuda et al US 6,033,862; Hattis et al (1991) Env. Health Perspect. 90: 229-238; 
5 Schentag (1999) Am J. HealthSyst. Pharm. 56 Suppl. 3: S21-S24; and Nicolau (1999) 
Am, J. Health-Syst. Pharm, 56 Suppl. 3: S16-S20. 

The markers of the invention are also useful as pharmacogenomic markers. As 
used herein, a "pharmacogenomic marker" is an objective biochemical marker which 
correlates with a specific clinical drug response or susceptibility in a subject (see, e.g., 

10 McLeod et al (1999) Eur. J. Cancer 35(12): 1650-1652). The presence or quantity of 
the pharmacogenomic marker is related to the predicted response of the subject to a 
specific drug or class of drugs prior to administration of the drug. By assessing the 
presence or quantity of one or more pharmacogenomic markers in a subject, a drug 
therapy which is most appropriate for the subject, or which is predicted to have a greater 

15 degree of success, may be selected. For example, based on the presence or quantity of 
RNA or protein for specific tumor markers in a subject, a drug or course of treatment 
may be selected that is optimized for the treatment of the specific tumor likely to be 
present in the subject. Similarly, the presence or absence of a specific sequence 
mutation in marker DNA may correlate with drug response. The use of 

20 pharmacogenomic markers therefore permits the application of the most appropriate 
treatment for each subject without having to administer the therapy. 

VII Experimental Protocol 

25 A. Subtracted Libraries and Transcript Profiling 

Subtracted libraries are generated using a PCR based method that allows the 
isolation of clones expressed at higher levels in one population of mRNA (tester) 
compared to another population (driver). Both tester and driver mRNA populations are 
converted into cDNA by reverse transcription, and then PCR amplified using the 

30 SMART PCR kit from Clontech. Tester and driver cDNAs are then hybridized using 
the PCR-Select cDNA subtraction kit from Clontech. This technique results in both 
subtraction and normalization, which is an equalization of copy number of low- 
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abundance and high-abundance sequences. After generation of the subtractive libraries, 
a group of 96 or more clones from each library is tested to confirm differential 
expression by reverse Southern hybridization. 

5 B. Proteomics 

Proteins that are secreted by normal and transformed cells in culture are analyzed 
to identify those proteins that are likely to be secreted by cancerous cells into body 
fluids. Supernatants are isolated and MWT-CO filters are used to simplify the mixture 
of proteins. The proteins are then digested with trypsin. The tryptic peptides are loaded 

10 onto a microcapillary HPLC column where they are separated, and eluted directly into 
an ion trap mass spectrometer, through a custom-made electrospray ionization source. 
Throughout the gradient, sequence data is acquired through fragmentation of the four 
most intense ions (peptides) that elute off the column, while dynamically excluding 
those that have already been fragmented. In this way, approximately 2000 scans worth 

1 5 of sequence data are obtained, corresponding to approximately 50 to 200 different 
proteins in the sample. These data are searched against databases using correlation 
analysis tools, such as MS-Tag, to identify the proteins in the supernatants. 

In addition, protein profiling experiments are undertaken to assess whether the 
proteins associated with the expression of individual markers of the invention are 

20 secreted. Transcriptional profiling experiments are performed on fractions of RNA that 
are obtained from either (a) endoplasmic reticulum-associated (ER-associated) 
ribosomes, or (b) free ribosomes. Eukaryotic RNA which is isolated from ER-associated 
ribosomes tends to encode secreted and membrane bound protein? rather than 
intracellular proteins. Accordingly, markers of the invention which exhibit significantly 

25 enhanced expression in fractions of RNA from ER-associated ribosomes (in comparison 
with RNA from free ribosomes) are predicted to be associated with secreted proteins. 

Vin. Summary Of The Data Provided In The Tables 

Table 1 shows 4068 novel nucleotide sequences identified through subtracted 
30 library experiments. The sequences of Table 1 were reinterpreted and those sequences 
are set forth in Tables 3 and 5. These sequences were determined to be novel through 
various BLAST searches of the available databases. 
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The library source for SEQ ID NOS: 1-675 was breast cancer cell cultures 
(ascites and pleural fluid cultures) versus normal (i.e., non-cancerous) human epithelial 
mammary cell lines (HMEC). SEQ ID NOS: 1-470 are preferred and SEQ ID NOS: 1- 
3 1 5 are most preferred. 
5 The library source for SEQ ID NOS: 676-1644 was cancer tissue samples 

(clinical invasive lobular carcinomas (ILC)) versus normal breast tissue samples. SEQ 
ID NOS: 676-890 and 1056-1363 are preferred and SEQ ID NOS: 676-792 and 1056- 
1254 are most preferred. 

The library source for SEQ ID NOS: 1645-2941 was cancer tissue samples 

10 (clinical invasive ductal carcinomas (IDC)) versus normal breast tissue samples. SEQ 
ID NOS: 1645-2454 are preferred and SEQ ID NOS: 1645-2124 are most preferred. 

The library source for SEQ ID NOS: 2942-4068 was cancer tissue samples 
(clinical ductal carcinomas in situ (DCIS)) versus normal breast tissue samples. SEQ ID 
NOS: 2942-3626 are preferred and SEQ ID NOS: 2942-3351 are most preferred. 

15 Table 2 shows 4843 novel nucleotide sequences identified through subtracted 

library experiments. The sequences of Table 2 were reinterpreted and those sequences 
are set forth in Tables 4 and 5. These sequences were determined to be novel through 
various BLAST searches of the available databases. 

The tester source for SEQ ID NOS: 1-64, 1960-1976 and 3038-3080 was 

20 aggressive breast tumor cell lines and the driver source was indolent breast tumor cell 
lines (detects markers upregulated in more aggressive tumors). 

The tester source for SEQ ID NOS: 65-72, 1879, 1977-2004 and 3081-3127 was 
indolent breast tumor cell lines and the driver source was aggressive breast tumor cell 
lines (detects markers upregulated in more indolent tumors). 

25 The tester source for SEQ ID NOS: 73-629, 1880-1894, 2005-2296 and 3128- 

3471 was poor clinical outcome breast tumors and the driver source was good clinical 
outcome breast tumors (detects markers upregulated in more aggressive tumors). "Poor 
clinical outcome" is defined as the patient suffering disease recurrence following 
surgery within a period of less than five years. "Good clinical outcome" is defined as 

30 the patient remaining disease free for at least five years or more following surgery. 
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The tester source for SEQ ID NOS: 630-862, 1895-1900, 2297-2385 and 3472- 
3602 was good clinical outcome breast tumors and the driver source was poor clinical 
outcome breast tumors (detects markers upregulated in more indolent tumors). 

The tester source for SEQ ID NOS: 863-1262, 1901-1910, 2386-2567 and 3602- 
5 3988 was breast tumor lymph node metastasis and the driver source was indolent 
(colloid and tubular) breast tumor samples (detects markers upregulated in more 
aggressive tumors). 

The tester source for SEQ ID NOS: 1263-1392, 191 1-1916, 2568-2735 and 3989- 
43 19 was indolent (colloid and tubular) breast tumor samples and the driver source was 
1 0 breast tumor lymph node metastasis (detects markers upregulated in more indolent 
tumors). 

The tester source for SEQ ID NOS: 1393-1638, 1917-1943, 2736-2940 and 4320- 
4604 was T1N1 breast tumors (tumors 2.0 cm or less in greatest dimension with regional 
lymph node metastasis) and the driver source was T1N0 breast tumors (tumors 2.0 cm or 
1 5 less in greatest dimension with no regional lymph node metastasis), good clinical 
outcome (detects markers upregulated in more aggressive tumors. 

The tester source for SEQ ID NOS: 1639-1878, 1944-1959, 2941-3037 and 4605- 
4843 was T1N0 breast tumors with good clinical outcome and the driver source was 
T1N1 breast tumors (detects markers upregulated in more indolent tumors). 
20 Table 6 shows novel nucleotide sequences shown to be associated with breast 

cancer. 

The contents of all references, patents, published patent applications, and 
database records cited throughout this application are hereby incorporated by reference. 

25 Other Embodiments 

Those skilled in the art will recognize, or be able to ascertain using no more than 
routine experimentation, many equivalents to the specific embodiments of the invention 
described herein. Such equivalents are intended to be encompassed by the following 
claims. 
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What is claimed is; 

Claims 

5 

1 . An isolated nucleic acid molecule selected from the group consisting of: 
a) a nucleic acid molecule comprising a nucleotide sequence which is at 
least 90% homologous to a nucleotide sequence of Tables 1-6, or a complement 
thereof; 

10 b) a nucleic acid molecule comprising a fragment of a nucleic acid 

comprising a nucleotide sequence of Tables 1-6, or a complement thereof; and 

c) a nucleic acid molecule comprising a nucleotide sequence of 
Tables 1-6, or a complement thereof. 

15 2. A vector which contains a nucleic acid molecule of claim 1 . 

3. A host cell which contains a nucleic acid molecule of claim 1 . 

4. An isolated polypeptide which is encoded by a nucleic acid molecule 
20 comprising a nucleotide sequence which is at least 90% homologous to a nucleic 

acid comprising a nucleotide sequence of Tables 1-6. 

5. An antibody which selectively binds to a polypeptide of claim 4. 

25 6. A method for producing a polypeptide comprising culturing the host cell 

of claim 3 under conditions in which the nucleic acid molecule is expressed. 



WO 01/51628 PCT/US01/00798 

-98- 

7. A method for detecting the presence of a polypeptide of claim 4 in a 
sample comprising: 

a) contacting the sample with a compound which selectively binds to the 
polypeptide; and 

5 b) determining whether the compound binds to the polypeptide in the 

sample to thereby detect the presence of a polypeptide of claim 4 in the sample. 

8. A kit comprising a compound which selectively binds to the polypeptide 
of claim 4. 

9. A method for detecting the presence of a nucleic acid molecule of claim 1 
in a sample comprising: 

a) contacting the sample with a nucleic acid probe or primer which 
selectively hybridizes to the nucleic acid molecule; and 

b) determining whether the nucleic acid probe or primer binds to a nucleic 
acid molecule in the sample to thereby detect the presence of a nucleic acid molecule of 
claim 1 in the sample. 

10. The method of claim 9, wherein the sample comprises mRNA molecules 
20 and is contacted with a nucleic acid probe. 

11. The method of claim 9, wherein the sample is isolated from breast tissue. 

12. The method of claim 9, wherein the sample is a tumor sample. 

25 

13. A kit comprising a compound which selectively hybridizes to a nucleic 
acid molecule of claim 1 . 
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14. A method of assessing whether a patient is afflicted with breast cancer, 
the method comprising comparing: 

a) the level of expression of a marker in a patient sample, wherein the 
marker is selected from the group consisting of the markers listed in Tables 1-6, and 
5 b) the normal level of expression of the marker in a control non-breast 

cancer sample, 

wherein a significant difference between the level of expression of the 
marker in the patient sample and the normal level is an indication that the patient is 
afflicted with breast cancer. 

10 

15. The method of claim 14, wherein the marker corresponds to a secreted 

protein. 

16. The method of claim 14, wherein the marker corresponds to a transcribed 
1 5 polynucleotide or portion thereof, wherein the polynucleotide comprises the marker. 

17. The method of claim 14, wherein the sample comprises cells obtained 
from the patient. 

20 18. The method of claim 17, wherein the sample is a breast tissue. 

19. The method of claim 1 7, wherein the cells are in a fluid selected from the 
group consisting of blood fluid, lymph, ascitic fluid, cystic fluid, urine, a breast exudate 
and a nipple aspirate. 

25 

2C. The method of claim 14, wherein the level of expression of the marker in 
the sample is assessed by detecting the presence in the sample of a protein 
corresponding to the marker. 



30 21. The method of claim 15, wherein the presence of the protein is detected 

using a reagent which specifically binds with the protein. 
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22. The method of claim 2 1 , wherein the reagent is selected from the group 
consisting of an antibody, an antibody derivative, and an antibody fragment. 



23. The method of claim 14, wherein the level of expression of the marker in 
5 the sample is assessed by detecting the presence in the sample of a transcribed 

polynucleotide or portion thereof, wherein the transcribed polynucleotide comprises the 
marker. 



24. The method of claim 23, wherein the transcribed polynucleotide is an 
10 mRNA. 

25. The method of claim 23, wherein the transcribed polynucleotide is a 

cDNA. 



1 5 26. The method of claim 23, wherein the step of detecting further comprises 

amplifying the transcribed polynucleotide. 

27. The method of claim 14, wherein the level of expression of the marker in 
the sample is assessed by detecting the presence in the sample of a transcribed 
20 polynucleotide which anneals with the marker or anneals with a portion of a 

polynucleotide wherein the polynucleotide comprises the marker, under stringent 
hybridization conditions. 



28. The method of claim 14, wherein the level of expression of the marker in 
25 the sample differs from the normal level of expression of the marker in a patient not 
afflicted with breast cancer by a factor of at least about 2. 



30 



29. The method of claim 14, wherein the level of expression of the marker in 
the sample differs from the normal level of expression of the marker in a patient not 
afflicted with breast cancer by a factor of at least about 5. 
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30. The method of claim 14, comprising comparing: 

a) the level of expression in the sample of each of a plurality of markers 
independently selected from the markers listed in Tables 1-6, and 

b) the normal level of expression of each of the plurality of markers in 

5 samples of the same type obtained from control humans not afflicted with breast cancer, 
wherein the level of expression of more than one of the markers is 
significantly altered, relative to the corresponding normal levels of expression of the 
markers, is an indication that the patient is afflicted with breast cancer. 



10 31. The method of claim 30, wherein the level of expression of each of the 

markers is significantly altered, relative to the corresponding normal levels of 
expression of the markers, is an indication that the patient is afflicted with breast cancer. 

32. The method of claim 30, wherein the plurality comprises at least three of 
15 the markers. 

33. The method of claim 30, wherein the plurality comprises at least five of 
the markers. 



20 34. A method for monitoring the progression of breast cancer in a patient, the 

method comprising: 

a) detecting in a patient sample at a first point in time, the expression of a 
marker, wherein the marker is selected from the group consisting cf the markers listed in 
Tables 1-6; 

25 b) repeating step a) at a subsequent point in time; raid 

c) comparing the level of expression detected in rteps a) and b), and 
therefrom monitoring the progression of breast cancer in the patient. 



35. The method of claim 34, wherein the marker corresponds to a secreted 
30 protein. 
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36. The method of claim 34, wherein marker corresponds to a transcribed 
polynucleotide or portion thereof, wherein the polynucleotide comprises the marker. 

37. The method of claim 34, wherein the sample comprises cells obtained 
5 from the patient. 

38. The method of claim 34, wherein the patient sample is a breast tissue. 

39. The method of claim 34, wherein between the first point in time and the 
1 0 subsequent point in time, the patient has undergone surgery to remove a tumor. 

40. A method of assessing the efficacy of a test compound for inhibiting 
breast cancer in a patient, the method comprising comparing: 

a) expression of a marker in a first sample obtained from the patient and 
1 5 exposed to the test compound, wherein the marker is selected from the group consisting 

of the markers listed in Tables 1-6, and 

b) expression of the marker in a second sample obtained from the 
patient, wherein the sample is not exposed to the test compound, 

wherein a significantly lower level of expression of the marker in the first 
20 sample, relative to the second sample, is an indication that the test compound is 
efficacious for inhibiting breast cancer in the patient. 

41 . The method of claim 40, wherein the first and second samples are 
portions of a single sample obtained from the patient. 

25 



42. The method of claim 40, wherein the first and second samples are 
portions of pooled samples obtained from the patient. 
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43. A method of assessing the efficacy of a therapy for inhibiting breast 
cancer in a patient, the method comprising comparing: 

a) expression of a marker in the first sample obtained from the patient 
prior to providing at least a portion of the therapy to the patient, wherein the marker is 

5 selected from the group consisting of the markers listed in Tables 1-6, and 

b) expression of the marker in a second sample obtained from the patient 
following provision of the portion of the therapy, 

wherein a significantly lower level of expression of the marker in the 
second sample, relative to the first sample, is an indication that the therapy is efficacious 
1 0 for inhibiting breast cancer in the patient. 



44. A method of selecting a composition for inhibiting breast cancer in a 
patient, the method comprising: 

a) obtaining a sample comprising cancer cells from the patient; 

b) separately exposing aliquots of the sample in the presence of a 
plurality of test compositions; 

c) comparing expression of a marker in each of the aliquots, wherein the 
marker is selected from the group consisting of the markers listed in Tables 1-6; and 

d) selecting one of the test compositions which induces a lower level of 
expression of the marker in the aliquot containing that test composition, relative to other 
test compositions. 

45 . A method of inhibiting breast cancer in a patient, the method comprising: 
a) obtaining a sample comprising cancer cells from the patient; 

25 b) separately maintaining aliquots of the sample in the presence of a 

plurality of test compositions; 

c) comparing expression of a marker in each of the aliquots, wherein the 
marker is selected from the group consisting of the markers listed in Tables 1-6; and 

d) administering to the patient at least one of the test compositions which 
30 induces a lower level of expression of the marker in the aliquot containing that test 

composition, relative to other test compositions. 



15 
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46. A kit for assessing whether a patient is afflicted with breast cancer, the 
kit comprising reagents for assessing expression of a marker selected from the group 
consisting of the markers listed in Tables 1-6. 



47. A kit for assessing the presence of breast cancer cells, the kit comprising 
a nucleic acid probe wherein the probe specifically binds with a transcribed 
polynucleotide corresponding to a marker selected from the group consisting of the 
markers listed in Tables 1-6. 



10 48 . A kit for assessing the suitability of each of a plurality of compounds for 

inhibiting breast cancer in a patient, the kit comprising: 

a) the plurality of compounds; and 

b) a reagent for assessing expression of a marker selected from the group 
consisting of the markers listed in Tables 1-6. 

15 

49. A method of making an isolated hybridoma which produces an antibody 
useful for assessing whether a patient is afflicted with breast cancer, the method 
comprising: 

isolating a protein or protein fragment corresponding to a marker selected 
20 from the group consisting of the markers listed in Tables 1-6; 

immunizing a mammal using the isolated protein or protein fragment; 
isolating splenocytes from the immunized mammal; 
fusing the isolated splenocytes with an immortalized cell line to form 
hybridomas; and 

25 screening individual hybridomas for production of an antibody which 

specifically binds with the protein or protein fragment to isolate the hybridoma. 



50. An antibody produced by a hybridoma made by the method of claim 49. 
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51. A kit for assessing the presence of human breast cancer cells, the kit 
comprising an antibody, wherein the antibody specifically binds with a protein 
corresponding to a marker selected from the group consisting of the markers listed in 
Tables 1-6. 

5 

52. A method of assessing the breast cell carcinogenic potential of a test 
compound, the method comprising: 

a) maintaining separate aliquots of breast cells in the presence and 
absence of the test compound; and 
10 b) comparing expression of a marker in each of the aliquots, wherein the 

marker is selected from the group consisting of the markers listed in Tables 1-6, 

wherein a significantly enhanced level of expression of the marker in the 
aliquot maintained in the presence of the test compound, relative to the aliquot 
maintained in the absence of the test compound, is an indication that the test compound 
1 5 possesses human breast cell carcinogenic potential. 

53. A kit for assessing the breast cell carcinogenic potential of a test 
compound, the kit comprising breast cells and a reagent for assessing expression of a 
marker, wherein the marker is selected from the group consisting of the markers listed in 

20 Tables 1-6. 

54. A method of treating a patient afflicted with breast cancer, the method 
comprising providing to cells of the patient an antisense oligonucleotide complementary 
to a polynucleotide corresponding to a marker selected from the markers listed in Tables 

25 1-6. 

55. A method of inhibiting breast cancer in a patient at risk for developing 
breast cancer, the method comprising inhibiting expression of a gene corresponding to a 
marker selected from the markers listed in Tables 1-6. 
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Sequence 1 

CCGCGGTGGCGGCCGCC^^ 

ATCG7CTGCATCTTCGTGTGAACCAGTGAGTGATTTTCCAGCATCTTTCCGAAAATCTAC 
CTACTGGATGAAGATGAGAAGAATCAAGCCAGCTGCTACTTCTNATGTCGAAGGGTCAGG 
TGGAGTATCANCCAAGGGGAAAAGGAAACCCAGGCAGGAAGAAGATGAAGACTATNGAGA 
A JII CCTNAGAANM GCATMGCmAra 

TCoCAAATCCCAGGCCCGTCGTATTCGGMGGMCCACCAGTTTATGCAGCAGGCAGTTT 

GGAGGAGCAATGGTACCTN 

Sequence 2 

T AGG pJ^ M ^ GGA ^^ 

^TGTTTACTTTTCATGTGGATTCGGAACATCCCMTGTAGTGCTAMTTCATCCTATG 
TRGTGTCGMTGCAATTTTCTTACCAAAAGGTATGATGCACTTTCTGAGCATAATCTGA 
^WCACCCAGGAGMGAGMTTTTAAGTTGACTATGGTGAAACGTAATAACCAGACAA 
TCTTTGAACAAACMTAAATGATCTGACTTTTGATGGT 

CAGAGCMGCAGMTCTACAGAAGTTTCTTCTTCGGGAATATCTATCAGTAAAACTCCTA 

TCATGAAAATGATGAAAAATAAAGTGGAAAATAAACGGATTGCAGTTCATCATAACTCAG 

TTGGGGACGTTCCTGAAGAGAAAGAGAATGAAATCAAACCAGACCGTGAAGAAATTGTAG 

AAAATCCAAGTTCTTCAGCTTCTGAATCTAATACAAGTACCTGCCC 

Sequence 3 

AGGTACCTGCCCATCCACTGCCTTTTCCATGTATCCTGGAACTGAGCATAGACCTCTTCC 

CAGGCAGAGCTGACAGCAAGTAAAGGAGATCATAATCAGGGGACCAAACAACTTTGTCTA 

AAGTGTGAATGTCACCTAAGGAGAAGCTGTGAGATCAGAAGGGTGGGGCAGAGGAGCAGA 

CACCATGAGGGAGAGTCCTTGGGGGTACCTGCCCG 

Sequence 4 

CCGGGCAGGTGCTGATNTTNCNCGGTNAAGCCCNCGGNNGGACCCATNCCTGCGGNCACC 

TTGACCGAANCCTGTGAGANCTCCTGAAATANAGGANCCAGCNNTTNCTCTGNTANACAN 

ANACNCTGGNGCCNTTGNNGCTGGNCTGNNTGAANGCNCACNTGAACTNCGGATTCACAG 

NAANTNNTNGGNTNATCNACNGTGGANGAAGCCANANNCCANNNTATNAGNNTNNTCAAN 

CNNNTTNANGNAGATGTATGTATGGAGAGAAACTGAGGCCTCCNNGCCAACAGCCAGCAC 

TAACTTGGCAAGCATGTTTGAGAGCCACCTGGNGAAGTGGAGCCTTCAGCCCCANTTTAA 

AGCCTTCANATGAGACTGCAAGTTCCTGGGCCACCATTCTGGGACTGCAACTTTCACAAA 

GGAGCCTCCTAAANCCA AGAAG NCCCATGCCAGATGGGAATTCTTTGGGCCCCCCAAGAA 

AATTTGTGCC C ACTTAC AT TTTTT G GGG AAAAT TTTATTN NTGC AAGC N C ATTAAG ATTT 

ACCAATTACCACTTCACTTATTTTCN I I i I I I i I I NTGGAATGCCCATACTTAAAAAAAA 

AAAAAAAAAAA 

Sequence 5 

GGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGAAAAGGATGAAAAAGGTGGTGT 
CA I!fI!3]^ 

CATTTTTTTGGTAGTAAGCAGTTCATTGAGTATCAGGTCCTCAMGGAATGAGTTGGCCC 

GGCTAGGGTGGGCCCTNTTGACCTAACTTCAGAGGGGGCCTTGGCTCAGTAGGTGTGAAT 

CAGGGAAGCCACATTGTCCTCAGGGTGCTGTATGAAGCTGGGTGTGGGCGGATTCCTCCC 

ACACCTTCACACTGGCCTGCCTCCAACTCATACAAATCTCGGAGCGGTCGGTACCT 
Sequence 6 

CNAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTTCACTGGAGCACTTGG 

AAAACAGAAGAAATCAAAGGGCAGTCAGGACTCTCCCAGCAGGCAGGGACCAGGTCAGAG 

TCAGGATGAAGACAAAGCACCCTGAGCAAGAAGCAATGGAAGAGATGCTTGTGACCCGAG 

CTGATGGGGAAGGTTGAGGGCAACGCANAGACCCCACTGCCGACTTGCCAAGAAAACGTG 

CAGGAMTGAGGGAAGGCTCCATTTTCCTGAGAAAAGGGATGGAGGAACATAAGCAGCAT 

CAAGCTCCAGCTACTGGCTCTTTGAGAATGTAGTCGGGACTTGCCCCAGTCACCCTTGCC 
CTGAG AACTCA 

Sequence 7 

GGCG^TTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGAAAGGATG/VWVGGTGGTGTC 
A I G III^ 

An m n iGGTAGTAAGCAGTTCATTGAGTATCAGGTCCTCAAAGGAATGAGTTGGCCCG 

GCTAGGGTGGGCCCTCTTGACCTAACTTCAGAGGGGGCCTTGGCTCAGTAGGTGTGAATC 

AGGGAAGCCACATTGTCCTCAGGGTGCTGTATGAAGCTGGGTGTGGGCGGATTCCTCCA 
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Sequence 8 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATGTATTTGTCACTTAAAGGT 
TCTTTCTGTAAACTGCTTCAGATTCTTTTACT^ 

AAGAAC GTTAATATTCCTCTTTATAATCAATCTTTC C C AGTTAG CCTTA AAAATGT ATTC 

C CTACTTTTG CTTC AGG AG ATC ATTATTTGCAAATGCAAAG ATTTTTTAC TTAGACTTTT 

GAAATCACTCTTAGTAACTTTAACATTG Mil i'AGG TATG AAATTGAAGGCTGTGAAGTG 

ATTTGGGCCATTAAAGATAAAGCTATAGGGAATACTTTTCTTCGATGCAGGAGCAGCTTG 

AATTCTTTGACTTCAAAGCTCATTGCTGGAAAAATCMGAGGGCTTAAATTTGCCCCTTA 

AAAGGAACCNGGATTTTCCCACCTGGANGGAAGGG 

Sequence 9 

GATGCAGAGTCGGAAGGGAGCAGACTGCCCTGCTGGTGAGGGAGGTGATTACCCAGGGCG 

AGTGTGCAGCCACTCAATGGCAGGGATTCTTTTCTATTTCCAGCCGGGGTGGAAGAGAAC 

CCCTTTGTCCCTCAGACAAGTGGCCCCAGTTGAGCCCAGCGCTGAAACCGCCAGTCATCC 

GCCTCTGCAGGAAGTGGCCGCCCTGACACATAGCCCAGGGCTGGATTCTTGTGGAAGAGT 

CAGTGGAGTGAGATGGTGATTGAAAAAGGAAGAAGGCCCCTCCTCAGATCCAGGGCTTTA 

ACCCCAGCGCACTCCCCGATGTCTCACCCTTTGAGCTCCATGACTGCATTCCCACTTCAA 

GGTGCAGGCACGGACGCTAACGGCCCANTGACTTGGCCAAGGCCACCCAGTAATGCAGCA 

GCTGAATTGGGGGCATTTAC 

Sequence 10 

CCGCGGTGGCGGCCGAGGTACATTTTTGAAAAGCACAATAACTTGTATTAATTGC ACTT A 
ACACAATGAACCTTTAG7TTCCMCCAGTTTTCATTCT 

TATAAAAACTGCTTTCAAAAGGCATAGAGACACCACACATGGTC CACAGTA AATTCAAAT 
AGAGAGGTGCMTAGTTGCAGTGGTAAACACACAAAAAAATACATTTTTTTGGACTAAA^ 
ATCTG GTCACGGATAAAAGCATGTGCCTTTTCATTCTTCTCTG GGATGTTACAAC AGCAA 
CACGCTCTAAAACAATTAAGTTACATGCAT AATGCTA AAAGAATGTGAGCAATCCTNTAA 
CCAGCTTTTAAGCCCATCTGCTTGATTTCN I I I I I I I 
Sequence 1 1 

CCGGGCAAGGTACACACCAGGGATTGGGGGCCCTGCAGCTTCTACGCCCAGGGACATTTG 

CTTCGGGGACTGGAGTCCTTGCTGTGGCGTGAGGCTGTGTCTGGCGTCTGGGAGGAGGAC 

TGTGTGGGGTCTGGGTTCCCCAGCCCTAATGACCCAGCTGGTTGAAGGCAGCAGATGAAA 

GGAGACAAATGACCACGCAAACCCTGGGTGGCCTCATGAACAGGTGGCGGACAAGGGTGA 

TCCCTGTTGAGCAAGGTCCTGCACTGCGGTGGGTCCAAGGACTAAGCTGCCAGC 

Sequence 12 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGC AGGT ACAAAACTTGGGA 

TCAAATGGAATCTTGA1TCACTMCCAATTTAAGATCTGACTTCTAATTTTAGGAACTTT 

GGGTTATGAACGCTTCCATTTTATACCTGTGTCTAGTTAGTTTCTGCCTATCTATCCAAG 

AAGCTTTTATCAAGGGTCCACCATGTGCCAGCCACTGAAGTAGATATAAATACAAGGATG 

TGTAAGGTATGGATGATGGTATACGAACTGTCATCTTACTGGATTTGTCCGCTCTGTTAA 

AGATACGGTTCCGAAAACTTTTTAMGCCCTAGAGAGGGCTTTAAG 

Sequence 13 

TTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAAACTTGGGATC 

AAATGGMTCTTGATTCACTAACCMTTTAAGATCTGACTTCTAATTTTAGGAACTTTGG 

GTTATGMCGCTTCCATTTTATACCTGTGTCTAGTTAGTTTCTGCCTATCTATCCAAGAA 

GCTTTTATCAAGGGTCCACCATGTGCCAGCCACTGAAGTAGATATAAATACAAGGATGTG 

TAAGGTATGGATGATGGTATACGAACTGTCATCTTACTGGATTTGTCCGCTCTGTTAAAG 

ATACCGGTTCCGAAAACTTTTTAAAGCCCTAGAGAGGGCTTTAAGG 

Sequence 14 

TNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATCATATGCCTGCTG 
AAGTGCTCTGACTTTAGGATGAGAAACTCTAACATAGGCCGGAAGACAAATAAACCATAA 
ACTGTAACAATG ACTAAAC AG AC ACTTG G CCC ACTGTGGTG G ATTTGTATAAC ATCTCTT 
CGCCMTTTATGAGCTGTTTTTATTTCCTGTTTAGTTCTCTTAGCCATGAGAGGTGGACT 

CTTTGACCTGCCCG 
Sequence 15 

CGGCCGCCCGGGCNNGTACCAAAAGGNTNAANACCCANAGGGNAAACCCACCGCGGGNNG 
AGCAACAAAGNAGGCACACANGGGGAAAGACCCCANANACGGGGGGAAANGCGGGAAAGC 
CCCCCGAGGAAGCACACACCNCANACAACACCAAAAAANACACACGGGCGAGAAGNCGAN 
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GCCCAGAANAGNGAGGGGAAAAAGANGCAGACGGAGCCCAACCCCCANGGAACAAAAAGA 

GGACCCANACCGGGGAAAGACCNAACCAGNNCAAAGAGGGGGGGAAAAGGGNCAGGCAGC 

CNAGCAACCANAACCGCGNGCAGAGAAAGCACACCNGGAGACAAGCCCNACNAANGGNGA 

AGAAAGGGGGAAAANGCCGCCAAGGAGAAAGCCAGGGCNGAAACNCACGAGAAAAACCAC 

ACCAGGGGGNNAAGACCNANGCNANACAACGAAAACCAAAGGGAAAGGGNNCGACCCAAA 

NGGGCCGGCCCGGGNAAACCANCAAGGGAAGGCCACCCCNAGGGNGAAAACCNAAAAAAG 

NGGAGGGAAAGGGGGGAAACCC 

Sequence 16 

AGGTGNGMTCMCGCAGGTCAAMTGAAATTTACACTGAAGGCTTCCAAACCAAAGGGA 

AGGACAGGATGTGTCATCAMTATGTTTNGTCACCTTGTATTATACAAAANGCTATTTTC 

TAANGAGTCAGAGAAANTNTGTGAANCTTATTGTGCGGCCCCCTTGTAATNAAATGTTAA 

CTCCCTTGTATTTAATnTC AAC ACTAC ATTAAG AATTAAGTG GTTT N AGTTTG N G G AC A 

TAAGCANCACTTATTAATATCAAGGTGTTTAAGAACTCAGGAG 

Sequence 17 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTTNTGCTTCCAGTTTATATTTCACA 

TCCTGTTCAGCTTNATTAATATGCCATGGGTCACAAAACTCAGTGCAATAAAATGTGTAT 

GAAAGAMCACCC TTCAGA AAAGATGAGAC TCTTT CAAGTGTAAATACTCTAAACTAATA 

TAAGTCAAAATATA I III! I GTGCCCAGNGATTTTTAAAAATTACCCAGTCAACCATTTC 

CTCAATMTTC AAATACTC AAGTGTCC ATTTATATTTTTGGAATAAG CG AG AGTG ATCGT 

AGTACCTGCCCG 

Sequence 18 

CNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGATTGTGTGCAAAATCACGAG 

GGGGGTGCAAGATCCTGATTTTTNAGGAGTTCAAGCGACAATGGCAGNCCAATACGGCAG 

TATGAGCTTNAACCCCAGCACACCAGGGGCCAGTTATGGGCCTGGAAGGCAAGAGCCCAG 

AAATTCCCAATTGAGAATTGTGTTAGTGGGTAAAACCGGAGCAGGAAAAAGTGCAACAGG 

AAACAGCATCCTTGGCCGGAAAGTGTTTCATTCTGGCACTGCAGCAAAATCCATTACCAA 

GAAGTGTGAGAAACGCAGCAGNTNATGGAAGGAAACAGAACTTGT 

Sequence 19 

CCGCGGT GGCGG CCGAGGTACCTCACGCGCATAAATTTGCTGCTCCTA T i ' I I I I I I TCTG 
TTTATGTG I i i I I ATGG ATCT AAGTTAAATCTTTTGG C AATATATAAAAATGTAAATAGT 
AAAC TTTATTT ATTAAG AATGTC ATCTTTTTTAATTT ATATTTAC AC AATTGTTC ATC T A 
ATTTAi Hit ICTATACAGTT TTAA ATACTCAGACATATTTTGCTGTTCATGATATTTTT 
ATCCTGTTCTCATGGATTTGTTTTCCCATACTGTTTTCTCTGATCTCAATTACAGGTTGG 
ATCTCACAAATMTAATGTCAGAGACAGAAATATTTTGCCACTGTTGATTACTATACTTT 
AAAGTTCTATATTATGAAAATATATAATAGCTTGTACCTGCCCG 
Sequence 20 

CTACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACGTCTTTCC 

GTGTGTGTATGCTGGTGAACAGTCAGATTTATTTATATTTTTrTGCMGCATTGAATAAT 

CTMGTTTTAMTATTATTrATCCCCATCCGTTCGTATTTATATTAM 

Sequence 21 

CGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAGA 

ATAAGGTTCACAGCTGAATTGGCTTGGTGTCCTAATTCCCTATTCCAGTATTCTCAGAAG 

GATCCCATCTATGATATATGCAGAAATCACAGCCACATTTGAATGGTTCAATCTTGATTC 

ATACTG AA CTCC C ITAAGCCC AG CAGTTTCTC ATTCTTAAAAGGTAG GTTATTAG AGCCC 

CAAAGCAATTTTCATAACAAATCTACCTTTGTATATTTAAGAATATGCATTAAGGCTAGG 

AGTGGTGGCTC/.CGCCAGTAATCCTAGCACTTTGGGAGGCCGAGGTGGGCAGATCACAAG 

GTCAGATTGAG/vCCATNCTGGCCAACACGGTGAAACCCCATCTCTACTAAAAATACAAAA 

ATTTGCCCGGGGCGGTGGCTCACGCCTATAATCCCAGCACTTTTGGGAGG 

Sequence 22 

CCGCGGTGGCGGCCGAGGTACTCTATGTCCCTCCCAGTTGCAACAATGTTGTTTCCAGCC 

TGGTTCAGCAGCTTGCTGGTGGTTTACTAGGCAGACATGAGCCAGAAACAAAATCAGCAA 

CCCTGGAGTGGTCATTTAGGATCAGTTTTGTCTCCTTTTACAMGCAATGAGTGCTTTAT 

AAAAGCATTTCTCCTCAGGAAACTGGCTCCTCAAAAGTCGTTCTCCAAGCCAGCAGCTCA 

CAGTAGATGGATGAGTTTACCTGTCCTAATAATACTTTTTATTATAATACTTTTATTACT 

GATACTTTATTATAATACTTTrATTACTGATACTGATACCTAGTAATACTTTTATTAGAT 
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GAAATGTATCCCCAGGCCTCCACATCAGCCTTCTGGGGCCCTGACAGGTGTTTACCAGTA 
GCTNCACAACCTTCACAGGCTTGTTCTCTCATCCCTGTGAGGGAAGGTTCAAAAGGCAAC 
T 

Sequence 23 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTCGCTCAAATTAAGI Mill I AAAAGGTCTGT 
AATTTGAMGGAAAACAATTTTTCACTAAAAATATCCTTATTACATGAAAG 
AAAAGACAGAGAAAAAAGCTTTTTAATTAACAAATAAACAGCATCTCTCAGAGACACTTG 
GGAATGTTTGTTTTTAATCAGTGGGTCCTAAAAATCACTGCTTCCACCTGCAAACGAACA 
CGAATCCACTGTGGATGGCGTCCTCTGACCCACCCTGAAGGGTTCTGTTTCTCTCCACAC 
ACTTCCTTTTCTGCAACTTTCAGCAAAGCAGGTTTGGMGAAA GAGGA TTACAMGA^ 
. ACGACTGGCTCCCTAGTGAATAACTTAACACGGTAAAACCCGGTTTTCCATTTACATT 
Sequence 24 

CCGCGGTGGCGGCCGAGGTACCTACTATGTGTCACGCCATGGGGGATAC.AAAGATCTATA 

AGGCACAAGACCCTCAGTCTTGTAGTCGCCTGACAGCCAGCCAGCTACAACATAATGTGG 

AAAGGACAATGGTGGAAAATGCACTCAGGTCTTCCTAATGCACAGAGTATGCTCAGGCTG 

TGACATCAGGAAGAAAACAGATATTTACCTTAACACGGACTTGGAGGACCTTCAAAAAAC 

AGTGATGGGAGGAAATCCAGTTTTAAAAGTCTTGATTT AAAAA AAAGAAAACACT^ 

TGGATAAAGATAGGCTGCAGGAAATGTAACCTATGAAATTTTCTCAAATTAGCTTTCAAA 

CACACACAAAAAATTGCATTTGGTTGAGGAGCAGAATGTAACTTATATTAAAGAATAAAC 

TACTATTTAGTATCTGAGTGAAGTACCTGCCCG 

Sequence 25 

CGAATTGTTGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAATAAAGTTCACCCACCC 

TGCACTTTNGCCCTTAGATCAATCCTAAGTAGCCATTGCCAGTAGGCCAAGTTTAATCAG 

AGGACAGTGCCTACCAGTAAATACTGAATAGTTACAATAGTTATGTCCATCCAACCAGTA 

G CAG ATG AAC AGCTAATACATC ATG ATG CTATG CTCTCCTAAC AGG GTCCCCTC AG ATCC 

TCAGTGAGCACATAAAGAAAGGGAGGTCATATCCCTTACATCTCTACCAGGTATTAACAC 

CTAACTACTCTCTAGCCAGAGGCAATTCCCTTTATTTCCTTACTCTCGTCGTCTTCTCTT 

TAGCCCAATCTCCTGACAATAGTTAAAACAAAAAGACCCCCAAAATATCTCTTGCTAAAA 

CAG 

Sequence 26 

ACGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAG 

CTAAGAAATGTAACAGTATCAACCCTCCCAGTTGCTTAATTATACCCATAGGTAATACAA 

AAAGCTCTGAAGACCCAAAGATGACATTACTAATGATGTGATTTCAGGAGCCACAGAAGA 

ACCTTACCAGCTTCCCTCAAATCAGTCCTTATCCTCTTTCTATCTTCACTCCCATCATCA 

TCTATTTTCACACTATCCAGCTAAGCAAAGATTCCTGGAGGCTGACTTGTATCTTCAGAC 

TCACAGAGTGAATTCAGCTNTTCTGAATCAAGACCCACCCAGTN 

Sequence 27 

ACGCTACTATAGGGCGAATTGNNGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGl linn 

1 1 I 1 1 I I I 1 1 I 1 1 I I 1 GGAGAAACAGGGTCTCACTATGTTGCCCAGGCTGGTCTCAAACT 

C7TGGOCTCAAACAATCCTNCTGCCTCGGCCTGCCAAAGTGGTAGGATTATAGTCATGAG 

CCACT3CGTCCAGCTAGAAATAGCAATTTTTCTMCACATATATTTCATTGGTTTATGTA 

TGAATTTAGAATATCAAAGTGGCATGAACAGGTTCAAGATCCAACAAAAGGGAATCTTGA 

1TCTGCCTCACCTAATAAAAGCAGAAGTGCGTGGGCNCNCTAAAA 

Sequer.ce 28 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACGAAGAGGTTCGTGGA 

AAAGGAATATGCTTAATAACTTCGAAAAAAAATACCCATAAATGGACAAGGGTGTTGGGC 

CCCCTTTCTGGGTGGATTCAGGGGTAGTCAGATATTCCAGGTTAAACTGAACCTGATGCA 

TC A ATCTC CTTCTTGG AGGTGTCTC AGTGTG CC AG AC ACTG AATCC ACCTTAG ACATCTA 

TTGCCATTCAGCCAGCAACAAACACACATTCACACTCCTCTGTGCAAACCTTTCTTTGCT 

CArCTTGGTGAGAAAACACATCCAAGAGTACCTGCCCG 

Sequence 29 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGAGAAGGCACTGAAT 

AAATTCACAAAGGCCGATTGGTTCACCCATTCTTTTAGAGACAACAGACACGCAATTCTG 

ACGAGGACTCCTGTTACTAAAAGACACAGCCTCTGATACAAGAGAGATATCCCTTTGACT 

GAAGCATTACCAGGGTCCCCAGGGCCCCCTCCCACTGGGGCGGTAACACTACGGGTCTCC 

CCACCATATATTCCATGTCAAAGTATCTACACAAATACAGAGGAAATTAAGCAAGTAAAT 
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A ?? G TATGTMnG ™TCATTTGTATTTCTTTMGGCATAmATAAATATTTTAAAGT 

TGGATCTGAATAAGGCAGGTTTACTGGNGGGAGCATATTAACATAAAACAGCTTATATGA 

TTTCAGGTGGGTACCTGCC 

Sequence 30 

T^II^P^C^^^CCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAACAGGGAC 

GGCAAAAGTTGAATAAG TTAC AAGAACAG G CTTTTATTC ATTTGC TATTTGGTTTTTATG 
^CrreTTTACT7G£ITrGAmGTTTCTCAAAACTCACTCAGTGGGATTACCTGTCTG 

Sequence 31 

NACGCGC/VT^TTTTGCCGCTCCTAI I I I I I I I I CTGTTTANGTGTTTTTATGGATCTAA 

ATCn n I I AATTTATATTTACACAATTGTTCATCTAATTTATTTTTTCTATACAGTT^ 

AAATACTCAGACATATTTTGCTGTTCATGATATTTTTATCCTGTTCTCATGGATTTGTTT 

TCCCATACTGNTTTCTCTGATCTCAATTACAGGGTGGGATCTCACAAATAATAATGGCAG 
AGACAGAAATATT 

Sequence 32 

CTATANGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCCATTGCATC 

ATGAAGTTTGGGCATATCMCATTTTGACTCATTCGGGAAATCATACGCATTAAAAGTTG 

AAGCTTTCTACGATTTGGTGGGGGAAGTAACAAACAACATANCTGTANAGCATCNATGGC 

AACCCTCTCTAAATGAGGTTGCAGCAAGCTTTGTGTGCCAGTCAACATAGAAGAAGCTGG 

MGTAAAGAATTTTCTGAAAGTTCTATTGTACCTTGGCCGCCCGGGCAGGTACGGAGGTG 

MCTGAAAGAGACTGCTTCCATTGGCCTAGTCCAGATGAGTGTATATATTCCCATCAGAA 

CCCACCACTGGTCATCTTTAAGGCATNCACAAMTGTCCCTCACAAATTCACTTAATCTC 

Sequence 33 

TCGAGGTACCTAAGTCAAAAGGCACTGNTTTGGAGATGGCACACTCA7TTTCATGCGTGT 
AA^TNTTAAATCATCCACTTTGCAGGCAGTGGCTTTGATAACTCACTGCAGTGTTCAAG 
GGGTTTATAAAACTGGTTATAAGCTTCAAACCCATGTTTAGAAAAATTGACACTCATAGA 
AAAAAATGCTTTCTCTG GGCTAATTAAAATTAAAAAAATTGAAATGTAAACC G AGAAGTA 
TG GCTC AC AAAG CTATAGAAAAG ATC CTTC ATATCATCCCTGG CCCTAGC ACCGTG AGTA 

GATGTTCACCCTGATAAGGCCAGGCGGAGGTGGCCCATGTGAAATTCTTTGGCTTTGAGC 
TAATTG 

Sequence 34 

CCGGGCAGGTACTAACATGATGATAGGTTTTCAAAATATCTTTGTAGTGGATGCTGCATA 

ATTACATTCACTTCTCTTAGACTGTAAAAGACTTTCTTC5ACTTGTTTTAACAGTAGAGAT 
AGCAGTACCT 

Sequence 35 

TNCTATAG^CGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAGTGCCCCTTTTCA 
GACAGTTTTTGATTCGCTCTAGAC I I I II I I I I FTAATAG G G AAAAAATTTG ATAATTTT 
Cm n I 1 CTA^TGCACTTAAGACTAAAACACAGGTTTGGATTAATTTTATTTG C TTCCT 
TTTTCCGCTTTTCTTCCCGCAGAGCCTGATGGGAGMTGTCCAGGGCAGGGAAACCACAT 
TTTTTGTAGGTGATAACTCAATGAAMTTGGTGCTTATTTTTTACACTTCTCT^ 

CTCTCTTGTGGGGCTATCTATCTGTTTTAAGGTC7CCTTGAAGGCGCACTGGGG 
Sequence 36 

CCGGGCAGGi I I m i m II M I M I I I I I NTCAGGGNTTGAACACNTTTTTAATTATTAG 
AAATCC ATACATG CATAT AATTAAC CTCTAMTTTATTTTTAAAAAATAAATTATG GN C A 
TGAAAGAACAATAGNGCTTAAANAATTTNAAATTTNCATANACATGTAATTTTTATTCCC 
TAAAAATGAAATAAGATGAAACCTTTATGGGTANATTAATGGGATNTTTTA 
Sequence 37 

AAGGGCGAATGGACT CCAC CGCGGTGGCGGCCGAGG I U I I l III l I I I I i I M I CTTTT 
TTTGGAGAGAACTCCATTTTATTATGGAAAGNTAAAAAACAAACAAANCAAAACAGGCAA 
TTGATAAAGGCGGCACAATNGNNAAGGAGAGGTTGAGGTGTCTCCTTAGCCACCCGACAC 
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CTGCCCGGGCGGCCGCTCTANAACTAGTGnnnnnnnnnnnnnnnnnnnnnnnnnn nnnTC 
AAGCTTANTCCGATACCGTCTGACCTCGAGGGGGGGCCCCGGTACCCAACl i M i iGNT 
Sequence 38 

AGGTACAGAAACCTTCAGAGAGGATAAATAGCTTGCCCTGTAGAAGCAGGACTGAAACCC 

TTGTCCGCCTGACTCCCCCAGCTACTCTGCCCACTGTAGCCCCCTNNCTTACTGTCCTGG 

CACACCCCTCACCATCCTNTNTACCTTAAATATC AAAGAG GGCAAGAGNGAAAGGGCTTT 

AAAGATMGTTATTTTTTTAAGGAAC CTTAATATTATTTTTAAG NAAGTAACC CAAATTA 

GTGACGTGAAATTACAANAAACCTGNCCGGGCCGGCCNGCTCTTAGAACCTAGTTGGAAT 

CCCCCCCGGGCTGCAGGGAMTTTNTGANTTTTCA AAGCT TTATTCNATACCCGTNCGAC 

CCTTNAGGGGGGGGGNGCCCCGGGTACCCAAACNTTTTTGGTT 

Sequence 39 

TAGGGCGAATTGGAGCTTCCCGCGGTGGCGGCCGCCCGGGCATGGTCCGGGCAGGTACAC 

ATGCATGCATACACACCCATACAAACATCTGTGTGAGGGCAGTTCTGGAGATGAGCAGAG 

AGAGACCGGAATAAACTCAATCTTTTCTTTCCCAAGCTCCTAGCCAACACTATCCTTGGG 

AGAAAGAAATTTGCAGAAACTGCTNANACCAAGTGTGGAGATGTCAAGCTAGTTCACACT 

CTGAGGCTCAGAATATGTAGGACATGCACAATTGTGCAGTCCTTTGGGATTGGAAGTGAA 

ACAGTCTGNGATCCCCTACCTTCTAGGGAACTAGGACCTAGGAAGAGGTAAAGATTNTCA 

GGTATGCAAAGCCGCCCCA 

Sequence 40 

CCGCGGTGGCGGCCGAGGTACGCGGAACACCCATGGCAGTG GGATA TATCCCGGAAACCC 

ACAGGATGAGAGAAAAGCTTGGCGCAGATGTGATAGAGGTGGCTTTTGGGCAGATGATGA 

TTATTCTCGCTGTCAGTATGCAAATGATGTCACTAGAGTTCTTTATATGTTTAATCAGAT 

GCCCCTCAATCTTACCAATGCCGTGGCAACAGCTCGACAGTTACTGGCTTACACTGTGGA 

AGCAGCCAACTTTTCTGACAAMTGGATGTTATATTTGTGGCAGAAATGATTGAAAAATT 

TGGMGATTTACCAAGGAGGAAAAATCAAAAGAGCTAGGTGACGTGATGGTTGACATTGC 

AAGTAACATCATGTTGGCTGATGAACGT 

Sequence 41 

AGGTGGAAAGCTGGTGCTGCTGCTGCCTGATTCCCGCCGACAGACCTTGGGACCGGGGCC 
AACACTGGCAGCTGGAGATGGCGGACACCGAGATCCGTGCACGAGACTAGGTTTGA 
Sequence 42 

CCGCGGTGGCGGCCGAGGTACCCTTTGCCTCCTTTTGTGGGGAGATATCTGGAAAATAGC 

TGAACAAACTCTACATGTGGCAAAACAAAGTGACAGTTGTTATAAAATGTCCACTAAAAT 

TACCTGCTAGCATTACTACGTATMCTGAAGTTTTTGCATATCCGCATAAGAAGAAATCC 

ACATTACAAGGAAAAGTACACCTGCCCG 

Sequence 43 

CAACTTCGGCGCCACCCAATCGCTCACCTGGAACGGCTTGCGGATCAACCGTTGCTGCGA 

TGCCAGATNCACTTTGTCCTGCAACTTGGCGAGGAACACCGGGTCGAGGGTCGACGGGAA 

AATCTCGCTGAGTTTCTGGTCTTTCAAATCATTGGTCAGGATCACCGGTGGATAACTCGC 

CAGCCCCGACACCAACTGGATGTAGGCCTGCTTGTTGTTGTCATCGGTCAGCCATTGCAC 

CGCTTGCTGCTGCGCCTTGAGCAACGTG3CCACGGCTTCAGGATGCTCATCGACAAACTT 

GCCGCTGCCCACCAGCACACTCTGGATACT 

Sequence 44 

CCGCGGTGGCGGCCGAGGTGTCACTTCTACGCATAAACTAGATATAGCTTTTGGTGTTTG 

AGTGTTCATCAGGGTGGGACCCCATTCGAGTCCAATTTTCC TAAG TTTCTTTGAGGGTTC 

CATGGGAGCAAATATCTAAATAATGGCCTGGTAGGTCTGGATTTTCAAAGATTGTTGGCA 

GTTTCCTCCTCCCAACAGTTTTACCTCGGGATGGCTGGTTAGTGCATGTCACATGACATC 

CACATGCACATGTATTCTGTTGGCCAc.CACGTTCTCCAGACTCTAGATGTTTAGATGAGG 

TTGAGCTATGATATGTGCTTGTGTGTATGTCTATGTGTATATATTATATATACATTAGAC 

ACACATATACATTATTTCTGTATATAGATGTCTGTGTATACATATGTATGTGTGAGTGTA 

TGTATACACACACACACACACACAC?4CACTTTTGCAAGAGTGATGG 

Sequence 45 

AGAAGACTAAAGACAGTTATCTTATAATAACGAAATATAGTATAAATAGCACCTTATCAA 
G AATTCTG C AGGGGTTTTAAC ACTTAC AATAATAG G AAATAG CC AT TAAAAAGTTGCTCT 
AACTTTAGATTTCTAACTTTAGTGTTCTTTAACAAAGGCCATATTT^ 
CAAAAAATTATATCTGGCTTTATCTATTAGTAAACACAAAGGGTCCATATTTTATTCTGA 
AAAAATATTTATTATATTCATTCATAAATGTTCTAACTAATTTAACTAAAAAAATCTTCT 
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AGTATTTTCTGATGCCACAAGCTTACTAAGAAAATTACTTCTAAAAATTGGTNATATAAA 

TCATCAATGGATTTACCCTACTTTAAAAAAGAGGGGTATCTGGNTTTCTCTTACATTTAA 

TA ACCT GAAAATGGAGGTCTATAAAAATATTTTTTAAAAAATACAGNGACNCCTGNTGGA 

GGTTTTGGTAGGGCCCCTTGGTTTTTTNAAN 

Sequence 46 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGCACTAGAAT 

AATGCTGTGTCCTCCTGTTACACATACTCATGGTCTCTCCTACTCTCCCTTCATCACACC 

TGTGACATCTTGCAATTTATATTTGATTGTGTCATTATCTAATGTATTCCTCCTGGACTC 

TAAGCTCCAAATGAACAMTGCTTTGTCTACTTTGCCATAACTGNGTCCCCAGTGCTTGG 

CATGGGACCTGGTGCAGAGTAAATACATCATAAATATTTTGTGAATGAATGAATGAATGA 

ATGACCATGATTAATAAAAGGATATAGCTGCTCAGTCTGGTGCTGATAATGGTGGTAGTG 

ATCAAAAGGGTTNAACCACAGGTCATTAAATGAAAACAGCTAACTTAGTGCTTACAGGAA 

CCATACACTCCCTGTGTATAAACATGCCGGTGTGTGTGTCCTNGGT 

Sequence 47 

GCCCGCGGCCGCCCGGGCAGGTNCAAAAGAACAGCATAAAANCCANAAANTTNGAAGCNA 

ACAAGGANNNGCAANGGGCCGGGAAAAGTAAANAAACAGCGGCCCGCGCACAANCGGNNA 

NACAAGACAAGGAANNNMCAACCCAANCCCCACACGGAGAAGGAGAGCCNAAAAAAGGA 

AAAGNCGCCAGNCAAANGGAAGCCCCGAGGNNAGGAANGAAACCAACANGAGACCACCGC 

AAAAGAGAAAACCNANGNGGGCAANGCAAACCGNGAAAAGANGGGNCCCGNCCAAAAAAA 

ANAANNNNNCCCCTGGCNGNAGGGANGAAAAANATGGGCATATAGCCCAGAGCCACNAAG 

AAGAAAAAACANCCCACCCCCAANAAANCNAAGGGAGGCCCCAACNGAAGGCCAAAGGGG 

GNCCCAAGGGAAAAAAAANGGANGAAAGAGCCCCCCNGGGANCCAGGGAANGGGNAAAAA 

AGGGGGGAGGAGGGNGGAAAAAAACCAGGGANACCCNAAAAACCGGCNAAAGGGCCCGNC 

ACCCCAAGCNNAGGGAACCCNACCGGGAANGGGGGGGNG 

Sequence 48 

CCGCGGTGGCGGCCGAGGTACAAGAGAACAAATTAAMTTGAAAAATTGAT1TCACTTAG 
AAAAACTTCTAGGAACAGGGTGAACCACTGATTTTAATTTGCCTMTTATCTTATGACAA 
GTATCAAATTMGATGACACTTAAAGATCCTTAGCATTAACTTAATGATGGAGAAGAGTG 
C TCAAC AGACAGTTCCCAGTAAGGTAATGAGATGCCATTTTCAGAGACATTCTAAGAAGA 
TATTTTGATTCATTAAAAC ATTAAATAAAAAG C CC TCCTC AG ATTG G AACCCCC AAATCG 

ATGG AGCCACATTAATAATACTTTTCATGCCTCACTTTGACATGACAGGCATTTNGATTT 

TTTTAAAGATCTTTAATACTTT 

Sequence 49 

CGCNGGCGGCCGCCCGGGCAGGTNCGAGAAAAGAGCTAGGGTAGGCAACTTAAACTTACA 

CAGTGCCAGTCTCAGGAGGTCAGTAGCTCACAGAACTCAACAGATAAACTGGATTAAAAC 

TTAAAAGTCTTCTTTCTATTTG 

CTGCCTATATATAMTTTTTTATTAATTTTC 

AGGATGAGMGAAAAAATACGCTGACACTGCTAAATCGGGTATATGTTTTTGCAATAAAG 
AACACTGGTCAATATACAACTGAGGAAAAACTGAAACAGATGTGAGTCCTANAACCACAA 
GAGTTTGAATTTGCCCAGAAATGCTATTTTAAACACTCTATATGTTGGCTGCTGTTTT^ 

GGGGAATAATGCATTCTTGGCATCCTTAAAAGGTTTCAATATGTTACAAAGGTTATCCCG 

GAAAGAGAAAAAGC 

Sequence 50 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCTCTGTCTTCTGGACCAG 

TC AGTCTATGAGCTCTTCTTTG AAACAGTCTTAG AC C C AG AATG GTC ATCTC ACCCCC AG 

AAGACATGGTTTCCAAGGGTGGGAGGTGTTAGTTATGGGTGCCGAGCCCCACTTCCTGGG 

CGACAAGGCAAAGGTCMTTTCTCCCTGCTTCAAGAATGAAGAGGGAACTGGTGCTCCTT 

GAACCAAAGATTCTTGGAAGGGTCATTGTCACCAAAGTGGCTGGTGTGGGCAGGAGCACC 

AAGTGCAGACAAACCCACCGGGGCCTGGCGGGGCAGCCCTGTCATGGTTTCTGTCTTTGT 

TCCTG TTAGTGAAACAGAAGACCCCCG ACCCCC GCC ACCTAGC AG C ATGGAAC AC CTGCT 

GCCCAGATACAGACAAGGCTCTGCTTTGTCTCCTTGTGCTGGTTTTTGGCAGAAACGTTA 

AAGGGGCTGAAGGCCTGTGGGACAGAA 

Sequence 51 

GCTCCCCGCGGTGGCGGCCGAGGTACTCATNATAGTTACATCCCTATATAAAAAGTATGT 
TTACATTTAAAAAATTAGTAGATAACTTC^ 

TTGAGTCAACTTTTGTTTTGGAACAAATTAAGTAAGGGAGCTGCCCAATCCTGTCTGATA 
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TTTCTTGAGGCTGCCCTCTATCATTTTATCTTTCCCATGGGCAGAGATGTTGTAAGTGGG 

ATTCTTAATATCACCATTCTTGGGACTGGTATACATAAGGCAGCCGTGAAACTGGAAAGT 

CATTTTGATGACTGATGTG ATACATCCAGAG GTAAAATG C ATTTAAACATATTAAAG GAT 

TTGCCAAAGATCAATTTTCTTGCTGACATAAAAATCACA CAAA CCAG TCCCCC CCAAACC 

ACAACTGNCTCTCAAATAGCTTAAAAAAATTGGAAAACATTTTAAGGA TTTTT CMGGTT 

TCTAGATTTTNAAAAGGATGGTCAGCTTTTAGAGGGNATGGTNAAAMTTTT 

Sequence 52 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAGGAATTTACTTGAC 
CATTCCCCTTATTTTTCATCTAGAGGAATCTCGGATTCAGCCCTTTCATTGCTAAGACAC 
CTTTTCACTGAGGTTCTTACCAGCTCAGCCAAATCTCCACTCTGCTATAGCAGAAGCAAT 
AATGTTTGCTTTAAAAAGATTTCTTGACCNATGCCTTTTNTTANAMGGGGGGTAGA 
TTTTGAACTTNCTATCATCANANCAGNCCCCNAAANGGGGG GGTTGG GGG GGGGA l tut 
TTTNTGGGGNNGNNN I MINI I NNCCNCCCCCCCCCCCNN1 Mil l AAGGl \ I I iGNGG 
GGGGGGGGNAAAGGGGNA I rTTTTTTTTTTTl I NNNCNCCNNGGNAAAAAAGNNNNNNAN 
TTTN nTTTTTl NNTTNGGGGGGGGGGGGGGGGGGGGG 
Sequence 53 

AAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCATCCCATGTGGAATCTGTG 

AGTGTCCTCTTAAGTAGCGTGGGCTAGCCAATCTGCCGTTCATGGTGTATTGTAAACTCC 

GAATTCCATATGTAATAGGATGCAAGTCTAAGCGTTTCAT GTGG ACATAAATGTATCTAA 

ATAAAACTTTCCCTAGCACTGTGGCTGACCTCACCGTTACTTTTATACTTTAGTATGAAA 

CTGATGAGAACTTTGGTAGTGAGTA I HUM I ATATATATACATATATATGTACCTGCC 

CGGGCGGCCGCTCTAGAACTAG 

Sequence 54 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTCAC GCGC ATAAATTTGC 
TGCTCCTA 1 1I11H I CTGTTTATGTGTTTTTATGGATCTAAGT TAAATCT TTTGGCAAT 
ATATAAAAATGTAAATAGTAAACTTTATTTATTAAGAA TGTC ATC I I I 11 I AATTTATAT 
TTACACAATTGTTCATCTAATTTATTTTTTCTAT^ 

TGCTGTTCATGATATTTTTATCCTGTTCTCATGGATTTGTTTTCCCATACTG TTTTC TCT 
GATCTCAATTACAGGTTGGATCTCACAAATAATAATGTCAGAGACAGAAATATTTTGCCA 
CTGTTGATTACTATACTTTAAAGTTCTATATTATGAAAATATATAATAGCTTGTACCTGC 
CCG 

Sequence 55 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAACTATATATACATGTGCATATATATGTAAAT 

TTATATTTATATATAATATCTTTATATAGATAGATAT CTGC AGACAGGTTATTGATTATA 

GAGACCCAAGAAAGCAACTCAATAATTGTTCAAAGTTTTCCTCACTGACTGCTGGTGTGT 

AGTTTMGAAGCCCCCATTTGTTTGCACTCAGAATGCCTTATCTTCTTTT MTGG AACAC 

TTGATGTGGAATTTTAAGTCTGAGAAGTGAGGTGCCTTCTGAAGAAGAGATTTTAGAGAC 

TCCCTTCCTCTATAAGTTGGAAATGACCAAGAAGTCTTTAAGTAGACGACAGTTAGCTGA 

CTTTGACATTGTAGGGACGTMTCAGCTTTTAACCAAATTATAGAACTGGTAAAGGGTAA 

AGATCAAATTTGCCAAAAAAA 

Sequence 56 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGAAGAAAAGAGCTAGGGT 

AGGCAACTAAAACTTACACAGTGCCAGTCTCAGGAAGTCAGTAGCTCACAGiXA CTCA ACA 

GATAMCTGGATTAAAACTTAAAAGTCTTC1TT CTATTTG AGC CCATM TGACTATTTTG 

MCATGGCTCTTTTGCTGCTGCCTATATATAMTTTTTTATTMTm 

AG ATCTTG AATAC N CTCC AGG ATG AG AAG AAAAAATACC CTG AC ACTG CT T AATC G G GT A 

TATGTTTTTGCCATAAAAACCCTGGNNCATATACCACTGGGGGA AAACTN GAACCCATNT 

GGGGTCTTTNAACNCCAAAGGTTTTAM 
TTTTGTGGGGGCGGCNGGTTTTTTTTGGG 
Sequence 57 

ACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGG CCGA GGTACCCC3TTCTGCCTGAGCA 

TTTTTTCCTAAAGGGAAGAATCAATAGTTrCTGACTGTTTTAACAGCTGAAAGCTCCAAC 

TGGAGGCAGAAGATGGGATGGCTTTTCACACACGTGCGTGCAAGTTTAGCCACCTCCAAA 

GGCCTTGTTCTTAAAGCA 

Sequence 58 

CCGCGGTGGCGGCCGAGGTACTGGGAAAATTTATAGAAATCATCTAGTCTTACCCTTCAT 
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CTTACATATAAGAAAAATGGTCTTTTCTTCTAATCACATTTACAAAATATGATATAAACC 
TTG ACCATG AATGTATG AG CCTAATTAG AG AAACAG AAAATC AG C ATGTC AGTTTTCCTT 
CATTCAAAAAAACATAGTCTTTNTAAGCAGACATTNTGGNGGTAANANGCCTAGTTTAAA 
GCCCCCCGTGGNGGCT CNAAAA ANTTTTGGAATGGACTTCCCGNCNCNNCCNTTTTAAAG 
GANGNCCCCCCCNCCCATTTTTCCTGNCCCCNCNCNGNGNNTTNNNNGAAAGGGGTTCCA 
CNCCCGGGGGGGGGGGGNGNTTrrNNCCCCCCCCCANGNTNTTTTTCCAAGGGGGNTTNT 
TTTCCCCN M l it NTAAATAAAANAAAAAN NCCCCCCCCCAA 
Sequence 59 

CTATAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCGTAAAACAAGGATCAT 

CGATGTTGTCTACAATGCATCTAATAACGAGCTGGTTCGTACTTTTAGTAACAGTTCAGA 

ATTCATCTTTATCTCCTACCTGCCTCATCGGTGGAAGTTTAMGTCATGATTrTTTTTAG 

ACATTGATACTTGTGTCTATAGACAAATAMCTCATATTAGATGACAATTGATTTTTTAA 

AAGTCCAGGTAGAGAAAGGAGCAATCA7TTTGAACTAAAATCTTTCTATGTTTTTTGATT 

ACTATTCAACTTGCTATTTTTTAGCAAAAAGCCGAAGTTTCAATTAGTGTTCATCTCAAA 

TCTTATTGCTTTCAACCCGGGGTACCTGCCCGGGCGGCCGCTCTAGAACTAGGTG 
Sequence 60 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGATAGAGA 

TAATAGA AGACA AAGAGATTAAAAGGAAATAAAAATGCATGATTAAAAACTAAGAATAAA 

AAACCTATTTTTATGTTTCCTAAAGGAMTTGTTTATTCTACAGACTCAGTAGGTAGACA 

CAAACATAAAGATTTCCCTAGAAGACATAGAGTGGGATTTGATAACACTGTCTGTTATTT 
TCTGTACCT 

Sequence 61 

GCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCATATTATACCTTTTTAT 

TATTGTTATMTTATTATGGGTATTTCTMTTMTATGATGTTGAAACCTGTTTGGCACC 

TTCTGGAAGCTACCAAAAAMTGACACTCCATTGAAGTGCTTAAAAGCTGTTCTCATAAG 

AATTCTACTGGCCTATTGCCNAAAAAAAAAAAAAAAAAAAAAAAAG 

Sequence 62 

TAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATTTTTGAAAAGCACAATAAC 

TTGTATTNTTTGCA CTNMC ACAATGMCCTTTANTNTCCMCCAGTTTTCATTCTCTGC 

ANACCCGGGCTTTNTTTTTATAAAAACTGCTTTCAAAAGGCATAGAGACACCACACATGG 

TC CACAGTA AATTCAAATAGAGAGGNGCAATAGTTGCAGTGGTAAACACACAAAAAAATA 

CATTTTTTTGGACTAAAAATCTGGTCACGGATAAAAGCATGTGCCTTTTCATTCTTCTCT 

GGGATGTTACAACAGCAACACGCTCTAAAACAATTAAGTTACATGCCTAATGCTAAAAGA 

AATGTGAGCAATCCTATAACCAGCT 

Sequence 63 

ACTATAGGGCGAATTGNTGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCATTTCACA 

AACTCTGAAATTGGGTTCCATATTGGCAAGGCTGCCACAGTTGTTAAGAATAATCCTCTA 

TGTTTCTTCCTCACAAAACCATATCTCATTTATATCCAGACCATTACTTCACTATAATTA 

CAAGGACAAATTATTAGCMGAAATAAGMTAGTATTAGMGAATTGATCCTATTTTGAA 

CCCCTCTCCAGTATCTTCACACTCTTGTCAACTCTCCAGGCCTCTCTCTTGCCCTGAGTT 

ATCAGCCTGTGTGGTGTTAACTACCTTAGAAGGTACCTCGGCCGCTCTAGAACTAGnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGNCGACTCGAGGGGGG 

GGCCCGGTACCCCAGCTTTTTGTTCCCTTTTAGG 

Sequence 64 

CCGCGGTGGCGGCCGCATCCTGAAGACCTCCTTTTTGGGCACCGGACGCCAGGTGTTCTA 

CAAGTATGGGAAACTCTCCAAGTTATCAGAGATTGTCTACGACAGTACCGCCGTCACCTT 

CGGGTATGACGAGACCACTGGTGTCTTGAAGATGGTCAACCTCCAAAGTGGGGGCTTCTC 

CTGCACCATCAGGTACCGGAAGATTGGCCCCCTGGTGGACAAGCAGATCTACAGGTTCTC 

CGAGGAAGGCATGGTCAAT 

Sequence 65 

GCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATGAGGTGTGGTGCCTATAAGCCAGGG 

TCGGAAACTTTCCAGACAAACCTAGATAGCTCTACTAGGAGGGAGTCAACGACCTATTGT 

GTCAATTAGATCCCAGCCTGGACAAGGGCATGACTATTGCTGTTTGGGGACACGTGGCTC 

TGTGTTGAAGGCAATGACTGTGTGGCTGTTNGCCATGTGGCCTGTTCTCCCCTAAGCCCT 

GAACCAAAGGTTNCAGGCCCAGATGAAGAAGGCCCCCTGGGAATTGCATGCCATANGGGG 

TTGATGCTGGACATTTACCCAGCCNTCCATTAAATGAAAACC ATi lllllll l AAAACTT 
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ACCTGNTTTTNAAAAAAA I I I M I I l" NACCCANCAAACCCTAAAAAACCTGCAAAANGGG 
GGTTGGGTTTTAN NTTTTNAAAAATTNTGGGGAAAAATTAAAAANTTTCCCAAAGAAACC 
CCCTTTTNGAACNTNNTNCTNNNNANNCNNNAAATTTTAANANNANTNTTAAAMNTTTT 
T 

Sequence 66 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGTGGAGTCGCAGCTGCAGCGCACAGAGACTGG 

AGCGACGGAGACCGTCACGCCCTCAGAGGCCCCGGTGCTGGCAGCTGAGCCCGAGGCTGA 

CAAGGGCACTGTGTTGGCACTCACTGAAAATAACTTCGATGACACCATTGCAGAAGGAAT 

AACCTTCATCAAGTTTTATGCTCCATGGTGTGGTCATTGTAAGACTCTGGCTCCTACTTG 

GGAGGAACTCTCTAAAAAGGAATTCCCTGGTCTGGCGGGGGTCAAGATCGCCCGAAGTAG 

ACTGCACTGCTGAACGGMTATCTGCAGCAAGTATTTCGGTACCTCGGCCGCTCTAGAAC 

TAGGTG 

Sequence 67 

CTCCCCGCGGTGGCGGCCGAGG H ( 1 1 I GTTCAATACATTTTAGATTAAGGA TTGACA AG 

TAAAGATACTGCTATGGAATGATACATTGTATTTTCTGCATTGTGTGAAATAAGTTTTTA 

TTGAMGTCAAGTGACATTTCAAAAGAAGTTCTATAACAATTATGTTTCATGCTTAAAGT 

AAAAATTCCCAGAGTTTAGTTTAGAAAATGTMTCTTTTAAATTTCAGACTGATATATTC 

CCAAGTATTTTCAT 

Sequence 68 

CCGCGGTGGCGGCCGCCCGGGCAGGTACACTGACTCAAATTTTCTCTAATCATTCATACT 

CGCACATTCACTCAAGCATACATACTAGACATAAAAATTGTGCAAGATCAAGCATCAGCT 

TTCTATTTTAAAAGCCTTAATTTAAAAAAACTTAGATAGGCTCACTGCAG^ 

GGAATATTCAGTACCTCTGAAAGGTGGCATTCTCACAGATGGGTTGAATGAAGCTTTCAT 

CTGGACATTCTACCACCACAAAAGCAGCACCAGGATCTGGAGGGAGTACCTCGGCCGCTC 

TAGAACTAGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGTC 

GACCTTCGAGGGGGGGGCCCCGGGTACCCAGCTTTTTG 

Sequence 69 

CCGGGCAGGTCCAGGGGTAAAAAAAACCCTTGTGAATCCTGCACACATCCCTTCCTGTGT 
TGGTGTTTCCTTTAGACTGGCCTAGCCTGGGCATCACTGTAAAGTATGTGCAGTTGGTCT 
CTGTCCTGTACCT 
Sequence 70 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAAAAATACCAGTGCTTGTTATACTAGTTACTA 

AAAGAAGAAGAAACTCAAAATTCCTATCTGCGTGCTAATTTGAAAAGAACAACGTAGATA 

GATTTGTTGGCACATATATATGGCATATTCACATATGGCATATATACATATGGGGAGAAA 

ACATGAACCAAAGGCCAATTCAGTTATGGGAGCTCATCTCCTTCCATCTCTCCTAATCAA 

GAGCAAAGGGAACAGCAAGGCCTAACAGCAGGGTTGGGGAAGGCAAAAGGACTGGGCACT 

GAACTAAGTGAAAGGGGCGTCTGGTCTTATTTCAGAGGAAGAGGCTGGAATTGGGCTTT 

Sequence 71 

CTCCCCGCGGTGGCGGCCGAGGTGTGCTATAGACGCACAAACGACCGCGAGCCACAAATC 

AAGCACACATATCAAAAAACAAATGAGCTCTTATTTTGTAAACTCATTTTGCGGTCGCTA 

TCCAAATGGCCCGGACTACCAGTTGCATAATTATGGAGATCATAGTTCCGTGAGCGAGCA 

ATTCAGGGACTCGGCGAGCATGCACTCCGGCAGGTACCTGCCCG 

Sequence 72 

TACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGGTACCCCTGCT 

GCTGGCTGGGTTCTCAGCCCCTGACCATGAAATGGTTCCACATCGATCTACAGAAAAAAC 

AGCCACTTTGGCCCGTGGGCCCTGACGCGTAATATCATTATGGAGTCAGCAGCGCATCCC 

GTCCCATTATCGCCTGGCCAAGCTTTTCCAAGGCCAGCCTGGCTGCAAAT GGCTT TCTCC 

TGTCTAGGGGGGCCACAGCCACTGTTTGCCCTCCACTACCGGAGGAGAAATTTTATCTTT 

AATTGACACTGGCTGCCTGCCACCCCCTTGCCTTCGGCTGGCCCAGGCTCCAAGCCCGGA 

TCTTTTCCTTACACTTCAGGAACAGGCTTCCCCTGGCTTCTTCCCTTCTTCTCGGCAATT 

CCCGTCCTCACCTTGGTTGTTCTACACTGGCCAAGACAGTCCNGGGGGGNGGGTCTTTGG 

CCTTT 

Sequence 73 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTTCTCT 

CCTTCCCTGGCTCTGAACATCATTCAGAACAGGAACATGGGGAAGCCAGCCCAGCCTGCC 

CTGAGCCGGGAGGAGCTGGAAGCACTCTTCCTCCCCTATGACCTGAAGCGGCTGGAGATG 
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TATTCACGGAATATGGTGGACTATCACCTCATCATGGACATGATCCCGGCCATCTCTCGC 

ATCTATTTCCTGAACCAGCTGGGGGACCTGGCCCTGTCTGCGGCTCAGTCGGCTCTTCTC 

TTGGGGATTGGCCTGCAGCA TAAG TCTGTGGACCAGCTGGAAAAGGAGATTGAGCTGCCC 

TCGGGCCAGTTGATGGGGACTTTTCAACCCGGATCATCCGCAAAGTTGTGAAGCTATTTA 

ATGAAGTTCAGGAAAAGGCCATTGAGGAGCAGAATGGTGGCAGCGAAGGATTGTTGGTCA 

TGGGAGCCCACGATGAAAGACCCTCAGTGACGACCTAANATGAAGCAGCAAAG 

Sequence 74 

GGGCNCTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAT 

AGATCCTGATCTACTGGCAGAGCTCAGCGAAGAACAGAAACAGATCCTGTTCTTCAAGAT 

GAGAGAGGAACAGATCCGACGATGGAAAGAAAGAGAAGCAGCTATGGAAAGAAAGGAGTC 

CC TG CC AG TG AAACC CAG AC C AAAG AAAG AG AATG GG C AAATC GGTTC ATTG G AAACTTG 

GAGCTGATAAG 

Sequence 75 

TTAGGGCGAATTGGAGCTCCCCGCGGNGGCGGCCGCCCGGCCCGGTACTTTGNTTCTNTT 
GGGNAGCNTAANACAGNAACCCTCAAGGAGCTAGAGAACCGGATGGGAGACATGAGCAGT 
AAT^CTCACTTGTTCCCCAGAGTNTCTATTTGGANNTG^ 

A7TTTCCTATTTTCTTTCCTCCATGTAATTTTCACTATGGCCCAACTAATATAAACACCT 
GGAAATTACAAGGAAAAAAAATTCTTCCTCTAATMCTNTCCAMTTTGNGGMTATTTA 
TTTGTAANANCAGTTATCAAGGTATGCTTATATAGCAT 
Sequence 76 

GNGGGCGGCCGCCCGGCCAGGTNCAGCGTGGTCAAGGTAACAAGAAGAAAAAAATGTGAG 

TGGCATCCTGGGATGAGCAGGGGGACAGACCTGGACAGACACGTTGTCATTTGCTGCTGT 

GGGTAGGAAAATGGGCGT AAAGG AGGAGAAACAGATACAAAATCTCCAACTCAGTATTAA 

GGTATTCTCATGCCTAGAATT7TTGGTANAAACAAGAATACCTTTCTNTGGCAAATAACT 

AACCCNTGGNGGGACCW^ 

TAAAAAGGCC TG I MM 1 GGGANAAANAAANAAAACCTTTTTTNNTTAANNAACNTTTTN 
AAAAAGN NGAi mm I GGGGGGGGTTTNCCNTTTTTNACCCNTTNTTNAAAAAAAAGGNA 
GGGGGG 1 I I i I AAAAAANTTGGGGGGGCCCCCCCCCCGGGNGGGGGGGGGAANTTNNAAA 
NCNAANTTTTTTNCCCCCCCCC 
Sequence 77 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTTTTTCTCTTTGA 

AATCTCATAGGATAGTCACACTTATAAACATTCCCAATATTCGGATTCTAGAAGAAATGC 

AATTCATTAAMTTrrCCTGG CACT GAGAGTTAATCTTTAGCAGATTGCATGAAAATACT 

GMTTCCTGGTAAGGAGATATTTTGTTTTAAAAATAATGTGTTTTGATACGMTCAGTGT 

ATTAACTGATAACTAAAAAGTACCT 

Sequence 78 

CCGGGCAGGTACGTTGMCGTTTATTACAACTAATTGGCGATGTGATAAGACAGTGCTCA 
CGTGGCCTGAATGTTGGTCACAATCACAACAAAGCTTAATCCAGCCCAGCATATATAAGT 
GAAAATATAAACCATGAAGACATGTTTAGATATGTATAAGTACCT 
Sequence 79 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCGGTTCTCTTGG 

TCTTATTCCCATCCTGGCCMTGCTTAAATACTATTTGTTGAAAATAATTCT7TGAGACA 

GAmCAGCTACCTCCCTTCCAGGTTCGATTTMCTTGGTTGTMTTGTCMTTTGTTGT 

TATAGGTCTTACCTGTGTG AAAG AAAGAAAAAGAAAGAAAGAAAGAAAGAGAAAGGAAAT 

TATAAGGTCAAGTTAACAGTTTTGAGGTTTTGTGTTTTTTTCTGGAACTACT^ 

GAAAATAAAAAAAAATGGTGACAAAGCTGTACCT 

Sequence 80 

NCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGG 1 M 1 1 M M 1 M M 1 GAAA 
AGGAAGGAAAATATTTTATTAAAAGTTTTTTAAAGTTTATGM 

AAACTACTTGATACAAAAACATTCAGATATTAGATGTCAAAATAAATCAGCACATTTGAA 

AATACTATAAATTATATTTCAAACAAATATATACACATTATGTTAACCTTCAACAGGATC 

CATTATCACTACTTAGMCACTGATATGTTTATCTTTTAAGTATGTAAAAATTACATAGC 

TGTTAACTTTGTATGGCMTTCACCTATMCACATTTAAGAMGCATTACAAAATTCATT 

ATATAATAATTC TC AAAAGTTTTCTC AC ATT 

Sequence 81 

GGTACTTTAACTTrTGAAGGTGGTTTCTGGTTCCCCCAATCGGTGAGCCCAAAGACCTCA 
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AATAACATTTTATTCACACAGACTTCTTAGAGATGAAAAGTTTCTAGAGAGCAGAGCCTT 

TAGGGAACTAGGGGATGATGCATTCACTGAATAAATCACAGTATTATACTCAAGAGCAAA 

TACGTTTTCTTTCTTATTGGTGTCATCTTTCCTTGTGAAATACGGCAACTGATGAAGAAG 

TCTCCTATTGAGAATAACCAGACGAAATCACAAGGCTAGACAAGCAGCTACTAACACCAT 

CCCCTTCCCCCTGCCCATGGTAAACCCTGCTGGCAAATGTATTTCTCCCTCCTAAACCTG 

GGAGGACTCACGTTTCTGCACCCAGAGCCCCAGGGGGAAGTAAACAATGGGGAGTGAGCC 

GATGAGGTGGGAT 

Sequence 82 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTGTTCTTATAAAAGATTCTTTCTCCCAGAATT 

ATATCTCCTCAGAGCAACAGCAAGGTTCTCAGGATCGAAGCCTACTCTAGCCTGAAGGGC 

TAGGAAGATTAGGATAAGGATAAGGATAATAATC C AAAAGTCTC G AC AATTCC AGTAGTC 

TCTGGATGGCTCCMCATCATAGAAATTTAACACTGTTCCACTTGTTTACAAAATCTAAC 

ACTGGCTTAGACATTCTGGACTTTCAGTGAGGGTTCCAGCATCTGATGTCCCTCAACTCC 

TTTCCAGGGTGAGAGGCCCACTTACAGGAAACTTAACTTCTCACCATGTGGACCCTGAGG 

GGTTTTTCTCTTGCAGAAGGGCTAAAAAAGTGAGGAAGCTATGGAAAAATGGTGGGTCAC 

TTTTAAGGCAGACTGTTTGGGTTGTGTGGGAATTTTTCCTAACCTAAGTTCTACCAT 

GGGATTATTC 

Sequence 83 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCCTGTGCGTA 

GGTGGAGCGGTTGAAGCAAGGTGGGTGTGAAGCCCCGAGAGTGGTGCTTTGGTCGGACTG 

GCACACTCCGGGCAGCTGGGAGAGTGACAGGGAGGCTGAGAAGGGGCCCGCTGGATCCCG 

CAGCTGGCGCGCACATAGTGTCTGCTGCCCTGCTTAGGACAGTTCTCTTTGCATCCGTGA 

GGAGCCAAGACAAGTCACAACTGCAAGTGAGGGGGTTGCCAAAGAGGTTGATCGATAGGA 

CCTGGGAGGAATCCAGGGTCCAAGGAGGGAAGGAACTCAAGTTGCAGTTCTGGAACAGAA 

G G ACCTATAG ATGTGGAGTATCTTG AAAG ATGTGTGTGG CG AC AG AAGC AAGTGCTGGGG 

GAGTCTTGACAGTCCAGCTGTTGCAGGTGGGGGGTCATCTTGAAAAATGTCCCCCTCAGG 

GGTCGTCAAGCCGAGGCATNTTCACCTNGGNCCGCTTTAAAACTAAGTNGGATCCCCCCG 

Sequence 84 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCTAACCTACCTTTAAGACTGGGATAACTATTG 
G AAAC AATAG CTAATACCGG ATATAGTTATTTATCGC ATGATGAG TAATAGAAAG G AGCT 
TCACAGCTTCACTTAAAAATGGGGGTGCGGAACATTAGTTAGTTGGTAGGGTAATGGCCT 
ACCAAGACGATGATGTTTAGCCGGGCCGAGAGGCTGTACCT 

Sequence 85 

CTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGG CCGCC CGGGCAGGTACTCTTTGA 

GGACATTTTTGTCAGATTMCTATMCAGTGTAGTGTAGTTTTTAAAATTGCAGTTGAAA 

AGTTTAGCTGTCTTGGAAGTCAAATTTATCCMTTGTTCAGACTTCTGTTACTACTTAAT 

ATGAAGCCACCATGCTGGCTTGGACAGAATTMTTTCATTCATGTTATGGAGAATTCTAT 

ATTACAAATCTGGTCCCCTATAATATGAACAGTGAGCAGTCAGAAATATACAAAGGGTTA 

AATAGGGTAAAGACTTTGGCCAAGAAAGGAAAGGCCTTAGTTCTACCATAGAGTATCTTC 

TCTAATTAAAATGACTGGGAAATATATGGAAGCAGAAACCAGCACAAAGCACTACCCATC 

TAGAAATAATTTTTCAGTTAAAAMCAACTCTCAAMCCAGCACTCATTTCTCTAAGATA 

GGGTTATAAGTATTTTACGATTTCTTGGTATATTAAAAT 

Sequence 86 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACCAGCCAGCTCCCACC 

ACTCACGGCCTTCATCCTGCCTTCGGGAGGCAAGATCAGCTCGGCGCTGCATTTCTGCCG 

GGCCGTGTGCCGCCGGGCCGAGAGACGTGTGGTGCCTCTTGTCCAGATGGGAGAGACCGA 

TGCGAACGTGGCCAAGTTCTTAAACAGACTCAGTGACTATCTCTTCACGCTAGCCAGATA 

TGCAGCCATGAAGGAGGGGAATCAAGAGAAAATATACAAGAAAAATGACCCAT 

Sequence 87 _ 
CCGCGGTGGCGGCCGAGGTACAGAAGAGGTTCGTGGAAAAGGAATATGCTTAATAACTTC 

GAAAAAAATACCCATAAATGGACAAGGGTGTTGGGCCCCCTTTCTGGGTGGATTCAGGGG 

TAGTCAGATATTCCAGGTTAAACTGAACCTGATGCATCAATCTCCTTCTTGGAGGTGTCT 

CAGTGTGCCAGACACTGAATCCACCTTAGACATCTATTGCCATTCAGCCAGCAACAAACA 

CACATTCACACTCCTCTGTGCAAACCTTTCTTTGCTCATCTTGGTGAGAAAACACATCCA 

AGAGTACCTGCCCGGGCGGCCGCTCTAGAACTAG 
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Sequence 88 

£II AGGGG ^ 

^^I 7717 GGTG ^AGAmTTTTCTTTGTGTCTTTnTTnTTTTTr T 

rTTTTTTTTTCTTTTI (A CAATAGG CCAGTAGMTTCTTATGAGAAACAAGCTTTTAAGC 

ACTTCMTGGAGTGTCATTTTTTTGGTAGCTTCCAGAAGGTGCCAMCAGGm^ 

CATATTAATTAGAAATACCCATAATMTTATAACAATAATAAAAAGGTATAATATGGTAC 
CT 

Sequence 89 
?™™ GGGCGM ^ 

A^ATCTAGCTGCTGACATTGMGAAGAGCTTGTTTGTAGACTGAAMTTTGCGATGGGT 

TTTCACTGCAACTAGATGAATCAGCTGATGTTTCAGGACTTGCTGNGCTGCTTGTGTTTG 

TTCGTTATAGGTTTMTAAGTCTATTGAGGMGACCTACTCCTGTGTGAATCTTTGCAAA 

CTAATGCTACCGGTGAAGAAATATTCAACTGTATCAACAGTTTTATGCAGAAACATGAAA 

TTCAATGGGAAAMTGTGTTTGATGTTTGTAGTGATGCTTCTAGGGCAGTGGATGGGAAA 

ATTGCCGAAAGCTGTCACCTTAATAAAATATGTGGCTCCCGAAAGCACCAGTAGTCACTG 

CCTATTATACAGACATGCCTGGGCAGTTAAAATAATGCCTACATCTCTNAAAAATGTGCT 

AGACCAGGCTTTCCTGTACCTGCCCGGGCGGGCGCTTAAACNAGGGGGGATCCCCCNGGC 

Sequence 90 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCGAACAATG 

ACAGGGGAAGGGTATTGGACACGGCAGCGTCCTCCTTATTGAAAACACATTATGTCAGTT 

GGGAATTTTAAATMG CTTTTA GCAAACCTMCACT 

ACCATTACCATAATACATTTTTCATCTCATGGCTACMTGGAATTNTTGAAAAGGAAAAA 
AAAATCCTATCTACATATAAAAACCTGCATGAATGAATCACTACATATGCTTATAATGAG 
GAAG^TTATGGGTCCTGAGTGTAATTrTTTATCCTTTCTTAAA^ 

CATTTTTGATAACACTCTGATGATCCTTCCCTTACATTTGAAATGTTATGTACCCTNGGC 
CGCTCTAGAACTAG 

Sequence 91 

CCGCGGTGGCGGCCGAGGTGGCTATTGAACTTCCTTTTCTTGTTTGMGTTAGCTTCAAA 
TTTGTTCCTATCCAGAATATTTACAGGTAATTTAGCATAGGAGCAAATTACCTGTAAATA 
TTCTGGATAGGMCTACTTGAAATAGTMTTTGTTAAAAGATATGACAAAATGAAAATGC 
TTAAACTAC AGAAATTT AAAAA TGC C ATAACAATCTTGC G AG AC TAACTTTAAAATATAC 

TTTAMTGATTATTATGATTTTGGTGGTAACGATCCCCCACACACAACCACTATGAAGAA 

ATAATGCCGCATTTTTCCCCCATTG 

Sequence 92 

ACACTAC7TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTACA 

TGGGTGTTTTGATCTCTGTTCTTTCATACTACATTTGAACAGGGCAAAATGAACTAACTG 

CCATGTAGGCTAAGAAAGAAATGCTAACCTGTGGAMGTTGGTTTTGTAAAATTCCATGG 

ATCTTGCTGGAGAAGCATCCAAGGAACTTCATGCTTGATTTGACCACTGACAGCCTCCAC 

CTTG^ACTATTCTMGGAGCAAATACCTTAGCTCCCTTGAGCTGGTTTTCTCTGATGG 

CACTTTTGAGCTCCTAAGCTGCCAGCCTTCCCTTCTTTTCCTGGGTGCTCAGGGCATGCT 

TATTAGCAGCTGGGTTGGTATTGGGAGTTGGCAGACAGGGGATGTTCAACTTAATGAAGA 

AATACAGCTAAGGCCTTGCCAGCAACACTTGCGTAAGTTACTGGCTTGAGTGAGGGCTTA 

AAAAATTNAAAGGGTACTGGGTTTTATCCTCTATCCCTTTTTCCCT 

Sequence 93 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGGATGGACGATACTGGCT 

ACAGCGAGAAAGCAACCTTTGCCTCAGTGACATATCCTCGGGGCTATCCCTACAACGGGA 

CAAACTATGTGAATGTCACCATGCACCTCCGAAGTCCCATCACCAGGGCAGCTTACCTCT 

TCATAGGGCCATCTATAGATGTTCAGAGCTTCACTGTCCACGGAGACTCTCAGCAACTGG 

ATGTGTTCATAGCCACCAGCAAACATGCCTACGCCACATACCTGTGGACAGGTGAGGCCA 

CAGGACAGTCTGCCTTTGCACAGGTCATTGCTGGATCGTCACAAAATTCTGTTTGACCGG 
AAT 

Sequence 94 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTGCTGAAATTGAGA 

CTCTTCTCCCTTTTGAGCAGAGGTCATTTTCTCTTGACATTGAATTCCAGTATTTGTGCA 

GAAAAGAGGTGTCAATGAGGACTCCATGGATACTTTCAGTTGATCAGTATTGGCTCTGCT 
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GCAGGGTGATTTTGGCTTCACACATAAATGTTGCAGTGGCATTTGTCCAGGGGAAAGCCC 
GCTTGGAGGCTGTAGGATGACCCTGATGGAGGCTGTCACTCAAACAGCTCCAGGTAGTCA 
GGCAGCTGGTCCTGCTGACCACAGGGATAGAGAAGGAGCTCCTTCCCCGGGGAAGTGATG 
GGTT 

Sequence 95 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAGAAACCCACCTCACC 

CCGGCTCACATCTAAAGGGGCGGGGCCGTGGTCTGGTTCTGACTTTGTGTTTTTGTGCCC 

TCCTGGGGACCAGAATCTCCTTTCGGAATGAATGTTCATGGAAGAGGCTCCTCTGAGGGC 

AAGAGACCTGTTTTAGTGCTGCATTCGACATGGAAAAGTCCTTTTAACCTGTGCTTGCAT 

CCTCCTTTCCTCCTCCTCCTCACAATCCATCTCTTCTTAAGTTGACAGTGACTATGTCAG 

TCTMTCTCTTGTTTGCCAGGGTTCCTAAATTAATTCACTTAACCATGATGCAAATGTTT 

TTCA7TTTGTGAAGACCCTCCAGACTCTGGGAGAGGCTTGGTGTGGGCAAGGACAAGCAG 

GATAGTNGAGTGAGAAGGAGTTGGAAGGGNTTTA 

Sequence 96 

CCGGCCAGGTACCTAACCTACCTTTAAGACTGGGATAACTATTGGAAACAATAGCTAATA 

CCGGNTATAGTNATTTATCGCATGATGAGTAATAGAAAGGAGCTTCACAGCTTNACTTAA 

AAATGGGGGTGCGGAACATTANTTAGTTGGTAGGGTAATGGCCTACCAAGACGATGATGG 

TTAGCCGGGCCGAG 

Sequence 97 

AGGTACAGCCTCTCGGCCCGGCTAAACATCATCGTCTTGGTAGGCCATTACCCTACCAAC 

TMCTMTGTTCCGCACCCCCATTTTTAAGTGAAGCTGTGAAGCTCCT7TCTATTACTCA 

TCATGCGATAAATAACTATATCCGGTATTAGCTATTGTTTCCAATAGTTATCCCAGTCTT 

AAAGGTAGGTTAGGTACCTGCCCG 

Sequence 98 

TATNCTGGAACTGAGCATAGACCTNTTNCCAGGCAGAGCTGACAGCAAGTNAAGGAGATC 

ATAATCATGGGGACCAAACAACTTTGNCTAAAGTGTGAATGTNACCTAAGGAGAAGCTGT 

GAGATCAGAAGGGNGGGGCAGAGGAGCAGACACCATGAGGGAGAGTCCTTGGGGGTACCT 

GCCCGGN 

Sequence 99 

GGGGAACACCCCCCNGNGGCGGCCGANGCCMNANCTTTTTGNGGGGGAGGNAAACCCCA 

CCCCCCCAGACAAAACNGNCACGNACCNCGGNNTTTNTAGAACNAGGGGNGCCCCCCCCC 

ANAAANAANNnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnTTANAAAATTTTTTTGAGGGGGGGGGGCNCCCCCGNAAAAAACANGGNCANAGAAG 

NNCCCCCGNGNGAAAGGGGGANCCGNNCACAANNNCACACAACACACGAGCCGGGAGCAN 

AAA 

Sequence 100 

GGNAACCCCCCCCNGNGGCGGCCGCCCGGGCAGGNTCNCTTAGGGGGGGGGNAAACCACC 

CCCCCCCCNGNGCGAAAGNCNACCNNNGATGTTTTCAGNTNCANGGGGGGAANCNCCCNG 

NAAAACGGGNGNGCAGGCNGNAANNNNCNNNCNNNANAAANNCNGCAAANCNAAAGAACG 

ATTTTTCC 

Sequence 101 

ACCGCGGNGGCGGCCGCCCGGGCAGGTACTTTTGTNGGNTGGGGAAAAAACNCTTTAACA 

AGCTTAGATTTATTATNGCGCCTTTTATGGAAGATGAGGAAGAAATAGGAAAGCAGTAAA 

ATGAACAATGAGAATCTGTTACCCAAACTGATATAAACTCCCTCAGAAAGAAAGATACTA 

CTCCCTAATCTNTNGAAGAGGGNAGNGGCAATAATGATCAAATGCCGNAAAGGGGANTGN 

CACTACAGCTAATAC 

Sequence 102 

AGGTACAAGGGATTTTAACCNTTNNNCANAAANCCCANNNGTGAATGCTTNGCCATACTT 

GCTTTAAGGTGTTATGGTNGGCACAGTTTACTGGCTTCGCCTGTTAAATTTACAAATGTC 

CTGTTTGATACTACTTGTTAGMC ACTATTTTTn* AAATAC AGAAA AAG CTN CCTATAAT 

GGCAACTTTCAGANGAAATTTAAAAAATNCACAGGAGGTATTTTATTTACCCMTGG 

GCCAGGGTTCCCTGGC 

Sequence 103 

CCGGGCAGGTACAGTTGGATTGACTACAAAAAAAMTTAGGC CTTG TCCATTTATCCAGA 

GGTTTCCTTCAAAACTTTTAAAAAATTTTATCACCTAAAATGGATTTTA 

TTrAGATGTAGCAATTMGTAAATCTTAAAGGAGGTGTAGAATTCCTTTAAGAAGTTATT 
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GTCTGCTTCCCATTGCAGCAATGTCTGCAATACTACTTTTTTTAATGAGCCCAGAAACAA 
TTCTCTGCAACTGTAAAAATTATGCCAGMTGCAAGTTTTTGGTTAGTGCCATAGTTTCG 
TTTTGTTATTGAAAATGG GATAAGTCATG ATTTATTCC CTCC CTTCTATGG AA 
Sequence 104 

AGGTACTTTTACCTTGAATCTACTAGAACTCTCTGCTATTCAAAACAAAGAGCTCATACT 
TGTTGGAGTAGGGAAAAAATTAGAAATTTGACCAAAAGATAGATTCAATCAACTACAAAG 
TCAATTCC AAG ATGCTGATAAC ATCG AAACTCTTGAAAAAG AAC TATTAG AATCTG G AGT 
TGAACTTTAATGGATCATATACCTGTTTTGCTTGATCAAGTGATTGATCAGCTTAATATT 
AAAGAAGATG GTATCTATTTAG ATCTTACTTTAG G AC G TGGTG G TC ATTCG AG TC AAATT 

TTAAAAAAACTTACTAGTGGCAAGCTTATAGTCTTTGATAAAGACAAA 
Sequence 105 

MCCCCACCGCGGTGGCGGCCGCCCGGGCAGGTACTTTTTGCCATGTCAGTNATGACTCC 

AATTTTCTGTGTGCAAGAGCAATCACACGGMGCCCTGTTTAGTGMGTC^ 

> I I IGAAAATCGACAGGAA CTGT TTCAGGTTTACAGAGACCGGCAATGGCCTCGGGCGCT 

CCTTTNATGTAGGCNTCCATTTTCCTATCCCCCAGCACCCTGGCAACCACACTCATACGT 

TGCAAAGCAGAAGAAAATGGGAACTGGCGAACAATTCCTATCTCATAAGTAGCTGGAAGT 

TCAAACAGCTCCATTTCTTGGTTTCCTGCAGGGGTAGATTCAGGAAGCAGTTGTTTGGGA 

GGACGAACCACTGTGGGCA 

Sequence 106 

CCGGGCAGGTAAAAAACAATTAGTAGATACTAAACTACTATTTATTCTTGCCGGTATGGG 

CGGAGCAACAGGGACTGGAGCTTCACATATTTTTGCAAAAGTAGCCAAAACTCTGAAATC 

TTTAACTATAGCTATTGCTATCCMCCTTTTGATTTTGMGATAGTA 

AGCTTCTGAAGGAATTAAAAAACTACAAGAAAATTCAGACGCGCTAATTGTAGTTTCCAA 

TTCTAAAA TCGCG GAGCTATACMCGGCATMGTATTTCAGATTCTTTTACAAAAGCTAA 

TCAAATTA Mill GATATTATTTAAACTATTATTGATTTAATTAGTAAACAAGCT 

Sequence 107 

NNNAANCTCNCCCNCCAAACCNNNCCCGGGNNNNNNNNGGGGGGGNTTTTNTTGGGGGGA 
NGGGGNNNNNNCCCNG NNNN NNNNGGCCGCCCGGGCAGGTACATTCNCAAGGGTTGGAAC 
CCAAGCCCCACCCTGGGTTTTCTTMGTTCATTATTCCCCCACCCCAGNAATTCCCTTGG 

GAGTTCCTTGGCCAGAAGCCATCAGAGACAGCAGGCGAAAAGCAGGGCTTAACTGAATNC 

CATATTGGGG 

Sequence 108 

AGGTACATCCCGAAAGACAGCAAAAAGAAGAAGCACCGAGCTGAAGATTACTCAGCAGGG 

CACGGACCCGCTTGTTCTCGCCGTCCAGAGCAAGGAACAGGCCGAGCAGTGGCTGAAGGT 

GATCAAAGAAGCCTACAGTGGTTGTAGTGGCCCCGTGGATTCAGAGTGTCCTCCTCCACC 

AAGCTCCCCGGTGCACAAGGCAGAACTGGAGAAGAAACTGTCTTCAGAGAGACCCAGCTC 

AGATGGGGAGGGTGTTGTGGAAAATGGAATTACCACATGTAATGGAAAGGAGCAAGTGAA 

GAGGAAGAAAAGTTCCAAATCAGAGGCCAAGGG 

Sequence 109 

AGGTACAGCCTCTCGGCCCGGCTAAACATCATCGTCTTGGTAGGCCATTACCCTACCAAC 

TAACTAATGTTCCGCACCCCCATTTTTAAGTGAAGCTGTGAAGCTCCTTTCTATTACTCA 

TCATGCGATAAATAACTATATCCGGTATTAGCTATTGTTTCCAATAGTTATCCCAGTCTT 

AAAGGTAGGTTAGGTACCTGCCCG 

Sequence 110 

AGGTACCGACCGCTCCGAGATCTGTATGAGTTGGAGGCAGGCCAGTGTGAAGGTGTGGGA 

GGAATCCGCCCACACGCAGCTTCATACAGCACCCTGAGGACAATGTGGCTTCCCTGATTC 

ACACCTACTGAGCC/^AGGCCCCCTCTGAAGTTAGGTCAAGAGGGCCCACCCTAGCCGGGC 

CAACTCATTCCTTTGAGGACCTGATACTCMTGAACTGCTTACTACCAAAAAAATGCAAA 

GGAT TCAG CAAAAA3TGCAATTTCCATTTGAGMGTAAGATTCTCCCCAAAACATGACAC 

CACCTTTTCATCCTITCAGTACCTGCCCGGGCGGCCGCTCTAGAA < 

Sequence 111 

C C GG G C AG GTACTAC C ATTTAGG AAACTG CTATAAC AC AT AATTTC ATG AAGTAAC ACCT 
AATACGGTGTAGTTCCCTGGTCATATTTTATACAATTCAACCATATAAAAGGGTGTCACT 
GTAATTTCAGTAGTGTGG GTTTACAAATAATCTG CTGGTTAGCTTATTACCTTG AGG TTT 
TGAAAAACTAGAATTATATTGAGGCATTTCATAAACATATCTCTTGCACCCTCTTCATGG 
TG G AGTTAAGG ATAACTTGC AGGTG GTTGGC C AAG GCCC AATATAG ATG ATTATAAC ATT 
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TAGAATTGGCAATTAGAAGTTGATAATCCATATAGGACCATAGG 
Sequence 112 

AGGTACCCACCTCACATTCTTTMCACTTAAGGTTTTTCTGGGTAGTAAGTGCAATACAT 
TCTTATTATAAAACAATATGGACAGTTCAGTATGTATAAATGAGGATAAAATCAAAATCA 
C CCACAATCTCAC CACTTTGTGATAATMCCA7TAACTTC AATATTGATG AATTTCCTTG 
MTGTTTTATCTACAATATTTCTTTC ATATAGTTG ATATC ACTC TGTATG C ACAAACTTG 
TATCCTTTTTCTAAATCTTMTATCATMTATTAGCATTTTCTAA'I GTTAGTAGATGTAC 
CTGCCCG 
Sequence 113 

AGGTACTATGAGTTGCCACCATGTCATGCAACATAACATATTGCTTTCTAATGGAATATA 

GTGAGCTTACGATTGATACTATGGACATAACCACAATAGCTAGAGCATAGTAATAGTATT 

CATCAGTGCTCCACAGTATAACACTGAACAGCTGGAAAATGTAAAATGGGTTGAGAACCT 

CTTTAATTAGAAGCTTAAAAACAGAAGGCACTTTTACAGCAATTTCAT TTACT CCATAAA 

GCAGTTTTCTGTAGGCATGCATCCCCTTTGTCAGTCCTGCACTATGCTTTTCATAAATTG 

ACGTACCTGCCCG 

Sequence 114 

AGGTACTGCACTGGTCATTGATAAAGATGTATTCAAAGGCTCACTTAAAGACTTCGAATA 

CAAGCTGATGGGAATCTGATTACTATTTTCACTGCTTGGAGCTGATCCAAATTCAGAGAG 

AGAAGGTTCTGACCCAAATTCCAGAGCCCCAAACTGCACATTTAATCCTGTGACATCTGC 

TGAACCAGGCATTTCCACTGCAGAAGCTGGGATCTTAGAAGCTGGGGGTATCCGCCGCTT 

AGCAAGTTTGATGTGTTTGGGCTGTGGCTGGTGGACAGACACAGAGATATTTTCAATGGT 

CGTGCTGGGAAGCTGCAAAAGCTTGTTCACAGTGGAGGGACTGTC 

Sequence 115 

CCGGGCAGGTACCTGGTTCTACAGATGGTTCTCATGCACAAAATTTCAGAACCACATTGT 
AGAAMGTAAAGCAGTATGACATGCTTTGGAAACTGCAGATAATTTAGTGCAACTGTATT 
ACAGGTTACAGATAATAAGAGATGAATCTGGAAAAGAAAAGAATGTTATGGTACCT 
Sequence 116 

AGG I HUM I CACTNNCGTATTGTTTATGGAAGAAGAGATTAGAGGACAATACAAGTAG 

CCACAGCTATGATGCATGGAATACTACAGAATATGGTGAAATGCTATGTAAGGGCTAGAA 

ACAATTCATTAGGTGTACCTGCCCGGGCGGCCGCCCGGNCAGGTACATGCAAATCGCCTG 

TGGTAGCCATAGTCACAGGATGTGTCTTCAAGACAGAAACTTGCTTTGTGGCCCTCAGCC 

ACTCTCCTNTGGGTGTTGNCATCAAGCANGTCATAGAGNCTAAACTCATCCATACTGTGG 

NAATG 

Sequence 117 

CN GGGC AGGTACCTAACCTACCTTTAAG ACTGGG ATAACTATTG G AAAC AATAG CTAATA 

CCGGATATAGTTATTTATCGCATGATGAGTAATAGAAAGGAGCTTCACAGCTTCACTTAA 

AAATGGGGGTGCGGAACATTAGTTAGTTGGTNGGGTAANGGCCTACCAAGACNATGATGT 

TTAGCCGGGCCGAGAGGCTGTACCT 

Sequence 118 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAACTTCCTTATTTCACAA 

ATTTAAGT^TGGTTTATATATTTTATTGACATGGTTACTCAATGTCCACAT CATTCCA TC 

TG CATCG7 CTTCCTAC AAAC AGTTTTTCTTCTACTATTCGGTTATTTCTCCTTTTTTTGT 

TTCCTATTT C AGAATCAAATTTATTTTACTTG C AAAGTCAGTGG AATATGGTTTG GAACC 

AGTAGGGOCTCTAACTTAAGCCCAGAACCTGTCAAAGAGAAGTGCAGTATCATTGCTAAG 

ACTTGAACAGTTTATCTCTCAGAATCTTCAGTTCCTTTGAATTTCTCAGCTCTTAGTGTA 

ATCTGTTrTATGTGTTTGTTGNAGACTTCCATTATGGATAGATTNCCAAAATAANT TGGG 

TAATCAACTGGTATTTTAG CATTC TGGGAACTAACATGTTTAAAAATCTAAGGCTCTTTT 

TCATGCCAAGGAGACTGATTTTTGGCTT 

Sequence 119 

TTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACAGTA ATATC ATGAAGCTGC 
TGTCTGCACTTGTCAATATCCTGTAAGCCAGTAACAATAAAATTCAGCTTTTGGTTGAGC 
CCGGCCTGGCTGCTGGGCTGGAAGTCACTGACGATGATGCCGAGCTGCCGAATGTTCTCC 
ACGAACTTCTCCAGGTGCTCCTCTAGGTGGTCAAACTTCTCCGCCACCTGCCCG . 
Sequence 120 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGGAAGAAATTGCCAAGGA 
CAAAGTTTTAAAAGACTTTTATGTTCATACAGTAATGACT^ 
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^TAG AC AATATGG CTCCTAGTCCTG GTC ATATATTGAG AGTTTACGGTGGTGTTTTG C C 
T™GTCTGTTGCTTO^ 

A^TTCAGGTATACTCTGAAACTMTGATTGATAAAGCAAGTTTAGGTCCAATAGAAGA 

CTTTAGAGAACTGATTAAGTACCTGCCCG 

Sequence 121 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGATGGGTCATTTTTCTTGTATATTTTCTCT 
^TTCCCCTCCTTCATGGCTGCATATCTGGCTAGCGTGAAGAGATAGTCACTGAGTCTG 
£™ G ^^ 

oIa^^ CCGGCGGCACACGGCCCGGCAG ^ TGC ^^ 

GAAGGCAGGATGAAGGCCGTGAGTGGTGGGAGCTGGCTGGTGTACCT 
Sequence 122 

G AAATAAAAATTAG C AAAC AATTATTC TAG GG ATATTTTC AG ATTTTAC TTCATTTCTTG 

'^^15^ G ^ZSS£^ A ^ G GA ATTG C ATTTCTTGTGCC AAG AAACTAATAG AACTTATTTC 

AG IJT ACCl rrTT1 1 AAAATGTGAATTTAGTTATTATAGTTTCAATTTTATGGCCTTACA 

GATGGCTTTTATTTTGmGCAGCNTGACACTGCAGTTCCTTTCATGCAAMTACCCATA 

^TGTTrrGATGGAAAMTTCATTGCCCCTTAATGGGAAAACCTCTCTAGTrrrT^ 

ATTATAAACTANTTCCCTACTGNTACCCTGCCCCCGGGGCCGGGCNCGCTTTTCGACCCA 

MCCATTNGTNGGGTNGMGCNATTTTCCCAACGGGNGCCGCCMTO 

GGGTNGCCTTGGTGCCTTNCCGMGTCNTNCTTGGNAMTMTTTTTTTGGATANGGGAA 

AGCCGGACCANAGGGAAAAAAATTT 

Sequence 123 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAATAAAGTT 

CA^CCACCCTGCACTTTGGCCCTTAGATCAATCCTAAGTAGCCATTGCCAGTAGGCCAAG 

TTTAATCAGAGGACAGTGCCTACCAGTAAATACTGAATAGTTACAATAGTTATGTCCATC 

CAACCAGTAGCAGATGAACAGCTAATACATCATGATGCTATGCTCTCCTAACAGGGTCCC 

CTCAGATCCTCAGTGAGCACATAAAGAAAGGGAGGTCATATCCCTTACATCTCTACCAGG 

TATTAACACCTMCTACTCTCTAGCCAGAGGCAATTCCCTTTATTTCCTTACTCTCGTCG 

TCTTCTCTTTAGCCCAATCTCCTGACAATAGTTAAMCAAAAAGACCCCCAAAATATCTC 

TTGCTAAAACAGAAGTAGTCCCTAAACTCTCTCATC7TAGACTACTGTCAGGTACCTCGG 

CCGCTCTAGAACTAGGTGGATCCCCCGG 

Sequence 124 

CCGGGCAGGT ACAAA G CACT GGAATTGGGGAAATAGCAGGGTGTTTCCCCCACAATTAGA 

AGC^TCTTGCTTTTCATTTTCCTmACTGATTAGCAC 

GAATTTTTCAAAAACAGCATTTAGGGTCCACATTTATTTTAATTCTGATCl^ 

TAATTG GGTG AAACTTATG GTG AAAAAATATGC ATAGTTACTTTTG ACATAG ATTTGTTT 

AAGCATGAAAGCTAGGAATTGATTAAAACCAATACATAATATTTAGTTTTGTGTTACTTA 

GTTTCTTTGTAATAGTGTGTAGAATCATGTGAATTACTG 

Sequence 125 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTAGTTTAGTTTATTTCTAG 

TTCAAAAATAATCTGTAATTGCTGTMGAAATGTCAACCACTTACCTAGGATGTTTGACA 

ATTGGGATGAAGTCTACATATACTMGNAATGGCMGACAATTATTTTATTGCTCAAAAG 

AAAGTCAAAAAAATTCCATATTCCCTTTGGGGAAAATTGG CAGGATTTCAAGTATGAC CT 

TTAAG AA TC AG G AAAAGACTAACTTATGCTTTAG GATTA ^ AACAATCAAATAATTAAATT 

AGTTCAATTTTCTMCATAGTCTCTATCTTCAGTTAAAGT3CATCATTGCATGTTATACA 

TTACTAAAATTACACAGTGCATAATTGTTACCATGTGACTATTTAATTCAGGGTCAACTG 

TCTAAAGGTCTCAGGTGTCACATTAAGGGTTATAATCTAGATGAATTGGAACAAGAAAGA 
AAAGATTTCT 

Sequence 126 

AGGGCGAATTGGAGCTCACCGCGGTGGCGGCCGAGGTCGNCCGCCTGACCTGGGGCAAGT 

GCTTTCAACTCTCTGAACTTCGGTTCCTCATCTGCCAGTTGGCAGACGCTCAGCAAATCT 

TCCTAGACTCACGGGGCGAATTGTACCTGCCCGGGCGGCGTTAGAACTAGTGGATCCCCG 
. N 

Sequence 127 

CTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATTCCCAACAGCATGTCCT 

TCCTGGTTCTCTACCCCCACAGCACTTCTTAGAGCAGAGGCAGAGCCCAGAAGCTGTGTG 

GGTCACAGGCAAGAGCTGAAGTAAGACCTGCAAGAGGCGGCAGGGAGCTAACTGTAGCAC 
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GAGGACAAAAATGAACACGGTAATACTGAGGTAAATGAACACTCAATTCATGTGGAGGCT 

GTAAACGTCCTGATGTCACCTCTGCCTCAAGAGCAGAAAATGTGACTGGAGTGGTTACAG 

GAGGGGCCTGCCAGACCCCTGTGGGAATACTACATCTGGGACACCTCAATCAAGGAGGCA 

AGAGAGAATTTCTGGCCACTGGCAAATGAAGCATACTGGCTTGCAGGGACCTTCTGATTT 

CAAGTACCTGCCCGGGCGGCCGCTCTAAAACTAGGT 

Sequence 128 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCCTTGCCGGAC 

TTTCCCTTAAAGAAGGAGAGGATCAGAAAGAGATAAAGATTGAGCCAGCTCAGGCTGTGG 

ATGAAGTGGAACCTCTACCTGAAGACTATTATACAAGACCAGTAAATTTAACAGAGGTAA 

CAACCCTTCAGCAGCGTCTGTTACAGCCTGAC7TCCAGCCAGTCTGTGCTTCACAGCTCT 

ATCCTCGCCACAAACATCTTCTGATCAAACGGTCCCTGCGCTGCCGTAAATGTGAACATA 

ATTTGAGCAAGCCAGAATTTAACCCAACGTCAATCAAATTCAAAATCCAGCTGGTCGCTG 

TCAATTATATTCCAGAAGTGAGAATCATGTCAATTCCC 

Sequence 129 

CCGCGGTGGCGGCCGCCCGGGCAGGTACATAGATCCTGATCTACTGGCAGAGCTCAGCGA 

AGAACAGAAACAGATCCTGTTCTTCAAGATGAGAGAGGAACAGATCCGACGATGGAAAGA 

AAGAGAAGCAGCTATGGAAAGAAAGGAGTCCCTGCCAGTGAAACCCAGACCAAAGAAAGA 

GAATGGCAAATCGGTTCATTGGAAACTTGGAGCTGATAAGGAAGTCTGGGTATGGGTGAT 

GGGCGAACACCATCTAGATAAACCCTATGATGTGCTCTGTAATGAAATTATTGCTGAGAG 

GGCTCGGCTGAAAGCAGAACAGGAGGCAGAAGAGCCCAGAAAAACTCACTCTGAAGAATT 

CACCAATAGCTTGAAAACAAAATCACAGTACCT 

Sequence 130 

CCGCGGTGGCGGCCGAGGTACATCATATGCCTGCTGAAGTGCTCTGACTTTAGGATGAGA 
AACTCTAACATAGGCCGGAAGACAAATAAACCATAAACTGTAACAATGACTAA ACAGA CA 
CTTGGCCCACTGTGGTGGATTTGTATAACATCTCTTCGCCAATTTATGAGCTG \ I ll l AT 
TTCCTGTTTAGTTCTCTTAGCCATGAGAGGTGGACTCTTTGACCTGCCCG 
Sequence 131 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGTGTTTGTT 

TCCTAATATTTATTAACCACCTTATACCAAATGTCTTGCAAAGAAATGTTATTAAAACCT 

TGAATTTTTACAAATGTAAAAAACAAAMGTGTATTAATGTATTTGTTCAGG 

CATACCGAAGGGCTTTTGTATATGAATTCTGTGGTGGGGAGACCCATTTGTAATCTATAT 

GGCAGTTCCATCTGGGTTTTAAGTTTAGATTTCACCGTGTCTTANCTGCTTCATTCTATT 

GGTTTATTGGAAC ATGTAATAAATAGG AGTAGTG ATG TATTA AAAC AC AN GN NTTC ATTA 

atgntttatatcttcactaamttctatagttatgaaacttttatcmtcaaggtg™ 
atttcagtcagaagtgaaaa 

Sequence 132 

CTTAGGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTACCAAAATA 
GTTCAAAAGAGTTCTGTATTATCCTW 

GGGAAGCTACAAGACATAATCCTTCAGTGCTTAACTGGAAAGAACAGTCAATCCTAAATA 
TTAAAG AATCTTATG CTGTTG AAAGTAAATG A W\ATTTC G AGTTATTTGTG G C ATATTT 
GAGCAACCAATATTTATTGCATTTAATCAACTGACTTrAAAAAATTACAATCTTGTTACT 
TTCATTAATTTAGCATGTGAAATTTAATTAGAf'XATACTGGGTACCTCGGCCGCTCTAGAA 
CTAG 

Sequence 133 

ACTATAGGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACANGAAATATTTTGTA 

ATGTGTTGATTTTAAAAAAAATCTGTAAATA/\AGTTT^ 

ACGCTGAAATATGTAGATATTTTGCTATTTA>TTTAAAGGAGTATT TTAAG AGATATTG^ 

CTATCTGAAATTGACCAGTAATCAAAGTTCCMTCATCTGAATGCT TTTCC TTGA GGTAG 

AATGTGAGTCTCAGAAATGACTGCATTACCTGCCCTTTTTTGC ACCTTTTC TGTCTTTTT 

ATTTTGCAGAACAACAACAACAAC AAAAT TGTGCCTTAGCTGTA I Mill (GTCTAGGGG 

GAGTTTGTTTCTGTCTGACCAAA.^GCA/ i .CATTTTTTGCAGAAMCAGGNGGATGGTATTA 

AATACTGGTATCATACCAAAAACCTGCCANGGTGGTATATAGGATGCTTTCTGTCATACT 

GGTGGI I I I I CAGATGCAGAATTTT 

Sequence 134 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAAAGCTTATTCACATTTTTAC TAAATC CAACA 

CAACTTTCACAAATGGCAAAATGATTGCCTCTTCAAAGCAATGCAGCCTAGTTTTTGGTG 
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TCGATGGGCTCCTCGATACACGAGGCTTCATAACATGCGCTCTTTGAAGAACCATTTCTG 

ATGATCCGAGTTGGTGCAGTCTCGTAAGAGTGGAACGAAACTGTCACTCGAGTCCTTCCT 

5 G ?f GCCTGGACACAmc ^ 

TGAACTTCTGATTCTCTGGGGGCA 

Sequence 135 

?I[ AGG ^ GAA ^ GGAGCT ^ 

ATGMCTTGGMGAATAAAGCCTTCCAGTTGGATGAMGCACATCCAGCCTTTTGTGAGT 

? A ~ A ? C III GT ^^ 

!^? G ^^ 

TTAGAGCTGATATACAGATGTGAAACGATCGAACAACATGATTTCTCATTCTAGTGCTCC 
TTAGAAAGGAGTTCTGATAAGCCCCAAAGCAGACCTGGGTTGGAATCGTGGTTATTATTC 
AAGTACCTGCCCGGGCGGCCGCTCTAGAAACTGGTG . 
Sequence 136 

SII A ^ GCGMTOGAG ^ 

CCAGTTGATGACTGTCCATCTCTAGCCCCTCTAACCATGCTATGCACCCAGTGAGTGCAG 

MTGATTy^AGTAATTGATGTTGACmACMTTCTATGACATGTAATACAGGATGCCG 

AGGCACTTTTAAAATTCCACCTMCATGTTCTGGCACTGGAAMTCTTACTTGAGATTTT 

AATACTGGTTAATATAGGAAAAATTATGCTTMGCACCACTMCTTAAAAGTGTAATTTC 

ATTTTCTTGAATTTAGGTAAMGCAAMTTCTCAAAATTCTATTATGGCACATGGTTTCA 
CAGTTTCAACCT 

Sequence 137 

CCGCGGTGGCGGCCGAGGTACATACACATACTCTAGTTATGAGCACCTAGGGTCTTTCCC 

ATCACTCTTGCAAAAGTGTGTGTGNGTGTGTGTGTGTATACATACACTCACACATACATA 

TGTATACACAGACATCTATATNCANANATAATGTATATGTGNGTCTAATGTATATATAAT 

ATATACACATAGACATACACACAATGCACATATCATAGCTCAACCTNATCTAAACATCTA 

GAGTNTGGAGAACGTGCTGGCCAACAGAATACATGTGCATGTGGATGTCATGTGACATGC 

ACTAACCAACCATCCNGAGGNNAAACTGTTGGGAGGAGGAAACTGCCCACAATNTTTGAA 

AATCCAGGCCTACCAGGCCATTATTT 

Sequence 138 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAGNGCCCNTTTTCAGA 
CAGTT NTNGATT CGCTCTAGAC I rTTTTTTTTTTI I AATAGGGAGGGAAAAAATTTGATA 
ATTT TCi n il I C TAC AT G CACTTMG ACTAAAAC AC AGGTTTGG ATTMTTTTATTTGC 
"^"^ ^ iililCCGC TTTTCTTCCCG C AG AG CCTG ATGGG AG AATGTC C AG GG C AGG G AAAC 
CACAl mi l GTAG GTGATMCTC AATG AAMTTGGTGCTTATTTTTTAC ACTTCTCTCT 
TGTGGCTCTCTTGNGGTGPTATCTGTTTTAAGGTCTCCTTGAAGGCGCACTGGGGTCCCT 
GGCCATGCCTCGTTCTCCCTGCTTTCTTTA 
Sequence 139 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGTGCTATAGACGCACAAAC 

GACCGCGAGCCACAAATCMGCACACATATCAAAAMCAMTGAGCTCTTATTTTGTA^ 

CTCATTTTGCGGTCGCTATCCAAATGGCCCGGACTACCAGTTGCATAATTATGGAGATCA 

JAGTTCCGTGAGCGAGCAATTCAGGGACTCGGCGAGCATGCACTCCGGCAGGTACCTGCC 
CG 

Sequence 140 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTTTGATAGAACTAAGGAAATAGTGGTTTTGAG 

TGAAGGGAAAGGAAACCCAGAAACATTTTACGTTGCTTTTACTTCTGTAGTGTAGATTGC 

CCCGGCCCC TCTCT GAGCCCTGTAGCATCTGTGATAGCTTNTGTCCCTTCATCGGTTCAT 

GTCACAGGGATTTTCTTTCCCAGGAAGCGGACACGGAGAGTCAGCCCTAATAAATGAGCA 
CATGCCCTGGC 

Sequence 141 

GACTACTATAGGGCGNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGATTGTTCTCA 
AG AG GG CCATC AG AAG G AAGCC AAAG AGTTC AC AGCCTC AG C AC C AAC AACTC AAC ATG G 
TCATCATGTTTTCTATATGGTTTTTCCAGCTAGCAGTACCTGCCCG 
Sequence 142 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCCTGAGAAGCATGGGGCAGTAGAAAGAGCATG 
TGGGCTTTAGAGTTCAAACCAAGTCAGGCTCAAACATAGTTCTGTGATAAGCCCTGAGCA 
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AGTTACCCGGGTCTTCCATTTCCCCCTTCTGGAGAAGTCCTTTGGAGGATGAGTCCTTCT 

GGAGGATGAGTCCTTCTGGAGGATGAAGTCCTTCTGGAGGATGAGTTCGTTGTAAGAATA 

AAATGAGAATGTAAGACACCTAGAGGATGCCCGAGTAAAAAATGACAGTTGCTAGTAGTA 

GTAATTTGTAGGGCTCATTATCTAGAATAATTTTGTTTGACGTTACTAATTAAMTGAA^ 

TCTTAAAGAAAAGCAGTGTATTTAGACTCTTGTAGTTAAGAAAAATTACACCACAGAGCC 

CI 1 I MAC! 1 I I f AAATTCA I I I I ! ACATTTTAAATTCATTGCATGTATTCATTATG 

Sequence 143 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTATATTAAGTGGT 

TCTATAAAAGCTATTCACMGTTCTACTGTGATGGACATCCTCATCATAGACAAA CCTC C 

CCTGTTTTATCCTCAATTTCTAGnACAGAAATTTGGTGATGCTTATTTTT 

ATGTCAAAATAAGNTAAAACTTCCCTCCTGTTCACCTCTTGGGTCTCTATCCTGTGTAAC 

CTNTGGTGTAGTATTTGCCCATAGGCAACCAGAGCCACTTCCTCTGAACCCAACATCTNC 

TGGGGACCTTCGCAGCANGAGGAAAGCACTGAGACAATAGCTTGCTAAGCAGGGGCCCAG 

NGGTGTCTCAAAGAAACCATGGNTGTNCTCGCCACTTCCCAGGGNGGGTGANGNGAGCTC 

GGGAACATAACGATGGTTTTG 

Sequence 144 

TNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGATTATG 
CTTACCTC AG GTTTCTn"AGTGTGCTTGAATGCCCTTCTTTC C ATATAAC ACTTATCTTC 
TTCTTAATTCGGCAATGGAATATCTTTTAAGTTTTAAAAAAACTGGAATAAT^ 
TCTTTTTTGCCGTTTATATTTAGGGGTTTTTG 

TGCTGAATTAAATATTTATGAGTGGTGCATTTTTAAGTATAGTGAACAAGACACCATATT 

AAGTACCTCGGCCGCTCTAGAACTAGN 

Sequence 145 

CCGCGGTGGCGGCCGAGGTACCTGCCGGAGTGCATGCTCGCCGAGTCCCTGAATTGCTCG 

CTCACGGAACTATGATCTCCATAATTATGCAACTG GTAGTC CGGGCCATTTGGATAGCGA 

CCGCAAAATGAGTTTACAAAATMGAGCTCATTTGrrmTGATATGTGTGCTTGATTTG 

TGGCTCGCGGTCGTTTGTGCGTCTATAGCACCCTTGCACAATTTATGATGAATTATGGAA 

ATGACTGGGACATGTACCTGCCCG 

Sequence 146 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAACCTCATCAATGT 

CTCTAAAACTAGGTCTTGAATCTAAAATGGAATTATACATAGAACTATTACATGAAGCAG 

TTCTTTCACTACAATCATTTTTAGCCCCCCAGTGACCAATCTCCTCAGGTAAACAATGTC 

TAATGACTG ATGATTGTTTACTGTATG C CAATTAC TGCTAAATCCCTC ATATAGATTATC 

TCATCTATGAAGACACAAGGATTATTAATAACCCCACTTCAGAGATAAACAGACACAGGT 

TAACACATCTCGCTAAGACCTNTTGTMGTGAAAAACCAAGTTCAGAGCCCAAAAAGTT 

Sequence 147 

CCGCGGTGGCGGCCGAGGTCTGTATCTCATGTTTGCTGATGGCAGTTTGTGGGAGAGGGA 

ACCATTTCAGCAATGATTTCAGTTAGC AAATAG CTTTGCAG CTTTAAC AG AACTCTCTCT 

GCTTGCTTTTTTATTTCCTGGGTTTTCTGGCCAGGCGCGGTGGCTCACGCCTGTAATCCC 

AGCACTTTGGGAGTCTGAGGTGGACAGATCACTTGAAACCAGGAAGCCAGGAGTTCGAGA 

CCAGCCTGGCCAACATGGTGAAACCCTGTCTCCACTTAAAAAAAAAAAAAAAAA/^TTAGC 

CTGTAATTCC AACTACTCG AGAGGCTGAGGC AC AG G AATCGCTTG AACTTG AGA AGC AG A ' 

GGTTGCAGTGAGCCAAGATCGCACCACTGTGCTCCAGCCTGGGCAACAGAGCAvAGACTCC 

ATCTCAAAAAATAAAAATAAATAAAAAATAAAAAAATA 

Sequence 148 

CCGCGGTGGCGGCCGAGGTACACTAAGTTTGCAACATTTATTGAGATCTAAGTCTGTCTT 

GCCTTCATTTCTCTTTTTATCTCCCCCTTGCCCTCATTCTTGAACAGCTGGAGGAATACA 

TTTTATTCTGTCCATGAAGCATACACTATGAAATTCAAGTGCTTAAAAATACTT'CTATGA 

CTCTCTGCTATCCCACTGTATAGATCCACAGGGAGCAAACACTTAGAAATGATAGAGAAC 

TGAAGGAGATCAATGGTTTAACAGTTATCCATGCCAAGTCCCATTGTCAGAAATATTCTT 

ATTACTCAGTCAAACACTCTTTGAGCTTCCCTTCCTAAAGGTAACCATT CCAGT GAATAG 

ATGTGCCCTTTTATAAGGAAACTTCTGATGTTTATTAAAA 

AGGGAACTTAATTTGGGAATTTGGTGNGTTGNAATGGCATTTAATT 

Sequence 149 

CCGCGGTGGCGGCCGAGGTACAGAAGTTAGAATTTTTGACTCCAGGCAGCAGTTTGCTCA 
GTGATCTTGAACAAGTTATCCAATTGCCTCTACATTTGCATCAGTTTCTCTAGCTGCAAA 
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^?^A T ^ TACTATATA CCTACCTCACAGTGGGAGGGCAGGAGATTTTGAGGCCCTGA 

V A H CA S AGGCCCATCGGCC1 ^ AGAAAGACAAGACAG GGAGTGCTTGm 
AAGGAA ™ AAG J^ 

GAA I ATTTTA1 1 1 n I IGGCAMTTTGATCTTACCCTTTACCAGTTCTATAATTTGGTTA 

Sequence 150 

G ™ GG ^ G ^^ 

^C^TCGAGmAMTCCJATGTATCTTTTGTTTCCAAACCATGCTGCAGAAACTA 
rT^J ?/^$IS?Ti?Z A ^ A ^ A ^ A ^^ G ^ AGG ^ G ^CTG AAGTTTTGAAAG AGCTTC CC AAAA 
CpTMGGTAGATCCCTAGTTTCATCATAMTGTAACTAAMGMTTGGCCTAAATGATT 
TCAAAGATCTCCACCAGCTCCTGG7TAAAAACTGCAAAGCTATGATGTTGTGGGCTTGTA 
^^ G ^ 

AGTTGGTGTGCACCTGTAATCCCAGCTACTTGGGAAGCTGAGGGGAAAGGGTCAC7TGG 
Sequence 151 

GG ?^^ 

™SIS? GCl mm 'ACTTTGACAACTGAGACAAAATGACAAATTGTCAGTGTTCAGAG 

a ™cagac^ 
£2^ GGC J^ 

rrm igggcccttamgcnagggggtttagggnannataacncccccttntnnnngann 

GG ^™ GAAAAACCC ^^ 

AAAAAAAAAAA 
Sequence 1 52 

ccgcggtggcggccgcccgggcaggtacattattttgatggacgaaaacggatgatcttg 
agcactatttcatggatgggaggaaaamtccatttrtggggattgcttacatcgctgtt 

GGATCCATCTCCTTCCTTCTGGGAGTTGTACCT 
Sequence 153 

CCGGGCAGGTACCMTTAGMTGTCTTCAGTTATTAGTAATAGAGCATCCTAAATCAACT 
GGCTTAAAMTMGmAGTCTCTCACAAAAGAAACAGTCCA^ 

GGCTGCTTGGTTrGGTGGCTCAAAGACATCATCCAGGTCTCAGGGTCTTTCAGTATTTCT 

GCTCAGGCCATGATTAAACTATAATCCA7TGCTGAAAGATGGTTACCAGGCACCACATCC 
AGACAAGGTACCT 

Sequence 154 

2? G £ GGTGGCGGCCG C^^ 

CACTATTGAAACTTCGCTTmGCTA^ 

AG^GATTTTAGTTCAAMTC^^ 

TGTCATCTMTATGAGTTTATTTGTCTATAGACACMGTATCMTGTCTAAAAAAAATCA 

TGACTTTAMCTTCCACCGATGAGGCAGGTAGGAGATAAAGATGMTTCTGAACTGTTAC 

TAAAAGTACGAACCAGCTCGTTATTAGATGCATTGTAGACAACATCGATGATCCTTGTTT 
TAACGAGTACCT 

Sequence 155 

TTCACATTTTTACTA MTCCM CACMCNTTCACANATGGNAAMTGATTGCCTCTTCAA 
AG ?^ TGCAGCCTAGTr ^ 

AGAGTCCTGGCTCCACAGTCTCCTTCGATGGGCTCCTTTGATACACGANGCTTTATAACA 
TGCCCl mi IGAAAAAACCATTTNTGATGATCCCNGTTNGGGCCCCNCTNTTAAAAAAN 
GGGAAAAAAMNTTGNGGNTTNNATTNNTTTT^ 

GNNGGGNNGG GAAAAAAA AAAAACCCCCCNCNCCCNGNANAAAAAAA I rTTTTTTTTTTT 
TTNGGGGGGGG nTTTTi NAAAAAAAAAAAAAAAAAAAAGNNNANNTNC' i rfTTTTTTT C 
CNNNNAAAAAAAAAANNNNNNNNNNNNNGGGGGGGG 
Sequence 156 

CACTACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACTCGCGATGCCCCC 
GAGTGGCCTCTGCCAGGCTGTCCCGGGCCTCTCGGCTTCCCGGGGACCCAGTGGTGTAGG 
CACGGACCATGTTGTAGGCACCATCCCGGAGACGCCGCTGGACACAGTATGCCTCATACA 
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GCTCATCGATCTTGCTGGCATGGAACTCCAGTCGTCGCAGGAAGCGTTCAATGGACTTGA 

CTTGCTTGTCCAGATCATACAGGAAGCCCAAGCGGGAATTCCTCTTGGACTCCCTTATCT 

GCCCCTGGAGTTTCTCTTGCTCCTGCTGGTGCACTTCCAAGTAGGCCGTCAGGCCCCGCT 

TCAGCGCCCGTGTACCTGCCCG 

Sequence 157 

CCGCGGTGGCGGCCGCCCGGGCAGGTGGCGGAGAAGTTTGACCACCTAGAGGAGCACCTG 

GAGAAGTTCGTGGAGAACATTCGGCAGCTCGGCATCATCGTCAGTGACTTCCAGCCCAGC 

AGCCAGGCCGGGCTCAACCAAAAGCTGAATTTTATTGTTACTGGCTTACAGGATATTGAC 

AAGTGCAGACAGCAGCTTCATGATATTACTGTACCT 

Sequence 158 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATNNAAATNATACGT 
TTTAATAAAATACATCTTTAGATCAAAGCTGAAAGAAGACATCAGTAGTAGATCAGAGNA 
TTCCATTT 
Sequence 159 

ACACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTGGGTCTGAAAG 

TCGATGAAGGACGCGATTACCTGCGATAAGCTTCGTGGAGTTGGA AATAA ACTATGATAC 

GGAGATTTCCGAATGGGGTAACCTAACTGAGCAMCCTCAGTTGCATTTTGATGAATCCA 

TAGTCAAATTAGCGAGACACNGTTGCGAATTGAAACATNTTAGTAGCACCGGGAAAAAAA 

AAAAA 

Sequence 160 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTATCATTTGCACACAGCAGATCAATAGGTGTC 

AGTCACCAGCTTAAGTTACACTTGTCAATATTCAAACTTGAATAAAATAAACACACATCA 

CMCAGCGACACTTTGCACTATCAACAATGAAGCTTGCCCTCAACAATTATGACGTTACT 

GGTTTTAGTAACATAAAAATACATTGCTGGTTGACAGGAAGGATAAAAATGACATCAAGA 

ATCTCAAAAAGTTATGTTGAGGTGCCGGTCACCACATTTAGGCACTCAGGAATTAAAAAT 

CA 

Sequence 161 _ 

CACTACTTAGGGCGAATTGGAGCTCACCGCGGTGGCGGCCGAGGTACATACATTTCTTAG 

GGACACAGNATACTGACCACATNACCACCCTNTTCTTCCAGTGCTGNGNGGACCATCTGG 

CTGCCTTTTTTTCTCCAAAAGATGCAATATTCAGACTGACTGACCCCCTGCCTTATTTCA 

CCAAAGACACGATGCATAGTCACCCCGGCCTTGTT TCTCC AATGGCCGCGATACACTAGT 

GATCATGTTCAGCCCTGCTTNCACCTGCATAGAATCTTTTCTTCTCAGACAGGGACAGTG 

CGGCCTCMCATCTCCTGGAGTCTAGMGCTGTTTCCTTTCCCCTCCTTCCTCCTCTTGC 

TCTAGCCTTAATACTGGCCTTTTCCCTCCCTGCCCCAAGTG 

Sequence 162 

CTTAGGGCAATTGGAGCTCACCGCGGTGGCGGCCGAGGTACATCGTCTTCGTCTTCCTCC 

TGGGCGCCATCTGCACTCGAGAGGCCCTTCGCTATGACTTCCTGATAGCGGAAAAGGTAT 

CCCTCATGGGCTTCCTGTTCCTTTTGGGCTTATACATCTCGTCCCTGGCTTCCTGCATGG 

GAGGACTTTATGGAGCTCCCCGCATCCTGCAGTGCATTGCCCAGGAGAAAGTGATCCCTG 

CACTTGCCTGTCTGGGACAAGGGAAGGGGCCAAACAAAACACCCGTGGCTGCCATCTGCC 

TGACCAGCTTGGTGACCATGGCCTTTGTTTTTGTGGGTCAAGTGAAACGTTCTGGCCCCC 

ATCGTCACCATCAACTTCATGCTGACAATACGTTGCAGTGGACTACTCTTACTTCTCCCT 

GTCCATGTGTTCCTGCAGCCTGACCCCGGTGCCTGAGCCGGTGCTCAGGGAGGGCG 

Sequence 163 

CCGCGGTGGCGGCCGAGGTACCACAGTCTTGCACATAAGTGCAGATTTGGCTCAAGTAAA 

GAGAATTTCCTCAACACTAACTTCACTGGGATAATCAGCAGCGTAACTACCCTAAAAGCA 

TATCACTAGCCAAAGAGGGAAATATCTGTTCTTCTTACTGTGCCTATATTAAGACTAGTA 

CCTGCCCG 
Sequence 164 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCACCTACAAGCTAGAGTGG 

TAAATTTCAGGAAAGCAGAAGTTAAAGGCAAAATTGTAAATCAGTCGAGATCGGGTGCCT 

TCAGGGTGGTATGGCTGTATACCAAAATTGTAAATCACTACATGAAGCTTATATATTGGT 

TTGGCCTGAAAGGTGAAGTGGGGTAGGCAGGGGGCGGGCTTACAGGTTATGGNGGATTCA 

AAGACTCCCTGATTTGTGATTGGNTMGGAAGCAAAGCTTTGTCTAAAAACTTGGGGTCC 

GCANAAAGGAACATTAAGGTCTGGCCA 

Sequence 165 
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TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATAAATATGGA 
AGAGCAGTTTGTAATATGAATACATTTTCTCTAGACGAGATCACAGTTTTATTTTGTAAA 
TA7TACATTTAAGTATATATATACACACATATATGTACCT 
Sequence 166 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAAACCTCAGGG 
"^ATTTACG AAGC CAAAGGACTTTG CTATATC AAGTAGTTC ATTTCTTATCTAAG ACC AA 

CTATAGGTATGATGCTACTGTATTCAGGCAATGCCGACTGGATTGGAACATGCTAATTTA 
AGGTGAGTTGGTACCT 

Sequence 167 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGTTTTTGCTCA 

AAATATATGAAAGGAMGATAATCAGTTCTTAAAGTTTGGTATGCTAAATAAGCACCCAG 

GGAACTTACCTGCTCCACCTCACCCCAGAAATTGATTATTCAGGTCTGGAATATGGCTCA 

AGGATTCTTAATTTTCMCCMCACMTATAGMTCCTGTCAGTTATCCTTGGATCATTT 

GCTAAGTGATCTTGCAAAATCAAAACTCCTTTGAGACTATTTCATTTTCAGAMTATT^ 

AAAACTTGTTCAGAAACATCTTAAAACTTGTTAGCTTATAGGAAAAAGTNCNTNGGCCGT 

TTNANACNNAGNNGGANTCCCCCGGCCTGNAGGATTTCANTTNAGNTTNATGANTNCGGC 

CACCTNGNGGGGGGGGCC 

Sequence 168 

TACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACACCCATCAA 
GCCACCAGCCTGGCCAAACGGGCATCTTTCTGTCCAGAGCTCATCAATAGGAACTGGGCA 
ACCAGGTGGGGAGCCGCCAGGGGAGCTTGGAGAGGGCAGAGTAGGAGCCTCCATGAACCA 
AGTCAGGCTGGAAGGGGTCAGCCCTCGTGGTCCGTCAGCGTGCGGCGGAGTCCCTGTGCC 

TAAGTAGCAGAGCGGTAGTCATTGAAACAGGCCGGGATGCCAGCAAGAGGGAGGCGAGCA 

TGCAGTGAGCGATAGGTACCT 

Sequence 169 

CCGCGGTGGCGGCCGAGGTACCTTGAAGCAATATACTCAATGAGCTCTAAATCTCACATT 

CACTAGTGATCTGCAAGTGAAGCTGMTAAATATATTCTCTTTGTTCAGTCATGGAAACT 

CAACATTAGAAATGACATGTGGGAATAGATCACCTTCTCTCTAGACTAATATCCCATTTC 

TGACAGTGGTCTCCATGGCCCATTGGGGCTTATAAAGTAGCTGCCTTCTGTGACATTCTG 

CTTGAAGACAAAGGCTTTGCCCAAACATAACTCCTTTTAAAAAAGCACMCTTGTTT 

CACACAGAGGTTAAGTGACTTACACAAATCCACCCAGGATTGGTGATTGGCACCGGTCTA 

ATCTACTGAA AAATA TMGGCCAGGCMTTATATAGATATCTTTTAACAAGTTAGTAAAA 

TAGTGGGGACTTTTCTGGCTAAAAATTGGTAATTAAATGGGNATGGTTACCTCTGGTTAT 
AACTAAACAAACAAA 

Sequence 170 

TATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACGAACCATGCTCGTTATTA 

GATGCATTGTAGACAACATCGATGATCCTTGTTTTACGAGTACCTGCCCG 
Sequence 171 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATTAAATAATAACAGAAAC 

CAAGAGTTTGTTTCTAGTAGTAGAAGTATAAGTGGAGAGAGTGCTACATCAGAGAGTGAA 

TTTACCTTAGGGGGTGACGACAGTGGTGTGTCAATGAACCCAGCTAGGAGTGCACTTGCA 

CTGTTGGCCATGGCCCAATCTGGGGATGCAGTCAGTGTCAAGATTGAAGAAGAAAACCAA 

GATTTAATGCATmAACCTTCAAAAGAAAAGAGCTAAAGGAAAAGGGCAAGTTAAAG^ 

GAAGACAACAGTAATCAGAAACAGCTGAAAAGACCTGCCCAAGGCAAACGCCAGAATCCA 

AGGGGAACAGATATTTACTTACCGTATACTCCTCCTTCCTCAGAAAGCTGCCATGATGGT 

TATCAGCATCAAGAAAAAATGAGACA 

Sequence 172 

CCGCGGTGGCGGCCGAGGTACCAACTCACCTTAAATTAGCATGTTCCAATCCAGTCGGCA 
TTGCCTG AATAC AGTAG CATC ATAC CTATAGTTGGTCTTAG ATAAG AAATG AACTACTTG 

ATATAGCAMGTCCTTTGGCTTCGTAAATAACCCTGAGGTTTTGTACCTGCCCG 
Sequence 173 

CCGCGGTGGCGGCCGAGGTACCAACTCACCTTAAATTAGCATGTTCCAATCCAGTCGGCA 

TTGCCTG AATAC AGTAG CATC AT ACCT AT AGTTGGTCTTAG ATAAG AAATG AACTACTTG 

ATATAGCAAAGTCCTTTGGCTTCGTAAATAACCCTGAGGTTTTGTACCTGCCCG 
Sequence 174 

CCGGGCAGGGTACCAGTGCCCCTTTTCAGACAGTTTTTGATTCGCTCTAGACTTTTTT^ 
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MM 1 A ATAGGGAGGGAAAAAATTTGATAATTTTCTTTTTTC TACAT GCACTTAAGACTA 
AAACACAGGTTTGGATTAATTTTATTTGCTTCCI 1 1 1 I CCG CTTTTCTTCCCGCAGAGCC 
TGATGGGAGAATGTCCAGGGCAGGGAAACCACATTTTTTGTAGGTGATAACTC AATGA AA 
ATTGGTGCTTA I MM I ACACTTCTCTCTTGNGGCTCTCTTNGTGGTGCTATCTGTTTTA 
AAGGGCCTCTTGAAGGCGCACTTGGGGTCCCTGGCCATGCCTNGTTCTCCCTGCTTTCTT 
TAATCCTGGTATTGCCTCCACAAGGTCTGTTGCCAAGGACTCTTAAAGATCAATGGCACG 
TCACTTTTCCTTTCC 
Sequence 175 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCTTGG 

ATGTGTTTTCTCACCAAGATGAGCAAAGAAAGGTTTGCACAGAGGAGTGTGAATGTGTGT 

TTGTTGCTGGCTGAATGGCAATAGATGTCTAAGGTGGATTCAGTGTCTGGCACACTGAGA 

CACCTCCAAGAAGGAGATTGATGCATCAGGTTCAGTTTAACCTGGAATAT CTGACTAC CC 

CTGAATCCACCCAGAAAGGGGGCCCAACACCCTTGTCCATTTATGGGGTAI M I M MCG 

AAGTTATTAAGCATATTCCTTTTCCACGAACCTCTTCTGTACCTCGGCCGCTCTAGAACT 

AGTGGA 

Sequence 176 

CTATAGGGCGAATTTAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACACGCGTCGCAA 

TTCAGGATGGCTCTCTTGGATCTTAGCTTGCAACTCGGCCTCCAAGCTTGTCACAGACAT 

GGAACTTTTCTGAGGAAACAGGGCACAACACAGGGGTGTAGCAAACACCAAACAGAAGCC 

AACTAACCCAACTTGAATGGGTGCACTCATCCATGGGAACCTCTTCAAAAAGGCTTTCTT 

TTCCAAAGTGTTCATAATGAATGGAGGGATGGCCATGCCAGGGGCTGCCATGAGAATCCT 

GGACACGACAACTTGCGTGATGGCTTGTTTCGCAAGCGTTCGCCGACTCCCCCAAGCGGT 

TCCCATTCTCATTCCCGTGACGGGGAAT 

Sequence 177 

ACTTAGGGCGAATTGTAGCTCCCCGCGGTGGCGGCCGNGGTTACAAGCTATTATATATTT 

TCATAATATAGAACTTTAAAGTATAGTAATCAACAGTGGCAAAATATTTCTGTCTCTGAC 

ATTATTATTTGTGAGATCCAACCTGTAATTGAGATCAGAGAAAACAGTATGGGAAAACAA 

ATCCATGAGMCAGGATAAAAATATCATGAACAGCAAAATATGTCTGAGTATTTAAAACT 

GTATAGAAAAAATAAATTAGATGAACAATTGTGTAAATATAAATTAAAAAAGATGACATT 

CTTAATAAATAAAGTTTACTATTrACATTTTTATATATTGCCAAAAGATTT 

Sequence 178 

CCGCGGTGGCGGCCGNGGTACACGAAGGTTAGAATTTTTGACTCCAGGCAGCAGTTTGCT 

CAGTGATCTTGAACAAGTTATCCAATTGCCTCTACATTTGCATCAGTTTCTCTAGCTGCA 

AAATGGGGATAATACTATATACCTACCTCACAGNGGGAGGGCAGGAGATTTTGAGGCCCT 

GAGGTTTTAGGTGGGCTGTGAGGGCCAACGCTTGACACAAAGTCCAT 

Sequence 179 

TGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGGTACTTCCTGAGGAGTGAAGCTGTTT 

TCTGCTACCCTTTCCTGTCTGGTGGCTCANGAAGTTCAGTGTCTCGGTGTCTGATGCAGT 

GAATACCTGGGATTACAGTCTGGCATGCTCATTCCTTAGAACACAGGTTTATTCTCAGGA 

TGATTTATAGCAAGTGATTCTGGTTTCCAGTCTGGTTGAAAAAAGTGAATATTGAATGTC 

CGGGCCCAGTTTTTAAANNACTCTTTTTTTNTTGGGAAAAMGCGG GGNAA AAACCGGCC 

CTTGGGGCCCCCTTNNNNTANGGGGNNGGGGGNCCCCCCCCC CCTTTTN AAAAAANNANG 

GGNCCCCCCCCCGGGNNNGGGNGGANMNTTNAAA ANAAAAGNT TTTTTCCCCCCCCCCC 

CCGGGGGGGGGGGGGNCCCCCCCCCCCCNCN I I I M I M M M I AAAANANNNGGGGNGC 

CCCCCCCCCCCCCCCANA 

Sequence 180 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTGTTTATAGACT 
CAAAACATGATGAAAATTCATTTCATTGTCTTCACTGAATAGACTTAACCTTGACAGTAA 
GCTTAAAGCATTAGTGTCATTTAATTACATGTCTGAGGAATTTATGCTMCTAGATTTTG 
GGCTTTTGCAGCGAATTTTAGTCGGGGCCTCATTCCTGAGCCCAAGTGACCCTTTCACCT 
CAGCTTCCCAAGTAGCTGGGATTACAGGTGCACAC CAACTGTGC TTTGCAGTTrTGTTTG 
TGTGTGTGTGTGTGTGTGTGTGTGTGTATTTTTTGGI M M M I I ACAAGCCCACAACAT 
CATAGCTTTTGGCAG \'\ '\ \ I AACCAGGAGCTGGTGGGAGGATCTTTGAA ATCA TTTAGGC 
CAATTCTTTTAGNTACATTTATGATGAAACTANGGGATCTACCTTNAAGGTTTTGGG^ 
Sequence 181 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCATGG 
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A^TATTAMGATCTTTA^^ 

^t^aatgtggctccatcgatttg^ 

TGTTTTAATGAATCAAMTATCTTCTTAGAATGTCTCTGAAAATGGCATCTCATTACCTT 

A . 

Sequence 182 

^^^^^ GGG ^^^^ GGAG ^"^CCCCGCGGTGGCGGCCGCCCGGGCAGGTACAATAA 
AG II^ CCACCCTGCA C^GGCCOT 

S^J^ MTCAGAGGACAGTGCCTAC CAGTAMTACTGAATAGmC^ 
GA I GCAACCAGTAGCA ^ 

TCCCCTCAGATCCTCAGTGAGCACATAAAGAAAGGGAGGTCATATCCCTTACATTCTCTA 

CCAGGGATTMCAC CTAAC TACTTCTCTMGCCAGAGGCAATTCCC7TTATTTCCTTACT 

CTCGGTCGNNC7TNTTTTTAACCCAAATCTTCTGACCAATAGGGTAAAAACAAAA 
Sequence 183 

I G ^ A £ : ^ AGGGCGAin ' GGAG CTCCCCGCGGTGGCGGCCGAGGTACACACACACCGGCA 

TGTTTATACACAGGGAGTGTATGGTTCCTGTAAGCACTMGTTAGCTGTrTTCATTTAAT 

G ACCTGTGGTTTAACCCTTTTGATC ACTAC C AC C ATTATC AGC AC C AG ACTGAG C AG C TA 

TATCCTTTTATTAATCATGGTCATTCATTCATTCATTCACAAMTATTTATGATGTATTT 

ACTCTGCGCCAGGTCCCATGCCAAGCACTGGGGACACAGTTATGGCAAAGTAGACAAAGC 

ATTTGTTCATTTGGAGCTTAGAGTCCAGGAGGAATACATTAGATAATGACACAATCAAAT 

ATTAAATTGC AAGG ATGTCACACGTG NTGATGAAGGGTAGAGTAGGAGAGAC C ATG ANTA 

TTGTGGTAACAGGAGGACCCAGCATTATTCTAGTGCTGTACCTGGCCCGGGCGGCCCGCT 

TAAAACTAGGNNGGGATCCCCNG 

Sequence 184 

CACTACTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTAACCT 

AC CTTTAAG ACTGGG ATAACTATTG GAAACAATAGCTAATACC GG ATATAGTTATTTATC 

GCATGATGAGTAATAGAAAGGAGCTTCACAGCTTCACTTAAAAATGGGGGTGCGGAACAT 

TAGTTAGTTGGTAGGGTAATGGCCTACCAAGACGATGATGTTTAGCCGGGCCGAGAGGCT 
GTACCT 

Sequence 185 

CCGCGGTGGCGGCCGANGGNTCNNNTNCNTNNTTNNTTATTTNTGGGANGNTGNNTNTTT 

TTATTCCTAGATNTGGAGGAANCATATTTNTNTNTTNTTATAAAACTGATTGTAATAATT 

CTTTCCTATCAAAATCTCCATAGGTCAAAATATTGNNGGMTACTAAAATTTGCAACCTT 

GTTTACTrTAAMGGTTGCCACTTTCAGTGCAGAATACTACCGGCATCTTGTTACTGCAA 

TAGTTGGAAATAAMTGTGAAAATTAGCAGAAAAAAAAAAAAAAAAAAAAAAAAAACCCT 

Sequence 186 

TCACTACTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTCTGATAAAGTTCT 

TG^TCCTGTATGTTCTTCMGACCTTTATCAGAGTTTATGTGAATGCGGCTATTTTCT 

GTTTTAGGACAMCAGGACTAAAMGTTCAAGGGMTATGCTTGGTmGTCCACTTTCA 

TATGCTTCTGTAAAACCATCAAGCTGAATCGAATTCTCTTCAGGTATTCTTATCTCCCCG 

TAACTTGCCTTTGGTTCAGTCTCCATGTTTACATTCCCTTTATTTACAGGACTCCGCCTT 

TTATCTTTCTTTTG AGCTA AAAMGCAACCTGGCTGGGAGGTMTTATACTGAGAAATTC 

TGTATCAGTTGATATTTTAAGGTCTGAGACCTTCCTAACTGCTGCTTCAGACCTTGACTT 
ATCTACCTGCGT 

Sequence 187 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTACATGGGTGTTTTGATCTCT 

GTTCTTTCATACTACATTTGAACAGGGCAAAATGAACTAACTGCCATGTAGGCTAAGAAA 

GAAATGCTAACCTGTGGAAAGTTGGTTTTGTAAAATTCCATGGATCTTGCTGGAGAAGCA 

TCCAAGGAACTTCATGCTTGATTTGACCACGGACAGCCTCCACCTTGAGCACTATTCTAA 

GGAGCAAATACCTTAGCTCCCTTGAGCTGGTTTTCTCTGATGGCACTTTTGAGCTNCTAA 

GCTGCCAGCCTTCCCTTCTTTTCTGGGNGCTCAGGGCATGCTTATTANCAGCTGGGTTGG 

TATTGGAGTTGGCAGACAGGGATGTTCAACTTAATGAAAAAATACAGNTTAAGGGNCNTT 

GCCACAACACCCTGCCGTAAGTTACTTGGCTTGNGGGGNGGGGCATNTTAAAGTTAAAGG 

GNACNGGGNTTTAATNCCCTATCCCTTTTTTC 

Sequence 188 

CCGCGGTGGCGGCCGAGGTACATTTCTAAATAAATATCTTTGATCTATAACCCTTGAAAA 
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TATTACAGTTTATGGTGACATGATAGAGACAAGTCAAACAGTATTATAAGTGAAAATGAA 

ACATCAATTTAAAATGAACAAGTGTAAAAACACGCGGAACTGAAATTATAGAAAATCAAA 

ACACTAAATACACAGTAATTAAATATCTAGCACTGGAATCACCAAATGCATGCCGCAGTA 

ACACACTTTCAACTGATATAAAAAGCCCTTTAAGATTCACATAATCTAAATAAATAAGCA 

GAAACACAAAGGGGAGGTACCTGCCCG 

Sequence 189 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGGAGGGAACTGTTGCA 

AAGCCATTTCATCGAGAAGGGGACAGAAGGAGAAATACACACATGTATACACAAACAGAA 

TGGTTGAGAAAACGTTTTAATAAAATGTGAGGGTTGTATGTGTGCGTGTATATATTTACA 

CTTAACCTCTAAAATTCTCTTCTACAGTATCTCTGTTATGAATATGA TGGAA AAGCAACA 

TTTTGGTGGTGAGACTATTGTTAAAATAAATTTGAGAMGACGAAMTTTTGTGAGTCTT 

GATAATT 

Sequence 190 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGT l l I I i GT TCAA TACATTTTA 

GATTACGGATTGACAAAGTAMGATACTGCTATGGMTGATACATTGTATTTTCTGCATT 

GTGTGAAATAGTTTTTATTG AAAG N C AAGTGACATTTC AAAAG AAG NTCTATAAC AATTA 

TGTTTCATGCTTAAAGTAAAAATTCCCAGAGTTTAGTTT AGAA AATGTAATCTT^ 

TTCAGACTGATATATTCCCAGTATTTTCATAATGCCATGTTTTGATAAAGTACCTGCCCG 

GGCGGCCGCTCTAGAACTAG 

Sequence 191 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGTTCTTATAAAAGATTCT 

TTCTCCCAGAATTATATCTCCTCAGAGCAACAGCAAGGTTCTCAGGATCGAAGCCTACTC 

TAGCCTGAAGGGCTAGGAAGATTAGGATAAGGATAAGGATAATAATCCAAAAGTCTCGAC 

AATTCCAGTAGTCTCTGGATGGCTCCAACATCATAGAAATTTAACACTGTTCCACTTGTT 

TACAAAATCTAACACTGGCTTAGACATTCTGGACTTTCAGTGAGGGTTCCAGCATCTGAT 

GTCCCTCMCTCCTTTCCAGGGTGAGAGGCCCACTTACAGGAAACTTAACTTCTCACCAT 

GTGGACCCATGGAGGGGTTTTTCTCTTGCAGAAGGGCTAAAAAAGTGA GGAGG CTATGGA 

AAAAATGGTGGGTCACTTTTAAGGCAGACTGTTNGGGTTGGGTGGGAATTTTT 

Sequence 192 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGATTCACTAGGGCATCCTGCGAGCCTCACTAG 

CCTTCTGGTTCATGCCTTTGACAAGCATTTTTGTGCCCCCTCTGCTTACTGTGACAGTCG 

ATGATGAATCTTGCGTTGCCATTTTCTGCTGTGGGTAACTGCGTGCAGNGTCTTGCCTTG 

CTTTCTCTTNTTACTGTCCCACAGCTTGGTTTCATGTTACAAACAGAAAAGCTCGAGGCT 

CCCACCCCGCCACATCCCAACTTCATTTCCCCCTCA CTGTA GCCCATTT CCACC CCACCA 

CAAAGTTGCCACAGGTTTTCTTTGTATAGAATATGTTATTTTGAAGCTCTATTT^ 

TATTTATTTTAGAAAGTCTACTATTGNAAGAGTTCTTCTGGTTTGGGAAGAAAAA 

AGGTAAAAACTGGAA 

Sequence 193 

AGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTCCATAAGCTTGGATTTTAA 

ACACTGATAGTATCTCATGAGTAATGTGTGTTTTGGGAGAGGGAGGGATGCTGATTGATA 

TTTCACATTGTATGAAATACCATGTTTGA AACTCA TAGCAATAATGCTATGCTGTTGTGA 

TCCCTCTCAAGTTCTGCATTTAAAATATATTTTTTCTTTATAGGAATTGATGTATACCAT 

GAAAGTCATTGTCAGTTGTAGTAGGCTCTGATGTTGAATGAGATATCATGGTTTAAGCAT 

TNCCATTTTACTGGCCTAGGGGAANAANACC ACTTT TTCTTGGCTTNCATTTGGGANGGA 

TCCCCCAGGGGGNGTNTTTGGGGTGGTCCCTTTTTNTNGGGGGAAGCCAACCATTTCCAC 

TAGGCCTC I I I I I I I I I NAANCCTTTNAAAAATGGGGAA 

Sequence 194 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGTGGACCGTGAAANGCTGTAG 
TTTGCAACCCTGCGAGTATGTCTGGATCACAGGAGAATGGTCAGAGTGCTCAGTGACCTG 
TGGAAAAGGCTACAAACAAAGGCTTGTCTCGTGCAGCGAGATTTACACCGGGAAGGAGAA 
TTATGAATACAGCTACCAAACCACCATCAACTGCCCAGGCACGCAGCCCCCCAGTGTTCA 
CCCCTGTTACCTGAGGGACTGCCCTGTCTCGGCCACCTGGAGAGTTGGCAACTGGGGGAG 
' CTGCTCAGTGTCTTGTGGTGTTGGAGTGATGCAAGAGATCTGTGCAATGTTTAACCAATG 
AGGACCAACCCAGCCACTTATGCCACACTGATCTGAAGCCAGAAG 
Sequence 195 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAGTTGGGGTCATAATT 



Page 26 of 635 



WO 01/51628 



PCT/USO 1/00798 



Table 1 



ATCGAGTCTCTTGATATTATCACAATTACTGTGCCCCCTGCACTTCCTGCTGCAATGACT 

GCTGGTATTGTGTATGCTCAGAGAAGACTGAAAAAAATCGGTATTTTCTGTATCAGTCCT 

CAAAGAATAAATATTTGTGGACAGCTCMTCTTGTTTGCTTTGACAAGACTGGAACTCTA 

ACTGAAG ATG GTTTAGATCTTTGGGGG ATTCAACGAGTG G AAAATGCAC GATTTCTTTCA 

CCAGAAGAAAATGTGTGCAATGAAGATGTTGGTAAAATC 

Sequence 196 

TACTTAGGGCGA ATTGGA GCTCCCCGCGGTGGCGGCCGAGGTGTGCTGCACAGATTGATA 

CATTAGCCTTTGCTTTTTCTCTTTCCGGATMCCTTGTAACATATTGAAACCTTTT 

ATGCCAAGMTGCATTATTCCACAAAAAAACAGCAGACCAACATATAGAGTGTTTAAAAT 

AGCA7TTCTGGGCAMTTCAMCTCTTGTGGTTCTAGGACTCACATCTGTTTCAGTTTTT 

CCTCAGTTG TATATT GACCAGTGTTCTTTATTGCAAAAACATATACCCGATTTAAGCAGT 

GTCAGCGTAi I I i I I CTTCTCATCCTGGGAGCGTATTCAAGATCTTCCCAATACAAGGAA 

AATTAATAAAAAATTTATATATAGGCAGCAGCAAAAGAGCCATGTTCAAAATAGTCATTA 

TGGGCTCAAATAGAMGAAGCTTTTTAAGTTTTAAATCCAGTTAATCTGGT 

Sequence 197 

TCCTATAG GGCG AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAATTGTTCTACTGT 

TTTAAAAAGTTTTCCGCAGAACAG TGCA TTTATGGCAATGCTATGTTTAATGAGTTAGGG 

ACATCAAATA TATAG TAGTTCCTTATTTTCAGTTGTGAAAATGAAATGGCTAAAGCAGAA 

GAGACGTCTATTTT AGTC TTTTAAAAATGTGTGTGGGTGGTCTTTTTTCCTCAGMG 

AAAGCACATGTATATTTTGTTATTTCTCCTTGCTATATTCCTGAGACTATACTAAAAACT 

TTAAGAAAAGGMCAAGAAAAAGGTAAATTCATGTGTTCCCCACTGCTGTGTCTAGAACC 

MGATCACATTATA TCAT TGTTAAAATTGTGTTATCTAGGAAAGGTGCAATATAGGGAAA 

ACACTCTAAGAATCTTTTAAAAACCTAAGGGTTTCCCTTATTTTGTCAGAATATGTNGGG 

AAGTGGCCATCCCATAAGGAATCTTTTTAACC 

Sequence 198 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCTAACCTACCTTTAAGACTGGGATAACTATTG 

GAAACAATAGCTAATACCGGATATAGTTATTTATCGCATGATGAGTAATAGAAAGGAGCT 

TCACAGCTTCACTTAAAAATGGGGGTGCGGAACATTAGTTAGTTGGTAGGGTAATGGCCT 

ACCAAGACGATGATGTTTAGCCGGGCCGAGAGGCTGTACCT 

Sequence 199 

CTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGATGAACA 

CCAACATGTTCCAGAAGACCCAAGAGAAAAATCACAAGATGAAGTCTTGAGAGATGACCC 

TCCAAAAAAAGAACATCTACGGGATACAAAGTCTACATTTGCTGGCAGTCCAGAGCGTGA 

•GTCCATTCACATCCTGAGTGTTGATGAGAAGAACAAGTTGGGAGCCMGATTATCAAAGC 

AGAGATGATGGGGMTATGGMTTAGCTGMCMCTTAMGTTCMCTTGAAAAGGCAAA 

TAAATTCAAAGAAACTATAACACAGATACCAAAAAAATCTGGGGTAGAGAATGAAGACCA 

GC 

Sequence 200 

CCGCGGTGGCGGCCGAGGACCTGTGTTNAAAAGATGAAAGAAATGGNTGCTGAACCCAAA 

ACATGTTTAGGAGGAAAACCAGAAGAACAGAGCAGTGGCACTGGACCTCCTGTCTGTCCG 

TGAGGCTGCAGAACACAAGGCAGCTCCCAATGCCGCTGGACTCTCTGACCTTGGAGAGNG 

TAAACATCATATATGATTCTGTGTTCCTGAGATCTGGGTTTAACGATCTGGGCTTTGTCA 

TGGTGTT 

Sequence 201 

CACTACTATAGGGCGNAT'rGGAGCTCCCCGCGGTGGGCGGCCGAGGTACAACCTCTCGGC 
CCGG CTAAAC ATCATCGTCTTGGTAGGCCATTACCCTACCAACTAACTAATGTTCCGCAC 
CC CC ATTTTTAAGTG AAG CTGTG AAGCTCCTTTCTATTACTC ATC ATGCGAT AAAT AACT 
ATATCCGGTATTAGCTAT TGTTTCC AATAGTT ATC CC AGTCTTAAAGGTAGGTTAGGTAC 
CTGCCCG 
Sequence 202 

CTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGAATCCAAAAG 

TCATAAAAAGCAAAA GCTATC TTTTTTTCACTCTGGCACCCATCTGTTCTTCCCTGGAGT 

CAAACACTATTACCAATTTTTAGGTATACTTCCAAAGATACTTACTGCATTTACAAGCAC 

AGACTTATATTGATTCTAAAAGAATMGAGACATTTTCAGCATGTTGTTTTGTTCAACAC 

CACAGTATATCTTAAAGATGGTCCCCCATCAATACATATAGAGATCTCTCTTTTTTATGG 

CTTCATAGTATTATCATGTGCTGATATGCCAAAAACTAATCCATTAGACCCTTTTGGGTA 
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TATATTTACTATTTTGCAGTCTTCTGCTCTTAGGAATAAAGTGAAATTCAATCACCTGTA 

TAGAAAGTATTTCACACATGTTTGAATATATATGTAGAATAAATTCCCAA 

Sequence 203 

CCTTAGGGCGATTGGAGCTCCCCGCGGTGGCG GCCG CCCGGGCAGGTAC AAAG CTATAAA 

GGAACGTTTTTAGAGAAAGCACTGAAGACACACATTTTGCTGACCTAAAAGATTTTAA^ 

TGAATTAGAATAATTTACATCATATAAAGAGGTATTTAGTCT7TAAGTGGAGAAAGTTGC 

TAGTCACATGTAAGAAAMCAAGTATTATGGGCCTTCCTAAGACAAATGGAATAAATTCC 

ATC ACTTTTGG CTTTTTTATAAAC C AGACTTCTAACAAAT AAC C A TATTAATTTTCC AGT 

AATCAMCATATCCTAAAAGAGGAAAAAGAGAATGTCTACTTTTAAATCACATCCAATGT 

GAATTTGCGTGAATGCAAGGCATAAAATACCT 

Sequence 204 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAGGGCAG 

GATGGCAGGCACAGAGCCGCCAGCTCTTTCCCCGCCCACTCTCGGTGAAGGAGTGCAGCT 

TGCCCGCCAAGTAGAACGTCGTGAAGCCAAGGCCCGAAAAGGCAAAGGAGGAATGGATGC 

TGGGGAAGCTTTTGCGGCCCTCGGACACCAGATCGGGGTCACCTGTGCAATGCATTTCCG 

AGTTCATCACTCCATCTTCCCACTATTAATTTAATAGTGTTTGTGCAGACTCCATTCAAA 

GCAAGAGCCAAGGACACCGCTAAGAAGGCTTCCTTAATTTCAGTCTTGTCTGTTCGCCGG 

ATAATTTTCACCACACAAATAACAGCCAGGGGTGTGAGGAAAGAAATTGCAAACATGAGG 

CNGGGTAGGGTATG 

Sequence 205 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCATATCTC 

CTGGCTTCCTTCTACATGGGTCACTTAGTTAAGAGGGAGGCCAAGGGAGTTCCGATTTCA 

GGCAGTGTGTGGCAGGGTTACTGTCCTAGCAACCTGGCTACTCCTCACTGTGAACGTTTC 

TCATAGGTGTCATATGGCAGGATGAAAAACATATTTGCCTCCCAGTGAAAGATGGCACAG 

GCTTTTGCCCAGCCAGGTTGGCAAGAGMCAGAACTCTTAACCCCTTGCTCGACAGGTTT 

GAGTTCAAGGGGTTGGATGCTCCAAGCAGAGGGCCAAACCCTGATTTATGAAGCATGCTA 

GGTCAACAGCCAGTCAGACCT 

Sequence 206 

ATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGGTACAGTGCCCCCTG 
CTTGTGGTGCAGGGGGGTGCCATTGCCGTGGTGCTTGGCATTGAGGGGCTGCAGGTGATG 
CATTGCACGGGACACCAGGTCACTCAATTCGGCGATGCTGCCTACGCGCGTGGCCAGCAA 
AGACTGCTCGATGGTCCTCAGCTCCGACTGGCTGAACATGGGCAGTTCCCCTGGACCT 

Sequence 207 

AGGGCTAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAGCTAAAGAAGG 
CAGGAGAGAAAGTTTTTGAGGATGGTGTGTGTAATCGGTCACTATTTACTGTG TGAGT TC 
CACTGGAGACAAGCACACTTTCTCCAATCCCAGAGTCTACACCTGACTGGGAATTTTTCA 
TTCTGGATTCTGATTGAGGGAAAATACTGATATCCAATGGGCTATAAGGAGGACCTTTCA 
GTTTCTGTAACAGC ATCTGAGTTCTATGCATGTCCTG GAAAGGC AC CTGCC C ACTG G CTA 
ATTCACATGCTGTAATCCCAACACTGTAAATATCTGACTTCACATTATACCCACGTAAAT 
CCTGTCTCAGT^.GTTCTGGACTCAGCCACGGCTGCACTGATGTGCTGAACTGTGG 
Sequence 208 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAAGCTAAAGAAGGCAGGAGAGAAAGTTTTTG 

AGGATGGTGTGTGTAATCGGTCACTATTTACTGTG TGAGT TCCACTGGAGACAAGCACAC 

TTTCTCCAATCC.CAGAGTCTACACCTGACTGGGAATTTTTCATTCTGGATTCTGATTGAG 

GGAAAATACTGATATCCAATGGGCTATAAGGAGGACCTTTCAGTTTCTGTAACAGCATCT 

GAGTTCTATGCATGTCCTGGAAAGGCACCTGCCCACTGGCTAATTCACATGCTGTAATCC 

C AACACTGTA AATATCTG ACTTCAC ATTATAC C C 

Sequence 209 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGM I I I ACTAAATTATCT 
ATGTCAAA/AAATTCTGTGCCTGGCGTGGAATTTCACTCCATCAAGTGTTACMTGATTT 
TTTCATTTI CATTACAAGCAGGAGAATGAATGTAGGACAAGTGTTAGGAAACATGGCAAT 
AAATTAGAATATAATTTACAAAAGCAAAAAAATTAACAGTGTACCTGCCCG 
Sequence 210 

CTATAGGGCNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATAGTCTCCCAGTCTACA 

TGTCCTGGCTGTTAATCATCTTGGCCCCTTGAGGCACATCACAGTTTGAAGGACCTGTTT 

AAGTTGAAATAGACTTTGCTTATTTATTGGGATTCTAAAAAATTCTGAGTGAGTTTGCAG 
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T A X GA J^ G ^11 A ^ 

^^TlS^J G0MGTGGTGCMTCTC ^^ACJGCAACCTCTGCCrCCTGGGrTCA 

CCCGGCTAC 1 1 I I I GGATTTTGGTTTTTAAGAAAAACCG 
Sequence 21 1 

^^^^^^ 

^J AG f£ C ™ G ^ 

^^MZ^JI GNmN ^ AAAAAACAAAAAAA 
GAC ™ CN I CTACAGA ™ AGA TCW^ 

SeqS™M212 ^ ATNATGGA ^ ACATAGGAGAAGACNCAmACAATCA '^CATTGGG 

AGG J A £J GGA J G ^^ 

^^^CTAGTmGCGCCCATTTACTGGCTTGACAGGGATGCCTGAGGACTTCTGG 

^TJ^PJ GA £ AGTGGCTGAACCACCAG ^ GCATGACA GTGGCTGTGCCTG 

GGAGGCTTCTTGTAGCCAAAGGATCCCGAAGTGGAGGGGCGAGCAATGCCCGAGGGGGGC 

Sequence 213 

GCG ^ GG J GGC ? GCCGAGGTAC ^GAGACCTCATGCATATAAMCCAGCm 

I GG ^ ACTATATGAAGAAA ™ T CACTGGGGGCAAATCACCAAGTAGGGAGCACAATACAC 

AGGGTGTGGATGTTAATGTCATTCCCTAGCCTTCTCATTCCCTTCTCTTGGTCCTTTATG 

?~ A 7 GG ^^ AG ^ C ^ A ^ A ^ AAA TT^ GTA ATAATAACTGAGAACCTGACTCCCAGCA 

AGGGAGTAGTTCAAGAAGTTGAGGGGAGTTTAAATCTGAATGAGTAAATAAAGCAATTAT 

ATC ATTAG CTTAAAATTTTATC ATC AATTAAAAATAAATTTAAAAA C AAATAC TTAAAAA 

AAAAAAAAACC 

Sequence 214 

CCG £ GG I GGCGGCCGCCCGGG CAGGTACAAGTCTCATAAACTATCTCAGCACTGCATCCG 
T A I™ 5TGGGCACCTGTGCCAAAAGCAC ^ 

ATGGCCAATAGATGTCTTCTCAAGTGTTTTTGATCACCAGGTCAGTGCTATTGGGAGTGA 
TATAATATGGGTGGTTCCTAMGTTATTTTTGGTCTC I 1 1 I II 1 1 1 1 1 1 I II I A AGTGCT 

A T G ^T AGA "^ AA< ^ AA ^ACCTGCCTTGGATGCAGGAATTAGGTGGGGCTGCACACGTGCAT 

GG J A T^9™ CCCTATG C™GGTT^^ 

ATGACTTGATTTAC 

Sequence 215 

TTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGTAAAGTGAAAAGGGATGT 

GCANAAAAATAAAAAAAAACMCAAAhJAMGCTMCCTTCTATTAGAAAAGGGGACAGGG 

GAATGAGTAAACTTCTTTTATTGCGGACAAATGTGCACATAGCCGCTAGTAAAACTAGCC 

TCAAACAGGATGCTCATAGCTTAATAATAAAAGCTGTGCAAAG 

Sequence 216 

ACGAC I C H ATAGGGCGAA ^ GGAGCTCCCCGCGGTGGCGG CCGCCCGGGCAGGTACATA 

CAGATCTTTTACTGCC TTGGTG TTAMTTTATTCCTMGTAAGTATTTTTTTTATGCTAT 

CATAAATGGCATCTTAATTTTTCAGTTAGGTCATTATTT^ 

GCTAATGCTrr TTGTATGT TGATmGTATTCTGCAACTTTACTGAATTCACTT^AriTTr. 

I A ^ C ^ CAGArm ^^ GTGTG TGTATCTTTGGGGrrm 

G Iin^ TAGAAATG ^ ATAATAGTGGATGGAATAA ^ 

^?°^ ATCTC ^ CTATATATAT AQATAGATATGAATQAG1AT6TGTATAAATCATAAA 
GCAAA^GCTTTGAGAAACCAAACTCATCTTTGCTTTCTGTACATACTGTGGCGTTTCTGG 

Sequence 217 

C™? GG 9? M ^ 

ATAAATAACATAAATGAGAAGTT7TCAAGTATTTCAATG 

GTAATTCTCACAATTGGTTAGACAATCA I I I il I I t I CTTTTCTTTTCCTCTCATTTAGT 

CTCAGAGCCTAAAAAGAAATGCCTCCAGTCTCTGTTAGCCCCGATGTTTGGAATGAAATT 

AACAAGATTCTACCTTAAAAGAGAAMCTTATGTGGGC7TTTCAAATTGTGAAAATTTGT 

TCCCTCTTATAAAATATAATC-rrTCCCCTGCTGCTGGTTTATAAATATGGGCATATAGCC 

CAGAGCCATTATAAAGAAAAAACAACCAACCACAATAAACTAAGGAGCCTCATCTGATGG 
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CTAAGGGTTCTCAGGA AAAAA TGAGMGAGCCCTTTGATCAGGAAAGGAAAAAAGGATAG 

AGGATAAAACAGTAACCTTTTA 

Sequence 218 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATCTGAGACTC 

AAGACTCTCACTGATTGGAGAGCTTGTGGAAAACAAAACACACCATGCCAATAAATGAGA 

TGAAAACTTGAGTTTGCC ! I l I I AACTATTTATGTTCTAAGTTAAGCTTTGATAACATTC 

AAATGTCAAATTCTCTCATTCTTATAAAAAGTTGAATTAATTGCCTGTATTTATTTTAGC 

AATTATTCAATGTATTTCCAGTATAGGATGTATAGTATAATTAAI Mil I GTAAATAAAA 

TATTTTACCT 

Sequence 219 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTANTCTTGTTTCTATGAA 

GGAAAAGTTTGGCTACTAACAGTAGCATTGTGATGGCCAGTATATCCAGTCCATGGATAA 

AGAAAATGCATCTGCATCTCCTGCCCCTCTTCCTTCTAAGCAAAAGGAAATAAACATCCT 

GTGCCAAAGGTATTGGTCATTTAGAATGTCGGNAGCCATCCATCAGTGCTTTTAGCTATT 

ATGAGTGTAGGAAACTGAGCCATCCGTGGGTCAGGATGCAATTATTTA 

Sequence 220 

CTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGCCAGATCTGT 

ATCACTCCGGAGCAACGACCTGTCTCTCTGAATGATCTCTGGCATGGTAAAAGAGTTAAC 

TGTATTTCAAAAGGTGAAACAAAAC I Mill I GCTTTGTTATGCATGTTTCAAATTGATC 

AATGGTTCAGGCCTGGAGTATGCATCAGAATTATCTGTGAAGCTTAGTGAAAGTGCTGAT 

GCCAGAGCCCACCCAAAGCTTACTGCTTCANAACCACTGGGGACCTGGGCAGTTCTGATG 

TGCACGACTAGTCAAGAATCCTTTAAAGGCTGATAAAGCAATCTTGAAATCACTGTCAAT 

GCAAAAGTGGGATGTTCCTTATTAATAAGAACTCAAAAAATAAAGTACCTCGNCCCGCTC 

TAGAAACTAGGTGGATCCCCCCGGGCTGCANGAATTCCGATATCAAGCTTATCGATACCC 

GGCCGACCTTTGAGGGGGGGCCCGG 

Sequence 221 

CCGCGGTGGCGGCCGAGGTACGCGTTTCTTGTAAACACGAGGCACCCCAAGATAAGAAGA 

CAGATAGAGCAAGGGATGGACATGGTCATCTCCTCAGTGATTGGAGAAAGTTACCGGCTT 

CAGTTTGATTTTCAAGAGGCAGTGAAGAATTTCTTCCCCCCAGGAAATGAAGTGGTTAAT 

GGAGAAAATTTAAGCTTTGCATATGMTTCAAAGCTGATGCATTATTTGATTTCTTCTAT 

TGGTTTGGGCTCAGTAATTCCGTTGTAAAAGTAAATGGAAAAGTTCTGAATTTGTCAAGT 

ACCTGCCCG 

Sequence 222 

AGGTACTTACATGGGTGTTTTGATCTCTGTTCTTTCATACTACATTTGAACAGG GCAA AA 
TGAACTMCTGCCATGTAGGCTAAGAAAGAMTGCTAACCTGTGGAAAGTTGGTTTTGTA 
AAATTCCATGGATCTTGCTGGAGAAGCATCCAAGGAACTTCATGCTTGATTTGACCACTG 
ACAGCCTCCACCTTGAGCACTATTCTAAGGAGCAAATACCTTAGCTCCCTTGAGCTGGTT 
TTCTCTGATGGCACTTTTGAGCTCCTAAGCTGCCAGCCTTCCCTTCTTTTCCTGGGTGCT 
CAGGGCATGCTTATTAGCAGCTGGGTTGGTATGGAGTTTGGCAGACAGGATGTTCAACTT 
AATG AAG AAATAC AAGCTAAGGCCTTGCC AG C AA( ^ AC CTG CCGTAAAGTTAC 
Sequence 223 

CCGCGGTGGCGGCCGAGGTACAGTCACCATCATACTTGTCTCCATTTGGAAAGATAAAGC 
TGATCTTATACACTTCTGAAGTTGGTCGAGAGGTGTTAGAGAGATCTTCCAAACTCTGCG 
ACTCCCCCTGGGCGGGCCAGCTCTGCGGCCACCTTGCCCG 
Sequence 224 

CACTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATTACTC 

CATCTACCGAGGCAGCGCATGGCATGACTGAACGGCTTGGAACAAACACAGAAATTAGCA 

CCACAAACATTCAGGAACCAAATATAATCTGCTATGGTCACACCACAGACAATGCAGGAA 

GAGGC M I I I ATTGCTTGTGTGTGTTTTCAAAT CAAGTAGGCACAACCCAGTCCTCATAA 

CAGCCCTCGGGAGTGGACGCACCGGGGCGGGAGAGCGCAGGAGGGAGTGTGCTTGGCTAT 

CCTGCCCAGTGGAAAGGAAAGTGACGTGCArTGATCTTAGAGTCCTTGGCAACAGACTGT 

GGAGGCAATAACAGGATAAAGAAAGCAGGGAGAA 

Sequence 225 

CCCGCGGTGGCGGCCGCCCGGGCAGGTAAAAAAAAAAAAGGAGACACAAGACTTACTGCA 

AAMTATTTTTCCAAGGATTTAGGAAAGAAAAATTGCCTTGTATTCTCAAGTCAGGTM^ 

TCAAAGCAAAAAAGTGATCCAAATGTAGAGTATGAGTTTGCACTCCAAAAATTTGACATT 
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ACTGTAAATTATCTCATGGAATTTTTGCTAAAA7TCAGAGATACGGGAAGTTCACAATCT 

ACCTTATTG 

Sequence 226 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTCCATT 

CATTCCAGTGAGAAGCACTAGAACTTTAAAATTTCAGAATTGGATGAAAATAACAAAGAT 

CACCCAGTTGAATCATATTTTTTTGTTCAGAAGCAAAAACTGAGGCTCAGAGACAAACTG 

TCCTGTCCAAGATTATACAGAAAATGGTGGGMGAGTTGAGAAGTTTCTCAATAATGCCA 

TGACTGTTGAATCATAGGAACAAAAGCCAAGAGATGGATCCTTACAATAGTTTAACAATA 

TAAACCACATGGMCATTTCAAGAAAGAGTTAMCAATGATCTATGTTTAAAAGCTATCT 

CTACCTGGCATAGTATCTCACATTCTATAATCCTAGCACTTTTGG 

Sequence 227 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTGGCCCGCAGA 

GCTGGCCCGCCCAGGGGGAGTCGCAGAGTTTGGAAGATCTCTCTAACACCTCTCGGCCAA 

CTTCAGAAGTGTATAAGATCAGCTTTATCTTTCCAAATGGAGACMGTATGTTTTTCTCC 

TCTGTTTAGATGGTGACTGTACCT 

Sequence 228 

AGCTCC CCGC GGTGGCGGCCGCCCGG.GCAGGTACAAAATAACATCATAAATAACATAAAT 

GAGMGTTTTCAA GTATTTC MTGTGCCTTGAAAAGTTTCAAACAGTAATTCTCACAATT 

GGTTAGACAATCA I 1111111 I CTTTTCTTTTCCTCTCATTTAGTCTCAGAGCCTAAAAA 

GAAATGCCTCCA GTCTN TGTTAG CCCC GATNTTNGGATGGAATTTACCAGGTTTNTCCTT 

TAAMNAAAAAANTTrTGGGGGGTTTTNA AAMN GGGAAAAMTTCCCCCCN 

AAAAMNAAT TCCCCCCCCNGGGGGNTGTTTTTAAANAAGGGGGGGNNNNCCCCCCCCCC 

ITTTTTTn I ANAAAAAAAANCCCCCCCCCCCCCNAANNNNNNNGNNNNCCCCCCNTTGG 

GGGGGGGGGGGGGGNNMNAAAAAAAAAAAAAAAAANCCCCCCNTTTCTCGNGGGGGGGG 

GNGGGNGGGGGGGGGGGGNNNAAANAAAANN 1 rTTTTTTTTTTTTTTTTTTTl NCNCCCC 

CCNCNANNNNNNNGGGGGGGGGGGGGGGGGGG 

Sequence 229 

CTNCTTAGGGCGAATTGGAGCTC CCCG CGGTGGCGGCCGCCCGGGCAGGTACAAAATAAC 

ATCATAAATAACATAAATGAGAAGTTTTCAAGTATTTCAATGTGCCTTGAAAAGTTTCAA 

ACAGTAATTCTCACAATTGGTTAGACAATCA I I 1 I I I i I I CTTTTCTTTTCCTCTCATTT 

AGTCTCAGAGCCTAAAAAGAAATGCCTCCAGTCTCTGTTAGCCCCGATGTTTGGAATGAA 

ATTMCAAGATTCTACCTTAAMGAGAAMCTTATGTGGGCTTTTCAMTTGTGAAM^ 

TGTTCCCTCTTATAAAATATAATCTTTCCCCTGCTGCTGGTTTATAAATATGGGCATATA 

GCCCANAGCCATTATAAAGAAAAAACAACCCACCACAATAAACTAAGGAGCCTCATCTGA 

TGGCTAAGGGTTCTCAGGAAAAAATGAGAAGAGCCCCTTGATCAGGAAANGGAAAAAANG 

GTTGGA 

Sequence 230 

TGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTACATGGGTGTTTTGATCTCTG 

TTCTTTCATACTACATTTGAACAGGGCAAAATGAACTAACTGCCATGTAGGCTAAGAAAG 

AAATGCTAACCTGTGGAMGTTGGTTTTGTAAAATTCCATGGATCTTGCTGGAGAAGCAT 

CCAAGGMCTTCATGCTTGATTTGACCACTGACAGCCTCCACCTTGAGCACTATTCTAAG 

GAGCAAATACCTTAGCTCCCTTTGAGCTGGTTTTCTCTGATGGCACI 1 I I IGAGCTCCTA 

AGCTGNCAGCCTTCCCTTT 

Sequence 231 

CCGCGGTGGCGGCCGCCCGGGCATGGTACTAGAAATTGCACTGTCCTAGAAATTCCTCCA 

TCCCATGAAGCCATTTCCATCATGAAACCTTTGACACACTTGAACACAAGTTCCGCTCTC 

CTTAAGCACCAGGGTATAGTCATTTNCTCTGACATATAATGTACTTAAGCNCTGCAGAAT 

NGCTCNAGTAATATNCCNATANGTNCNTTTCCAATTTCAATTA 

Sequence 232 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTGGAGCT 

GACCAAACGGGAGGTCGAATACATTGCCCTGTCAAGGGACACCACTGAAACTGATCTCAA 

ACAGCGACGAGAGATCCGTGCAGGCACAGCCTTTTACATTGATCAGTGTGCAGTTCGTGC 

AGCCA CAGA AGGCAGAACTCTCATTTTGGAAGGTTTGGAAMGGCAGAGAGGAATGTTTT 

GCCTGTTTTGAACAACTTGCTGGAAAACAGAGAGATGCAGCTTGAAGATGGACGCTTCCT 

GATGTCTGCTGAGCGTTACGACAAACTTCTCCCGAGATCATACCAAAAAAAGAGTTGGAT 

TCTTGGAAAATTGTCCCGAGGTTAGTGAAAATTTCCGAGTGATTGCCTTGGGGCTTGCCA 
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GTGCCCAAGGGT 
Sequence 233 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACGATAGAGATAATA 

G AAGACA AAGAGATTAAAAGGAAATAAAAATGCATGATTAAAAACTAAGAATAAAAAACC 

TATTTTTATGTTTCCTAAAGGAAATTGTTTATTCTACAGACTCAGTAGGTAGACACAAAC 

ATAAAGATTTCCCTAGAAGACATAGAGTGGGATTTGATAACACTGTCTGTTATTTTNTGT 

ACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnNGTTCCCTT 

TANGGNGGGGTTAATTTGCCCCCNTGGGGGGAAACATTGGGANTMGCTTTTTTCCCCCG 

GGGGGAAAAATNGTTTTTCCCCGTTCAAAAATTTTC^ 

GGGNAANAAAAAAAGGNAAAANANCCCCNGGGGGGCCCCCAAAANGGGGGGGGG 
Sequence 234 

CTACTATAGGGCNAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTTGTTTTGTTTT 
AAATTAAGAGATTCCCAAATGCC I 1 1 1 1 CCCCCTCATCTTGAAATGAGATGAGTTTTTAT 
GTGTMGCAATATTTATTTAACTATTCTATAAMTTATTGAGTGCCTACTGAGGCCTTTA 
AGC ACCG CTAACATTCCTTTCCATCATTCTTTTAAATG AC ATAAAATAATTGTG C AATG T 
TCCTGATGATGTACCTGCCCG 
Sequence 235 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACTAATGTTTGCAAC 
ATTTATTGAGATCTAAGTCTGTCTTGCCTTCATTTCTC 1 I I I 1 ATCTCCCCCTTGCCCTC 
ATTCTTG AAC AGCTGGAG GAATACATTTTATTCTGTCC ATG AAG C ATAC ACTATGAAATT 
C AAGTGCTTAAAAATACTTCTATGACTCTNTG CTATCC C ACTGTATAG ATCC ACAGG GAG 
CAAACACTTAGAAATGATAGAGAACTGAAGGAGATCAATGGTTTAACAGTTATCCATGCC 
MGTCCCATTGNCAGAMTATTCTTATTACTCAGGCAAACACTCTTTGAGCTTCCCTTNC 
TAAAGGTAACCATTCCAGTGAATAGATGTGCCCT 
Sequence 236 

TNCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGG I I I I I AATC ATTCATATTG 
ACTTAATCAGGTTGACAAAATCTCAGGGTTGGTCTGGGATCATTCACAAATCAGAATTAC 
TCAGAGAAATTTTCACAGCAGTTGCAGCAGCACAAACTCTAACGCTGATTTAAATTAATA 
ACTGTTATTAATGAAC AAACTTGAAAAAGGMTTTCATATAGAACATTAC ATTTCTC C CT 
GACACTACCGTATCATAAAAATTAAAMTGATATAAACATGTTTTAAAGTCATAGTGGTA 
GAACCAGCCACACTACAGTGCGGCATTGTTAGGAAGTTCGCTCTCAAGTTGACTCAGCTT 
AGTAAAGCAAATTGTTCTCTGAGATTTCAGCTTTGGTTAGTCCTCTTTGGCTGA 
Sequence 237 

CCGCCCGGGCAGGTACNTTGTTGTCACAATTTAAAACTTCCTCTTCTACATACATGTTCC 

AGAGAGCATCTTCCAAACCGGATACATTTTTGACTGCTACTGTTAGATCTAAGAAGTCTT 

CCTGCCTCTCGCTMCGTTCTTACATTCTTTACAAACAATCTGGTTAACAATGGTT CCAT 

GGTACCTTGGGCCGCTTCTAGAACTAGTGGNATCCCCCGGCCNGNNGNATT TTNTTTTCA 

ANNTTTTTNGNNTCCCCGGCCCCCTTTGGGGGGGGGCCCCCCCCNCCCCCN I I ITTTTTC 

CCNTTTTTGGGGGGGGNTTANNNNNNNCCCCNGGNGGGNTNNTNTNGGGGNTCANNNTAT 

TTTTTCTNNNGGGGGGAAAAAAAATATTTT 

Sequence 238 

TNCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCTTTGAGGACATTTT 

TGTCAGATTAACTATMCAGTGTAGTGTAGTTTTTAAAATTGCAGTTGAAAAGTTTAGCT 

GTCTTGGAAGTCAAATTTATCCAATTGTTCAGACTTCTGTTACTACTTAATATGAAGCCA 

CCATGCTGGCTTGGACAGAATTAATTTCATTCATGTTATGGAGAATTCTATATTACAAAT 

CTG GTCCCCTATAATATGAAC AGTGAGC AGTC AG AAATATACAAAG GGTTAAATAGG GTA 

AAGACTTTGGCCAAGAAAGGAAAGGCCTTAGTTCTACCATAGAGTATCTTCTCTAATTAA 

AATGACTGGGGAAATATATGGAAGCAGAAACCAGCACAAAGCACTACCCATCTAGAAATA 

ATCTTTTCAGGTTAAAAAACAAC 

Sequence 239 

CTACTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGCTAGCTG 

GAAAAACCATATAGAAAACATGATGACCATGTTGAGTTGTTGGTGCTGAGGCTGTGAACT 

CTTTGGCTTCCTTCTGATGGCCCTCTTGAGAACAATCTGTACCT 

Sequence 240 

NGCNAATTGGAGCTCCCCGCGGNGGCGGCCGAGGNGTTGGGTTTGATTTGCTTTTGTTTT 
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TAATGCAGCTCTTTTMTATAAAGATTCTTGATACAGTGAAATCTCTTATTTCAAGTGTA 
AGTTATTCTTCACCCACCCCTTCCCCTGCCATTGTATTTCCCATCTGTTNNANGGNGTTN 
NANCANTTNNCATTGNNTCGNATNCAGNAGGTNCCTGCCCG 
Sequence 241 

AGCTCCCCGCGGTGGCGGCCGAGGTACTTATTGAGTGCTTGTATGCACCAGGCTCTGAAC 

TAAACGCGCTCCATATTACCCCCATTTAAATTCTTACCATTATTCCTGTCTTACAGAAAG 

AAAAAAGGCATGGTGAGTCACACAGTTCACAAATGAGGGCTAGAACCAAACCCAGGCACT 

GTGACCTGCTACACAGGAGAGGCAGATATGMGATTTCTTCAAGGGTGCTTATTACAGCA 

TCATTTATAATAGCAAATAAAGTACCTGCCCGGGCGGCCGCTCTAGAACTA 

Sequence 242 

CCGCGGTGGCGGCCGAGGTACTCCTCCAGATCCTGGTGCTGCTTTTGTGGTGGTAGAATG 

TCCAGATGAAAGCTTCATTCAACCCATCTGTGAGAATGCCACCTTTCAGAGGTACCTGCC 
CG 

Sequence 243 

CCGCGGTGGCGGCCGAGGTACTTTTTAAAAAATCTATAAATTTAATGCACTGTCCAAGTG 
AAATGTCCTAGTTGTCATTGTGATTAAGGGGCCAACTTTCCAGGCAGCTAGCAGAGATAC 
TATTCTCTTCCTCTCCCAGCAAATTTGTATTCCTTCGCCCACGCATTCCTGCTATACTAG 

ATGGCAGCCAGTGATGGAACTATAAAGATGTCTGTGGTCATATGTTGAATGTGGCAGCTT 

GAAGATGTACCTGCCCG 

Sequence 244 

CCGCG GTGG CGGCCGAGGTACACGAAAMGGATTGGAATTGTGTTTAGTTTTTGGTGTTT 
AGATTATTTTCTTTGTGTCTTTCTTC I I 1 1 I C 1 1 I I I I H I i I I I CTTTTTTACAATAGG 
CCAGTAGAATTCTTATGAGAACAGGCTTTTTMGCACTTCAAT^ 

GTAGCTTCCAGMGGTGCCAMCAGGTTTCAACATCATATTAATTAGAAATACCCATAAT 

AATTATAACAACAATAAAAAGGTATAATATGGTACCTGCCCG 

Sequence 245 

GCTNCTATAGGGCGAATT GGAGC TCCCCGCGGTGGCGGCCGAGGTACTTTTTGCTTTTCT 

TT TGAAA GTATG AAMTG ATTTTTACAATGTAAAGAGAGTCTGAGAAATGCAATTGCTAT 

TCT TTTTA TATACTTT TTCAGA TTTTTTATATGCT^ 

CAATTTTATATACTGAT TTTTT CAATCAATGTATATCATGAGTTTTTAAAATAA 
AATCTTC ATAACTAATATTTTAATG GGG AGTGACTGCTTAATTG GC ATG AGGTTTC CTTT 
TGAGATGAGGAAAATGTTCTGAAAATATGGTGATGGTTGCACACCATTGTGGAATGGTAC 
CTGCCCGGGCGGCCCGAGGTACTTGCCTGGGAGAAG 
Sequence 246 

GCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACAGGGTCCCCAGCTTACTCCAGGGATGA 

TGTCAGAAAATGAGGTCCTAAACATGCAGCT7TCGGATGGAGGACAAGGAGATGTCCCTG 

TTGATGAAAACAAACTCCATGGTAAACCTGATAAMCCTTGCGCTTTTCCCTCTGCAGTG 

ATAATCTGGAAGGAATATCTGAAGGTCCTTCAAATCGCTCCAATTCAGTGTCCTCCCTAG 

ACCTAGAAGGAGAGTCTGTGTCAGAACTTGGAGCAGGACCTTCTGGCAGTAATGGAGTTG 

AAGCTCTACAGCTGTTAGAACATGAGCAAGCTACAACACAGGATAACCTTGATGATAAGC 

TAAGGGAAGTTTGAAATl'CGTGACATGATGGGGATTAACAGATGATAGGGGACATATCAA 

AAACAGTGAGTGAGACTTGGAGTCCTTCGGCCCCTTTAGAACTAGGTGGATCCCCCCGGC 

TGCAAGGAATTCCATTCMGCTTATCGATCCGTCCACCTCGAGGGGGGGGCCC 

Sequence 247 

ATTG GAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTNC ) iTTTTTTTTTTTTTTTTTn 
TTTTCTTATCAAAAATA'iTTTATTTNCAAAAAATTAATTATNCTATAC 
AAATNCATTGAATAA1~"GCTAAAATAAATNCAGGCAATTAATTCAACTTTTTATAAGAAT 
G AG AGAATTN GAC AT fTG AATGTTATC AAAGCTTAACTTAN ANC ATAAATAGTTAAAAN G 
GCAAACTCAAGTTTTCATCTCATTTATTGGCATGGNGNGTTTTGTTTTCCACAAGCTNTC 

CAATCAGNGAGAGTCTTGAGTCTCANATGTACCTCGGCCGNTTTAGAACTAGGGGATCCC 
CCGG 

Sequence 248 

TTAGGGCNATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCCAGCACATCTCA 
GAGAGATGGAGGCAAGCTGGGTTCCTGATGATTTCCAGCTAATACAGGTCCTTCCTCCTG 
CATGCAGGCGCCCCGTAACGCTTTATAACAGACGCCTCTAGACTTCTGTGGGGGTAAAGT 
GAAGGACCCAAAGCGACACAAGTAGTGTCTGTTCACACTTCCACTTTCAAAGCTAACTAC 
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TAGCTGTTCAAATATACTCCATACAGCTTTCAGCAAATCAAAGTGTTTACCTCTCCCACA 

CCAAGGGAAGAAAAGATGCAGACTGCCTTTAAAGCACCTGTCAGCAAGGCGAGGGGTTTT 

AAAAGGATCAAGCCTTGAGAATCAAAGCAGCAGCAGAAGTGTCATTCTTNCAGTGCTCTC 

CTTTCCGGTTC 

Sequence 249 

TACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACCCATTGGTCTTACAGA 

CAAGCATCAGGAACCAGAGGGCCTGGTGGGGCTGGGAGGAAGCTCGGCAGTGACAGCTGA 

GGTGCTCATGTCCTTCCATCCCACTGCCCAGTGGATAATGAGCTCATTAGTCAGACGAGG 

ACCAGCCCAGAATAGCCAGGAGTAAGCATGTCACATTACAGAGCTGTAGCCAGCTTCTGG 

GTGGAAATAGCACTATCTGGTACCTGCCCG 

Sequence 250 

CTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCCTTTTTGCACACTGTCA 

GCAAAACACGTCCCTTTGAGTATCTCCGGCTCACCAGCCTTGGAGTTATTGGGGCCCTGG 

TGAAAAC AGATG AAC AAG AAG TAATC AACTTTTTATTAAC AAC AG AAATTATCCC TTTAT 

GTTTGCGAATTATGGAATCTGGAAGTGAACTTTCTAAAACAGTTGCCACATTCATCCTCC 

AGAAGATCTTGTTAGATGACACTGGTTTGGCTTATATATGTCAAGACGTATGGAGCGTTT 

CTCCCATGTTGCCATGATCTTGGGTAAGATGGTCCTGCAGCTATCCAAAGAGCCT 

Sequence 251 

TACCCGGGC GAAG TNANTNNGGGGAAAANTNCCCCCCCGGGCCGNCCGGGCANTTTTACN 

GGCACANGGTTTTAAGCCNCNGAGGGGTTTTTGAAANCCCCCCANGANACCNNANGGNGC 

AGGGGGTTTTN NTTTAAAANAAG N NGGAACCGCCGGANG ANG AGGG ANGGN AAAAACACC 

CNGGGGAAACCNAANAANCCCNAAAAGGGGNCAAGGNGGNAAAACCCCCAAGCNAACCCN 

GGGGGGNCCANGGAAANNCNGGGACNAAGACCCAAACCAAAANGGGNNGGANGGCCGCCN 

GGGGGACAANNNCGGGCCNACNACCAAGGGGAACAANCAANNGGCCGGNAANNGGCCCNC 

AAGGGCCNGGNAACAAAAANNNNNCNAACCCCGGGGAANNAAAGNAAAGGGGGCAAAAAA 

GGCAAANAAAACCNCGCCCAAAANGGGGAAAGAAAAAANNCAAAGGGNGGCCCCGGGGGG 

GGGGNNNCCNNGGAAAAAGNAGGGGGNAAAAAAAAAAAAAAANAAGGANCNNNAAACCCN 

NNCNAAGGNAAGGGGGAANNNNNAAAAGGGGNANNNNNNNGGGAAAAAAAAAAGGGGGGA 

NAACCNGGGAAAAAAGC 

Sequence 252 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAAAATAACATCATAAATAACATAAATGAGAAG 
TTTTCAAGTATTTCAATGTGCCTTGAAAAGmCAAACAGTAATTCTCACMTTGGTTAG 

acaatca i rrrrm icttttcttttcctctcatttagtctcagagcctaaaaagaaatg 

CCTCCAGTCTCTGTTAGCCCCGATGTTTGGAATGAAATTAACAAGATTCTACCTTAAAAG 
AGAAMCTTATGTGGGCTTTTCAAATTGTGAAA^ 

TTTCCCCTGCTGCTG GTTTATAAATATG GGC ATATAG CCC AG AG CC ATTATAAAG AAAAA 
ACAACCAACCACAATAAACTAAGGGAGCCTCATCTGATGGCTTAAGGTTCTCAGGAAAAA 
ATGAGAAGAGCCCCTTGATCAGGGAAAGGG 
Sequence 253 

CCGCGGTGGCGGCCGAGGTTGGAAGAGAAGTTACCCCGATGACTTGGTTTGGAAGGGGTT 

AAGGCACCAGTCATCCTCTTCTAAAGTGATTTATGATGATGTGTGGAGTTTAAAAACTTT 

ACCCCACCCCAAAGAACAGCCCTCTCACTCCTCACTGAGTCCACTCTGAACGTGCTAAAA 

TGGGAAGGAGGCGGTGTTTTGCTGATCTGTTAAATTCTTAGTGMGTTTCCTTGATTTCC 

AGTGGCTGCTGTTGTTTGAGTTTGGTTTGGAGCAAAACTGAGGTAAGTCCTAACATTTCT 

GGGACTGAATCCAGGCAAGAGAAAGAAGAAAAAGANGAAGAAAAAGAGGAGGAAAAAGGT 

AGGGAGAAATAAAGGGAGGAGAGAAGCACAGTGAAAGAAAAAAAAAGTCC 

Sequence 254 

GGCCGGTACCTTTAACTCAATTTAATATAACAAGAAATCGTAAAATACTTATAACCTATC 

TTAGAGAAATGAGTGCTGGTTTTGAGAGTTG I Mil I AACTGAAAG ATTATTTCTAGATG 

GGTAGGGGGCTTTGTGCTGGTTTCTGCTTCCATATATTTCCCAGTCATTTTAATTAGAGA 

AGATACTCTATGGTAGAACTAAGGCCTTTCCTTTCTTGGCCAMGTCTTTACCCTATTTA 

ACCCTTTGTATATTTCTGACTGCTCACTGTTCATATTATAGGGGACCAGATTTGTAATAT 

AAGAATTCTCCATAACATGAATGAAATTAATTCTGTCCAAGCCAGCATGGTGGCTTCTAT 

T AAGGT AGTAACAG GAAGG TCTGMCAATTGGATAAATTTGGACTTTCAAGACAGCTNAA 

CTTTTTCACTGGCAATTTT 

Sequence 255 
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ACTNCTATAGGGC GAATT GGAGCTCCCCGCGGTGGGGGCCGAGGTACATTGTGGTAGGTC 

CAGGAAATATGACATTTTCCCCCTTGATGTGTTATTGTTGTTGTTGGGTGGGGTGGGCAT 

TTTGTTTATTTGTTTGGTGGCAATCAGTGGTAGTAGGGAGTGGGAGGGCTTATATTGGTT 

TTTC C AGCTATTAAGGGG AC ATATTGTGTC GTTGTG CTTTTC ACGTTATAAAATGTTTAT 

ATTTACCAGTACCTGCCCG 

Sequence 256 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTCCATAAGCTTGGAT 

TTTAAACACTGATAGTATCTCATGAGTAATGTGTGTTTTGGGAGAGGGAGGGATGCTGAT 

TGATATTTCACATTGTATGAAATACCATGTTTGAAACTCATAGCAATAATGCTATGCTGT 

TGTGATCCCTCTCAAGTTCTGCATTTAAAATATA I I I 1 I I CTTTATAGGAATTGATGTAT 

ACCATGAAGTCATTGTCAGTTGTAGTAGCTCTGATGTTGAATGAGATATCATGTTTTAGC 

ATTCCAT1TTACTGACTAGGGTAGAAGMCACTTTTCTTGGCTACATTTGGAGGATACCC 

AGGGAGTCTTGGGGTGTTCCTTATCTGGGGAAGCAAAACATTTCACTAGTCTC i I I I i I I 

TCATC 

Sequence 257 

CCGCGGTGGCGGCCGCCCGGGCAGGTACA I 11 I I 1 AAAGTTCCTCAAAAAATTCTGCTTC 

TAGGCAAATGTAATAGATATTGTGCGTTGCTCATG I I I I I CCTATCAGAACCATGATTTT 

AAAAAAACTATCCTTTCTGTCCCCATATATCCCTCATGTGGCCTCCACTTCTACTCTCAG 

CTCCAGTGGACCTGACTGCCCTAAAGGTTATTCTAGACCCTTTGCCAGTGACTGGCTCAG 

GGATGGACAGGTCTMGTTACTTTAGAGCAATGAGACCTAAGGATATGGAAAGTGAATTC 

TGGGAAAGCTTTCTCATACTAGGACAGGGCCCCTAGAATCCAGCCTGACTCTTCCGCTGC 

ATGTAAATGAAGATGCACAAGGCTNCAACTGCTACTGGCAATCATCCCATGACCACGAGG 

GTAGCCAGCCTTAGGATGATGCCCCATGTTTGTGGGTTGAAAANGGGGGGGG 

Sequence 258 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTGCTAGCTGGAAAAACCATATAGAAAACATGA 
TGACCATGTTGAGTTGTTGGTGCTGAGGCTGTGAACTCTTTGGCTTCCTTCTGATGGCCC 
TCTTGAGAACAATCTGTACCT 
Sequence 259 

NNGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAACAAAAGCCTGTGAACTGGCTTA 
AACC AGTTTAC ATGCTG G ATTC CG AC C C AGATAATAATGGATTC ATAAATGAG GATTTTA 
TTGTTTGGATGCGTACC 
Sequence 260 

TAGGGCGAATTGGNAGCTCCCCGCGGTGGCGGCCGAGGTGNCTGAAAGTCGATGAAGGAC 

GCGATTACCTGCGATAAGCTTCGTGGAGTTGGAAATAAACTATGATACGGAGATTTCCGA 

ATGGGGTAACCTMCTGAGCAAACCTNAGNTGCATTTTGATGAATCCATAGTCAAATTAN 

CGAGACACCGTTGCGMTTGAAACATCTTAGTAGCATCAGGAAAAAAAAAAAAACCNNAA 

AANNANAAAGGCTTGACCT 

Sequence 261 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGTATAGGCCATATTCCTT 
TACGTGGGACATTMTGAAMGCTGTCTTTTCCAAACTGAATGCGCCTATTAATAAAAAA 
ATACACCTGACCATCTAATGCTGAAGTCATCCGCCAAGGTGGTACCTGCCCG 
Sequence 262 

CCGCGGTGGCGGCCGAGGTACAGATTATGGAAAATATGAAGGACTAACAAAGAATTACAT 

GGATTATTTATCCCGACTATATGAAAGAGAAATCAAAGATTrCTTTGAAGTTGCAAAGAT 

CAAGATGACTGGCACAACTAAAGAAAGCAAGAAGTTTGGTCTTCATGGAAGTTCGGGGAA 

ATTAACTGG ATCTACTTCTAGTCTAAATAAGCTCAGTGTTC AG AGTTC AG GG AATC GC AG 

ATCTCAGTCATCTTCCCTGTTGGATATGGGAAACATGTCVGCCTCTGATCTCGATGTTGC 

TGACAGGACCAAATTTGATAAGATCTTTGAACAGGTAC C TGCCCGG 

Sequence 263 

TACTATAGGGCNATTGGAACCTCCCCGCGGGGGCGGCCGCCCGGNCAGGG I I II I J M I I 

rTTTTTm I ATACAAAAAGCAACAAGAGTTTA^TTCTC I I I I I ACATGGCC AC AGGCTC 

TCTTCAGTCAGGGGAACTTCAGCTGGTGCCTCTCTCTTGCAGCTATGAGGCGACAGTGTG 

GTGACATGCCTCATACAGACTGTCCCAGTAAGCCAGGACAAGTCACCATTAAAATCTTGC 

ATGAACAGCCCTGGGCACGTGGGAATGTTAAGAAAGAGCCACCGCCTCCTTAGTCAGCTT 

AACCACAGCTCCAAACGCAGTTTGTCCCAGCTGGCAAACGCCTCAAACACCAATCATGCC 

GTCGTGCTCCTATTCTGGGGTTTTATAAA 
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Sequence 264 

TAGGGCGATTGGAGCNTCCCCCGCGGGGGCGGCCGCCCGGNCNGGTNCAGATTTNTATTT 
ATCCATAGATAGGGTATCTATAC ATACACATCTCAAGTGCATCTATTC C C ACTCTC ATTA 
ATCCATCATGTTCCTAAA I I I i I GTAATCTTACTGTAAAAAAAAGTGCACTGAACTTCAA 
AACAAAACAAAAAACAACAACAACAAAAAACAAGTCCAA ACTG ATATATCCTATATTCTG 
TTAAMTTCAAAAGTGMCGAAAGCAmAACTGGCCAGTTrTGATTGCAAATGCTGT^ 
AGATATAGAATGAAGTCCTGTGAGGCCTTCCTATCTCCAAGTCTATGTATTTTCTGGAGA 
C C AAACC AG ATACC AG ATAATC AC AAAG AAAG CTTTTTTAATAAG G C TTAA 
Sequence 265 

CTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAAGGTACACGGTGGACC 
TGGAGTCAGGGCTACACTACCTCCTGCGGGTGGAGCTGGCAGCCCACAAGTCCCTGGCCG 
GAGCAGAGCTGMGACGCTCAAGGACTTTGTGACTGTCTTGGCCAAGCTGTTCCCTGGA 
Sequence 266 

GCGGTGGCGGCCGAGGTACTTTGGGGGAGAAAAACATGATTCCATTTACGGGGAAAAAAG 

CCATTGACACTCAGTAAGCAACACTGCCATCTAGTGGAATGGTGACACACCACCAAGAAT 

TTCAAGACCCGATAGGAAATGTGAGTGGATTTGGTTTCAATTTTCACC ACAAA ACAGCAC 

TTTTMTAAGCTGGTTTTCAGAGAACTTCAGA 1 HIM I GAGAAACTACTTTTTATCTTT 

AAAATGCATAAATGTATGTGTTTTCTCTGTTTTGGGGGGGGTGGTTAAGAATGAAGTTGT 

AM I I ICTTATTAGAGCATAAAGTTCTCTACTTGAAAGGTAGAAAACACTTTGG GGCA AC 

CTGCCTCCTTGAACATCCATCACTTCTGCTTGACAGTCTAACAGCACCCCTGATTTTAAC 

AGACAGTTTATAAAGGAAGCAAAACTC 

Sequence 267 

CTNCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCTTTGAG 

GAC A I I 1 I IGTCAGATTAACTATAACAGTGTAGTGTAGTTTTTAAMTTGCAGTTGAAAA 

GTTTAGCTGTCTTGGAAGTCAAATTTATCCAATTGTTCAGACTTCTGTTACTACTTAATA 

TGMGCCACCATGCTGGCTTGGACAGAATTAATTTCATTCATGTTATGGAGAATTCTATA 

TTACAAATCTGGTCCCCTATAATATGAACAGTGAGCAGTCAGAAATATACAAAGGGTTAA 

ATAGGGTAAAGACTTTGGCCAAGAAAGGAAAGGCCTTAGTTCTACCATAGAGTATCTTCT 

CTAATTAAAATGACTGGGAAATATATGGAAGCAGAAACCAGCACAAAGCACTACCCATCT 

AGAAATAATCTTTCAGTTAAAAAACAAC 

Sequence 268 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTTTCATGGTCAAGCTCTAAGA 

CACAACGAGCTCTGTTATTCAAGAAATCAATTCCAGTGGATTTCCAGTTCCAATTCCTGA 

GAACTAGGGTAAGGGGGAGAGCTAATGGTTGCTTCCTAAGGCCTTCTGGGTTTATTAGTT 

CCATTTCAGGACATGACAAGAAAATGTACCTGCCCG 

Sequence 269 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTTCTA 

AGGTAGTTAACACCACACAGGCTGATAACTCAGGGCAAGAGAGAGGCCTGGAGAGTTGAC 

AAGAGTGTGAAGATACTGGAGAGGGGTTCAAAATAGGATCAATTCTTCTAATACTATTCT 

TATTTCTTGCTAATAATTTGTCCTTGTAATTATAGTGAAGTAATGGTCTGGATATAAATG 

AGATATGGTTTTGTGAGGGAAGAAACATAGAGGATTATTCTTAACAACTGTGGCAGCCTT 

GCCAATATGGAACC 

Sequence 270 

CCGGGGTGGCGGCCGCCCGGGCAGGTGCGNTCTCAGCTCATTGCAAGNTNTGCCTTTNTA 

AACATGCTATTAAATGTGGCANTGAAGGANTGTTTTTANTGTNATCTTGCTATTAAGTGG 

TAATGAATGTTCCCAGGATGAGGATGTTACCCAAAGCAAAAATCAAGAGTAGCCAAAGAA 

TCAACATGAAATATATTAACTACTTCCTCTGACCATACTAAAGAATTCAGAATACACAGT 

GACCAATGTGCCTNAATATCTTATTGTTCAACTTGACATTTTCTAGGACTGTACCT 

Sequence 271 

TTGGAGCTCCCCGCGGTGGCGGCCGNGGTACATCCACAGGAGGAATCGGACGAGAGGATG 

TGACATTCGGTCCAGGAGAGAAAGAGCAGTTTCTGTTAAAGATGTAACAAATGGATTTCC 

AAAGTCTACATGACATTCACTTTTCAMCTTCCCACCAGTTGMTTTCTTTTTTT C 

AGAAACAGGTGATGTCTTGGAAAACAGCTCCTTATGTCTCTCTGTGCATCTCCATTTTCC 

TAGTCTCTGGAGTCTCAAAAAGAGTGGCAAAGCACTTTACAGTAGTAACTGAGGAATCAG 

AGTCTCTGCTTCAGCGATATCTAGTTTGTACCTGCCCG 

Sequence 272 
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CCGCGGTGGCGGCCGAGGTACTTGTCAAATGAAAGAACAGGGATTGCCAGACCTTCAAGG 

CAATGGGAAAGGAGCAAATCTGCAAAGGTAGGATCTCTTTGGAAGGCAGGTATTGGCCAC 

CAAGTCAAACTCCTTGAGTCTTATATTCTGATTGGGATGATCTCACATGGATGTTCATCT 

CTTATATGTGAATGCTCATTTGTGAAAAATAGTAAGAGCCAGCTAGGATATTTGGATTCA 

GTCAGGCACCATCAGAATAGTGCAGTGAAAGGCCAAACTGGCCACAAGACAGAGGAATGT 

TTTCAGTTTTCTGGTTTTCCTCTGGTCCATGATAAAGCTCGGAGTAACTCTTCTATCAAG 

ATGGGGCTATACCTTCNCATGACAGAGGCTGGCAATTGAGCTACCCAGCAGAACGTGTGC 

TCTCAAAAGGGGAAGTCAAGGGAAC 

Sequence 273 

CCGCGGTGGCGGCCGCCCGGGCAGGTCAGGCAAGGAGAACTGTGCTATATTGTTGGTGCT 

TGCTGTTTTGCTGGACATCTTCAATTATTG I 1 \ I 1 ACCTGAAATCAGTAAGACTTTGACA 

G G ATATC ACCTG AATTATTAATG AATG C CC AGG AAGTAATTTTCTTCTC ATTCTTC TAAA 

ACTACTGCCTTTCAAAGTGCACACACACGCGTCCACATACACTGCATTCGTTGCTCCAGT 

ATAAATTACATGCATGAGCACCTTTCTGGCTTTTAAGCCAATATAATGGGCTGCAAAATG 

AAGACACCAGAGTGTATGCATACAAATCTCACTGTATTAAAGATGCAGGTTTTCTAATTG 

TACCTCGGCCGCTCTAGAACTAGGTG 

Sequence 274 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGG I 1 I I ! I 1! I 1 ATC AAAAAAT 

ATTTTATTACCAAAAAATTAATTATCCTATACATCCTATACTGGGAAATCCATTGGAATA 

ATTGGCTAAAATAAATACAGG GCMTTAAATTCAACCTTTTTATAAG N AATGGAGAGN AA 

TTTGACATTTGNAATGTTATCAAAGCTTTAACTTAGAACATAAATAGTTAAAAAGGGCAA 

ACTCAAGTTTTCATCTCATTTATTGGCATGGTGTGTTTTTGTTTTCCACAAGCTCTCC^ 

TCAGTGAGGAGTCTTGGAGTTTCAAGAATGTACCCTGCCCGGGGCCGGCCGCTCTAGAAC 

CTAGGTGGGATCCCCCCGGGGCTGCAGGGAAT 

Sequence 275 

GCGGGCCCGAGGGTACACACCGAAMGGTGGTGGTATTTTTACCTTCCCAAAGGTGGTGG 
ATAGGGTTTTATTGGGCCCTTTTTCNTAAAAMGGAAMTANAAAAACCAAAA^ 
AAAAAAT AATTGGAAAAGGTTTTAAAAATACTTTCNTTTTCACC AGG AA^ 
TTATTTCCAA I 1111 1 AAAAAANGGGTTAGGAGGAGGAGGANGAAAACCATTTTT AATTT 
CCTTTTCTMTCAATTGGAAACCATCCAGGTTCAAAACCCAANG CCTTTCN ACAATTTTT 
TGGGGGGCCAGGGGNA 1 mTTTl 1 AAAGGTTANTTAAAAA 1 1 I I i 1 1 1 1 1 AAAAGGAAA 
ACCTTTAA I rTTTTTAATTTTI I AAAAAAAAAANGGGGAAAAGGAAACCAGGGGTTTTTA 
ATTTTTTTG G G C C AA GG AAAAAAAN G GTAAAAAAAATAAG G AA G G AMTT AATG G C C C AA 
AAAAI 1 I i IGGTCCI I I J 1 AAMGGAATTNGGGGCCATTAACCAAAAAAAATTTTAAAAA 
AAAAACCAACCAAGGTTACCCCTTGGCCCCCGGGGGCCGGGCCCCGCT TCTTA AGGAAAC 
CTAAGGGTGGGGGATTCCCCCCCCGGGGGGCCTTGGCCAAGGGGAAMTTTTCCGGANTA 
TTCCAAAGGGCCTTTAATCCGGATTACCCCGGTCCGGACCCCTTCGGAGGGGGGGGGGGG 
CCCCCGGGTTACCCCCAAGCC I (Mil I GGTTTCCC 
Sequence 276 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAACTATAT 

ATACATGTGCATATATATGTAMTTTATATTTATATATAATATCTTTATATAGATAGATA 

TCTGCAGACAGGTTATTG ATTATAGAG ACCCAAGAAAGCAACTC AATAATTGTTCAAAGT 

TTTCCTCACTGACTGCTGGTGTGTAGTTTAAGAAGCCCCCATTTGTTTGCACTCAGAATG 

C CTT ATCTTCTTTTAATGG AAC ACTTG ATGTGGAATTTTAAGTCTG AG AAGTGAG GTGCC 

TTCTGAAGAAGAGATTTTAGAGACTCCCTTCCTCTATAAGTTGGAAATGACCAGAAGTCT 

TAAGTAGACCACAGTTAGCTGACTTTGACATTGTAGGACGTAATCAGCTTTTAACCAAAT 

TATAGAACTGGTAAAGGGTAAGATCAAATTTGCCAAAAAA 

Sequence 277 

CCGCGGTGGCGGCCGAGGTACAGCACTGGGCTTTATAAAGACTGCACTCAGAACCACACT 

GCACAGTCCAGTTrTTTAAAAAGCTGCTACATGACAGACAGGTAATCCCACTGAGTGAGT 

TTTGAGAAACAAATCAAACGAAGTAAACAAGAAACATAAAAACCAAATAGCAAA TGAATA 

AAAGCCTGTTCTTGTAACTTATTCAACTTTTGCCAAATTCCTACCAATCACTTGCTTT^ 

AAAAGAAATGTATAATAGCCAAAAGAGAAATTATGTCCCTGTTGTACCTGCCCGGGCGGC 

CGCTCTAGAACTAGGTG 

Sequence 278 

GAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGG 1 I I 11 II I I 1 U I AAGTTT AAAATTTT A 
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ATATTrATAAATTTAAAAAATTTTACACGTGCTGAGTGGTAGCAGTGCTAACAl I 11 IGT 

GACCATTAAACCACAAACTCACTGCATAGAGCTTCTTTCTTTGTACCT 

Sequence 279 

AGGTACCTAACCTACCTTTAAGACTGGGATAACTATTGGAAACAATAGCTAATACCGGAT 

ATAGTTATTTATCGCATGATGAGTAATAGAAAGGAGCTTCACAGCTTCACTTAAAAATGG 

GGGTGCGGAACATTAGTTAGTTGGTAGGGTAATGGCCTACCAAGACGATGATGTTTAGCC 

GGGCCGAGAGGCTGTACCTGCCCG 

Sequence 280 

AGGTACAGTGCCAGACCATGACTGTCAATCGTCAGATGAAGCGCTACAACGTTCCGTTTC 

TAACTTTTATTAACAAATTGGACCGAATGGGCTCCAACCCAGCCAGGGCCCTGCAGCAAA 

TGAGGTCTAAACTAAATCATAATGCAGCGTTTATGCAGATACCCATGGGTTTGGAGGGTA 

ATTTTAAAGGTATTATAGATCTTATTGAGGMCGAGCCATCTATTTTGATGGAGACTTTG 

GTCAGATTGTTCGATATGGTGAGATTCCAGCTGAATTAAGGGCGGCGGCCACTGACCACC 

GGCAGGAGCTAATTGAATGTGTTGCCAATTCAGATGAACAGCTTGGTGAG 

Sequence 281 

CCGGGCAGGTACTTGTTTATAGACTCAAAACATGATGAAAATTCATTTCATTGTCTTCAC 

TGAATAGACTTAACCTTGACAGTAAGCTTAAAGCATTAGTGTCATTTAATTACATGTCTG 

AGGAATTTATGCTAACTAGATTTTGGGCTTTTGCAGCGAATTTTAGTCGGGGCCTCATTC 

CTGAGCCCAAGTGACCCTTTCACCTCAGCTTCCCAAGTAGCTGGGA7T ACAGGTGCACA C 

CMCTGTGCTTTGCAGTmGmGTGTGTGTGTGTGTGTGTGTGTGTATmGTTTT^ 

TTACAAGCCCACAACATCATAGCTTTGCAGTT 

Sequence 282 

AGGTACAGCACTGGGCTTTATAAAGACTGCACTCAGAACCACACTGCACAGTCCAGTTTT 

TTAAAAAGCTGCTACATGACAGACAGGTAATCCCACTGAGTGAGTTTTGAGAAACAAATC 

AAACGAAGTAAACAAGAAACATAAAAACCAAATAGCAAA TGAATA AAAGCCTGTTCTTGT 

AACTTATTCAACTTTTGCCAAATTCCTACCAATCACTTGCTTTTTAAAAGAMTGT^ 

CAGCCAAAAGAGAAATTATGTCCCTGTTGTACCTGCCCG 

Sequence 283 

ATTGGAGCTCCCCGTGGTGGCGGCCGCCCGGNCAGGTACGGATACAATTCCGCTGAGTTA 

AGATTCCAAATTCTAACCTCTCCATCACACGCCCCAGAAAGGACAGTAGCCAGCTTCTCT 

GGATGCTTTGCCAAGCAATTGACTCCATCACGGNGACCATCCAGCGAAGCAAGGAANGGT 

TTTGCAAATACTCGNTCCAGTTTGGTAGCAtTTAAAGCTCTTATATATTCTCGNGGGACC 

TCAAAAGGATGTAAAACCT 

Sequence 284 

AGCTCNCCGCGGTGGCGGCCGAGGTACTTACATGGGTGTTTTGATCTCTGTTCTTTCATA 

CTACATTTGAACAGGGCAAAATGAACTAACTGCCATGTAGGCTAAGAAAGAAATGCTAAC 

CTGTGGAAAGTTGGTTTTGTAAAATTCCATGGATCTTGCTGGAGAAGCATCCAAGGAACT 

TCATGCTTGATTTGACCACTGACAGCCTCCACGTTGAGCACTATTCTAAGGAGCAAATAC 

CTTAGCTCCCTTGAGCTGGTTTTCTCTGATGGCACTTTTGAGCTCCTAAGCTGCCAGCCT 

TCCCTTCTTTTCCTGGGTGCTCAGGGCATGCTTATTAGCAGCTGGG 

Sequence 285 

TTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTACATGGGTGTT 

TTGATCTCTGTTCTTTCATACTACATTTGAACAGGGCAAAATGAACTAACTGCCATGTAG 

GCTAAGAAAGAAATGCT/^ACCTGTGGAMGTTGGTTTTGTAAAATTCCATGGATCTTGCT 

GGAGAAGCATCCAAGG/^ACTTCATGCTTGATTTGACCA CTGAC AGCCTCCACCT TGAG CA 

CTATTCTAAGG AGCAAATACCTTAGCTCCCTTG AG CTGGTTTTCTCTG ATG GC ACTTTTG 

AGCTCCTAAGCTGCCAGCCTTCCCTTCTTTTCCTGGGTGCTCAGGGCATGCTTATTAGCA 

GCTGGGTTGGTATG 

Sequence 286 

CCGCGGTGGCGGCCGAGGTACTTACATGGGTGTTTTGATCTCTGTTCTTTCATACTACAT 

TTGAACAGGGCAAAATGAACTAACTGCCATGTAGGCTAAGAAAGAAATGCTAACCTGTGG 

AAAGTTGGTTTTGTAAAATTCCATGGATCTTGCTGGANAAGCATCCAAGGAACTTCATGC 

TTGATTTGACCACTGACAGCCTNCACCTTGAGCACTATTCTAAGGAGCAAATACCTTAAC 

TCCCTTGAGCTGGTTTTNTNTGATGGCACTTTTGAGCTCCTAAGCTGCCAGCCTTCCCTT 

CTTTTCCTGGGTGCTCAGGGCATGCTTATTAGCAGCTGGG 

Sequence 287 
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AGGTACTTACATGGGTGTTTTGATCTCTGTTCTTTCATACTACATTTGAACAGGGCAAAA 

TGMCTMCTGCCATGTAGGCTAAGAAAGAMTGCTMCCTGTGGAAAGTTGGTTTTGTA 

AAATTCCATGGATCTTGCTGGAGAAGCATCCAAGGAACTTCATGCTTGATTTGACCACTG 

ACAGCCTCCACCT TGAGC ACTATTCTAAGGAGCAAATACCTTAGCTCCCTTGAGCTGGTT 

TTCTCTGATGGCACTTTTGAGCTCCTAAGCTGCCAGCCTTCCCTTCTTTTCCTGGGTGCT 

CAGGGCATGCTTATTAGCAGCTG 

Sequence 288 

AGGTACAGTAAACATTATTCAGTGAGAGATCAGMGGAAACAAATGGCATCTTTTCAGAA 

CGCTGTCAACTGTTCCCACAACCCAAGTCTG Mil 1 CC AAGTTGC ACAAGTGCTTGG AAT 

AACTCTGAAACAATTCTCTTCAGAGTTTAAAGGCTTCAGAGTATAGGTGATGCTTCCTAA 

AACAAGAAGCCTGTATTAACATCACAAATTGGAACTCA7TCCAAAGCCTCTTCTTAGAGA 

AAAAAAAATAAAGAAATAAAAAATAAAAAATGTAAAGAAGGTCATGAAAAATTCACATGC 

AGCTAATTATCGTAAAAAATATCAAACAC 

Sequence 289 

AGGTACAGCATTGATGAACACTTTCAGCCGAAGCAGATTGTCAAGTCTCTTATCCCTTCG 

TGGMCAAACTGGTTTTCTTTGAAGTATCTCCTGTGTCCTTTCACCAGGTGTCTGAAGTG 

CTGTCTGAAGAAAAGTCACGTTTGTCTATAAGTGGCTGGTTTCATGGTC CATC ATTG ACT 

CGGCCTCCCAACTACTTTGAACCCCCCAtACCTNGGAGCCCTCACATCCCACAAGATCAT 

GAGATTTTGTATGATTGGATCAACCCTACTTATCTGGACATGGATTACCAAGTTCAAATT 

CAAGAAGAGTTTGAAGAAAGTTCTGA 

Sequence 290 

CCGCGGTGGCGGCCGCCCGGGCAGGTACATTGTGGTAGGTCCAGGAAATATGACATTTTC 
CCCCTTGATGTGTTATTGTTGTTGTTGGGTGGGGTGGGCATTTTGTTTATTrGTTTGGTG 
GCAATCAGTGGTAGTAGGGAGTGGGAGGGCTTATATTGGTTTTTCCAGCTATTAAGGGGA 
CATATTGTGTCGTTGTGCTTTrCACGTTATAAAATGTTTATATTTACCAGTACCT 
Sequence 291 

GAATACGACT CACTATANG CGNNAATCTGANAGCTCCACCGNGGTGGGCGAGCCCGCCCG 
G G GC AG G G G TTTTTTTTTTTTTTTTTTG G TAAATTTTC AAT AA G TTTAATTTTTAAATAA 
ACCCAAGGGTTTTACAATTAAACCAGGTCAACGGTGGAATGGAAACC I I I I I I I I CTTTT 
AAATGGTCMGCNTNAAAACCTTCAAAAAACCACCMGTTm^ 

AAACCC AAAAACC AG GCTTC NTTTC AACGGTTTTCCC AAG AAAAG CCTGGCC CTTC AAC A 

GGCTTAGGCCACCAAGAATCAACAAGGGAAGAATTTAACTGGTCNTTGGTCCCATTAACC 

CCAACCCAAGAACCACCAGGGAAACCTTGGAAACCACCCCCAACACCACCCCAAGTTTTT 

TTCAAAAAAGGGAAGGGGGGAAACCTTTACCAAAATGGAAAATGGGCTTGGGCCTTGGCC 

CCAAGGGGGCCAAGGCCCAATTGGAAGTTGGGTAATTCNTGGGGGGGACCTTCAAAAGGC 

CTTGGGGGAAGG I Mill! ICCCAAGGGGGGGGGAGGNAAAAGGGCCCTTGGGGGGAAAG 

GCCTTTTGGTGGGGCCAAANGGGGGAAAAGNTTTGGGGGNAAA 

Sequence 292 

CCCGCGGTGGCGGCCGAGGNACNGANTNATTTTCNGNATTACTGAACNCCTTTGAACTAT 
TTTGAACTNTGAATCTCTTCTCTGAATAACTGCCGAAACTCAAAGATAATAATNAGTTGA 
TGAAGGTTATCAATTAATAAAATAACNCAGACCANTCTTACCAAACTCTAAATACATTNT 
AAAAAATTTTAACTGGC AATG ATAAAAAGAATGTTG AGTACCTG CC CG 
Sequence 293 

CCGGGCAGGTACAATAAAGTTCACCCACCCTGCACTTTGGCCCTTAGATCAATCCTAAGT 

AGCCATTGCCAGTAGGCCAAGTTTAATCAGAGGACAGTGCCTACCAGTAAATACTGAATA 

GTTACAATAGTTATGTCCATCCAACCAGTAGCAGATGAACAGCTAATACATCATGATGCT 

ATGCTCTCCTAACAGGGTCCCCTCAGATCCTCAGTGAGCACATAAAGAAAGGGAGGTCAT 

ATCCCTTACATCTCTACCAGGTATTAACACCTAACTACTCTCTAGCCAGAGGCAATTCCC 

TTTATTTCCTTACTCTCGTCGTCTTCTCT 

Sequence 294 

CCGGGCAGGTACATAACATTTCAAATATAAGTGGAAGGATCATCAGTAGTGTTATCAAAA 

TGCATAATACAGAAACTTTTTAAGAAAGGATAAAAAATTACACTCAGGACCCATAACTCT 

TCCTCATTATAAGCATATGTAGTGATTCATTCATGCAGGTTTTTATATGTAGATAGGATT 

TTTTTTTCCTTTTC AAG AATTCCATTG TAGCC ATG AG ATG AAAAATGTATTATGG TAATG 

GTATAGCTTTCTTCTATTTTGCTTTTAGTGTTAGGTTTGCTAAAAGC™ 

CCAACTGACATAATGTGTTTTCAATAAGGAGGACGCT 
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Sequence 295 . 

AGGTACTGATTTMCACTCTGTATTACTGAACTTCmGAACTATTTTGAACmGMTC 

TCTTCTCTGAATAACTGCCGAAACTCAAAGATAATAATTAG TTGA TGAAGGTTATCAATT 

AATAAAATAACACAGACCAGTCTTACCAAACTCTAAATACATTTTAAAAAATAGGAACTG 

GCAATGATAAAAAGAATG TTG AGTACCTG CC C G 

Sequence 296 

CCGCGGTGGCGGCCGAGGTACAGCACTGGGCTTTATAAAGACTGCACTCAGAACCACACT 
GCACAGTCCAG I I I I I I AAAAAGCTGCTACATGACAGACAGGTAATCCCACTGAGTGAGT 
TTTGAGAAACAAATCAAACGAAGTAAACAAGAAACATAAAAACCAAATAGCAAATGAATA 
AAAG C C TG TTC TTGTAAC TTATTC AAC TTTTG C C AAATTC C TAC C AATC AC TTG C I I I I I 
AAAAGAAATGTATAACAGCCAAAAGAGAAATTATGTCCCTGTTGTACCTGCCCG 
Sequence 297 

CNCCCCGCGGTGGCG GCCGAG GNACAGNACTGTTTTTTNNAAAGACTGCACCCCAGAACC 
ACACTGCACAGTCCAG I Mil I AAAAAGCTGCTACATGACAGACAGGTAATCCCACTGAG 
ACGAGTTTTGAGAAACAAATCAAACCGAAGTGNACNNGAANCATAAAAACCAANTAGCAA 
ATGAATAAAAGCCTGTTN TTG N AAC1TATTC AAC TTTTG CCAAANTCCTACCAATCACTT 
GCTTTTTAAA 
Sequence 298 

CCACCGCGGTGGCGGCCGAGGTACTATGTCAGNTTTGTATGTAACTAGCTGTCAGGTCTT 

TCCCCGTTGCCTCAGCCTTTCTACACAGACTGGCCTTCAACTTCCCCTGAGTCCAGAAGT 

AGACTCTTTCAGCAACTCTATTCAGGAATCTGCAGCAGGAAAACTGCTTCCTCTATTAAC 

ATCTATGACTGAAGCACAGATGTGTCTAATAGAAATCACCCTTCACCCAAAAGCTGGGTG 

CAGAAAGGGAAGCCCTTAGCTGACTATAGGAGGTGCCTCTTGTGGCTCCACGTGCTTCTT 

ACACACCACCCCCCAGCTTGAGCGATGCCTCAGCCAGCTCACCCTCATCCACACAATCGC 

TAGAAA 

Sequence 299 

CCGGGCAGGTACCATATCTCCTGGCTTCCTTCTACATGGGTCACTTAGTTAAGAGGGAGG 

CCAAGGGAGTTCCGATTTCAGGCAGTGTGTGGCAGGGTTACTGTCCTAGCAACCTGGCTA 

CTCCTCACTGTGAACGTTTCTCATAGGTGTCATATGGCAGGATGAAAAACATATTTGCCT 

CCCAGTGAAAGATGGCACAGGCTTTTGCCCAGCCAGGTTGGCAAGAGAACAGAACTCTTA 

ACCCCTTGCTCGACAGGTTTGAGTTCAAGGGGTTGGATGCTCCAAGCAGAGGGCCAAACC 

CTGATTTATGAAGCATGCTAGGTCAACAGCCAGTCAGACCA 

Sequence 300 

CCGGGCAGGTACCTAAGGGGTTACTTGTTTAATGGGATGGCATTGAC Mil fGAAAATCA 

AGTGGACTGAGTCATTGATAAAACATTTCTAAGAGTGGGGCTAGAGAACATACTTTACAT 

CTGACATCCTTTGGCCTAACAACATCTATTATTATAGTGCTCAGCAGTGTGGGCATTGAA 

GAGGCGCAGAATGCTTTGAAAGAAACTAATCAGAATCTTGGAACATCATGATCATGCCAT 

TCTTMGTAAATCAACTATTTTCAACACTGAAGAAAAATGAAACATTATTTAGAAAACAA 

TGAGATTACAAGTTCCAAACTCAGCCA 

Sequence 301 

CCGGGCAGGTACCCTCTACCTCCTTCCCTGTCGGAAAAGTCACCACTTTAGTCCCTGGCC 

ACCAGCACACCCCAGGAGGGTGAGTGGCCTGAGGTAGTTACCGGCACTTAAAACTCCCTT 

GCTACCGATCTGGAACTCAAGCCCCAAGACATCCCCTTAGATGATCTGAATACGCATTCA 

GGGACAGATCTAGGCAGTTTCTAAACAACACTTAGACTGGGGTCTAACGTTGACAAATCC 

TTCTAGAATTTGCCTCTTTGGGACTGAAGTCTAAGGGGCTGAGACCAAGAAGGGAGAGCA 

CAAGACTAAC7TTGGTCTCTTGACCTTTT 

Sequence 302 

AGGTACCTTCTAAGGTAGTTAACACCACACAGGCTGATAACTCAGGGCAAGAGAGAGGCC 

TGGAGAGTTGACAAGAGTGTGAAGATACTGGAGAGGGGTTCAAAATAGGATCAATTCTTC 

TMTACTATTCTTATTTCTTGCTAATAATTTGTCCTTGTAATTATAGTGAAGTAATGGTC 

TGGATATAAATGAGATATGGTTTrGTGAGGAAGAAACATAGAGGATTATTCTTAACAACT 

GTGGCAGCCTTGCCAATATGGAACCCAATTTCAGAGTTTGTGAAATGGTACCTGCCCCGG 

GCGGNCCGCCCGGGCAGGTACAGTGACCTAAAGT 

Sequence 303 

AGGTACTCAAATGCATTGATTTTTCCAGCTGGCTTTTCAGAGCAGTGCTAGTGAGAGGCA 
CCAGTGTAACTTGAAGCAGGAATAGTGATGGCTCAGGTCCCCACAGGTAGCTCCTCCATA 
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CCTGTGGGACTCAAAGGCTAAGGGCACTAACTCATAGGGCTCAAAGGCTGGATGAAGAGA 
ACAATCAATGTGTTAAATGTCCTTCAGACCTAGGTCAGAGGACAAATTTTGACTAATTCT 
TTTGACTAATTCTTTTTGCAACTTCTCTCAACAGGTACCTGCCCG 
Sequence 304 

CCGGGCAGGTACCTATTCACCATTCCAACGTGAAGAAGCTCTGCAGTAGGAAAAATAATT 

AACACACTTATAGTCTACTGCCTATGTAAGGATCAGCTCCGGCTAAGAGGCCAAAGATGG 

GTGACATCGTTATGCTCTGCCTTTATT I I ( i CTTTCTTACCCACTTAGCTTCCTAATTGG 

AGGAAGGAGGCGTGGTAAAGGTATATGAAGACTATGGTTTAATTAGACCAGAAAACACTG 

TCATAATCTCTGGGGTCATCAGAATGTCCAGTTTTGTCTTTGGGCCAAGATAAGGGCAGT 

GGGATTTATGATGTGTTGTTTATAGTCTGAAACTACTC 

Sequence 305 

AGGTACGCATGTCGCAGGGTTMGTATGATGCAGAGGTTAAAGTCTGTTTGMCAAAAAC 

AAA TGCC CGGGGAA^TTTCATAGCTATAAAGTTAATAACTAAATTTTGTTCACTAAGAGG 

ACCTTTTCTATGGATTTCCTTCATCTCTCAGTCACACTGCAAAGTTATCTGAAGTGCACT 

TCCCAAGTGTAGTTAGAGAATAGGAAGGAATGTAAAA I I I I I I I I I I AATTATTTGTTTT 

GTTTGTTTGTTTGTTTTTTTAAAAACAGCC^ 

TGTAMCTCATAAAAGCCTTTTATAGAAAGCAAATTC 

Sequence 306 

CCGGGCAGGACTAGGGTGGACTAATGTCTCACTTCAAAAAAAATATGAGTGTAGTATACT 

TGGCCTAAAATTGTTACTCAGTGTTACTAAAGTGTAATCATTTTCCACATCTCCAAATAT 

ATTCCAAGTCTCTGAAATACTAGTGCATTGGGAGGTAGAACATTTTCGAGCTTCATGTCT 

GTAAATGGTTTCTCTTTACTGTCATTCAGTAAAATCAGAATAAATGATTGATGTAATACT 

TAAAAAAAAAAAAAAAAAAAAAAAACCT 

Sequence 307 

CCGGGCAGGTACATTCAAATCAGGACACCTGGGTATATGGACAAAGATATCCTACAATTA 

AAATGGAATAGAAGTTAAAAAAAAAATTCAGTCCCCTACACCACAGGCCTCAAACATGCT 

TGCTGATGGATTCTGGGAAGGTTTTGGATGAAGGACGGAGATATGGGAGGAAAGTGAGAA 

AACAGTGGATTCCCTTTGAAAAGTATGCTAGCAGACAAGCATTTGGTTTTATGAAAGAGG 

CACTCTTATAGAGAAAGAAGCTAGTATGTGGTGTATAAAAAGCCCCTAAATCATCACCAG 

AATGTCTATCCATGATTGTACCTCGGCCGCT 

Sequence 308 

AGGTACAGCCTCTCGGCCCGGCTAAACATCATCGTCTTGGTAGGCCATTACCCTACCAAC 

TAACTAATGTTCCGCACCCCCATTTTTAAGTGAAGCTGTGMGCTCCTTTCTATTACTCA 

TCATGCGATAAATAACTATATCCGGTATTAGCTATTGTTTCCAATAGTTATCCCAGTCTT 

AAAGGTAGGTTAGGGTACCTGCCCGGGCGGNCGNCCGGNCAGGNCATAATACATGATTGA 

ATACATGANCG NATNTAACATG N NTTTTTTTTCTG N NG N GG NCAAATAAACATNCTNAAA 

GNNGCANCTGCNNATCAGNNACCCTTAGAAAA 

Sequence 309 

ANCCCCCCGCGGNGGCGGCCGAGGTACGCATGTCNTTTGGNNNAGTATGAANCACNGGTT 

AAAGTCTGTTTGAACAAAAACAAATGCCCGGGGTAATTTCATAGCTATAAAGTTAACAAC 

TAAATTTTGTTCACTMGAGGACCTTTTCTATGGATTTCCTTCATCTCTCAGTCACACTG 

C AAACTT ATCTG AAGTGC ACTTCCCAAGTGTAG NTAG AGAATAG G AAGG AATGTAAAATT 

1 1 I I 1 1 1 I AATTATTTGGTTTGGTNGGTNGGTTGG 1 WW I AAAAAC AGCCCTTATTAAC 

TCTTCCTTCCACTGATTCTACTGGAAACTCATAAAAGCCTTTT 

Sequence 310 

CCGGGCAGGACTAGGGTGGACTAATGTCTCACTTCAAAAAAAATATGAGTGTAGTATACT 

TGGCCTAAAATTGTTACTCAGTGTTACTAMGTGTAATCATTTTCCACATCTCCAAATAT 

ATTCCAAGTCTCTGAMTACTAGTGCATTGGGAGGTAGMCATTTTCGAGCTTCATGTCT 

GTAAATGGTTTCTCTTTACTGTCATTCAGTAAAATCAGAATAAATGATTGATGTAATACT 

TAAAAAAAAAAAAAAAAAAAAAAAACCT 

Sequence 311 

CCGGGCAGGTACATTCAAATCAGGACACCTGGGTATATGGACAAAGATATCCTACAATTA 

AAATGGAATAGAAGTTAAAAAAAAAATTCAGTCCCCTACACCACAGGCCTCAAACATGCT 

TGCTGATGGATTCTGGGAAGGTTTTGGATGAAGGACGGAGATATGGGAGGAAAGTGAGAA 

AACAGTGGATTCCCTTTGAAAAGTATGCTAGCAGACAAGCATTTGGTTTTATGAAAGAGG 

CACTCTTATAGAGAAAGAAGCTAGTATGTGGTGTATAAAAAGCCCCTAAATCATCACCAG 
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AATGTCTATCCATGATT 
Sequence 312 

AGGTACAGCCTCTCGGCCCGGCTAAACATCATCGTCTTGGTAGGCCATT ACCCTACCAAC 

TAACTAATGTTCCGCACCCCCATTTTTAAGTGAAGCTGTGAAGCTCCTTTCTATTAGTCA 

TCATGCGATAAATAACTATATCCGGTATTAGCTATTGTTTCCAATAGTTATCCCAGTCTT 

AAAGGTAGGTTAGGGTACCTGCCCGGGCGGNCGNCCGGNCNGGNCATAATACATGATTGA 

ATA 

Sequence 313 

AGGTACAAACAGTAGMCAATACTGACAAATGCAAACTTAGTCACATGTGCTTTAATAAC 

TGACAATACATTCAAGCAGGTTTTTCTAATTCAAGTGTATAGCACATATTCAAATAATAG 

GAAACAAATCTCATGCAAAGTAMTAAATCTTGATGTNTAAAAACTGATTAGAACTAGAA 

AAGAAGTGGACATGTTTTATTATCTTTGCATTACGTTCTAACAAAAGCAGATTATCAGGG 

GCTCTTACTCACTTGCCATTCCTGACATGAGCACTATAAGTG.AATACTATGAGTTCTACA 

AACAGAACAT1TTTCCACATGAATTTGACTTGCAA 

Sequence 314 

CCGGGCAGGTACGCCGTTTCTTGTAAACACGAGGCACCCCAAGATAAGAAGACAGATAGA 
GCAAGGGATGGACATGGTCATCTCCTCAGTGATTGGAGAAAGTTACCGGCTTCAGTTTGA 
TTTTCAAGAGGCAGTGAAGAATTTCTTCCCCCCAGGAAATGAAGTGGTTAATGGAGAAAA 
TTTAAG CTTTGC ATATG AATTCAAAGCTG ATGC ATTATTTGATTTCTTCTATTGGTTTG G 
GCTCAGTAATTCCGTTGTAAAAGTAMTGGAAAAGTTCTGAATTTGTCAAGTACCT 
Sequence 315 

AGGTACATTTTGTATTMCTGMCTCAGCTAAACAGTAAAACCTGTTTTACTTAATCCCT 

GTCTTGACTACCAGACTATCATGATGTTTGTTGGAACTGTAATTCCTGCTCTCCATTTCT 

CTCTGTCCCCCAAGTTGAAAATATAACCCAAATCTTTAGAA I I I I I GCCTCTCATTCCTT 

GCAACTCCAGTGGGCTAGAATCTTGATCCAGTTTTCTCCAGACTAGATTTCAAGCTACTC 

TGCATGATGTAGAACATGTAACCATGTAACCTCATCAGCTACCTTTTCCCTTTTGACTTT 

TCCTGTTGCACACAGTTCAGTCTGACTATC 

Sequence 316 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGAATGCGATCACT 

GTGGGAAGGCCTTCAGCATAGGCTCCAACCTGAATGTGCACAGGCGGATCCACACCGGGG 

AGAAGCCCTACGAATGCCTTGTCTGCGGGAAAGCCTTCAGCGACCACTCATCCCTCAGGA 

GCCACGTGAAAACTCACCGGGGAGAGAAGCTCTTTGTGTCATCCGTGTGGAAAAGGCTCC 

AGTGAGCGCGCCTGCTTTAGAGACACAGGATGATTCAGACCGGAAACAGACCTCGTGGGT 

GTAAGAGGAAGCCTCTGTGAGCTCGCACCTTACTGGGTGCAAAAGAATCCACGGAACTTG 

GGAGAAGTCCAGTTCCTGTAAAAACTGGGAAGACGAGGCGTTCTCATCCCATAGGAGGTT 

TGTGAGAACTCACGCCGGGGGTGAAAATGTACCT 

Sequence 31 7 

GGAGCTCCCCGCGGNGGCGGCCGAGGTN 1 1 II 1 I 1 I I I AGTTTAGTTTATTTCTAGTTCA 

AAAATAATCTGTAATTGCTGTAAGAAATGTCAACCACTTACCTAGGATGTTTGACAATTG 

GGATGAAGTCTACATATACTMGTMTGGCNAGACMTTATTTTATTGCTCAAAAGAAAG 

TCAAAAAAATTCCATATTCCCTTTGGGGAAAATTGGCAGGATTTCAAGTATGACCTTTAA 

G AATC AGG AAAAG ACTAACTTATG CTTTAG G ATTAAAAC AATC AAATAATTAAATTAGTT 

C 

Sequence 318 

TCTGGCGAACATACATGGCATATAGAATATCNCCTTCTTCCACATGCAGCCTGAACTTCT 

TCACCTTCTCAAATAAGGCCTTAGCCTGCTCACCTTCCTTTTTATCCAGAGCCCCTGCAG 

TGGATTTATCAACTACAAACTTCTCAGGAATGGACTTTAAAGACTGAGAGTTGACCAGGC 

TGCCTTCTGGACCAGATTGGATGGTGTTGGACCTGTTCATGTTGTTCTTCCTGTTGTCCT 

TTTCTTCTGAGTCCTCCCCAGACACTTCAGATAAAGC CCCG TGTGGTCCAAGGGAGAGTC 

AAAACACTTCCCCAGMTGGGAGATGAAAATGTTCTATTTTGGGAGCTGGGACCAGGGGA 

ATTTGAACCAAAAAGTTACTGGCAGAGCATAAANACCCAGGGTNTNGATGAAGGCCAAGG 

TAAGGGNAATCTTGGCTTACCAGTGGAGGGCTCGCTTATACACATTTTGNTNANAACGGN 

NNCCTTCCCCGGGCCGGCCCCCCGGCAG 

Sequence 319 

TNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTAAACAT 
CTCTTAGTAATTGAGAAAATTGAAAGAAAAGAAAAAAGAGAAAGGGAGAAAGAGAAACAG 
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CGAAAGGGATAATGAAGGCGAGAAAGAAGAAGAGAAAGGAAGAGGAAGAAAAGTAAAAAG 

GAGGAGGGGGMGGMGAAGGAAGAAAGGTGAAAAGAAAGAATGGTAAACTTTTTAACAA 

CATAATTTATCCTTCTAGAATATGAATGTTGGTCTATTTGATGATGTCCCACAAATTCAT 

TAGTCTCTGCTCATTGTTTATTTTTTATTCTTTCTGTTTCTCAGAGACAGTATTTT 

TTTCT TCTCTT CA AGTTC ATGGCTTCCTCTGTGTGTGCAAATATACTCTTAAATCCCTCT 

GGTGAi I \ 1 lAA Al 1 I I 1 ATCATTGNAGTTTTCCACTCCAGAATTTGGTATCCTCTTTGC 

TGATATTCCTACTTTTTAATATTTTTTC 

Sequence 320 

NCCGCGGTGGCGGCCGAGGTACCGCTTATGCCAGGCCGATCCCGACCTGGACTCAGACAA 

GGATGAAACAGGCGCCTATTTAATCGACAGAGACCCCACCTACTTTGGGCCTGTGCTGAA 

CTACCTGAGACACGGCAAGCTGGTGATTAACAAAGACCTCGCGGAGGAAGGAGTGTTGGA 

GGAAGCAGMTTTTACAATATCACCTCATTAATAAAACTTGTAAAGG 

Sequence 321 

GGGCNATTGGAGCTCCCCGCGGTGGCGGCCGAGGI I i I I ATTTCTAGTTTAGTTTATTTC 

TAGTTCAAAAATAATCTGTAATTGCTGTAAGAAATGTCAACCACTTACCTAGGATGTTTG 

ACAATTGGGATGMGTCTACATATACTAAGTAATGGCAAGACAATTATTTTATTGCTCAA 

AAGAAAGTCAAAAAAATTCCATATTCCCTTTGGGGAAAATTGGCAGGATTTCAAGTATGA 

CCTTTMGAATCAGGAAAAGACTAACTTATGCTTTAGGATTAAMCAATCAMTMTTM 

ATTAGTTCAATTTTCTAACATAGTCTCTATCTTCAGTTAAAGTGCATCATTGCATGTTAT 

ACATTACTAAMTTCCACAGTGCATAATTGTTACCCATGTGACTATTTAATTCAGGGTCA 

ACTGTCTAAAGGCTC AAGGTGTCAC ATTAAGGGTATAATCTAGAATGAATTG G AC N A 

Sequence 322 

AGGTACATACTCAGACATGTCTAATACAATTCCAGTGTTTTATAAACTGGCAGAATTGAT 

GTGTTTTATAACAATTCAGTATCCAAAAGCCTGAGGAGACTTGGACCATTTTCCTAAGGG 

ATGAAGCGGGAAACGTTTATTAACCAGAGCCAGATTGAGACCTTAATAATAAAAAGACAT 

GGGTCCAACCCATAGGTCACTCCAAATGGCATCAACACTAAACCATAAATAAGCGTTAAC 

AAAGTGCAAAGTTCTAAATTTTCGCATAGCTACTTCAAGTCCTCTCI till I AAACATTA 

TTCTTATGTGGCTCACCCGTCACTAATGCTCAGTGTGCCTCTTGGGTGGCTCAGCAGTGG 

GAAGTCCACTTTCAGCCTAACTACTCCTAACAATGAAGCTGTTCTCCAGAGAGCAAGAAT 

AGCCAGCCACATGCCCCCTTCTCTGCTTTTTCATCCCCCTCCATCTGATTCAATGGCGGT 

GTCCCTGGCTTGCAGGCTTCAGTATGCTTCA 

Sequence 323 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATTGAATCATCATGTAAGG 

AGTTTTTAAAACATTGTTGCCAGGGCCCCTTTCTAGACCAAGTTAGTCAGAATGTTGGAC 

AATGAGGCCCATGCATGGGTA I I I I I ACAAAGCTCTCTGGGAGATTCTAATGCTTAACCA 

AATTGA GAAG CACTGAATAAGAATATCCTG G GCCGGGCGCACTGGCTCATGCCTGTAATC 

CCAGCATTTTGGAAGGCCGAGGCGGGTGGATCACTTGGGGTCAGGAGTTCGAGACCAGCC 

TGGCCAACGTGGTGAAACACCGTCTCTACTGAAAATACAAAAAATTAGGTGTGGTGGTGC 

GTGCCTGTGTTCCCAGCCACTCGGGAGGCTGAGGCAGGAGAATCGCTGGAACCTGGGATG 

TGGAGGTTGCAGTGAGCCAAGATTGCACCACTGTACTCCAGCCCG 

Sequence 324 

C CGCGGTGGCGGCCGAGGTACATTGAATCATCATGTAAGGAG I I I I I AAAACATTGTTGC 

CAGGGCCCCTTTCTAGACCAAGTTAGTCAGAATGTTGGACAATGAGGCCCATGCATGGGT 

ATTTTTACAAAGCTCTCTGGGAGATTCTMTGCTTAACCAAATTGAGAAGCACTGAATAA 

GAATATCCTGGGCCGGGCGCACTGGCTCATGCCTGTAATCCCAGCATTTTGGAAGGCCGA 

GGCGGGTGGATCACTTGGGGTCAGGAGTTCGAGACCAGCCTGGCCAACGTGGTGAAACAC 

CGTCTCTACTGAAAATACAAAAAATTAGGTGTGGTGGTGCGTGCCTGTGTTCCCAGCCAC 

TCGGGAGGCTGAGGCAGGAGAATCGCTGGAACCTGGGATGTGGGAGGTTGCAGTGAGCCA 

AGATTGCACCACTGTACTCCAGCCCG 

Sequence 325 

CCGCGGTGGCGGCCGAGGTACTGTCCTTACMCAGTTTGGACTGAATATTCTTGTTTCCA 
TTGACAAACATTTCTTTCTGAACTAGCAACAGCACTAATCAATAGCTACACTTAAGCCAT 
TTAAGACTMGCTTTAGATGTGTGTCTAATTATCTGGCAAAAATAAAAGAACCAAAGAGT 
TTC ATGTTAGCAGC AAATTAACTAG ATTTAAGGGTCTTG ATTTTTTTCATAAG C CTTTG G 
AAGCCTGCATATTAAACACTGGTGAGTTTTAAATTCTCAATGGAAATGGTGATGGCATTT 
ACTGAACCAAGCCAGCTCATATTACAACG I i ! I I I I M I I i i I I I I CGTGGTTCTGAGTG 
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CAGTCTTTATAAAGCCCAGTGCTGTACCTGCCCGGGCGGCCGCTCTAGAACTAnnnnnnn ' 
nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGTCCACCTCGAGGGGGGG 
CCCGGNCCCAACTTTTGGTNCC 
Sequence 326 

CCGGGCAGGTACCGCCATGATGAGAACATCTTGGAGTCCGAGCCCATTGTCTATCGACGG 

ATCATAGTGCCCCACAACGTGAGCAAGGAGTTCTGCACGGACTCTGGCCTGGTTGTCCCA 

AGTATTTCCTATGAGCTGCATAAAAAGCTGTTGTCCGTGGCTGANAAGCATGGGCTGACC 

CTTGAGCGGAGACTGGAGATGACAGGTGTGTGTGCCAGTCAGATGGCACTGACCCTCCTN 

GGAGGACCTAACAGGTTGAATCCCAAAAATGTTCACCAGAGGCCTACAGTGGCTCTACTG 

TGTGGACCTNATGTGAAGGGGGCTCAGGGTATCAGCTGTGGAAGGCACCTAGCCAACCAT 

GATGTCCAGGTCATCCTTTTCCTGCCCAATTTTGTCAAGATGTTGGAATNTATCACCAAT 

GAGCTGTCGNTCTTCAGCAAGACCCAAGGCCAACAAGTGTCTAGCCTCAAAGATCTGCCC 

ACTAGCCCTGTGGACCTGGTCATCAACTGCCTGGATTGCCCTGAGAACGTCTTCCTGCGC 

GAT 

Sequence 327 

CCGCGGTGGCGGCCGAGGTACACATGCATGCATACACACCCATA CAAAC ATCTGTGTGAG 

GGCAGTTCTGGAGATGAGCAGAGAGAGACCGGAATAAACTCAATCTTTTCTTTCCCAAGC 

TCCTAGCCAACACTATCCTTGGGAGAAAGAAATTTGCAGAAACTGCTAAGACCAAGTGTG 

GAGATGTCAAGCTAGTTCACACTCTGAGGCTCAGAATATGTAGGACATGCACAATTGTGC 

AGTCCTTTGGGATTGGAAGTGAAACAGTCTGTGATCCCCTACCTTCTAGGGAACTAGGAC 

NTAGGAAGAGGTAAAGATTATCAGGTATGCAAAGCGCCCCAATTNTTCTGCTGCCATGGG 

GGATTTTACCCCA 

Sequence 328 

AGGTACATATTTGTCCTTCTAGTTTGTTACTATCCTTCCCTGAAAGAGCGGAGCTGTTTA 

TAGGMGCACAACATTTGAGTCCCATTCACTGCTGTTTGTATTACATTTTCACAAAGCCT 

GCTTTGAAAGCTGGCAAACACTGCAAGTTACATGTTACCATATTACACATGGGAGGACCC 

ATACACATTGTTA I I I i I AAATTTAACATTGAACAAAAAAAAGGAGGAATCACTTAACTT 

GGAAACAAAACACCAAAGTTAGTCAGTGGAATGGTCAAGAGCAGGACAAGCCTGCCAGGG 

CCACTGAACTCCAGTCACAAGTGGTCAAAAGTATGACCGAGAACAACTGGATGCAGGAAC 

ATCAGCACAAAATACAGATGGGAATATTCTCGGGAGTGGTACCTGCCCG 

Sequence 329 

CCGGGCAGGTGTGAAAGTTTAACGTTTGCGATAAACTGCCGTAATTTTGATACATCTGTG 
ATTTAGGTCATTMTTTAGATAMCTAGCTCATTATTTCCATCTTTGGAAAAGGAAAAM 
AAAAACTTCTTTAGGCATTTGCCTAAGTTTCTTTAATTAGACTTGTAGGCACTCT^ 
TAAATACCTCAGTTCTTCTTTTCTTTO 

ATGTATTATGCTTGAAATTTTAA 1 1 1 I 1 I 1 I I I 1GCACTGTAACTATAATACCTCTTAAT 

TTACCTTTTTAAAAGCTGTGGGTCAGTCTTGCACTCCCATCAACATACCAGTAGAGGTTT 

GCTGCAATTTGCCCCGTTAATTATGCTTGAAGTTTAAGAAAGCTGAGCAGAGGTGTCTCA 

TATTTCCCAGCACATGATTCTGAACTTGATGCTTCATGGAATGCTGCATTTATATGTAAG 

TGACATTTGAATACTGTCCTTCCTGCTTTATCTGCATCATCCACCCACAGAGAAATGCCT 

CTGTGCGAGTGCACCGACAGAAAACTGTCAGCTCTGCTTTCTAAGGAACCCTGAGTGAGG 

GGGGTATTAAGCTTCTCCAGTG I I I I I I G TTG G CTC C AATCTT AAAC TT AA 

Sequence 330 

AATNGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGATGATGCCAGTGTTCCGGGCTGTGA 

TGGGTGGTGAATCAATGTCCAGGCGGCACATGTGCTCCAGGAATGTGTCAGCCATGGCTG 

CGTGCAGCTGCTGGGTCTGAATGAAGGCAGTCCCGGCTTCACTATGGGGCTTCGACATGG 

CTGCTGAGGTCCTCCGGCGCTGACCGACTCGA CAATT AGAAAACCTGCATNTTTAATACA 

GTG AG ATTTGTATGC ATACACTCTG GTGTCTTC ATTTTGC AG CCC ATTATATTG GCTTAA 

AAGCCAGAAAGGTGCTCATGCATGTAATTTATACTGGAGCAACGAATGCAGTGTATGTGG 

ACGCGTGTGTGTGCACTTTGAAAGGCAGTAGTTTTAGAAGAATGAGAAGAAAATTACTTC 

CTGGGCATTCATTAATAATTCAGGTGATATCCTGTACCTGCCGGGCGGCCGCTCTA 

Sequence 331 

ANGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAGACTCTTATTTNCTGCAGC 
TNTTGAGAATACTAAAGATACCTGAGTNATTTTAAATAAATTTCAGGTTNCTTTACCAAA 
CTACCCTTAACCGGATTCTCCTTGAAATGACATCACCTGACA CCCA TGGCATGCTGNATG 
CCCACAGNTAGATTACTTTTTAGTGCGCCCACCCNCTTCTGCTTTTATTAGGNGAGGCAN 
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MTCMGATTCCCTTTTGNTGNATCTTGNACCTGTTCATGCCACTTTGATATTCTAAATT 

CATACATAAACCAAT 

Sequence 332 

TATACGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAGTGI Mil IAGATACA 

GAGACTTGGGGAAATTGCTTTTCCTCTTGAACCACAGTTCTACCCCTGGGATGTTTTGAG 

GGTCTTTGCAAGAATCATTAATACAAAGAA I (Mill I AACATTCC AATGC ATTGCTAAA 

ATATTATTGTGGAAATGAATATTTTGTAACTATTACACCAAATAAATATA I 1 1 1 I ACCTG 

CC CGGGCGGC CG C CCGG GCAG GTACATAATACATGATTGAATACATG ATCGTATTTAAC A 

TG I 1 I I I I I I i I CTGCAGTGGACAAATAAACATCCTCAAAGTAGCAACTGCAAATCAGTT 

ACCCT TAGAA AAGCAAGACCAAACACTGTAGTTACACTATTAAGCAGTGACCAAAAAGGG 

CTAATATTTTCTMGMTAGmAMTTACAGACATTTGGTATATTTACCTTATGTGA^ 

TACATCACTATTTAATTACATTAAATTTTAACATCTGTTGNGTGGAGGTTGTATAGGTTC 

ATGCCAAAGCCTGGGGGTTG 

Sequence 333 

GCGAATTGGAGCTCCACCGCGGGTGGCGGCCGAGGTACTCTTTAAAAAAAAAAAAGTTTT 
AGCATGTTACATATTCCTAATMGTGAAATTTTATGTCTGNNCTAAMTTTACTTCACTT 
CCTCGNGTAAGGAATGCCTTGCTTAAACTCAATATATAAAAATCAGAATTTTAAAAAAAT 
TAAAAACCAATTAGAAAATGGAAAGTTTATT^ 

TTCTAAAATTTTTTMGTTATTCTTTTACN 1 1 1 I 1 CACACAATTTTTATGGTTACCCTGG 

GCTTTGCCCAATGGGTGGGGGGCCCTTTTGGAMGGTTTCCATTATTCNCCCCTGGACCA 

AGGNAAAAGGGGGAAGGMTCCTTGCTTCTGGAAGGTGGTTTGCCCTTAAAAGGACCATT 

CTACCAAAAACCAAAAACCCCTGGGGNGGTNTTTNAATTCCCAGGTGTGGAAAAAAAAAA 

ATGGAATAAATTTCTNGGTGAAAGGCAATCNTNGGGGTCTTTNrn I I 1 1 I ACCTAAAGC 

CAAGGCCTCTTAGGNGGTNTTGGAGGCCCTAATCCGTTTNGGGGAAACCTGGGNGGGAAA 

AAATTMTTTTCTTCTTAAAGTMGCCTTCTTTTAAAAMTNGGGGANTCCCCTTGNCTA 

ACAGGAGGANGMCTTAMNGCTMTNGGGCTTCTTTTCCTTCAGGGGGGCCTTTAGGCT 

CTAAAAAAATGGGAAAGTTTCTTTTNTTNGGACTGG 

Sequence 334 

GGCCNATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTGGCTTGCCCCGGAC 

CACANGCCTCGTAACGGTAACCCCTGCTTTCCAGGGGCCTGGCACCCCCAI I I I I AAGNG 

AAGCTGTGAAGCTCCTTTCTATTACTCATCATGCGATAAATAACTATATCCGGTATTAGC 

TATTGTTTCCAATAGTTATCCCAGTCTTAAAGGTAGGTTAGGTACTCAGCAGGAAAAGAA 

AAAGCAGTTCCTGGTTGTTATTTCACCTCGGCCGCCCGGGCAGGTACTTGTATCTGGGCC 

AGCTTGATAGGGAGAGAAATCCTAAGGAATGTAGAGATGATGACCCTACCAAATTTGGAA 

CTAGACTCCAATGAGTTTAGTTAAMCAAACACTATGGTGAAGAACTTTCTTCATTAGTA 

TCTTGTCACTTGGGGGAGTGAGGGGGGACAGAATTTGCTCTTAAAAACCCAGGGG7TGCT 

TTCATGTGTTATTACC N ATGGTCTTTGTAGGCCTTCCATTTNG G ATTG AGTGGGTN AAA 

Sequence 335 

ATAGGGCGAATTGGAGCTCCiCCGCGGTGGCGGCCGCCCGGGCAAGGTACGGGGGCGCAGG 

TCCAGGTGGCGGGGGATATGCTGCCCAACTCCACCGAGCGGGCCATCACNATCGCTGGCG 

TGCCGCAGTCTGTCACCGAGTGTGTCAAGCAGATTTGCCTGGTCATGCTGGAGACGCTCT 

CCCAGTCTCCGCAAGGGAGAGTCATGACCATTCCGTACGGCATGATGAGTTCTGAGCTGC 

GGAGGAACCCTCATTTCCTCAAAAGTMTTTATTTTTACAGCTTCTGGTTTCACATGAAA 

TTGTTTGCGCTACTGAGACTGTTACTACAAACTTT7TAAGACATGAAAAGGCGTAATGAA 

AACCATCCCGTCCCCATTCCTCCTCCTCTCTGAGGGACTGGAGGGAAGCCGTGCTTCTGA 

GGAACAACTCTAATTAAGTACCTCGGCCGAGGTACCGGATTCTCTTCTTTAACCCTCCCC 

TTTCGTGGTTTCCCCCAATGGTTAAAATGTTTGGATGGT 

Sequence 336 

ATANGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTGGCTTGCCCC 

GGACCACAGCCTCGTMCGGTAACCCCTGCTTTCCAGGGGCCTGGCACNGNCATTTTTAA 

GTG AAG CTGTG AAGCTC CTTTCTATTACTC ATC ATGCN ATAAATAACTATATCCGG N ATT 

AGCTATTG NTTCC AATAGTTATCCCAG TCTTAAAGGTAGGTTAGGTAC TC AGC AGG AAAA 

GAAAAAGCAGTTCCTGGTTGTTATTTCACCTCGGCCGCCCGGGCAGGTACTTGTATCTGG 

GCCAGCTTGATAGGGAGAGAAATCCTAAGGAATGTAGAGATGATGACCCTACCAAATTTG 

GAACTAGACTCCAATGAGTTTAGTTAAAACAAACACTATGGTGAAGAACTTTCTTCATTA 



Page 45 of 635 



WO 01/51628 



PCT/US01/00798 



Table 1 



GTATCTTGTCACTTGGGGGGAATTGAGGGGGGACAGAATTTGCTCTTAAAAACCCANGGG 
TTTGCTTCCATGTNTTATTACCATNTTNTTTTGTAGGCCTCCATTTGGGATTGANNGGTT 
AAAAAA 
Sequence 337 

CCCNCCCCCCCATNANTANTTNNATANTATTTNNTNGTNGNNGGGGGTTGCACATTTTAN 

GGCGAATGGAGCTCCACCGNGGTGGCGGCCGACCACTGACTTTGTGACTTAGGCGGCTGT 

GTTGCCTATGTAGAGAACACGCTTNACCCCCACTCCCCGTACCTGCCCGGGCGGCCGGGG 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTTGANNNCNANCNTATTNGNTTCCGT 

CNCCCTCGAGGGGGGGCCCGGGGCCCNCCNCTTTGTTCCCTTTAGCNNCGGTTAATTGCN 

GCGCTTGGCGTAATTCATGGTCATAGCTGTTTTCCTGTGTGAAMTTGGTTATCnnnnnn 

nnnnnnnnnnnnnnnnnnnCGAGCCAGGGAGCATAATAGTGTAAAAGCNCTGGNnnnnnn 

nnnnnnnnnnnnnnnnnnnnnATTAAATTTGCNGTNGCGCTTCACNTGGNNCGCCTTTTT 

CAAGTCGGGG AAAACCTTNTTCGTGCCAGCTGCCATTAATTGAATTTCGGCCAACCTCGC 

CNGGGGAAGAAGGCCGGTTTTTGNCGGTATTGGGGCCGCCTTNTTTCCCG 

Sequence 338 

AATTGGAGCTCCCCGCGGTGGCGGGCGNGGTACTTTGGTCCAGATAACACTGGTGATATC 

ATGACCCTGAAATTCTTGACTGGACTTCAGAATTTCATAAGNGGCATGGNTACTGTTCAC 

TTGGTCACTGCAGATGGGAGTTTTGATTGCCAAGGAAACCCANGCTGAACAAGAAGCTTT 

AGT TTCT TCTTTGCATTACTGTGAAGTTGNCACTGCTCTGACCACTCTTGGAAACGGAGG 

CTCTTTTGTTCTAAAGATGTTTACTATGTTTGAACATTGTTCCATAAACTTGATGTACCT 

GCCCG 

Sequence 339 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCCCTTCATCCA 

CCAACCTCCAGAGAATMCCTGGCTGCCCTCACTTCTCCAGCTTTTCTCACAAGGCACTT 

GTTACAGGGCTGCTGATGGGAGAGGGAGGAAGGCCTGGGAGGGGGATCTGGTGGTGGCGC 

CCAATCTACAGGGTCCGGAATCAGGGTGGAAGTTGTCAGGGACTAGCGGCTGCCATAGTA 

CCTCGGCCGCCCGGGCAGGACGTGTAGTGGGCCGGAATGGAGAATCCAGTGAACTGGACC 

TACAAGGCATCCGAATCGACTCAGATATTAGCGGCACCCTCAAGTTTGCGTGTGAGAGCA 

TTGTGGAGGAATACGAGGATGAACTCATTGAATTCTTTTCCCGAGAGGCTGACAATGTTA 

AAGACAAACTTTGCAGTAAGCGAACAGATCTTTGTGACCATGCCCTGCACATATCGCATG 

ATGAGCTATGAACCACTGGAGCAGCCCACACTGGCTTGATGGATCACCCCCAGGANGGGA 

AAATGGTGGCAATGCCCTTTATATATTATG I I I I I I ACTGGAATTAACTGGAAAA 

Sequence 340 

GGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCGACCGCTCCGAGATCTGTATGAG 

TTGGAGGCAGGCCAGTGTGAAGGTGTGGGAGGAATCCGCCCACACNCAGCTTCATACAGC 

ACCCTGAGGACAATGTGGCTTCCCTGATTCACACCTACTGAGCCAAGGCCCCCTCTGAAG 

TTAGGTCAAGAGGGCCCACCCTAGCCGGGCCAACTCATTCCTTTGAGGACCTGNTACTCA 

ATGMCTGCTTACTACCAAAAAAATGCTNAGGATTCAGCAAAAAGTGCAATTTCCAT7TG 

AGAAGTAAGATTCTNCCCAAMCATGACACCACCTTTTCATCCTTTCAGTACCTNGGCCG 

AGGTGGACTAACTGCAACGGAGAGACTCAAGATGATTCCCI I I I 1 ACCCATGTTTTCTCT 

ACTATTGCTGCTTATTGGTAACCCTATAAACGCCAACA 

Sequence 341 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTCACTCAGAT 
ACTA AATAGTAGTTTATTCTTTMTATMGTTACATTCTGCTCCTCAACCAAATGCAATT 
TTTTGTGTGTGTTTGAAAGCTAATTTGAGAAAATTTCATAGGTTACATTTCCTGCAGCCT 
ATCTTTATCCACAGAAAGTGTTTTC I I I M I ! AAATCAAGACTTTTAAAACTGGATTTCC 
TCCCATCACTGI till IGAAGGTCCTCCAAGTCCGTGTTAAGGTAAATATCTGTTTTCTT 
CCTGATGTCACAGCCTGAGCATACTCTGTGCATTAGGAAGACCTGAGTGCATTTCCCACC 
ATTGTCCTTTCCACATTATGTTGTAGCTGGCTGGCTGTCAGGCGACTACAAGACTGAGGG 
TCTTGTGCCTTATAGATCTTTGTATCCCCCATGGCTGACACATAGTAGGTACCT 
Sequence 342 

NATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGCATGTTACATATTCTTCT 
TTAAAATTTTGTAATAAACATTGACAGTGTTTGGTAGGCACAGGGAAACAGGATAACGTA 
GAGTCATTACAGAAGAAAAAAACTTATTGCTAACATTGCAGTATTCCTTTTATCAGAATT 
AGGTGAGTATTGATTGTAAAAGCTCTATCAACTCTTGCTCTTATTTGATGACTTTGAGAC 
Mill I ACTCTTGCTATAAAAAGAAGGCTACTTTCTTTCCCTAATATATTTCTACCAATG 
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CGAATAATTCAGGAAACAATGAGAGAAAAGTMTTCACACTTAATGTGTTGTTACTTAAG 
AGATTTGACGGATAAAAATAAAATCAAATTAAATCATTTGAAAAGGCANGGCTTAGACGC 
CCTTAAAAACACCGNTGGTGCCTACCATAATTTGAGCACTTTTTCATGAAAAAAAAAAAA 
AAC 

Sequence 343 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGNNGGGAACNANAGNANAACTAAACGAT 

GATGAATGCNNAANNTNCANNAAACGGNGGANNGGNTGNANNGATGNGGNNCCCACTGTN 

NNCANCNNNATTACATGACATNAAATCAGANACNANGGAGGCCCTGCTACAAAGAAGCAC 

AAGATGTGGTGTTGCTTAAAAAGTCCTGTATGTGAGGAACTCTTTCATTTTCTTGGGGAT 

TGGCAGTGCTAGGACTTGGTMGTTGTTAGGAAACTTCGAAGGGCTTTCCGGCATAAGTG 

CTTCAAGTGAGGACAGGACCCTAGGAGCTGTCCAAGAACTGGA 

Sequence 344 

AGGTACCAGACTCTTATTTCCTGCAGCTTTTGAGAATTCTAAAGATACCTGAGTCATTTT 

AAATAMTTTCAGCTTTCTTTACCAMCTACCCTTAACCGGATTCTCCTTGAAATGACAT 

CACCTGACACCCATGGCATGCTGCATGCCCACAGCTAGATTACTTTTTAGTGCGCCCACG 

CACTTCTGCTTTTATTTAGGTGGAGGGCAGNMTCANAGATTCCCTCTTTGTTGGGATCN 

TTGMCCCTNGATTCATGCCACTTTTTGGATAATTTCTAAAAATNCCATTACCATAAAAC 

CCAAATTG AAAATTATATTG NTGTTTANG AAAAAAATTTG G CTTATNTT 

Sequence 345 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACNCCGGCCTGCANAAG 

CCCTCGTGGCACCCGCCCCACTGNGTGCTGGGCCCTGTCTGGGGCACGCTCTACTCAGCC 

ATGGGGTACTATATNTTTAGCTTGCAATGACATGACTCCAGAGCAAATGGCTACAAATGT 

GAACTGTTCC AGCCCTG AGCG AC AC AC AAG AAGTTATG ATTACATG G AAGG AGGG G ATAT 

AAGAGTGAGAAGACTCTTCTGTCGAACACAGTGGTACCTGCCCGGGCGGTCGNCCGGGCA 

GGTCTCCAGAGCCTTCTCTCTCCTGNGCAAAATGGNAACTTTTAAGGAA 

Sequence 346 

ATGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCCAACTTCCATTCCTCGCCC 

TGCCCCCGGAGCCGAGTCCTGTATCAGCCCTTTATCCTCACACGCTTTTCTACAATGGCA 

TTCMTAMGTGCACGTGTTTCTGGAGAAAAAAAAAAAAAAAAAAAAAAAAAA 

CCCGGGCGGCCGCCCGGGCAGGTACAAGTATACATAAAAATGGCATAAATGGCATAATTG 

AA CCAAT TACTGGATT CAAC TATAT TAAG ACTATTTCCTTAAATCCTACTTCAGACTAAA 

TTATTTTACCTACATTCTTTTCCATATTTTGGAACTC 

CACATTAAAATAATTTAAAATTACATGTATCCCTTCTCAATAAGTTTAATCAGCTAACCC 

TAAGCTAGAGGTCAAAATCTACTTCCTCTAATATCAAAACGAAAATT 

Sequence 347 

GGCNATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAAGGTACCGTTTCCTCAGCGGCG 

GACTGCTGCAGTAAGAATGTCTTTTCCACCTCATTTGAATCGCCCTCCCATGGGAATCCC 

AGCACTCCCACCAGGGATCCCACCCCCGCAGTTTCCAGGATTTCCTCCACCTGTACCTCG 

GCCGCCCGGGCAGGTACATTTMTCAGTAAATCAGTTTCACATCATGTATTGTGATGTTT 

CAATGTGAGACACAAAAA CAATGG CTTGAMCTTGTGTATCATATGTGATTTTGAAATGA 

ACACCTTGAATA GCAC T AATT TTTATTTGTGGNATTTTTCTATMCAAAACAAGTAGCTC 

TAGGAAAAGAGGTTTTATTTTGTAAACGATCATTTGTGACCTCAGACACTCTCTGGCTAA 

T 

Sequence 348 

AATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACMCCAGCTCCTCTTTTGCGT 

CCACACGGATCTGAGAAAGTGCACTGTAGGCATAAGCAGCTATCCTGGAGACCTGCTGCA 

AATCAGAGAACGGGATGGGCTCACTGGCCAGCACTTGGTGGGGGCTGGCTGGTAAGAGAC 

GGCAGCGGTGGCAGCTTCTTCCAATGGGTCAGGCTGCTGCTCAGCACAGCCAAGCGGGTG 

CTGTACTCATG ACGGA AGTTGCCGCTCGCCCACTTGTGCAGCAGCGTACATCATTTAAAT 

AACATAAATGACTTTTACACAGCTTGACCTAGGAAAAAATAAAATCCATCATAGCCACAG 

CTAAAAAGCATGTTAAGATTCACAATAAGMTTTGTrTCTCTTATTATAAAGAGAAGAGC 

AATC ATATAACCTCCTG G GGGTGGGGG G AG ACCTC ATAAATATTTTATATTG ATTG AC AA 

AACAGCATGCT 

Sequence 349 

GGGCNAATTGGAGCT CCCC GCGGTGGCGGCCGAGGTACTGATTTAACACTCTGTATTACT 
GAACTTCTTTGAACTATTTTGMCTTTGAATCTCTTCTCTGAATAACTGCCGAAACTCAA 
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AGATAATAATTAGTTGATGAAGGTTATCAATTAATAAAATAACACAGACCAGTCTTACCA 

AACTCTAAATACATTTTAAAAAATAGAAACTGGCAATGATAAAAAGAATGTTGAGtACCT 

GCCCGGGCGGCCGCCCGGGCAGGTACTGAAGATTCTTCTTTCGGTGGAAGACAATGGATT 

TCGCCTTCACTTTCCTGTCCTTAATATCCACTTTGTTGCCACACAACACAATGGGGATGT 

TTTCACACACTCGTACCCATAGC 

Sequence 350 

CCGGGCAGGTACAACTT AAATA GCATNCTAGGGTAAAGAGTAACATATTCCCCAAGAAAC 

AGAACTAAAATATTTCCTATTTTATGAGAAGAGTGAGTNAGAACAAACAGGGATACCTNN 

CTCACCAGCNNCACTTAATAAAACATTTNATTTCCATACNAAATCCACAAGCCTTTTCGG 

TCAGACTTTAAANGMTGTATNATTCGAAAAMCAATNAGTTTATCTTCAATTTNAAAAA 

ATNTCANTTGGAAAGACNTAANTAAACCNAGTGGAATTCTTTTGGCTTNAACAAATTATT 

GGTTACAAATTTNCCCAAAAAGGGGTTTNAAATTAAACCTTTANTNAAGNGTTATAAAA 

TANACNCNGTTAAAAATTAANTTGGGNAAAATAACNTATTNTTTTAGNTTCCCTTGCGGG 

GCCCGGTTTCTNAANGAAACTTAAGNTTGGGAATCNCCCCNCCGGGGNCTNGCTAAGGGG 

AAATTTNCAATTATTTAAAGCCTTTAATTCGANTTCCCCGNCNCGAACCCTTNGAGGGGG 

GGGGGGGCC 

Sequence 351 

TTAGGGCGAATTGGAGCTCCCCGCGGT(3GCGGCCGAGGTACATGCAGGGAATATTACAAT 

TGCAATCATTATTTTAACACCATCTCCTCCCCCTCTTCCCTTCCCAAAAGATATAAAGAC 

ATAAACATCATCTGCTGCCAAATAAATCCTGAAATTCGAGGGGCGATGCTGGGAAGGTAT 

GAAATTAGAGGGAGGCATGTTGCTTTTGCACTTTCAGATCAATGGCTGAATTCCAGACCC 

GGGGAAGCCTTTCAGATCCGCATGTAATTTGTTTAAACAATTGCTGTTCATCTGCCACAG 

CAGCACTTGTAATGCCGCAGGGAGGAACTTGCAGCATTTCTTGCTCACTTCAAGAAATCA 

ACTATAACAAACCTNTTNCAGAAGACTACCGACATNCATATTTCTGCAATAGTTAGATAA 

AAGTTAAAGAACTGGTCCTTTTTGTGAATATTCACAGTCCACTGCATATAAATCGGT7TA 

TGA 

Sequence 352 

GGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGCACGCTAAGTGATGCTCTCTTG 

TAATTGAAAGAGAATGGGTTTCCCTTCAAACCATATGCTATTACTAGTCTCTGAGTGCCT 

TCAAAGCACATACTCTTCCTTTAAAATCCAAGGATCCTTGCCTTCAAAAGACACAAATGC 

AGTAGTTTTCATTTCTTACATTCACTTAAAAATGGATTATGATTCCAGCAGGCAAATCTG 

CATATTAAATATAGACAACCCAACCACAGTATCTGAACGTTAAAAGAAATCTATCACGAC 

AGACATTTTACCTAGTTATAAATAACGTAACAATAGATCACACTCAGATTTCTTAAACAG 

CTCACTCAGGAGTTTCCTCTTTTAAAAATTAAATAACTTATGGTACAACAGGCAGCAAGC 

CATGTATTTACCATGGCGAGGGTCACATTTCACCATCTGGTTGGCTGGCTCAAA 

Sequence 353 

CCGCGGTGGCGGCCGAGGTTTTGC I I I I I I I GAAAAAGGTAAGTTGCTGATTAAGTCTAA 

TTGGAATTGATAATTCCATAGTCTTAGATTAAAATGAGGATATTTTCTCCTAGAT7TTCT 

CATGTTATGCCATGCATTTATATATCTAACCATTAATTTCACACTAAGGATGCTTCACCA 

TATAATAAAAGGAGCAAGATGGMGCACTTTGAATTTTCTTTCATTGAGMTAACTGTTT 

TATGTAAGAATCTGTATTTATAACACCAGATATTAAGATAGGCTTCCATTTTTTAATGCA 

AGCCACTTACTTAATCTTGTATTCTTTTTCAGGACTCAAATAACTAGCTTTGAACATAAT 

ATTAAAACACTACTTATAGAATAGATTTATTAATGTTAATACCTAGTGAATATCCATGTG 

GCATCCTGGTTATGTTATCGGTTCAGCGTTAATCCTATAGAAAAGTGGTTTGGAGG 

Sequence 354 

AGGTACTTGTTAGTTGTTGAGAAACATTTTAGGCAGTAAATAAAATAGTAAATATTATGT 

GTCCTATAATTTGACCTAAGCTGTGTGCCTTTGGAATCAGCTCTAAGCCAGGGCCCCCCG 

GGAGTCATGTTAAGTAACACGATCATAAAGTGACATACATCAAGGTTATCTTATATAGTC 

ATAACAGTATTATTGGGTCTTTCAAAAAC AG CTG GC AG ATTG AG CTTTTAATGGCTATAT 

TGATACAGTAGATAGAGCACCTCTAAGGAGTCCTGCCCGGGCGGC 

Sequence 355 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCAGAGACTATGATTTATATTGAT 
TGCACTTGCCTGCCATGATTTAGATAAGA I I I I I I I GCATGGTTTTTATTCTTTCCTAAC 
GGATCCTGTTTTATAATACTTCCAAGCCTGTCCATGGATATATCAAATGTCTTCACTTGT 
ATATTTTCATGGCTAGGTATTTCTAATGTTTATTCTTCCCTGTGTACTGCAGGGGATGCT 
CTGCCGGTCCAGGGACAACAGCCTTTCTCTTGCTACTGCTCTGATTCTTGTCTCGTTCCG 
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TTTTTTCCTTCTCACCATCTTTCTC 

TGTCATCATCTTTCAAAGCGTCGAACTTGTTGTTCATCCTCTCGCCGCCGCTGTCGCCCC 

CGCGTACCTGCCCGGGCGGCCG 

Sequence 356 

GCGAT TGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGCATGTTACATATTCTTCTTTAAA 

ATTTTGTAATAAACATTGACAGTGTTTGGTAGGCACAGGGAAACAGGATAACGTAGAGTC 

ATTACAGAAGAAAAAMCTTATTGCTAACATTGCAGTATTCCTTTTATCAGAATTAGGTG 

AGTATTGATTGTAAAAGCTCTATCMCTCTTGCTCTTATTTGATGACTTTGAGACTTTTT 

TACTCTTGCTATAAAMGMGGCTACTTTCTTTCCCTAATATATTTCTACCAATGCGAAT 

AATTCAGGAAACAATGAGAGAAMGTAATTCACACTTAATGTGTTGTTACTTAAGAGATT 

TGACGGATAAAATAAAATCAAATTAAATCATTGAAAAGGCAGGCTTAGACCCCCTTAAAA 

AC 

Sequence 357 

CCGCGGTGGCGGCCGAGGTACCTCTTTGCCTTAAATTGCTTTTTAGTTCTAAGATTGTAG 

AATGATCCTTTCAAATTGTAATCTTTTCTAACAGAGATATTTTAATATACTTGCTTT 

AAAAAACAAA AAAA CTACTGTCAGTATTAATACTGAGCCAGACTGGCATCTACAGATTTC 

AGATCTATCATTTTATTGATTCTTAAGCTXGTATTAAAAACTAGGCAATATCATCATGGA 

TACATAGGAGAAGACACATTTACAATCATTCATTGGGCCTTTTATCTGTCTATCCATCCA 

TCATCATTTGAAGGCCTAATATATGCCAAGTACCTGCCCG 

Sequence 358 

CCGCGGTGGCGGCCGAGGTACTTCTTTTCCAGAGCAGGTGGCACAAAACGACCCCCCAGG 

TAATGGTAGCGACCGGTAAACTGGGTTGCAGATTTTTTGGGGGCTGTGAGGGAGATGAGC 

AAGTCTGGCTGGATCCCTCCAGCATTTCCCTTCTCCACGTCCCATCCTGAGGGAATGTCG 

ATGCTGGCAATGGGCACAGTGAGTCCCTTCAGGACACTCAGGATGCTGTGGAACGGTTCC 

CGAACATCGCCCTTGAAGCTGAAGCCAAAGATGGCATCCACCACCAGCTCATACAGTTCA 

TCAATCGTCATGGGCTCTGCGGGCATTTCCCCAAGGAAAGGGATGTCCATTTTCTGACAC 

TGGGTCACCAATGCAGTGAAGAGGGGCTTGTTAGGCCTTTTGGGGTAATAGATGGTTGGC 

TCGTAGCCAMGAGTTTGAGGTGTCGAGCACAGACCAGACCATCTTCTTCATTATTCCCC 

G 

Sequence 359 

CCGCGGTGGCGGCCGAGGTTTTCATGCATGCAGACTTTTATTTAACCTCATTGAAATTAA 
MTTAGCAATTTAAMTACAAMTATAG 

CTTTAAGATAAAAACAAATAGGGTTGGTCACCCTTATATGTCAA CCCA AAAAGTATCAAA 
AATACCTTGACTGCATGAAAGMCAGCTTCTTTAGGNCTATGCATTTTGACTTGGGCTTC 
TGCCAGATGATACCATCCACTTGGGCAGACAGGGCTTTCCTTTAACCCT 
Sequence 360 

CCGGGCAGGTACCAGCGCGTCCCATACGTGGAGCCGCCCGAATACGAGTTCTTTTGGGGC 

TCCCGGGCCAGCCGCGAAATCACCAAGATGCAAATCATGGAGTTCCTGGCCAGGGTCTTT 

AAGAAAGACCCCCAGGCCTGGCCCTCCCGATACAGAGAAGCTCTGGAGGAGGCCAGAAGC 

TCTGCGGGAGGCTAATCCCACTGCCCACTACCCTCGCAGCAAGTTGTCTTCTGAGGACTA 

GCCAAAGTrCTGGGAGGNCMGATTGAATGGGTTTTCNTGACCCCTCAACNCANGGGCTT 

GTTGGGAAAGGGGNAGGGGTGGCTGGCGTTCATTMTAGGTATTTCANGGGATnTAACA 

GGCTGGCAANTMTTTNCACNGTThn'AAACCTTTTTTAMGTATTTCCAAGGTNACCCTT 

CTTT NCCG CNTCTTANGAAACCTAAGGTGGGGAATACCCCCNCGNGGGCTTTGCTAAGGG 

GAMTTTTCCGANNAATTCAMGNCTTTNATTTTGANTACCCGGCTNCGTANCCCTTCCC 

AGGGGGGGGGGGGGGCCCCCGGGTTAACCCCAAGCi I I 1 1 INGTTT 

Sequence 361 

CCGGGCAGGTGTGTGTATTCACGCAGTTACTCGCTTCCAI I ! I I ATGACCTTTCAACTAT 

AGGTMTMCTCTTAGAGAAATTAATTTAATATTAGMTTTNTATTATGAATCATGTGAA 

AGCATGACATTCGTTCACAATAGCACTATTTTAAATAAATTATAAGCTTTAAGGTCACTT 

ATACTACCMTMCTTGTTAAATCAGGATTTGGCTTCATACACTNGAATTTTCAGNTATT 

TTATCTCAAGTAGNATATAGACACCTAACCTTTGATAGTGGATACCGTTAGGAGGGGTTC 

CTATTCTTCCCATTGTACCTCGGGCTCGCTCTAGGAACTAGGTGTNATCCNCCCGGGGCT 

GCAGGGAATTCCGATATCAAGCCTTTATCGATACCCGCCGACCTCNGAGGGGGGGGGGCC 

CGGTACCCAAGC I I I i I GTTTCCCTTTrAGTGGAGGGGTTTAANTTGCGCGCTTTNGCGT 

AATCATGGNTCATAAGCTNTTTTCCCTGGTGGTG 
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Sequence 362 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTCTCCTTCAGGGCATC 
ACCCACCTTGCGGAAGTCTGCCAGCTGCAGCCAGCATGTCTTCAGGCTACATGAGCCGGA 
CACCCCATGGCACTTGCAGGCCACATNAGCCAGGTTGTACAGAAATTTGNATATATGATG 
GTTCTTAGAACTTGTTTTAA I I I I 1 GTG GNCCTTCTGTTTATTATAATAGGCGTCCACCA 
ATGATTATCCA TATGTG TTCTTAAI 1 1 I IAACTGCTGGAAGTGTTAAAACACACACACAC 
ACACACACACAI I I II II II I GAGAACTCCAAAGTCCTGAAAATTTTGGTGGACAATGAT 
TTTTAAAAMCTMCTTTGTATAACCTMTATTTGNATTCTCTCATCTAT 
Sequence 363 

ACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTGGCATATATTAGGCCT 
TCAAATGATGATGGATGGATAGACAGATAAAAGGCCCAATGAATGATTGTAAATGTGTCT 
TCTCCTATGTATCCATGATGATA7TG C CTAGTTTTTAATACAAG CTTAAGAATC AATAAA 
ATGATAGATCTGAAATCTGTAGATGCCAGTCTGGCTCAGTATTAATACTGACAGTAGTTT 
TTTTG I MM I AAGAAAGCAAGTATATTAAAATATCTCTGTTAGAAAAGATTACAATTTG 
AAAGGGATCATTCTACAATCTTAGAACTAAAAAGCAATTTAAGGCAAAGAGGGTACCTGC 
CCGGGCGGCCGCTCTAGAACTAGGTGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
nnnnnnnnnnnCGTCNACCTCCAGGGGGGGCCCCGGNCCCMCTTTTTGGTTCCTT 
Sequence 364 

CCGCGGTGGCGGCCGCCCGGNCAGGTACAATGTTGTTATGGTAGAGAAACACACATGCCT 

T AAAATTT AAAAAG C AG G G C CC AAAG CTT ATT AGTTTAAATT AG G GT ATGTTTC AAG TTT 

GTATTAATTTGTAATAGCTCTGTTTAGAAAAAATCAAAGACCATGATTTATGAAACTAAT 

GTGACATAATlTCCAGTGACTTGTTGATGTGAAATCAGACACGGCACCrrCAGTTTTGTA 

CCTNGGCCGCCCGGGCAGGTACAAAAAGGGATCAGAAGGGCAAGGGGCATGCTAACGTCA 

TCGGGGGGCCAGTCTCCAAGTCGCAGNCNCGTGCCCCCCGGAGGCCGGTGGCTATCTGAA 

CTCTATGCATTCGC 

Sequence 365 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTGCAGAATCTG 

GTAGACTGCTCTGGGCCTCAAGGCAATGAAGGCTGCAATGGTGGCCTAATGGATTATGCT 

TTCCAGTATGTTCAGGATAATGGAGGCCTGGACTCTGAGGAATCTTATCCATATGAGGCA 

ACAGAAGAATCCTGTAAGTACCTCGGCCGAGGTACTGGTCTGTGGCCTGGGGGTGGGGAC 

CCCTGTTTTAAGCAGTTTAAGAATTATTC I I I I 1 CAGAATATCTACCTCTACTGGGATAA 

GATCACGTTTCTTTACTGTGAGCACACAGGTAGGTGCTCAATAAATGGCTTGTTGCAAGA 

ATGAATGACCGGCCGCTCTAGAACTAGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnCGTCGACCTTCGAGGGGGGGGCCCCGGTACCCACTTTTGGTTC 

Sequence 366 

CCGCGGTGGCGGCCGCCCGGGCAGGTACACTGTCACCTGGAAGTGGTTCTTCTTCTGGCA 

CACAAAGGCGTCGTCGCCCACCGAAAAGTTGAAGCCCTTGTCCGCATCCACGCGGTAGGT 

GAGCATGGGCAGCTCCTTGTAGTTAGCATCGTACCTATAMGTAAAGCTAAAATGATTTT 

ATCTGTGAATTCAGATTTTAA^AAGTCTTCACTCTCTGAAGATGATCATTTGCCCTTAAG 

GACAAAAATG AACTGAAGTTTCACATGAGCTATTTCCATTC CAGAATATCTG GG ATTCTA 

CTTTAAGCACTACATAAACTGACTTTATCCTCAGACTAGCTGAATGArnTGTGCTGTTT 

CAGGATGTTTGCACTGAAGA'W\ACAGAAAGCTTATCTGAAATTTATAAMCTTTTTGTT 

TTGCTAC ATAG AAAAC AG AA(3 3TA 

Sequence 367 

CTATAGGGTTTATGGGANCTNCCCGCGGTGGCGGCCGAGGTTCATTATTTTCCNTGTCTN 

CCAGGANTTGTGGCGCAAG AAGAGGTAGC CTAATATAAN CC AAAAGTTTACTTAGATCTT 

TCCGTCTCTGTTCCAAATC/ \TGACGGACCCAAGTAAGAAGTG CATTCAATATTGTCTCCT. 

CATTAGGAATGTTCATGTCATCACTAGCCAAGAGCTTTGCAATTTCGCTGGCTGGTAATA 

ATACAAATTCCTGGTTTCTGATTACTTCCATGAAATGCTCCATAGTATAATTGTGAGCCA 

CTTTATGCAAATCTGTACCTGCCCGGGCGGCCGAGGTACTGTATTAGAACACTGGGTGTG 

TCATACCGTTATCTGTG r ;AGMTATATTTCCTTATTC AGAATrTCTAAAMTTTAAGTTC 

TGTAAGGGCTAATATATTCTCTTCCTATGGNTTTAGATGTTTGATGTCTTCTTAGTAATG 

GCATAATGTCATGATTTACT 

Sequence 368 

TNCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGGAGTTTCCCTATTT 
TGGTGTTCAGCTTGAAAAAGGACTTGTCAGAATCAACTGTGTCATCAAAATTTAAGTAAT 
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GTGCATTGAAAATMGGTTGA TCATG GGMTATGCAGMTTTCC 
CAA^AAAATTGTAA TTTAGM TTmAATCTTAGGTTTCTTGATTAATTTATM ■ 
CAATTATTGTCAGTC I Mil I GTATGTTTTTTAAAAACATAGTCCAGAGCATGGGCAGAA 
1TGACACCTCTCTTTTMGTGAMTTTGGATTGCTCACAAAGCACTAGGAAATGTCATGG 
GGTTCAMTATATATCCTACACMCTGGGCMTACATTTTTGTTTGATTTTTAGGTCTGT 
G TATACATTAACAGTTCATGTAATTAATACCTGATCATTTG GGATAATG AAAGGTGAAAG 
TTAGTTGTAG ATGAAGTAAAGTTATAAAAG GAG ATTAAAAATG C 
Sequence 369 

G ™I^ GGCGM ^ G ^ 

TMCTTGTATTAATTGCACTTAACACAATGAACCTTTAGTTTCCMCCAGTTTrCATTCT 

CTGCAGACCCGGGCmCTTTTTATAAAAACTGCTTTCAAAGGGCATAGAGACACCACAC 

ATGGTCCACA^AAATTCAAATAGAGAGGTGCAATAGTTGCAGTGGTAAACACACAAAAA 

aataca i rrm jggactaaaaatctggtcacggataaaagcatgtgccttttcattctt 

CTCTGGGATGTTACAACAGCAACACGCTCTAAAACAATTAAGTTACATGCATAATGCCAA 
AAGAATGTGAGCAATCCTATAACCAGCTTTAAGCCATCTGCTTGATTTC 1 rTTTTTTTTl 

TGTAAGGAACATATATAGGAAMGAGMTTNCCCTTTTGTTCCATTACATATAGAAACCT 

TTTGAAGCTTTCAAANAGCTTTGGGTTT 

Sequence 370 

TAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACTGCGCCATCTGGG 

ACACCTTGTTTCCCANCTGGGAGGCTTCAGCCCGGGCAGCATTGGCCAGCGGCTCAATGA 

GATGGGAGATCTCTTGGACTGCAGTGAGCATCTGAGTGTGCAAGGCCTNTTGAGAGATTC 

CCTCACGGGGAGCAAGCTGNTGGCTTGACTGCAGCGAGNGAAGCCNTGNTNTAGTTNCCA 

TTNNAAAAAATNTTTAAMTCNTCCAAANGGATTTTTTAAAAM 

NNN NTTNAAAA AAAAAAAGGGGNTGTTTTTNATTTTTTTNAAAA^ 

CCCi hum I NTAAAAAAA MNNCCCCNT TCTTNNNAAAAAAAANAATTTTATTTNNNN 

TTTNTATAAAAAAAAAMTNTTTnTTTm 

TNCCCCCNNNC ANAATAAAN I I Mi M I I 1 t Mi HI I I 1 A TNATAAAAAAAAAAAAAAN 

AAAAAAT 

Sequence 371 

GATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGNCAGGTACTCTGGCGTGCAACCTGGAAT 
TTTTCGGAGGTATTTGTAGGCAAAATACCAAGGGATTTA 

CTTTTTGAGAAGGCCGGACCCATTTGGGATCTACGTCTTATGATGGATCCACTGTCCGGT 
CAGMTANAAGGNNATGCATTANTCNCCTTTTNNGGAAAAGAAACTGCNNAAGAAAACCN 
NGAAAAAANGGG^GANAAMNTNNTAAAANTTTCCCCCTTGG 

ggggncn i rrm l NGGGGAAAAAAAAAAAAAANNTNTTTTNNGGGGGNCCCCNCCCCCC 
CNAMNTTTNANNTCNCNTNAAAM^ 

CCCCNNNGGGGGGNGGGGGGGGGGGGNNTNN1 1 I I I H I II 1 Ml 1 1 I I I NCCCCTCCNN 

NNAAAAAAAAAAAAAAAAAAAA 

Sequence 372 

CCGCGGTGGCGGCCGAGGTACCTGAGGAACATAGATTCTCTGCATCTTTCTCAAGGGGAA 
CCCTCTCCAGCTTCCCTGGTGTGACCCTTCACATGCCAGATTGGGTAGGATCACTTTGAA 
CTGCCTGAAGTTCAGGAAGGTCATCAAGCTC 
Sequence 373 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGTACATTTCCATGGGCCCT 

GTTCCCATTGATGTATACTGCTTCCTTACTAACAGTGAGGGATGACTTTCATCAGTCTTT 

TATCACCTGAACAGTCTTCCGGCCATAATGATAGTAACTATAAGCTGATGCAGCTGTGGT 

GAMGCTGTAAAACACCTTTTATGGAAGAAAAGAAATAAMTGTAGTTGTCAAGTCTAAA 

AAATAGTACCAACGGGAATCATAATG AATAC ATGCN ATGAATTTAAAATG N N AAAATGAA 

TCTA AAAAG TAAAAAGGGCTCTGNGGTGTAATTTTTCTTAACTACAANAGTCTAAATACA 

CTGCTTTTCTTTAAGAG7TCATTTTMTTAGNAACCGTCAAACAAAATTAT 

Sequence 374 

ACTACTCAGGGCGAATTNGAGCTCACCGCGGTGGCGGCCGCCCGGGCCGCGTGAAGAGGA 

AGAATGCCNNGAATTCCAGGGTGGGGCTGGGGCTGGAGACGACGAGGAGGAGGATTAAGT 

CCACCTGTCCCTCCTGGGCTGCTGGATTGTCTCGTTTTCCTGCCAAATAAACAGGATCAG 

CGCTTTACACCATGTTGTTACATGTAAACAAACTTCAATTTGAAGTGCAGCTATTATGTG 

GTATCCATGTGTATCGACCATGTGCCATATATCAATTATGGTCACTAGAAAGTCTCTTTA 
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TGATACTTTTTATTGTACCTCGGCCGCTCTAGAACTAGT 
Sequence 375 

CCGCGGTGGCGGCCGAGGACAAATGTGGTGTGTCTTCCAACTTTCATTGAAAATGCCATA 

TCTATACCATATTTTATTCGAGTCACTGATGATGTAATGATATATTTTTTCATTATTATA 

GTAGAATATTTTTATGGCMGATATTTGTGGTCTTGATCATACCTATTAAAATAATGCCA 

AACACCAAATATGAATTTTATGATGTACCTGCCCG 

Sequence 376 

ATACACTACTATAGGGCGAATTGGAGCTCACCGCGGTGGCGGCCGCCCGGGCAGGTACGA 

GTTCAGCCTGACCCGTGAGACAAAGAAGCACGTGCTCTTTGGCACTGTGGGGGTGCCTGA 

GCACACATACCGCTCCCGAACCAACTTCACCAGCAAATACAACATGAAGGTCCTCTACTT 

ATCCGCCTTCACTAGCAAGGACGAGGGCACCTACACGTGTGCACTCCACCACTCTGGCCA 

TTCCCCACCCATCTCCTCCCAGAACGTCACAGTGCTCAGAGACAAACTGGTCAAGTGTGA 

GGGCATCAGCCTGCTGGCTCAGAACACCTCGTGGCTGCTGCTGCTCCTGCTC'iCCCTCTC 

CCTCCTCCAGGCCACGGATTTCATGTCCCTGTGACTGGTGGGGCCCATGGGAGGGAGACA 

GGAAGCCTTCAAGTTTCCMGTGCAAGAAGATCCTACTTNTTTTGAGTCAAGCTGACCCC 

CTCCCCCCAATCCCTCAAACCTTGAGGGAGAAGTGGG 

Sequence 377 

C N GGTTCGCTGTGTGCCTAATACATG CATGTTGAAC GGGATGTAG CAATACATTC AGTAG 

CGAATGGGTGAGTAACACGTACCTAACCTACCTTTAAGACTGGGATAACTATTGGAAACA 

ATAGCTAATACCGGATATAGTTATTTATCGCATGATGAGTAATAGAAAGGAGCTTCACAG 

CTTCACTTAAAAATGGGGGTGCGGAACATTAGTTAGTTGGTAGGGTAATGGCCTACCAAG 

ACGATGATGTTTAGCCGGGCCGAGAGGCTGTACCCTCGGCCGCCCGGGCAGGTACAAGAG 

TGATGGCAATGTGACTGGAACAGAAATAGTTTCTACCAGGCACACAAAAGCTTCTGTAAG 

CCCCGTANTTCCGTCCTGCAAAGGGCCTTNAGTGGGAACCAGGTCTGCAGACCCCAGTGG 

GCANAAAGACCGGGTGGAAGCAGG 

Sequence 378 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAAAGGAGAAA 

AACTATGTCTAAGGAGGGAAGCACATAATAGAATTCTATTATTACAATAGAATTCTATAA 

ATACAAAATTGGATAAACTTTATCAGAAGCAGGTGTTTGTCACCATTTTATATGTGAAAC 

TCAGGAAGTTCTTGA I Mil I AAGAGCTGTATTCCTTAATCTGGTTACAGGCTATAAAGG 

AGATAATCATTTACATGATCATATTCTCAAACAGATGGTCACCAAATGGAATCAAAGGAC 

TGATTTGATGTAGCCGGTAGTATGATAA I i I I I GTAGGTTAAATGGAAAAATACNGGATA 

GGATCCNAGAAAAATMNGTAATTTTNCCAGGTAGGCCNGGTTTAATAAAATTACCAGGA 

CCTAGGAATAAGCCAATTTTA 

Sequence 379 

CCGCGGTGGCGGCCGAGGTCCCCC I I I II M I I I I M I M II i I I I I I I CTTGACAGCAG 

CATCTGTTTATTGACAATTCCAGGTCATTCCTAAC ACGC CG N ANCAG GGCTNTGTACCTG 

CCCGGGCGGCCGCCCGGGCAGGTACTTCATGAAGCANACCATTGGGAATTCCTGNGGCAC 

AATCGGACTTATTCACGCAGNGGCCAATAATCAANACAAACTGGGATTTGAGGATGGATC 

AGTTCTGAMCAGTTTNTTTNTGAAACAAAAAAAAATGTCCCCTGAAMCAGMCCAAA^ 

TGCTTTTGAAAAAAAANNGGGGCCTTACCAGGCAAGCCCAATNAATGCCCCGGGGCACAA 

AGNAAGGCCCAATTGTTNGGGGTAAAAANAAAAAAGGGGGGAW\TTCCCCATTTTTTNT 

TTNTTNTTTNAAANANCCCGGGGGGGGGGNCCCCCNTTTTAN WAAANTTGGGGGGGGAA 

AAAATTNN I I MUM I GNAANAAAAAANCCNCGGGCCCCNANAAAAAAAAAAAACCCCC 

CCTTTTGANGGANCCNCCCNNNNTAANANAAAAAAAAAA 

Sequence 380 

ATTGGAGCTCACCCGCGGGGNCGGCCGCCCGGGCAGGACCTCTTTGCCTTAAATTGCTTT 

TTMGTTCTAAGATTGTAGAATGATCCTTTCAAATTGTAATCTrTTCTAACAGAGATATT 

TTAATATACTTGCTTTCTTAAAAAACAAAAAAACTACTGTCAGTATTAATACTGAGCCAG 

ACTGGGCATCTACAGATTTCAGATCTATCATTTTATTGATTCTTAAGCTTGTATTAAAAA 

CTAGGCAATATCATCATGGATACATAGGAGAAGACACATTTACAATCATTCATTGGGCCT 

TTTATCTGTCTATCCATCCATCATCATTTGAAGGCCTAATATATGCCCAAGGACCTCGGC 

CGCTCTAGAACTAGGNGGGATCCCCCGGGCTGCAN 

Sequence 381 

TAGGGCGAATTGGAGCTCCCCGCGGNGGCGGCCGAGGNNCTAAGCCCCAGGACCNATTGG 
TAGACGACNTANNANCNNAGGCGCATNNCACTGAAACANGTCAGNGTATATGNTGGCACG 
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TATTAAANTTAAG ATG AAN G N NG AAGC AAAAAG ATTTAC AAG AATTAGC NGTAAC AAN AT 

TGATGCTNAAGAGACATNATTGTACCTGCNCN 

Sequence 382 

TNCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGCCGGCAT 

TGTCTTAGGGGTTAATMGGTTAACCTATGGTTACAAAACAAAGGGTAACTTGAAGTATT 

GAGT>^TGCTTCTGTMTATGGGGGAG 

AAMTTTTGACTTAAATTTMCACTC^^ 

^GGTTTAGAAGCAMCCAGCAAAGGAAAAAAGCAMCCCTAGCAAACCTTTCACAM 

GTTAMGAGACACTGCTCCATTTTAAGCAGTGCGGGTCATTTTGAGmAATGAATCCCC 

ATCAAATGGGTGGTATAGTAGATAGCTATGGTGTGGTTTGTAAGGGTTTGTAGCTTGCAA 

GGACGTGCACTAATAATATTTTATATGATCTTTCCTGGTTGGCAGGACCCATGGATGAA 
Sequence 383 

CCGCGGTGGCGGCCGAGGTACACGGATATCACCTGAATTATTAATGAATGCCCAGGAAGT 

AATTTTCTTCTCATTCTTCTAAAACTACTGCCTTTCAAAGTGCACACACACGCGTCCACA 

TACACTGCATTCGTTGCTCCAGTATAAATTACATGCATGAGCACCTTTCTGGCTTTTAAG 

CC AATATAATGG G CTG C AAAATGAAGACACC AGAGTGTATG C ATAC AAATCTC ACTGTAT 

TAMGATGCAGGTTTTCTAATGTACCTGCCCGGGGCGGCCGCCCGGGCAGGTACCCCACC 

TCCCTAGCCCTCTGTGCTTCAAGTCTTCCTTTGACCCCCTTCGCTCTCCCACAGCAGGGC 

TCTCAGCTGCCAGCCAACCAAGCAGGATGGTGCAACAACCAGACCTAAAGGAAGGGACCT 
GGGGAAGAAGAGAAA 

Sequence 384 

CCGCGGTGGCGGCCGAGGTACACATGCATGCATACACACCCATACAAACATCTGTGTGAG 

GGCAGTTCTGGAGATGAGCAGAGAGAGACCGGMTAMCTCMTCrrTTCTTTCCCAAGC 

TCCTAGCCAACACTATCCTTGGGAGAAAGAAAATTTGCAGAAACTGCTAAGACCAAGTGT 

GGAGATGTCAAGCTAGTTCACACTCTGAGGCTCAGAATATGTAGGACATGCACAATTGTG 

CAGTCCTTTGGGATTGGGAAGTGAAACAGTCTGTGATCCCCTACCTTCTAGGGAACTAGG 

ACCTAGG AAGAG GTAAAGATTATCAGGTATGCAAAGCGCCCCAATTCTTCTGCTGCCATG 

GGGGGGATTTTACCCCAACTCCAGGGGTTCGAGGCCAATCTGAGAATGGCTTAA 
Sequence 385 

AGGTACCAGACTCTTATTTCCTGCAGCTTTTGAGMTTCTAMGATACCTGAGTCATTTT 

AAATAAATTTCAGCTTTCTTTACCAAACTACCCTTMCCGGATTCTCCTTGAAATGACAT 

CACCTGACACCCATGGCATGCTGCATGCCCACAGCTAGATTACTTTTTAGTGCGCCCACG 

CACTTCTGCTTTTATTAGGTGAGGCAGAATCMGATTCCCTTTTGTTGGATCTTGAACCT 

GTTCATGCCACTTTGATATTCTAAATTCATACATAAACCAATGAAATATATGTGTTAGAA 

AAATTGCTATTTCTAGCTGGACGCAGTGGCTCATGACTATAATCCTACCA 

Sequence 386 

CCGGGCAGGTACACTGCGTTCCAGGCTGGTGTATGGGGTTGATCCTTATCGGTCACAGTT 

AGATTAGCTACTATGATGTCTACCCTGTTCTCAGGMCTTCACCATAAAACGTCATGGCA 

GTAAACTCTG GAGG ATTGTCATTGACATCTGTCACTGTGATGACGGCCGCCCGGGCAGGT 

ACATGGTTGATTTTG TGCTGTTGTT TGGACTGTAACATCCATGTTGTCAATACGTATACC 

TTGTAAGTGGATAACTTTTCTTTTTCCCAGGCCAGAGAATTCAMTTGTTAAAACATTGG 

CATrrGMGAGGAGAACAAAATGTAGCATGATGTATTTAAAGTAAGGCCTTTAGTAA 
Sequence 387 

CCGGGCAGGTACAGGAGTTTCCCTATTTTGGTGTTCAGCTTGAAAAAGGACTTGTCAGAA 

TCAACTGTGTCATCAAAA TTTAA GTAATGTGCATTGAAAATAAGGTTGATCATGGGAATA 

TGCAGAATTTCCAATGTATTTTTAAATACAAATAAMTTGTAATTTAGAATTT^ 

TAGGTTTCTTGATTAATTTATMGAGATCAATTATTGTCAGTCTTTTTTGTATGTTT^ 

AAAAACATAGTCCAGAGCATGGGCAGAATTGACACCTCTCT7TTAAGTGAAATTTGGATT 

GCTCACAAAGCACTAGGAAATGTCATGGGGTTCAAATATATATCCTACA 

Sequence 388 

A GGTACCC AGTATTTCTAATTAA^ 

AAlmi lAAA GTCAG TTGATTAAATGCAATAAATATTGGGTTGCTCAAATATGCCACAA 
ATAACTCGAAA 1 1 1 i I CATTTACTTTCAACAGCATAAGATTCTTTAATATTTAGGATTGA 
CTGNTTCTTTCCAGTTAAGCACTGAAGGATTATGTCTTGTAGCTTCCCCAAGAGAAGGGG 
AAGGAAAAAAAAGCACTATGTTAAGGATAATACAGAACTCTTTTGAACTATTTTGGGTAG 
TACCCTGCCCGGGCGGCCGNCCGGNCAGGTACCAGTGGAGG 
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Sequence 389 

CCGGGCAGGTACTCTTGGATGTGTTTTCTCACCAAGATGAGCAAAGAAAGGTTTGCACAG 

AGGAGTGTGAATGTGTGTTTGTTGCTGGCTGAATGGCAATAGATGTCTAAGGTGGATTCA 

GTGTCTGGCACACTGAGACACCTCCAAGAAGGAGATTGATGCATCANGTTCAGTTTAACC 

TGGAATA TCTGA CTACCCCTGAATCCACCCAGAAAGGGGGCCCAACACCCTJGTCCATTT 

ATGGGTA I MUM 1CGAAGTTATTAAGCATATTCCTTTTCCACGAACCTCTTCTGTACC 

T 

Sequence 390 

CCGCGGTGGCGGCCGAGGTGTTTGAGTTTGGTTTGGAGCAAAACTGAGGTAGTCCTAACA 

TTTCTG GG AC TGAATCCAGGCAAG AG AAAGAAGAAAAAGAAGAAGAAAAAGAGG AG GAAA 

AAGGTAGGGAGAAATAAAGGGAGGAGAGAAGCACAGTGAAAAAAAAAAAMGTCCCTTTT 

CGACATCACATTCCTTGTGNTTTCCCTCAGCCTGGAAAACATATTTAATCCCCAGTGCTT 

TTTACCGCCCCGGAAACAAAGAGACTAAGCCAGACTTATGGGGGAAAGGGNAGATAAGAA 

GGATCCTGGAACTTTNAAG 

Sequence 391 

TGCTTCCTTACTAACAGTGAGGGATGACTTTCATCAGTCTTTTATCACCTGAACAGTCTT 
CCGGCCATAATGATAGTAACTATAAGCTGATGCAGCTGTGGTGAAAGCTGTAAAACACCT 
TTTATGGAAGAAAAGAAATAAAATGTAGTTGTCAAGTCTAAAAAATAGTAGCAACGGGAA 
TCATMTGAATACATGCMTGAATTTAAAATGTAAAAATGAATTTAAAMGTAAAAAGG^ 
CTCTGTGGTGTAA I I I I I CTTAACTACAAGAGTCTAAATACACTGCTTTTCTT 
Sequence 392 

CCGGGCAGGTACCGCATCAGCAAAAGTGCCTGGCTGAAGGACACTGTTGACCCAAAACTG 

GTGACCCTCAACCACCGCATTGCTGCCCTCACAGGCCTTGATGTCCGGCCTCCCTATGCA 

GAGTATCTGCAGGTGGTGAACTATGGCATCGGAGGACACTATGAGCCTCACTTTGACCAT 

GCTACGTCACCAAGCAGCCCCCTCTACAGAATGAAGTCAGGAAACCGAGTTGCAACATTT 

ATGATCTATCTGAGCTCGGTGGAAGCTGGAGGAGCCACAGCCTTCATCTATGCCAACCTC 

AGCGTGCCTGTGGTTAGGAATGCAGCACTGTTTTGGTGGAACCTGCACAG 

Sequence 393 

AGGACAAATTCAGTCCCAATACTCAATACGTATTATAGATGACTATGAGTGCAAACCTTA 

GGATGTGATTTTCTGAATAATTGTTCTTTGTAGGATrTGGTTACATTATTTAAAATGAAA 

AAGATCTAGTTrTAGTGTGAGCTCAGTAATGTTAATTGGTTMGTTCATTGTGAATCTTG 

AGTTTTAGATAAGNAGTTA I Mill I CAATATCACTTCTG I I l ! I AGTG ATATTATATCA 

AGAAACAACGTATTCAAGAGCCATGGCTGACAGTGCCAGATATACTTAGGGATAAACATC 

AAAATGCAATTATAGTTGCTATAACGTTAGATACTCGGAATCAAAATTT 

Sequence 394 

ANACCCACTCACTNNNAGGGCGAATTGGNAGCTCCACCGCGGTGGCCGGCCGAGGTACAG 
TGATAGGTATCTTTCTTTGGAGl mTTTTl I GTGCATATGTGTATAGTTTTATGGGTTC 
TGAGTTGGTGACCAGTAAGTTGCATGTAGTGCTGGCACTTACTTAATAACTATTCATGAT 
ATTGTTAATMCTTGTTATAGGATTGTATTCCCAATTACAGTCTCTAAGATTGTAATTGA 
TATTATCTGAGAGGTAGTGTGACAACTTTCTTTTGTTGTTACATTAAGCCGAAAACATAA 
TACTAATAGACAACTAACAGTTTGCTTATCAGGCACATCAACTAAGGCACCTCCCCCCAT 
G CTAAGTTTCTC CTGG ATATATGG AAG 
Sequence 395 

AGGTACATAATAAACMGTTTCAACCAGCAAGAAATTACTAATATTGACTGTGGAGTTTT 

GGCTGTTTTMTAGTTCTAACTCATTATTCCGTAATTCAACACAGCACTACCAACACAGC 

TGGCAATGACAAGACTGGGAGTATCAAACTAGGATTATTAGGCACAATCCAGGTGGCCTC 

TGCAGCTGTGTCTCTCTTTCCTCTTCTGTTCCTATAAGGGCAGGGCCTCCTTCAGGAACA 

GCCACCAGTGAGCTTCCTCCTCCCTTCTGGTCAGTTGGATTTGTCACTGTTCAGCATCTT 

TTCGATGATTTACCTGCCCG 

Sequence 396 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATTTCCATGGGCCCTGT 

TCCCATTGATGTATACTGCTTCCTTACTAACAGTGAGGGATGACTTTCATCAGTCTTTTA 

TCACCTGAATAGTCTTCCGGCCATAATGATAGTAACTATAAGCTGATGCAGCTGTGGTGA 

AAGCTGTAAAACACCTTTTATGGAAGAAAAGAAATAAAATGTAGTTGTCAAGTCTAAAAA 

ATAGTAGCAACGGGAATCATAATGAATACATGCAATGAATTTAAAATGTAAAAATGAATT 

TAAAAAGTAAAAAGGG CTCTGTGGTGTAA I I ! I I CTTAACTACAAGAGTCTAAATACACT 
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G CTTTTCTTTAAG AGTTC ATTTTAA 
Sequence 397 

TTGTCTGAGCTACTGGMTGMGTTCACAGGTCTTGMGACCMTATTATTTGTCAATAT 

GTGGGGATAACCTGTAGCTGCATTCATGAGGTAGCAAATAGCAGTTTTGGCCTGTGGGGT 

GAACAGCTAAACAGTGTTCTGGCATCAGATAACACGGCACAAAGGGATTGTGTCTAGCTC 

TTTTGGAGAAAATTTATGTCCAGTCTCAGGATCTGTGGACAATAACGGAAAACTTTGAAT 

A7TCCTG ACMGTTGCC ATATTAGGAGTCAGTTCTTTACTTCTCTGAGATTCCCAGGTCA 

TCCATGAT n I I I I I I I I AATTTGAGACAAAGCCTCACT 

Sequence 398 

NACACCCACCGCGGNGGCGGCCGCCCGGGCAGGTATGGNTCCGNGGATGCAAAACCCCTG 
GGCCGCAAGAAGGGAGCCAAGCTGACTCCTGANGAAGAAGAGANNNGAAACAAAANCCGA 
NCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGGCNNGNACCCCGGCCGCCCGGGCAG 
GNACCNGACAGNAGTTTAAGANGAGAGAGGNNAGGGNCCACNCGGNNNAAGCAAGAGANA 

GGNCGGGGGGCNNNNGGNNNAAACNANGGCCAGGAGANNGGGCNAAAGAGAAGACGACGA 

GAGNAAGGAAANAAAG 

Sequence 399 

CTCACCGNGGTGGCGGCCGCCCGGGCAGGTACACGGACCGCACGGAGAAGCTGAGGCCTG 

AGATGGAGGGGCCCGGCAGCTTCACCATCTTCGCCCCTAGCAACGAGGCCTGGGCCTCCT 

TGCCAGCTGTGAGATGACCTCCGTCTGCCCGGGGGACTCTTATGGGGAACTGCCTTACTT 

CCCCGAGGGGTGGGCATGATGAATGGGAGCCTGCAGTCATTTCCTACTGTTTCAGGAAGC 

TTTCTCCTTAACCCCTTAGAAMGGCTGTGGMCTTGAGCTAAAATATGTCTTACCAGGT 

TGCGTCTAATGCCCCCCGTTCCCTACTGGGCAGAAAGACTTGGGTGCTTCCTGAGGAGGG 
ATCCTTGGCA 

Sequence 400 

AGGTACTGCTTAATTCTTTTTTCTCCCTCCCTCT 

TCCTAATAGCAAGTTAGGACATGTCTGTTGGCTCTCGAGATTCATGGGACAGCAATGCAG 

CAAATCTAGCCATAGTATTTGCTCTCTCTAGCCCTGCCCTTTTTCCTGTCCAGTGAATAT 

CAAAACAGGTAGAAAACATGGCCTGAAGGATTGTCTCTGCCACCACCTCCATATGCATTT 

TACCAGTAGTCCTGTCATACAGGTTGAATTAGTTTTATGTAGAACAAGTCATGAACACTT 

T AGTGTG G AAAAAT AGTATTATATAAAGCTTAATATTAA 

Sequence 401 

AGGTAC ATTAT CAGANACAGTGGTTGACCTCTTTTTTCCTTTACTCCCTTTTCATCTGAG 

AGAGCCTTTTAGAGATCCGGMTCATTTGCTGTCTGCAATTACTATAGGCTTTGGCTCAC 

MTTCTGGGGAAMTGCCAATTGAAGGMCCCTGCCTATACATTTTATTTTCTrm 

CGAGACAGACAACCTCAAAATMGGTCCAMTATTNGGTTCC7TNAAATGGTGTCAAAAA 

GAATAG TATTATATGAGGAGGATAGTTATCACAGMTAAGAACTAAAATCCCATTTTTiT 

TTTTTTAGGAAAAAAAGACCTTCNATGATGCAGGTGTNTGTGTATAAGGAACTA 

Sequence 402 

AGGTCTCAGCGTGGCTACAAGTAACTGTGGTGTGGAAGCAGAGTAGAGAGAAAACTTGTT 

CCTCATTAGAGAGAGAGCCACACTTCTCACTGCTCACAATGAGAGGCCAAAGATTACCCT 

TGGACATCCAGATTTTCTATTGTGCCAGACCTGACGAAGAGCCTTTTGTGAAGATCATCA 

CTGTTGAAGAGGCAAAGCGCAGGAAGAGCACATGCAGCTACTATGAAGACGAGGACGAAG 

AGGTGCTGCCTGTCCTACGGCCCCACAGCGCGCTCCTGGGGAATATGCACATCGAGCAGC 

TGGCCCGACGCCTTCCTGCAAGGGTGCAAGGG 

Sequence 403 

AGGTAC ATTTCC ATG GG CCCTG TTCCC ATTG ATGTATACTGCTTC CTTACTAAC AGTG AG 

GGATGACTTTCATCAGTCTTTTATCACCTGAACAGTCTTCCGGCCATAATGATAGTAACT 

ATAAGCTGATGCAGCTGTGGTGAAAGCTGTAAAACACCTTTTATGGAAGAAAAGAAATAA 

AATGTAGTTGTCAAGTCTAAAAAATAGTAGCAACGGGAATCATAATGAATACATGCAATG 

AATTT AAAATGTAAAAATG AATTTAAAAAGT AAAAAG G G CTCTGTG GTGT AATTTTTCTT 

AACTACAAGAGTCTAAATACACTGCTTTTCTTTAAGAGTTCATTTT 

Sequence 404 

AGGTACTCTTGGATGTGTTTTCTCACCAAGATGAGCAAAGAAAGGTTTGCACAGAGGAGT 
GTGAATGTGTGTTTGTTGCTGGCTGAATGGCAATAGATGTCTAAGGTGGATTCAGTGTCT 
GGCACACTGAGACACCTCCAAGAAGGAGATTGATGCATCAGGTTCAGTTTAACCTGGAAT 
ATCTGACTACCCCTGAATCCACCCAGAAAGGGGGCCCAACACCCTTGTCCATTTATGGGT 
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A TTTTTT1 r CGAAGTTATTAAGCATATTCCTTTTCCACGAACCTCTTCTGTACCTGCCCG 

Sequence 405 

AGGTACCTCCTCATGACATGGGGTTTAAATAAACTACAAGTTCTATTACCTTTTTTATTT 

TCCAGGGAAAAAGAAACTTGGGAAGGCTATTTTACACMTTAATTACACATTGACACCAC 

AGTTCTGTTTGG ATAAATTGG ATAC TGTATCCTG GAGTTTAG AC AG G GAAAAGTGATCCC 

AATGTTTTACTTTCAGGTGTAAGATTTAAAACAAGAATTTGGACCAAAATGTTCATCACT 

GCAGTTACTTTCTGCTTTCCATTTTTMGCATGTTATCTTCATGTAGTCCACCTGAACAT 

CAACAGTTCAGCCTCTACAGAGTTTCTGTAATA 

Sequence 406 

AGGTACGTTCAGATGTAGCCATGACTGGAGAAATTACACTGAGAGGTCTTGTTCTTCCAG 

TGGGTGGAATTAAAGACAAAGTGCTGGCGGCACACAGAGCGGGACTGAAGCAAGTCATTA 

TTCCTCGGAGAAATGAAAAAGACCTTGAGGGAATCCCAGGCAACGTACCTGCCCGGGCGG 

CCGAGGTACTACTAATTCTACMTGCCTTTCTCTTTAGTCAGTAT TAAA AATCTTTTTTA 

AAGTATTGGTATGAAAACAAA I I I I J GTTGCCCTGATAATGGGAATTTTAAAACTACCCA 

C AGT7TAAGAG AAAAC ATAACTTGGTAAAAAAG G TAGCC AATAAAAC C AC A 

Sequence 407 

AGGTACTCAGGGGAAACTGGAAGCCGCTGGCTCTTTCAATTCTGATGATGATGCAGAGAG 

CTGCCCAATCTGTCTCAACGCATTCAGAGACCAGGCCGTGGGGACGCCGGAGAACTGTGC 

CCATTACTTCTGCCTGGACTGCATTGTCGAATGGTCCAAGAATGCCAATTCCTGTCCAGT 

TGATCGAACTCTATTTAAGTGCATTTGTATTCGAGCTCAATTTGGTGGTAAAATCTTAAG 

AAAGATCCCAGTGGAGAACACCAAAGCGAGCGAGGAGGAGGAGGACCCGACCTTCTGTGA 

GGTGTGCGGCAGGAGCGACCGTGAGGACAGGCTTTTGCTCTGC 

Sequence 408 

AGGTACCAGTTCTAAATGGGCCAAGATCCTGGGGACGCCAAAGTTATTGGCTGGATGGCT 

GATGAACTGAGGCCAACTAGAGACTGGTAGCACAACTGGGGACCACTTAGGGATGGAGAG 

TGAACACATCCTGGGCCTGGATAGGACGGGGAGATGGTGAGTGATGGTCAGACGTGGCCC 

TTGGCTCTCACTGTTGTTATCCCTCTCACGTTCACATAGGGGCACACGCAGGGCCAGCTT 

CATGGGTGTGCAACCTGTGCAGCCACTCAGAAGGGCCCCATGCTGGGTTTAATGCTTGGC 

TGTCACCATCATGAAATTAATAATGGTTGAACAGGGGCCCTGCGTT 

Sequence 409 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGAAATAAAAATTAG 

CAAACAATTATTCTAGGGATATTTTCAGATTTTACTTCATTTCTTGAAATGCGT GTGCCA 

TATGCAATTGCATTTCTTGTGCCAAGAAACTAATAGAACTTATTTCACTTTA CC 1 1 1 M I 

TAAAATGTG AATTTAAGTTATTATAG TTC AATTTT ATG G C CTT AC AG ATG G CTTTT ATTT 

TGTTTGCAGCTGACACTGCAGTTCCTTTCATGCAAAATACCATAMCTGTTTGATGAAAA 

TCATGCCCCTAATGGAAACTCTCTAGTTTTTCCATATAACTATCCTACTGTACCTGCCCG 

GGCGGCCGCTTCGACCAACATGTGGTGAGCATTCCACGGGCGCATGAAGTCTGGGTGCTG 

TGCTCGAGTCTCTGAATATTTTGATAGGAAAGCGACAAGAAAATTCAA 

Sequence 410 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGAAATAAAAATTAGCAAA 

CAATTATTCTAGGGATATmCAGATTTTACTTCAmCTTGAAATGCGT GTGCCAT ATG 

CAATTGCATTTCTTGTGCCAAGAAACTAATAGAACTTATTTCACTTT ACCl I I I I I I AAA 

ATGTGAATTTAGTTATTATAGTTCAATTTTATGGCCTTACAGATGGCTTTTATTTT 

GCAGCTGACACTGCAGTTCCTTTCATGCAAAATACCATAAACTGTTTGATGAAAATCATG 

CCCCTAATGGAAACTCTCTAGTTTTTCCATATAACTATCCTACTGTACCTGCCCGGGCGG 

CCGCTTCGACCAACATGTGGTGAGCATTCCACGGGCGCATGAAAGTCTGGGTGCTGTGCT 

CGAGTCTCTGAATATTTTGATAGGMGCGACAAGAAAATTCAAACTGCTCTTTGCTGACT 

ACTGGNAAGTGAAAAAGATGCTTCMGGTTTANCCATTCAAAGGAAACCATTAGGCCTTT 

TCAAAAAAC 

Sequence 411 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAGGAGTTTCCCTATTTTGGTGTTCAGCTTGAA 
AAAGGACTTGTCAGAATCAACTGTGTCATCAAAA TTTAA GTAATGTGCATTGAAAATAAG 
GTTGATCATGGGAATATGCAGAATTTCC AATGTAT 1 1 I I AAATACAAATAAAATTGTAAT 
TTAGAATTTTTAATCTTAGGTTTCTTGATTAATTTATAAGAGATCAATTATTGTCAG TC 
TTiXTGTATGTTTTTTAAAAACATAGTCCAGAGCATGGGCAGAATTGACACCTCTCTTT^ 
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AAGTGAAATTTGGATTGCTCACAAAGCACTAGGAAATGTCATGGGGTTCAAATATATATC 
CTACACAACTGGGCAATACATTTTTGTTTGATTTTTAGGTCTGTGTATACATTAACAGTT 
C ATGTAATTAATACCTG ATCATTTG GG ATAAT 
Sequence 412 

CCGCGGTGGCGGCCGCCCGGGCAGGTTTGGGTCTGAAAGTCGATGAAGGACGCAGATTAC 

CTGCGATAAGCTTCGTGGAGTTGGAAATAAACTATGATACGGAGATTTCCGAATGGGGTA 

ACCTAACTGAGCAAACCTCAGTTGCATTTTGATGAATCCATANTCAAATTAGCGAGACAC 

GTTGCGAATTGAMCATCTTANTAGCMCANAAAAAAAAA/^^ 

TTGNAACCTN 

Sequence 413 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTTACACTCCAAGGTAACAGAATAATAATACAA 

TCTTCTATTTATTGCTCCCTATCAATGTTCTAAAAACCTTACATTTATTTAATCCTCATA 

AAAATCCTGTGAGGTAGGTAGTATTATCATCCCTAATTAAAAGATGAGGAAATTGAGACA 

CAGAGAGATTAAAATTGCTCAAGGTTACAAAAACAGTAGGTGTTAGAGCAAGAATTTAAA 

CCCAAGCAAGTCTGACTCTAGAGCCCAGGATTCATCACAATATTCATTTCCTATGAGCAA 

ATCTCAAGGTTTTTGGATATCTATAGTAMCCATTTATACTTCTAGTCAAGTAACACATA 

TGAAA ATTC ATGTCCACACCATGGGTGAAAGAGGTTGTTCAAAGAATAAAATGAATGCCT 

TGAAATTTTGGCAGATGGATACTACCATGGATAATA 

Sequence 414 

GCGGNGGCGGCCGAGGTACTATGAAAAGTTGATTGTAGCACAACTGTTCAATNAGTAAAA 

GGTCTTCGGCAMTTCCCTTTAGAGTATACTTTCTATAAACTACATGTTCCACAAAAAGG 

TCAATTATATATACAATTGATTTGTTTTACTTAATCTTATTTGCTCAGATCTTGCAMT^ 

C MTG AGMTATTAAGCCT GAGGC TAGTTCTCAGTGTATAGGTTTAACAAATTAAGGCTC 

ATTTTCCCAAATCAAAATAGTTTTTAGTTTTC 

ACMGAAGAAATGCTTAAGGAAGAATTTCAAAAGAAATCTGAGCNGTTAAATAAAGAGAT 

TAATCAACTGAAA 

Sequence 415 

CCGCGGTGGCGGCCGAGGTACTTGGCATATATTAGGCCTTCAAATGATGATGGATGGATA 

GACAGATAA AAGGCC CAATGAATGATTGTAAATGTGTCTTCTCCTATGTATCCATGATGA 

TATTGCCTAG l I I l I AATACAAGCTTAAGAATCAATAAAATGATAGATCTGAAATCTGTA 

GATGCCAGTCTGGCTCAGTATTAATACTGACAGTAGTTTTTTr^ 

GTATATTAAAATATCTCTGTTAGAAMGATTACMTTTGAAAGGATCATTCTACAATCTT 

AGAACTAAAAAGCAATTTAAGGCAAAGAGGTACCTGCCCG 

Sequence 416 

TNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATAGAGCTTAAATAA 
TATCAAAATG CAAATATAGATTGGGTGC ACTGTTAAGCTGAA7TGCAAATTATG GCAACA 
CACACTGGACTGGGGTATACCGTTGCTTTGATATCACCATTTGTTTGTTTATGTCATGCA 
GACCACMTAGTCAATCTTTTGTTTTTC I I I I I IGTACCTGCCCG 
Sequence 417 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGCCAGGTACTGCCATGGCCA 
GTAAAAATGAAAGTGGATAGATACGGAMTACACACAAAGTGAATTTTGCTAAAACAAAT 
CAATGGCACTT MGAC ATTTCTATTCTAAAAGGCTTATTCTTAAGGCTTTTCTATTCTAA 
AAATTTGCAATGTTTTMTTGAATATTANATT^ 

ATGAAATATCAGGAAATAACTCACTGCTCAGTGGAAACTCTCATGGCTGNTGTTTTTTGT 
TG' rGTTGTTTCAG G ATTGG AGGTTTTAAAGC AGG AAAC AG C AGTTGTTG AAAACGTCCCC 
ATTTTGGGACTTTATCAGATTCCAGCTGAGGGTGGAGGCCGGATTGTACCTNGGCCGCTC 
TAGAACTA 
Sequence 418 

ACTATAG GGCGAA TTG NAGC TCCCCGCGGTGGCGGCCGAGGTACTTTATTATATTGGGTG 
TCTTGTC Mi ni GG GCTTTTAGTTAGCTMTGAATGAATACATTAGACACTTTTGGGTT 
TTAGTTGGGATTTTACATAGCTTGCATTTTMTTCTTTGGTTCTTTGCTGTT^ 
CCC AC AG C ATTATTTTAATAAA 
Sequence 419 

CCGGGCA GGTTTGT GCTATTCATTATTAMCACTAAAACTTTGGCGGNTCTTGCATAACA 
TTGTCAGA T \ WW \ AGTGTATTTCTGTG AAGACA l M I I I I I CTTGTCATTCCTTTTGT 
AGTANTTGCTGCTTNGGATAAAAAGTTTGATGTGTGATTTTTTTATTAAACAANATAGTA 
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AACCCTTCAATTATAAGTTAGTNCTTTGGTGGAAGTAAGGATGGTTTGTTAGGACNTTTA 
TTAGGTTCTTTAAATTTCATTTGGCCACCAAACCGTGGACTGGTTTGTAAGTCTTAAACC 
ACCCCAAAATTAAGTGGTCGTTNGGCCAAATTAAC 1 1 1 1 1 I I I AAAAATGGGCTTGNAAN 
AACCACCCTTAATAAAATTNGTTNCATTCAG N AAAATTATTCTTTGTTCANCNTG N CTTC 
CTGGTTTGCNCAAAATACCNTCTANNNAAATTAGGGAAACTTTTAANNACCGTTTATGNT 
TCCTTGAAGGTTCCCCCNTGGGAGGAANCCACCATTGGCCTTAANAANGTTCCCNAATTG 
GAAAAAAAAG 
Sequence 420 

CCGCGGTGGCGGCCGAGGTACTTTTGGTTA I I M I CTGTCATCAAACAAAAACAGGTATC 

AGTG C ATTATTAAATG AATATTTAAATTAG AC ATTAC C AGTAATTTC ATG TCTACTTTTT 

AAAATCAGCAATGAAACAATAATTTGAAATTTCTAAATTCATAGGGTAGAATCACCTGTA 

AAAGCTTGTTTGATTTCTTAAAGTTATTAAACTTGTACCTGCCCGGGCGGCCGCCCGGGC 

AGGTACACAACTGGAAAGACTGCTGTAATAACACAGCCTTGTTA'I I I I I AAGTCCTATTT 

TGATATTAATTTCTGATTAGTTAGTAAATAACACCTGGATTCTATGGAGGACCTCGGTCT 

TCATCCAAGTGGCCTGAGTATTTCACTGGCAGGTTGTGAAI 11 I I CTTTTCCTCTTTGGG 

GATCCAAATGATGATGTGCAATTTCATGTTTTAACTTGGGAAACTGAAAGTGTTNCCATA 

TAGCTTCAAA 

Sequence 421 

CCGCGGTGGCGGCCGAGG rTTTTTTTTTTTrTTTTTTTTl GCAAAGA I I I I 1 1 I I ATTTC 

TACTGGGGAGGGAGGAGGATAMTAGAACTGTTTTCCAATTTGCTCTCCACTGTATACAC 

ACATACCCACACACATACATAAACATACATNCACCAAAAATACCCCAAACAAAAMCAAA 

AAMCCAGGAATCAAAAACCAAAACACCCTCAAACTGCACCAATACTTCATATTTTGACC 

AAAAAAATATCCTGGGAGGAAGTGCAAAATGCAAAATCAAATGACCCGAAAAATGCTGAA 

CAMCAGCATTATTAATATACAAAATATTTATATTACTGAACTAGTAACAGGTGATGTTC 

TCCATGTCTGTAAAACTTTGGAACCACATAGCTGATTTGTTAAATCTAGTCCATGCCAGC 

TTNCAAAAACCAAACT 

Sequence 422 

TATTGGGCGAATTGGAGCTCNCCNNGGTGGCGGCCGCCCGGGCAGGTACNATATACGAAG 

ACTCTGAGCTGTTTGCCTCCGATGGTTTCCAGTATTTGCCCGTTGTAAAGCTCATTAAGG 

CCAACTTTTACTTTCAATATGTGATTCTGCAGAATTAATTTAAGGAGGCGCTGATCCATG 

CTGAGAGTATCATCAGAAAATGCATTATTCACAGGTGCCAGCAAAGGTGTNTTCTCCATT 

TGGCCTTAAANCCAAATNCCCAAANCTTNNTTGGGCCCAAAAANNCCCGGAAAGNGGGGT 

TTG l lll l ITTCCCGCCCCNTTNAANNAI I I I I I 'I 'I GGGAAAAANNCGGGAATTTGGCCC 

NTNI Mill I AAAAGGGGGGCCCCCCC I I J M I M I NAAAAAAAANANTTTTTNTNTNCC 

CCN NNTTTN NGNNCCNTTNTTTGNNNNGGGGGGGNNCNCCCCNCNAAAAAAAANNAAAAN 

NTNNTTTTTNCCCCCCNNNNANGAGGGGGGGNNCCCCCCTNTTTNANAAAAAAAAAAAAN 

NNNNNNNN I rTTTTTTTTTTTTTTTNC 

Sequence 423 

TTAGGGCGAATTGGAGCTCCC CGCGGTGGCGGCCGCCCGGGCAGGTACTG ATTTC AGTAA 

GTCTCATAGGTTAAAAAAAAAGTCACCAAATAGTGTGAAATATATTACTTAACTGTCCGT 

AAGCAGTATATTAGTATTATCTTGTTCAGGAAAAGGTTGAATAATATATGCCTTGTATAA 

TATTGAAAATTGAAAAGTACCTCGGCCGCCCGGGCAGGTACCATCTTGGCGGATGACTTC 

AGCATTAGATGGTCAGGTGTA I I I l i I I ATTAATAGGCGCATTCAGTTTGGAAAAGGCAG 

CTTTTCATTAATGTCCCACGTAAAGGAATATGGCCTATACAGTACCTCGGCCGCTCTAGA 

ACTAG 

Sequence 424 

CTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACAGTAACATCCAAGAGCC 
CATTCTACAGTGGGTGGTTTrGGTCTTTTTATAACTTTTTCTCAMGTCACTGATGTTTG 
TTCCTGTTAAATGTATATGCA 
Sequence 425 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGTGGATCTAAAAACAAAC 

AACAACTCAAAAAAAAAAAAAAAAAGAAAGAAAAAAGAAAAAGA^ 

AGGTGTTACCAGGAAGTTAAGCATGCCAAAAGGTGTTCCGTGCGAATGAAAACCTAAAGC 

CAAGGTACTCTCCACTCTTCCAGTCAGAGTGGCACATCTTGAGGTCACGGCAGGTGCGGG 

CGGGGTTCTTGCGGCTGCCCTCTGGGCTCCGAATGTTCTCG 

Sequence 426 
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TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTTAAAAATGTAATCTA 
GTTG GCAAA GGTGTGCGCTAAAACACGGAACCGAACATGCATTGATTTGGATMCTnTG 
AGGGTTTTTGNCAAATAGCATGTGAAGAGTTACN I i 1 1 1 CTTAAAAG ATTGGTGGTCCCA 
ATGTCAGAGTTCTTGGAACAGATAACTGAATGATAGAI M I I I ! 1 i I AAAG ATAAAACTT 
TAC AAC CTG C AC ATTTGTTATGCATACTAAATGG N G TGTTAAAATTAGG G TTTCTTTGC C 
TNTCTACACTACACTAATCTGCCTAAAGGNGGTTGTTTCATATTTATAATGCTAATTATC 
ATACCTACCTACTTTAAATTTTAGGTAGAAAATTATCTGATTTAAATACAAACATNTTTT 
TCTCACATTGAGTAATATGCATAATGTAAGTTCAAATGTATTTCAT 
Sequence 427 

TCACTACTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTGGCA 
TATATTAGGCCTTCAAATGATGATGGATGGATAGACAGATAAAAGGCCCAATGAATGATT 
GTAAATGTGTCTTCTCCTATGTATCCATGATGATATTGCCTAGTTTTTAATACAAGCTTA 
AGAATCAATAAAATGATAGATCTGAAATCTGTAGATGCCAGTCTGGCTCAGACCT 
Sequence 428 

CTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACGCATACAGGGCAGATGT 
CAGAGATTATGATCACAGGGTGCTGCTCAGATTTCCCCAAAGAGTGAAAAACCAAGGGAC 
ATCAGATTTCTTACCCAGCCGACCAANATATTCCTGGGAATGGCACAGTTGTCNTCAACN 
TTCCCACAGNGTGGNTGNTGTTTCNCCCNTTTATCCCTNTTTGCCCCNCACCCCAAAAGA 
ANANTNGGTTNTNTGGCCCCNAAAAANNAGN I I I M I i I I 1 I I I AAAAAAANAAAACCCT 
GGGMCTNTTNGGCTTCCCNCAANGGCMTTTTNTNNTTNTTNCCCTTCCCCGGGGGGNG 
GNCCCNTTTTTNAAAAACANAGGGGNNNCCCCCCCCCNGNGNCGGGGANNGAATTTTNNG 
AT7TAAAAAN 1 MINI INNNNCCCCCCCCTNNAACCNCCNTNGGGGGGGGGGGGCCC 
Sequence 429 

TAACACTACTTAGGGCAA7TGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGCG 

TGTTCAACCGCAACGAGGATGCCTGCCGCTATGGCAGTGCCATCGGGGTGCTGGCCTTCC 

TGGCCTCGGCTTTCTTCTTGGTGGTCGACGCGTATTTCCCCCAGATCAGCAACGCCACTG 

ACCGCAAGTACTTCCTTTGGCAGACGGCTATTATGTTTTACTTTAGTGCCGTATTGTGGA 

TGCCAGAGAACTTGCATTTGCCAGGGAATAATCTCCTGAGATGCTATATTATGAAGTAGA 

ATGGTGTAGTCTAATTCAATAMGTCATCATTTCTTTGCACTTGCTTGGACAGTTTGAAT 

TTTTACCATTTTATACCATGTGTCTTTCAGTAGGAATTTTGCCATATTATATAACCACAG 

GCAACCCAAGCTAAAATGTAGAACGGAGCCGTCATTTCTGGGAGATACATTTCAAA 

Sequence 430 

ATCACTACTTAGGGCNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCTTCACAGCTG 

ATCCAGMGTCAATATTCTCCTCACTATATTCAGACTTCAAGAAAGCTTTGAAAGCTGCC 

AGCCCACATTCATGACTAATCAGGTTrTCCAGTGATTCAGCCCATTTCTTGACTTCCTCT 

TGGCTCACTCTCTGGCAAATMCCACTTTGTCCTTNTGGTTGGGGGAAAGAATTGGGGTC 

ACAGGAATCAAAATTTTGCAGCAGGGAAACCTAGCCCGTGGTTTTCATATTNTTTGGACT 

CCTAAAGAAAANAAACCCGGAAAAAGCCTTGGNAGGCCNTTTNGCAAATTNTTNTNTAAA 

ACNGGGTTGGANATTCCCCAGGGGGGANNAAAAAANATTNGCGGGAAGNGGGNGGNNCTT 

TTTTTTTTTNTNTTAAAANAANACCCGGGGGGGGGGGNANAAAANNNAAANAANTNCCNN 

CCCCCCGGGGGGGGGGGNNNTTTTfcAA I Mil I I CCCCCCCCCCCCCCNGGGGGGGGGGG 

GGGGGGCCCCCCCCl 11 I M M I N I i II 1 I II 

Sequence 431 

AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGAGACCTCCTTACTTACCCCCCTTCT 

CCTTCGGCTGGAGCTCGGCGAGCGAGAGGCGGCGCTGGCGTTGGAGAGCGTGCTTGAGAA 

TGAAATGTTTCCCATCATAGTGGAT^CTTAAGCACGTTCTCCACGTATGCGGCGTGCTAG 

CTGGATGTCTTTNGGCATAATTGTrACACGTTTGGCATGGATAGCACACAGGNT GGGT GT 

NTTAAAAANGGCNAACCAMTGGGCCNTTTTNGN 

CTTTGNNANNNCCCAAA I lllll! fNAAAGGCCCGNGAAAATTTTTTNCCNCAAACCCCT 

TGGANGGGANATTTNGGGAAMNAAAAMTTTAAGGGGGC I I I I II I I NAAACCCTTTAN 

NTTNTNGGGGGGGCCCCAAAATTTCNTTGNCGGGGGGGGGCNTTTTAAAAAAAAAANNGG 

GGCCCCCCCGGNGGGGGGGGGGNANTTNNANNAAAAATTTTTCCCCCCCCCCCCCCCCGG 

GGGGGGGGGC 

Sequence 432 

GGGGAGCTCCCCGCGGTGGCGGCCGAGGTACATCGATGTTGATCTTCANGTTTATCTCCC 
CGCGACTTGTCCACGTAGAGCTCAGGATGCACCTCCGTGGTGAGGTAATACTGCACCTGC 
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CCGGGCGGCCGAGGTACATGCCTAGACCTGGGCTCCGGCCAGCGCCCAACAGCGTGGATG 

TCGATGACTTCATCAATACGAGAATACAGGAGGCAGACAATGACCCCACG 

Sequence 433 

ATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATGTATTGCCATTTCTCTGCTATATAGT 

AATATTTCTTGACACCMTGGTAATGTGTTrGTTTCTTTCTCAGCACCAGATGTGTCCAT 

ATG TATTTAAACAG TAG G AGC ATTGTTTCTAAATTC ATCTATG CTATGAC ATATATAAAC 

CCATTATTATTATG7TTCACTAAAGTTCCAGCACTTTAGAAMGTTTCAGTTCAAAGTCA 

TTTTGGCTCATTCA7TTAGATAATACTGACCATTTTGCACTACAATTTCAAAAGGAACAT 

GAGAMTTTGGATTTCTTTGAAAGAGTCAMTATGTAATTACAGAATTGAAACACTGNGT 

TAATTCCCAAAGGGGGTGGTAATAATTAACATTAA 

Sequence 434 

CGMTTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGGAGTTTCCCTATTTTG 
GTGTTCAGCTTGAAAMGGACTTGTCAGAATCAACTGTGTCATCAAAATTTAAGT/ATGT 
GCATTGAAAATAAGGTTGATCATGGGAATATGCAGAATTTCCAATGTATTTTTAAA7ACA 
AATAAAATTGTAATTTAGAA I I 1 I I AATCTTAGGTTTCTTGATTAATTTATAAGAGATCA 
ATTATTGTCAGTC I Mil IGTATG I I I I I I AAAAACATAGTCCAGAGCATGGGCAGAATT 
GACACCTCTCTTTTAAGTGAAATTTGGATTGCTCACAAAGCACTAGGAAATGTCATGGGG 
TTCAAATATATATCCTACACAACTGGGCAATACATTTTTGTTTGATTTTTAGGTCTGGGT 
ATACATTAACAGTTCATGGAATTAATACCCGGATCATTTGGGATAATGNAAGTGNAGTTA 
GTTGTAGATG AAGTAAAAG NTATAAAAGGAG ATTAAAAATGCG GTAACCTTTTA 
Sequence 435 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGC CGCCCG GGCAGGTACCTTATTGATA 

GTGGAATTATATA I I 1 I TACTCTATGTTTCTCTACATG I Mill \ CTTTCCGTTGCTGAA 

AAATATTTGAAACTTGTGGTCTCTGAAGCTCGGTGGCACCTGGAATTTACTGTATTCATT 

GTCGGGCACTGTCCACTGTGGCCTTTCTTAGCATTTTTACCTGCAGAAAMCTTTG TATG 

GTACCTCGGCCGAGGTACACAACTGGAAAGACTGCTGTAATAACACAGCCTTGTTATTTT 

TAAGTCCTATTTTGATATTAATTTCTGATTAGTTAGTAAATAACACCTGGATTC TATGG A 

GGACCTCGGTCTTCATCCAAGTGGCCTGAGTATTTCACTGGCAGGTTGTGAATTTTTCTT 

TTCCTCTTTGGGGATCCAAATGATGATGTGCAATTTCATGTTTTAACTTGGGAAACTGAA 

AGGGTTCCCATATAGCTTCAAAAACAAAAACAAAATGTGTTATCCGACGGGATACT 

Sequence 436 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAGGAGTTTCCCTATTTTGGTGTTCAGCTTGAA 
AAAGGACTTGTCAGAATCAACTGTGTCATCAAAA TTTAA GTAATGTGCATTGAAAATAAG 
GTTGATCATGGGAATATGCAGAATTTCCAATGTATTTTTAAATACAAATAAAATTGTAAT 
TTAG AATTTTTAATC TTAGGTTTCTTG ATT AATTT AT AAG AG ATC AATTATTG N C AG TCT 
TTTTTGTATG I I I I I I AAAAACATAGTCCAGAGCATGGGCAGMTTGACACCTNTCTTTT 
MGTGAAATTTGGATTGCTCACAAAGCACTAGGAAATGTCATTGGGGTTCAAATATAT^^ 
CCTACACAACTGGGCAATAA I Mill I GTTTGATTTTTAGGGCCTGNGTATACATTTAAC 
AGTTTCATGTAATTAATACCTTGATCAT 
Sequence 437 

CCGCGGTGGCGGCCGAGGTACTTAAGCACTGCAGATGCTCCAGTAATATGCCCATAAGTT 

OCTTTCCAATTTCAATTACTGGGAAAATATACATATGGACAATAGATGGATGCCACAATA 

AAAGGCTGGCAGCCTAACCCTCACATGAATTTTTCCCTACCTCTATTTAGGGTGACAGTG 

GAGGGCCTCTGGTTTGCTTTGAGAAGGACAAATACATTTTACAAGGAGTCACTTCTTGGG 

GTCTTGGCTGTGCACGCCCCAATAAGCCTGGTGTCTATGTTCGTGTTTCAAGGTTTGTTA 

CTTGG ATTG AGGG AGTTG ATG AG AAATAATTAATTG G ACGGG AG AC AG AGTG AC G C ACTG 

ACTCACCTAGAGGGCTGGAACCGTGGGTAGGGGATTTAGCATGCTGGAAATAACTNGCAG 

TAATCAAACGAAGACACTTGTCCCCAGCTCCAGCTACGCCCAACTCGG 

Sequence 438 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAGAAACATTTTCT 
TTTATTGTTACTTGC I 1 I U AAACTTTGTTTAGCCACTTAAAATCTGCTTATGGCACAAT 
TTGCCTCAAAATCCATTCCMGTTGTATATTTGTTTTCCAATAAAAAAATT^ 
AAAAAAAAAAG AAAC AAG AAAAAAAAAAAG AAAAAAAAAAN N AAAAN AN AAAAAN AAAN C 
ANAAAAAAAAAAAAAAAAAANTANANAAAAAAAAAAGGAAAAAA 

AAAAAAANAAGAAGAAGAAATGGTTTTNGCNNNGGNCCCNNGGNGGNCTrTCTTTTNAAA 
NANATNTTTTTNAAANGGNTGNNTAAAAANANANNAACAATGTNNNNNAAAAAATCTTTT 
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TTNTAAACCNNGGGG CNGTTNNTTTT TN^ 

AAAAAAATTATNGGGG 1 MINIMI 1 NAAAAACCCNCGGGGGCCCNGGG 1 I H I I I I I I 

AAAAAAGGGGGGGGGGGCA 

Sequence 439 

CCGCGGTGGCG GCCG AGGTACCACGACTCTTATTTCCTGCAGCTnTGAGAATTCTAAAG 

ATACCTGAGTCATTTTAAATAAAmCAGCTTTCTTTACCAAACTACCCTTAACCGGATT 

CTCC TTGAAATGACATCACCTGACACCCATGGCATGCTGCATGCCCACAGCTAGATTACT 

TTTTAGTGCGCCCACGCACTTCTGCTTTTATTAGGTGAGGCAGAATCAAGATTCCCTTTT 

GTTGGATCTTGAACCTGTTCATGCCACTTTGATATTCTAAATTCATACATAAACCAATGA 

AATATATGTGTTAGAAAAATTGCTATTTCTAGCTGGACGCAGTGGCTCATGACTATAATC 

CTACCACTTTGGCAGGCCGAGGCAGGAGGATTGGTTGAGGCCCAAGAGTTTGAGACCAGC 

CTGGGCAACATAAGTGAGACCCTGGTTTCTACAA 

Sequence 440 

CCGCGGTGGCGGCCGAGGTACTTGGCATATATTANGGCCTTCAAATGATGATGGATGGAT 

AGACAGATAAAAGGCCCAATGAATGATTGTAAATGTGTCTTCTCCTATGTATCCATGATG 

ATATTGCCTAGTTTTTMTACAAGCTTAAGAATCMTAAAATGATAGATCTGAAATCTGT 

AGATGCCAGTCTGGCTCAGTATTAATACTGACAGTAGI M M 1 I G TTTTTTAAG AAAGCA 

AGTATATTAAAATATCTCTGTTAGAAAAGATTACAATTTGAAAGGATCATTCTACAATCT 

TAGAA CTAAAAA GCAA TTTAAGGCAAAGAGGTACCTGCCCGGGCGGCCGAGGTCCCCCTT 

i n 1 1NTN7TT NATTTTN C CTNTTTN TCTTTTTCTTNTTTACTGCCCNCN 

GTTGGANCC CTTTTTTTANTTTTTTNCAACACCCCCCGNCTTTATMNTM 

NM M I M I n IAAAAAA 

Sequence 441 

AGGTACCAGGTTTTATTTTTCACCTTATTCTCTACTTTAAACAMTCATAACTTT 

TAAGCCTCTGCTATAAATTCTCCTGGCTCTCCTGGGCTTNCATATnTGGGGGCTTGGGG 

TGTCAAAAGTGAGATGAAGTTCTTTAGCTCCAGGTTTTGGGGTAAACCANAGGTAGGTAA 

CATTGTTGGGCATTTATTTTGCAANTTAANCAANTACTTTCCTTGNNACTNGGNTGCCGG 

GTGGCTCACGTNTTGTAATNCCCAGCACCTNTTGGGGGAGGCNTTGAAGACCAGGTNGGG 

ATNCAACNNTNNAGGGGTCGGGGTAGGTTTCTNAGGAACCATGGCGCTGNGCCNCAAACN 

ATTNAGGCCAAAANACCCCCTTGTNCNTTTTNACTTAAANAANANTNACCAAAAAAAATT 

TAANTATTGGGGGNTGGN I MM t GGGGCAC 

Sequence 442 

CTA CTTAG GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTCTTTGCCTTAAAT 

TG CTTTT TAGTTCTAAGATTGTAGAATGATCCTTTCAMTTGTAATCTTTTCTAACAGAG 

ATATTTTMTATACT7G CTTTCTTAAAAAAC AAAAAAAC TAC TG TC AG TATTAATACTG A 

GCCAGACTGGCATCTACAGATTTCAGATCTATCATTTTATTGATTCTTAAGCTTGTATTA 

AAAACTAGGCAATATCATCATGGATACATAGGAGAAGACACATTTACAATCATTCATTGG 

GCCTTTTATCTGTCTATCCATCCATCATCATTTGAAGGCCTAATATATGCCAAGGTACCT 

GCCCGGGCGGCCGCTCTAGAACTAGGTG 

Sequence 443 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGNACTGACAGGAGA 

TTGGCTTTTACACTAAAAAAGACAGAAAATGGACTAAATAGGAACATAATTTCAGAGTTT 

ATGAGTCAGTTTACAATATGCCATATTAGCGATCTCATGATGATTCAGACCACAGTCTGG 

GTTAAAGATGTCTTGAC CTCTGA GGCTAGCATAACATATATGAGAATACAACTTGCCTAT 

TTCCAGAAAATCTGTATATTmATAGAATAACTTTATITACAATTrCTATCA 

ACTCACTAGGCATTATTTGT 

Sequence 444 

ACACTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGCAGA 

AGTTTGCTGTCTCTAGGATTCATATAGCACCCACAGAGCTCCAAGTAACCAAATTCCCCC 

AAAGACAGGAGGTGTGGCTGAGGAGGAGTGACCATATTGAGTGTAGCTTTGAACGCCTCC 

GTTACTCTTTAGGAGGCGACCGCCCCAGTCAAACTACCCACCACGCACTGTCTCCTTCCC 

AGATAAGGGGAACGGGTTAGAAAATCAATTTAGCAAGGGTGGTATTTCAAGGTTGACTCC 

ACTAGAACTAAGCGTCCCAGCTTCAAAGTCTTCCACCTATCCTACACATGCTAAACCAAT 

Sequence 445 

NCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCC i MM I AAAAACAC 
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TGTAAAAGTAACCACAAATATGTGAGGACTTACTATTTTAAATGGAATGGAATGAGCTCC 
ATAGATTAGTTTTGAATATAAAGTATATAAAAGTGCATCAGTGGTTTATATAGGCTTTAA 
AAACATGTTATCTTACAGTC CTTTAAAGC AGC CATAG AGTTTGTATCATTTTTCAAGC C A 
ATTTCAGTCAGGGATTTGMTTGTTTGATTATGGATGATAAATGTGTCATATCTTATTAA 
TATGTCTC ATG TCTCGTTCCTTCTTAATATG ATTTAGCTG G AATTC ATTTTCTTTTCGTT 
TC ATGTTTAATTTCATAAAAC GTTTAAC AATTGGC ATATATACTTG G CATTCCTGTC C AC 
CAAGGGATTGTAATCCAAGCCTGGGAAAATCTTAAATTTCTTTTTACTTAAATCTGGNAA 
TTTTGTCNTCATTCTGCCACCTTTTTTTTTTTTT 
Sequence 446 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAGCTCCT 

AAAAGAAGATTACTGCTGCCAACTTMGTCATCTCCGTTAACGAAATTGC ATTCT TGTGG 

CAGAGTTAAAACAACAAGAGAAATTCAGTGTTTGCTGGTTCTGMTGTCATTTTTCCTCC 

CTGGTGTGGTTTTACATTTTCAGCTTCTTTCCCTTTTCTTCTCTCCCACCCTCAAAATTC 

TGCCTTAGCATTTGTGTGCTTAATTAAATCCACTCTGTGCTTTATTGTTGGAGAATGTGG 

ACAATACAAAGATTTGGGGTGGGGTCATACAGTGTATACAAAACACACACACTATGTGTT 

TGGACAAATTCGCCAAGCGTGAGAATCATCMGTAGTGAGTTTAAAAGTTTGAAAATCAA 

GACCCAACATTTTGGGGTGTTTAAAATATCTCCCGCCTTGAAATGGNTCCTGTTTAAGTG 

TTAGATNGGAGANCCCTTGATTCAAAAACAAAAACCGA 

Sequence 447 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAGAAGAGAGAAATTCAAACAAAATATTGCTGT 

TCTTCAGTTTTGTTTGTGGAATTTGAAATTACTCAMTTTAAAATAAATTACTGGACTGT 

GGAAATMCATAGAATTGMGTmMTTAMTACCACTCAMCGAAMGAACAGTAGTT 

TTTGTAGTTTT ATATTG G ATACTG AG GC ATTAGGG AGGC ATG AAAGG AAG AGG AATG AGG 

ATTGAGACATGTGAAGACATTGTGCATTATATCAATGTGCATTCCTGTAGTTCATTAACA 

AGGTACCT 

Sequence 448 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAC 
CAAAAI Mil I ATTTGACTAAATTCTATTCATGCATGAGCATGACAGTCTCCCGTTTATT 
TGACTTTGGCAATAAAAAAACAAGCAGCCTTGTACTTCATTGTGAGGTGGCTACAACTCT 
ATAATATGCACAGTGATTTTAAAATAGGC I I I I I GCATGCCTTGCATGAAAGGTGCTACA 
TACAAACCTGTTTTGTGAACTCTTTGGTAACCACCAATTTAAAAATTTGGATGAAAGCAT 
TTCCACATGGACAGATCTGAAGCACATTATTGGAGCTCTGAGCCAAAGCTATTACCCTGT 
ATATTGATTCTTCAGTTTCCTTGAGGGGTTAGGTGTTGATTTAGAATACAGCCAGATAAT 
TTAAAGCATGTCAGGCCCCGGTTAGGAAAATGAAAATG 

Sequence 449 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAAAAATAGGAATGGGTrriTTACCTGTTTAAA 

GTCACTTTGTGTTTATAACAAMTTACTTTTAGCTGAGGAACAMGGTGACA AAGAT TT 

TGTTGGTGGCTGAGAGTCAMGCAGGCCAATCCACACCATTACCTGAMTATT7TTCAGG 

CTAATTTATAACTTTATGGATTTCOTCGATACAAGTTTATTAGTTTATTCTCCATATAC 

AGTTTATTCTCCCAGAATAGCAGCAAATAAAACTTGAATTGGATGTACCTCGGCCGAGGT 

ACTACGTGCCAGCTCTAGTTTTC/GCCTTGGGAGGTTTTATTC 

Sequence 450 

CACTACTTAGGGCNAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACTGCATATAAAA 
ATGGTCTAAGATGCAATTTTCCTCCATTCC I Mil I GCTTTTAAAATACTGAGACAGCAT 
TTTAATTCAATATTCTAGGTTCAA^TGATACATTAAAAAAAAATCATACCAACCTTTAA 
TCATTCTAC ATCC A I WW I AAAGTTAGCTAACAAGATGATGTTTCACTAAAATAAAATA 
TCCAATCATCAGATTAAAGTGT/\AAGTTTGTGTGAACAGGGAMTTAGATCATTTCTCTA 
AGTTTTAATTCCTATGTTTCTGAATGTTTCTTGMT^^ 

TTTCAAAACACGGCATCTCTCITTCACCATTCCACAGAGAGAGAAAAGACTAGAAAATAC 

TTTAAAAAAAATAAATATTTTA\A 

Sequence 451 

CTACTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTACGTGCCA 

GCTCTAGTTTTCAGCCTTGGGAGGTTTTATTCTGACTTCCTCTGATTTTGGCATGTGGAG 

ACACTCCTATAAGGAGAGTTCAAGCCTGTGGGAGTAGAAAAATCTCATTCCCAGAGTCAG 

AGGAGAAGAGACATGTACCCCACCTCTGAAGATGCTGGGGGAGGCAGCTGGGATGGGAGC 

CAGCCCCATGCCTGTCTGTGACCCCACAGTGGGTGAGAGCCCGTCACAGTCCTGGGGTGT 



Page 62 of 635 



WO 01/51628 



PCT/US01/00798 



Table 1 



GGCTGCTCTGGAAGAATTAGGAGGCAGCCATAATAAGAGTCTTCAGAGAGATGATGGGAG 
GGGCCAGTGAGGACAGGAACAGGAGAGTNNGATGTCCTATAATAAAGGGGGC 
Sequence 452 

CTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGGAGATGCTGCC 
TTGAGACAGCTGAGGTCACCGCGGCGCGCACAGGCCCCGAACGCTCAGGAGTGTGGTTGT 
TAGGAGAG CACACAGG TGTTCATACAGTGGCATTTGGGACACAATCGTTGGAACCTGAAG 
AATCTGAAG I II 1 I I I I ACCACCATCTTTTTCTACTCTGTAAGGAAGTAGATCTTTATGG 
GGAAAAGAGAATTTGGGGTGTTCTGCAAGCCAGTCAAAGTGGCACAGCAAATCATATAAA 
TCGAATTAAATGGACAACACCGTTAGATGTGTATGTAAAAATTTTCTGTTTCATATTTTT 
CCTTTCACTTTCGGTTTAAAACATGCTATATGTACCTCGGCCGCTCTAGAACTAG 
Sequence 453 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTGGCATATATTA 

GGCCTTCAAATGATGATGGATGGATAGACAGATAAAAGGCCCAATGAATGATTGTAAATG 

TGTCTTCTCCTATGTATCCATGATGATATTGCCTAGTTTTTAATACAAGCTTAAGAATCA 

AT AAAATGATAGATCT GAAATCTGTAGATGCCAGTCTGGCTCAGTATTAATACTGACAGT 

AGTTTTTTTGTTTn TAAGAAAGCAAGTATATTAAAATATCTCTGTTAGAAAAGATTAC A 

ATTTGAAAGGATCATTCTACAATCTTAGAACTAAAAAGCMTTTAAGGCAAAGAGGTACC 

TGCCCGGGCGGCCGCTCTAGAACTAG 

Sequence 454 

TTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCTGACTTTGTGACTTAGGCGGCTG 

TGTTGCCTATGTAGAGAACACGCTTCACCCCCACTCCCCGTACCTGCCCGGGCGGCCGCC 

CGGGCAGGTACAAGCTATTATATATTTTCATAATATAGAACTTTAAAGTATAGTAATCAA 

CAGTGGCAAAATATTTCTGTCTCTGACATTATTATTTGTGAGATCCAACCTGTAATTGAG 

ATCAGAGAAAACAGTATGGGAAAACAAATCCATGAGAACAGGATAAAAATATCATGAACA 

GCAAMTATGTCTGAGTATTTAAMCTGTATAGAAAAAATAAATTAGATGMCAATTGTG 

TAAATATAAATTAAAAMGATGACATTCTTAATAAATAAAGTTTACTAm 

TATATTGCCAAAAGATTTAACTTAGATNCATAAAAACCACATAAACAGAA 

Sequence 455 

ACACTNCTATAGGGCGA ATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCTTTTT 

I I I I I M I I III I M 11 I I GGGCGAATACAGATAAATTTATTAGTTAAATACTG ATTTTC 

CAGCCATTTCACCTTAAGACAATGTTAACAGGTTTGNGGGTTAGGGAGGGTATACGAGGG 

GGCCTTTGGMGAAAACAATGTAAATGATGATTAAAACAGAATCTTGGTTCAAAGGTATT 

CTNTGCTATAGCCAGTAGGATTTTGGAGTGAGGGGTCTGGGCCGTGTGGGGAGGCGTAAT 

AATGCCACAGTCAGCTNCAGCTTTTGNTGAGAAAGAGGAAAGGAGTCTCTTGAGCTCAGC 

ATCAAGGGGCAGAACAGCAATGTGCCANAGGAAGAACGCGGGCAGATCCCGGAGCACTTA 

AAAGTCNGAGGCTTNCTTCAGGCCCCCCTGNCGGGTTTAANCAAGTCGTTTTTNTTTCAN 

GAATTCTGGTGGGGTGGGGT 

Sequence 456 

CGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTTTGATGAAGGAACTGACAGGAAA 

TTAACAGCCTCGTGCAGGCAAGGACAAAGACAAGAAAAGGCCCATTTGGGCCTCATATCT 

CTCTGGATTGGAACCCGCAGGAAGCAGAAGGGGCTGTTTAATCCTGGCTCCAGAGCAACC 

GTGAAGGCTGGCTTCCTCCCTTGMGAAGAACAAGGTGAAGGAAAGAAATCACTGTGTTT 

TCAGCTCAGCGGCCCTGTGACATTCCTTCGTGTTGTCATTTGTTGAGTGACCAATCAGAT 

GGGTGGAGTGTGTTACAGAAATTGGCAGCAAGTATCCAATGGGTGAAAAAGAAGCTAACT 

GGGGG 

Sequence 457 

GAGGTNCTGCATGTTACATATTCTTCTTTAAAATTTTGTMTAMCATTGACAGTGTTTG 

GTGGGCACAGGGAAACAGGATAACGTAGAGTCATTACAGMGAAAAAAACTTATTGCTAA 

CATTGCAGTATTCCTTTTATCAGAATTAGGTGAGTATTGATTGTAAAAGCTCTATCAACT 

CTTGCTCTTATTTGATGACTTTGAGAC I I ! I I [ ACTCTTGCTATAAAAAGAAGGCTACTT 

TCTTTCCCTAATATATTTCTACCAATGCGAATAATTCAGGAAACAATGAGAGAAAAGTAA 

TTCACACTTAATGTGTTGTTACTTAAGAGATTTGACGGATAAAATAAAATCAAATTAAAT 

CATTGAAAAGGCAGGCTTAGACCCCNTTAAAAACACCGTTAGTTGCCTACCATAATTTGA 

GCACTTTTTCATGGAAAAAAAAAAAAC 

Sequence 458 

CTCNCCGCGGTGGCGGGCCGAGGTNCATTTCCATGGGCCCTGTTCCCATTGATGTATACT 
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GCTTCCTTACTAACAGTGAGGGATGACTTTCATCAGTCTTTTATCACCTGAACAGTCTTC 

CGGCCATAATGATAGTAACTATAAGCTGATGCAGCTGTGNNGAAAGCTGTAAAACACCTT 

TTATGGAAGAAAAGANATAAAATGTANTTGTCAAGTCTAAAAAATAGTAGCAACGGGAAT 

CATAATG AATACATGC AATG AATTTAAAATGN AAAAATG AATTTAAAAAGTAAAAAG G G C 

TCTGTGGTGTAATTTTTCTTAACTACAA 

Sequence 459 

CGCGGTGGCGGCCGAGGTACAAGCTCCTAAAAGAAGATTACTGCTGCCAACTTAAGTCAT 

CTCCATTAACGAMTTGCATTCTTGTGGCAGAGTTAAAACAACAAGAGAAATTCAGTGTT 

TGCTGGTTCTGAATGTCATTTTTCCTCCCTGGTC 

CTTTTCTTCTCTCCCACCCTCAAAATTCTGCCTTAGCATTTGTGTGCTTAATTAAATCCA 

CTCTGTGCTTTATTGTTGGAGAATGTGGACAATACAAAGATTTGGGGTGGGGTCATACAG 

TGTATACAAAACACACACACTNTGTGTTTGGACAAATTCGCCTAGCGTGAGAATCATCAG 

TAGTGAGTTTAAAAGTTTGAAAATCAGACCCAACATTTTGGGGTGNTTAAAATATCTCCC 

GCCTTGAA 

Sequence 460 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTTGACGAAGAATG 

AAAAGGTTGTATTAAAGAACTATCAGGACATGGTTGTGGAGGGTTGTGGGTGTCGCTAGT 

ACAGCAAAAAAAAAAAACGTCAAGCCAAACACAAACA GCGGAAACGC CTTAA GTCCA GCT 

GTAAGAGACACCCTTTGTACAAGC 1 i I 1 I I I I II II I I I I III II I I I I I NGAAGTTTTT 

AGTTTATTACCTGCCCG 

Sequence 461 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAATGTTATGTCGGGAACA 

CGTGCTGCTAACTCACTGGTGAGTTCAATGGCAACGCTTCATTCGGGAGGCTGTTCTGCT 

TTACGCATCTGAGAACTACATAGGAGAGCAAGTGTCTGCACCTCCTAACTGCAGAAGCTA 

CCGTCTTCTCAAAGACGAAGGTCTTTGCAAAGTTCAGTGCTCGGTGTTCTCGGCACAACA 

ATGCNNTGTAGTTCANAAGGTATTTTGGCAACTC TTAAT CTGAACAA GAATG GGGGGGGC 

GCTTTTGAAAAATMGGCTTTAAGMGGCTTGTCATTT^ 

GTGAGTCTGAACTCTTACATTTAGAACAAACAAAACCTTAAAATTNCTGATTGGTTCAAA 
AAATGGTTTTATGGAAAAATTAATCTGTAACAAAAAGTTGGCATTTGAGTGCGAAGGCTC 
CCCCGTTGNl I I I I I l I I IGN1 I I I I 

Sequence 462 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAGC I I I I i I I M I I 

rTTTTTTTTTTTTI CCTGTTGCTACTAAGATGTTTCAATTCGCANCGTGTCTCGCTAATT 

TGACTATGGATTCATCAAAATGCAACTGAGGTTTGCTCAGTTAGGTTACCCCATTCGGAA 

ATCNCCGTATCATAGTTTATTTCCAACTCCN CG AAGCTTATCGCAG GTAATCG N GTCCTT 

NATCGACTTTNAACCTGCCCGGGCGGCCGCCCGGGCAGGTCCAAAAACCAACNTGACACA 

CAGGAAAAAATAAAAGTGCAATTTTAATATAGGGAATGNGATNCATGTATAATTCCTCAT 

AACAAAATGGTCAAAACCTTTAAAAGATCCNCAATAGATTTCTGAAAATCTTA GCAAT GC 

TGTATTTNTTTTGNGGACTAAATGGTGAATTTATATTCAAATT^ 

AAACTTTCAGAGCACANATCATTGGTAATTTT 

Sequence 463 

CCGGGCAGGTACACGTGTTCAGTGTGTTGCACATCACAGCCCAGGTTGAACTCAGCCAGC 

AGGATCTTGGCAGCTTCTTCCTGCTTTGCAGGAAGGACAGAGGTTGAGGTAGGAACTTCA 

TAAGCAATTTGGAG AG AGGCTCCTCC CATATCCAGTATCCCTACTG* rCCTTCTCCGTCCT 

GCTGCCAATTCCTGGGTAGCCTCAGCATCTGATTCATCCTCGTGGTCGAATCTTCCCAAA 

ACAAAGTTGATTCCAATCCATGCATAAACCCCTTCCTGCTTCCCAGAGATCACTTCTGCT 

TG AG ACTGTG AAAAG AG G AAGTC AAA 

Sequence 464 

AGGTACATTTCTTGTTTAGGAGGGTTTTCCTATCTACCTTTCTACTGAAGTAGTTTCTGG 

AACTTTCCTGGTGGATCAGAGTTACGTAATGCAGTCTG/.GCCTrCAGACTGCTAGTTAGA 

ATTGTTTTAGGTGTTCAGAAAGGGCAAAATAGGCTGATGTGGCCTGTCAGAGTGATGTGT 

TCTCAAAAAAGTTCACTTGCACATCTGTGGGCCGCTTTTGTCCTCAGACCCTTAGTGGAC 

AGACTCCACAAACCCTCTGATGAGACGATTGATGTGGCCAGGGTCCAGTTAGCATCAGTA 

GAAGGATGTCACTAGGAAAGGCCCAGGTATCTGGTAAGT 

Sequence 465 

CCGCGGTGGCGGCCGAGGTACCTCAAACTCAGAGTTTCTTCCCTTCTTTGATTTTCTGGA 
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GGACCTGCAGCTGGCCTTCCTGAGACAGGCTCCATTCCTGTTCCA7TTGCCTTCCCGGCA 

GCCTTCCCTTTAGTGGGTATAGGTTTTGACGTTCTGAGTTACTTTGTATCAAAGAGCTAA 

TTAAAAATGGTCCTTCAAAMCATAAAGAAAAACAGCTTGAAAAATGTACCTGCCCGGGC 

GGCCGCCCGGGCAGGTACCAAAAACCAACATGACACACAGGAAAAAATAAAAGTGCAATT 

TTAATATAGTGAATGTGATACATGTATAATTCCTCATAACAAAATGGGCAAAACCTT 

Sequence 466 

CCGGGCAGGTACCTC AGCT ATCMCATTTCTGAGCTACCATTCAATGTTCCTCTGTGTCA 
TGGAGTGAAATTCTTGTTTTGTGGGTATTAGGAGTGTGGGAATGTGATAACCTAAACAAC 
CTTTGCTCTG AAATTCC ATTTTTCCCTCTTTCCCTGAGTTGTATTGACCTACAGAGTTAA 
TTTCCTTTGTATTTTTTTAAGAAM^ 

CCGGGCAGGTACTCCAAGATGAGCTTGACGCGGCTGTGCAGCATCTTGATGGCGCTGTGC 

TGTGCTATCAGGTGT 

Sequence 467 

CCGGGCAGGTACTTGGCATATATTAGGCCTTCAAATGATGATGGATGGATAGACAGATAA 
AAGGCCCAATGAATGATTGTAAATGTGTCTTCTCCTATGTATCCATGATGATATTGCCTA 
GTTTTTAATACAAGCTTAAGAATCAATAAAATGATAGATCTGAAATCTGTAGATGCCAGT 
CTGGCTCAGTATTMTACTGACAGTAGTT77TTTC 

AATATCTCTGTTAGAAAAGATTACAATTTGAAAGGATCATTCTACAATCTTAGAACTAAA 
AAGC AATTTAAG GC AAAGAGG 
Sequence 468 

CCGGG'CAGGTACTAGGTTAGMCATTGCTTAATCCTTTTAAAAAANATGCATTNACTGTA 

AACACAGAATACTGAAATTGGNGGA I MM 1 AACTATNTCTGACATAATTTTATTC ATCA 

ATTACATTACACATTCATTTANCCCAGATTTCAAATAGGGGGGGAAGAAAGAAACTGTAT 

TTCAGAGTAAAATCTCCTAAAGGAAATANAAACACAGAGTTGTAAATNCACATGCTTGCA 

AAAACATTAGTCGTGAAATCCCTAGCAACAAGTCACTGGATTTTTCTCTGTCAGCACGCG 

TGTCAGCTGCCAAA 

Sequence 469 

AGGTACAAATGTGGTGTGTCTTCCAACTTTCATTGAAAATGCCATATCTATACCATATTT 
TATTCG AGTCACTGATGATGTAATGATAT A I Mil I CATTATTATAGTAG AAT A 1 M M A 
T GGCA AGATATTTGTGGTCTTGATCATACTTATTAAAATAATGCCAAACACCAAATATGA 
ATTTTATGATGTACCAAGAAAGTCGGGATCGTCGGTAAATACGGGACCCGCTATGGGGCC 
TCCCTCCGGAAAATGGTGAAGAAAATTGAAATCAGCCAGCACGCCAAGTACCTGCCCG 
Sequence 470 

AGGTTTGAGCTCCATAGAGACAGCGCCGGGGCAAGTGAGAGCCGGACGGGCACTGGGCGA 

CTCTGTGCCTCGCTGAGGAAAAATAACTAAACATGGGCAAAGGAGATCCTAAGAAGCCGA 

GAGGCAAAATGTCATCATATGCATTTTTTGTGCAAACTTGTCGGGAGGAGCATAAGAAGA 

AGC AC CC AG ATGCTTCAGTCAACTTCTC AG AGTTTTCTAAG AAGTGCTC AGAGAG GTGGA 

AGACCATGTCTGCTAAAGAGAAAGGAAAATTTGAAGATATGGCAAAAGCGGACAAGGCCC 

GTTATGAAAGAGAAATGAAAACCTATATCC 

Sequence 471 

ACCGCNGTGGCGGCCGANGTACAGGCTGTGATNCGTGTGGCGATCGATCTTCTTAGATTC 

ACGGTATCTTCTGAGCAGCCGGCGCAAAATCCTCATTCTCCTNATCCATGTGACCTTNTC 

TGGCATTCGGGCATTGGCTGTNCGAATCAAANCACTTACATGAGGGGGCAAAGTCAGAGA 

CAGNTGAGGAGCTGAAGAAGGTGGCTCANGAGCTGGAGGAGAAGCTANNCATTCTCAACA 

ATAANTATAAGATTCTGCNGGCGGNCCAAGAACTGTGACCACAGGGCANGGCATCCACCA 

CCANAGATATGCCTGGCNGGGGCCAGGACAAAATGCAAACTTTNM MM I CTGAGACAG 

AGTCTTGCTCTGTCGCCA 

Sequence 472 

AGGTACAAGCTTACCTTTTAGGGTAGAAAAAGAMGATCATTTGAAAAATGTATCTAAAA 

TAATCCAGAGAACATAATGTTTGTCTTGGTCTGATAATGATAAGAAGTCAAGGATTGGCA 

GAGAAAATACTAAACGCCAAGAGTTGAGCCTGTGGGTCTCTCCATMGAGTTTTAAAACT 

CTTGCCAGTTACCACTTTATCCAATTrGCTATCATTTTCGTATTATCAGCTATCGCCCTG 

TAAAATATTCAAAACTAGCTATTTNTAAAGTAAACATTTTATCTGTTACTTTTM 

TAGGTGTCTTTGTCATCCTTCTACTATAAATTGTTCTTTGCCAACCTGTACCTGCCCG 

Sequence 473 

CCGGNCAGGGCTGGTTTGGGGCACAAGGAAGCCTTAGGGTATGGGGAAAGGCTGTTATTA 
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CCTAGAGTTTACTCCCAGGCCAGGGGGCTGCCATCTTCTTCACAGACATCCCTGAAAGGA 

AGCC CCT7T GGGGCAGGG AGGTGAG GACTTCATCTCMCATCGGCTGGTGGTTGGTAGGG 

GAGCTTTTTCTTTTCTTTCC I I I I I I I I GTTTTTGTTTTTG I i I i I GTTTTTGGTAACAT 

GTTAGGAGTCAATGTTGCAAAGAGTAGTTTACATCTTCACTTTCTGAAGACACTTGAATT 

TAGGACCGATGTATCTGTGACAAGCATGCCAGAAGTGGCAGGGGCCATCAGGGCTAACCA 

CTTCACACCTACCATCGTCCCATGGGGATCCAAGACCTGAGATAAAGCAACAGCCTGCCC 

AGATCCCTCTGTTCATCCTATCCCTTCCAAGGTTGGTCCATGCCAACATAACCTCTGGGC 

ATCAGACATCAGCAGGTCTGTGTGCCTCAGCCCTGTTAAGGGGCAGGTTTCTCTTTAGCC 

CTCTTCCTGCACTTGGGAGCAAANGCACTACCAGTNGAAAAAGGGCCATTCAGCCGTGCC 

CCCAACCTGGGACCCCTGGGGCTCAAATAGAGGTGCTGAGCCCCTGTGTNAAAAGTTGGT 

AAATGGTT 

Sequence 474 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATTATTTTCAA 

ATATAC AAG G ATC ATTTACC AAGATAAACC ATAATTTTGG GCC ATAAAAC ATTTAGG ATT 

GAAAATACAGAGCAAATTCTCTAACCACAATAAATTTAAAAAATCGATAGCCAAAAGATA 

ACTGAAAACTCCCCCAAATGTrTGAAMTTMCACATTTCTAGAAAAGTCATAGATCAAA 

ATTAAATTACAAGGGAAATCTGACACTATTrTGAACTACGCGTTCATGAAAACACATCAA 

AACTTGTGAMTGCAGCTAAMCAGTGCTTGGAGAGAGAAAAGTTCTAAAATTACTAATA 

TAAACTTCTACCTTAAAAAAGCTAGMGAAAAGCAMCTCAAAGTAGAAGGAAAGAAATA 

AAGATAACTGCAGAAATCAATGCGATAGAAAATAAGCAAATAGAGAAAATTAGTAAAACC 

AAAAGTTGCTCCTTTAAATGATCAATAAA 

Sequence 475 

CCGCGGTGGCGGCCGCCCGGNCAGGGCTGGTTTGGGGCACAAGGAAGCCTTAGGGTATGG 
GGAMGGCTGTTATTACCTAGAGTTTACTCCCAGGCCAGGGGGCTGCCATCTTCTTCACA 
GACATCCCTGAAAGGAAGCCCCTTTGGGGCAGGGAGGTGAGGACTTCATCTCAACATCGG 
CTG GTGG TTGGTAGGGGAGCTTmCTTTTCTTTCCl MUM ( GTTTTTGTTTTTG1TT 
TTGTTTTTGGTAACATGTTAGGAGTCAATGTTGCAAAGAGTAGTTTACATCTTCACTTTC 
TG AAG AC ACTTGAATTTAGGACCGATGTATCTGTGACAAG C ATGCCATAAGTGG N AGGG G 
CCATCAGGGCTAACCACTTCACACCTACCATCGTCCCATGGGGATCCAAGACCTGAGATA 
AAGCAACAGCCTGCCCAGATCCCTCTGTTCATCCTATCCCTTCAAGGTTGGGTCCATG 
Sequence 476 

AGGTACTGTTCTATACTATTCAGGTATCTTrrTATTTCTGATAGTTTTATATTATAATAG 
AAAGCCAGCCACTGCTTAGCTATCATAGTCACCATTTTCTCACTGTTAACATTAGGAAAA 
TC AAGGCTACTATGCTTCAGGATTGTCTG GTTAAATAGTATG GGAAAAAAACTG AAGAGT 
TTCACATAATTACACAGGTGAAATAATTAAAGCTTAMCTGMTn"GTATTTCATTTTAT 
TGTCAGATGGTGGTGTTCACCAGCCTGTATCTTGTCTGAGACTGCATTCGTATCTGAGCA 
GGTTTTCTATGCCTACTGATGTCAGTATGTTTATACTAACCTTCATGCI IN! ICCCAGA 
ATCCCTCATCTGCCAGAAAACTTGAAMGTTTA7TGCTTGTAGAG7TGTACCTGCCCG 
Sequence 477 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCAGCMCAACTTAAGTGTTT 

TATCCTTACGGTCAGCTTTGGTTAAAACAGTAACATAACAAAATTAATAGAATCATGCAT 

CGCAGGTGATAAATGATTCAAAATGACAGGAAATCAATTTAATTAAGAAATAGCAACACT 

TAAAAAATGCATAGGGTTGAGCACCCGTATTATTGAGGGTGGATAATTTAAATGTATAGA 

AGAAA TCAGTTCAGAGTTTAATGATTGAATAACATTCTCTTTAGCAACTATGAGTGTGAA 

ATTTrGGTTAATGATACCGAAAATATTAGGTTATAGGMGTTACAGTATTACATAAGGTG 

AGCAAAATAATCTAATCCAATTAATATAACGCCATCTGGGGGCT 

Sequence 478 

CCGGGCAGGTACATTCCAGGCAAAAACAGGGATCTCAAGGNGGTCAAGAAGAACATTCTG 

GATAGCAAGCCCACTGCAAACAAGAAGTGCGACCTGATCAGCATCCCCAAGAAAACCACA 

GACACGGCCAGTGTGCAAMTGAAGCCMGTTGGATGAGATTTTAAAAGAGATCAAATCT 

ATAAAAGACACAATCTGCAATCAAGATGAGCGTATTTCCAAGTTAGAACAGCAGATGGCA 

AAGATAGCAGCCTGAAGGTCCCACCCCCACCCCTACAGAAAAAATGGGAGCAAGAACTTG 

TGCTTGGGAGCTGGTTATTGGTGTGGTCCTAGGGAGGGCGGAAAGGGAGGCACTGCCATT 

TGGAGACATTCCATTTCAGATTTGTCAACCAGCGATAGGCCACATTCCAGTAAGAACTCA 

ATT TGTCTC CCAA ATTT GCAGAAACAAAACGTGATTTAAA AGCTG AGCI I 1 i \ ATCAGAA 

AGCI INI I GATGTTTTAAGTGTTATGTGACTTGTTGAAC I \ 1 1 I AAAAAGTGCTNCTTT 
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GTACCTNGGCCGTCTAGAACTAGTGGATCCCC 
Sequence 479 

I^T^I^f^^^^^^^^^^^^^^^^^^CGGCCGAGGTACCAGAAACCCACCTCACCC 

^CT^CATCTAMGGGGCGGGGCCGTGGTCTGGTTCTGACm 

^I??™ C ^ MTCTCCmCGGMTGMTG ^CATGGMGAGG 

^ A ^J G JJ^ AGTGCTGCA ^CGACATGGAAAAGTCCTmM 

CJ CC T™CTrcTNCTCTACMTC^^ 

AATCTCTTGTTTGCCAGGGTTCCTAAA7TAATTCACTTAACCATGATGCAAA 
Sequence 480 

AGG I A f AGG 3J GAAC ^^ 

A ^nj^JI CCAGCATOCGTC A<^GTGGAAAATTTTAC^^ 

?J? G .^ G ^^ AGTGCAATCAAAACmG ^ CCATGCCCAA GAmrr/\AAM 

TAAAATTACCTTCAAGCTATGTTGTATAAGGTGTATATAAAACAAATGAATTTTGTGTTT 
A ? AC J3 GGGTCTACTCCCAAGATA CCTCATTATATATATACAAAMTCCCCAAATCCAAA 
AACAmGAAACACTTCTGGTCCCACACATTTTGGATAAGGGATATTTGACCTATATCTA 
ATTATATAATAAGAAAAATGTATTTTCT 1111111111111 I GAGATGGAGTTTCACTCT 

TGTTGTCGCCCAGGCTGCAATGGCACAATCTTGGCTTACTACAACCTCCGCCTCCCTAGT 

I GA ^ GGGA ^ CTCCTGCCTCAGCC TCCCAGGTAGCTGGGATTACAGGCATGTGCCACCAT 
GCCTGGCTAAT 

Sequence 481 

CCG^CAGGTGGCAGCTCTGG 

T!^ G I GCTGGATG ^^ 
A ™£ ATCA ^ GG ^ 

AGTTCTACACCAAGGCCGAGAGCCTCATTGGAGTCTATCCTGAGCAAGGGGACTGTGTGA 

TCAGCAAAGTCCTGACCTTCGACCTCACCAMTTTCCTGTTAATCGAAATTTAACCCGAA 

AGG HJ? GA ^ AGG ^ AA ^ G ^ AGG ^^AAMCAGGCTTGGTGTCATCTACCTGGGACAGAC 

AGTTTTACTTCACGCAGGGTGGAAATTTAATGAGTCAGGCCCGTGGGGATGTAGCAGGAG 

?^J^ CATGGACATAGACAAC TGTTCAGTGATGGCTGTGGACTGTGAAGACAGACGAT 

ATTGTTTTCAGATCACCTCTTTCGATGGAAAAAAATCTTCMTTTTGCAAGCAGAGAGTA 

AAAAAGATCATGAAGAGTGGATCTGTACCT 

Sequence 482 

CCGGGCAGGTACAGAAGAGAGATGACGCCTCCGGCTGGGGAGGGAGGGGCTGGCGCCTCT 

GAGAGAGCCCATTTATTATCAGGGTCACGGCCAAGACCGTGGGCTTGGTTTCTAGGTAGG 

TGGACTTTGAGGGGCAMGACAACCCACTTTTCCTGCCCCCTACCCCCGGCTCTTAGAGA 

CCATTTAAATCCATATCCAAAGTATATCCAGAGCCACCTGGGACCATCAGGCCTCTGGCT 

AGTCTTTCTGGGCCAAACCTGTCCTCCACCCTGAGGACCCTCAGGGAACCAGGAAGGTGA 

GGGGCTGGCTCCGCCCCACCTGTAGCACAAGAGCTGGCCCAGGAGGAGGCTGAAGATGTG 

TTTCTAGTTCCTGCATGCTGCTTTGTTGTGTGACCTCGGACATGTCACCTCCAGCCTTCT 

TCACCCACCTCACCTATCAACTAGGGCTAGTAATTGCCTGTGGTGGGTGGGCTGGGGGGC 

TCTGATGCCAAAATGCAGGTAGAAGACACACTGTGAGAGGCAAACGTCTTCACTCATCCT 

GCAGGGAAAGGCCAAGTGAGCATCTGGGACCCATTCGAGCTGAATGACCCCCAC 
Sequence 483 

CCGGGCAGGTACGCAAGCCTCTTCTCTCCCTTGATGTGGTAGCTACAGGCAGTTACATTC 
CTTrGCTGCTTGTGAGAAGCTTACATTTTC 

CAAAGTAMCATTACAAGAATATGAACTTGYTATTGGGGGAAAGGGGAAGTGAAGCAATC 
TGTAGAAMTATTTAACTGAAATTACAMCATMGGACAAGCCTTTAAAGCAAATTATAn 

AGGCATTTTTGTCCTCTCTCTGGTCCCTGCAGTTAATTAACAGGGCCACAGATAATCTGA 

CATCTTTAGATAAAACAACTACACACCTATATTACTTAAGTTAAAGCCAGTAAGTTCAAA 

AAAAGCAATTACMCCATTTGACATGTTTGTATTTTTCATCTCTAACTCCTGATTTCCTA 

AATTACATTCAGTCATACAAACATGGCTGAACAACAGCAAATGGGATCTGACTCTGGGAT 

GmACTTCTTGGGATTAAMTTCCCAMCACTTTGGTGTCTCTTTCTTGTGAATCTATA 

CAACATTTCCMTAAATGTTTAATAATGATTTTTTAAAAGATATTGCCAAA^ 

GATTTAAGATGAAATAAAAAC 

Sequence 484 

CCGGGCAGGTGGCCTGTTCACTTTCTCTTACTCACTGTCTATTCACTTGTCCTGTTCACT 
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CGTCTGGAAGATCTCAGCCAGCACCATGACTGACAATGAGCTGTCTGCCTTGGTAGTGGA 

TAATGGGTCAGGGATGTGCAAGGCAGGCTTTGGTGGTGACGATGCCCCCCGGGCTGTGTT 

CCCCTCCATGATAGGGCGTCCTCGACACCAGGGCGTTATGGTAGGCATGGGCCAGAAGGA 

CTGCTACGTGGGAGATGAGGCTCAGAGCAAGAGAGGCGTCCTGACCCTGAAGTATCCTAT 

CGAGCATGGAGTGGTCACCAACTGGGACGATATGGAAAAGATCTGGTACCT 

Sequence 485 

NCGAGGTGGACAANAAACACGNATGGCTAGGANAAACTATCAATGCTGGCAGCCAGNTTG 

AATANAATGTGGAAGGAGTGACTTNCAAGGAAANGGCTACCCAACTNGCCTNCATGCGCC 

TGCTGGCCAACTATGCCTCTCANAACATCACCTACCACTTGCAAGAACAGCATTGCATAC 

ATGGATGAGGAGACTGGCAACCTGAAAAAGGCTGTCATTCTACAGGGCTCTAATGATGTT 

GAACTTGTTTGCTGAGGGGCAACAGCAGGTTNCACCTTACACTGTTTCTTTGTAAGATGG 

CCTGCTCTAAAAAAGACAAAATGAATNGGGGAAAAGACAACTCATTTGAATACAAANACA 

AAAATAAGCCCATCACCGCNCTGCCCTTNCTTGANATTGGCACCCTTTTGGACATCGGGG 

GGTGCCTGACNAAGGAAATTCTTTTGCNGGACATNNGGCCCANTNCCTNGTTNCAAAANA 

AAANTGAACCTCAAATCTAAATTAAAAAAAAGGAAAAGAAAAATTT^ 

TCCTCTTTTNGCCNAGNGNCCTTNGTTTCNTCCCTTTT 

Sequence 486 

AGGTNNTGACACACAANTACANACATAGTCCCACTTAGGGTGNTGATATTNCTTTAAAAA 
AAAAAAC AANGGGCTC AC AAACAATAAGAC ATC AC AAATAAAAC ATC NAG AACC AC AN TT 
GCCAANAAATGCCAAGACCAATATGCTACTGAAACAACTGGTAGAAATGTCACANACAAN 
N TATTC AN N ATGTATTG NAATAATTACTTG N AGTTAANG AN NG CG AAAGGTAAAACC AAA 
CNGGGANAAGAANTAACTCCTTGTAAACTGGGCTCTTGGGNTNNAAAGAGGATCTTTTNA 
C AANNTTANTTTTNTAAACC ATTACCCTTTGTTTATAAAAAAAAANGTAN CCAAAN CTAC 
AATGGTNNACCCCGGANGGGGA 
Sequence 487 

TACGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGAGACAGGCAAGCTCGGATTCA 

ACTACCTTAGATAATATTTCTGAAAACCTACCAGCCNGAGGGTAGGGCACAAAGATGGAT 

GTAATGCACTTTGGGAGGCCAAGGCGGGAGGATTGCTTGAGCCCAGGAGTTCAAAGACCA 

G CCTGG GC AAC ATACC AAGACCC CCGTCTCTTT AAAAATAT ATAT ATTTTAAAT ATACTT 

AAATATATATTTCTAATATCTTTAAATATATATATATATTTTAAAGACCAATTTATGGGA 

GAATTGCACACAGATGTGAAATGAATGTAATCTAATAGAAGCCTAATCAGCCCACCATGT 

TCTNCACTGGAAAAATCCTCTTTCTTTGGGGG I 1 i \ \ CTTTCTTTC \ III I I GATTTTGC 

ACTGGGACGGTGGACGTCAGCCCATGTACCTGCCCCGGGCCGGNCGCTCTAGACCTAGGG 

nnnnnnnnnnnnnnnnnnnnnnnnnGATATCAAGCTTATCGATACCCGTCGACCTCGANG 

GGGGGGGC 

Sequence 488 

GANTTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAANGCAAGAAAGCTGGGG 

GGCTCCATGTTCACTGCCAACCCATGGATCTGTATATCAGGAGAATTGGGTGAGACACAG 

ATCATGCAGATTCCCAGGAATGTGCTAGAGATGACCTTNGAGTGCCAGAACTTGGGGAAG 

CTTACTACTGTCCAGATTGGCCATGATAACTCTGGGCTGTATGCCAAATGGCTGGTGGAG 

TATGTGATGGTCAGGAATGAGATCACAGGACATACCTACAAGTTCCCGTGTGGCCGGTGG 

TTAGGGAAGGGCATGGATGATGGAAGCCTGGA 

Sequence 489 

CCGGGCAGGTACAAGGTAGTG GAATTATNTCTTATATTGC I I I 1 I CCAAAAAGTAAAAAC 

CAAAAATGTGGGCNCGGGTGTGGTGGCTCATGCCTGTAATCCCAAGCACTTTGGGAGGCT 

GANGCAGGCGAATCACANNGCCAGGAAGTTCAAAGACCAGCCTGCCCAACACAGTGAAAC 

CCCGCNCTNTACTAAAAATACAAAATTAGCCAGATGTGGTGGCGCACACCTGTAATCCCA 

GCTACTCAGGGAAGCCTGAGGCAGGAAAATCGCTTGAACCCTGGGNAGGTGGGAAGTTGG 

C AAGTGG AAGCCCGAGAATCG CACCACTTGCACTTCC AAG CCTNG G GTTGG AC AAAG AAG 

CCGAAAACTTCCANCTTNAAAAAAAAAAAAAAAAAACCCAAAAGTGGAAMCACTTAAM 

AAATTC C C CTAATAG ATAATATTTC C C AA G G AAAATTATTTTTAAATT 

Sequence 490 

CCGGGCAGGTACCAACATTTATTAATCCTCACAACACCCCTGTGAGGTAGGTCAGTATGG 
NCTTTAGAGTCGNGAACTGAGGCAGAGGTCAAGCAAACCTGCCCTGGGCCACAGAGCAGC 
ACGATGAAGGGCCTANACCTGGATCCAGAAGCTAGGGGCTCTTCGGNCCAGCATTCATCC 
ACTGGTGGACATNACATGGGGCTTATTTTTACCAAGCCGAAGGTTACCGTGAAGGGACAA 
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AACCGCACTCANCCAGCAACCGGGAAACTCAAACAGTTCAAACAGCACCTGGGGGGAACA 

TGTTCAGTGTAAAAGGAGACGAAAACCGCTGACCAGCTCATGAATGGAGGGCAAGGACAA 

CCATGGCGGCTTGAGGGAAAAGTGTGGGGAATCAATCACCGACCTGGGGGGATTTAGGAA 

CCCAAAGAAACCGGGGGCGGCTTCAGGGGAGGGGGTTTCMGGGAAAAATGGTTAAAACA 

AAACCTNAGGGAACCCCCATCCCAAGGGGGGGGGACCAGGGCCCCAAAATGGCCTTACCG 

GNTCTTNCAAAATGGGTAGAAAAAAATTTTGAAAAAAGGAATAGGNGGGNAAAA 

Sequence 491 

TCAGATCCAAAGCCAGCACGGCACCGGGTTTCTTGCCTAAGGTCCATGGTGACCACTCCT 

TGGCTATTGCCMTGTTCACTCAAGGCCCAAGCACTCTTTATTCAGCTGGTGGCAAATCC 

AGCCAGGCTTGTGTTTTTCCCTTCAGTATGACGAGCTCCCCCTAAGCCCAAGACAAGTCC 

TAAAATGfCATNTGGGAGCCAGGACCTGGAATCAGGATCCTTAGGAATTTACTTGGTGCT 

GTATTCTACTATGGCTGAGCTGGCAATCAMGTGGCAAAGATTAMGCCCTTTATTTTCT 

TTGNTCTCCTTTCCTC 

Sequence 492 

AACGCGGNGNGGGACTTGNGGGCCAAGCAGGAGGGAAAGCCTGTCTGGAAGTTACNTGTG 

GACATGGNGNCCAGGACGCTGGTATCCTGCATAGATTTCAGGTACCTCGGCCGCTCTAGA 

ACTAAGTGGATCnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnNGATACCCGTCGA 

CCTCGAGGGGGGGGCCCGGNACCnnnnnnnnnnnnnnnnnnnnnnnAGGGGTTAATTGCC 

GCGCTTGGCGTMTTCATGGGNCAATMGCTTGTTNTCCTGTNGTGAAAAATTGTTATCG 

CGCTTCACAATTTCCACACAAACAATACGAAGCNCGGGNAAGCATAAAAGGTGGTANAAA 

GCCTGGGG 

Sequence 493 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTTTTGTGGGGTGCAG 

TGAATGAAACTAAAGAGTTGTTGCACCTGAAAAAGACACCAGTGATTGGGTATTGGGTAG 

TCGATTGGAGACTTTGAI ITTTTTTTTTn I CACATGGGAAAGTCTTAGATNCCAAGTAG 

ATGGAATTNGCMTCCTTTGCAGTTTGTTCTCATTACAGATMTCMCATTTNTTTTGTA 

TAATTCAATACATGATCAAGTTAATTGGTAAA I I I I I ATTGAAATTTAAAACTTGCATTT 

TTATATATGTGTATGTGTACCT 

Sequence 494 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAACTTGAATGATATTTNGA 
CCACATCCATGCAGGGTGCTNACACTGNGACATNNGGAACTGATATCAGGTCAGCTCTCT 
GCCTG N CTCANTATGCATCACTATATCTCATCAATATG AATCATATTGATAN C NGAAGGN 
G NTG AAACAAATG AAG C N GGC NTTTATATATTAACTGTGTGTNN AGTGCGTATAG AACAT 
GGGNTCACATTCACTAATATTATGTGTATTCATACACATGCACAAAACCTT 
Sequence 495 

GGTACTTTACCCTTCMCGGGACANGAAACTGCNCCAGAAGTCAACTTCACATTTGAAGG 
AGAAAC GGGAAAGGNTCC AN G ATTGAAGAAGACAAC AC ATTTTATC AAAGACTTAAGTCC 
ATG^AGGMCNCGNTAGAAGCACANAATATTCCAGACMTTTTGGAAATGTATCTCCAG 
AAATGACGCTCGNNCTACATTTAGCCTGGGNCTGCCTGNGGCTTATATAANATTGGCAAG 
AAATTCCTACTTGAAGACACMTGGTATAAAMTGGGNNAAAAAATTNAANACTGTCAAG 
CCAAGGTGGCAAAAGCAAATGAN N GACCTTTTATTGGAATTTAAGANTACACCATTCTAC 
TTTCATTAAAT 
Sequence 496 

AGGT/\CTGTTCTACCAACAGCTCTGATTTGGTGAGTGCATTTTCTACGCCACTGCTCACG 

AGCTGCATNATCCGACTCCCCAAGACTGTGTTAATGCTGCCACTGACCACAGACTTGGTC 

TTCTCCACACTGCCAGTCACTGCCCCTTTGGTCTTGTCCATCACCCCTGTGATCGTGCTG 

GCCACAGMTCCTTGGCCCCAGTCACAGTANGTCCGTCACAGCATCTTTTTGCCCCAGTC 

AC/.GCGCCTTTGGCATTTGGGCAACAAATCTTGAGTTTGATGGGGCTGATTCAAGAAATA 

AG3CAAGTTCCTCTCCTCAATCCCTGGTCTAAGCCCCCTTACAGGGCATAAGGGTATTGG 

GGCCAACTGGCAAATTTTGCGGGCTTCCTAGCTTTCTGGGAATGAATGGGGCCAAGAAGC 

ACTGGGTCATGGGCCCACCGGGAGGGTGAATGGTCTTTCAACAACCGTTTCCTCNTGCCC 

ATCTTCACAACAACAGGAACTTTCAAGGGTAGGGGAATACCTGGGGTCCCTTTGGTAACC 

TTGCCCCCGGGGCGGGCCCGGTTCTTAGAAACTAAGGNGGGAATCCC 

Sequence 497 

GAGCTCCACCGCNGTGGCGGCCGAGGTACTTTTCTACCTTAAAAAAATCAGTGAGGATAT 
TTATTTAATGAACATCAATTCCTTTTAAATTTTCTTAGAGAAATNGNCTCTGTGGCTCAG 
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TTTTACCACCCATAAAGCGGAGACAGTAATTTATGGTTATTCCTTTCTGACCCACAAAGT 
ATGAAAAGTTCTTGTAANCTGTAAACTCAGTTCTGNAATCTGCATTATTGAGAT GATTA A 
TATAMGTTGTATTTTCACTGAAATGANTGTTTTGCTC 
GCCGGGCTTA I I I I I I IGAAAG 

Sequence 498 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTA CAGTTAATCA TTTT 

GACTGTGTCAGTAATATAATCAGAATACTGTAACGGTAAGAGAAAAATACI MINIMI 

TTTGAGACTGGGTCTCGCTCTGTCGCCCAAGCTGGAATGCAGTGGCGCAATCACACCTCA 

CTGCAGCCTCAACCTCCTGGGCTCAAGGATCCTC TAGCC TCAGC CTCCTG AGTAGCTGGA 

ACTACATGCATGCACCACCACACTCAAGCTNAI Mill' ITAAATTTATTTTTAGTAGAGA 

CAGGGTCTCACTATGTTGCCTAGGCTAGTCTTGAACTCCTGGGCTCAAGTGATCCTCCTG 

CCTNAGCCTNCCAAAGTGCTGCGATTACAAGCGTGAGCCACCGCCTGGCCCGAAAAATAC 

CTTGGGTTTTAATCAATTCCCATTCANCAATAGTGAAAAATGCTNTATGGTCCTTCCGGC 

CCNTTTAAGAAACTAGGTGGGATTCCCCCC 

Sequence 499 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTACCTGGGGGGGTTTGCT 

TTCCTGCCTTTTCTCTGGTTGGTCAACATCTTCTGGTTCTTCCGAGAGGCCTTCCTTGTC 

CCAGCCTACACAGAACAGAGCCAAATCAAAGGCTAGCTGGAGATGGCCAAAGCCCGGAAC 

CAACTGGATGCTGTCTTGCAGTGTCTGCTGGAGAAGAGTCACATGGACAGGGAGCGTCTG 

GATGAGGAAGCTGGGAAAACACCCTCAGACACCCACAATAAGGACTGCTCCATCGCAGCC 

ACTGGCAAAAGGCCATCTGCCCGCTTCCCCCACCAGCGGAGGAAGAAGAGGAGGGAGATG 

GATGATGGGCTGGCTGAGGGAGGGCCGCAGCGATCCAACACATATGTGATCAAGCTGTTC 

GACCGGAGCGTGGACTTGGCCCAGTTCAGCGAGAACACGCCACTGTACCTGCCCG 

Sequence 500 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACANGAGCTCAGACCCGGGGAGAC 

CCTCAACGTCAACTTCCTCCTGCGAATGGACCGCGCCCACGNGGCCNNGATCCGCTACTA 

CACCTACCTGATCATGAACAAGGGCAGGCTGTTGAAGGCNGGACGCCAGGTGCGAGAGCC 

CGGCCAGGACCTGGTGGTGCTGNCCCTGTCCATCACCACCGACTTNATCCCTTCCTTCCG 

CCTGGTGGCGTACCTGCCCGGGCGGCCGAGGTACTTNCTACCACTGCTGCATGAGTATAA 

TGCTC C GG AATTATC AGAAAG C ATAATGC AG AAATACG AATTANTG G AACTTAATC ATGT 

GCCANATAAGCTTACCTAACAAACAGTTATATCCCTATTCCTCAACTGAATGTCTTGTAA 

TAAATAAGMTTNATCATTTATTTTCTGCMCTTTTTTTATGTCTCATTCT^ 

AANAACCCTTAATATTTTAACAAAAATTTCTTC 

NAAA 

Sequence 501 

AGGTACNGTGTGGCTTGGTCACTTCGTGGCTAAGGTAAGAACGTGCTTGTGGAAGACAAG 

TCTGTGGNTTGGTGAGTCTGTGTGGCCAGCAGCCTCTGATCTGTGCAGGGTATTAACGTG 

TCAGGGCTGAAGTGTTCTTGGGATTTCTTCTAAGAGGGCTGGCAAAGAACCANGTTGTTT 

TTGTCTTGCGGGTCTGTCAAGGGTTGGAAAGTCCAAGCCGTAGGACCC AAGTTT CCTTTC 

TTMGCTGATGTCTTTTGGCCAAGAACACCCGTGGGGCCTGTTACTTTGTTTTTTGAGTT 

GGGAAGCCCGGGTTTGCAATTTTANCGCCCCTGTAAAAMTGGTATTTCAATTTCTT 

TTTTTAATGGTNAAANGGN I \ \ It I M li I GGTAACCCTTNGCCCCGGGGGCGGGCCCGC 

TTCTTANAMCCTTAAGTGGGGAATCCCCCCCCGGGGCTTGCAAGGGAAAAATTTCGATA 

ATCAAAAGCCTTTTAATTCGAAATAACCCCCTNCNAACCNTTCCAAAGGGGGGGG 

Sequence 502 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATGAACATACACACACAC 

ACACACACACACACACACACAGAGGCCAATTTCCAGTGAGGAACAGGCTCCACAGAAGAA 

GCTGATAGGCAGGGGGGATCAAACGCTGCTCCCCTGCTCACCGATCTACTTCTCCAAATA 

CAATATCTTGGGTACTTACAACTATAAAAAGATACAATAACTC T ACCAA7TATAAATAAT 

GTAGCATTTCATATTAAAGACATTATCGTACCTGCCCG 

Sequence 503 

CTGTTCCTACGGCAGTCAAGAGGCCCAGGCTCTGTGGGCTCCAGCTCTGCATTTCCTGGT 

TCTGGGGNTGGGGCTGGGATGACTTCCTGTTGGACTTGCTGCTGGGACTGGAACTGGAAC 

TGTTCCTCGGAGGGCCGAGGAGTCACCTCTGCAGCCAGGGGAGGATAAGGGGGTCTGCTC 

CCTCTACCCCTCCCAGGGGGTCTCCCCACCCCAGCTGCCGGCCCGTCCTATCCTACCCCT 

AGCCCGTTACCTTGATAATCATAGTAGTCTGGGTTGTCGATCTGGTCGCTATAGTGGGTG 
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TACCTGCCCG 
Sequence 504 

AGGTNCATTCAG AATCCC ACGAGCAATTCAG ATAG ACCAGTCACCAGGCC TCTAGC TAAA 

AGAGCACTGAAATACACAGATGAAAAAGAGACGGAGGGTTCTAAGCCAACAAAAACTCCT 

ACCACTACACCACCTGAAACTCAGCAGTCACCTCATCTTAGCCTGAAGGATATCACCAAT 

GTCTCCTTGTATCCTGTTGTGAAAATCAGMGACTTTCTCTTTCNTCCAAAAAAGAATAA 

AGCCAAGCCCAGCAGNGGGCTCTG CCTA AACCGTAGGGGGGCACAANGCCAGCCGTNGAA 

CTATTAAG GGAG GCC CACNCCT TGCTTTTGNAAAACTGMGAAAAGAGGGGGGACCCCTT 

TTACAGAATTTTGNTG i HIM ( GGAAATTCTCCCTATTTTTCAAGCANAAAAAAGGG AT 

TTNAGGACCGTTTCTAAAAAAAAAGGNATTGAAACCAAAATACC 

Sequence 505 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATGTNTTTGTCACTTAAA 

GGTNCT7TCTGTAANCTGCTTCAGATNCTTTNACTATTCAATTTTTAATNCTTAATNTCT 

GTAAAGA ACGTT MTATTCCTCTTTATAATCAATCTTTCCCAGTTNGCCTNAAAAATGTA 

TTCCCTACTTTTGCTTCAGGAGATCAT 

Sequence 506 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGAGMGGCACTGAATA 

AATTCACAAAGGCCGATTGGTTCACCCATTCTTTTAGNGACAACAGACACGCAATTCTGA 

CGAGGACTCCTGTTACTAAAAGACACAGCCTCTGATACAAGAGAGATATCCCTTTGACTA 

AAGCATTACCAGGGTCCCCAGGGCCCCCTCCCACTGGGGCGGNAACACTACGGGTCTCCC 

CACCATATATTCCATGTCAAAGTATCTACACAAATACAGAGGAAATTAAGCAAGTAAATA 

CGGTATGTMTTGTTATCATTTGTATTTCTTTAAGGCATATTTATAMTATm 

AACAATATGAGTGAGTGCCTTTCATTAGCTATGATCTTTCATACTGATATAT 

Sequence 507 

GGNGAATNGGAGCTNCACCGGGGNGGCGGCCGCCCGGGCAGGTACATAACATTTCAAATA 

TAAGTGGAAGGATCATCAGTAGTGTTATCAAAATGCATAGGNACAGANNCTTTTTAAGAA 

AGGATAAAAAATT ACACT CAGGACCCATAACTCTTCCTCATTATAAGCATATGTAGTGAT 

TCATTCATGCAGGTTTTTATATGTAG ATAAGGA 1 I M 1 III I NCTTTTCAAGAATTCCAT 

TGTANGCCATGMGATGAAAAATTGTATTATGGGNMTNGGTATAGCTTTTCTTCTATTT 

TGCTTTCAGTGTT AAGGG ATTTGCCTAAMGGCCTTATTTTAAAAATTCCCCAACCTGAA 

CAATAAATNGNNGTTTTTCCAAATAAANGGGAGGGAACGCTTNNCCTAACCTCCCGGGGA 

AGACGGGGGGGCCCNGGNTTTCCCCTTGGANGCCAAACCCTTTNGGNTCCGNTTTCTAAG 

AAACAT AAGTG GGGANTNCCCCCCCGGGGGCTTNGCAAAGGGAAATTNTCGNNNTATTCC 

AAAGGCTTTTAATCCGGAANANCCCGNGGCGGANCCCCCNTNANGGGGGGGG 

Sequence 508 

ATNGGAGCTCCCCGCGGTGGCGGCCGCCGCCTCCAGCCGTGTGCCGCTATGGGAGTCCCG 

GCGTTCTTCCGCTGGCTCAGCCGCAAGTACCCGGCCATCATAGTCAACTGCGTGGAAGAG 

AAGCCAAAAGAATGCAATGGTGTAAAGATTCCAGTTGATGCCAGTAAACCTAATCCAAAT 

GATGTGGAGTTTGATAATCTGTATTTGGATATGAATGGAATCATCCATCCCTGTACTCAT 

CCTGAAGACAAACCAGCACCAAAAMTGAAGATGAAATGATGGTTGCMTTTTTGAGTAC 

ATTGACAGACTTTTCAGTATTGTAAGACCAAGAAGACTTCTCTACATGGCAATAGATGGA 

GTGGCACCCACGTGCTAAAATGAACCAGGAGCGTTCAAGGGAGGTTCAGGGCATCAAAAG 

AAGGAATGGAAGCCAGCAGTCGAGAAACAGCGAAGTCAGGGAAAGAAATATTGGCAAAAA 

GGTGGCTTTTCTTTCCT 

Sequence 509 

TATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTCGCTTGGAACCTT 
TTGGGGATGTGACAGAGAGGAAGCAGCTCCGGGACAAGCTCCAGTGTAAAGACTTCAAGT 
GGTTCTTGGAGACTGTGTATCCAGAACTGCATGTGCCTGAGGACAGGCCTGGTTTCTTCG 
GGATGCTCCAGAACAAAGGACTAACAGACTACTGCTTTGACTATAACCCTCCCGATGAAA 
ACCAGATTGTGGGACACCA GGTCATTCTGTACCTCGGCCGCCCGGGCAGGTC U I 11 M 1 
I i M M I I I I 1 I I I I I I I I II I I I It ! I M I I I I CCATGGNAAATTTNGCTTTTNTT 
Sequence 510 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTAACCTACC 
TTTAAGACTGGGATAACTATTGGAAACAATAGCTAATACCGGATATAGTTATTTATCGCA 
TGATGAGTAATAGAAAGGAGCTTCACAGCTTCACTTAAAAATGGGGGTGCCGGAACATTA 
GTTAGTTGGTAGGGTAATGGCCTACCAAGACGATGATGTTTAGCCGGGCCGAGAGGCTGT 
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ACCT 

Sequence 51 1 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGNNGNNCNGNTAGCCATNAATCATAN 

ATATAGGANACAGCANCTATACANCTGACATNACCANTTAATNTTATATNATGAAAGCAN 

NNCATNTGCNTGTGCATCAAGGCCAGTCCTATTCAACCNANCTNTCGAATGCTGATANCT 

GGATAGNATGTCATNNTGMGNTGNCACATAACTTNTCTAAAAAAAaGCANTCTTTGTTG 

NNTGCTTCTTCCCTACNGATGACTTCTAAAAATATATGACAGGGTATAAAAAAATTAGCT 

ATACATGATCATATCAACACATGTAACTGCTGAAATGGCATTCTA 

Sequence 512 

CCGGGCAGGTACAGGTTCAAATGAAATTCAGGTTGTTGCAGGAGACCATGTACATNTTGC 
AGTATGGGCCGGCGAGTTATGTTAATATGCAAGGTTAAGCAGAAAAAAGCGGANCCGTAG 
GGAAACCGAGTNTGAATANGGGCGACTTTAGTATATTGGCATATACCCCCC 
Sequence 513 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGCTAGCACCATCTGCTAGACTT 

GAAAAATCTACTATGATATCAATTATTTTAGACTTCTTGTTTGCA 

AAATATGGAATTATTTrATATGAAAAATAGTTGTCCCACAGGTTATTTTTATCTTTAATA 

GTTATATATGAGAGGAAGGGTCAACAAAACAATATAGATAACACTGGAGTCTCGCTGTAG 

TATACTCATTTGGTTCTGTTAAAAAGAGTTTTGTATTATAGAATGTACCTGCCCG 

Sequence 514 

AGGTACTCCTTTNTTAAAAACACTGNAAAAGTAACCACACATATGTGAGGACTTACTATN 
TTAAATGGAATGGANTGAGCTCCATAGATTAGTTNTGAATATAAAGTATATAAAAGTGCA 
TCAG N G GTTTATATAGGCTTTAAAMCATGTTATCTTACAGTCCTTTAAAGCAGCCATAG 
AGTTTGTATCATTTTTCAAGCCMTTTCAGTCAGGGATTTGAATTGNTTGANTATTGGAT 
G ATAAANN GN GTCCATATCTTATNAANATGN GNTCN NGTC NCGTTCNTTNGTTAN ATATG 
ATTNAGGGGGGAATTCA I I I I I I I I I NCCCTCCATGTTAAATNTCANAAAACNNTAAACA 
GTTGGCCNTTTNTANTNGGNNTTCCTTGCNCCCCAAGGNTTNGNNNCNCCCCNNGGGGGG 
NAA AAATTTT AAAAN I Mill) i NCNNAAAAANGAGGNGGAGGNNNCCCCCCCCCCCCCC 
CCCTTTTTTT 
Sequence 515 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCCTCANGCAAAAATTGCACGNGGTTG 
ATGCCTTACATGAAGTACCAGTGAANAAAGGNGAAGGTGCCGAGCTATAAACCTCCAGAA 
TATTATTA 
Sequence 516 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGCCAGGTACCAGTGGAGGAAG 

GCCTTCCGGCGGAACATGGCAGTGAACTGCTCCGAGATGCGCTTGAAGAGCTCCTGGATG 

GCTGTGCTATTGCCAATGAAGGTGACTGCCATCTTGAGGCCACGAGGNGGGATGTCACAG 

ACTCGGTCCTTGTCGCCTGGAAGGCCCCGCGCCGGGAGCGGTCGACCTCGGCCGAGGTAC 

CCACCACCACTGTATGATGCTCATGAGCTCTGGCNTGCCATGAAGGGAGTAGGCACTGAT 

GAGAATTGCCTCATTGAAATACTAAGCTTCAAGAACAAATGGAGA 

Sequence 517 

GGCMAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGCACTGGGCTTTATA 

AAGACTGCACTCAGAACCACACTGCACAGTCCAGTTTTTTAAAAAGCTGCTACATGACAG 

ACAGGTAATCCCACTGAGTGAGTTTTGAGAAACAAATCAAACGAAGTAAACAAGAAACAT 

AAA ^ACCAAATAGCAAATGAATAAAAGCCTGTTCTTGTAACTTAAAAAAAAAAAAAAAAA 

AA^AAAAAAAAAAAAAAAA 

Sequence 518 

C-GGGGCAAGGTACCATCTGAATACCTCTTTGAAAGAAGGAAGACTTTACGTAGTGTAGAT 
T":"GTTTTGTGTTGTTTGAAMTATTATCTTTGTAATTA1 1 \ \ \ AATATGTAAGGAATGCT 
T GG AATATCTG CTATATGTCAACTTTATGC AGCTTCCTTTTG AGGG ACAAATTTAAAACA 
AACAACCCCCCATCACAAACTTAAAGGATTGCMGGGCCAGATCTGTTAAGTGGTTTCAT 
AGGAGACACATCCAGCAATTGTGTGGTCAGTGGCTCTTTTACCCAATAAGATACATCACA 
GTCACATGCTTGATGGTTTATGTTGACCTMGATTTATTTTGTTAAAATCTCTCTCTGTT 
GTGGTTCGGTCTTGGTCTGGTTTGG 
Sequence 519 

GCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACTGTGAAATTAACTGGCATCCTGG 
TGGGCCCAAGGGTTTTCAGGACTGGGGGCCAATGACTCACCCCCTCCTTCCTCCTCCTGA 
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TCCCTATCTCTAGC TCTTA TCACAGATTTTGAACAATTGTCTGTGAGGTTAATGATGGTT 

TCAGAGGGAAGCCCTTTTCCTCCCTGAGACTGTGTGGGGTTCAGTCAGCCTGCTGAAATT 

GCTTCCACTTATTACCCATCCTTCCTCTTAAAAAAAAAAAAAGCCCACCAAGTTAGTATT 

CTCTGTAGCTCTCAGACAGCTACAAGTGTTCCTGGCATATTTACCAAAGTACCTGCCCGG 

GCGGCCGCTCTAGAACTAG 

Sequence 520 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGGTTTTAGTG 

TGAATTTACATAGMTAMTTTACTTCACTTTCATGTCATCGACATGAATGACACAAAAG 

CTACTTCAT MTAC TACTTTACAATAGTTTTCAACATTTCCATATGGTGCGACCCCTTTG 

CTCTCATCMTTTTGGGTGTCATGAGAACAATAGGTATCCCGTTGGACATGATGTATTGC 

GMGAGCATATAMGCAGAGGGAAMTGAAAAAGCMGAGAAACTCATTTCAATGCTTTT 

TCTAAAAGGTAACAAATATAATmAATCMCTTCCTTGGAAAATATTT^ 

ATCAATAGAAAAAATTACAAAACATCATATGAAGCTATAAATAATTTTGAAAAACACCTC 

GGCCGCTCTAGAACTAG 

Sequence 521 

AGGTTTAACCCCCCAGGGNTAAACCCCACCCAAAGGNNAAAGANGGGGGNACACAAAGCA 

CCGAACAGCAGAGCAGNNNAANNCNNCGNNNACNAAACNCGGGGCNAGACACNAACNNGA 

AAGAAACCACCCCCCCGGAGNNAAAAGGNANAGGANCANAACANGGGCACCGAGCCAAAG 

AAAACGGGCAAGGACNAAGCCCCCCGGANCCACCCCCGNGGAACACCCIMCNAGAGGNGGA 

GAGAAAAGAAGGAAAGGCCNGAAGAGAAAGCAAANCCNCACANNCAANCCGGAAAGGCCC 

CAAAGGCAAAGGGG 

Sequence 522 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGTNCTCGGTAACGGGCC 

GACTTAATGGGNAGATCAACGCGAAATCCTTGGAAATATATANCAACAGCAAACACAAAA 

ACTGNAGCAATGAGGTTCATGAGATTGGGTAAGTTCTGCCGATAAAAAGCCTCCCGTAAA 

GCTCGGACTTTGNCCGTCCTGGTGGCCAACAAATGGAACAGAGCTATGACTGCACCCTCA 

AACTCAGTACCTGCCCG 

Sequence 523 

GAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTACAGGGTTGAGTATTCCTAATCTGG 
TMTCTGAAATTTCACATGCTCCGAAAATCACAAACTTTTTGAGTGCCAACATGATGCTC 
AAAGAAAATGCTCATTGCAGCATTTAAGATTTCAGA I I I I I GGGTTAGGAATGTTGAACT 
GGTATAATGCAAATACTCCAAAATCTGAAGAAATCCAAAGTCTTAAATACTTCTGGTCCC 
AAGCACTTCAGATAAAGGATACTCMTTTGTATTCCTATTTTGGAGATAAGAGATGGAGA 
CTTACAGGCTGGGTTGGTTGGATTGGTTGCCTG I Mil ! GTTGATGTTGTAGTGGAAGCT 
AGGAATGGTTTTTCACA I Nil I AAAGTGTACCTGCCCG 
Sequence 524 

AGGTACCGGCCATTGGTGCCTTGTGTCAGGTCCCAGAGGGCTGCCTCAATGCCAGCAGCA 

ATGCCCTCCACCACCTCCTCACTCAGTACTGGGTCTGGGAGCTCCCGAAGGCTGTGGAGG 

CCAGGGCACAGAGTAAAGCTCCTGGGCTAAGCCAGTGGCCAAGCCAGAAAACAATGCTGG 

GCAGAAAAGGGTTACAAGGCAGGTAGTCTGGACACTGGATTCAACAGCTTGCCATCAGCC 

AGGGAAAGTGAAGCTGAGGATGCATATGGAGCAGGGCCAGGAAGATCAGGAGAGCTGAGG 

G AGAACAGG AAAATTGGG AGGAAGGAAG AGGCAGCTTAN C AGC AAACTCC AG G AGAAATA 

ACCAGCAGCCCTCACAGATTTGTGATTGTCTCTTAAAAAGATNTATTTTCTCTrCTTGCC 

TGAATGTACCTGCCCG 

Sequence 525 

CGAA TTGG AGCTCCACCGGCGGCCGAGGTACCACCGATAATGCTATTAGCCCAAACCGTG 

GGTGTTTTNTAAATATTAATAGGGGGGCTTGATTCAACAAAGCCACAGAG*ITAACGTTGA 

AATTTTCTTNAGGAATTTTCTAGTAACCCAGNTTCTAAAGTAGCTACAGCAAAGGGGGAA 

ATATTATGTGTGANCATTTTTCTTCTTATGCTATATCCCCAAG l I I I I I \ CAGGACTCAT 

TTTAAGTNAAAGGCTAGAGTTGAGTAAAGGAAATAGAGCCCAAATGAGGGTAAGGTTGTC 

TGAGCCATTGAAAGTNTTAMTACTGAAAAGAATGGTCCACTTTTTATTTCANNGAAAAT 

TA 

Sequence 526 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTTTCCACGACAAATTCT 
TCAGGC TCTGC CTCTTCAAC I MM I A CTCTTTCCATTCTG { INN iC CCAl UIHGC 
AATGTAGTTTTGTTGGAGGCCA I MM I ATTGCAGACTTGAAGAGCTATTATTCACCGCC 
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TCCGAGCTGCTCCGACCTGCCCGGGCGGCCGAGGTACTTGTGCCTAGTTTTTCAAGGTAT 
TGGCTGTTCTATAGATGCAGTGATTGTCCCAGCTAGCTCTGTTACCAGCCTTTTGGTGTG 
TCTTTATGTTCATTTGGAGAGTCAGGGCC 
Sequence 527 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATAAAGACAAACTNGTAANGCA 

TGACCTGGGGAAATGGTCAGACCTTGTATTGTG I I I I I GGCCTTGAAAGNAGCAAGTGAC 

CAGAATCTGCCATGGCAACAGGCTTTAAAAAAGACCCTTAAAAAGACACTGTCTCAACTG 

TGGATGTTAGCACCGGCCAGCTCTNTGTACTCCCTGGCACCATCTGGTTCATGCCAGGTT 

TAGAGTTCATCGTGTTTGGCTTCCTGGTCTTTGAGTTTAAATTTCTCGGCTGTGACCTTC 

CCATCTCCATTCCGGTCCTGGTTGGTGAACATATTCTTCACAN N CAGGTCAGNATN NAAA 

GCCAGGAGCGAGTTTNCCTTTGCCAGATGCCACCTGGGCGTGAATGTACCTNGCCnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTCCNNTTATAAAANCTTATTN 

GGATTCCCGTCNN ACCC N NG AGGGGGGGGGCC 

Sequence 528 

AGCTCCCCGCGGTGGCGGCCGAGGTACAAACTCTGGCCAGTGCCCTGTTAATTGTTAATG 

TTGGTAGAAAGCAGATTTGAATGCTTCTTTTAATTAGCTTCATTAGTCATTTCCCACAA^ 

CAAAACAGTATGTCTGTAGTTAACC TACTACTCTAGTAACC AAC AC ATG GAAG CATC ATA 

TGCACATAAAAGTTAAACATTAAAACAAAACTTAAAACTCTCAAAACAGATAATCATGAA 

TTTGTGTrTATAAAATTATATAGMTAAAGGCTTTGTGTGTGGGGACTTTCAGACCTGCC 

CG 

Sequence 529 

TGGAGCTCCTCGCGGTGGCGGCCGCCCGGGCAGGGTACAATTCGCCCCGTGAGTCTAGGA 
AGATTTGNTGAGCGTCTGCCAACTGGCAGATGAGGAACCGAAGTTCAGAGAGTTGAAAGC 
ACTTGCCCCAGGTCA 
Sequence 530 

AATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTGCGACCGCCACGGAGCAGTGGG 

TTCTGGTGGAGATGGTACAAGACCACACCGGATGTCATCTTTGTATTTGGATTCAGAACT 

CATTTTGGTTGGTGGCAAGACAACTGGCTTTGGCATGATTTATGATTCCCTGGATTATGC 

AAAGAAAAATGAACCCAAACATAAGACTTGCAAGACATGGCCTGTATGAGAAGAAAAAGA 

CCTCAAGAAAGCAACGAAAGGAACNCAAGAACAGAATGAANNAAAGTNANGGGGGACTNG 

CNAAAGNCCAATGTTGGNGNTNGCCAAAAAGTGAGCTTGGNNNATTGGATNCNCNAGCCC 

GAAGGGANTTAAAAGGGGGCNNNNCANNTGNTNTTNGNNTTNGGGGCCCCCTTGGTGGNT 

NTTTTTCNCCAGGGACCNTTTATTTANCTTTMNAACNTTCTAAAAAAAM 

Sequence 531 

TGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTGACAGTATGT CTAA GATGAGAGAGTTTA 
GGGACTACTCTGTTTTAGCMGAGATATrTTGGGGGTCTTTTT^ 

GAGATTGGGCTAAAGAGAAGACGACGAGAGTAAGGAAATAAAGGGAATTGCCTCTGGCTA 
GAGAGTAGTTAGGTGTTAATACCTGGTAGAGATGTAAGGGATATGACCTCCCT7TCTTTA 
TGTGCTCACTGAGGATCTGAGGGGACCCTGTTTA 
Sequence 532 

TTAATTGGAGCTCCCCGCGGTGGCGGCCGCCNNGGCAGGTACAATNNTTTATTANGGCAT 

TTTTATTGCTCCGAATGAAMGAGTGGAGCTTT MGTA AATATGCCAAAAAAATATTCCT 

AGCACAAAAGCCTGCACCACTGCTTGAGTCTCCTTTTAAAGATAGAGATCATTTCCAGCT 

TGGCACCTTATCTCATACTCTCAACATTAAAGCTACTGGGTACCT 

Sequence 533 

GCCGAGGGCGGGGGGGGGGGTTTrGGGGGGGGGNAAAGCCCNTTTGGGGGGGGGTTTAAA 

CNCNCCNANACACAGGNANGGGNNNNNACNCCNANNAACAAAGGGCAAAAAAAAACNNCN 

NCTNNNAGGGNAAAGGANATAAGAGGGCCAAAAAAAGCAGCNCNAANNANACAGGGNAAN 

AAAAAGNAAAGCNNGCGGCCCAGGCCCGGNNGGGGACCCAAGGNAAAACGGCCCAGGGNG 

GGGNAGGGCCACAAACCAACGGGGGAATTCCCNGGACCGGAAAGAAGGGGAGNNANAANA 

NCCCAANNGNNAAAAAANAAACACNCANCCGAAACACANNNANAAGCCAGGAANCNGCCA 

TCTGCCNTAAAAGCCAGGNAACGGGNANCNCACACCCCNGGGANNNACACAACCCCNNNC 

TCTNNGGCNGNAAGNGGAANCTGGAGGGGAGGGNCGCTCTTCTCCCGGACCCCGNAAACN 

GGGCCGNNCAANAACAAAGGGGACCCCCCGGGCGGCAGGAAAGNAAAANCAAGCNAAACN 

AAACCGGNGACCCNNGNGGGGGGCCCGGNACCCANGNNNGGCCCCCTAAGGGAGGGGAAA 

ANNGNCNCCGGGGGGAAAAACAAGGGGNAAAGCGGGGGCCCGGGGGAAAAAGGGGAAACC 
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CCNCANAAANCCCACACAAAAAAACCANGCCCGGGAACAAAAAAGGGGAAAACCCCGGGG 
GGG 

Sequence 534 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCANGAGGCCTGAATTCATGTC 
CACAATGACCTGTGCTTAACTATTCCAAAGGTCGCTAAAGATACTGTTACTACTATTGAG 
ATATTATTGGCTACTTCACGTTTACATAGTAMTGTTTGCAGCATATAACATTCAATCAT 
AAACCCATAATTAACTTATAAGTGTTMTGGACAACTGTGCmGATTTTTGCCTn 
GATAAGAAAACAAAGTAGTGAMTGGGTCACTCCTCAMGCATGGMCATTTTAACTTTG 
CCTAGTAAGGAAAAACAAAACAAAATATAG CAATTACATGTGGAACCCGTAACCTG CAAA 
AAGTAACACAAATATTGTCTNAAAAGGTACCTGCCCGGGCCGGNCGCTCTAAAACTA 
Sequence 535 

CGGGCAGGTACATGAAATGGTTTTGAAACAATAGGAACAGATAAGTCCCAGATAGGAGGC 
TCACTGATA CTTAATTGG CCATGTCACCMTGTTTGTTTTTMGGGAGTTTGGTGGTTGC 
CATGTTATCAl I I i i M I ATCTTTCTTTC I I f I f CTTTCTTTCTTTCTTTCTTTCTTTCT 
TTTCTTTCTTCCTCTTCTTCTTCTTTTT^^ 

GTAAGAATACTGGGTCATAGTTTCATCCAGTGAAACTCTGTGACAATCCTTCACTAGAAG 

GAGAGTACCT 

Sequence 536 

CCGCGGTGGCCGGCCGAGGTACCTTTCTCCCTCTCTACCAGTTTTTCCTCTAACTCTCTC 

ACmCTCTTTATTTCTTTCACmCTTCTAATGCATTCTGCMGATCCTGCTT^ 

AT1TAT7TCCTCTTCCGTGACCCTCAGCTCGTGGAAATGTGNATAATTAATCCATGCTTT 

CAGGTGAGACAAAGACCGAGTTTCATCTGGCTTCTGCACACTAAGGACTTCTTTCATAAG 

CCTCCTCGGTATTTGCTCTGAAGTTTCTTTAAAGTTTCTGGGCTGAGGGTCAAGAAGCTG 

TTCTCTTGAAATCTCAAGTGG TTGTT TAAAGCATACCAAGTTCTTGCCCTTCGGGGATTG 

GAAGTTCCGACCTGTAGCTGGTTTTCTCTCTTGCTTCAAGAAGCTTCTCTAAATCTCNTT 

CTGGNAGGATCTTNGCA AAAAT CTCTTNGCAAGTTGGCTGGAAATTCTGACCANTCAATT 

G GTC AGGTAG GTGGG AATTTTACCTAAAAG AATGGTAATTAAAAG AC AAGG ATGGGAAG A 

TNGAATTTGGGAAGTCTTGGGGGGAAGAAGGGTTTCNACCAAGGTTTTCCCCAAGGGGAT 

NGGGGCCCCCAANTTAAAGTTTGGGGNGGGAATNGGGTNTGAAGNTNAAAACTTTAGGTN 

TTGCTTCCCCCCTCCTTGGGGGGNAANTTGGGGCCCCNGNAAANTAAATCTGGGNAAACC 

CCCTAATTTGGGNAAAGGGGA 

Sequence 537 

AGGTACCCTCTC ATG AGG AGG AATCCATG CTGG AGG ATTCTCATAGGC AGGC AAG AAAAC 

CACCGGGTAGTCATCATAAGGAATCCGGCTGTCCATCTCGGGCAAGGCCTTAGCTAAACA 

TCTTCTGGAT GCCA AGCAACCTGTCAGGTACGCGGGAAAGTTGAAGGCTACAAGAAGACC 

AAGGAAGCTGTTTTGCTCCTTAAGAAACTTAAAGCCTGNAATGATATCAAAAAGGTCTAT 

GCCTCTCAGCGAATGANAGCTGGGCAAAGGCAAAAATGGAGAAAACCGTCGCCCGTATCN 

AGCGCAGGGGGCCCGTGCCATCATCTATAATGAGGGATAAATGGGTTATCAATCAAAGGC 

CCTTCAGAAACATCCCTGGAATTACTCTGCTTMTGTAAGCMGCTGAACATTTTGAAGC 

Sequence 538 

GGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGCCACGGTACGCGGGGGTGGCGGCGT 

TGGGTTGAGCGGGCTTTTTGGAAGNTNGNGGCGGGAGTTCTGTGATATGAGCAACANTGG 

ACCAGANGATTTTATCTCTAGCAGCNGAAAAAACAGCNGACAMCTGCAAGAATTTCTTG 

GGCAGGGCC TGGG GAATGCTTTTTTATCTCATATTAGTGCCTGTGATGGCATCTTTCATC 

TAACACGTGCTTTTGAAGATGATGATATCACGCACGTTGAAGGAAGTGTAGATCCTATTC 

GAGATATAGAAATAATACATGAAGAGCTTNAGCTTAAAGATGAGGAAATGATTGGGCCCA 

TTATAGATAAACTAGAAAAGGTGGGCTGTGAGAGGAGGAGATAAA 

Sequence 539 

GGTACAC TGATCAGGG ACTGGMTCTTCTTTCCMTTrCCATGGCATATGCTTTCACTTr 

gctgagg i nrnm I AAGTGCAAGTAGAGCTTATCTTGGTATTCTATAGGACTTGCAGT 

TGTCTCTGGAGTTTCTTCCTGGGAGTTTTCTTTAACAGTTTCTGACAMTTCTCTGAGTC 

TATATGTATAATATTTGGGTATGAAACTGAAGACTCCCCCAGCTGTGATCTCTCAGNGTC 

TGTCTTAAACGGCAGCTTGTCA TTTCC TTGGCCAAGTGGGTCTATTTCTGAAAGAGAGAT 

GCTGGCTGGTATGTCTCCCCTGTTTTCATCTTCGCCGTCATCTGAACCCACACCCTTTGT 

GAGCATAGTGTGGAAAGCCAAACTTTTTGACC 
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Sequence 540 

CCGGGCAGGTACAGGCTGTTCACCAGCTTGCTCAGAATCACGCCATTCTTTCAGCCAGAC 

CTGGAAGCCCAAGCCGCCCANCGGGTCTGGGNCGGGCCANATTCAAGGGNCCNANCTTGA 

NCTTATTGNATCCNACTTCCAACNAGCCCGGCTTCCCTTCCNAGCTTCCCTTCGGTCANT 

ACTTTCTTTCTTCGNNTTTTTGGGANCTTGGCAACTTTCGGCGGGCTTCAATGGCCCNTA 

GGGAMGGGACCCCTTGGTTGGGCCCATGGTCNTGGGGGAAAGAAGGCTTCCTCAGGGGC 

TCGCAAGGAAGGAGTGAAGACTTGTGACCTCGGCCGCTCTAGAACTAGTG 

Sequence 541 

GCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGATCTGGGTGGAAGGTGACTCCTG 

TGGGCGCGATGATGGGGTGGAGGGTGTCTGATGGAATCCCGTTGGGTCCCGGCTGCCCTA 

CGTCACCCTTTTCACCCTTMCTCCGTAGAAACCAAGTCCTCTGTTGCCTTGCTGTCCTT 

TTGGTCCAGGGGGTCCAGGTGGTCCTGGTCTCCCTGGAGCTCCAACTGGACCCATCTGTC 

CCACATGCCCGGGCGGCCGCCCGGGCAGGTACTCTCAGAGAACAGGAGATATGTGTGCAT 

GCCTTAGAAAAAGCCCTTGAGTAGGTAAGGAAGGAAGAAACCTATGGCAGACAGGATTGC 

TGCAGCCAAGGGGGCTTCAAGCAGAAATGAAAGAGATGGCTTGTGAAGCTGCCCGTGTCG 

TGT 

Sequence 542 

] ] I I i M I I I I I I i ) I l I ICGNTTGGGGGACCCAAAAACCCCCAANAACNAGGGGACCCC 

CCGGGCNGCAGGAANTCGAAAACAAGCTAAACGAAACnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnTNCTGGGGCNGGGGGGAAAAGGGNANCCGCNCACAANNCCANANAACAAACGAGCCGG 

GAGCAAAAAGGGNAAAGCCGG 

Sequence 543 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAATTCTCTTCTTAA 

CCATTTGGTCCTCTGATATAAAACTTGCTTAACTGGCAAGATGGC TCACTTGG TCCTCAA 

GCCAACATCAAGAAATATTTAGCTGTCTTAACACAGGTGGGAGAC I I Mill I AATGAAA 

TACATAGGAATCCATTAAGTAACATTAGCATGTCATGTGTTAAATAGCTTCATTTAAAAA 

TGTTACAGATCAACAGAAGGGAAATCAAAAACTTATTTTTCTCAGTAACTACTTTACATA 

CTAGTGTAATTAAAGGTTTAACTGGAAGAAAAATTCAGCATAAAAGTATAATTAAAGTAA 

AAAGCTGGTATTCCACATTCCTGTTCCTATGAACACCTGAGCTGGCACAAAAGCTGAGTA 

GTTTAATGCATTGGCATATAT 

Sequence 544 

GAANTGAAGCTCCACCGCGGCGGCGGCCGCCCGGGCAGGTA CAATT AGAAAACCTGCATC 

TTTTAATAC AGTGAG ATTTN GTATGCATAC ACTCTGGTGTCTTC ATTTTGC AGCC C ATTA 

TATTGGCTTAAAAGCCANAAAGGTGCTCATGCATGTAATTTATACTTGGAGCAACGAATG 

CCANTGTATGTGGNCGCGTNTGTGTGCACNTTTGAAAGGNAGGAGTTTTTAGAAGNAATG 

AGMGGAAAATTTNCTTTCTTGGGCATTCATTAAATAATTCAGNGTGAATATCCTTGTAC 

NCTTGGCCGNTCTANAAACCAGNTGGATCCCCCGGGGCTGCAAGGAAATTTCGAANTNTT 

CAANGGCTTATTTGANNCCCGGCGACCTCCNANGGGGGGGGCNCCGGGTACCCCAANCTT 

TTT 

Sequence 545 

CCGGGCAGGTACATTATTGTTTATNTGAAATTTTMTTGAACTAACAATCCTAGTn 

ACTCCC AGTCTTGTC ATTGCCAGCTGTGTTGGTAGTGCTG N GTTG AATf ACGG AATAATG 

AGTTAAGAACTATTAAAACAAGCCAAAACTCCACAGTCAATATTAGTAATTTCTTGCTTG 

GTTGAAACTTGTTTATTATGTACCCTTCGGGCGCGCTCTAGAACTANToGGATCCCCCGG 

GGCTGCAAGGAAATTTCGATATCAAGGCTTTATCGGATACCCGTTCG^.CCCTCGAGGTGG 

GNGGCCCCGGGTACCCAAGCTTTTTTGTTCCCTTTAAGTGGAGGGGGTTTAAATCTGGCG 

CCGCCTTGGGCCGTTAATTCATGGGTTCATAAGCCTGGCTTTCCTGGTGTGGAAATTTGG 

TTANTTNCGGCTTCAACAAATTTNCCAACAACCAAACATAACCGAAGCCCC 

Sequence 546 

CNGCCGGCCGAGGNACCAAAATTTGTGGGAGGGNANTNG.WVCCCGGCAGANTTTAANAC 

CNNATGCCNATAAANGGGGGGNCAGGGGANGAAGACGGGGGGCCCGGGNGAACAAAAACC 

ACACGGNCTCTANGGAAAANGNGGAGAGAACTGAGAGCGAGGTGNGGCAAGAAGCAGGCT 

CGGAGCCGGAGGAGGGNGGCTANCGGCNNTAATCTCAGGGAGAGATGGCGCNGTGCGGNC 

AATGATGCAAGCNGGNAAGGGACCNGGCGGGGAGGGAAAGGGACGAGGGAAG 

Sequence 547 
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CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATAGACTGGG 

TCAGAAACAGATATAATACATTATTATTTTMTATATTGCTTGTTMTAGCTTTAAAAAT 

ACTACATTCTATGGTAGATTATTGGTTATATAAAAGGTAGAAATGTCATAAGTTTTNTAA 

TTNCCAGTCTGTAAAACCTGTCAGTTGAGTGCTTTGAAGGTACCTnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnNGNTTGCTGGGAAATTCCGNTATCAAAGCTTATTCGATACCCGTCG 

ACCTTCGAGGGGGGGGGCCCGGTTCCCCANNCNTTTTGCTTCCCTTT 

Sequence 548 

GCAGCAGGAGCCTGGGAAAGAGGCCNTCACCCTGGCGATGGCCTCTTTCCCACCTCCCCT 

TC AGTTCAN AN CC AATATGG NTCTAGAATCTG GC ACTTTACTC ATTTCCTTTGATAAATT 

AGTACCTCNNCCGCTCTAGAACTAGTGNATCCCCCNGNCNnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnNCAGTAATCATTNTCATANCTTTATTTCCTTTGTGAAATT 

GTTATNCCNCTCACAAT 

Sequence 549 

AGGTCACAGTTATGGCAAAGTAGACAAAGCATTTGTTCATTTGGAGCTTAGAGTCCAGGA 

GGAATACATTAGATAATGACACAATCAAATATAAATTGCAAGATGTCACAGGTGTGATGA 

AGGGAGAGTAGGAGAGACCATGAGTATGTGTAACAGGAGGACACAGCATTATTCTAGTGC 

TGTACCTGCCCGGGCGGCCGAGGTACCTTACATCAATGGCAAGTTTAAGAAGGATAATTA 

ATTACACAAACCCTTCACAGACTGCTCTGGTGCCTGGTGGTGCTAGCTCCTCCCACCTCA 

GCACCTGCTGATTTCTGGGAGCAGC 

Sequence 550 

CGCCCGGCAGGN ANTA \ 111 I I 1 1 I I 1 11 I 1 1 1 ill I Ml 1 11 1 i 111 1 1 11 M I 1 I HI 

Ml 1 1 111 1 1 M 1 1 1 I 1 1 1 1 1 1 11 1 1 1 11 1 I 1 1 1 1 1 H M 1 1 1 1 1 M 1 AAAANNTNNAAA 

ANTTTTNNNNAAAAAAAAAAAAAAAAANGNCCNAAAAAAAAAAAAAAN 1 It I 1 1 1 M 1 I 1 

AAAAANNCCCNTNNCNTTNNAAAANNAAAAANCCNCNNCCCCCCAANANNAAAAAATTTA 

AATNAAANGTTANCCNCNNCCCNGAGCCGGGGGGGGGNNCCCAANCCNNNTTTTTNTGGC 

NTAAAAAAAANGTNAAANNCCCCCNNANAAAAAAAAANNCNNTTTCCCAATTCCCAANNN 

NAAAACANTTNNCCCTTCCAAAAAAAAANAAAANCNCAACCNGAAAAANAAAAANNGNCA 

ACAACCNNNGGNCNAAANAANACNCNCCNNNNNNGTATNGGGGGNCCNCCNCNTTCGGGA 

AAAAAAAAAANAAAAACCCCCCCCAAAAANAAAAANAAACNGCCCAAANNGGGGGNAAAN 

NNNNTTNGGNTMNNCCCCCTTNNCCCATTTNCTNIAAAAAAGNNNGGGGNNGNTTNGGNN 

NNNGGGNGNGNNNNNNNAAATCTTCTCCCCAAAGGNNNTATTCCCCTTTTAAAAAAANTG 

GGGGAAAANCTANGAAAAAAAATTNGGGCAAAACCCNGCCCCNGCCCCGGGGCCCGTNNA 

GGGGCNNNTTTTNTANNTNNAAACAACCCCCCCCCCCCNAAAAAAAAA 

Sequence 551 

CCGCGGTGGCGGCCCGCCCGGCAGGTACTTGTTATCAACACGTTTGTATCAGAGTTGCTT 

TTCTAATCTTGTTAAATTGCTTATTCTAGGTCTGTAATTTATTAACTGGCTACTGGGAAA 

TTACTTATTTTCTGGATCTATCTGTATTTTCATTTAACTCAATATCATACTACCGGCTAC 

ATCAAATCAGTCCTTTGATTCCATTTGGTGACCATCTGTTTGAGAATATGATCATGTAAA 

TGATTATCTCCTTTATAGCCTGTAACCAGATTAAGGAATACAGCTCTTAAAAAATCAAGA 

ACTTCCTGAGTTTCACATATAAAATGGTGACAAACACCTGCTTCTGATAAAGTTTATCCA 

ATTTTGTATTTATAGTATTCTATTGTAATMTAGAATTCTATTATGTGCTTCCCTCCTTA 

GACATAGTTTTTCTCCTTTTGTACCT 

Sequence 552 

CCGCGGTGGCGGCCGAGGTACAAATTCTCTTCTTAACCATTTGGTCCTCTGATATAAAAC 

TTGCTTAACTGGCAAGATGGCTCACTTGGTCCTCAAGCCAACATCAAGAAATATTTAGCT 

GTCTTAACACAGGTGGGAGAC I I I I I I I l AATGAAATACATAGGAATCCATTAAGTAACA 

TTAGCATGTC ATGTGT TAAATAGCTTCATTTAAAAATGTTACAGATCAACAGAAGGGAAA 

TCAAAAACTTA I I I I ICTCAGTAACTACTTTACATACTAGTGTAATTAAAGGTTTAACTG 

GAAGAAAAATTCAGCATAAAAGTATAATTAAAGTAAAAAGCTGGTATTCCACATTCCTGT 

TCCTATGMCACCTGAGCTGGCACAAAAGCTGAGTAGTTTAATGCATTTGGCATATATAT 

GCCTTAAGGACTGTACCTGCCCG 

Sequence 553 

CCGGGCAGG I I M I I I I I I I I GTCAT1TTAAATGTTTATTTTATTTTGCACAATTTTAAC 

AGTAGAGTAATTTTGCTGTGGAAAATAAAATTTCATCTTGCAAGAAAGAGAAAACTCAAG 

ACCACCAACTGGCGTCTTTATTTACATATTAGAGCGGGTCAGCCAATGAAAATCCTGGGC 
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CTTCCGTGTGGCTTGTGGCTCAATCATGATGTTGATANGAGAAGGTTTAGTTGTGTCTGC 
TAGGCTCTGCCTCAGGGATTTTTGGAGTTCTTCTGGTGTTTGTACCT 

Sequence 554 

CCGCGGTGGCGGCCCGAGGTACCCCGTTCTGCCTGAGCAI (Mi ICCTAAAGGGAAGAAT 

CAATAGTTTCTGACTGTTTTAACAGCTGAAAGCTCCAACTGGAGGCAGAAGATGGGATGG 

CTTTTCACACACGTGCGTGCAAGTTTAGCCACCTCAAAGGCCTTGTTC7TAAAGCAACAG 

TGCTGTTTGCATTATGAAATGTCTCTGGAGTTCCCCTTTGGAAAGGCTGCTGGTGGGCCA 

CATGGTCACGATACTTTCAAGTCACACCCTACTTTGTGACCTTATCCTCAGAGTAAAGGC 

TTTAGAGGAAAAGGGACCCCACAGTCTCACCCATTACCTGGCTGTCAGCATCTCCATATG 

CTCCTGGCTGAGTTTTATTGAGCATCAGCTGGGGATGTGAGCAGAAACCTGAATCCTTGA 

GACAGGTGGTTTTCAAAAAGGAAGCCATAACAATGAGTGGCTTAGTACCTGCCCG 

Sequence 555 

CGAGG 1 1 1 M I 1 I I I 1 11 I I I M N I 1 M I 1 I I 1 1 I NGAAGAGAAAAGATATTTAACCATT 
TCTCAGGCTAATCTNTTAAAGCGCTCAAAGTGTTTTCACAAGTGCAAGGGGAGGGATTGT 
TCTCAGCCCAACATTfACCCAGTTNTTTTTCTTANACCACNCAAAAAACACTCTCNAATT 
TACGGTGCCTTTANACCTTTACTTT 

Sequence 556 * 

GGCAGGTACACAGCGGCAGTCGCGCCCACACGTCCATGACTGGTCGTCCTAGATTTTAGG 

TGTCGATGAATACGGCCCACTGGGAACTGCAAGCCGGCTCTCTGCGAGCGGGAAACCGCC 

TTTGTCTTGGCCTTTCCGGAGTCCTTTCCAGCCTTACCGCCAGCCATTTCGAATTCCGCT 

GAAGCTTAAAGCAAGCAAGGCAGAGAAAANGGNTAATCGGAC CCACG GNGGGNATCCCAC 

CAACNTANTTNTTTTTTTGAAACCGGGAANAAA AAAAG GGGGNTTTTTNCCCCCN 

CGGNNNCCTTTTTAAAAANCCCCCCNGGCCCNTTTTAAAAAANAGGGGGGCCCCCCCCGC 

NGGGGGGGGGGNNNTNTTTTNAAAANNTTrr TTTTTCCC CCCCCCCCCCCCNGGGGGGGG 

GGGGGGGCCCCCCNNNCNA I I I I I II I I I I II II I I i II I NNAANAAAANAAANCCCCCC 

CCCCCCCNAAAAAAAAAAAA 

Sequence 557 

CCGCGGTGGCGGCCGCCCGGGCAGGTACATGGGCTGGAAGTGGGGATGTCCATGGCCCAA 

GAGACCCCTCCAGCTTCCAGCCCCTGGCAACAGCGGGAGAGCAAACCCAGCACCAGGGAC 

TTGGCGTGAGCCAGGGCCGGGAGGCCCCCAATCAGGCTCTGGGGTATCAGANCAACCAAN 

TTGACGTAGTGTGAAAAAATAGTATTCCTTTGATAAAAAATACTGTCCCTTGGTCTTCTC 

TAAGTTTGAAACACCCTGGGGAGCTTTATTTTTTAGCAAA GCCA ATTTCCCATACCCACC 

CAAAANTTmNCATTCCAAGTTTTAGGNTAAACAGGNANTTTTAMCCG 

TAAACTTTAAATTTTTGGAAAGGNCCTTTTGNAN 

TTAAAAAAATNANGGGGNNCTNCCCCCGGGGCTCTGGGNGGNGAATNI I I I I I NTTTAAA 

NAGTTrrTNTTANTAACCCCCCNCCCCCTTTTNGGGGGGGGGGGCCCC 

Sequence 558 

CTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGG l I 1 I I I I i iGAGAC 

AGAGTCTCACCCTGTCACCCAGGCTGGATGGAGTGCAGTGGTGTGATCTCGGCTCACTGC 

AAGCTCCGCCTCCTGGGTTCACACTTTTCTCCTGCCTCAGCCTCCTGAGTAGCTGGGACT 

ACAGGCGCCCGCCACCACGCCCAGCTMTTTrTTTGTAGTTTTAGTAGAGTCGGGGTTTC 

ACCGTGTTAACCAGGATGGTCTCGATCTCCTGCCCTTGTGATCCGCCCGCCTCGGCCTCC 

CAAAGTGCTGGGATTACAGGCGTGAGCTACCACGCCCGGCCGAGGTACTGTTCCTGTTGG 

CCGAGTGGAGACTGGTGTTCTCAAACCCGGTATGGTGGTCACCTTTGCTCCAGTCAACGT 

TACAACGGAAGTAAAATCTGTCGAAATGCACCATGA 

Sequence 559 

CTATAGGGCGAATTGGANCTCCCCGCGGNGGCGGCCGAGGTACAAGTTGGGGTCATAATT 

ATCGAGTCTCTTGATATNATCACAATTACTGTGCCCCCTGCAC TTCCT GCTGCAATGACT 

GCTGGTATTGNGTATGCTCAGAGAGGACTGAAAAAAAACGGT^TTTTNTGTATCAGTCCT 

N N AAG AATAAATATTTG N G G N C AG NTC AATCTTGN NTGNTTN G AC AAGACTG G AACTCTA 

ACTGAAGATGGTATTACATCTTTGGGGGANNCAACGAGNGGAAAANGCNCCGATTTCTTT 

NACCANAAGAAAATGTGTNCCATGCTGNNNNTGGTNAAACCCCAGCCNGTTGCTAGCANG 

GCTACTCCCCATTCACTTACAAAA 

Sequence 560 

NCTACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCG CCCG GGCAGGTACCTGAAA 
CTGCCGCCACATGCACTCCTCCACCGCTGAGAGTTGAATAGCTTTTCTTCTGCAATGGGA 
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GTTGGGAGTG ATGCG TTTGATTCTGCCCACAGGGCCTGTGCCAAGGCAATCAGATCTTTA 
TGAGAGCAGTATTTTCTGTGTTTTCTTTTTAATTTACCTTCAGTCAACTT^ 
TCCTGGATTTCCAAGATCCGCGTCTGAAAGTGCAGTACCT 
Sequence 561 

ACTACTTAGG GCGAATTGG AGCT CCCCGCGGTGGCGGCCGAGGTGAACTTGCTACTTTTT 

nTTTTTTn iGAGACAGAGTTTTGCTCTCATTGCCCAGGCTGGAGTGCGGTGGTGCTAT 

TTCAGCTCACCACAACCTCTGCCTCCTGGGTTCAAGTGATTCTCCTGCCTTAGCCTCCCG 

MTAGCTGGMTTACAGGCACGCACCACCATGCCTGACTMTTTTGTATTTTTAGTAGAC 

ATGGGGTTTCTCCATGTTGGTCAGGCTGGTCTCAAACTCCCACCTTCAGGTGATCCGCCC 

ACCTCGGCCTCCTGAGGTGCTGAGATTACAGGCGTGAGCCACTGTGCCAGCTTGCTAATT 

TTC ACAGAA GTTGATGGCMTrCTTCACATGTAAACAGTGCCAGTGCACAGAACCTTTAT 

ATATTTTTTGAAGCCAGTACCTGCCCG 

Sequence 562 

CCGCGGTGGCGGCCGCCCGGGCAGGTACACAAGTCAGTCCAACAGTTAGTGTTAATTACT 

AATAATATATGAAAACCCTGCCAACACAATTGCTGCTACATCACCAATATAATTATTAAC 

CACTGTCGGAAAAACACACATAMTTCAGGTMGACTAAAAGCTGTCTCACAAAAAGAAA 

A AAGAA ATCCAATGGATCCACTAATGCTATCAAAAGGGACATGCAGGAATGTAACATGAC 

ATTTTTAGAMTGTGTGTTTCTAAAMGAAAAAAAAATACACTAAAATGCCAGTGGACTA 

TAATTCATTCAAMCATCTTTAGTGTTCCTTCCCAAAGATCTTGATCTGCTCAGTAATTG 

CTTCACAAGATCTAT CACAG CCATCTTTTGGAGCGTATGGTTAGGCTGGTCCTCCTGTGG 

TGGTAGGGGCAGTCTTTTTGAAGCTTTAAGTATCTGGTGGTACCTCGGCCGCTCTAGAAC 

TAGGTGGATCCC 

Sequence 563 

TTNCAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATCGGAAATTATCTACA 

ATGMGAAMTTTNTTGGNAAACTTATCCAACAGCCTGGGCAAGCTACCTCTCGCATGGG 

AAATTGATAAATCTGAATTTGATGGGGTGACCACAAATTCGAAACACAAATCAGGCAATG 

CAAAG AAACAAGTTTC C AAGAGAAAAACTTC AG ATAAAAAG GG AAG ATATCAG AAGG AAT 

GTCCTCAGCATTCTCCTCTTGAAGATATTAAACAGCGGAAAGTATTAGACCTCAGACGAT 

GGTACCTGCCCG 

Sequence 564 

CAGGGCGAAT TGGAGCTCCCCGCGGTGGCGGCCGAGG rTTTTTTTTTTTTTTTTTTTTTI 
I I I I I i I III I GG NCCC AGTAATTTATNG ATTAAATC ATNGTAATC N CC AATAN AG ATTN 
CAATAGAGANCNCCAACATGATTNCATGCNTTNAGNGGANAAATNTTNCCNGGTTAAG 
Sequence 565 

AGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACTGGGGATACAGGAG 

AGAANCNANCGNNTTTGTCTTTGATCNNAANGAATCCGCATNGTANAAAGTGGAANATGN 

NATGNGATGNACACATTAATTATNATCATNCCCTTNNNCTACCTAAAATACTCGCAGTGG 

CTCNACCTTACGCATATAAAAACTNCGCATTCCAGCCCACCGCCTTCA 

Sequence 566 

NCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAATCTCATTT 

TCATAAAMCAAAAGAAAATCATTATGGACATGTGTGTGTAAAMTGCTTTTTGGAAGGA 

C TAGAAA CTAGATCAACCA GCTAAA CAGAAGTTCCTTCTGTAGCATGGCAGCGGGTTTCC 

Al 1 I I 1 I ATTAGTTTCCTAGTTTTTTAMTATAAAAAGAATGTTTTTACAA^ 

TTGTTTATTATGCCTACAAAMTTAATAmATAmATTGTGTATATATATAAAACC^ 

AC AAACACTTAAAGTTTATACATATAC ACACTAGATATGG CCTGTGAATTGTACCTCG GC 

CGCTCTAGAACTAGT 

Sequence 567 

TACTACAGGGCGAATTGGAGCTCNCCGCGGNGGCGGCCGAGGTACTCCAATGCGTGGCTC 
TCCTGTGTGATTCTTNCTCTCCACATGGTTGTGAGTCTTGACAATCGCAGCCCCTGCTTT 
TTCCCTTCCCTGGGAGGCTAGAACANAGAAGCCCTTAC 
Sequence 568 

GAATTNGAGCTCACCGCGGGGGCGGCCGCCCGGGCAGGTACAl I 1 I 1 AATCTGTTATAGT 
TNTTTGC AGTATTAAAATGGTG AGTAAAAAN AAAAAAAAAAAAAAAAAAN NG GG CTTGTA 
CCTCN 
Sequence 569 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGGCGGACCTCATCCGCCTCCT 
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GCTGAAGCACGGGGCCAACGCAGGTGCCAGGAACGCAGACCAAGCCGTCCCGCTCCACCT 

GGCCTGCCAGCAGGGCCACTTTCAGGTGGTGAAGTGTCTGTTAGATTCGAATGCAAAACC 

CAATAAGAAGGACCTCAGTGGAAACACGCCCCTCATTTACGCCTGCTCCGGTGGCCATCA 

CGAGCTTGTGGCACTGCTGCTACAGCACGGGGCCTCCATTAACGCTTCTAACAATAAGGG 

CAACACAGCGCTGCACGAGGCTGTGATTGAAAAGCACGTCTTCGTGGTAGAGCTGCTTCT 

GCTCCACGGGAGCGGTCAGTTCAGGTGCTGAACAAGCGGCAGCGCACGGCTGTAGGACTG 

TGCTG AAC AG AATTCAAAAATAATG GAATTGCTTC AAGG N G GTAC C fCGCCCGAGGGCTC 

CTTCCTTATGACTCCATTCAAATTTACGGGNTATTAAAGCAGGGGCTTCAGTGGCCCGGG 

GTCCCT 

Sequence 570 

CCGCGGTGGCGGCCGAGGTACTAAAAACACAAAAGCACTTATCAGACACCCTATGGCATT 

TGTTGCTGAAATGTCTACCGGCTGGAGATTAGCCTTGAGCCGAAGCAGCCTCAGAGCCAA 

CACTGGCTTCCTCCTGTAAATCATCTCCAGCAGAGGACTCTGACTCAGCCGGAGAATTTT 

CTTCTTCTAGTTCAGCATATTCATCAGAGGTTTCGTTCTTTGTTGTATCTTTATCATTAT 

CGATGGTGACAGCTTCATCCAGGGCAGCATTTGTAACCTTGGGGGTTGTTTCAGGGGTTT 

ACTTCCGTGGGTTTCCCTCGCCGCTGCATCTGTGACCTCTGGCCCCGGTTTCAGCACTGA 

CTGCTTGTGGGTCTCCGGAGCAGCCTTCCACGTGACCTGGGACAGGCAGATGCCTCTTTT 

ATGACCACAACTTGGTAGGTACCTGCCCGGGCGGCCCGCTCTAAAACTAGGGGGGGATCC 

CCCCG 

Sequence 571 

GTATACGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGAAAAGTT 

TTATTCCTTTTGAGGACAAACAATTAAACCACATCCAAGGTCTTAACTTACAGACAGAAA 

CCAAAGTAGCCATTTAAAGCGTTAGATATCGGATACAAGACATACACTGGGGAGAATGCT 

TCACCATCTGAAGCTCACACCACAATGGCCCAGTGGACAGCTGTGCACTCTGCTTGTGCT 

TAAGTGCCTGGGTGTGGCTGAGGGGAAGGCGTGTCTGCAGAACAGAAGAACAGCTGTGTT 

TCACAAGTACCTGCCCG 

Sequence 572 

CCGCGGTGGCGGCCGAGGTACCTCCTAAACAACACAAACTCAAAATGGCTCTAAAAACCT 
GCAGTGCCAATCAGGCTGTGCATAGCAATTGCAACCCGTCCAGCCAATCTGTGATGGATG 
TAAAAGGTCTGTAGCTCCAAGTCCACATGAAGTGAGTGTCTTCAATGTGTTTAGAATAGT 
GAAGTAATACTAAGGTCACTTCTGAATAATATTTTTGTC I MINI I GTTTGTATGAACA 
ATCCATTGGTCACTTAGACCCCTATGTAAAAATAAAAGAATCCTCAAGGCATGAAAACAC 
CAGTAATCTGTAATCTGCAGGACTCTTCCCCTGTCTGGGAAAATATTTGCCACAATTTGC 
CTCAAG C AAATTCTTCTTG GTCACTCAAGGCTAAGAAAAAAATTC AGG ATTGTAAGGAAG 
TGGAAAN I i M l ICTCTCTCTC l I I I I I I I I I I I I I I I I I 
Sequence 573 

ATACGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACG 

CATTTAAAATAAACCATATAATTTTACCAGTAAGAAAGCCGATAGTTAAGTTCAATTCAA 

ATTGGAGTTCATMGTTAGTAATTTAAATCCTTAGACAAAGTTACAGAAAGTGCATCTTC 

TTATTTTCCATCTTCATAC AATGTTAA I I I I I I I I I I GGTGTTTATACCTTTTAAAAAAT 

AAAAACAGCCAAATACTTAAGCAATATGTACCTCGGCCGAGGTACGCCAAGGGCGCCCTG 

GAGACGGAGGATCCTGCGTTAATC 

Sequence 574 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGTCTTATCG 

CTCAACAACAGAGCTCCCATTAATTGGGAACAGTTCTAACACTCCCCCAACTTCCTCTCC 

AACGCCTAAAAGCTTCAGAATGGTTATGTGGCGCAGACCAAAATTGCACGCCTGACCACT 

GAGGGCAGAGAGCTGCTGTGAATAGGCCCAGTCTTCATCGTTAGGAGCAGAGATCACCAG 

CMCCTCCTCCTCCACCGGAACCTGGATAGGAAGTTCTCCAGGGACTGCTTTTTGTCCTC 

TTTGCAAACAATGCCCTCCTTCTTCTGCTTCTCCATATCTTTGATTCGGGACTGGAAAGT 

ATCGATCAGATCAAACACAGACTTCATTGTTATTGGTACCTCGGCCGCTCTAGAACTAGT 

G 

Sequence 575 

CTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACACAAGGGAGAGGCTCCT 
GGGCAGTGACGGTGGAAGCTCCACTACCTCTGGGATTAGGGGCACTGTTTCCAGAGTCTG 
TAAGGTCGTGAGGATGTCACTTATGCTGTGCTCCTGTGGCTGGTTCTCCCCTCCAGGGAG 
TCTTTCCCCAGCCTCAGCTGGTTCCTGCCCAGCCTCAACCCCAGGCTTGCCTTCAGCACC 



Page 80 of 635 



WO 01/51628 



Table 1 



PCT/US01/00798 



TCGCTCCAGGCCTGGCTGTGAGAGCCNAAAGCGTAACCTTGCCCGGGGCGGCCGCCCGGG 

CAGGTACAGGGGGTGCTGCAGGTGGCAGAGTGAATGTCCCCCATCATGTGGCCCAACTCT 

CCTGGCCTGGCCATCTCCCTCCCCAGAAACAGTGTGCATGGGTTATTTTGGAGTGGTANG 

NNGMCTTGTTTACTCATTGAAGCAAAAATTCTGCTTCCTTTTATTTTTATAGGAATAGA 

NGAAGGAAAGGGTCAGAATGCGTGCCCAACTTCTTTACCCCCCAA 

Sequence 576 

GGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATGAACTGTAGAGTCA 

GAGTTAATGTTGACAAG GTATTTTTTG ATTTAATG AG AG AAATTCG AGCG AG AAAG ATG G 

AAGACAGCAAA GAAAA GAATGGAAAAAAGAAGAGGAAAAGTTTAGCCAAGAGAATCAGAG 

AAAGATGCTGCATTTTATMTCAAAGCCCAAACTCCTTTCTTATCTTGACCATACTAATA 

MTMTTTATAAGCATTGCCATTGAAGGCTTAATTGACTGAMTTACTTTAACATTTrGG 

AAATTGTTGTATATCACTAAMGCATGAATTGGGMCTGCMTGGAAAGTCAAATTTACT 

TTAAAAAGGAMTTMTATGGCTTCACCAAGAAGCAAAGTTCMCTTATTTCATAATTGC 
CTACATTTATCA 

Sequence 577 

TTAGGGCGAATTGGAGCTCCCCGCGGGGGNGGGCGAGGTACTCATGACGGANGTTGCCGC 

TCGCCCACTTGTGCAGCAGCGTACTTGTCATTGTCCAGGTCACAGGTCTCGAAAAAGCGG 

GTGGTGCAATGCTCCATGGGGATGAGGGGAGCACGCAGTGGAGCCAGCTCGGTGTGGGAG 

AGGTACCTGCCCGGGCGGCCCGAGGTACCAGAAAGATAACGGAATGTAAAAACTGGAATT 

ATGAAATCTGGAGTTATTATTTGGGAAATGGCAGAAAAAAAGACAGTAAATACGAAACAT 

ACTTGA AAAG AACAATGCAAMTTTAGTMGATGAAAGGGGAAATCTATTAAMTGTTTG 

AGMCATTTTANAGCACGGGATTTGGNAAATTTCCCCATAGGACAGATGGA 

Sequence 578 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCATAAAGAGTGTT 
GAAGTTTATTTATTATAGCACCATTGAGACATTTTGAAATTGGM7TGGTAAAAAAATM 
AACAAAAAGCATTTGAATTGTATTTGGTGGAACAGCAAA 
CTTTGTCAAATTATACTGTTTCCAMCAT^ 

CACTTGCACTGGTTGGCAAGATTAGAACAAGAGGAACACATATGGAGTTAAA'l I I I 1 t 1 T 
G 

Sequence 579 

CCGCGGTGGCGGCCGAGGTACAATCTCAGCTCACTGCAACCTCCGCCTCCCAGGTTCAAG 
TGATT CTCCT GA CTCAGC CTCCCGAGTAGCTGGGATTACAGGCATGCACCACCATGCCTG 
GCTAAl I I I iGTAl Mil IGTAGAGACAGGGTTTCACCATGTTGCCCAGGCTAATCTTGA 
GTTCCTGAGCTCAAGTGATCCAACCACATCTTGGCCTCCCAAGGTGCTGGGATTCCAGGT 
GTGAGCCACCGCCCCCAGCCAACAATAATTTTTAAGGAGAA^TACCAAATCCAGAATACT 

GATTACCTCTGGGGAAGAAAACAAAAGAACAGAATGAGAGAGAGATACACAGGGAACTTC 

AACTGGTTTGTCTAATATTTTA 

Sequence 580 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAACATCTTTGAAGG 

GATGGAGCTGCGCGGGGCTCCTCTGGTTGTCATCTGCCAGGGCAAGATCATGCTGGAAGA 

TGGCAACCTGCACGTGACCCAGGGGGCTGGCCGCTTCATACCCTGCAGCCCGTTCTCCGA 

CTATGTCTACAAGCGCATTAAAGCACGGAGGAAGATGGCAGACCTGCATGCCGTCCCAAG 

GGGCATGTACCTGCCCGGGCGGCCGCCCGGGCAGGTACCAAAA/iCCAACATGACACACAG 

GAAAAAATAAMGTGCAATTTTAATATAGTGAATGTGATACATGTAT/\ATTCCTCATAAC 

AAM TGGTC AAAACCTTTAAAAGATCCACAATAGATATC.TGAAAATCTTAGCAATGCTGT 

ATATATTTTGAGGACTAAATGATGAATTTATATTCAAMTTGGTCMATATATT 

Sequence 581 

CCGCGGTGGCGGCCGCCCGGGCAGGTACATCTCCTCGATCATCTCGCGGAGGGCCAGGAA 

GTAGGGCCCGTTTTCATTTGGGGCTCGGCGCCAGTCATAGGGAGCCCCTCGGACATCCTC 

ACCCCGTGTGTAGCCCCAGCCCACAAGGCTCTCCACCATGGTGTGGAAATAGGAACCCAC 

GCTGCTTTTGCTGGGGTCCAGGAACTCCAGTGAGAAGGTCTTCCCAAAGCCAGGGACACG 

TACCTC GGCC GCCCGGGCAGGTACTCCAGGACGAAGMGGAAGATCATGCTTGATACTTA 

GATTGGTTTTCCCAGGGAAGAGGGCGGAGCAGAGCAAAGTCACTGTGAACCCTGGGCCAG 

GCCCTGGCTGGGCCAGCTCCTGAGAGCGTCTCGTGTTGCANACCCTTGCCCACTTTACCC 
A 

Sequence 582 
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CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATCATTCCACTTCAAGA 

AATTTTTTTTTCTTTTTn 1 GGAGACGGAGTCTTGCTCTGTCACCAAGGCTGGAGTGCAG 

TGGCACGATCTTGGCTTACTGCAACCTCTGCCTCCTGGGTTCAAGCAATTC TCCTG CCTC 

AGCCTCCCAAGTAGCTGGGACTACAGGCGCACGCCCCCACACCCAGCTMTTTTTGTATT 

TCTAGTAGAGACGGGGTTTCACCATGTTGGCCAGGATGGTCTCAATCTTTTGACCTCATG 

ATCCACCCGCCTCGGCGTCCCAAAGCGTTGGGATTACAGGCATGAGCCACCGCACCCGGC 

CTCACTTCAAGAATTTTTTACAAGCACAGAAACTATATCTCAAGTGTATGATAAACTGGT 

ACTATAATACTATATTGGATTATAAATATACAAGCTATTTGAGGGGGGGTGATAGCTTCA 

CTAC CTTC AC C AAGCTTAGG AATATATATAATCTACTTTG 

Sequence 583 

ACTATAGGGCGAATCGGATCTCCCCGCGGNGGCGGNCGCCCGGGCAGGTACCACTTCTGA 

TGATGGAAGCAGTGACCTGGATCCCATAGAACACAGCTCAGAGTCTGATAACAGTGTCCT 

TGAAATTCCAGATGCTTTCGATAGAACAGAGAACATGTTATCTATGCAGAAAAATGAAAA 

GATAAAGTATTCTAGGTTTGCTGCCACAAACACTAGGGTAAAAGCAAAACAGAAGCCTCT 

CATTAGTAACTCACATACAGACCACTTAATGGGTTGTACCT 

Sequence 584 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAAGTGAAAGAAGCCAGAC 

ACAAAAGGTCTCATACAATTCGATTTATATAAAATAGCAAGAATAGATAAAACCATAGAG 

ACAGAAAGCAGATTGGTGGTTGCTGGGGGCTGGAGGAAGGAAGGAATGGAGAGTATCTGC 

TTAATGGGATGGGGTTTCCTTTTAGAGGAGMCATTTTTTTGGAACmAM 

GTGGTTATGCAACATTGTTAACAATACTMGTGGCACTGAACTTCTCACTTTAAAATAGT 

TMTTTTATGTTATGTGAAGTTTACCTC 

ATATCCAACCATTAAAAAATNTMGTTACATAGCTCGTATATAATCMCTTACTTATTAC 

TGCACTATGAGGGGCCCCAGATATGTAAGACAGAAGTCCCAACC 

Sequence 585 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACAACATCCTTTCAA 

ATATTCTTTCTTTATTCTCCAATTCACTTTTCAGGAGAATAGATAACC TCAAT CATATTG 

ATTCTCAGC CTAATGGTCTCCCTTACATAMTTATAATG AC ATAAG C ATTTTGTATC ACT 

TTATTGTTACCATCGACCTCCTACAGTTTGGGGGAACCCAATCATTCTTTAAAACATTAG 

AATGACAACTCCAGTTGATTTGGCATATTGTATAGGTGAGTGGGGAAAGACTTGTCAAAT 

CCCAGAAAGGGGGCGTGGCACCAGCACCACCCAACCAGAAGAAAAATCTCTCTTTGGTGC 

TAGTGTCATAGCTCCCTATCTGTTCCGCTAACCCATTAAATGATGTGGTGGAGGAGGATC 

CTTTCACAATCTTTGTTCA I MM! I ACAGAGCAATGGGAAGTNNTACCTAACCTTTTAC 

NAAATTCA I MM IGGGT 

Sequence 586 

NCGGGGGGGGCGGGCCGCCGGGCCGGGTCCTTNAAAAANTNGAGGAANTTNTTAATGGNT 

CCCANTAGGGCNTTTNAAANCTNCAANCGGCTTTCTTTNTTNAATTNN AANG CTAATNCT 

CTNGCAAATCCAATAAAATCGGNCNTTrNNTATNCATTCCNGCTTTATATTTTNATATTT 

NAAANGANGCCNCCNGTAAATNCTATTTTNTTTGGCAAAATACCAAAAGGGNCAAAAAAC 

CTTTAAAAATTNCNGTAAANCATAANGTAGTTCTNAGGGGGAAANGTTATCATTCCCAAN 

GTCCTNGNGTNGTTCCTTCCTTAAT 

Sequence 587 

CCGGGCAGGTACTACTTTGAGGGCATTAAACAGACCTTTGGCTATGGCAGCACGCTTATT 

TTAGGGCCAGCGGCAGGGGACTCCAGCCTGCTGGACACTTCAGTCCAGCTCTCTCCTGAC 

TGGGGCTTGCGACTCACAGGATTGCATCGTCCCAGCTGCTAACTTGGGGCCGGGGCCCCT 

CCCTTCCACATATACCTTGGGTTTGTGCATGTTTTCTGCTGGGTGGGTTCAGAGGGCAGG 

GGACTCCAGCCTGCTGGACACTTCANTCCAGCTCTCTCCTGACTGGGGCTTGCGACTCAC 

AGGATTGCATCGTCCCANCTGCTAACTTGGGGCCGGGGCC 

Sequence 588 

CCGGGCAGGTACGTAGTAAGAGAAACATTGCCACAAAATCCTCTTTTAAAAGCTCTTTCA 

GGACACTAAATTAAGTCATTCATTTCTGAGATTTTAAAGGCCAACAATGTCTCTATTTTC 

AGACTATGTCACAAATGACATAAATGCCAAAATTATAGATCTTAAGAAGACAACACATAA 

AATACAGACTTAGACAAAACAAATGACTAAAGCTGCATAGTTCTGCTAGACTTTATTCAA 

AACTGGCTCTATTCTTAATTATTTGAGAGACCTAAAATTCTAGTGTTCAAATGTAAATAC 

CCATCATATACTGAAAAGTCATCTTCAAATTTGT 

Sequence 589 
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AGGTACCACTGAATCCAAGGCTCTCTTGGGTAGCCTATGTGCCTCTTGGATGGTATGTGA 

ACAAGGTAG G GATGAAACT GCAGATGTCTGAAAAGAGCTTAAGCATCCTGACAGCTTGCA 

rTTTTTTTTTAATTTTTTI I 1 CCTCACCGTATTCCACATACCTATGGCTGGGGAAGGTAA 

TGTTACCTCTGCAAACAGATGCTCTGGTAAGAGAAGAAAGACACAGGGAAGGCAGAGAGA 

GCAAGACAGAGGGCTCAATCACAMGCTGGGACAAGGAGAGCAGCTGTTTTTCACACATT 

CCTGCTCCTCTCTCAGCTGGGCTCATGTTCACtCCGTTTCATTCCAATCATCAG 

Sequence 590 

CCGGG CAGGT ACAGGACAGCCAGCGTCATCATTGCTTTGACTGATGGAGAACTCCATGAA 

GATCTCl l I \ ICTATTCAGAGAGGGAGGCTAATAGGTCTCGAGATCTTGGTGCAATTGTT 

TACTGTGTTGGTGTGAAAGATTTCAATGAGACACAGCTGGCCCGGATTGCGGACAGTAAG 

GATCATGTGTTTCCCGTGAATGACGGCTTTCAGGCTCTGCAAGGCATCATCCACTCAATT 

TTGAAGAAGTCCTGCATCGAAATTCTAGCAGCTGAACCATCCACCATATGTGCAGGAGAG 

TCATTTCAAGTTGTCGTGAGAGGAAACGGCTTCCGACATGCCCGCAACGTGGACA 

Sequence 591 

CCGGGCAGGTACACGGAAATCTGGACAGTGCTCCACAGATTGATACATTAGCCTTTGCTT 

TTTCTCTrrCCGGATAACCTTGTAACATATTGAAACCTTTTAAGGATGCCAAGAATGCAT 

TATTCCACAAAAAAACAGCAGACCAACATATAGAGTGTTTAAAATAGCATTTCTGGGCAA 

ATTCAMCTCTTGTGGTTCTAGGACTCAGATCTGTTTCAGTTTTTCCTCAGTTGTATATT 

GACCAGTGTTCTTTATTGCAAAMCATATACCCGATTTAGCAGTGTCAGCGTATTTTTTC 

TTCTCATCCTGGAGCGTATTCAAGATCTTCCCAATACAAGAAAA 

Sequence 592 

CCGGGCAGG TACGG TGTGNTGTCGATGAGCACGATGCAATTCTTCACCAGGGTCTTGGTA 
CCAGTGCCCCTTTT CAGAC A GANT TTGATTCGCTCTAGAC I I I I I M I M I I < l AATAGG 
GAAAAAA NTTGA TAATTTrTCTT TTTTTCT 

GGATTAA l l \ l l ATTNTGCTTNCTTTTTTTCCGCNTTTTCNTTNNCGCNCGAGNCCTGAA 

NGGNAAGAATGTTCCAGGGCAGGGGNAACCCACA 

Sequence 593 

CCGCGGTGGCGGGNCCNCCCGGNNTTNNAAANNNGGGGNNNTNTACATAGAGGTGAGGGT 
CATGCCCGTGTTTCAGCTCATNCAGTNCAGGGACTTCGCCTTGCCCCACCCATNATGGGT 
NAGGCCGGGAAGGGGGCCATTTTGGAAGCCAAG 
Sequence 594 

AGGTACTCTGAAGGTAGGATGTGCAGCTTTTTCTTAGGCAGGATAGATGTAACATAGATG 
ACTGCATAAAAAAGAGGGCAGAAAGGGCAGGAAGCCCA I rTTTTTTI I AACACTTCTNCA 
TTGTGTCTCTGCCTCACCCATAATGGCTGGTTTATTCAATTATTCATATATTCCCGTTTG 
TGTTTNCAANNNAGGTCTACACAGCACCAGCCTACAAAGGCAAGGTCCAATCCGTGTAAA 
AAAT 

Sequence 595 

CCGGGCAGGTACCGTCGCCCGGCTCTCCGCCGCTCTCCCNGGGGTTTCGGGGCACTTGNG 

GTCCCCACAGTCTNGGTCCTGCTTCACCTTTCCCCTTGACCTGAGTAGTCGCCATGGCAC 

AGGTTCTCAGAGGCACTGTGACTGACTTCCCTGGATTTGATGAGCGGGCTGATGCAGAAA 

CTCTTCGGAAGGCTATGAAAGGCTTTGGGCACAGATGAGGAGAGCATCCTTGACTTTTGT 

TGACATCCCGAAGTAATTGCTTCAAGCNCCCAGGAAAATC 

AACTNTTGTTTGGCAGGGGATCNTTNTGGGNTGACCTTGAAATCAAGAAANTAACCTGGG 

AAAAATTTG 

Sequence 596 

GGAGATTATGTCTACTTTGAGAATTCCTCCAGCAACCCATACCTAATAAGAAGGATAGAA 

GAACTCAACAAGACTGCAAGTGGCAACCGTGGAAGCAAAAGTAGTATGCTTTTATAGACG 

ACGTGATATTTCCAACACACTTATAATGCTCGCANATAAGCATACTAAAGAAATTGAGGA 

AGAATCTGAA ACAAC AGTTGAGGCTGACTTGACCGATAAGCAGAAACATCAGTTGAAACA 

TAGGGAACTCTTTTTGTCACGCCAGTATGAATCTCTGCCCGCAACACATATCAAGGGGAA 

A 

Sequence 597 

CCGGGCAGGTACTCATGGAAAAGGAGTTCCCTGGATTTTTGGAAAATCAAAAAGACCCTC 
TGGCTGTGGACAAAATAATGAAGGACCTGGACCAGTGTAGAGATGGCAAAGTGGGCTTCC 
AGAGCTTCTTTTCCCTAATTGCGGGCCTCACCATTGCATGCAATGACTATTTTGTAGTAC 
ACCTAGAATCTTTGTGAATAATGTGAGGCCAGTTCTTCCATAAGGAAGGCTGGTTATGGA 
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TATTCATAAGGTTATTTCAAAGTTAATAAAGACAAAGTGGCAACTGTAGAAAG7GTTGCC 

TCCAATCTTGGTCCGTATTTCCAAAGC 

Sequence 598 

NCGGNGGCGGCCGAGGNNNNNNNNNNTGGTTGAGGGGGAGGAAAAACCCGCCACCCNNGC 

GGGAGNGCAGNAGCACGACCNCANTTTACNGCAACCGGCAGCNCCCNCNCANAANAGANN 

CNCCNGCCNCAGCCNCCCAAGNAGCNGGGANNACAGGCANGCGCCANCACACCCANTTAA 

NNTTTNTNTTCCCAGGGGNGAGGGGGCCCCACCAAGAAGGCCAGAATGNANNTGNNCCCC 

NGACCGCAAGGGANACACCCACCNCAGCCNCCCAAAGGGCNGGGANNACAGGNGNGAGCC 

ACCGNGC 

Sequence 599 

CCGGGCAGGTACAAAGCTATAAAGGAACGTTTTTAGAGAAAGCACTGAAGACACACATTT 
TGCTG ACCTAAAAGATTTTAAAATGAATTAG AATAATTTAC ATC A7ATAAAGAG G TATTT 
AGTCTTTAAGTGGAGAAAGT 
Sequence 600 

NGNGGCGGCCGAGGNACNNCANAANTNNNTTCGGGGGNCANAAAAACCCCCCACNCCCCC 

NNNCNAGGACACCNCGNGGAAGTTTTTNAAAAGCCNCAGGGGGGGGGACCCAANAAAGNG 

AAGNGACCGNCNCNNCCAAGGGCCNCCAANANGNCCGANGNGANGGCAGCNGCNNTNNAA 

GANGANTNNTTNTNNNGCGGGGGNAACGNCCCCNANAAAAGAGGGACCNGANCGGNCNCC 

CCGCCCGGCGGAAGCACGAGGAGNNCNGNGNCCCCACGNCCACGGNNCCCGCNACNGNGN 

CCAACAANGNGNNNNNANCCGANNNCAGCANCAGGGANCNGCAGNCCAACGNGGAGCACC 

NGA 

Sequence 601 

CCGGGCAGGTACTGGTCTACAGGGACAAGCAGTCTTTAAAACGAAACTCACCTTCAGACC 

TCACTCTACGGACAGTGCCACACATAGAAAGATGACTCTGTCACTTGCAGATAGGTGTTC 

AAAGACACATGAAGATTAGAATCTTGCCAATGGCTGGTCGTGACCCTGAATGCCAACGCA 

CAGAAATGATTAAGAAAGAAGAAGAACGTTTGAGGGCTTCCATACGTAGGGAATCTCAGC 

AGCGCCGAATGAGAGAGAAACAGCACCAGCGGGGGCTGAGCGCCAGTTACCTGGAACCTG 

ATCGATACGATGAGGAGGAGGAAGGCGAGGAGTCCATCAGCTTGGCTGCCATTAAAAA 

Sequence 602 

AACNCCACCGCGGAGGCGGCCGAGGTACTGCATCAGTCGTGAGGAGGAGGGAAAACCC7T 

GGGATTGNAAGACTATTAAGATCCTAGNAAGTCNNTGGTAGNTNACNGGGATGNGACCAT 

NGAAGANCCCCCCGNNGNCACTAATCACACAAATAACAANNNNGAGAAGNAGGCTATAAA 

ACAAATTNAGGNTGCTNTTTTCNANNGGGAAANAAACCAAAAAAAAAACAGAAAAGANGN 

GAATCCCCCCCAGNNANGGGGGAAAGGNAAAGCAACACCAAANAAAAGCCCA 

Sequence 603 

AGGTACCACGCACAGGAGCCGTCACTTCTCTTGATGCAGGCGAAGAACTTGGCCTGGTGC 

CCGTTGATGTTCTTNTNTGTGACCCANTCCATCCAGAGGCACTCGTCCGGGGAGGAGATG 

TAGCACCGGGATCATGGGGCAGCTCGTGATCTTGCACTCGCAGCCCATNTGGTACCTGCC 

CGGGCGGCCGACCGGGCAGGTACTTACTTTCCCCAT AGGAAGACA GACCATAGGCAAACT 

CTGTTTTGGG ATCTNAACTCCATC ACCTTTGTTTCAATA 1 I I I i I 1 ( I CTCTCTTGAACA 

AAACTGAGATAATTTAN 

Sequence 604 

CCGGGCAGGTACAGGCACAGCATATATTTGAGAAAACATCTTACAAATTTCATTTACTAT 

AGGTTTCTCAATAATCTTTACATTTAATCAATGAGAAAAGTGATTCAGTCTC7TGAATTT 

TAAGTTAAAAAAATTAAAAAGTATTTCCAGGGACTCTTAAAGCTCTCTCCCAAAGTATAA 

AATATTATGTACTGTGGGGTCAGATTCAGCATACtTGAGGGAACGAAAGAC^rnTCGTTG 

GGGCGTGACCCGTTTGATGTTTGGATGATCTTCAAGTGTTAGCAGCCCCGCTTTCTGTTT 

TTCTTTTATCTGAA 

Sequence 605 

AGGTACAGACATTTTCAAAGTTGCCAGTGTTACTTTAATTGGACTGCCTTCGTAATTCAT 

TGCCTCTGCTTCAACAATGTGCAACTCATCCTTTGCACCAGCCCCTAAACTGACCGTTCT 

TAAAGATAACTGGTGCTCATTTTCATCATTATCCACCTTAAAGTGATAATCTTTGTCGGC 

CTTTAGTTCACAACCGAAAAGATAGTTCTGGGGCCTCAGGGGGCTCATGTCCATGTCCAT 

CACAGAATTCTTTAATATAAATACGAACGGATGGGGATAAATAATATTTAAAACTTAGAT 

TATTCAATGCTTGCAAAAAA 

Sequence 606 
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CCGGGCAGGTACATATGATCCTTAGCCACCAGGGCACAAGCTTACCAGTAGACAATACAG 

ACAGAGCTTTTGTTGAGCTGTAACTGAGCTATGGAATAGCTTCTTTGATGTACCTCGGCC 

GCCCGGGCAGGTACCTTAACATTCACATGGAAGTAGTAAAAATAAGATTCTGGGTGCAGT 

TCTCCAATGACAGGAAAAAAAACAAAGAGAATTTGAAGAATACCGTCAGAGACAAATACA 

TTACAACCAAAATTGAC7TCAAGGCACTTTTGAAGGAGATCAMTTTATAACAAAATAAT 

TTAGTGAAAGTGAAAGCTT 

Sequence 607 

CCGGGCAGGTACCTGGACCTGCTGTCCCAGCCCTGCCGCGCTGTTTACATCTTTGCCAAG 

AAGAACGACATTCCCTTCGAGCTGCGCATCGTGGATCTGATTAAAGGTCAGCACTTAAGC 

GATGCCTTTGCCCAGGTGAACCCCCTCAAGAAGGTGCCAGCCTTGAAGGACGGGGACTTC 

ACCTTGACGGAGAGTGTGGCCATCCTGCTCTACCTGACGCGCAAATATAAGGTCCCTGAC 

TACTGGTGCGGCGCACCCTTCAGGGTCTTGAGATTGAGCTGCAGTCACAGCTGAGCATGA 

AAGCTGCCTTGGAAGACACACTGGC 

Sequence 608 

CCACCGCGGTGGCGGCCGAGGTACCCAGGATGNTTTCTGGCAGGAGGGAGCNTGTGTTCC 

TGTGACCTGTGACCCACCTCCACCAAAATTNCATGGGCTCTACCAGTGTACCTGCCCGGA 

CTTCCTCTCCTACCTACAGTCCCTGCTCCTATTCCCACCTCAGCAACTGAAGCCGCCCCC 

AG AGCTCTG CCTTGAGG ATATAAAC ACATTTAAATT ACTGTCATATGCTTCCT ATTG CAT 

TGAGCATGGTGATCTGGAGCTAGCAGCAAAGTTTGTCAATCAGCTGAAGGGGGAATCCAG 

ACGAGTGGCACAGGACTGGCTGAAGGAAGCCCGAATGACCCTAGAAACGAAACAGATAG 

Sequence 609 

AGGTATTAACACATGAT GGAA AAGTCATTGTGACGCCAATGMTTTCATTGAGTATAAAC 

TCATCTACTTCAAATTTATTTTATAAGACAACCTAAGATACTCAAGATMTTATTTAATG 

GTTAGCTCTTAAGTTGMTTGGTCTACATAATGCGTGGGMGAAAACCAGATTTTTAGCC 

TTCTTGCCAAATCCAGACCTCTGGTTGATTTTTCTTTGACAGAAGATGCAAGTTATTTTC 

CAATTTCACAATTAAATGTATTTAACATGAACATTATTTTGCTTTAAAAACTAT 

TGTAGGAGAATTATAGCCAGTCTTCAGTTATAACCACT 

Sequence 610 

AGGTACTTCGAGGTCTTCGGCGAGATCGAGGAGGCGGTGGTCATCACCGACCGGCAGACG 

GGCAAGTCCCGGGGCTATGGATTTGTCACCATGGCTGACCGGGCTGCTGCCGAAAGGGCC 

TGCAAGGATCCCAATCCCATCATTGATGGCAGAAAGGCCAACGTGAACCTGGCATACTTA 

GGAGCAAAACCAAGGATCATGCAACCAGGTTTTGCCTTTGGTGTTCAACAACTTCATTCA 

GCCCTTATACAAAGACCTTTCGG 

Sequence 611 

AGGTTTTGCAGCTCGTGC CATC ATTTCAGAGCTGGTGAGCATTTCAGAACTAGCTCAACC 

ACTAGAAAGTGGCACCCATTTTCCTCTCTTCCTACTTTGTCTTCAGCAGTTAGCTAAATT 

ACMGATCGAGAATGGTTAACAGAACTTTTTCAACAAAGCAAGGTCAATATGCAGAAAAT 

GCTCCCAGGTAAGAGAATGCTGACTTGTTTGTTTTG I I I IN I AATATTATATCTAGAGA 

TTTCAGGTACCTGCCCG 

Sequence 612 

CC GG GCAG GTACANATGNGN N GTGTCTN N CAACTTTCATN AGAAAANG CC ATATCTATAC 
C AT ATTTT ATTCGGAGTCACTGAN GATGTAATGATATATCTTTTTC ATTATTATAG C AG A 
ATA l l I ( I ATGGCANGATATTTAGANG NCTTGANCATNCCTATTAAAATANTGCCAAACA 
CCAAATATGAATTTTNTGATGTGC N N 
Sequence 613 

AGGTACAGAACTTGGTCATMTATCTTGCATTTTATAGATTTATTAAAGATTAGTTTCAA 

GTTCACATTCGCTATTCAGTT GTAAA CCGAATGGATGGGAGGGGAGAAAATATAAGCTCT 

CCACACAGGTATGCTCCTCTCTTTTCTGAGAGAGAAGGCATGGGATTTTCAGCATAAATT 

CCATGTTATGTGAGTGCTGTTTGAGTTCTGAAGTTCCTATCAATATCTGTTCCTGCAAGT 

GATCTCTGTAAGACCACCTTACATGCTGGTCTTAGTTATTGTTAAAATTGCAAGGTTTCT 

TCACACCCTCTTTGATAAGAAGTGTTTAGCTGGCAGAGC 

Sequence 614 

CCGGGCAGGTACCTCTTGAGTGTCCCTTGAATTGGTAACCGGTTTGTTGTCATTATTTTC 
TAAGCGAATATGCCGTAATfGGTTATTGGGAACATCTTTGACAAAGATCCATTTAACTTC 
AAATTTGCCCTTCCACTTATCCTGAGACCAGACACCAGCATACGCATTATAGTCCACAAC 
AGACTTCATTTCAGCCACTCCACAAAAATGTCCACTGCCATTCACACTGAAGAGTAAATA 
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GAGTGGGCCTTTCCCATTCAGGGAACGGTAAGCTGCATCCAAACGCTTATTACCATGCTC 

AGTACCT 

Sequence 615 

CCGGGCAGGTACTTTTGATTGTGCACGCTTTTAAATAGAGAGCAGAGTTGCCCACTTGAA 

ACTACTCTCTTGCATGGGATATTTCAAGCTGTTTTACTATGGGCAAGGAGCAGGGACCAA 

AATGCTGCCAGGGCTTAAAAAGAGCCGTGATCAGATTAAACAGAAGTTGGAGAAGTGGAG 

GGATGTGGGCCACACACGAGAAGAAATTCAGGAAGTAAGAAGTAAGAGTGACCCTATTAT 

GCTTCTCAAGGACAGGATGGTGAACAGCAATCTTGCCAGTGTGGGAAGAACTAAAGGAAA 

TTGATGTGGAAGTGAGGAAGGAGATTGAGGATGCTGCCCAGTTTGCCAC 

Sequence 616 

CCGGGCAGGTACATGTACATTATAAAAAAAAAAGTTCAACTAGTATGAAAGGGTTATAAA 
GTMC AG AAAAAAAAAAAAAAAAAAAAAAAAGGANTTAATCACAGTGGAAGCTTGC AC GG 
TGGGCCACG 
Sequence 617 

CCGGGCAGGTACAAATGTGAAGAAAGCTTTGTGAAAATTCCTGGCGAGAAGGACTCAGTG 

ATCTGCCTTAAGGGCAGTCAATGGTCAGATATTGAAGAGTTCTGCAATCGTAGCTGCGAG 

GTGCCAACAAGGCTAAATTCTGCATCCCTCAAACAGCCTTATATCACTCAGAATTATTTT 

CCAGTCGGTACCTCGGCCGCCCGGGCAGGTACCCAAGGGATGTTTCCAAGCTTCTTGATC 

TTTAATCTTTTAGCAACCTTCTGATACTGGATTTACCTTCCTAGGTGTGGTGCCCTCCTT 

CTGGAATTTAGATCTCTGGGGCAAACCCTGTCAGGAT 

Sequence 61 8 

CCGGGCAGGTACCCTGTGGCAAAATTAGTCACTGGTAATGAGAAAGATATCACTGAAAGC 

CTCAAGTGCCTGAGAAACAGTTTACTCATCCATGGGATCTCGCCAATTGTGAGGAAACAG 

CTCAATGGCCATTTCCAGTTATAAGCAGCTTA I I I I IACTGATTGGACCTGGTTACCTAT 

CATTTCTAAAAATAACTTCTGATACAATTTGTACTTCCAATTTATMTGAATA CTTTC T^ 

AGATTTTAGGTAGGAGGGGAGCAGAGGAATTATGAACTGGGGTAAACCCATTTTGAATAT 

TAGC ATTGC C AATATCCTGTATTCTTG NTTTAC AT 

Sequence 61 9 

GGCGGCCGAGGTACTCAANACATCTTTCAATAAAAATGGGGNTCGGCTTCTCTGGAGTCA 

CGGACAGTTGTTAAGGGCACCCCAAAANAGTTACTCTNCCAGGTTGCCTTTGNGGTGGAT 

GGCCGGNGTTTGGGCTTTGGTTTATTAGNGTTCCTCCTTAGGCTGCGAAGAATATTCCTC 

NCTCCGAGGGTCTTCGNCTGTTTCGTATGGAATGACAGCATTGTCCCCTTTATAACCCTG 

GGATGCCTGATCCTNCTCTTTCTTNCGGATGGGCCCCAGCTCATCATCACTCC 

Sequence 620 

ACTATAGGGCGAATTGGAGCTCACCGCGGTGGCGGCCGCCCGGGCAGGTACTGAACAAAG 

GCATAGCCCTTGTGCACAGAACAGCCGGCCACACGGCCATACTTAGAGAAGATGGTCTCC 

ACATCTGATTTCTTCACCAGAGCTGTGTTGAGGTTTCCAATGAAGACTCGAGAGTTGATG 

GACTTGGGGTCATTCTTGTTGGTTACATTGCTTGCCTGAAGCTTCAAGGACATGGTGCCC 

ACTTAACAAACATTTCTTGAGTGTCCTCCTCTGTGCTGGCACTGTGGAGATGCTGGGAAA 

CACCGTGCTCACCAGTCTTCTGCTCTACTTGCTTTTCAAGAAGCCCCCTCTCTGGGAACC 

AGTAACAATGATGAGCTTAGCCAGCTGTTTCCTCCT 

Sequence 621 

TTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGG I I I I I II M I I M ! I I I I I I I 

1 1 1 1 1 n 1 1 1 1 1 n 1 1 n 1 1 1 n 1 1 1 aacaaan agtttgattttattgatattn aaatat 

ATNAAATNTTTCACTGAAANACATGGTTNACCATCCTCCCCCACCCCCACAGGGGTTACN 

TTNTAAAACCAAAGCCCNCGGCCTCCCACCTNCTGACTNCTTTACCAACTGGGNGAGGAA 

AGGGACAATGGTNNCCCANGGGAAGGGCATGGNTGGCNCTNNGGNACCTGCCCGGGCGGC 

CGNTTTAAAACTAGGGGATCC 

Sequence 622 

CGACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAGGAGAGCACA 
ATTGGAGCGGCCTTCCTCACACAGACTGTCTGCCTGGATGACACAACAGTCAAGTTTGAG 
ATCTGGGACACAGCTGGACAGGAGCGGTATCACAGCCTGGCCCCCATGTACCTGCCCG 
Sequence 623 

CCGCGGTGGCGGCCGAGGTACATTATATAAGGGAI I I I I 1 1 I AAGTTGAAAACAACTTTC 
TTTTC1 1 1 I I GTATGATGG I I I II I AAC ACAGTCATTAAAAATGTTTATAAATCATAAAA 
AAAAAAAAAAAAAAAAAAAAAAANAGGGGGGGACCTGCCCGGACCTGCCCN 
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Sequence 624 

?fp l ( :?? TGGCGGCCGAGG ' " 1 1 1 "" 1 1 1 1 1 H H 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 n 1 1 i i 
iilL 1 1 1 "" " 1 1 1 1 1 1 1 1 1 1 1 n 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 
T^NNNNAAAMNNNNNTTTmTTTNN 

™™ CCC ^ CCCNNAAAAGNTTTT ^ 

TTNGGGGNAAAANCNNNNTCCCCCCCTNN U II II I I 1 I I I ' VWVWI NuyK3fa INNiTTT 
Sequence 625 

Sequence 626 

ATGTGACCAGTGCTATGATACAAACAGGATGCTACAAGAGTACCT AAGGCA 
Sequence 627 

GG ^ G ^I? GACTCCCCGCGGTGGCGGCCGCCCGGG CAGGTACA7TrCGAGCAGCGGAAG 
Sequence 628 

GGGG ^ GAGGTAC ^ G ^ AT CMCACATTTGTATCAGAGrrGCTmCTMTCTTGTTA^ 
CTGCTTATTCTAGGTCTGTA^ATTAACTGGCTACTGGGGAAATTT^ 
CTGGATCTAATCTGNNATTTTTTCATTTNAACCTCCA 
Sequence 629 

AG ^N^NAGNTGTACNNATCGCNGNNCNTAAAGANNTNCCCNNACCACNTCCTNTT 
GNGCTTNTTTATTNGNATTGnNGMCANTCTNNNNTNTGNGANTCNTTACA^ 

CA ^IS^I AnGCmAGGACTCTGGGAAGCNGNTG ATAAAAAGATAACACGAGCTAC 

ItgItcS ANATGmNCTMNC ^^ GGAG ^ TAT ^ 

Sequence 630 

GGGCG ^ GGGGCCGAGGTG C ' I I ' I I I I I I I I I I I I [ I I I | | | I I I I I 1 I I NCCAAAAT 

; A ™JI!3 A I rCAAAACGCTGGGGTCAGAG 

GA £ GG IT AGA ™ TGCTG ^ 

A J GG ^ A PJSI AAAGG ^ NGGATATATGTATNGNAA ^ AGAA ^GT^ 

TTCATTNGNGCCAAAAATATTAAAAANG 

Sequence 631 

£a GGGCG ^Z GG ^ 

TAAAGCCTGCTGATCTTAAGAAGAGAATAGAATCTTTAATTGACCGGGACTACATGGAAA 

GAGATAAAGAAAATCCAAACCAGTAGGCTGGAAGAGGATCTTGGGCGCCGCCAGTCTTTA 

GCACCAGTTGGTGTAGGAGTTGAGACCTACTTCACAGTAGTTCTGTGGACAATCACAATG 

GGAATCCMGGAGGGTCTGTCCTGTTCGGGCTGCTGCTCGTCCTGGra 

I GAGG ^ A I AGCCTGCAGTG CTACAACTGTCCTAACCCAACTGCTGACTGCAAAACAGCC 

gg^II^ 
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Sequence 632 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTGAGG 

ATC G ATAAAAG AGGC AAAGTAAAAGGGAC CCAAGAGATGAAGAATAATTACAATATCATG 

GAAATCAGGACAGTGGCAGTTGGAATTGTGGCAATCAAAGGGGTGGAAAGTGAATTCTAT 

CTTG C AATG AAC AAG G AAGG AAAACTCTATG C AAAG AAAG AATG C AATG AAG ATTGTAG C 

TTCAAAGAACTAATTCTGGAAAACCATTACAACACATATGCATCAGCTAAATGGACACAC 

AACGGAGGGGAAATGTTTGTTGCCTTAAATCAAAAGGGGGATTCCTGTAAGAGGAAAAAA 

AAACGMGAAAGAACAAAAAACAGCCCACTTTCTTCCTATGGCAATAACTTMTTGCTTA 

TGGGATATAAAGAACCAGTTCCAGCAGGGAGA I I I I I I I AAGTGGACTGNTTTCT7TCTT 

CTCAAAATTTTCTTTCCTTT 

Sequence 633 

CTACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTCGGGACCGGGGCCGCTG 
GGGTCTGGACGGGGGTCGCCATGATCCGCTTTATCCTCATCCAGAACCGGGCAGGCAAGA 
CGCGCCTGGCCAAGTGGTACAATAAA I M M II I GGTCAAATTTAAAAAAAAAAAAAAAA 
AAAAAAAAAAAGGAGGGGGGACCTGCCCG 
Sequence 634 

GGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCANGGN 111 I I I 1 ! Ill I 1 U i I 
M M I I I I I I I 1 M M 1 I 11 H I 1 I 1 N I ! I 1 I II 1 I I M I I 1 1 I I I I I I 1 i I I 1 I I I M 
iTTTTTTTTTn 1 ATTAAAACCCCNNCCNAAAAAAAAAAAAGGGANCCCCCNGNNNAAAN 
NNNAAATTANAANAAAAAANTNAAAANCCCCCCCCCCANNCCCGGGGGGGGGCCCCCCCC 
CCAAAANNTNNNTTCCCCATAAGAAANNNNNNAAAAAAAACCNCCCCCCNGGGGGGAAAA 
AAAANGGAAAAAAAACNNNTTNTCNNGNGNGAAAAAAAAAGGGNTTANNNCCCCANAAAA 
GGNCCNAAAAAAAAAAAAAAAACNACGGGGGGGGNAAAAAAAAAAATANANAAAAAGCTG 
NNGGGGGGGNNCCCCCAAAAAGGNGGGCCANCNCCCCNNNNCNAAAAATTTGGGG 
Sequence 635 

CACTACTATAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCTAGTCGGTTAA 
TTTACACTTTA \ Mill I AAAAAGTTGATTTAAAAAAGAAACAACACAAGTTTAGAATCC 
ATAAAATGTCAGCAATGCTGATGTGCACTGGACTGAAACATCTTGATCATCTTCTGATAG 
AAGTAATATTCC ATACAAAAAGATTCTTAGATTCCA 1 I I 1 I I GCTTCATTATTGTTTGTG 
GCTTGCTTTCTTTGAGCAATAAAGGGGTACCTGCCCGGGCGGCCGCTCTAGAACTAnnnn 
nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGAGGGTTAATTGCGCGCTTNGGCCGTAA 
TCATGGGTATTAGCTGTTTCCTGTGTGAAAATTGT 

Sequence 636 

CTTAGGGCNATTGGAGCTCCCCGCGGTGGCGGC CGAGGTACAAGC1 1 i U 1 I ( I 1 M I U" 

iiiM i iMiiinmi i iiiiuinuiiiiHiiimmimi niunni 

TTTTNCCAAANNNNAAAAAAAANCCCCCCNCCCNTTTTTNANTTTTTTNA/^ 

GNCNTNNNAAAACAAANCGGMGGNTNNAAANTNCCCCCTTTNACNTTTCCNAANAAAAA 

AAAAAAAAANTTNCNNNNCCCCAAAAAAAAAAANCCCCCAAAAAAAAAAACCCCCCCGGG 

GNNAGNCTNAAAAAAAAAAAAATGGGGGGCAAAAANANNNCC CCNAAAA AAAAAAAAAAA 

AAAACNCCCCCCCANAAAAAAAAAAAAANNGGGGNNATAAANNTTTTTTACNCCCAAAA 

Sequence 637 

ACTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTAAACACAGAT 

GGGCACACTCTAGATATAGMCACGTTTCTTGCTGTGGTTGTGTGACAAATGGCTAGAAT 

TTTAGGAGGCTCTGGGACTGCACACTGTATATGCAATCCATCTCAGACTTTGGGATGGGA 

ATCCATTTCATCATCCAGCATGGCGAAACTTGGTTCATTCGCCCCAATGTGAGTCCTTCC 

TGGGTAAAATTAAAGGCAGACCGGNTGGGTGCTAAGGATCCCTTGGTGTGGAGAGGCCAT 

TTCCGTTGAGCAGCTGGACTGTGAAATCAATACCAAAGGGGATGGTGGGGTCTGGGCGTT 

GGGGAATTCTTTGACAACTTTCTTGTCCTTCCTTAACCATGGGCTGTATCCCCNGGAGGG 

TCTTTCCGGCTTGTGGTTCTTGGGCCTTGGGGGGGTC 

Sequence 638 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCGCCGCTGGGCCTGCAGGTCTCT 

GTCGAGCAGCGGACGCCGGTCTCTGTTCCGCAGGATGGGGTTTGTTAAAGTTGTTAAGAA 

TMGGCCTACTTTAAGAGATACCAAGTGAAATTTAGAAGACGACGAGAGGGTAAAACTGA 

TTATTATGCTCGGAAACGCTTGGTGATACAAGATAAAAATAAATACAACACACCCAAATA 

CAGGATGATAGTTCGTGTGACAAACAGAGATATCATTTGTCAGATTGCTTATGCCCGTAT 
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AGAGGGGGATATGATANGTTTGCGCAAGGGTNTGCACACAAACTGCCAAATNTGGGNGAA 
AAGNTTGCCCTMCAANTTTTGCNGAANATTTTGTCCCTGNCCGGGNGGGCCNTTTTAAA 

AAACTAGGGGGGCCCCCCCGGGGCNGGNGGNAATTTTNAANANAAANTTTTANNNGNCCC 

CCCNCCCCCCTNGGGG 

Sequence 639 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAAAAAAAAAAAAAAACCTCGGCCG 

CCCGGGCAGGTGTCTTGCGCCACCATGCCTGGCTMTTTTTGTATATTTAGTAAAGATGG 

GTTTTCGCCATGTTGGCCAGGCTGGTCTCAAACTCCTGGCCTCAAGTGATGCACCTGCCT 

CAGCCTTCCAAAGTGCTGGGATTACAGGCATGAGCCACCGTGCCCGGCCAAAGTCAGCTT 

TCAAAATCCAAGCCATMTTGGTGAGGGGGGAGTTTCAGAATTACATATAAAAATTAATA 

TTTGAAAAAATMTTCTGAMTTTCGAATTTAAAMCAGATGTGCTGCTTCTGGGTG 

GTAGTAAAAGTNTAGGAAAAGGAACTGTTTTCCTTAGAAGCGGACTGTGGAANGCCTTAT 

GTAGANTGTCAM GGGCA NCANGAGCCCGNGTTTTTAAATGGCATTACTGTACCCTTCGG 

CCGGTTCTAGANCTTTTTGNMNACCCCTTTACNTNCCCNAATTTTNATTNTNAAAGGCT 

TTNTCGANTCCCTCNTANTTCGNGGGGGGGGGCC 

Sequence 640 

GGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCCGAGAAAAGAGCTAGGGTAGGC 

AACTAAAACTTACACAGTGCCAGTCTCAGGAGGTCAGTAGCTCACAGAACTCAACAGATA 

AACTGG ATTA AAACTTAAAAGTCTTCTTTCTATTTGAGCCCATAATGACTATTTTGAACA 

TGGCTCTTTTGCTGCTGCCTATATATAAATTTTTTATTAATTTTCTTGTATTGGGM 

CTTGA ATACGCTCCAGGATGAGAAGAAAAAATACGCTGACACTGCTAAATCGGGTATATG 

TTTTTGCAATAAAGAACACTGGTCAATATACAACTGAGGAAAAACTGAAACAGATGTGAG 

TCCTAGAACC AC AAGAG TTTGAATTTG CC C AGAAATGCTATTTTAAAC ACTCTATATGGT 

TGGNCTGCTGTTTTTTTGNGGGAATAATGCATTCTTGGCATTCTTAAAA 

Sequence 641 

ACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACCTCAGGGGTGGTCTGT 
GGMGCCTTAAACTCTCCACACTCAGAGTCCTTTGTTTCCCCAGAGGCTGTTGCAGAACC 
TCCTCAGCCAACGGCAGGTATGTCTCCCTGCAACACAGCTGTTGTACCTGCCCG 
Sequence 642 

TACTTAGGGCGAATTGGAGCT CCCCGCGGTGGCGGCCGCCCGGNCAGGNACAAGCTNTNT 
rTTTTTNTTTTTTTTTTTTTTTl CGGAACCGTACCAGAAAATTTTATTAAAAAATTAAAA 
CTATT 

Sequence 643 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGTCAACATGCTAGA 

GGTCCAGCATTTCACAAATACGCCATGCAGTTACATGAGGACTGCTACAAATTTTGGAGC 

AATGGCCATCAGCTCTGTGAGGAGAGGAGTTTAACTGATCAACACTGTGTGCACAAATTT 

CACTCATTACCTAAATCAGGAGAAAAACCAGAGGCTGATAGAAATCCGCCTGTGCTATAT 

CACAATAGCCGAGCTCGATCTACTGGTGCTTGCAACTGTGGAAGGAAACAAGCACCTCGA 

GATGATCCCTTTGATATCAAAGCAGCCAATTATGACTTCTATCAGCTTCTGGAAGAAAAG 

TGTTGTGGAAMTTGGATCATATCAATTTCCCAGTATTTGAACCAAGTACCTGCCCG 

Sequence 644 

TTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGGTACTTCAGGGCTGT 

GAGGAAGGGGTGCCTGGAGTTCTGCAGGACGCGGTTCTCGGTGAGTGTGTGGGCCACCTC 

GTCCTTGGCCACGATGACTTCCTTCTTGAGGATCTTCATGGCCGTAAGTAGCGGCCTGTG 

GCCTTCTCCTTCACCAGGATCACCTTGCCGAAAGTGCCCTTGCCCAGCAGCTTCAGGTAC 

CACTCTTCCAACTCCTGAAGGAAGCTGGCTGTGCCCGTTGACTCTGACACCTTTCGAGTC 

GCCAGTTTGCAAAGCCGCTCAACAAAGCCAGGAATGCTGAGAAGGGCCGCTTGATTGATT 

TCAGCTGCAGMGGCCCCAAACTCAAGCTCTTCTTCTGTTGAGCATTTTTGGCAAGAAGC 

GTGTGCTTGTCATCATTCCCTGTATCCATCTCTGAAAATGGTGACAGCCAAGAATCCGGC 

AGAAATTAGCGAGCTGCTTGCTGATCGAAATTG 

Sequence 645 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCACGCAGCAACGCGGAG 

GAAACGGGAGTGAACGGAGAGCGTAGTGACCATCATGAGCCTCCTCAACAAGCCCAAGAG 

TGAGATGACCCCAGAGGAGCTGCAGAAGCGAGAGGAGGAGGAATTTAACACCGGTCCACT 

CTCTGTGCTCACACAGTCAGTCAAGAACAATACCCAAGTGCTCATCAACTGCCGCAACAA 

TAAGAAACTCCTGGGCCGCGTGAAGGCCTTCGATAGGCACTGCAACATGGTGCTGGAGAA 
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CGTGAAGGAGATGTGGACTGAGGTACCTGCCCGGGCGGCCGAGGTACCTGCAGAAGGCCT 
ACAGGGTGCCAGGCACTTCTTTAATGTGTTCTTTCTTTATGTGATTATTTGATTAATCTe 
TGCCTCCCCCACTAGGACTGTAAGCTCCCTGAAGGCAAGAATCCTGTGCTTATGCTCAAT 
ATT AG CTC NTCCCTTG GC ACAG AGTAG G C ACTTCAAC AAATG CTCCC C A 
Sequence 646 

TTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCCATCTACAAAGA 

CCTCTATTACATAAACACTGCTTCTTACAGGAMCAAACCTCTTCTGGGATCTCCTTTTG 

TGAAAACCAGTTTGATGTGCTAAAAGTAAAAAGTCTATTTTCCAGTGTGGTCTTGTTCAG 

AAGCAGCCAGATTTCCAATGTTG I I I I ICCCCTCCACTCAGAAACCCCTGCCCTTTCCCT 

TCAGAAAACGATGGCAGGCATTCCTCTGAGTTTACAAGCAGAGACTCACTCCAACCCAAA 

CTAGCTGGGAGTTCAGAACCATGGTGGAATAAAGAAATGTGCATCTGGTCTCTTCTGTTG 

MM IATTTCATATCAGATTAAATTTCTTTACCATGTTGGCTAAGTCTAAATATTAGAGA 

TGAGGGCTGTGCCTACTCCCTGGCCAGCTCTNCTGATAGCCTATGATGGGGTTCCAATGG 

GGAAATGACTCTTTACTATTAAAAGACAAGGGAAAGGC 

Sequence 647 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGGGATTACAG 

GCGTGAGCCACCATGCCTGGCCAGAAATCTATGTTTTCTTAGAACATGTGGAAGAAGGAA 

AAAGACAAAAAAGGAAGTCTGGATTCTGAGGACCACGTCTCACCCAGGGTGACATCAGGA 

ATGGGTGCTAGCCTCTGCAACACGACACCCAGTCTGAAGAAGCTCTATACAGGGTACCCT 

NGGNCCGTTTTAGAAACTTAGGTGGGATCCCCCGGGCTTGCAGGAAATTCGAATATCAAA 

GCTTAATCGATACCGTCGAACCTCGAGGGGGGGGCCCGGTANCCCAGCTTTTTGTTTCCC 

TTTAGGG 

Sequence 648 

CCGCGGTGGCGGCCGAGGTGTAGGGTGTTATCCCTGTGACATTGTCTCTTTAGTTTGCTC 
TTTCMGAGATACTTACAGATGTTGAGATGGCTGCCCTGCATTTCCAGCTAATCTCTTCT 
GCTCTAAATATTTAAAAACAGTTCTTCTCAAACATTTTCATTCAGATAGCTTTCTGAAAG 
TTCCCTATCCCTCTTTACCATAA I MM 1 AAATGTAGCCACATTGTAATAGTAAACTTCA 
TATATAATGAGTGCTTCATA I I \ I i GTTATGGGAAAGCAATATATTATGCAGCCAGTCTG 
TAGAAACATTCAGATCCCTCTTCCTTTACTCAAATACAGTTTCAAAA GGAAG ACTCATGA 
GAAATTTCATAAAATACAANG I 1 i I I AGATGTTTATGCTTTGCCTTTCTTTTTAAAGGNG 
NTTTTCCTGCTNTGGAGNCNCTAACTCTGGAAAATTTAAAAATATGTAAACNAAAGGGGG 
GTTT 

Sequence 649 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGCTATACATG 

CAAAACTCTTAGCACCTGAATTTGTAAACATTTACACGTATCCGTGTATTCCAGAATATG 

AAGAAAAGGACCATGAAGTTGCACTCATTTTTCCTGGACCTCAGTCTATCTCAATAAAAG 

ATATTTCTTTTCATCTGCAAAAAAGGATTCAAAATAATGTTAGAGGCAAAAATGATGACC 

CTGACAAGCCATCTTTTAAACGCAAAAGAACTGAAGAACMGAGTTCTGTGATTTGAATG 

ACAGCAAGTGCAAAGGCACAACACTGAAAAAAATTATATTTATAGATAGCACCTGGAACC 

AAACAAACAAAATATTCACTGATGAGCGACTTCAAGGGGTTGTTACAAGTTGAGTTGAAA 

ACAAGAAAAACTTGCTTTTGGCGCCATCAAAAAGGGAAAGCCAGATACTTTCCTTTCTAC 

AATTGAAGCCAT 

Sequence 650 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTGAGGGGCTGCTGGA 
CAGAGCGGTAGCGATCAATGCACAGGATGAAGACACTGAAAATGGACGCTGTGCTGGCCA 
CATAGTCCATGGAAAGCCAAAAGAGGCAGAGAGGACGGCCCAGTGACCACTTGACCTGCC 
CGGN 

Sequence 651 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAGCTCACTAAAAGA 

TTGAGACCCCCTCCTAGGACTATAGAACCCTTCCTCTCCCTCCATATTTTATCACCACAT 

TACTAAAGGCCTGTTTGCAGCAGTTCCTCTTAACTTGTGTGTCATGTCCAGTAATCAAGA 

AAAAAATTAC AAG AC ATATTAAAGGC AAG AAAAAAC AGTTTG ACG AG AAAGAG C AAGTAT 

CAGAACCAGAGTCAGATATGGCAGGGATGTTAGAATTATACACCAAGTGGGATTTATCCC 

AGGTATTCAAGGCTGGTTCAGCATTCATTCAAAAGTCAATTAAGGTAGTCCATCATCACA 

TCAGCAGGCTAAAGAAGAAAAATCATGATCATAGCAGTAGATGCAGAAAAAGCATTTGAC 

AAAATCTAACACCCACTTATGATCTCAGCAAACTAGGAAGAGAGGGGAACATCCTCAACT 
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TGATAAAGGAATATCTACCAAAACACCTATTAGCTAACATCACACTTGATGAAAAACT 
oequence 652 

™ GGGC ^"? GG A^ 

V^l^^ a ^ gagtgactgcacat ^ 
GG A^ C IT C T™ TATA ^ 

CCACCMTCCTAATAA^^ 
Sequence 653 

I^f^^^^G^GGTATGTTTCATGGACTGGAGACTTTG 

AAO I U I GTTT AAATAAACG 
Sequence 654 

SpaYp^a^ 
G ^rr^ G i A ^^ G ^ 

^ G ^ GGA ^^ GG ^ G ^GATAAGCCATCAGATCTTGTGAAACTTATTCACTATCACAAGAATA 

G J$ GG ^VPJ GG J AGATAC ^ 

TATCATrCTACCCCTGGCCCCTTCAAATCTTATGTCCTNACTATTCAAAA 
Sequence 655 

pIaaI^t 0 ^ 0 ^^ 

lf A J AGGGG J™ 

GA ^HT^ CCATGGGTGTCCCCACATACT GTAATCTTCTCTGTCTCmGAGTGTGGTTTC 

AGGCCTGTTGAi nu iCTTCGTTTTATGTTTTCCAGCAAGGATATCATAAGGACCAACT 
Sequence 656 

^^ A ^S^S^ GGAGCTCNCCGCGGTGGCGGCCGCCC GG 

TGGAACTCTGGGGCCCACTGAGGCACCATTATTGGGGATTTCAGGGTGGGCTGGGCACTG 
CA 

Sequence 657 

G ^ G A G i^TS GGGGG ^ GGGGGGG ^ AGG ^ A ^ G ^^GGGAMCAACACTTGGTTAGTCTCTT 

£ AAG JI AC ^ A /^ GCCAA ^ G ^^ 

ACCCACCAATATAAATTTAATTAAAGATATATGTTGTAAAAAAAGAAAAAAAAAAAAACC 

II GGC ^ GCCCGGGCAGGTACTC ' n ^TCCCACTCTGCGGTTCTGCCAAGCACCTTGCCTGG 

GCACTTGTGTGTGTGCAAGACTCTACAGTGTGGTGTCTGCTGCTTTTTTTTTTTTCTTCT 

Sequence 658 

AG i?i'i' ' ' ' ! ' 'GTCTGAGTCCTCTGACGTCCTGCCCAAGNGAACAGCTGCGGCAGCCC 

I^ A ^n GNAACAAGGAGAACCCACC ^ GAAGC ^ A "^NCCTANCCCNTNAGCCCACCT 

I G ?P G M GN ^ AANnACCCTG GGGGG ACCC CTTNCTTGCCCNATNGCCNTTCCANTrrA 

S AAAAAAC ^ rCCAAATTTGCN 1 I ' I I 1 1 I I I I I I I GGTTCCCAAAANNTAAAAAAACN 

CTNAGGCTTTATNCNTCTTGCCCNANNNTTTTANTCNGNANTATTTGTATTCNANAAAGT 

^ G ^ GG ^ GGGGGGGNNACNCNnGCCCCAGGGACGGGGG CNCNTTCTAAGAAACCTAAGAC 
GNGAATC 

Sequence 659 

CACTACTT^^^ 

1 ' 1 ' 1 1 ' 11 ' 1 1 1 1 1 ' 1 1 n 1 1 1 AACAGGAGGAAAAAAAATTTAATAATATATGTTTGCAC 
AGGAGTTCCACGAAATATGAGACTCCAAGAAGGGTCAGATGATTGACACTCATACNCCAT 
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CGTGAGCTATCGAAAAGAACGGCAG1TTGGGAGTTCTGCAGGGA GTTG ACCACANAAGTG 

GGAGAGTGAAGGGAAGAAGTGTGTCGTGAATAAAGCTTGGCTGGTTTTCANATAAAAGGT 

CTTGCCGAGTGGCCAGGTGTGGTGGCTCACTCCTGTCACGTCCCANCACTTTGGGAGGCC 

AAGGCGGGCGGCTCATGAGGTCAGGAGTTTCGAGACCAGCCTGGCCAACATAGTGAAACC 

CCGTNTTTACTAAAAATGCAAAAAAT 

Sequence 660 

CNCTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCG GGCA GGTACTGTCTGG 
GCTGGGGGGGACACTGTCCAAGGGAGTGGCCCCTATGAGTTTATATTTTAACCACTGCTT 
CAAATCTCGATTTCACTTTTTTTATTTATCCAGTTATATCTACATATCTGTCATCTAAAT 
AMTGGCTTTC AAAC AAAGCAACTGGGTCATTAAAACCAGCTC AAAGG GGCTTTAAAAAA 
AAAAAACCAGCCCATCCTTTGAGGCTGATTTTTC 1 MUM 'I AAGTTCTATTTTAAAAGC 
TATCAAACAGCGACATAGCCATACATCTGACTGCCTGACATGGACTCCTGCCCACTTGGG 

G 

Sequence 66 1 

GACTACTATAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGAGG illl HIM liiiui 

i n n 1 1 1 1 1 1 n m h n i m 1 1 1 1 1 u 1 1 1 ngccncgaggggggtttnattttcatt 

ATTCANACAAANAATTTTNTANAANATCCCGGGNCAAACCGGAAMTTTGGCAGNCCNAT 

NGGGGGGGG 

Sequence 662 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAATTTCAATAATAAATTA 

AAAAAATGAGTAAAGTAAGTCTGAGAGAAGTTCATAGTCACTATATTACTCAATGACATT 

TTTCTGCCTTCATTTCTAAGAGTTCATCTGATCCAAGGTCTGATCATCTTCTTGACAGAA 

AAACTTGTTTTGACAGCACATTTGAAATGCCGAGTGTTTTGTTATGTCTAAGCAGCAGAG 

TGATGTTTGTTAATAACAGAAACACTTTAAAAAGAATCAATTGCAATCAAGAAGTGTTGA ^ 

TGAACATTTGGACAANGNTACTTGGCCCGGGGCCGGCCGCCCGG 

TTCATGTCTT7TTTTAATAAATAGGATTTCTAGGAGTCAGTATATATTTAATACTCTTCT 

TCCTTAAG AAATAAG AAGTTTAG GN CCAAGTGTT AAG CTTTATC CTTG ACACTG G CCTTA 

TCTCACATGGAGGGATTAAAAGGGCC 

Sequence 663 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACGAAGAGAGAAATTCA 
AAC AAAATATTG CTGTTCTTC AGTTTTGTTTGTG GAATTTG AAATTAC TCAAATTTAAAA 
TAAATTACTGGACTGTGGAAATAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^^ 
GACCNGCCCG 

Sequence 664 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTCAGTTTCATAT 

TACTCTAAATCCATTACAAATCTGCTTAGCTTCTAAATATTTCATCAATGAGGAAATCCC 

AGCCCTACAACTTCGGAACAGTGAAATATTAGTCCAGGGATCCAGTGAGAGACACAGAAG 

TGCTAGAAGCCAGTGCTCGTGAACTAAGGAGAAAAAGAACAGACAAGGGAACAGTCTGGA 

CATGGCATCAGAGATCCACATGACAGGCCCAATGTGCCTCATTGAGAACACTAATGGGCG 

ACTGATGGCG.WTCCAGAAGCTCTGGAAGATCCTTTCTGCCATTACACAGCCTATGGTGG 

GGGGGTGGOXAATTGTGGGCCTCTACCGCACAGGCAAATCCTACCTGATGAACAAGCTGG 

CTGGAAAGAAA 

Sequence 665 ^ 
TAGGGCGAATTGGAGCTCACCGCGGTGGCGGCCCGCCCGGGCAGGGTACACNGTGGATGG 

GGCAGTCGGTCTTNTCTGGGCAGGTTGTTGTTTCTNTCGGTAGTGTCCGGCGAGATGACC 

TCCTGCTGCTGGGCCTGGATGGCCCCCAGGATTCCCCAGAGCAGGAGCACGGAGCAGGTG 

CCCTGGAGCATCTTGGCAGCACCTGTGGCCCTGAAGTCCTGGTCCCGAGGAGGCACCT 

Sequence 666 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGl rTTTTTTTTTTTTTTTl 

GGCAGTGAGGACTTTATTGGGGAGGGTGACTTTGGCTTCCAGAAAAGAGGGAAGGTGGGG 

ACCTAGTAACAATAACCATTTATTCCAAGGAGGCCCTGGCCCTGAACCCGGGGTTCCCAC 

AGGAATCAGGGAGGCACCTGAGTCCCCCAGGACCAGGGCATCCCAAGGCATCATGGCAGC 

TGCGTTGTTCAAAAGGAAGTTTCATTGAGCTTCATCTTGGGAGGTGTGAGGGGGAGTGCC 

GAGACCGCTGGAGGGCCACGGGGCTGGTGTGGGTGGGCCGTCGGAGGTCTGGCCCACTCC 

GCACCAGTTCCTGGCAGCGTTCCACAAAGCTGCCCCCACCACGGCGCCGGGCCTTNAGCC 

TGCGGGGGGCTTGGGCTCCCACGGTGGCCAGAAAGGGAAGGTGTTNCTGNAGGCTNAGTG 
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GAGGCTG 
Sequence 667 

P^°°f^^^CTGACGGTATTCAGCTGCGCGTAAGTCTGGCCGGTGCCATCTGTCT 

^^?f™ G ^ ATCG ^ GACA ^ CmCGGmGMGAA TMGAAAGGAGCAAAGC 
AAC 4 G ^ G J T J A 7P™ G ^^ 

AGGAGCTAAATGAGCTGTTCAMCCTTGTAGTTGCTGCTCAAAAAAT 
Sequence 668 

™p GGT i^ 

^Ar A r^ 
rA G A^f C T^™ GG ^^^ 

TTC^CTC TCTGATGGCTTATC 
Sequence 669 

rI A £™o G J ATmAC ^^^ 

CAAGG1 ^ GAAATAAAAm ^ 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^AAA^^^ 
Sequence 670 

£ A ^ G f£CGM^ 
A I A JI^ GAACTGGACCAG <3TGGGTATTC^ 

T TCCGC . TTAATGTTTAAATTCAGTAACCGTC I 1 nTTTTTTTTTTTTTrrTTTTTTTTT N 
Cl 1 ""■ I I I I I I I I I I I I I II I I I II I I 1 | | | | | | | | | | | | | | | | i 

Sequence 671 

GC ™ C J A I AGGGCGAATTGGAG CTCCCCGCGGTGGCGGCCGAGGTACTGGCCTGCACCAG 
^9^1 ^ AG ^ G ^^^^TAAAAATGTGTAGCACAGAACTGCACTAGGATTAATT 

TAACATGCATCTGMTTTTAAGTTGCAMGGTATCTGMTMTTmCATGTGCATCTTT 
TGTCGAATGTTTTGGTTCAAGAAAGAATGTTTAAAGCTTTTTAAAAGACTTCAGTTCTTA 
ATGTMCTGTACAAAATTAGTTGTAAAAMTMCATMTnrACCAGTAAACCCACTCATA 

^ TGTGCA ^ GG CTrrTGATATAAAAAGGTTTGTACCTGCCCGGGCGGCCGCTCTA 

Sequence 672 

AG ?I AC J GAAGCCAGCCACGCTG CGCCCGGCCCTGCCCCGGGCCTTCCTCGTGCCTGGGA 
^^ G J r 97 AGGGATGCTCCTGACCTCCGTCTC ^ GGA CCTAAGATGGMT^^ 
?SI~ A 5 GGATTGCCTGAACCAAGAGGCCAGGAGCCCG CATGGGC 
? GG i^ CCGCCCGGGCAGGTACAAAGTGCTGCAGTAGG CGGTGAGCA^ 

TGCAGGAAATCTAACCTGCCTATTCAGCCCATC 
Sequence 673 

C ^^ GAGG J ACM ^^ 

T^GGCAAATTTAACGGAAGTGAATGCTGATGTGAACATAGAAAAAATAATTACAAAAAC 

ATGAAAAATGGAAACTGACATAAAAGTTAAAATAAAGGGATATTTCTAACTTTTAGACTG 

ATGAAAGAAATGTCTGAGGCCATTAAATGCAAAAAAGAAATACATTCTCTTTAAATTATT 
GNGTACCT 

Seq uence 674 

AGGTTTTGC I I mil iGAAAAAGGTAAGTTGCTGATTAAGTCTAATTGGAATTGATAAT 
TCCATAGTCTTAGATTAAAATGAGGATATTTTCTCCTAGATTTTCTCATGTTATGCCATG 

CAAGATGGAAGCACTTTGAATmCTrrCATTGAGAATMCTGTTTTATGTAAGAATCTG 
TAmATAACACCAGATATTAAGATAGGCTTCCATT-rnTAATGCAAGCCACTTACTTAA 
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TCTTG N ATTCTTTTTC 
Sequence 675 

AGGTACACTTCTGTCTTCAAACTGTATCTTCTTTGGATGGAATTAAGATGTAACTGTATA 
GTTTTAAG ATAAATAAATG GGAAGTTGGTC C AACTAAGATG ACAGC AG ATATATTAC ATG 
CAGGATTTMTATTTrCTMTTCTCTCTmAAAAAAMGGATGCTGTTGGATTGGGA^ 
AAAAAAGTCAAAAAAGAACCCAGATTCAATATATAAAAATGTCCC ACAATA GGTCGACAA 
TGG C AGTAAAAATAAGAGG AAAAAAGTAATCTTTCTG G AACATC ATTTTTC AATAACATA 
GAAACACTAAGTGCCAGGAAGATTCCTAT 

Sequence 676 

CCCTTTCGAGCGGNCGNCCGGGCAGGNACTTTTTTNTT^ 

CATGACACTTTCACTGATCATTAGGATGGCAGGGNATCGNTAAGAACCAGGATTATAAAG 

ATGAACAGGCCTCAGGTCTGTNTNTAACANGCTTATAATCTATGGTAAGAGGCAAGGNAT 

TTAATCTCCTTTGCTTCTACGATNATGGCTCANATATGTTTGCTCTGCTCTAAGAAAAAT 

GCATCAGAAGCACCTTAGGTTTGGCAGGNAGGGGCTCTGATGTTAACAGGATTGGGTAGA 

GAAGCTCTTGATGCTTCTGGAA 

Sequence 677 

NCCCGGNNNNNNNNGGGGNNGGGGGNNMNNNNNNNNNNNGGGGGNNNNNNGGGGGNNNNN 

NNNNNNCCCNNNNNNNNNNNNNGGCCGAGGTACCATATCAACCAGAGCCAGNCCACCACC 

ATTGGAAATAAAGNCCCAGCCAGTNAAGGTG GCCA AATTATTCCCACCATTAGGAAACAT 

TTTTTCAGGCATTAATGGACCCAATGGAAAAATTTTCCTGGTTGGCC 

Sequence 678 

AGGTACTGCTTAAATGAACACAAGATGGTGTCAGGATTCCAAGTTTCTAGTTCTTTCTAG 

GGAGTTACCTAAGGTCTAAGTTTTATCCTGGACAGAATCTCTTGGGGAACTAAAATGCTA 

CCAATGAAAAGCTCTTGATACTCCTACATCATGAATATTATTGAGCTCTAAAACTTGAAC 

TGGTCGTGTCAGGAGAACCTTTAACTGCTCATGGCAGTTCTTGTGGACTGAGAGGAAGTA 

CCTGCCCG 

Sequence 679 

AGGTACTGAACATGTTCCAAACTCAGATAGTGATGATTGCACAACTCTGCGAATACACAA 

AAAAACATCCTCGGAGGGAGTCTGAAGGTATGTAAATTACATCTCAGTAAAGCTGAAAAA - 

CTGCTTTGGCTAAAGTGGCTATCCCTCCATGGTGCTGGGTCCTGCCCGnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGTCGA 

CCTCGAGGGGGGGGCCCGGTACCCCAACCTTTTTGGTTC 

Sequence 680 

TAGNAACTAGTGGGGATCCCCCCGGGGCCTGGCCAGGGAAATTTCCGGATTATTNCAAAA 
GGCCTTTTAATTCCGGAATACCCGGTCCGGAC CCTTC CGGAGGGGGGGGGGGGCCCCCGG 
GGTAACCCCCAAGCCTTTTTTTGGTTTCCCCCTTTTTMGTTGGAGGGGGTTTN 
Sequence 681 

AGGTACAAACTATTTCATTATCAGCAAAAATTAACATCGACTGGACCGAGAAATGACTCC 
TTTTTACAAACGGGGAAAGAAACGGAGAC TCCTGGAG G AAAGCTG CTTCTG CTG G AC ATG 
TTCTCCCGCGTACCTGCCCG 
Sequence 682 

CCGGGCAGGTACTACTCAGAGAATTTCATTGCTACTGCCTGTTCGTTAACCTAGCAAGTG 

AGTAAATTGAGGCTTTAATTAAAAAAAAACGGTTGTTGTTCTTTGATATACATT^ 

TAGCTCAAGCTTCAACCCTGCCCAGTTCCTATGCTCAGACCCCATCTTCAAGGATGCTCC 

TCCTGCCTAACTCCTATCTCAGAATATTGTTTTTfCCTTTCCCGTTTAATCCCCAGGCTC 

AGCTCCTGGGACCAATCCTGGTTCAGGCTCC7GTTGCCTGCCTCATCCCACAGCCGTCCA 

TMTTTGGTGGTTACGTCATGCACCTGAAGTGGTAGTGAGCATTTATAGTAATAAAAA 

Sequence 683 

AGGTACATTTTGAACAATCTACTCTTCAAGTGGATCATTTACAAAACAAGAACATAATTC 

CCTG AATCAGTAAATTTTATTTTCATG AAAGG ATG AC AMG AAATTC C ATTTAC ACTATC 

ACGATTTACATTTTCCTACAATTCATGCAGC'nTCTTCAATTCACAATCTACAAACCACA 

ACAATGCACAAATTTGATGTTAGAGGATAC^TTAATGTTTAGTTCTCATCTCAACAGTCA 

CTGTTGCTAGTGTCTCTCATTCTCACTCTTCACTGTTACTTAAAGAACAAAGAACTAAAG 

AAGCCAGAAAGAACTAGATCAAGCGATGAGACCAATTTTCAGGGTAAGGCATAATAA 

Sequence 684 

AGGTACTACTGGCACTGAGCCAATGTATGCTATCAAGGAAAGCTTTATCTGTCACTGAGC 



Page 94 of 635 



WO 01/51628 PCT/USO 1/00798 

Table 1 

AAAAGGTGAAGTTCAATTAGGTCAGTTTTATCACTTCTTTTCCTACACACAAACTATGAG 
GAGAGATCATTTCTTTTCTTTCTTTTATTTATTTA'I I I I I I I I GAGACAGGGTCCCACTC 
TGTCGCCCAGGTTAGAGTACCTGCCCG 
Sequence 685 

AGGTACCTCCTGCCTTGAGTGATTTAAAGCATTTTAAGATATGCTTTGCTGCCTCTAGAT 

ATrrCTCTGGTTAGATCCCCACAAAGATACTGTAACATTAAGTTCTGGGACCATTrGAGA 

ACAAACTAGCTGCTCACAGATTCCTGAAAGAAATGCAAAACCTTTTCTCTGGCCCCAAGT 
ACCTGCCCG 

Sequence 686 

AGGTAC TCTTTTTC AAAATG CC ATATTTTGAAATTGCTATTTG CTATTATATGATGCTGT 

GTCCTTGAGAAAGAGAATACATGGATCTGGCAACCAGGAGTGTGTGTAGGTAGAACTGGC 

TTCTC TCATCATAT ACCCCAGGGATCCAATGGGAMTGTTTTCTTCCTGTCCTTTTAACT 

TCAGGi mTTTl ICGGGGGGTGGGGGTAGCGGTAGGGGGTTAGTGAGGATCTTCTAGCA 

GAGAATACCTTACAAGGATACCACTAAATCTAAAGCTCCAACTACCTNCTGGNCACTTNG 

GGCTCCTNCTGACACTAGNCCAGCNGGCAAAAAGGNTACTGAACT 

Sequence 687 

AACCCACCGCGGNGGCGGCCGCCCGGGCAGGNACTNCTTGCNNGGGATTAAAAACCCGAC 

AGCAGCAAACNGCAGAANCNNCAGACNGCAGGNTTGCNGANGGNGAGAGGGAACNCCCGC 

CCNAAACCCACNGCCACNGAACCNGGCNIGGGANACCAGNGGCCCNGGNGGANGCACCANA 

GATGTTTGAGCCNGGNTNNCTGGCCAGGGGGNGGCCGCCACCNNANNN 

Sequence 688 

CCGGGCAGGTACGCGGGGGGATTCAGAGTGAGAAGGCATAAAGGAGAATCCCCAGCTGAC 

TTGTGCAGTGGTTAATTGAAATTATTCAGGCAAGAGATGATGGTGTCTTGGACCAGGGGA 

TGAGGMGGCTACAAAATGTGTCTACCTGTATTCTGTGAGGAGAACGTGTTCCCTGGTTT 

TAGATACTGTGAAGATGGATCAGGAGAGAGTTTATCTAGACTGTTGGGGAAAGGTGTTGC 

GATTCCTTCAGCTACACAGGATTGAAAGGAGACATTTCTGMGGGGAAAAAGGAAATGAA 

AAANNNNANANNNNNNNNNNNNNNGGTACCTCGGCCCGCTCTAGAACTAGTGGATC 
Sequence 689 

AGGTACTGCAAACT AATA TTAGTAATAATTTGTTGATATCAGCTTGTGAGCTTTAAAAAG 
CC TGGT CAAACAAGTTTTATGTATTATGTTW^ 

AATT TTGAGTMnATTAGGATTTGGCTGGGACAGGGGATGGATGAGCTTTCTGTTGTAA 
TTTTGCTCTTAGTGAGTAGTAG GTAC CTG CCC G 
Sequence 690 

AG GTAC CACTTGTG ATGTGTTTAATAAATTCTGTGTGGTTTTTACATAGCCCTG AG AGG T 
AAGCAATG AGTC AGGAGTTTCCTCCAAG C ACCTGTTAAGTGGC ATC ATATAAATAG C AAT 
GTCAGGAAATCAGGTACCTGCCCG 
Sequence 691 

CCGGGCAGGTACTGGAATGAAAATACAGGTAAAGCATTTGAAAATTATTTTTCTGGCTTA 
GACTGTTTACTGGCTCGATTCTAGCTCTGTCCTGGTAAAGGCCTCCAGAACCTTCACCTA 
CTATTATCAAGTCTGGCCTCATAGGGAGGGACTTTTACTGTACCTCGGC 
Sequence 692 

GAATTTCCATCGGTTTGCTGAGMGCACAACTTTGCAAAACCCAATGACAGCCGTGCTCT 
CCAGCTGATGACCAAATGTGCGCAGACTGTGATG G AAGAACTAGAG G ATATTGTGATC GC 
GTATGGACAGAGTGATGAGTACCTGCCCG 

Sequence 693 ; 

ATTGGAGCCTCCACTCGCGGTGGCCGGCCGCCCGGGCAGGTACCATAAGAACATTAAAAG 

CTGAAATTCAGGTTGGTCTGCAGCCTGGGCAAGGGGAACAGCTCCTGGCCTCTGAGCCCA 

G CTGGTGGACCACTGG AAAATATGG AATATTTC AGTACCTnn nnnnnnnnnnnnnnnnnn 

nnnnnr.nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnn nnnnn nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnT 

C ATGGTC ATAG CTGTTTTC CTGTGTG AAATTG GTTATC C C G CTC AC AATTCTC AC AC AAC 

ATACGAGCCGNGAGCA 

Sequence 694 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCAGCGGCCGCCCGGGCAGGTACAACAAAAG 

TTCATCCTAGTAATTCTTAAAGCTCTTCAATTCCTATAGAGTTTAGCCTTTGTCAATAGC 

CAAAATATGTGCTTGAAAAATMCTTCTTTGAGTTTCAAAGCAAATGAAAACATAAAACA 
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AGCAAAAAAGGC1 1111 I'GTTGGTGNTTTTCTCTGCATATCTAGGGTT TGTTT CTTCATT 
CATAAATACGGTTTTCAAAAAGCATTGCCTCAGCCAAMTTATTGCCCTTTTTAAAAATG 
CTTTTCATGTATACACTTTCTACATAACTGCTTTTCTTTACAC 
Sequence 695 

AGGTACATGAAGGCCCCCAGTTCCCCCATGCTAGACACGTCCCCAGAAGCAGCACCTAAT 

GGGCMCACTGCGGMTCATTTTCCACCCAGATCAGGGGCATCCCACGGACACTTATTCC 

AGAAAACTGAAGCTGGGCCACAAAGAAGGCTCCCATCCTTGCTGCTATTTGCCCTGGACC 

ACTTCAAAATGTGACACATCGGGCTGCAGTGAGCTGAGATCGTGCCACTGTACCTGCCCG 

GGCGGCCG 

Sequence 696 

CCGCGGTGGCGGCCGAGGTACTGAACATGTTCTAAACTCAGATAGTGATGACTGCACAAC 
TCTG CG AATACAC AAAAAAAC ATCCTC G G AGGG AG TCTGAAGGTATGTAAATTAC ATCTC 
AGTAAAGCTGAAAAACTGCTTTGGCTAAAGTGGCTATCCCTCCATGGTGCTGGGACCTGC 
CCG 

Sequence 697 

CCGGGCAGGTACAGTGAACACGCCTGTAAAACTATCACTCATATGAAAGAGAGGACAATA 
TCGGCGCAGAATAC ATACTAGCAGATACTG CT7TTTTACTGTGATTTGGAAAC TTTTGGT 
ATTCAAGTAACCACTAAGGAATATAACTCTAACATGGACAAGGCTTAGATGCATCATGAA 
AGTCTGTCATGGGAAAGGTCTGTGGACAGAGGATGTACCT 
Sequence 698 

AGGTACAAGGCAGCAGAAGCAACAAAGGCTGTGGCCAGCAACGAAGGCAGAGTCCCCGGT 

AGGCGGAGGTCCCCGTGCACAGTGTGGTGCAGCAGTGAACCCAAGATTTTCCTTCACAGG 

GCAGGGGAACTATTTCCTAATAAG CATTGGCATTCAGG CC AC AAG G GCAGC C ATTACTC A 

GCACTGCACTGACCTTGGGAATTTGCTGGGCAAGGAAAATGACGTGGCCCTCATCATCGA 

TGGCCACACCCTGAAGTACCTGCCCG 

Sequence 699 

CCGGGCAGGTACTCTAACCTGGGCGACAGAGTGGGACCCTGTCTCAAAAAAAATAAATAA 

ATAAAAGAAAGAAAAGAAATGATCTCTCCTCATAGTTTGTGTGTAGGAAAAGAAGTGATA 

AAACTGACCTAATTGAACTTCACCTTTTGCTCAGTGACAGATAAAGCTTTCCTTGATAGC 

ATACATTGGCTCAGTGCCAGCAGTACCT 

Sequence 700 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTA CCTCTGAAAA GCCTAGTG 

CACTCCAGAAGGCAGGTGGAAGGTGTTGTTTGAAGTGAACCCTATTTTATTTTTCTCAAG 

AAATTCTTGAGCCGATTCTGTGACATGAACATCTAAACAGAGTAACCAGATTCCAACTGC 

ATGACTCGAATAGGTTTACACATTTTTCTTCTGTCCCGCCTTTCAAACAAAATG GTAA AT 

AACTTCATTTCCAGGCCAGTTTAGCATTCCCA 1 (til i ACCCTG ATGAGTCCCATTTTAA 

TGAGCAATTTGCTTTATCCAGTAGTGCATTAGTGMGTGTTTCCTGATTTTATGCGGCAA 

GTGTTTAT 

Sequence 701 

ATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCTTCTAAAAATGAGTGAACATGGTGAC 

TTACCAAAGTGTGATTTGAGTTTCCTGGACCCTCTGCATGAAATGTGAACATGAGCTATG 

CTGAGATCACAAGTTAAATTTACTTTTGAAACCAGG TATA CAGTTGATGGATGTCAA/\TG 

ATAAACGTATGCATAGTGATGAAACGGTCTTTTMGTTTTAGGTGTGC ATGTTTG CT TTT 

TTCCTGAACCTGATTCAAATAGTTGTAATTTGTACCTGCCCG 

Sequence 702 

AGGTACCTGCAATGAAGGAGAGGAGAGGACAGGGAGCCAGATTTCCAAGGACGA3AGGAT 
GGATGCGGAGCAGGTTTTCTGTAGCATGTGGAGGAGGGTAGGAGGTGGCATGTGATGACC 
TCGTGCTCCAAGAATAGTGCTCAGTACCTGCCCGGGCGGCCG 
Sequence 703 

CCGGGCAGGTACATCCTACTCCAAACAGCCTAACAAAGAGTCAATCAAAGGAG'iTCTTTG 
ACCTGCTACTTGATTTGAGAGTTCCTAAGCAGTTTTCGTGTGTGGTTCTAGAATACCCAG 
AATGCTGATTTTGGTAGGAGGCTGTGGATATGTACCT 
Sequence 704 

GCGGTGGCAGGCGAGGTACGTGACAAATGTCATCTAATAACCCTGGGAGGGCTGGGTCCA 

CCAGGCTGTCCCAGTATCCACCACCCTCTTCCTATCCTCAGGGTGCAAAGGCCAATTCAC 

TCCCTCTCCCGTTTCCTTTCATTCTTTCAGTGTTGCTGTATTATGTAGTACCTGCCCG 
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Sequence 705 

GCTNATTGGAGCCTCCACCGCGGTGGCGGCCGAGGTACGATGCAAACCAGCAAGCCAGGA 

AGCACCTCCTTCCCGTCACCACAGCAGGCCTGAGCAGCAAGGAGCGTCACCGGAGCCCAC 

TGGAGAAGCCCCACAACGGNCTNCTCTTCCCCCAGCACCGGGGACTATCAGTACCTGCCC 
G 

Sequence 706 

CCGGGCAGGTACCCTCCAGGCCCTGTGTATTATTAAGATCTTTTAGTAGCGAGTTGCTCT 
TTCTCTGGGAAATCGGCTGTTAAAGTCAGAGGGAGCTCTTAATAGTTTGCATGGTATTTG 
ATTAAATGG AAC AGTTG G ATC AGTACCT 
Sequence 707 

ATTGGAGCCTCCACCGCGGTGGCGGCCGCCCGGGCAGGACGCGCTCTCCTTCGAAGTCCG 

GAGGAGTTTCCTGGATTTGGCACTCTCGTGCAAAGCGGTCATATGCTGCAGAGTGTCTCC 

TCTGCAGAAGTCTGAGATAGTGGATGTGGTGAAGAAGCNGGGTGAAGGCCATCACCCTCG 

CCATCGGGAGACGGNGCCAACGATT 

Sequence 708 

AGGTACATCTCTAGC TGATG ATTCAAAAAAGAAACCTTTTAATCTCACTCCACTGATCAG 

CTATGATACTTAAATGTTTTAGCTGTGAGCAAAATAATATGCATTCTCAAAGAGAGTATC 

TTCAGACTCCAGTGGCCGAGAATCTAGAGTTAGCAATGGAAAAATTAGTCTCGGGCTTCT 

GmCTGCCCACAGTmCAAATTAAGMCMTGTGmGCACTTAATGAAACAACCTCT 

ACTGCTCTTCAAGAGGACTCAGGATACCGATTCTCGAGGCCCCTGGCGGTCCCCTGTAAG 

TACCTGCCCG 

Sequence 709 

AGGTACAGCATCGCTGGTGGTTTCAAAAAACGTAGTCATTCCTCTCACTGCAACAATGTA 
AGATMGCAGAGTAGATCTGTTATTTCCAAATTAAAGGTGATTAAGATATATGGAGAGAG 
AACATGGCATGTGAGGTTTATAGGGCTAGAAACTGCAGAACC ATGTAGAACCCACATTTA 
ACTACAGTACCTGCCCGGGCGGCCGCTCGA 
Sequence 710 

AGGTACATTTGAGATGGTCTCACGTGAGACATCAATACGGCTTGCTGGGGGGCACAGGTT 

TAGGGCAGATGAAACTCACAGGAGGGCGGGTCTGGGTTAACTGAGCTAAAGAGCTTTTCA 

AGCCACTAGAGCAACANAGCTGCCCACAGTTGAGTCAGATTAACCTGGGAAGCCTCCAAG 

TGAATTGGNTACCAGCACCACATTCACAGATCTCAAMTTTATTGAAACTGATTGAGAGG 

NTGGATTTTGATAACTAAGA 

Sequence 711 

ACCCCCTATAGGGCGAATNNTGGNAAAACCGCCCCGGCGGCCGAGGNACCCAAACCTNTN 

CATGGNCAAGAGAAMCCNTTTGTGAGGGAAANNTNTTAAAANNANGACATGAAAATGGA 

GACGGANATTAAGAGAAACAAAAAGACTCNNCNAGACCAGCATGGACAGNACCTGCCCGG 

GCGGCGT 

Sequence 712 

GGGCGAATTGGAGCTCTTCGCGGTGGCGGCCCGAGGACTCAGGATCTGCAGAGGCNAGGC 

TGCTTGGCTCCCATCACNGGAGGTTTGCTTNNCAACCCAGAAGGAGCTTTCAAACAAGCC 

TAGATATTCCTGCCTGGCCTGCTGCCATTCTCCAAGAAGGCAAGTAGGCACAATAAATTA 

GACTGGACGGTAACAGTTCACATGTTGCCCTTTMGMGTGTTGATATTTTATGGCAAGA 

GGTTTGTTTGTACCTGCCC 

Sequence 713 

CGAGGTACTCAGGATCTGCAGAGGCCAGGCTGCTTGGCTCCCATCACAGGAGGTTTGCTA 

GGCAACCCAGAAGGAGCTTTCAAACAAGCCTAGATATTCCTGCCTGGCCTGCTGCCATTC 

TCCAAGAAGGCAAGTAGGCACAATAAATAAGAACTGGACGGTAACAGTTCACATGTTGCC 

CTTTAAGAAGTGTTGATATTTTATGGCAAGAGGTTTGTTTGTACCTGCCCG 

Sequence 714 

CCGMTTGGAGCTTTTCGCGNTGGCGNCCGCCGGCAGGTACTGTATGTATGTTGGTAAAA 

TGTTGTGAAATTTAATGAGGACTATTTNTTGCCATAAGACCTTCCGGACAGAATCTATCT 

CTGAAAAAGAATTCCCACTCCTAAAATAGAGGTCTGTGAGCAACAGATGTGGTAAATGAC 

CCTTGTTCAACTGAACCAAGAATATATTGTGATCTGTATGAGGCACAATTCAGAAGAGAG 

GGGAGGTGGCTACAGCACAGCCTCAGTGTTGAGTTTGGTTGTGGGGTTTAAACATATTTA 

TTTATTTCATAAAGTTTCTCTCATAAGGCACAATGCAAATGAGGTTTTAATTAATC/^ 

TAATGTTCAGAGGATGGTCTTCATGCTGTTTATAAAACAATTTACATTTTTATTTTGTAG 



Page 97 of 635 



WO 01/51628 



Table 1 



PCT/US01/00798 



TCTAAAGAACGTGTGTCGAAAAGCATGTCAGTGAGTAGCCATAACCTTCAGAGGGGATCT 
ATGCCTGGCTCTAAAACTAGNGGATCCCCCCGGGCTGNGANGAATTCGATATCAAAGCTT 
ATCGATCCCNCGCCTCGGNGGGGGGGNCCGGGNCCCANCTTTTGGTCCCTTTAAGG 
Sequence 715 

CGCCCGGGCAGGTACATACCCACATACATACCCACACATAAACATATAMTMTATTTTT 

TAAAGAACCAAGGACAATAAA7TTTCCTGATGAGCCAACATTTTAAGGAAAAAAATAAGA 

AAATTGTATCTTTGTGACCTGACATAAATACCCAAGGATAACTGCCATCCAGCCTTCTCA 

CGGTGTGACTGAGCACAGTGGGTCCTCCCTTAGCTGAATTCTAAGAGTGAAGAATAGTTT 

ATTGTTTGAAGCAGTGAGGAGAGCACTAGGGATATATTTCATAATACATGAAGAAAGGAG 

GGAAG AC TAATGCAAG TTTG TACCT 

Sequence 716 

AGGCGAATGGACTCCCGCGGTGGCGGCCGCCGGCAGGTACATACCCACATACATACCCAC 
ACATAMCATATAAATAATATTTTTTAAAGMCCAAG 

CAACATTTTAAGGAAAAAAATAAGAAAATTGTATCTTTGTGACCTGACATAAATCCCAAG 

ATAACTGCCATCCAGCCTTCTCACGGTGTGACTGAGCACAGTGGGTCCTCCCTTAGCTGA 

ATTCTAAGAGTGAAGAATNTTTTATTGTTTTGAAGCANGTGAGGAGAGCACTAGGGATAT 

ATTTCATAATACATGAAGAAAGGAGGG 

Sequence 717 

TTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTMTTTMTGTrTAATAATATTAAATAT 

CTTTAATATTAACGAAGTGATCTTTTAAGAGATATTGGTATTGCAAATGTTTACAAATAG 

TTTCTTAGTTGAATCAAGATTGGTATGAGTTGTAATTGTGGAGCTGGGTGATAGATCATG 

ATAATTCATCATATTGCTCAGTTTATATTTGAAATTGTATATTAAACTTGAAAAGACACT 

TTAATTTGAATTACTATTTTATCTTTTAAATGTGGTATTTCTCCTT 

AAACACTATAATTTTATCACAGTACCT 

Sequence 718 

CCGGGCAGGTACAGAGACTCCATCTCAAAAAAAAAAGAAATGAAAGATAAAAAGATTATC 

ATTGATCCCATTTTACAGTTGAGAAAACAAAGACAGTTCAGTTCCTCACTGAAAGTCACA 

CAGATTGTNAGCAGTCACACTGGGAACATAAGCTGGGCACTCCTAGTTCATGATCCTAAC 

CACTGTGCATTAATGCGGCACAGACATGAAATACTGATGAGACTATTCTTGAAAAAAAAT 

CACTCAAATAATCAAGTGTGTGTTAATTGGATTTGAC I I 111 t CAGTACTCCGTCATACA 

GCAAAGGTAAATCAATGGGTTGTT 

Sequence 719 

GGAGCTCCCCGCGGTGGCGGCCGAGGTACCCTTGGT7TCTCAAACAACTCACTGATTTAT 

GGTCTTGAGACCATAAACTCATTTTCCTTATATGAATGACATTTCCACATCCACAACANT 

ACCACCAAATATATGTATCTAGTTCTTACTAACTGCAAATCCTCAAAGTGAACTGCGTGC 

ATTTTAATGTTGCGTAGTTTGCTGATTTATGATTACCCTTAATGTACCT 

Sequence 720 

NCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCTTGGTTTCTCAGAC 

AACTCACTGATTTATGGTCTTGAGACCATAAACTCATTTTCCTTATATGAATGACATTTC 

CACATCCACAACAATACCACCAAATATATGTATCTAGTTCTTACTAACTGCAAATCCTCA 

AAGTGAACTGCGTGCATTTTAATGTTGCGTAGTTTGCTGATTTATGATTTCCCTTAATGT 

ACCT 

Sequence 721 

C TACT TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACGCCTCTGGATTCA 

Ci i i iGATAATTATGTCATGAATnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnTCGGCAATCAGTGGTAGTGGCGGTGGCACACnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnGTTTTAAAATTAGNGGTATCCCCCGGGNNTGNAGGGAATTTNNATA 

TCAATNTTTATCCACCCCCCCCCCCCCCGGGG 

Sequence 722 

CCGCGGTGGCGGCCGAGGTACCCTTGGTTTCTCAGACAACTCACTGATTTATGGTCTTGA 
GACCATAAACTCATTTTCCTTATATGMTGACATTTCCACATCCACAACAATACCACCAA 
ATATATGTATCTAGTTCTTACTAACTGCAAATCCTCAAAGTGAACTGCGTGCATTTTAAT 
GTNGCGTAGTTTACTG ATTTATG ATTTCCCTTAATGTA 
Sequence 723 

TGTAGCTCNCCGCGGTGGCAGCGGCCGCCCGGGCAGGTACTCTTGTTTAACCATCAGAGG 
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f^^^TCCTGCTTGOT 

™ r CCATCmA ®^ CCG1 ^ GTT CTTT<3GTACCTCGGCnnn n nnnnnnnnnnnnn 

ANGGGGGGGG 
Sequence 724 

^I G I GA J G ^^^^ 

SSSI^ 1 C ^ CAATCCCCAGTG GTGGTGGTTAGAGCTTCAATCTCCAGTGTGATGGTA 

™ GGGG TT^ 

TGAAAGAAGACACAAGAGGG 
Sequence 725 

AGGTAC1 nrrTTTTTrrn icacgtggtcccagcttgagtttactgagcctcctccagg 

CTCAGATA-T^TGCTTGCCTGGAGAMCTGNTTCCAMGGCCAAGCAGCCCTTGCCTTrGG 
!^ A i^l GCT ^^ 

CTNTTTTAG AGATTCATAAGGTACCTG CCCG 
Sequence 726 

A ??J£ CCCCGCGGTGGCGGCC C^^ 

CAAAGCTCAGGGATAAGTCTAAACAGAMGGCATTTATACAGCAACAAGAAAGTTACTGG 

GGGCTGGGGATGAGGGAGGGCTGGGGTAGGAGGATACTAAAATATTTTCTGAGGTGCCCA 

A $I G !^ GTC ™ G ^^ 

CTCCTCTACAGCTTTACAATGTTCTCTAGCAGAAGCAAAAACAGGGTCACTGCCATCATA 
GATAAAAGGATG 

Sequence 727 

AGGTACACGTTITATATCAATATTTTATCAAGCAAAMGTTGAAGCCAACACCCAAAGCT 
GCCAGATATGGAGMCCCAGTGACMTmGMTTMCCTGAGTGACCCTCTTTCTTTCA 

CCTTTTTAAAAAAMTTCTTTCTTTTCTGTGTTCTAAGCAACTGGAAACT 

CTGGCCTATCATMGGGAGAAGATGGGAGTCTTTTGCTGTCCATCTTGAATGTAATTCAC 

ATTGTCCATGAGTGCTGATCTAACCCCATATGCCTCTGCCCGACTGCCTTTATGAAATAT 

GGTAATTTAATGCTTAAACAACTGTTTCATTTGTGTACCTGCCCGGGC 

Sequence 728 

CGAGGTACCAGATCAAAACCTGGGMCTTCGTATTTGTCCTTTTCTCTCTGCCAGGAATA 

TCGTCCTCTCCATTTGCCCWGAGCCCCACCTNCTCAACATTTTTACCTNTGTGGAAAT 

CCCTTCCTTCCAGAAGCCTCCACTCGCTTTNTGCCAAGGAGCTTCTGCGGCGCCCTGCAC 

GCACCTTTACAGATGCAGGTGGCTGTTTCCTGTGTCAGACTGCAAGCTCCCCCCGCGTAC 
CTGCCCGGGCGGC 

Sequence 729 

AGGTACTCCACAAGCTTGCCTGCCATGGGCTGTCGGGATGTCCACGCAGCCACAGTCCTT 

TCCTTCCTGTGTGGAATCGCCTCAGTAGCAGGCCTCTTTGCAGGGACCCTGCTTCCCAAC 

TGGAGAAAATTACGACTGACCATTCMCAGAAACGAGAAGAACCTGACTGTTTACACA^ 

CCTGTGGGTGAAATGTGCCCGGTATGACGGGAGCAGTGACTGCCTGATGTACCTGCCCG 
Sequence 730 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGATCCAACTGTTCC 

ATTTAATCAAATACCATGCAAACTATTAAGAGCTCCCTCTGACTTTAACAGCCGATTTCC 

CAGAGAAAGGGCAACTCGCTACTAAAAGATCTTAATAATACACAGGGCCTGGAGAGTACC 
TGCCCG 

Sequence 731 

ATTGGAGCTCCCCGCGGTGGCGGCCGAMCTTTATAATCTTTTAACTAAATGTAATTGTC 
ACCATAATCTTATAGACAAAGCATTGAGGTTTATTGAGCTAATGCTGAAGGTAGTAAGTG 
G AGG AGCC AGG ATG AGGTCAG AATCTG AG ATTTTAACCATG CCTATGCTG TC ACTTCTTA 
CACTTTAGAATACCTCCATGCTCATGTGGACACCTAGGAACAAATGAATATTTCTATTCT 
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TCTCCCAGAATTTTAAAACATTAAACATGTTAMCTGTATTTTTG TTTAC CAT 
TCCNAGGAGGAACMGCACTAMCACAGTCTCTGGCTTAAG GATT TT GGATG AACATATT 
TCAAAAG CCATCTGCTTC N CAGCAATTCATAATCCATACC CCTTTTCCTTTTTG GCCACT 
TATTCACCAAGAATCTNCANTAGTTCCCTCGGGCCGCTNTANAAACTNANTGGGATCCCC 
CGG 

Sequence 732 

NCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAGA 

TCCCTCTTGTTGAGTAGCTGCAGAGACGGCAGATGGAAAATATGCTCAGAAGTTGTTTAA 

TGACCTTTTTGAAGATTATTCCAATGCTCTTCGTCCAGTGGAAGATACAGATAAAGTCCT 

GAATGTGACCCTGCAGATTACGCTCTCTCAGATTAAGGATATGGATGAAAGAAACCAAAT 

TCTGACTGCTTATTTGTGGATCCGCCAAATCTGGCACGATGCCTATCTCACGTGGGACCG 

AGATCAGTACCT 

Sequence 733 

AGGTACCCAAAGAAGTAAACTTAGCAACCTCATAATTCTAGGCACCTCTCTCAGGTAATC 

TCAGCTGGCTTCTCTAACACCTCTGGTTCCTCCCCCTACCAGCATGAGGACATGAACAGC 

CATGGATGGTGTTCTGCAAATGTTATCTCAGATAGTCTCTGGACAGGACCTGCGAGCAGA 

CTTGCTCAGGGTCTAGAAAGTCTTCCCTGGACAAATTCCAGATCTCTAGCTAGCTAAGAC 

AACTGCAGCCATTTTCCTGTACCTGCCCG 

Sequence 734 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTTATACA 

ACGATGCTATAAATATTTGTATGCATAATTCAACAGTAATCAGTGGTGTTTATCTAAACT 

AACTGATAATCTACAGATTGCAGTGCATTTATGATTTCAATGGAATTAATCTAATTCTCC 

ACACTTAATTGTGAGAATAGCTATAAACAGATTGTCAAGAGGAGCC7TTTAGTGCCAATG 

CTTTACTTGAGGAAAAAAATTTCTTTTGGGCAAAC^ 

ACGATTCTCAAAAGTAGCTTAACAACCCCAACTCCGCTGGGTAAGTGTGGCGGCACACGC 
CTGTAATCCCAGCTACTTGGGAGGCTGAAGAGGGAAGGCTGCTTGGGGCCAGGGAGTTTG 
AAACCAGCCTAGACAACATAACAAGAGTCTGTCGCGAAAAA 
Sequence 735 

CTACTTAGGGCNATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCTAACCTG 

GGCGACACGAGTGGGACCCTGTCTCAAAAAAAATAAATAAATAAAAGAAAGAAAAGAAAT 

GATCTCTCCTCATAGTTTGTGTGTAGGAAAAGAAGTGATAAAACTGACCTAATTGAACTT 

CACCTTTTGCTCAGTGACAGATAAAGCTTTCCTTGATAGCATACATTGGCTCAGTGCCAG 

TAGTACCT 

Sequence 736 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCACCATCCTGTCATAAT 
TC I 1 I 1 1 n I GGCCAGGGGGAGACAAGGGTCTCACTCTCTTGCCCAGCATAAAGTCCTTT 
TTAAAACTGTAAAATAGTTTATACATTTGAGCATTATTAT TATAA GCTTTTGTTTCTTAC 
CTCAGAAGMTATATTTTCAAATGATAGACTTCTGGGACTTTTGGTACCT 
Sequence 737 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATNNNTCAAACATACNAC 

NTAATGNGGGCATGANCACATACTTTGCTGATANCNGACNAATGACTGAAGATGANTGGT 

GNAACCANGGCATNAAAATGMTCTATTCAATGACATGACAACAGNTCTCATTTCAAAAA 

TATTGCCNTACCTTTCCATAGCCCCCTTTACCAAGTNCCTGCCNGnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTTG 

GCGTAATCATGGTCATTAAGNCTGTTTCCTGTGGTGAAAATTGTTATCCCGCTCACAATT 

TCCACACAACATACGGANCCGGGGAGCATTAAAGTGTAAAGCCTGGGGGT 

Sequence 738 

CCGCGGTGGCGGCCCGAGGGTACCACCCATGTGGAGGAGACTGCAAGGAAGCTGTTATTC 

AMGTAGAACAGTCAGCCTTGTGCTTGAGTCCTGCTTTATGCTTGCGTGTTTCATAACAA 

AACACAAAGGCAAGTCTTCATATCAGCACTTAGTCTTGATTCAAGTAGCTGACTACTGTA 

CCTGCCCG 

Sequence 739 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGCAGGTACTTGGAAAGAATT 
GACCCAGCTGAATTGGAAAATGTGGGAAGGGGATGGGGAAGAGGCTGCTCCACCTGAGAT 
CTGGCTCCAGGACTTACAGCAAGGGGAACTTGGGCAAGTTACAGACTGCTATGCCTCAGT 
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1TCTTTTAGCAAAACAGAATCATCCCATAAACTATAAGGTCGATGGTATCAGCGGGTCCC 

C^CTGACTGCACATCTGAGTCATGTTAACAAACACATTCCAGGCCCCACCTGAGCCCT 

CT^TCAGAATCCCTGTMGGAGGACGATGAACTTGAATTTGCACTGACTTTCCCAGCT 

?JI rC ™ CTCTGATC ^ C ^ GGGGATAGGGAC CCATTTGAGCTGCATCACATCA^^ 

MGCC^^CACMCAGCAGGACMGMTATTTTCAAGGCAGTCTCTAAAGCAGAGGAGA 

AACTGTTGAGGGAACCTAGAAAGTAAMGGAGATCTGGCTTGCTGGGCTCCATTTGAAC 
Sequence 740 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTAC I rTTTTTTTTTTTTTTl 
TT^NGGGACCAAGTTTGGCmCNTGAGAAGTGCAAGGTTGGTTCAAAATACCTACTG 

GATCACTCTGGGGTT^^GAAGGGGAGGACTGGGAACAGAGAAGGTGCATTTCAATGAA 
AACAAAGACTTGGG1 Nil I AAATTTGCCTTTATTTTTATTTT7TTATTTTAGAGACAGG 
GTCTTATTAGGATGCCCAGGCTGGGGCTTTCTTTTCTTGCAAGTC AI II 11 1 1 1 I CAAAC 
AATTCTATATGATAATCAACAGGGATGAACTAATGAANGGCNTNTTAAAAGAGGNGGTCC 
AGGTAAAAATGATTCAAGNGAAATNCCTAAATCTGGGGNGNCATGGTTTACCGGGGACCA 
GAAAMCAAAGGGGGGGTCCTTTTACTGNTGGTAAAAAGTTANTCTNTACCTGGTGGATA 

AAAATTTAAAANANCAAAGCCCCCAAAAAGGGGAAATCCCTACCAATGCCAATNCCNTTG 
NCTTTTN AGGGAATNNGNCTTNAAC 1 I 1 1 11 11 11 
Sequence 741 

CCCGCCGGCAGGTACGCCCTAGCTCCAGCTTCCNTTGGGAGACTGTGCATCTCCTGGCTN 

NACTAACATNACCTTCTTCTGACCTTCCAGCCTAGAGATGATGACTCTGCCAGCCTAGAT 

GGGCTCTGGGTTGCTCCTATTCCTGTTTGTTTGAGATTTCCCATTATGCTGTCACCAACT 

CCCCAGCCTAAGCCCTCTCTATTTTAMTTCTCAAGTGGATTATGTTCCTGATTAGTCCC 

TGACTGATATACCACTCTCCTCATGATCTCTGATTAGTTTTCCTGTTAGGTTGTTGCAGT 

AAAAAAAAAAAAAAAAAAAAANNNGGTCCCTCGGCCGCTCTAGAACTAAGTGGATCCCCC 
NGGG 

Sequence 742 

AATANCCCGAATTGGAGCTTTNCGCGGTGGCGGCCGAGGTACCTTCATGCTCTAAATCAT 

NATGATCTACTATCTGAAAAGGAAACGACAAATTTTTNAACAAAAGAATTTCACAACAGA 

TAGGCAGTTGATAGCATGAGGCACTAACATTAAACCCAAGTCTTTCAATGGCACTTGGAG 
TCCCAGGGCCTGCCCG 

Sequence 743 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTACTGTATCTCCC 

ACTAGGATGTCAACTCCGTGAAAGTAGGAACTTACTGGTCTTGTTCATGGCCCTATTCCC 

AGCTCCTAGAATACAGCCTGGCATGTTGTAGGTGCTCAATAAAATCCTTGTTGAATGACT 

GACTGAATGAAACCTGGCCAAGGTCTAAGCATCACTTCAATGGGATTCACCTGGAAATTG 

GCACCCCCCTCCCCCGCGTACCT 

Sequence 744 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTGTATCTCCCACTAGGATGTCAACTCCGTGAA 
AGTAGGAACTTACTGGTCTTGTTCATGGCCCTATTCCCAGCTCCTAGAATACAGCCTGGC 
ATGTTGTAGGTGCTCAATAAAATCCTTGTTGAATGACTGCTGAATGGAAAACCTGGCCAA 
GGTCT/yVGCATCACTTCAATGGATTCACCTGGAAATTGGCACCCCCCTCCCCCGCGTACC 

Sequence 745 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAACCTGATTACCTGAAA 

CAGCAGCNNAGNCATGCAGAGGAGGATCAGCAGCAAGAAGGGGTGAGCCATGAGTTCCTG 

AAGCCAGGAGGGCTCCATTCTTCTCAGAGGTCCTGAGCTCTGGAAAGCCTGAGAGAACTC 
CCCGCGTCCT 

Sequence 746 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTCTCCCACACTCTTCCTGACAA 
AGCAGNTGAGGGACTTTCTCTGCCAAAATGAGAGAGTAAACTTAAAAAGAAGACAAGGTG 

GGATATTGCAAACAAGAGATACAACACAGGAGGGAGCCGAAGAGAATCCCTTGATTTGAG 

GTGAAGCAGCTGTACCTGCCCG 

Sequence 747 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTTAGGCTCTCCAGGGAATGC 
CATCAGTAAAGGCAGCCTGTTCTGATGTCATGACTATAGGGAATGTGATGCCTTATTTGA 
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ATTAGGCGCTGTTTGTTCCACTCTTTATCTTTTTTCCTCCAGMCTGAGGCTAGTTGATC 
'TTCCCTTGAAAGTTCAGTCCCCCTGAGTAGCTAAGCCCAATCCTGTCTGTACCTGCCCG 

Sequence 748 

NATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCTAACCTGGGCGACAGAGTGGGACCC 
TGTCTCAAAAAAAATAAATAAATAAAAGAAAGAAAAGAAATGATCT CTCC TCATAGTTTG 
TGTGTAGGAAAAGMGTGATAAMCTGACCTAATTGMCTTCACCTTTTGCTCAGTGACA 
GATAAAGCTTTCCTTGATAGCATACATTGGCTCAGTGCCAGTAGTACCTGCCCG 

Sequence 749 

ANGGCGAATTGGAGCTTNCCGCGGTGGCGGCNCGCCCGGGCAGGTACAGTAGTCAGCTAC 

TTGGATATCAAGACTAAGTGCTGATATGMTACTTGCCTTTGTGTTTTGTTATGAAACAC 

GCAAGCATAAAGCAGGACTCAAGCACAAGGCTGACTGTTCTACTTTGAATAACAGTTCCT 

TGCAGTCTCCTCCACATGGGTGGTACCT 
Sequence 750 

GCTNCCCGCGGTGGCGGCCGCCCGGGCNGGTACCCACCGTTCATCTGTAACATGGCTGTA 

ATAATGGCACCTACTCGTTGCTGTTAGCATTGAAGGAGCCAACGTATGCAAGCCAATCAC 

TTAACAAATGTCCATTGTTCTGCTTGTCATTTCTTTGCTTATACTTCCTGACAACAAGCA 

GAAACCTTAGAAGAGTGGTCTGTTCACTTTAGGGAAGGCTAAGTCAGCATTAGGTGTCTT 

CCATCTTAAAGCACACAANACAGACTGG'GTAGAGCAGCANCGTCACTGATCGCCACTCTA 

ATAGCTCTCTGGCTAAGTTTCTCA 
Sequence 751 

GCGGTGGCTTTCGCCCGGGCAGGTACTCGGGCCTAGAAATTATTTAANNTGGCGACTGAT 

ACGTCTCATGGTGAACTCGTTTNTNCTAAGGCACTCCCACTTATAGTAGGAGCTCAGCTG 

ATCCACGCGGACAAGTTAGGTGAGGAGGGGCTGACATATTCTTGANTCACTCTTGGATCA 

ATCCCTGACTTCAGGCCCTGCTGGGTCCTTCTTGGTACCT 

Seouence 752 

CCGCGGTGGCGGCCCGAGGTACAATATAGGCAGACAGTTTGCCTTCAGAAATTCAGAAAT 
GCAGCTTTTGAGGGAGGTCAGCATCATTGGTCTCAGCTACTAGAGTTGAAGATGATTCAG 
C C ACTTITATTCC AGCCACTCCATTTCTAG C ATAC ACTAAGAGAAGTGTATATTTAAAGA 
GGCCATTTTCCTGCAGGATGTTTATAAATAGTTCCTGGCCCCCGCGTACCTGCCCGGGCG 

GCCGCTCG 



Seouence 753 

GGCGAATTGGAGNTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGAAATACTTAAGAAA 
TACTTTTC AC AAGTG NC AATAG ATTTATGTGTTATGGACTC ATTGC AGANG GCTG AGG AG 
CAGTAAAGGAGGGMGGTTCAAGAAAGCCTTCAAAAAGGAAATAACACTTTACCAGAGCC 
CTGCTCACCAGTMGAGCAAAMTTGTGCAACTTTGATATTTAAGAAATGGAGGGTTACT 

- — . » ^» » i ^.-m s^-m a a ^TA att^T AT ATn &PTA A AriP.ATf*. A AAflTHTAATAAG A 



CTGCTCACCAGTAAGAGCAAAAA M GTGCAAu l I IbAini I i/wummmiooauwi 
GCTAGTCGTAGNGGTGGGTGAAGTAATTCTATATGACTAAAGCATCAAACTCTAATAAGA 
GCATTATAAAAAGAGGCTAGAAAATTGGACAGGCMCATTCAA CATATC TTATAATTAAA 
GATTTGGTAGTNTrATGTTAGAGGTGATAGACACATAGGAGAGTTTTTTAAATGTGGGM 
GTAAAATTGGTCAAGACTTGAC TTTTT AGAACAGTAATAAAGGAGTGCCAGNTAt I I > n 
GGGAATTCATTTATGGTGACCCTTTTAA 
Seouence 754 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACCACAGGGMGAATACCTCAGTTATTCACCTT 

TTGTTTAA^TGTTTTGGAATMCACAGACACAGCAATTATGMGTTTTTC 

TTAGATGAACAGATTCTGTTAACTAAAATGAAGGCAAATGAATGAAGAGAGTTCTATGTT 

ATATATCTGAGAGCAGGGTCTCTATTTTTGATACACTGACCATTGTTT CTCTGT AAAAAA 

TACAATTAATACTTCATCCTCTCCGTAACTCAGTTTTAGCTGAATACTAi \\\\ lAGTCC 

ATTGTATATGGCAGACTTTATGTTAGGCACGAGGGACATAAAGATTCATTCATTCCTTCA 

CGCCAGTAAAATANTTGAGTTGGGAATCTTCCTATGTTCCGAGGCACTGAGTTCAAGTGC 

TGMGGTTTAACAATGAGCAACACAGACACAGTTTTGCCTTNCTGCAGGGTACCTCGGCC 

CGCTCTAGAACTAGTGGGATC 
Seouence 755 

CGAGGTACTGTMTTTTGGGGAGCAAGCTAACACATTTGACTTGCGGCTGAGCTCTTAAC 
TAAGCAATACCTCAGTATGCTCCTTCGGGAAAMTTAAAGGTTCAGTAGTCAAATACTTT 
TGG AAATGCTGGGCCATTATGCAC AG AG AAG GCCGC AG TMGGAACATTTTAAATTTGAA 
CAGAGAACATCCAAATCTAATTCATCTTAGAATCCATTTGCTATGGAATGTACCTGCCCG 
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Sequence 756 

GTAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGAAGCAATGACGTGGGGAGAGAGCGC 

NGAGGAAGGGAGAGATCCAGAAAGGTAGATTCTTCTGTGTAGGTGAGTGGATGAGATAAT 

GAAGCAAGTTAATTTGGGCTCTGCTGACTTATTTATCTGTCCCGGGCACTTTTNTCTGTA 

CCTGCCCG 

Sequence 757 

CGAGGTACTGCGGGC NCTCT TGATTCCAGTCTACTGGACTGCTTGTTGCTTTGGGTATTT 

GAGTATTTA TGTGT TTCTTTTGGCTCCTGTTTGCATGGTCACTGGCTGTCATAGGGAATG 

GTGATCGGCTTTTCCAGTTGCTCAGGNCACAAAAGAGGTTGGCTGGTGTCATGGACAGCA 

CATGGGATTTGGAGTTAGAAGGTCTCAAGTTAAAAACCCTCAGGCCCCTGCATTCTAGCT 

GGGGGACCCTCGGGCAGGTTGCTTGAACCTCTCACCTTGACTGGTTTGTTGAACTTGCAG 

AGTGCTAGCAGCACCTGCCCCCAGCGTGGGGGAGTGGTTTTGCAGATTAAAAGTGNTGTG 

AATGCACTTCCTCGATGGAGGGCCTCTANCACCATGGTTTCGTCATCCTGTATCTGTCAC 

TCACCCTGTANGTNAAGGGGTTTTTCACATTTGNNGCATCAAAAAAGACCCCTA 

Sequence 758 

GCGGCCGAGGTACATGGTGGCTACCACATCCTGNTGTTTGTCTATACTGGGAGAATCAGG 

CNTTCCNAGTCATCTTGGCTGCCCTAGCTCTGAGCTTCAGGGGATATATGACTCAGAAAT 

GCTATGCTTTCTGGAATTTGGATATTTCA I I I I 1 ATTGTTCTTGGTAAATTCTCTTTTGA 

CTTAGGAGAAGCTAACTATTTGGAAAGGTCTCTCAGAACTCTAATTACAAATATATGGTA 

CCTGCCCGGGCGGCCGCTCG 

Sequence 759 

CGNCCGAGGNAC TGGN TTGACANTGTGTTTAAAGTCAAAAGATTAGGCTTGAGATCTCTT 

TCTAGTG NGATGGTTTTACAAGTATATACCGTATGTTAATGN7TAAAAATTTC AC AC CCC 

AAAMTGTGCAGNATACCAGATGTTMTTATATCTCATAAAGCTATTAAAATTTTATCTC 

AAMTTATAGCTTTATTGCATTTAGGGCATTATCCAATTTTGMTCTAGTCCAGTTATCA 

TAGCTTAATGCAGTATTATGAAAATAATGCCTATAAAGGGCCAGTTCCTCAAACACCCTT 

GGAACCAATTTTGCCATCTATATTAGNNACCTTGGGCTGCNATAATGAAGTACCCTCGGC 

CCGCTCTAGAACTAGNNGGGATCCCCCCGGGCTGCAAGGGAATTCCNATATCAAGCTTAA 

TCGATACCCGNCGACCCTCGAGGGGGG 

Sequence 760 

CGCCCGGGCAGGTACTTGTAGCAGTCCACAAAGAGAACAGCCAGAACATTCTCTATGCCA 

CTGCCTGCTTCTGGGTGAATCCCAGGTGTGAGTAATGAGGCTCACATGAGTGGGTATATC 

ACTAACACGGCTTAGGAGCCCCATCTCNAGTCATTATTTTGCTTGACAACCATGAGCTTC 

CAGGATCCCGACAAGGCACATGGCAAATTACAGGAGTCCTCACACTAAGAGAAGCCCTAA 

G CTATGG CCAGCACAGGTATTTATAGTTCCTCCCAGTCCTTTCGCAGTTTACCAGTGGGT 

CAM n l ACCATAAG C AGTGTTGC CTAGTAACATAGCTGAC ATTCTG CCT<3TATG GTTTC 

CACGGGAACAGAGCTGATAGCTGGGTTAAACTGAATCCAAGCCCAA 

Sequence 761 

TGGCGGCCCGCCCGGGCAGGTACACTTTTTTG G CTTATG G GTATCTTAGTTTAAC CTTTT 

CTNTTTGAGNGAACTGTGTCATTTCAAAAGCCTGAAGACATTGTGATGACTGCTGCCTCC 

ATAATGGCTACATTCTAGGGGCTTTGCCCTGAATCGAATATTAACTCAAAAAGCAAACAG 

TACCT 

Sequence 762 

CTCCTATTTGCGAATrGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGAGCCACACCAAAA 

TCTGGGAGAAGACT^.TTTCAGGAAAGGAGGAGGAAATTTGAAGACCTGGGGCTAGAATGA 

GCTCAACATATTCTAAGATCGGCAAGTTTAGTGTGGCTGGAAAAGAATGAGTAAAACAGA 

GAAACCTGANATAAGATGAGGTCACGGAGAAATGCATGGGTCAGACCACGTATAACCTTG 

CAGGTC ATGGTAAAG GCATCTGG ACTTTATAC TG AGTACCTG C C C G 

Sequence 763 

CTTAGG GCGMTTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTAC M I i I I I I M I" 
till) IGGGTTTA1T TGCTTA GGATAATGGCCTCCAGCTGCATCTATGTTGCTGGAAAGG 
ACATGATriTGTTCCTTTTTATGGCCTTGTAATGGTTCATGGTGTATATGCACCACATTT 
TCTTCATCCATTCCAACGTTGATGGACATCTAGGTTGATTCCACGTCTTCACTATTGTGA 
ATAGTGTTGCANTGAACATATGAGTGCATGTGTCTTTTCAATAGAACTTTC \ Mil I CNT 
TT TGGGA TATACCCCCCANTAAATAANAATTGCTGGGGTCNAAATGGGTATTATCATTCC 
TCllll \ AGTAAACCAACTTATAAAAAGATTTTAAATCAAAG 
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Sequence 764 

TTAGGAATTTNTATATCAAAGCCTTATCGAAACCCGCCNACCTCNAAGGGGG.GGGCCCGG 

GANCCAACTTTTTGGTCCCCCTTAANGGAGGGGGTAAAATGCGCCCCCTGGGCGGAAAAC 

AAANGGCAANAAACNGGGNTCCCCCGGGGGANAAAAAGGGNATACCCCGCACANATTCCC 

CACCAAAAANTANCCCGGCCCCGGGGAGCAANAAAAGGGGAAAAAAGCCCNGGGGGGGGC 

CCTAAATGAAGTGAAGCCTAACTTCACAATTAATTGCGGTTTGCGGCTTCACTNGCCCCG 

CTTTCCAAGTCCGGGGAAAACCCTGTCGNGCCAAGCTGNATTAATGAAATCGGGCCCAA 

Sequence 765 

CCGGCAGGTACATTCAAATTTTTGTTATCCTCTTCAAGAAAGTNACCTNGGTAGANTNCA 

TATATATCCCAACATNNTAGGAACTCATAAAATGCATACTTTTTTGGTGTGATCAGAGAT 

GGGTMCTCCTTATTACTTGTGGGTAGATAGTTTTGGAAATAGCCTGGAGTCATTGAGAG 

CCAATCCTTATCAACAAAATGGCATAAATGGACAGGAATGGGTTC7TCTTCACTTTAGAG 

ATAGCTCTGAAGACATGTCTCAANTGGGATNAGTTCAGNTGCATATATGAATAGCAGTC 

Sequence 766 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGGATTCA 

GAGTGAGAAGGCATAAAGGAGAATCCCCAGCTGACTTGTGCAGTGGTTAATTGAAATTAT 

TCAGGCAAGAGATGATGGTGTCTTGGACCAGGGGATGAGGAAGGCTACAAAATGTGTCTA 

CCTNATTCTGTGAGGAGAACGTGTTCCCTGGTTTTAGATACTGTGAAGATGGATCAGGAG 

AGAGTTTATCTAGACTGTTGGGGAAAGGTGTTGCGATTCCTTCAGCTACACAGGATTGAA 

AGG AGAC ATTTCTGAAGGGGAAAAAGGAANTGAAAAAAAAAAATNN N ANN N N NN N AN AAA 

GTNCC 

Sequence 767 

TCGCCACCCATNTGTGAGGCACAAACTGTNGNNGNCTNCATTTCACAAATGCAGAAACTT 

AGGACCATGATAGATGAATGATTTGTC 

Sequence 768 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTACATGGGGACCGCCAGG 

GGCCTCNAGAATCGGTATCCTGAGTCCTCTTGAAGAGCAGTAGAGGTTGTTTCATTAAGT 

GCAMCACATTGTTCTTAATTTGAAAACTGTGGGCAGAAACAGAAGCCCGAGACTAATTT 

TTCCATTT 

Sequence 769 

TTAGGGCNAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGCTTAATGGCAAAAGAGA 
ATCTTAATCCCTAAGAGCTTTTTCTAATTGATGGACATCATTTTCCAATTGMCAAATTG 
GAATTTATTAGTGGTGACTGTGGCTTTGMTCTGAGCTAGTTTATTCTTGCAGTCACAGA 
G AGTGTCCTTATAG AAAATAAAAAGG GAAAAG CTC AGTTCTGCTG AATATAATC AATATA 
CACMTCAATTGATAATTCACAACTACCTCCTTTGCCTTCTTGCACACTCTTCCTGCCAC 
TCAGAAACATCCATACTTACTCTTCAGGATGGG I UN I GAGCCTGAATTATCCAGATGG 
GCTGTTTCAAC I I I I" I CAGCCATACCTTAAAAGTTGAAAAGGGATG GCCCTAATTTCCAC 
CTCTAATTCCCCTTCCAATTTCTGGGTTTGAAGCAAGACTNGAGAACTTGGG 
Sequence 770 

TTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACCTGATTTCCTGA 
CATTGCTATTTATATGATGCCACTTAACAGGTGCTTGGAGGAAACTCCTGACTCATTGCT 
TACCTCTCAGGGCTATGTAAAAACCACACAGAATTTATTAAACACATCACAAGTGGTACC 
T 

Sequence 771 

TNCTATAGGGCG/sATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCCTGCGCCAATCAT 

CACAGCCTGGT/GTATCAAGTGTGTGCCAAACTGCCTGCGCTAACAACCAAATGCAAATG 

TCATACGTCTCTGTCACTAGCAGCATAGTTGCTAGAGAGACACGTGTGAGCCACACCTCA 

AAGACAAGCCT.'CATCATCTTTTCCTGGAGATAGCTGACTAGCAATCAGATGTTCAAATA 

MGCTACTTTCTTGTACCTGCCCG 

Sequence 772 

CCGGGCAGGTACTGNTTGTGATCCAGCTACTGAGGTCTTGAACCGAACAGACCTGGGAAG 

NGTCTCANAGCTGTCTCTCAGGCTCTGAGAGAAAGTAGCCAACACTACAGTAGACAGAGA 

GAGACAGAGATCAAGACAGGAGACAGAGACAGCATAAGACAAAAATACTTTATGAAATTG 

CCAAGTCCTCAAGACAATTCTAGNTAACNTCTTTTCAGCNAAANTGGNTGGAACCCAGGG 

CTTTTAACCTCCAAAGGTNNCACAAANNAAATNAAAACTAAATNANNAANAATNGCTTCN 

CTNNGGGCGCGNCTTCTTATGTAAACNTATNGNGGGGNTATACCCCNCCCGTGNGCCNTG 
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GCCATGGGGAAATTTTCCGCMTTTTCCANAGGCCTTTTATTANTGNATNATCCCCGNTG 

CANAAACANCTANTNNAGGGGGGNGG 

Sequence 773 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCAGTCACTTAATGGCCCCATT. 
ACCGCTCCTCTGTCTCCCGTGCTGCTAGCTCCTGCTTCTAACTCTCATCCCTAATCTTAG 
TGTCTATGCCCTGTTCTTTGATGTTTAATCMTTGCTTTTGTGA' l iTTTTTTTTTTTTTG 
CTATGCTAAATGCCATATTTAAGTTTCTCAAGA I I I I M (I I AACCTCAGGAGCAGAAAG 
GTG MTTAAT CAGACTCTTTAGCCTGTGGCMTTTTGTTGTATAAGGTCCACCTGTCTTG 
TCTTATTTTTCCCTTTATGCCAGATTCCACCACCCATCAGTCCCTGGCCTATCCCAACAA 
GGCACCCTACACTTAA 
Sequence 774 

C TNCTT AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCAATTAA 

TATTTTATTGAGGTGCCCTGTMTCTGACATTCCAGGGAGCTTTCAAGGAAACTTTGTCA 

CACTAAAAAAAATTGAGGATTGTTTTGTGGGTTATTCCGCTCCTTTGTCTTCTAACCCAC 

AGAAGTTCTGTCATGGTTCTTAGGGGTTGTAGGGTGTGTCCACTTATTTTAAAATGTCAA 

AGACCAGAAGAGTGGAGAATGGGAAAGTGGCAAAGTGTTAGCACTGGAAAGATGTTTAGA 

GATC AC C GTAGTCCAACCCTTAAATTTTATAGATG AATAAACCTG GGTATCTAAG AAAAG 

CTTAAGTGACCCTGCCCTGGAAATCACACCAACTAGGT 

Sequence 775 

ACACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAGCT 
GAGATTTTGGAGTCTGGTGGGTTCCAGTCTTGCTTTCCACCAATMTTAGCTTCACTTGA 
CTTCTTTGGGTTTCCGTTTCCTCATTTGTATGAGAAAATAGGGATCATGAGAGTACCT 
Sequence 776 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTACTGGCACTGAGC 
CAATG TATG CTATCAAGGAAAGCTTTATCTGTCACTGAGCAAAAGGTGAAGTTCAATTAG 
GTCAGTTTTATCAC TTCTTTTC CfACACACAMCTATGAGGAGAGATCATTTCTTTTC^ 
TCTTTTATTTATTTA I II 1 I I I I GAGACAGGGTCCCACTCCGTCGCCCAGGTTAGAGTAC 
CTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
nnGAGGGTTAATTGCGCGCTTNGGCGTAATCATGGTCATAAGCTGTTTCCTGNGG 
Sequence 777 

CCGCGGTGGCGGCCGCCCGGGCAGGTCTCAAACCCCAAGACTGAAGTGATCCTCCTACCT 
CAGCCTCCTGGGTAGCTAGCACCATGGCATGCGCCACCATGCCCTA i \ 1 1 H H i 1 I AAA 
GACCCTC ACCTCTTAAAAAAAAAAAAGTAG ATTAAAAAATACC ACAATTGCTC AG G TAG A 
TTGAAAAACAGGCATATAGTACCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
nnnnnnnnnnnnnnnnnnnnnnnnnnnnCGTCGACCTCGAGGnnnnnnnnnnnnnnnnnn 
nnnnnnnnnnnnnnnTAGTGAGGGTTTAATTTGCCGCCGCTTGGGCGTTAATCATGGGTC 

Sequence 778 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGACCATAGATATCC 

CTGGGGGATGTAGTGGATTCAAAACATGAGTTGGATTCTCCAACGTGCCATCGGAATTCT 

TCTAATATTCAGCAGCTTTGATTGTTGTTTTACATCTGACCTGTATTTAGATTTCTCTGA 

TTCCTGGAATATTTGTGGAAAATAAGTACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGTTAATTGCGCCGCTTGGCNGTAATC 

ATGGTCATAAGCTGTTTCC 

Sequence 779 

CGACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGTTATTCCAGT 

TTTGCAGATGAAGAAGC TGGG ATTTAGTGAAGTTAAATAACTAGTAAATAGAGAGGGCTA 

TGTTCCATCCTAAGTTGCTTTTGATTCCAAGACTGCTATGGTATTTTGAGACGGCTGATT 

TGGTACCTGCCCG 

Sequence 780 

CTATANGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGTTTGTAGATG 
GAAGGAAGAACTTGTGTGCTTAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTAC 
TTGTAGGACCCTATACNGCMCCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGA 
AAATGGTAGCTGAG AC CAATGATGCCG ACCTCCCTC AAAAG CTGCATTTCTGAATTTCTG 
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AAGGCAAACTGTCTGCCTATATTGTACCTGCCCGnnnnnn'nnnnnnnnnnnnnnnnnnnn 
nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCNTCGACCTCGAGGGGGGGG 
CNCCGGTACCCAGC I I I I I GTTTCCCTTTAGTGGAGGGGTTAAATTGCGCC 
Sequence 781 

CTTANGGCGAATNGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCTGGTGAGAA 

GGATGTTCATCAGGAAGGGCATCATATTGTGTAAAGGGCACAGGATTGAGAGG AAAAT TC 

ATAAGGCAATGAATGTATATTCACCTAGCCAGCCCAGTCATACCGTCAGAGACATTTTTA 

ATTCCAATATGTTTGGTTACGTTTCTTAAAATTCCAACCTATGCTCCTTATATGATACAT 

TCACCTCTTTTGTAAGCATMTCTCTTTACCATTACCAATTAATTGCAGCCCATCCTATT 

AGCTGTAGAAGAAAGATGTGGCAAATTTGGGGAAGTAAAGAAAAAAGGGGATCAAGGAAT 

AGACATAAAAGATTTTGTGATCACCTGGCGTATATCTACCCAGNTACCTTTGGGTGGTTA . 

CTTCCTTTGGAAAG 

Sequence 782 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAGGAAAAGATCCC 

AAACCTGAATCTMTAATGAAGAAATACGATACAAATGAAAACTGAGAAACATTCTACAA 

TGTAACTGAACTGCAATATTCAAAAGTATCAAGGTCATGAAAATCAGGGGAAGACTGAAG 

AACTGTTCTAAAATGAAGAGGATTAAAGAAACATAACTAAGTATATAACCTATAATTCTG 

ATTAGATCCATTCCCTACACAGGGCATTGAAACAAAAGCCAAAATGTGTACCTGCCCGGG 

CGGCCGCTCG 

Sequence 783 

CGACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGGAG 

GCAAAAAAGCAATCAGTATGACTTGAACTGCTGGGTTTAATTACTCAATATAACACTTGC 

CATTTAAAAATCCATATGCCCATCAGCATGGCAACAGTCTCTCATAAAGATTCCGGTATC 

ATATGGCACAATTTGTACCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn nnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGGTACCCAGCTTTT 

TGTTCCCTTTTAGTGGAGGGGTTTAATTGGCGCCGCTTTGGCCGTAATCATGGGTCTTAA 

GNCTGTTTCCTGTGGTGAAAAATTGGTTATCCGCTCACAAATTCCACACCAACATACCGA 

GCCCGGGGAGCATTAAAAGTGGTAAAAGCCTT 

Sequence 784 

CCGCGGTGGCGGCCGAGGTACGCGGGATGTGACAACCCTGCTGAAGTGCCACCTACTATA 

TTTGGTCTCAGCGATTAAACGAAAAGAGATGGTAAGGCAACAAAAATTATCAGCATATAT 

TTTCAGCTTCTTTGAGTTTTGCAGATTAGTATAGTTCAAAGGATAGACAAATTCACTTGT 

TTTCATTTTGTCTTTAAGATAMTAAATTTTGGTACCTGCCCG 

Sequence 785 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCTTTCACTCTCATCAAACTAGACTACCGACTG 
TTTCTCGAAAAACTTTTTCCACCTCTTCACCTTTGCTTATGCTTATCTTT CTGTC 
AGGTTCCTTCCACCATTCATCTCTCCATTTAAAAATATTTTTTGAGCACTTTTT 
GAAAAATTTCAGTGCAG AGAAGTTGAAAGAGTAGTACCTn nn nnn n nnnnnnnnnnnnnn 
nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGAGGGGTTAATTGCGCCGCTTGGCCGTAA 
TCATG GGTC ATTAG 
Sequence 786 

GACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGOGGCCGCCCGGGCA GGTAC TGGGAT 

TATAAGCATATAAGCCACTGCGCCTGGCCCTATATACCTTCTTTATTCCTTTATTTTAGT 

GGCATTTCAAGAGTGTTGAAAACATATGTTAGATCTGGCATGTTTAAATAGAAATCCTTC 

TGAGCCCTACTTTGGGGGCAAAGGGAGCCGGCCGATTTATAGAGCTCTTATGATGTGGTA 

GACAGCGGCAGTACCT 

Sequence 787 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTGCAl'GTGCACGTGCGTGTGGTAGTAGGCAAC 
TGCTCAAATGTCAGTTTTGAGATGTTGTTGGTGGGCCTGTGTTTATCTGTAACTGTATTT 
ATGTAGGTGTCTCTTCAGAAATATGTCTGAACTGGGCAATTTGTTGTACCT 
Sequence 788 

AGGTCTTGAGTCGACCCACGCGTCCGTGCCTTACAACTGGATCTCATGAGACATTTCCTC 
AACTGTGGCTTATAGTCTCTGATGAATCTANGTTTGTGCC AAAGT TAAT ACAA AATCAGC 
CAATATGCCTGGATTTTAAAAGTAATCTATTTAGGCTTTACTTTTTGA 
CCCGGCGGCCGCCAC 
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Sequence 789 

AGGTACTTATATGCTAGGAGCCTGGGACATACTTTTATTATTTCATAATrTTACATCTTT 
AAATCTTAAAG AAACATCGGTAATATCTGC ATGAATCAGAATTCAAAC G CAGCCTTGCTT 
TCACAATCATAGATTTTCAGGATCAGAAATTTATTCTGTCAAGAAGAAGGATGACCAAAA 
ATGCCGGACGCGTGGATCGACTCAAGACCTGCCGGGCGGCCCGCCAC 
Sequence 790 

GGGCAGGTGTCTTAGGTAGGATCTTGCTACTCTGTGTGCA CTTATC TTGGGAGTCAGAGT 

AAGTTCAATTTGTCTACTTTATTCTGTGGTAG 1 I 1 1 1 CTTGTATT TTTAC ACTACGTCCT 

GTGTGATTTCATTTTCATTGTAATTTAGTC 1 I I ! I GTTTATTGGGTTTTTCTTCTTTCAT 

ATTGGCTTCATCTMGGAACAATMTGTCATTGTTCATTTCAGAACCAAGAAAATATAGA 

ACACCTAGAAAATCAATCAACCAAACAAACAAATGAATAGTAGAAAATCAAACACACATG 

TATTGCCTCTTGTTTCTTGCTGNAATTTTTCCCTC 

Sequence 791 

AGGTACACTTTTTACACCATAACTGCCATGTGGTGATAAGCTAAGTCATCTCATTCCTTC 

TCCCTTTCTCTGTGCCTGTCTCCATGGTCATTCGGTCCTATCATCTGTTAACAGGTAGGA 

GAAGGCATGTTTGCATTTGAGAATGATGGTTACTATAAGCAGTCAGTTGCTTATACTGGT 

CTATGAAGCATCTAGGACTGTCCCACAAACAAGAGAAGGAAAAAAAGCCAAATGGAGCGG 

ACGCGTGGGTCGACTCAAGCACCTGCCCGCGCnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnTGTTCCCTTTTAGTGAGGGGTTAAATTGCGCGCTTGGCCGTA 

Sequence 792 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGTTTACT 

ACCCAAATTGATATTACTACCTGAGCATTTTTCATTTTCTT^ATAA 

TCATTTTATCTACCACAAAATATTAAATTGAATAAMCATTTTTGCTGAATTTGTGGAGC 

TGGGCTTTTAG AC CGGTTTTAAC ATTTC AGTGTAAT ATG TAATG C AG CTC AAAAGCTTTG 

TGCAAGTGGT 

Sequence 793 

AGGTACCCTCGGTTGCAAGCACAAGCAAATGTGCCAGGGTGGTTGATGCAGCTGTGGTCA 
CAGGTCCTATCCAAAGAGCACTCGTCCACATCTTGGCAAGACTTCTCATCTGTTAATAAT 
TTAAATCCTTTCTT 
Sequence 794 

CGAGCGGCCGCCCGGGCAGGTACATCATGCCTCCAGTTCTGGAGCTAACACAGATTTCTC 

CAATTTCATCTGTTTTGCAGAGCTGGGGAGGTCCATCTGGTTTC ACAATG CACATCATCC 

CACCAGGCATTACATGCCCTACATCCTGGACCGTCAGTGCTGMTTTTTATCTTCAGTAT 

TGACCCGTATTACCCCATAGCTCAATCCATTCATTGAGAG AATGG CTCTTCCTG GCAAAG 

GGGCTCCTGGAACTCCAGGCCTGCGGATTGCTACAGTCATGGCTTCAGCAGACGTGGCGC 

ACGGTCAGATGGCCTCAGGCTTCAGTCCATGACTTTGGAACAG 

Sequence 795 

AGGTACCCTCGGTTGCAAGCACAAGCAAATGTGCCAGGGTGGTTGATGCAGCTGTGGTCA 
CAGGTCCTATCCAAAGAGCACTCATCCACATCTTGGCAAGACTTCTCATCTGTTAATAAT 
TTAAATCCTTTCTT 
Sequence 796 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAGGGCAATCAGAGGGTGAATGATACGCACACC 

TGTGTATCTCCCATATTCTGCMCTTCTGTTTTTATTAGAAGTGTAGAATAAGTTCCCAT 

CTTGTCTTCTAAGCTTTCAGGTTCCCAGGGTGTACCTCGG 

Sequence 797 ■ 

CCCTTTCGAGCGGCCGNCCGGGCAGGTACTTCC! rTTTTTTTTTTTTTTTCTTTl IAGCT 

ACCTGGGTATCTAGCTTGGGACCCTGTCCTCTTCCCACTCC ACTG TAGGGTAAGATNAAA 

CTTTTCTAGGATTNTTATTAATGAAATTATATAGCATGGATACTTTTATATCTGGCTTNT 

TTTGGTTAGCATAATA 1 i 1 1 1 GAGATTTATCCACGTTGCTTGTATCAT TAGTTT GGTCAT 

TTTGCATTGGTAANCAGCATTTCATTGGGTGAACATAATAAAACTTTAI I I I I CTGGTGA 

TGGACATTTAAATTGGTTTNCGATTTGGATMCT GGAMT AAAAACTGCTATGAACACAT 

ATGGTCAANTCTTTATGTGGGCAGATAAATGCTATTTTTTAAATTTAAGAAAAACTAAAC 

AACAGAAGTGAATTGANTAGCAAGTCACCCATATTTAAAGTTGGAATCATCTGAAAAGAT 

TATATATCTTGGTTAAAAACTAAAAACT 

Sequence 798 

CCCTTAGCGTGGTCGCGGCCGAGGTACTTTATCATCCATAATATCTTACTCTCTAATCCA 
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CATAATGGAGTTGTGCCAGACTCCAAACTGAACATAATATTTATATATACATATATATTT 
ATATATATACACACATACATAAACTATAAACCTGAATATCTTAGCATACAAAACTTATCT 
AACAACTAATAATTTAGGATTCCTTTGAGAGGGAGGGGTTAGAAGAAACTCTTCTTAGTC 
TTTGAAATAGTN CTGACAGTAAGC AATAAAC AACATTATTAATTN CTG G GTTCTCATTTA . 
TTCAAGTGAAAGACAAATAAAATAAAAATTCACTTCTTGAGCCACTAGAAATTTTGAAAT 
C ATAAAGG ATTATTTGG G AAATGAGTAAGTCTTTAAG AACAATTTAGTG NTATCTAAAGT 
AAAGTGGATTTTGGTAAGATACAGAGAAAAANTCTATGCATAAATNAATT CTTGN TTCAT 
TGAGCAACTATAATCGCTAAGGGGGGTCTTTNTAAGNATCTCAGGAATGTTTTTAAAATA 
TTAATAGTG AATAAATT AN ATTG AGG ATAG GG ACC AATC 

Sequence 799 

AGGTACTCAAAAGGCATTTCCGCTTACAATTTGTAGAMCACAAAATGCGTTTTCCATAC 
AGCAGTGCCTATATAGTGACTGATTTTTAACTTTCMTGTCCATCTTTCAAAGGAAGTAA 
CACCAAGGTACTGGGTAGATATACGCAGGTGATCACAAATCTTTTATGTCTATTCTTGAT 
CCC IHiM CTTTACTTCCCAAATTTGCCACATCTTTCTTCTACAGCTAATAGGATGGGC 
TGCAATTAATTGGTAATGGTAAAGAGATTATGCTTACAAAAGAGGTGAATGTATCATATA 
AG G AG CAT AG GTTG G AATTTTAAG AAACG TAAC C AAAC AT ATTG G AATT AAAA 
Sequence 800 

CCGGGCAGGTACTACCCAAGGTCCTCTGTGACTCGCCGCCCACTACCCAAGTGAATGAGT 
CTCCCCTAGAGCTTTGCTACTCAGAGGGGTCTGAGGACAACAGCATGGGCCAACACGTGC 
ACTCGAGCTGCCTGGAGATCTTGTTCAAAGGCAGATTCTGAATGAGTAGGTCTGGGTTGG 
AGCCTGAGAGTCTGTACCT 

Sequence 801 

CCGGGCAGGTAC M I I I I M I I I I I I I I l I I I I I M GTAAATAAAATTTTAATGGAACAC 

AGCCATGCCTGTTCATTTACATATTGCATAACCCCTACCCCTAGAATTGAGCAGTTGCAA 

CAGAGGCCATATGGCCTGACAMGCCTAAAGTGTTTA CCTAT TGGGTTCTTTACACAAAA 

TGTTTGCCCACCCCGGATTATATCATGGACTCGACTTGTTTTGGTTTCATATTCATAGTC 

TGAATATATTTTGGTAGCCTTAACAGTTCTACAGGGAGAGAATATACAAGTCAGGCTATT 

CTAGGTTTTCTGTAGTTTCACAGATTTGTCATTATAATCAGAT 

Sequence 802 

AGGTACCAGGACCTCTAACTCCCCCTGACACAGAGCAATTAGACTCCCATAACAATGGTA 

TCAATTATACCACTCCATTGGAGGGACTTCCTTTATGTGTCACCCAGGATACATTGCTCA 

ACTGCAGTTGCCTTGCAGT1TGATCCCAAGCATGGTTGAGTTACCATAAAAAAATTATGT 

ACCTGCCCG 
Sequence 803 

CCGGGCAGGTACCTGCTAAGTGCTGGACAGCCTTTCTCACATTTCCTCCCAGCAATCCTA 
GAGGCAGGCCTGGTTGCCTTTGTGAAGCTCAGAGTGGTTAAGTAACTTGCTCAAAATCAC 
AGAGCTACTAAGTGGTACCT 

Sequence 804 

CCGGGCAGGTCrrCCCAGGGATAGTTTTCCATTTGATTAAAGTTTrGTTCTTATGTTACT 

TTTTACTGTTGTTTTTGCAGTTTACCTAATGCTAATAGGGTCTCAGGAAC TGTA TTTGAT 

GjjAAAGTGTGGTTTTTCCAGMGATGACAGATAATTGGTGGTCTCCCCTT TTCCT CAGC 

AACATAGTTTGTACAG CATACTG ACTC AATTCTTAAGTCTGATTTGTG CAAATTTTTATC 

GTACTTGAGAGTTACAAAGCAAGTGAGAACTTG AGGGA TCAAGATCCTGGAGAGAAGGAA 

ACCTTAAAAGGGTAAACCCAACATTTGGCTCTACTTTTCCCCTTGAGGTAT 

Sequence 805 

CCGGGCAGGTACAAACCCCAAGTGATTATAGAAAAATCAATGTGGCAGCTACACTAGAGA 

TGTCCAACCCCAAGGCTATGGGCCGTTGCTCCCTCTTTCCCCCCAATCCCAATCCCGCGT 

ACGCGGGGCCTCTTTTCCGTGGCGCCTCGGAGGCGTTCAGCTGCTTCAAGATGAAGCTGA 

ACATCTCCTTCCCAGCCACTGGCTGCCAGAAACTCATTGAAGTGGACGATGAACGCAAAC 

TTCGTACCT 

Sequence 806 

AGGTACCCCTAAGTGGGAACCTACCAGGACATTCAAAGCAAGAGCAGTAAGT TCTGA ATG 

TTCTGGGACAACCTGGGTGATATGCATGGATATGGGCTGTGGAGGCTGAGCATTTTAATG 

ATAACTTAGGGAAACGAGGCATGGCCATGGTGTAAAACTCTCAAATCCCAAGCCCTAATC 

CAACCTTAAAATCCGAGTC1TCTAAAGGGCTGTTTTAACCATGAAAGGACCATAAGAAAG 

GCAATTCACAGAAAATGAAGCCATGTGGCCAAGAAATATAAGAAAAACAGTAAAAGCCCT 
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TAATCTCAATAGCAATAGAGTGGATGCAAATGAATATAAT 1 
Sequence 807 

CTCTCCAGTCTTTGATTGTCCCGCAACAGTATTACAAACAAAAGGCATTAGGCAAAGCAT " 

GCTGAATTGATTGGAGTCCCTTGGTCAAAGGTATTATTGATTGACGGCAATCAGATCCAC 

TCCCTCAGAAAGGATTTGAGTAGGCCTNTTCTGTCCATCTGCAGAAGGTTCCCCAAAAGG 

GGCAGAGGGCCGGGGCCCTGGTGGGAGGGTGCCATGGGAGTTAGGGTGACCCTGAACCAG 
GAGTAGCAAG 

Sequence 808 

AGGTACTGAATTCTACCCTGGAAAAACAAAACCCAGGTGTCTCCTCAGCTTCAAAAACTC 

TCAGGGAA TGAAT CCCTGTGTCCTACACCCAAGTATGTGGAATTTAAGAACCTGCTGTGG 

ACCTACCTATTTTCTTAGAAATATGCAGCTGAATATAACCATTTTTGGATATTTGAGATC 

ATTATGTACCTGCCCG 

Sequence 809 

AGGTACCAAATAATGGAGCTAGAATTCCTATCAAAATAATGGAGCTAGAATTTCCATCAA 

CATATAAAGTCCATATGTGAGCCTCATATAAGGCAAACTGTAAAATCAGTCAAGGTTCTA 

^ 3J!SZIIG!y ^"^^ C AAG ATCTGG AAAG AGTG ATTG AGC ATTC G TTATTTTAAATTAC G G AC T 

ATTTTT1 1CCATACAAGGAAGTTAACATCTAGAGCGATCATTCTCAAACTTTATTGTATA 

CCAGMTCATTTGGAGGATTTATTAAMCACAAAGTGCTGGGCCTTACTCCTGAGTTTCT 

AATTCTGTACACTCTGCCCCCATCCCGGGATGAGCT 

Sequence 810 

CCGGGCAGGTACAGGTATGGGGACCACAGGAACAGTTAAATTCATGGCATGGCTGGTCTA 

CCACACAGTCGGGGGAATTCTTTAAATAGAGCCTGTCACTCTCTTGGCCCATCAATGGGA 

TTTCCTTCTCGAAACTGCTGATTCGTTCAGGTACCTCGGTCGCTCTAGAACTAGTGGGAT 

CCCCCCGGGCTTGCAGGGAATTCGATATTCAAAGCTTATTCGGATACCCGTCGACCCTCG 

AGGGGGGGGGCCCCGGTNCCCCAGCTTTTTGGTTC 

Sequence 811 

CCGGGCAGGTACTTAAGAACTCTCCAGGTAAGGAGCATGGCTCTAATGGGAAATCCTTCA 

GGTCGGGGTGAAAGAGGAAAAAAAAATGGTTACCAAGGATGAACTGTAGCTCACTTAGGA 

ACTGAGCTTTCAGACTCCCATAAGAGCTAAATGATAAAAGAA I I I I 111 I I I l AAGAAAA 

AAGGAAAAGAAAGAAAAGAAGAGGAAGAAACCAGGAATTCAATAATGTCCTCAGAAAACA 

AAGCAGTGATTCCTTCCTCATTACTTTTGAGCAATGAGACCCCTACGCTCCCGCGTACCC 

TCGGCCGCTCTAGAACTAGTGGGATCCCCCGGGCT 

Sequence 812 

AGGTACTTTATTCTCTTATCCAGGACTGGATCAAATGATTTATGGCATTGTGCTGTTTTA 

ATGTTCTCATCCACAGGGTCGATTCCAATAACTTGAAGCCCCAAGCCGCCCTAGAGGTTC 

AGTTAACAGCCCACCACCACAGCCMCGTCAAGMTCTTCATCCCCAACAAAGGTTTTTN 

CTGGCTGGTGATTTAGGGAATTTGTTTTCAGAAGATTGTCCCTAATTAAAATGNCACCCC 

TCNAGGTCATTNCATNGGAAATNGAAGAGGNGCATATACTCCCTTGGTTCATNCCCACCA 

TTTGTGNAGCCANGGCCCAAGAAAGGTT 

Sequence 813 

CGCGGNGGCGGCCGCCCGGGCAGGNACTNTTGCCNGGGANGAAAAAACCCCGACAGCAGC 

AAACNGCAGAANCNNCAGACNGCAGGCTGCNGATGGNGAGAGGGAACNCCGCCCCANACC 

CACNGCCACNGAACCNGGCNGGGANACCAGNGGCCCNGGNGGANGCACCANAGANGAGGA 

GCCCGGGTNTTCTGGCCAGGGGGCNGCNCGCACCNCGGCCGCNCNAGAACNAGAGGACCC 

CCCGGGCNGCAGGAANNCGANANCAAGCNNANCGAAACCGNCGACCNCGAGGGGGGG 

Sequence 81 4 

GCATTGGAGCTCCA CTTTT GGTGGCGGCCCGCCCGGCAGGTACTCATCAGATGGAATGTT 

TTACCCCGCCGAGGTTTTAGTCATGATGTGCTGAGCTCTCTGCGTCTGACGTGACTGACT 

GGTAGCTGGGCGTTGGCGTGACCCTTCTTTCCCTCTAGGATCACATCATGGAGAATTCCA 

CCTATGGGAAAATTAAAATGATTGACATGCCCGTGGAAAGGATGCATCCCCATCTGTCCA 

AAGGCTATGCGTACCT 

Sequence 815 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATATCACAGGATTAAAAC 
TCCAGTTAAG CAACT GAGCTAATCATTGAAGTAAATTAAAAATACCAAGCTTCTTTAACC 
TATCAATG CTGTTTTAG AAGC ATC ATCC GAAC AAATAG AG ATTTAGTTATAAATTG N CTG 
GGCTACATTCTGTGATGAGAATTTTGCTTAGTACCACTAGAGGAAGAGAGACTAGAGGCC 
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CAAGAAGAAGGTGAAAGTAGAGCTATGTGCCCAAGAAAAACTTGTAGAGATGGAATAAGA 

ACTGAAACAATAGTAGAAACACTTGCCCTGAAGAAAAAGGATGGAAAAATGAAATAGATT 

GA1 m I AG CTG GTAGGG AAG AAAGTTAACATAGTCTTAGTCG GTTGTGTTTTCTC AAGG 

AACTGAGAAGCAGGATTAGTTTTTAGAAATTTGAANAAGGTAGTGAGAGGTGAAGCCCAG 

C TGGGTCG AGTGGGGACTTGGAAAACTTTTNTGTCGTACCTGCAGGCCTNCTACACCTAC 

CTTTTTTi iGGGC 

Sequence 816 

AGGGANCCNGGAGNCAAAGCAGCAGCCCCGGANGGNGCACNCCCCGGGGGAGACANGGGG 

GAGCCGCAGCCACCCTGCCTGGCNGGCACGGCACACNGGNNNGCAGACAGGCCCACGNAC 

CNGCCCGGGCGGNCGNCAAGAACAAGGGGACCCCCGGGCCGCAGGAANCGAAAAnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnNCNNNNGGNCCCNNAAGGGAG 

GGGAAAANGCGCGCCNGGCGGAAACANGGCAAAAGCNGGNCCCCGGGGGGAAAANGGNAA 

CCGCNCAACAAANCCACACAACAAACGAACCGGGAGCAAAAAGGGAAAACCCOGGGGGCC 

CAAAGAGGGAGCCAACCCACAANAAANGGGGGGGGGCCCACCGGCCGCNNCCAAGCGGGA 

AACCCGGCGGCCAGCCGNAAAAAAAGNAACGGCCAACCCCCCGGGGAGAGGGGGGGNNGG 

GGAAAGGGGCCCCCNNCCNNNNCCCCGNNCANAGACCCNNNGGGCCCGGGNCGNNCGGNG 

GGGGGGAGGCGGGAANANNCNAACCAAAAGGNGGGAAAACCGGGAANCCCAAAAAAAAGG 

GGGAAACCCCAGGAAAAAAAAAGGGAGCNAAAAGGCNNCAAAAGGCNCGAACCNNAAAAA 

GGCCCGGGGGNGGGGGGGNNNCAAAAGGNCCCNCCCCCGGGCNGGGAAAAAAAA 

Sequence 817 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGGGGGCCAGTTATTATA 

CTGCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTACCAGTGGCCCCCCGCGTACA 

TGTGTGAAAACAATATTGTATACTACCATAGTGAGCCATGATTTTCTAAAAAAAAAAAAT 

AAATGTTTTGGGG 

Sequence 818 

GACTTGTAACTGGCATCTGAAGTGTGGGGAGTCCTGTGGGATTGAGCCCTTAACCAACCC 
TAAATACTTGTGCTAACTCCAGGTNGTTAGTATCAAAATTGAATTACATTGTTGAACATC 
CATTTGGTGTCTGGAGAATCANAGAATTGGTTGCT 
Sequence 819 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTGTAAA 

GAGGTGCAAGAAAACCAAAAACATAGGCATCCAAGGTAGAAGGCGTATCTCCAAAGAAAA 

CTGAGATGTTCCCAATCTGTTTGATAGAAGATTTAGGCACTCCTTGGCATCTCTGTATAT 

CTGTGCTTCCACTTCTCGGAGGTGGTAGAGGGGAGGCTGTCCTCTGGTCAGGAGAATCCT 

ATTCAGTGCTCCCTTAGACATTCTTCCAGGCAGGATCAAACTCAAAGGAAGAGGAATTTG 

TGAAGCAA 

Sequence 820 

CCGCGGTGGCGGCCGCCCGGGCAGGTACATTTAAGCGGAAGCATACGGGAATATGAGTGC 

AAAAGTGTGGCTGAGCCGCGTATGCCCTTGATTGGTTTTGGGAAGCCTGAGGGAGGCAGC 

CTTCCTGGCTATGAGCCATCGCCTGCCCAATCAGGCTAAGGGTGGTGACTGGGGTGGTGA 

AGGGGCAGCTCTGCTGAGCATGGTCTGCCTTATGGCCTGAATTGTCCTCAAGGGGTGTGG 

ACTGCAAATGGTGTTCACATGAACCGGAGACATCACTCTTTTAGGATTCTACTGGCAGCC 

CCTGAATTGGCTCAACGTTT 

Sequence 821 

CCGCGGTGGCGGCCGAGGTACTGTGGGATTTGTCTTTTATGAGTTACCACGTTCAGAGAC 

TAGTTCATCTTTGGTTGTATAGGATGTTAGGATCTCAGTTGGCATTCACAGTTAAATCAT 

GCACCTTCAGGAACTGGGAGCATTTTGATCCAGAGTCACAATCATTCCTTTCTTATCTTC 

ATCCCTATGGTATGTGTGTTCTGAAGTTTAACTGACAGATGGCAGCTGGTACCTGCAGGC 

CTCCTACACCTACCTCTCTCTGGGCTTCTATTTCGACCGCGATGA" r GTGGCTCTGGAAGG 

CGTGAGCCACTTCTTCCGCGAATTGGCCGAGGAGAAGCGCGAGGGCTACGAGCGTCTCCT 

GAAGATGCAAAACCAG 

Sequence 822 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCNCGAGGTCACTGACTTACNCCCTTCCCAC 
AGCTACAGATAAGGGCTCGCAAAGTTGGCCTCAGAGACACATCANGAACCAAGGTGGACC 
AGCAGGTGCCGAGCCTGTGTATCTGCTTGGAGGAGACGTTCCAATGTGCTGCTTGTTCAG 
AGATGGTGAGTTGCAAGAAACAGAAACCCACCACAATTTCTNAGGCAAAAAGGGAGTTAA 
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TTATAAGGACATAAGAGCACAAAGTTCCAGTGCAAGAGATACATCCAGGCTGCACAAGCT 

CCGGGAGTGGGGCCTGGCAAGCCAAAAGAAACCAAAGTTTGTCTTGCCTTCTGTrCCTCT 

TTCTGAAGCCACATAACCCTTTATTGACCNGNGTNTCTTTGCATCGCTTTTGTTTTCTTT 

TTATG 

Sequence 823 

NCGCCCGGGCAGGTACCACAAGAACTATGAGCTGGTTATCCACTTCATGTGGAATCATAA 

GCGTCCCAAAGTGACAATACATATAGATTGCCAGGCAGTGAAACAGTTAAGATGCCACCA 

TAGCTTTCTTTTCAACATCTTTCTAAA^^ 

CCT 

Sequence 824 

GCAAAGGGACAGAAAAGCNGAGATGAACAGNGCTGCAGCAAACACAAGCCGGGCAAGGGG 

GCNCCGAGCCNCGCANCCCCCGGCCGNNCNAGAACNAGGGnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCNATN 

GGNCCCCNAAAGGAGGGGNAAANGCGCCCCNGGCCGAACAAGGCAAAGCNGGGACCCGGG 

GGAAAANGGGAACCCCNCACAAAGCCCACAACAAACCAACCCGGAGCCAAAAAGGGAAAA 

CCCGGGGGGCCCAAAGNGGGNNGCAACCCACAAAAAANGGGGNGGGGCCCACCGGCCCNN 

NNNCAAGGGGGNAACCNGGCGGCCCANCGGNAAAAAAGNAANNGGCCACCCCCCGGGGAG 

AAGGCNGGANNGCNAAANGGGGCCCCCNNCCNNNACCNGGNAAAAAGAANCNNNGGCCCN 

GGNNNNNNGGANCNNNNAGCCGGNNAANNCCCCNAAAGGGGGGGAAAACGGGGNNCNCAA 

NAAAANGGGGGAAACCCNGGAAAAAAAAGGGGGNNAAAAGGCCCNCAAAAGGCCCCGAAC 

CNAAAAAGGGCCCNNGNNNGGNNNNNNNAAAAGGNCCCCCCC 

Sequence 825 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAAGAGAAGCAGCTGA 

AGTTTCAGAACATGCTGTTATGCTTGCTAAGGATAGCAGAGAAGAGTTGACTCCAGATGC 

TAACATCAGAGAAGTTGGCCTTGAGGGGGCATTGTTGATCTTACTAGACAAGGAGACAGA 

TGAGAGATTATGCCATGATATCAAAGAGACTTTAAATTATATGCTTACATCTATGGCAGT 

GGAAAAACTCTCCCTGTGGTTAAAGCTTTGTAAAGACGTACCTCGGCCCGCTCTAGAACT 

AGTN 

Sequence 826 

GGGCNAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACGGGGGCCAGTTATTATACTGCT 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTACCAGTGGCCCTGGTGGATGCACCAT 

ANATGAGGAGCCTGGGAGCCTGGCCAGGTTTCTGCTGGTACCTGCCCGnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnTGTTCCCTTTTAGTGGAGGGTTAATTGCGCC 

GCTTTGGCCGTAATCATGGGTCATTAGGCTGTTTCCCTGTGGTGAAAATTGGTTATCCCG 

CTCACAATTCCACACCAACATACGAGCCCGGGGAGCATAAAAGTGTAAAAGCC 

Sequence 827 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAATAGCATGGGAA 
CCATTAACCAGCAGGCCATGGACCAACTGCAGTATCCTTTGTGACCCAGAACAGCACCAA 
GGTCAGAGGGCACCTGTATTCATAAACCAGCTGCCTGACTGTGAATCCTGATGAATCAAG 
CTCAAAAG G AGAAAAC ATAAAATN C ATAAAGTAC CTN 
Sequence 828 

TGGCGGCCCGAGGNCAACTTTAAACTGTATTGTATTCATGTTGCTAAACAATATTGGCCT 

TCTCGA TNA'r TTTATTCATGTTGCTCCAAAGTTAAAACCCTGTAGAACTAAGTAGGTGAA 

GAGATATTTTGTATAAGTGCCCAGAAGAGAAAATTAATAATTAAATAGTGAATTGAGCAT 

CACTAGAATAAAATAAAATG AGTA GGCATTCCTAAGATGTGAAATGATCACCTAAGATAT 

AC ATGCTC C AACC ATATTG ATTTTAG AAC AAAACACAG C AGC CC ATAAC AGTTTGTGG CC 

TCTACTA/>CTGNCCTCTGCTGCCCATCTAGAGGTTATGTTTCTCTATTTTTAAAATAAAA 

TGTA 

Sequence 829 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTC rTTTTTTTTTTTTTTTTTTTTI 
GGTTGAAAGATTCTAAAGTGTGTTTTGATCATGTTTO 

TTACTGAATCCAACCATT CCATC TGGAACTTTGTATTCTTCTGTCATTACAGATCTGCTA 
AGAAAACAGAAAGCACCATnTCCTTCAATGAAAGATACTTTGTTTACAAATAAAAGATA 
ATCTATTAATAAAAATGGAAGTCTAATGTCTCCCAATGGCTTAAACTAAGGTTCTAGGCT 
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cagactcatgtcaaaaaatatgatcttattacttgagtagttaaattmgaaatttaaga 
tgacacatcgagtgaagaagggatgatgaatggaacagtctgaaggcattaactgtaaAg 
aagttgaagtggagtttagagaatgtcatcttaacagttactctaattcac CCCTGT . 

Sequence 830 

GNGGTGGCGGCCGAGGTAC I 1 I I I I I I I I I I I I I I I I HI M I I 1 t I ! IGTTTCTNGGNT 
TCTGAGTCCAGTCCCCTTNGCATAAACNCCCGGCGNGGCTNTCTCANCGTCCCCGCACCT 
TCATGCNCAGNAAATTCTTGGACAACtTTGTCTTTCTCT 
Sequence 831 

AGGGCGAATTGGAGCTCCCCGCGGTGGCCGAGCGGCCGCCCGGGCAGGTACGGGAAGGCG 

AAGAAMGAATAGAGMGATAGGGAMTTAGAAGATAAAAACATACTTTTAGMGAAAAA 

AGATAAATTTAAACCTGAAMGTAGGMGCAGAAGAAAAAAGACAAGCTAGGAAACAAAA 

AGCTMGGGCAAAATGTACTGTTTGAGGTTTCAGGAGTTCACTGGGGGCCTTGGAATGTA 

TCTCCCGAAGATAAGGGGGAACTACTGTAAGCAAAATCGAAAGCTATACAACAATCAGAA 

ATGGGAAGAGAGCAAATCCAAGTATAAGTGAAGACATTCCTCTGAATTCCCAGCATTCAT 

CGAGCATCAAAMCGACCMTCTTTTAGTCC7TTTCTTGTACCTCGGCCGCTCTAGAACT 

AG 

Sequence 832 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGGGACATTCCAAATGGC 

TGCAAACTAATCAGGAATCGATCGGATTGTAAGGACATTGATTGGACGACATATACCTGT 

GTGCTAGGATTTCAAGTATTTGGTGTCTGGCCAGAAAGGATCTGATGGGACAGATATCAA 

TGCACTGGTGCGATCCCACAATAGAAAGGTGATAGCTGTTGCCGATGACTTTTGTAAAGT 

CCATCTGTTTCAGTATCCCTGCTCCAAAGCAAAGGCTCCCAGTCACAAGTACCTGCCCGG 

GCGGCCGCTCGACTGGCG I I I ( ICCATAGGC 

Sequence 833 

CCGCGGTGGCGGCCGAGGTAC I I I 1 1 I I I I I I M I I I \ I I M I I I I I I I ICCTAGCCTTC 
GAGGCCCACCTTCTNTNTGAAGTCTTCCCTGGCCACCCTGGGTCACACTGTGCTGCCTAG 
GAACTCTACAGCAGNGAATGCCTGCATGATTNATTTGGNACCTGCCCG 
Sequence 834 

NGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGGTACGCGGGACAAAATA AGAA GGC 

TTCCTGAAAGCACTGCTGCTTGGCATACTTCTTGTAGTAACCCTGTCACCGTCTGCTTTT 

TGAAGCTAAACTAGATCCAAGAAATTGACATAAATAATGAGCAAGAAAATGATTATAAGA 

GGATATTGCCTCATTTTCCACATATATTCCTTATTCTGAAGACTACTTTGAGGTCAACAT 

TCCAACAGACCTACGAGCAAAACATTCTGGGGAAATAAGTGAGAGAAAGGAAATTGAAGA 

ACTATCAGAAGCTTCAAGAAACACCATACCACTAGCAGTGGTGCTTCCCACTGAAATTCC 

ATGTGAGAATCCTGGTGAAATATTCATAATTTTGAGAGATGAAGTAATTGGTGATACTGT 

AGAGGTTGAATTTACATCAAGTAATAAGCGCATTANAACACGGCCAGCCCGTTGGAATAA 

GAAAGTCTGGTGCATGAAAGCTTTAGAGTTTNCTGCTGGTTCAAGTCCATGTCAATGTCT 

ACTGTGATGGAATCGTTAAAAGCTACAACCAAAATTAAGTACCTGCC 

Sequence 835 

GAGGTACAGGTGAATGCCANNCTAGTCATCTCCCATGAAGAANAATTAATNCCTTTGCAC 
AAATCCATTCCTTCTGATCCAGAAGAMGAMGCAGTGAGTGATTTATTTTCTCCATTGT 
AAG A I ] 1 I I GTTATTC AATAATTGC AATG AATGTG CT GAAT TTTTATGGTGTTATTGGCT 
AGTTTTGAAAAAAAAAAGGAAGAAATAAAGCATATTATTTTGAAGTTCTAATGACTCACG 
I I I I I CAAAATTAA I I i I I CAGTTAAGGCAATTACAAACTTATCATTAGTTATTAAC 
Sequence 836 

GGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCAAGGCCCTAATCTCAATACCAT 
CACATTGGTGGTTAGGGCTGTAACATTGGGATATGGGGCAAGACACAAACATTCAATCCA 
MGCAAGCATGTCCACACTGTTGGGACTCCAATCCCACTTTTGCATTGAGTCATTATTTA 
AC C ACTGTAC CT 

Sequence 837 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACAATTTTAATTTACAACTGTTG 
GAAATAAAAATCACTTAATTT7TTTCC AGTG CTTCTC C CTC ATCTGGTTATTC AAG AAAA 
CAAAACAGTCTCTGAGATATTTAGCTTTTCAAACTGTAAATAGATGCTCTAGTGTTATTT 
A M I I"! I AATCCCACTTGTATTATTTTACCTCTA GAGC ATCTTGTATTAGGACATGTTAT 
ATTTATGCCAGTGGGAAATMGTTATGGCCAAGTTTTGCAAAAACAGGAAGCA GTGA GAT 
ACTTG I I I I I I I CTCCTC ACTAAATATCAG TAATTGTC AGG AATGGTATTAC CTATTTTC 
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ATTTCCTCTTTTCAGCTTTAAGCTTTGNTGGATTGGGACACTAAAACTGATGTATACCTG 
AG GG AAAAA ATAGAATGTG CTCATGGTTAGGG AAAATTATATTA 1 U 1 IA A I1 1 1 1 I AAT 
TTGTTTTTTAACCAAGTAGGAATTTGGGTGTATGGATAAGANGCCAATCTGCTTCTGTAG 
GCTATAGAAGAAAACCAGTTGTATTTTATGG NC ATAAT 
Sequence 838 

CGCCCGGCAGGTACTGGGGGTAAGCTGAGAACTATCACGTTCGTTTCTTCCTCGTCCTCC 
CCCTTNTCTGCCTTCTCCTTGAGAACGTCTTTGAAGTTGGGTCCACTGTTTAGGGTTGAC 
TGCCTGTACCTCGGCCGCTCTAGAACTAGTGN 
Sequence 839 

GNACCCAGCGGGGGGGCCAT7TAGAGAGGGGNAAGGGCGCGCTAGNCGAACANGGCANAG 

CNGACCNGGNAAAGGNAGCNGGNCACAAANCCACACAACAAACGAGCCGGGAGCANAAAG 

NGGAAAGCCNGGGGAGCCAAANGAGNGAGCGAACNNACAANAANNGCGNNGCGCACACNG 

CCCGCGAnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnCGGANNGCGGAANGGGGCGGCACAAANCCGCANACACGCGNAACAGANNNGC 

NGGAGCACGGACGNAACGGCAGNGGCGAGCGGAACAGCNAACTCAAAAGGCGGGAAAAAC 

GGGNGAGCCACAGAAANNAGGGGGAAAAACGGCAGGAAAAGAAAAANGGGGAAGCAAAAN 

GGCCAGCAAAAAGGCCAAGGAAACCGAAAAA 

Sequence 840 

TNGGAGCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTACGGGGGCCAGTTATTATACTGC 

Tnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTCCAGTGGCCCTGGTGGATGCACANG 

ATGAGGAGCCTGGGAGCCTGGCCAGGTTTCTGTCGAGCGGCCGCCCGGGCAGGTACCTGA 

CTGTGATTTCAAACCCCAG GTGGTTACTGGAGCCCATACCTAG GAAAG GAGGC AAGGATG 

TATTCCAGGTAGACATCCCAGAGCACCTGATCCCTTTGGGGCATGAAGTGTGACAAGTGT 

GGGCTCCTGAAAGGMTGTTCCANAAAAACCAGCTAAATCATGACACCTTCAATTTGCCA 

TCATGACGCAGACCTGTATACATTAGGTTAAA 

Sequence 841 

AGGTCAACTTTAAACTGTATTGTATTCATGTTGCTAAACAATATTGGCCTTCTCGATGAT 

TTTATTCATGTTGCTCCAAAGTTAAMCCCTGTAGAACTAAGTAGGTGMGAGATATTTT 

GTATAAGTGCCCAGAAGAGAAAATATAATAAATAAATAGTGAATTGAGCATCACTAGAAT 

AAAATAAAATG AGTA GGCATTCCTAAGATGTGAAATGATCACCTAAGATATACATGCTCC 

AACCATATTGATTTTAGAACAAAACACAGCAGCCCATAACAGTTTGTGGCCTCTACTAAC 

TGTCCTCTGCTGTCCCATCTAGAGGTTATGTTTCTCTAl I I I I AAAATAAAATGTAGTTA 

AATTAGCCTGACGGATGTTTCCTCTCTATCCCTTATCTCAGACACATACAAAACAGACAG 

ACAATAACAAAAAGACATCATCTTTATCTGTACCTGCCCG 

Sequence 842 

TACCAGTGGCCCTGGTGGATGCACCATAGATGAGGAGCCTGGGAGCCTGGCCAGGTTTCT 

GCTGGTACCTGCCCG 

Sequence 843 

CCGCGGTGGCGCGGCCGCCCGGGCAGGTCAGCAGCTTTGCATGTTGACAAATCCACTCTC 
TGCTGCAGATGCCTAGGGGAAGTTGCAGACTTAAATTTTCTTTTGTAAAATGGGGGAACA 
CAAACAGATCTTATGTCACACTGGTTACTCCAAAAAAAAAAAAAAAAAAAAAAAAGTACC 
T 

Sequence 844 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGTGGAGAGGACCATGTGAAGAGAGAA 

GCTGAGACTGAAAAGGATTTATGTATTAATATTGACAGAAGCCAAGGAACACCATCTGAA 

GTTCTGACGGCAACATCAGMGCTAAGAGAAAGGCATGGAAAAGATTCTNACCTAGAGCA 

TCCAGAGGAGAGGTTGGTCCTGCAGACACCTTGTTTTCTGACCTCTGACCTCCGCAACTG 

TG AGGG AAGAMTTTNTGTTGCTTAAAG AC AC AC AGCTTGTG GT ACCTnn nnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTCGATNTCAAAGCTTATTNGATACCCGTCC 

ACCTCNAAGGGGGGGC 

Sequence 845 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGTCTACGAAGTCCCATGGCTTTTCCAG 
TTC TCTC TTGCCTTGAGGCTCAAGAACAGTTTGTTTTAACTGAGAGTCTGGGAGGAACTA 
CAGTTTTCCCCAGAGTAGTCTTGGCGGACTACTGGAAGTCACAGCCAAAGAAACTCTGTG 
ATTACTGCAAGTGCTGGATAGCAGACAATAGGCCTAGTGTTGAATTTCATGAAAGAGGAA 
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AGAATCATAAGGAAAATGTGGCAAAAAGGATCAGTGANATTAAACAAAAAAAGCNTGGAT 

AAGGCAAAGGAANAANAAAAGGCATCAAAAGGAGTTTGNTTNCAATGGAGGGCANTTTGC 

CNTNGAAAGNCTTTCCNAGNNGGNTTTTTTNAAAAANACNTTNGNTT TAAAAAAT TNAAA 

AANTTTTTNGGGNCCCCAGCCTTTTNANCCCCTTGTAANCCCNNGTC1 I I I I I ICCCCCN 

CNTNCCCTTGGNATTTTTNNAANTTCCCCNNGGGGGG 

Sequence 846 

ACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACATTTCCTAAGTATCACA 

TGACTTAACAGGTGCAAGCTGGCAGGCATCGGACTTGGTATCCGTAAGACCTGTTGCCCC 

TTTGCCTTCTGCCATGATTGTAAGTTTCCCGAGGCCTCCACAGAAGCTGAACAGATGCCA 

GTATCATGCTTCCTGTATAGCCTGTCGCCTTTGGGCAGCTGGACACCTCACCCACGCTTC 

AGGCTGGAGCCTTCCTCATCTTTATTAGGTGTCTTAGAGGCAAGTGCCGGAGGTA ACATC 

CTCCTCTGCATTTCCTACACTGACAAGGAGAATGCTGTGGCTCTACGCAGTCATCCTTTT 

ACMCTTTTTTGCCAGGGAGCAAAAAGTCTTGNCACATNAAGTCGAATGAAGTACCTTGC 

CCCGGGGCGGGGCCGCTTNTANAAACTAAGTGGGGATTCCCCCCGGGGCTTGGCAGGGAA 

ATTTTCGAATTNTTNAAAAGCNTTTAATTNGATNACCCGGTTCNAACCTTCTAAGGGGGG 

GGGGGGC 

Sequence 847 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGCTGTGTAGCAAAC 

AGCCCTAAACCMGTGACTTAAMCAACCATTTTCTTTGTTACAGTTTTGCT 

TTGGGCTCAGTTGGGCGGGTTTTGCTGGTGATCACTCTMTGGCTGCTTTTCTCTCCAAG 

TAGCCTTTCCAGCAGAGTTGCTGGCCTTACCTGGTAGCTGATGGCTTAGGGCTCACAAGA 

GTACCT 

Sequence 848 

CCGCGGTGGCGGCCGAGGTACGCGGGGTCGGCAACTTTGGGAACCACCAGTAGGATGTGG 

TTMGATTCAGTTCTTGCTGAGCTAAGGAAGCATTT CTCA CTTCTTTTrAATTGTCTGGC 

TCACTTCTAGTCCCTAACTAAATGCTCACTCAAGAGTTTTAGCTTGAATGTCAAATGTCA 

AAAAATTAATTG G G TG ATC TTTCTC C ATTTC TAGG AT AAG AAG AAAG AG AAG A AATG A AG 

TGACCATCCAGCCTTTCCCAATTAGACTTCCTCTCCTTCCACCCCTCAT TTCCT TTTTGC 

ACACATTACAGGTGGTGTGTTCTGTGATAATGAAAAGCATCAGAAAAGCTTTTGTACCTG 

CCCG 

Sequence 849 

CGACTNCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCGTGTTTTAAGC 

TTAGTTCAGTCTCAAGTGTTTGCAGCCACATCTGAAGACCAATAAAGCAACTGCTGGTTT 

ATCCTTTGGGAGCTGACAGAATTTCTTCTCCCAAATACATACACAGTAAAATCATAAGCC 

TGGMTGAAGAAAAAAAAATCTTACGGGCAATGCAATGGCTGCAAAACTATAAGGATTAG 

AAATGTGAACCCACATTTTAATCCAAATTAGGGCAATTTAGAGGTGGTAGCCGTAAAGAA 

TAGCTTGCTGTAATATACGCCATGCCTGGATACANAAATTGGGCTTTTGGCCCTTGTCAA 

AAAATTAAAATTGTGCCTTTCNTGTATTTGATGGGTGGGGCATGT 

Sequence 850 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTGGCCTCTCTG 

GG ATAGA AGTTATTC AGC AGGC ACAC AACAG AGGC AGTTCC AGATTTC AACTGC TC ATC A 

GATGGCGGGAAGATGAAGACAGATGGTGCAGCCACAGTTCGTTTGATTTCCACCTTGGTC 

CCTnnnnr.nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn nnnn nnnnnnnnn 

nnfinnnnCGCGCTTGGCCGTAATCATGGGTCATAAGCTGTTTCCTGTGTGAAAATTGTTT 

ATCCGC7CACAATTCCCACACCAACATACCGAAGCCCGGGGNAGC 

Sequenc/i 851 

CTACtATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACCAGGCAA 

AGTTGOTGCCAACACTCTGACnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn . 

nnnnnnnnnnnnnnnnnnnnnnCGTCATCACTATTTCTCCAGTGGTATCTGGGAGCCAGA 

GTAGCAGGAGGAAGAGAAGCTGCGCTGGGGCTTCCATGGTTCCGTCTGGGTCCTAACTGA 

GCAGTTCCTCCCCCGCGTACCTCGGCCGCTCTAGAACTAGTGGGATCCCCCGGGCCTGCA 

GGAnnnnnnnnnnnnnnnnnnnnnnnnnnCGTTCGACCTCGAAGGGGGGGGGCCCCGGTA 

CCCCAAGCTTTTGTTCCCTTTAGTGGAGGGGT 

Sequence 852 

ACGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAGGCAAAGT 
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Sequence 853 
Sequence 854 

^GCTCCAAAGTTAAMCCCTGTAGMCTMGTAGGTGMGAGATAT^ 
Sequence 855 

n„„J CTeCTnnnnn '- nnnnnnnnnnnnnnnnnn nnnnnnnnnnnnnnnnnnnnnnnnnnnn 
T^I?^ n ^ nnnnnnnnnnnnnnnnnnnnnnnnnnnn n"n"rACCAGTGGCCCTGGTGGA 

Lnnnnn A AGATGAGGAGCCTGGGAGCCTGGCCAGG ^ CT ^ 

^aaI^t^S CTCGAGnnnnnnnnnn nnnnnnnnnnnnnnTGTTCCCTTTAGTGAAGGGG 
JA^GGCGCCGCTOGGCGGTMTCAATGGTCATTAGCCTGTriT 

^ GA IE]^ k T GGAAGGAGGCA ^ 
£J A ^^M GGGG r G T ^^^ 

NAANTATTTGTTGGGGGGGGTTTMNAAAAAAAAAAAAAAMTATT^ 
j^TTWCCCCCCCCCCGGGGGG 

Sequence 857 




' 1 1 " 11 1 1 'NATAA NNNCCCCCGCCCCCCAAAAAAANTTTTrTTKI 

CCCCCC-l I I I 1 1 I 1 1 1 1 I NNCCCCCCCCCCCCGGGGGGGGNGTTAAAAAAAAA* ACTrrr 
CCCC GGGGGGGNGGGGGGG GGGGGTNGCCCCCN I Tl I Ti n ' nTf^ ^^^ 

NNNNNNTANACCCCCCC I 1 1 I I I I I I I I I | | NCCCCCT TI I I I I I I I 1 1 1 1 TA AAXAAAA 
TTTAGTCGCTTTCAGGGTTCTATGAGCACCCTTTCACAGACCCGTTC^cS^GGGTAAG 
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TTGTNACACCCAAAAAATTTTTTTNTTTTTTTGGGGGG GGANNCC CCNATAAAAAAGNGG 
NNNTGGTNCCCCCNCNCCCCTTTTTTTGNGGGGGGG I I i I I I I TNAAAANAAAAAAATNA 
AAAAAAA? I 1 1 I I M G GGG G TNTTTTTAAAANTTNAAAAATTNNNANCATTGNAAAAAAT 
! I M i 1 t I I AAAAAAAATTTT 
Sequence 859 

CCGCGGTGGCGGCCGAGGTACCAATAAGCATATTGCTTTGGCAATGCATNTCCAGAGCAG 
GTGACCCTGGCCGNCTGTCCTGGGGACACTGACACCNTAGGGTGGCTGTGTCAGTTCATA 
GGGAGNGTACTCACNAGCCCTGTTGGCACTANTAGNTGNAANGTATTGGGTTGG GGNTT A 
GNGTTTTTTGNGGGTTTTAAAAAAAAGGGGTNTTNA 

I 1 I 1 I 1 M 1 I ANAAAAMGAGGGGGGNGNAAAAAAAATTTTTTTNNGGGGGGGGGGGGGG 
GGNTTTAAAAAAAAN 1 11 1 I M 11 I I I NCCCCNCCCCCCCC CNAAAAAAAA GGNGGGNGG 
GGGGGGNGTTNCCCCCCCTTGCCCCCCCCNAAAAAAAAA 1 I I 1 I 1 I M 1 GGGGGGNGGGG 
GGGNCNCN 1 1(111 1 GNGGGGGNTTTTTNANAAAAANGGGGGGNGNNAAAAAAAAAACCC 
CCCCCCCAAAAAAAAAAAAANCCCCCCCNGGGNNCCCCCCCCNTNNCCCNCNNCATNAAA 
AAAAAAANGGGGGTTTTTTTTTTNAAAAAAAANAAGGGGGGGGG 
Sequence 860 

TCGCACTCATTTACCCGGAGACAGGGAGAGGCTCTTCTGCGTGTAGTGGTTGTGCAGACC 
CTCATGCATCACGGAGCATGAGAAGACGTTCCCCTGCTGCCACCCTGCTCTTGTTCCACC 
GGTGNAGCCTTGGCTGTAGTAAAAAAGGGTGGGGNTTTTTNAAAAAAA AAAAA GGGGGGN 
NAAAAAAAAAAAAAMGGGGGTTTGGGGGGNTTTNAAAAAAAAGGGGGTTTTCCCC 
TCNCCCCCCCCCGGGGGGN 11 1)1111111 1 1 NCCCCCCCGGGGTTNGGGGGGGGGTTTN 
ANAAAAAAANGGGGGGGTN 1 I 1 11 1 1 I I I r NCCCCCNCCCCCCCCCCAAAAAAAA NAGG N 
GGNGGTTTTNCCCCCNNTAAAAAAAANTCCCCCC CCCGG GGGNTNTGGGGGGGTTTTNTN 
TTTTTTTAAAAAAAAAANNNNNTTNNGGGGGGGG 

ATTTTTTTTTTTn I NNGGGGGNTTNNNTGGGNNNNAAAA TTTTn 11 II 1 I 1 I 1 IC CCC 
CCCCCCC n MINI 1 NNN 1 1 1 1 1 1 1 1 GGGGGGGGGGGGG \ rTTTTTTTTTTTTl (AAAA 

Sequence 861 

CCGGGCAGGTACGCGGGGGAACTGCTCAGTTAGGACCCANACGGAACCATGGAAGCCCCA 
GCGCAGCTTCTCTTCCTCCCGCTACTTCTGGGCTCCCAGAATACTCACCTGGNAGTAAAT 
ANGTGGATGTACCTCAANTTCTTCNCAAGNNCCCC NTACCC CCCNA AAAAATGNC CCCNG 
NNCCCCCTGTGCCCCCANTTTTTTGGGGNGNA I I I I I I I I INC CCCCl mill I i GGGG 
GGGGN niUUI GGGGGGGGTNTNTTTTTTTA TTCCC CCCCC I n u i n u i u NCTT 
AGAACCCCCCCAAAAAAANAANNNATN I I I 111 I I I I I GGGGNNTTTATNGGNTNTTTTN 
CNAAAAATTTTAAAAAATNTTTTATAGGGGGGGGNGNGTGNNAAAAAAAAAGGGGGNGGG 

GGTCCCCCCCTTNCCCCCNAAAAAAAAA 
Sequence 862 

GCGAGGTACGCGGGCATCCAGCAGAGAATGGAAAGTCAAATNACCTGAATTGCTATGTGT 
CTGGGTTTCATCCATCCNACATTGAAGNTGACTTACTGAAGTAATGGGAGTAGCA GGAAG 
T^GJAAACAAGGTGGGNAGACATATGCMGCATCTTGNGTTTTTNNNGNCCCCCNTTTTT 
TTTTrTTAAAAATTTTTTTNGGCCCCCC^ 

AANACAAAAAAAAGGGGGTTTTNGGGGGGGGNTTNAAAAAAAACCCC CCCCCC I l) m M 
TTTTGGGGGGGGNGGGGGGGGGN I I I I I I N Ml I I IN CCCCCCCC i mini iGTTTN 
TNNNNTTTTTTANGNCCCCCCCn I Ml MTG AAAAAAAAA t II I 111 I I' GGGNCCCCCC 
CCTTTTTTTNAAAAACCCCCC C I M Ml M I ', I M l I NGGGGGGGG CCC l liliiimi 
AAAAAAAAAAAAAACCCCCNTTTTTTNCCCCCCN 1 M I I I I II M 1 NGGGGGGGGGGGTN 
CNCCCCCCATNGTGGGGNCCCCCCCGNGGCCCCCCCCCCNGGNAAAAAAATGGGGGGNTT 
TTTTTGTAAAAAAANGGCCCCCCCCCCCCGGGANANNANTGNGGGGGGGGNGTATTNATT 
TTNCTNTTTAACCCCCCCCCCNGGGGGGGG 

Sequence 863 _ 
CCGGGCAGGTACTTTGGCCTCTCTGGGATAGAAGTTATTCAGCAGGCACACAACAGAGGC 

AGTTCCAGATTTCAACTGCTCATCAGATGGCCGGGAAGTATGAAGACAGNATNGGGTGCA 

GTCTCACCAGATTTCCGTTTTGNATTCCTTTTTTC 

GGNTNTNNCCCCCNA 1 111111111111111 I GGGGGGNCNNNGGNNNNAA \ rTTTTTTl 
TATAACCCCCTTTNCCCCCAANNCCCTTNTATCCCCCl MUM fNNGGGGN NGGGG GGG 
GGGGANTTCCCCCCCCNNCCCCCCCCNAAAAAGNTAAAAAATTTGAAAAANATTTrTAAA 
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AAANGGGGGGGGNT I M I Mi 1 1 1 NGGGGGGGA I Ml M 1 I I I ATNANAAAAAAANCCCC 
CCCCCNCNCCCCCCCNll II I U 11 1 1 ACCCACCGGGNGNAANANGGGGNNTTNNNTTTT 
CCCCNCCTNNN NANCCCCC CCNGGGGGGNGGNGGTAGTNCCCCCNNAA I I I I 1 1 III i I T 
TTGACCCCCCCN ITTTTTTTTTTTTI I AAAAAAAAAAAGGNGNGGGNGNTGGAAAAAAAA 
ANGTNNNAAANACTTTNCCCCCCCCCNC I MINIM I NTTANAAAAAAAAAAAG 
Sequence 864 

CTCCACCGCGGTGGCGGCCGCCCGGGCAGGTACCAGCAGAAACCTGGCCAGGCTCCCAGG 
CTCCTCATCTATGGNGCATCCACCAGGGCCACTGGTATCCCAGCCAGGNTCAGTGGCATG 
TTGGGGTCTTGGGGACAGTAGGTTTCACCTCTTCACCGCATCAAGNCAAGGCCTNTTGGG 
GGTTCCCCCCNAAAAAAAAAGGGGGN I M M M M M I CCCCCCCCTTTTTTTGGGGGGG 
GGANAAAAA^ 

CCCCi Mil l IGNG GGGGGGGNTTCCCCCCAAAAAAAAAANNGGGGCCCCGNTATNNGAN 
TNTTNT TTA \ I I I I i AATAAAAAAAAAANCCCCCCCCC I fTTTTTn 1 AAAAAAAANCCC 
CCCCCCi I m 1 i I IGGGGGGGGGGNTTNTTTTTTTATNTCCCCCTNAAAAAAANNNGGGG 
GGNN GCCCCCCCNCC NNNAAAAAAAAANGNGNGAGTGNATCTTATTTTATNTCNAANAAA 
AATNN I I I I I l l l l l l > NTANAATN AAANAAAANANAAA I M i M I M I I I I ATAATAAN 
ANAAGAAATAAATTTTGCTCCCCCCCCCN I M M M 
Sequence 865 

AGGTACCAGCAGAAACCTGGCCAGGCTCCCAGGCTCCTCATCTATGGTGCATCCACCAGG 

GCCACTGGTATCCCAGCCAGGTTCAGTTGGCAGTGGGTCTTGGGGACAGNAGGTTCACTT 

CTGCACTCATCAGGCAGNCCTGNCAAAAGGGGNN 1 H I I 1 1 I 1 CCCCCCCTTTTTTTGGG 

G GGGNTNNAA AAAAAAAAAAGGGGGGGNGGNAAAAAMNTTTTT^ 

Ci u i.i u n lGGG GGGGNNCCCCCCCNAAAAAAAAAAGGGGGGGGNNNTTTTTTNNNNT 

* n n i n n n n i aaaaaaaa ancccccccc m i m m i i i i naaaaaaaaacccccc 
ccni rmrm ingggggggg i i m t m m m i icccccccaaaaaaaaggggggnnggn 

NCCCC CCCCCAAAAAAAAAAGGGGGGGGN rTTTTTTTn i A AAAAAAAAAAA I 1111(11 

mm iaaaaaaaggnnaaaaaannggi n n 1 1 n intngnaaaaaaaannggnaaaaaa 

ATNGGTNCCCCCCCCCCCCC I I 1 I I I 1 1 I I I I i 
Sequence 866 

TCGAGGTACCAGCAGAAACCTGGCCAGGCTCCCAGGCTCCTCATCTATGGTGCATNCACC 
AGGGCCACTGGTATCCCAGCCAGGTTCAGTGGCAGTGGGTCNGGGACAGTAGTTTNACTC 
TGCACTCATCANTCAANGCTTTGTCNNATGGNANTNTTTTATTATNCCCCCC I I M M M 
GGGGNGGT NNAAAAANTT AAAAAGGGGGGGGGTGAAAAAAAAATTTTTNTG M I M I I 1 I 
TCCCCC CCCU Mil l ITTNGGGGGGGGTTCCCCCCCTCANAAAANAAGGNNGGAGNTNN 
Mill I NATATA TTTTNTCNNCT TTAAAAAAAAATATCCCCCC I I I I I M M I AAAAAAN 
AAAATCCCCNCI I I M I I II I I GGGGGGANNAAATTTTNNTNANTNCCCNCCAANAAAAT 
AANNGGGGGGGGGNCNTCCCCCCCNTAAANAAAAAAANANNNTNNTTG I I M M II I AAA 
AAANAAAATA I M M M I I I I 1 CTCC 
Sequence 867 

CCGGGCAGGTACACTTTTGGTCAGGGGACCAAGCTGGAGATCAAACGAACTGTGGCTGCA 
CCATCTGTCTTCATCTTCCCGCCATCTGATGAGCAGTTGAAATNCTGGGAACATGTCCTC 
NT GTNTGTGTTGC NCTGNCTGNTAAAAANAAAANTTTTTTTAAAAANNAAAAATNCCCCC 
CAiiMil NTTTTNIGGACCCCCTNTTTTTTTAAAAAAA I I I I II II I M I M CCCCCG 
GGGCCCCCTTTNCCCCCGGNAAAAMGGGGNGNNTTTNAAAAAAAAMNGGNGTTTTT 
AAAA AAGG GGGNNTTNGGGGGGGNGTTTTCCCCCCTTNTCCCCCCCCCAAAAAAAGGNAA 
AAAA TTTTAAAAAAG GGGGGGGNCCII I I I I I 1 I AAAAAAAAAN ACCCCCCCCTGATCCC 
CCAN I" I II I M I 11 ICCNCCTCGGAGAAAMNGGGNGGNNNTNTTTTTNTCCTTTTTTNC 
CCCCCCCGG GGGGGNGGNTTNNNCCCCCCCCTTTN I I I M 1 1 II 1 Y r CCNCCCCCCNTTT 

nrnrm iaaaaaaaaaaaaggggggggggggggngggaaaaaaangnntnaaaantt 

TGGGCCCCCCCCCCNC M M II I I M I 1 NAAAAAAAAAA 
Sequence 868 

GGGAAGATGAAGACAGATGGTGCAGCCACAGTTCGTTTGATCTCCAGCTTGGTCCCCTGG 
CCAAAAGTGTACCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnNCTGGCATGGAATTT 
CNNATATNCAAAGGCTTTATTCGAATAACNCG I M I M 11 I ANCCNTGGGGGTTTTAAAA 
AAAAANCCCCCNGNNN CCCC CCCN I II 11 1 M M IATNCCCCGGGGGNGTTTAAAAAAAA 
AAAANNATATGGGGGNTTTTGGGGTGNANTTNGGGGGTTAANTGGGGNTTTGGGNGTNTT 
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TGGGNGI Mil 1CCCNCCTI I 1 1 NNCCCCCNTTTCNCCCCCGGGGGGGGGGGGGGGGGNT 
I I I I II I I II I ' TATAAAAAATATCTCCCCNNNTGTCCCCCTCTNGTCTCCTNAAAAAAAA 
NAGGGGGGTNGTNNNCCCCCCCN i I II M NTTTATTTATNGAATTTATNGGATATTANNG 
GGGGGGGGGGHGGGGGNAG I MM I ATATAATA I Mil I ANTANACCCTNTNTNTTTANT 
CCCCCCNATATACCCNCCGAAAAl MUM INTN I Mill I ANGNGGNGNTTTGTTCTAN 
AAAAAAAAAAAAGNGGGGGNNTG I M I II II M 
Sequence 869 

AGGTACGCGGGATGGCACATGCAGCGCAAGTAGGTCTACAAGACGCTACTTCCCCTATCA 
TAGAAGAGCTTATCACCTTTCATGATCACGCCCTCATAATCATTTGNCCTGTATGCTTGG 
NTTCCTTAGNGCCCTGTTATNGCTCCNGCCCCCG M M I M NTTTTTAATTTTTTTANTT 
TTTNNCCCCC CTNCC CCCC I I M M I I I A AAAAAAAAAAAAANNACCCCCCNNAAAAAAA 
NCCCCCCCC M I M M I M I CCCCCCCAAAAAAAAAACCCCCCCCNTAAAAAAAAAAAAA 
AAAAN GNCCCCCCC CNTAAAAAAGGNAAAAAATTGNNAAAAAA ATTNAATAANN TACCCC 
CCCCCI M M M M I AAAAAA AAAAAAAAAAAANNCCCCCCCCC I I M M I ITTTAA AAA 
AAAAAAAAAAAAAATTTT7TTNTTAAAAAAAAANCCCCCCCCCN I M M M ITTTTTAAA 
AAAAAGGGNA AAAAAAGG GNNCCCCCCCCCCCCAAAAAAAAAAAA I M M I I I II M M I 
CCCCCCCCCCI M M M I M M M I CCCCCCCCAAAAAAAAAAAGANGGGGNGGNTNGGN 
ANAAAAAAAAAAACCCCCCCCCCCGGNGGGGGGGGGNGGGGCNCCCCCCCCCTTTTTT 
Sequence 870 

GAGCTCNCCGCGGTGGCGGCCGAGGTACACTTTTGGCCAGNGGACCAAGCTGGAGATCAA 
ACGAACTGTGGCTGCACCATCTGTCTTCATCTTCCCGCCATCTGATGAAGCAGTTGAAAT 
CTTGGAACTGCCCTCTGTTGTGTGCTCTGCTGAATAACTGTCTATTTNNCCCTNCCCCCC 
CCCCCNAAAAGGGGNNGNAAAAANGGGNNAAAAAGGGNGGGGGNNTCCCCCCNCCCCCAA 
AAAAAAAAAAAAAAAAAAGGGGGGG I I M I M M I AAAAATTTCCCCNCNCCCCCTTTTT 
TTTNGGGGGGGNGCCCNAAAAANCCCCCCCCCCCNNGGGGGGGGGGGGGTTTANAAAAAA 
AAACCCCCCCCCCCCCC 1 1 M M II IGGGNNGNANMCCCNTNNNCCCCCCCCCCNCCTN 
NN I M M tG GGGGGGTTTNNNTANTTNNNCCCCCCCCGGNGNNG GGGGG GGGNNGNAAAA 
NCCCCNCNCCCCCCCCCCGGGGGGGGG I I I I I CCCCCC I I M 1! M M ICCCCCCCCCTT 
M M I I I M MA 
Sequence 871 

TACTTAGGGCAATTGGAGCTCNCCGCGGTGGCGGACTTTTGGCCAGGGGACCAAGCTGGA 
GATCAAACGAACTGTGGCTGCACCATCTGTCTTCATCTTCCCGCCATCTGATGAGCAGNT 
GAAATCTGGAACTAGCCCTCTGNNGNGNGCCCTGCTGAATAACCTTCTATCCCAGNAGAG 
GGGGGGGGNGCCCCCCCCCCCCCNAATTTNAAATTAAAAGGGGGGGGTTTNNNNAAAATC 
CCCCCNCCCCCCI M I M M M INCTTGGGGGGGGGGGGGGGCCCCCNANNCCCCCCGGG 
GGGGGGNCCCCC \ MM INTTCCCCCNI Mill I NNAAA AAAAGGGGG GGGGAAAAAAAA 
AAAAAAANCCCCCCC I M I II M M NAAAA AAAA NGGGGG I I M M II M N GGGGGGG GG 
GGGGGGGGGGGAAAAAAAN I M II M M I M M INCCC NCN CCCCC CCCN M M I M NAT 
ANATANCCCTTTNCCCNCCNAAAMNAMNGAGGGGATTTTCNTTNNTTTTTNCCCCCCC 
CC Ilim i l GGGGGGGGGGNGGCCCCCCCCCNCNAAAAAAAAAANGGGTGGGGGGGGGG 
G 

Sequence 872 

CCGGGCAGGTACCTGGATCTTCCAAGCACAGCCACTTNTGTGAACATCCCTG/^CCTGCTT 

CCTGGCCGAAAATACATTGTAAATGTCTATCACNATATCTGAGGATGNGGGG^GGCANNA 

GTTTNGATTCNCTGTTCATAACTTTACCAAAGGAAATNGAAAAANNCCCCCTT/vAAAAAA 

AAAAAAAAANCCCCCCCCNAAAAAAANGGGTNGTNTTTNCCCCCGGGGNNTMNNNTCCCC 

CCCAANCNCCCCCN M HUM IGNGGGGGGNNAAAAAAAAANTTTTTTTNGG GGGG GGG 

GGTGTGCCCCCCCNTTCCTCCCCNTCNCCCANNTAAACTAANAAl M M M INTTTTNNA 

AATAAATCCCCTCN I M M I M I GNGGGGGGGGGAAAAAAAAAAA ATCCCC GTGTTTNCC 

CNNNNNCCCNGGGGNGGGGGANMNAAAAAAAAAACCCCCCCTT7TTTTTTNGGGGGGGG 

NTTTTTTT 

Sequence 873 

CTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGCTG3TCAGTTATGGACCCAGA 
GGGAACCATGGAAACCCCAGCGCAGCTTCTCTTCCTCCTGCTACTCTGGCTCCCAGATAC 
CACCCGGAGAAAGTATGTGTTGATCGCANTCTTTCAGGG CCNAAAAAGCCCAN ACCCCCG 
NTCCCCCI MM I NGGGGGGATA I I I I INNCCCCCCNM M M II M M M I M INGGGG 
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GGGGGTTTTNTNTCCCCNNTTTANTNCCCCCCCCCCCCCAAAAANNNAAAAAGGGGCCCG 
GGGGGGGGGGGTCNANAAAAAAAAAAAAATAAAAAATNGGGGGGGGGNANAAAANGNGGG 
GTNTCCCCCC CCCCCCCC CNAAAAAAANNCCNGGTNCCCCNNNNCCCCCC 1 1 rTTTTTTi 
TTNC CCCCC Ci I I I II II ICTTTANCNTNCGCCCCCCN i lllllll \ GTGGGGGGGGGGG 
GGTGI I N ICTTTNNNCAAAATAAATGNAGGNGGGGGGGGGNTTTA 
Sequence 874 

TACTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAGTCAGGTTGTTCATT 

TGAGCCAACMCAGATTTCTTGGTTATTGTGCTATTGCCACACCTGGGTTGGTGGTTTTA 

TAGCCATT TCCATC ATCAGTTTCCTGTCTCTGCTGGGGG7TATCTTAGTGCCTCTCACGA 

ATCGGGTGTTTTTCAMTTTCTCCTGAGTTTCCTTGTGGCACTGGCCGTTGGGACTTTGA 

GTGGTTCGAGCGGCCGCCCGGGCAGGTACTCATCACTCTGTCCATACGCGATCACAATAT 

CCTCTAGTTCT TCCAT CACAGTCTGCGCACATTTTGGTCATCAGCTGGGAGAGCACGGCT 

GTC ATTTGGG GTTTTTG CAAAAGTTG TGCTTCTC AAGC AAAC C G ATG GG 

Sequence 875 

TACCAGTGGCCCTGGTGGATGCACCATAGATGAGGAGCCTGGGAGCCTGGCCAGGTTTCT 

GCTGGTACCTGCCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnTAGTGGAGGGGTTAAATTGGCGCCGCTTTGGCCGTAAATCATGGGTCATAAGC 

CTGTTTCCTGTGTGAAAAATTGTTATTCCGC 

Sequence 876 

CCGCGGTGGCGGCCGCCCGGNCAGGTACATCAAATCGACTATGCCGAGTTGTGCAGCGTA 

NCAAGGAAAAAGGGAAAATCAGCTCCCTCGTGAAAGATGCTTCTGTTCCTCTGATTGATG 

TTACAAACCTCCCTACTCCTCGAAAATTCCTTGATACCTCTCACTATTNTACTGCTGGAA 

GCTCAAGTGTGAGGGAGATAAATCTGCNGGACATNAAGGAAGATTTAGAATTGGATCCAN 

AGGAAAACAGCACCCTGTTTATGGGTATCCTCATTAAGGGC 

Sequence 877 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCCGCTTCTTGGATG 

TGTTTGTCTCTCTTGGCTGCACTGGCCTGCTCTGCTGGAGACACATGGGCTTCAGAAGTT 

GGCCCAGTTCTGAGTAAAAGTTCTCCAAGACTGATAACAACCTGGGAGAAAGTTCCAGTT 

GGTACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnn nnnnn nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnGAGGGTTTMTTGCGCGCTTGGGCCGTAAATCATGGTATTAAGCTGTTTCCTG 

TGTGAAAATTGGTTATCCCGCTCACAAATTCCACACCAACATACGAAGCCGGGGAGCATT 

AAAAAGTGGTAAAAGCCTTGGGGGG 

Sequence 878 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTCTTTGCAGTATACAAGGA 

CTAACAGTTAATATTGACCCAATCTTATATACGTGGCTCATCTATCAGCCTCAGAAACGA 

ACAAGTAGACATATGCAACAGCAGCCTGTGGTAGCTGTTCCTCTTGTTATGCCAGTTTGT 

AGAAGGAAAGAGGATGAGGTGTCTATTGGAAGTGCCCCCTTGGCAAAGCAACAATCATAT 

CAGGCCTCTGAATATGCCAGCAGCCCTGTAAAAACAAAAACGGTAACAGGTTGAAGAAAG 

TTCTCCTGGATMTATCCTGAAGACTATTTTCCAACCTTGGTTCCATTTTTCCANGGTCA 

CTTTTCA 

Sequence 879 

GACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGCCAAAG 

GCTGGAAGAAAAT TGAGC C AAGAG CTGGAAGCCAGAAGTTCCCCATGCCTGCCCTACACA 

TCCAGGCTCTGAGTTTTTACTTTTTCTACTATATTGCCATGCAAGTAACTTTAACTTTGC 

TTCTCTGTTCTCTGTCACTGAGAAATACCATGTCTCCATGGCAATAGGAATGCATTTACC 

TTCACCAAACCTGCTCAAATCCTTGCTGAACAGGACGTAGCAAGGAGTTGACTTTCCAAT 

GGTGGGAGGTTCAGAGTACCTGCCCGGGCCGGCCGCTCGAGGGGGGGGCCCCGGGTACCC 

AGCTTTTTGTTCCCTTTTAGTGGAGGGGTTTAATTGCGGCGCTTTGGCGTAAATCATGGG 

TC ATTAAG N CTGTTTTC 

Sequence 880 

GACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTAGTGAAATATT 
CTTGGTAATATCATTCAGGTGGACTTTGGCATCATAATCAAATAATTTGATTrCTACTAA 
CTGTTG AAGAACTTTGTTAGCC ACTTTG CTTATTC ATTAC ATTTTG GGTGATCTTAAGC G 
CACATTAATTACCCTGTCTGTTTAGCATGTCTTGCTGTTCATCTTGGMTTATTTGTTGT 
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TTAGGGACACTTATCTAGTTTCCAGCAGTCTGCTTTTAACCGGGTGGTGnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCTTATCGATACCCGTCGAC 

CTCGAGGGGGGGGCCCCGGTACCCCAGCTTTTTGTTCCCTTTTAGTGGAGGGGTTTAAAT 

TGCCGCCGCTT 

Sequence 881 

GACTACTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGGGGGCCAGTTATT 

ATACTGCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTACCAGTGGCCCTGGTGGA 

TGCACCATAGATGAGGAGCCTGGGAGCCTGGCCAGGCTTCTGCTGGTACCTGCCCGGGCG 

GCCGCTCTAGAACTAGTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnCGGTACCCAGCl I l i i GTTCCCTTTTAGTGGAGGGG 

TTAATTGGCGCGCTTTGGGCGTAATCATGGGTCATTAAGNCTGTTTCCCTGTGTGAAAAA 

TTGTTATCCGCT 

Sequence 882 

CGACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTGACAACAAGA 

GCATTTTAGAGTAATTAATTTAATAAAGTAAATTAGTATTGCTGCAAATAGTTAAATTAT 

ATTTATTTGAATTG ATGGTC AAGAG ATTTTCCA I MINI I ACAGACTGTTCAGTGTTTG 

TCAAGCTTTCTGGCATAAATATGTACCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGCGTAATCATGGGTCATTAG 

CCTGTTTTCCTGTGTGGAAATTGTTATCCCGCTCACCAATTCCACACAAACATACCGAAG 

CCCGGGGAGCAT 

Sequence 883 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAACTTTAAACTGTATTGTATTC 

ATGTTGCTAAACAATATTGGCCTTCTCGATGATTTTATTCATGTTGCTCCAAAGTTAAAA 

CCCTGTAGAACTAAGTAGGTGAAGAGATATTTTGTATAAGTGCCACAGAAGAGAAAATAT 

AATAAMTTAAATAGTGAATTGAGCATCACTAGAATAAAATAAAATG AGTAG GCATTCCT 

AAGATGTGAMTGATCACCTMGATATACATGCTCCAACCATATTGATTTTAGAACAAAA 

CACAGCAGCCCATAACAGTTTGTGGCCTCTACTAACTGTCCTCTGCTGTCC CATC TAGAG 

GTTATGTTTCTCTATTTTTAAAATAAAATGTAGTT^ 

CTCTATCCCTTAT 

Sequence 884 

TCGACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCTG 

GAGTCAAAACGAATCAGTTTTATATAATCAATTATTTTAGACTATCCGAAGCATAATGCT 

TAAGTTTAAAAGAGGAGACAAAAATAGGAGGACAAGATACTTGAGAGGCTGTGGCACTAG 

GACTGCTTGAGAGCCCAGGAGTTGGAGGCCAGCCTGGACAACATAGCAAGACCTCATCTC 

TGGTCAGACATGATGACGGTGGCTTCACCCGGGGGTCTCCGCACAGCAGCGGCCTCGGGT 

AAGCAGAACCTCGCTCCGGGGTTTACAAATCCTTCTCCCTTCCCCACAGCACAACACCCG 

CGGCTCCCCGCGTACCTCGGCCGCTCTAGAACTAGGTGGGATCCCCCGGGCTGCAGGGAA 

TTTCGATTATCAAAGCTTTATCG 

Sequence 885 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCCGAGCGGCCGCCCGGGCAGGTACA 

TAATGCAGGATCTGATGTCCTCCTCTGGTAAGCAGCATGCACCGCATAAACAGGTCCTTC 

ATGGCCTTGAAGATGCACTGCTTTTAAAAGCTGATTATCCTCTATTTCCCAGTGAATCAC 

TTGATTATCAGATCCTTCCAGAAAACTAATTCAAGTAGAAGGGGGAGCCATCCTGTTTAC 

AAATCCACTGTATGCAATTGACTCGGGCCGGTGTGACCATTCAAGTTGGTAACAACAACC 

CTTTTCAGGGGGTCATAGAGCACCACGGAGCAGGGACGTGCCAAAGGCCCAGAAAGTCCT 

CCCCCGCGTACCCTCGGCCGCTCTAGAACTAGGTG 

Sequence 886 

CTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGACATTTC 

TTTTCCAACACAAGTGAGGTGAACAGCCAGCCAATCCTGAGTATTTGAAAAAATGTATCT 

GCAGCAGGTTATGATCAGGGAGCTGTATTTTCAAGTAGTAACAGCACTGCGAAGAGCAAG 

CMTGAAAGGCCAGCATTCTTAATTGGAAAAGAAAGAAGAGGAAATTTCCCTCTGCAAGG 

GGGCTAATTTGCCAACTGGTGATATTGTTCAGTTCAAGGGAATGAAGAATTCAGAATAAT 

TTTGGTAAATGGATTCCAATATGGGGMTAAGAATAAGCTGMCAGTTGACCTGCTTTGA 

AGAAACATTACTGTCCATTTGTCTAAAATAATCTATAACAACCCAAACCCAATCAAAATG 
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AATTCAACATTTATTTTCCCC 
Sequence 887 

ATTGGTTCTCCCCGCGGTGNCGGGCAGGTACMCTTTAAACTGTATTGTATTCATGTTGC 
TAAACAATNTTGGCCTTCTCGA TGATT TTATTCATGTTGCTCCAAAGTTAAAACCCTGTA 
GAACTAAGTAGGTGAAGAGATATTTTGTATAAGTGCCACAGAAGAGAAAATATAATNAAT 
TTAAATAGN GG AATTGAGC ATCACTAGAATAAAATAAAATG AGTAG GC ATTCCTAAG ATG 
TGAAATGATCACCTAAGATATACATGCTCCAACCATATTGATTTTAGAACAAAACACAGC 
AGCCCATA ACAGTT TGTGGCCTCTACTAACTGTCCTCTGCTGTCCCATCTAGAGGGTTAT 
GTTTCTCTAj \ I i lAAAATAAAAATGTAGTTAAATTAGCCTGACGGGATGTTTCCTCTCT 
AT 

Sequence 888 

CCTATAGGGCTNTTTGGNGCANACCCCGGTGGCGGCCGCCCGGGCTGGTACTTCCTTCTT 

TTTTATGACTAAATNCTACTCTATTGTATGGACATGACACATTTTGTTTAAATTGTATAG 

GT^TGCCATACTTTAAGTTTGTTCCATAAATCACTTCATTAACATTTGGGT^ 

TTTTTG G G T AATTMTT AATTT AATT AATTTATTT ATTTTG AGATGGAAGTCTCG AAAAA 

AAAAAAAAAAGAATGTG TATC TACCCGCAGTTGTCAGGCGCAGTATCCATATATGTCATT 

TAGGTCMGTTTGTTGNTTTTCAACTCTTCTGTATTTTTACTGACTTTTT^ 

TGTTTAATCATTCACTGAGACAGGTGTGTTAAATCTCTCATTGGGGTGATGATTTATCTA 
GTTC CC ATTTTAATT 
Sequence 889 

AGGTACCTGCAGGCCTCCTACACCTACCTCTCTCTGGGCTTCTATTTCGACCGCGATGAT 

GTGGCTCTGGAAGGCGTGAGCCACTTCTTCCGCGAATTGGCCGAGGAGAAGCGCGAGGGC 

TACGAGCGTNTCCTGAAGTATTGCAAAACCAAGCNGTGGGCGGCNCGNTCTTAGAAACTA 

GTTGGNATCCCCCGGGGCCTGCCANGGAAATTTCCGAATAATTCAAAGNCCTTTATTCNG 

GATTACCC GGTTC GNACTCTTNGTAAGGGGGNGGNGCNCCCCGGGNTTACCCCCAAGCNT 

TNTNTTTGGTTTTCCCCCCTTTNTAGGTTNGNAGGGGGGTTTTAAAATTNTGGCNGCCCG 

NCNTTTNGGGTCGGNTNAAATTNCAATTGGGGGATNCNANTAAGGCNTTGGTTTNNTCCC 

CTTGGGTGGGTTGNAAAAAATTTTGGTTTTTAATTC 

TTCCCAACCAANCCAANACCNATTTAACCCGGTAAGGNCNCCCGNGGGGAAAGNNCCAAT 
TTA 

Sequence 890 

G GCCGC CCGGGCAGGTCGCTCGTGATCTAGATAGTGAGCGGACGCGTGGGTCGACTCAAG 

ACTTTTAAAG ATTTATC AAAATTTG GTGAGCC GAATCTC AGAAAATTGGTG AATTC GGTT 

AGTTCCCAATAGGCCGCAGTAGTMTAAGTGGAGTGTTCGGGATATAGAAAAATTTTCAC 

GAATGG AATT AAMTTTCTAGGCGAATTCAGTGAGTTCCCAATGGGACATGATTGTATGA 

GTGAGCTTTTCAATATATAGCAACATTTTAGGTTCAAGAAACTTAAGTATCATGGTGAAT 

TCAGTAGGTTCCCAATAGGATTCGCATGTAATAAG 

Sequence 891 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACAAACATGTGCCACGTCACCACACAAAACCAA 

AGTCTGCTCAGAGAGGTGGGCTATGGTGTGCAGGCTGCAACCTTTCTCTGCAATTGTTAA 

G TCTTC AAAAATCTGAGTTCCTCACATAAAATTCTGTGCTGTGGCCAGAGCTCGTTTTAC 

CATTTTCTTAGATTGGATCACTTTTAGGATCAGCTTCGTTGTTCTTTGCGTAGACAAATG 

ACTCTCACAGCTTTCTCCAAGTGTNCCAGAAGCACTAACTTACTGAAAATAGAATCTCAT 

CAAAGCTTAACATATTCACTCTGAAAACAGCGGANCTGCTGGGTCGCTTAAGGAAAGCTG 

ANAACCTNAAACCTGTGGAAGGAAAACCAGTGACCACTTGGGGCCTTATAAAAGTTTGAT 

TGGCAGGTGANGAAGGGGATCTCAAGAGGAGAATCCCNAATTTCAAAAGACATGGGAGAT 

TFrGTCCCTAAATGTTTTATACTAGTGCTCTTGNNAAATGGAAAACCCT 

Sequence 892 

CCGGGCAGGTACAAACATCCACACCAGAAGAGCAAGACTTAGAAATGGCATCAGAGGGAG 
AGCAAAAGAGGCTTGAAGAATATGAAAATAACCAGCCACAGGGAGAGAATGGGTCATAAA 
ATCAACCCAACTGGCTATCAAGAGMTTATACCTTGCAGAATGGCACCTTTGGTATTAGC 
GTACCT 

Sequence 893 

AGGTACC CCAG CCACATGGCCTTCATCCTTATGACCTAGCAGGCAGAACAGGGACCAAGC 

AGCTTCTATTTTGTCAAACTCCTTTGGACAAATATTCAACATTCAACAACAAGCTTTGTA 

AACCTAACGCTAAACAAGTCATGGCAAGCAAACTGGATTTTCTTAAGAAATGAGGAAAAG 
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TGCAAGTGATCTCAGTACCTGCCCG 
Sequence 894 

AGGTACTTCTTTGCAGTATACAAGGACTAGCAGTTAATATTGACCCAATCTTATATACGT 

GGCTCATCTATCAGCCTCAGAAACGAACGAGTAGACATATGCAACAGCAGCCTGTGGTAG 

CTGTTCCTCTTGTTATGCCAGTTTGTAGANGGAAAGAGGATGAGGTGTCTATTGGAAGTG 

CCCCCTTGGCAAAGCAGCAATCATATCAGGCCTCTGAATATGCCAGCAGCCCTGTAAAAA 

CAAAMCGGTAACAGAATCCCGTCCATTGTCAGTTCCTGTTAAAGCCATGTTGAATATAT 

CTGAAAGCTGTAGAAGTCCTGAAGAAAGAATGAAGGAATTTATTGGA 

Sequence 895 

CCGGGCAGGTACTTACTTGATGTGACTCTCCTCTCATGCCTGGGCCCTGCTTACAGGTGT 
GATTGTGACACATAGCTTGGCCTAGCCCCTAGGTTATGTTACTCTCCTCTTATCCTTCAG 
TTATTTTCACAAGGGGCATTGTGACATATTGCTGGACTGGGAACCCAGGTGATGTGACTC 
TCCTCTACTGCCTGAAATACAACCAAAAAAAAAAAAAAAAAAAAAAAGGTCCCT 
Sequence 896 

NGNGGCGGCCGAGGNACNNNNGNCANTTGNTGGGGGGGAAGNAAAACCCCCCCCCCNCAA 

CANAGGCAGNNCCAGANNNCAACNNTTTANCAGANGGGGGGAAGACCCCCNACA AAAGG N 

GCAGCCACAGNNCGNNNGANCNCCAGCNNGGNCCCCNGACCAAAAGNGNACCNGTTTTNN 

CGGCCNTTTTTAGAACGNGGGGGGNCCCCCCCC 

Sequence 897 

CCGGGCAGGTACGCGGGTCTATATGTCAGAATACACATTTCCCACCTTGCCCAACAGTAG 

AAAAACATAAGAAGAGAAAAACATTAAAAAATGACAAGGAAGTTAATGGAAGTCAGCAAT 

GTGATGGTGTTTGGAGGTGGAGCCTTCAGAAGGTAATTAATGCCCTTGTAAGAAGAGGCC 

AGAGAGCTTGCGCACCTTCTTCCTGCCATGTGAGGAGCCAAGAAGCCGGCTGTCTGCAAC 

CTGCAAGAGGACCCTCACTAGAAGCTAGCCATACTGGCATCCTCATCTTGGCTTTCCAAC 

TTCCAGAACTGTGAGAAGTATATGTTTGTGGTTTAGTCAATGGTCTATGGTAATTTT 

Sequence 898 

AGGTACGCGGGGGACGCGCGTCTGTGGAGAAGCGGCTTGGTCGGGGGTGGTCTCGTGGGG 

TCCTGCCTGTTTAGTCGCTTTCAGGGTTCTTGAGCCCCTTCACGACCGTCACCATGGAAG 

TGTCACCATTGCAGCCTGTAAATGAAAATATGCAAGTCAACGAAAAAAAAAAAAAAAAAA 

AAAGTACCTGCCCG 

Sequence 899 

AGGTAC 1 nTTTTTTTTTTTrnTn I CAGGTCTCCAGGAAAGATAGTTAAGGCTATGCT 

AGTGTAAGTCTTTATGGCTTGCATGGAACCAGGCAGGCTACTGCTTTCTGCTG TGTG GCC 

TGGATCCTTTTGTTGTAGAATAAATAACTGCAATAGGAAAGAAACACATATATAATTTTA 

TTCTCCATTC ACAGCCCTTTG GGTTATAGTCTTTTCCCATATAGCAGATACTATACT AAA 

AAAAATTGGGGTGGAGGAATCTTCACTGACATCACAAGGTGATTTTTGAATCCGACTTTT 

ACAGTCTGAAATTCAGACTTCCTGCCCTACTGCAATGGGAGTAAAAACCACCTGAACC 

Sequence 900 

AGGTACGCGGGGGTGGCGCCAGGGATTTGAACCGCGCTGACGAAGTTTGGTGATCCATCT 

TCCGAGTATCGCCGGGATTTCGAATCGCGATGATCATCCCCTCTCTAGAGGAGCTGGACT 

CCCTCAAGTACCTGCCCG 

Sequence 901 

AGGTACAACAGTTTTTTAAAAGATTTCACTGAC ATTTGCAACAA i mil i CTAATTTCT 

TTTGGTGCCATTTTAGTAACTTAACAATTGCTTTMTAGACAAAACTCAATCATGCATGA 

TCAATCACAGGTGAGAAAATAGAGGCGTGCTTACTAATGGCAGCAAAAGGCTTCTGTGGG 

GGTGTGCGGGATGGAACAGCTAGCTAGCTAGAAAGGGGCTTCTTTATAGAGGGACTCCAG 

TGTCAGGCAGGGGCACTGAAATCTAGGGGTCACCAAGGGCCACAGCACAGGGGACTGAGG 

GAGAATCCAGTGGCTGGAACTCTCCTCACC 

Sequence 902 

ATGACATGGNTCAGCTTCGGTTTAAAAAAGGTCAGTGTCTATCTGGAAATTTCT ACGT GA 
GAGGTGATAGGAACCAGAGTTTACTTCTTCACACAAGAGGMCTGGACACGCATTTTCAC 

CACTTGCTGGAC , 

Sequence 903 

AGGTACGGCCACAAGAGGGTAAGAAATTATCGCATGGTCTAATACCAAATTTTCCCCCAA 
ATAGAACCTACCAAGAGATCGAGCAATCAAAGCGTTATCTGTCAAAGCAATTCGCTTCTT 
ATTGATTTTGCCATAACCACGCTTGTAGATTAGTTCATTTACTGACTTCAGATTGGGGTA 
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CCTGCCCG 
Sequence 904 

pu^a^™ 
G N GGA NN G NNCNNCCA^^ 

GACCCNAGCCCANNCCCACCCCCNGCfiAAAriMriArr Ar Anr-MMnr> ArrA uf+t+n a a , 



)AGCC 
TTTGT 




GCCNNCNCAGNANNGGGCAAGGNAAAGAGC 
Sequence 905 



oequence yuo 
?rArTrr A T G A^™ 

I;^™T AGG ^ CAGCTGCC ^ GAGCGGAC ^ ACC GCTATTTmCCCTCGTGCTGA^ 

T^^^I^CATGGATTATGCAGCTCAGAAGTATTGTTGGCACCCATGATTTCAGG 

A ^ C JI9I GTAMATGGATGTAG CCMCGGTGTGATTMTTTTCAGAGGACTArTCTTATC 

T^TTCAAAGTACCTCGGCCGGCTNTAGAAACTAAGTGGGATCCCCCGGGGCTGCAGGN 

ATrrCGATATCAAGCTTTATCNGATACCCGTCCNACCCTCGAGGGGGGGG 
Sequence 90S 

^^ G J GGCGGCCGCCCGGGCAGGTACTGGAGGAGTGAGT CCCTATGCTGACCCCAATA 

CTTGCAGAGGTGAGAGAATGCTCTTTGGTTGTGCTACAAGTGCCCAAGGCCCAACAGTCC 

TI™T?I ACAG £^ CTCCTCTCC ^ 

^o?^ G JSS^^ GA ^ GAAG ^ GG ^ G ^ AA ^ GAGG ^ GGGGAG ^ A GTGGATGTCTGCAAAA 

ACCAGAAGCGGCAAAAGCAGGTACCT 

Sequence 907 

CCGGGCAGGTACAAAGTGAGGAGGCAAGACAGGTGCACAGAGCATAGCTTTGTCCCATCT 

CAGGAACCCTGGGTTCCACCCCAGCTCCTGATCCCAAGAGATACG1TTCCCGGGACTCCA 

AGGGAGAGCTGAAACACTGGTCAAGCTCAGAGCCCTGAAGCTCTrrCCCACTCCCCGCGT 
ACCTN 

Sequence 908 



oequence yuo 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTAATGAGGCTGTATTA 

AGGAGAGTAACMGTTCTAATTCTTGACCCATCAAATTCTTAAGGTGAAGCTGAGGACCA 

GGMGAGAAAGAAGATGCTGAGAAAGAAAACATTGAAAAAGATGAAGATGATGTAGATCA 

GAATGGCCCATTCTCACTGGAGCAGCAACTGGACGATGA 
Sequence 909 

GGGGGAGGTACGC ^^ 

^?5 GGCGCCCTCCGCCACGCAGCCGGCCA CCGCCGAGACCCAGCACATCGCCGACCA( 
CTGAGGTCCCAGCTTGAAGAGAMGAAAACAAGMGTTCCCTGTGTTTMGGCCGT^ 

]7 G ^ GAGCCAGGTC ^ 

GAGGACTTCGTACCCTCGGCCGCTCTAGAACTAGTGGGATCCCCCNGGGCTTGCAAGGAA 

TTCGATATTCAAGCTTTATTCGATACCCGTCGACCTCGAGGGGGGGGCCCGGTAC 
Sequence 910 

CCGCGGTGGCCGAGCGGCCGCCCGGGCAGGTACAGAGGTTACCCAGTGTTGCATTACTTT 

I^^ GATC ^ AAGAAGCATACAG CTGATGAAAATCCAGACA^ 

CTATTGGATCACTC^^ 

CTCAAAAGCAAA'fTTTTGATGTTGTTTATGAAGTAGATGGATGCCCAGCTAATCTTTTAT 

CTTCTCACCGAAGCTTAGTACCT 

Sequence 91 1 

AGGTACCAACTGGCCTCATCCTATATTCACTTTCGGCCCTGGGACCAAAGTGGATATCAA 
A ^ G ^ CTGGNGG CTTGCACCATC^^ 

AAATCTGGGAACTGGTCACTNGTTGTGTGCCCTTGCTGG 
Sequence 912 

AGGTACCACCCATGTGNAGGAGACTGCAAGNGAAGCTGTTATTCAAAGTAAGAACAGTCA 

GCCTTGTGCTTGAGTCCTGTTTTATGCTTGCGTGTTTCATACCAAAACACAAAGGCAAGT 

CTTCATATCAGCACTTTAGTCTTTGATATCCAAGTTAGCTNGACTACTNGTACCTNGNCC 

GGGCGGCCGCTNTAGANNTAGTGGATN 

Sequence 913 

CCGGGCAGGTACTATAGCTGTAAGGAGAAGCTGAGAAATGATACCCAGGAGCAGCAGGCT 
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TTACNGTCTTCAGCCTAAAACCTAAAAAAAA^ 
CTGAAAGCATCTGTAACAAAAAAAAAAAAAAAAAAAAAAGTACCT 

Sequence 914 

CGTAATACCGACTCACTATAGGGCGAATTTCGAGCGGCCGCCCGGGCAGGTACAGGTCAG 

AGTCTTCTTTTCTTTTCTTTTTGAGATGGAGTCTTGCTCTGTTGCCAGACTGGAGTGCAG 

TGGTGCGATCTGGGCTCACTGCAATCTCCACCTCCCGGGTTCAAGCGATT CTCCTG CCTC 

AGCCTCCCGAGTAACTGGGACTACAGGTGCGCGCCACCAAGCCCAGCTCATTTTTGTATT 

TTTAGTAGAGATGGGGTTTCACGATGTTGGCTAGGATGGTCTCGATCTCTGGTCAGAGTC 

TTTTCTGTAAATATCCTTGGTAAAGAAGCAATTTTAGACTGTAGCTGTTGCAMTGCTTT 

AAGGAAGAAGCAAAACAACTGTCAGT 

Sequence 915 

AGGTACCATCGATCCTAGTGGGACGCGATCCAAAAATATGCCTATTAAAGATAATTGCTT 

TGGTTATGTTTAATGGGAAAGNCTATCTGTTTGGCTTAAAAAGGGGACAGATGTTCTGCC 

ATCACAAATTGACCAACANAATTNTTGTTTCTCCTGNNACTCCAGTAAGAAAAGACNCGT 

TTACAGACAGTGGGGTTCAAGGNNCAGNCACAGAA 

Sequence 916 

ACCGCGGTGGCGGCCGAGGTACAAAGTGTCTTTNTGTCCTGTGTTTNANCCNTTTAACAT 

ACAGAACCTAATTTTACTGGCATTTTMTGTTAATTCTCCACTCGAA GGTG AACATGGGA 

TGTTTGTAACACATATGTTTTGNTTATCAAGCACACATATGACCCCCTTTTCATGAATAT 

TCATAGGTTCCC 
Sequence 917 

ACCGNGGTGGCGGCCGAGGTACACTTTGCCTGCTCCAGCCCTGGAATCAGCCATTGCTGC 

TGTGCTATCCTTGACAGAACTATAACCTGAATTTAACAAGGAGGAAATATCAGGCAAACA 

CAMTGGAGTGACATTCTANTAAACAACTTGATCAGCACTCTTTAAAAATGAAAGACAAA 

AGAATGACTTGAGGAATGAGTTTC 

Sequence 918 

CGGCCGCCCGGGCAGGTACGCGGGGCAATTTGGAGAAGATAGAAGTTTGAAGTGGAAAAC 
TGGAAGACAGAAGTACGGGAAGGCGAAGAAAAGAATAGAGAAGATAGGGAAATTAGAAGA 
TAAAAACATACTTTTAGAAGAAAAAAGATAAATTTAAACCCTGAAAAGTAGGAAGCAGAA 
GAAAAAAAATNATTTTTTNAANNNNAAAAGTACCCTCGGCCCGCTCTAGAACTAGTG 

Sequence 919 A 

CCGGGCAGGTACTTCATGAAAACTACCATAACGTAAATTCCCCTTCTCTCCCCTGGATGG 

AAGTATCAACTTGGAAGTTACTAGGAAAAGGTAAGGGAACTGACTGAGGACACTAGGCTT 

ACCTTTTAATTTCGAAGAGTAACTGGATTATTTCAGGTTGCTACGATTTCTGCAAACGGA 

GACAAAGAAATTGGCAATATCATCTCTGATGCAATGAAAAAAGTTGGAAGAAAGGGTGTC 

ATCACAGTAAAGGCAAGTGTGTTTGTATTTTTA MGA TAATTTTGAGTTATCTTATGATC 

AAAAGTTTGAGTTATCTGATGATCAAAACTGAATTTT 

Sequence 920 

AGGTACGCGGGGCCGTAACTTTTCTATCCGTNCGCGTCAGCGCCTTGCCACCCTCATCTC 
CAATATGNCGTGGTCCGACCCCCAGTGGCACTAACGTTGGGANCCTTCAGGGCGCTCTTC 



CCAAG 
Sequence 921 



Ocuucntc Ofc i 

AGGTACTTTTTTCC I M I I I I I I I I I H I I AAGTGCGTTCATTCTCACTGCTGTTATTGT 
TTTCTGACAGCATGTCTGAACCAGCTAAGTCAGCTCCTGCTCCGAAGAAGGGTTCCAAGA 
AGGCTGTGACCAAGGCGCAGAAGAAGGATGGCAAGAAGCGCAAGCGCAGTCGTAAGGAGA 

GCTACTCCGTGTATGTGTACCTGCCCG 



Sequence 922 

CCGGGCAGGTACCCAGTTCCAGGCCTTGACTCTTGGATGGCATTTCTGGACTTGCCCTGG 
GCCAGAAGGAAGCTCACTGCCCTGAATGGAGAGTTCCAGGCCTGGCAGCATTCACCACAA 
ACTGACTAAAGAGGCCCTGGGCCTGAAGTGAACATCAGTGGTAGTCTGGCAGTATTCCTT 
ATGGACCTGTGGTGGTGGTGGCCATAAGGGTGAGGCTCCTCTGCCAGTGGAAGGGGGAGG 
GAAGAGTAGGAAGGACTGCATCTTGTNAGTTTAAGGTTCCAGCTCAGATGCAGTAAANTA 
GAGCATGNCCGGNNGATTTCTAATATTTTTTACTTNNAGTCCCTGGCTCCTGGAAA 

Sequence 923 

GGAGCATAAAAGTGTAAAAGCCTGGGGTGCCTAAATGAGTGAGCTTAACTCACATTTAAT 
TGCGTTGCGCTCACTTGNCCGTTTTTTCAGTTCGGGAAACCCTTGTTCGTGCCAGCTTG 
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Sequence 924 

AGGTACATATCACAGGATTAAAACTCCAGTTAAGCAACTGAGCTAATCATTGAAGTAAAT 

TAAAAATACCAAGCTTCTTTAACCTATCAATGCTGTTTTAGAAGCATCATCCGAACAAAT 

AGAGATTTAGTTATAMTTGCTGGGCTACATTCTGTGATGAGMTTTTGCTTAGTACCAC 

TAGAGGAAGAGAGACTAGAGGCCCAAGAAGANGGTGAAAGTAGAGCTATGTGCCCAAGAA 

AAACTTGTAGAGATGGAATAAGAACTGAAACAATAGTAGAAACACTTGCCCTGAAGAAAA 

AGGATGGAAAAATGAAATAGATTGATTTTTAGCTGGTAGGGMGAAAGTTAACATAGTCT 

TAGTCGGTTGTGTTTTCTCMGGMCTGAGMGCAGGATTAGTTTTAGAAAATTTGAAGA 

AGGTAGTGAGAGGNTGAAGCCAGCTGGGTCGAGTGAGGNCTTGGANAACTTTTTTNTCGN 

ACCTGCANGCCCTTCTTANACCCTACCTTTTTTTTNGG 

Sequence 925 

CTATAGGGCGAATT GGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTNTNTTTTTT 
nTTTTTTTTTTn I NNAAGAACAAAATGTTTATTTTTTGGATATAGACATTTAAAACAT 
TCATNTCCAAGAACAG CTTCAAT CAGGTATACAATAAGAAATAGACTTTAAATCCTAATC 
AGAAAAGGN AACA GAATTTTTTTGAATGCTGCTGTANACACTAGTGTTAGAAAAAAAAGT 
TTATAAATGGTTTTGCAC CAAAAAAATATGCAAAGAAACTTG AAA 
Sequence 926 

GGCGGCCGCCCGGGCAGGTACTACGAGTGAGTGAGGCTGGGAGGAACACCAACCTAAGCC 

AGGGTAATGAGGGGGGACTCTTTACCCAGGACCCTGCCCACTGGCCTTCCTCTCTTCCAA 

ACACAGGTTCCGGCATACCCAGGTGTGCAAGGCCTCAGCACTGAAGCATGGNGGGGATCT 

GGCACAAGACCCAGCCTGGACAGAGATCTTTGGTGTTCTCTCTGNGGCCACCATCAAGTT 

TTGAGATGCTGAGCACAGCCCCACAGAGTCAGCTCTTCCTGGCTCTGGCTGACAGCAGTA 

TCTCCACGAAGGGCACAAAGAGNGGCACCTTTGTCATGTATAATTGTGC 

Sequence 927 

CCGGGCAGGTACTACGAGTGAGTGAGGCTGGGAGGAACACCAACCTAAGCCAGGGTAATG 

AGGGGGGACTCTTTACCCAGGACCCTGCCCACTGGCCTTCCTCTCTTCCAAACACAGGTT 

CCGGCATACCCAGGTGTGCAAGGCTCAGCACTGAAGCATGGTGGGGATCTGGCACAAGAC 

CCAGCCTGGACAGAGATCTTTGGTGTTCTNTCTGTGGCCACCATCAAGTTTGAGATGCTG 

AGCACAGCCCCACAGAGTCAGCTCTTCCTGGCTCTGGCTGACAGCAGTATCTCCACGAAG 

GGCACAAAGAGTGGCACCTTTGTCATGTATMTTGTGCCCGTCTTGCCACACTCTTTGAG 

AGTTACAAGTGTAGTATGGAACAAGGTCTGTCCT 

Sequence 928 

a i n i n 1 1 1 \ 1 1 1 1 n n 1 1 1 iggcngcttttcctgattttttaactactcatacaata 

GTCTGGT AACT GGT NTTCT TTTTAAGCAGGGAACTCCNGGATTCAAATNCCTGATAATTA 

AAGGATCTTTTGATATTTTGGCAGTTNCTCTCACTAAAAGMGTTNCATTCANCAGATTA 

NAATGATTCATGAGAAATTCTTGGNTAGTAAATATTTAATCCAGATTTTATAATTGCCTA 

AATCTCTTTATAGGTTATTTCTTGCAATATTTCAAGATCCTGAGTCAGCCATGCTTATAC 
AAGC AAAC TTTTATTTA 

Sequence 929 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTACCTCCTGGCAGGAGGCCAACCAACACAAAA 

CTAGTGCAATAAACAAAACTACAACTAAGGATCCTCACAAGAGCCCATTTCACTCCCCTG 

CCACCTCCACAACAGCAGGTGCTTGTATCCATGGCTGAGAGACCTGAAGACTTTCACATT 

ACAGGACTCTATGCAGACTCCCCCAGTACCT 

Sequence 930 

CCGCGGTGGCGGCCGAGGTACTTCTATGTCATCATGGGTCTCAAAGGTGTTGTCAAAATT 

GAAMGATACTCAAACTTGTCACTGGAGATGCTGCA^TCTATGACTGTTTTTCTGCTTGT 

ATTGATGAT TATGAA TGG CAGCT GAATGGCANAGTrCAGAGCCGGCGGNCCCTGGGTTTT 

GCTGCTTNATrrTTGTNAATTTTTTTGTACCCTGCCGGGGNCGGCTT^ 

NGGNNCCCCCCCNNGGGGNNNGGGAAGNANTTTrTATTTNTNTATTTTTTTNNTCCCCCC 

CCCCCCTNGGGGGGGGGGGGGGNGGNCCCCCCCCCT AI 1 H 1 1 i I M 1 

Sequence 931 

GTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAGATTCT 

GATTCACTGATAATCGACTCATCTGTTACTAATACTTCCCCATCTGGCTTTCTGCGCAGC 

ACCTTTCTCTTCATCACTGGCTTTCGGAGTCTTTGGGGAGGGCCTCAGAGACTGTTTCTG 

AAGGGACATTACTTTCTACATGTGGGTACCT 

Sequence 932 
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TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCA GGTAC TGTGCAATGCCT 
CAGGTTG CAC CTG ACTTATATGCTG AACTACAG AAGGCAC ATTTAGTTTTATTC AAGG GT 
GATTTGAATTACAGGAAGTTGACAGGTGACAGAAAAATGGGAGTTTTTCTGTTCCATTTC 
ATCAGGCTCTGAATGGCTTCCATCCTGCACCACTCTGTACCT 
Sequence 933 

GTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCCGGGGCTGGCTCC 

TCCAGTGTCTTCTCAGTCCTGAGCAACAGTGCAGAGGTGAAACGGGAGCGCCTGGAAGAT 

GTGGTGGGAGGCTGTTGCTATCGGGTCAACAACAGCTTNGGACCATGAAGTACCTGCCCG 

Sequence 934 

AGCTTTACCGCGGTGGCGGCCGCCCGGGCAGGTNCTTTCAAACAACGCGGTAGGCCTTCC 

CTTTNGGGTCTGCCATGACAACGATACCCCAGGTGGCAATGCTTGAAGCCGATGACCATC 

TGCGTAATGGNCCATTGGACTNTGGGGGGTATTCCATCTTTAAATGGGCCTTTNCGTGCC 

ACCCATTACCTTNGGCCCGCTTTAAAAACTAAGTTGGGATCCCCCCG GGCT TTCAANGGA 

AArrTTNGAATATTAAAAGCTTTATTNCGAATTCCCCGGTCGGAACCCTmNAAAGGGG 

GGGGGG 

Sequence 935 

TAGGGCGAATTGTAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGTG TATCA TCGC 
AGTCTTGC U I I I I GAGTAATGGATTCCTAGATTCTATGAGGATACCACAACCACTTTTA 
AAGAGGTTTCTAAGGCCAGGTGCANTGCTTACGCCTGGGAGACCAANGTGGGAGGATCAC 
TTGAGCTCAGGAGTTTGAGACCAACTTGTACCT 
Sequence 936 

NCACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCACAAACACGAT 

CGCAGATCCAGGTTTCTCAAACTGGAGCATCTGCTTAATTTTCCCATAAAATCAGTCTTA 

TTCTTTCTGACAGCTCTGAGACTCCTCCGGCCACGACTAGGTGCTGTCCTGGAGGAAACG 

GTGGAGGA 

Sequence 937 

AGGTACGGGAGAGACTCATAACTCTCCAGGCCCAGTGCTGGGGGAACCCCTCATGGGAGC 

AGCTGTAATGAGGCCTGGAAACTGGACACTGCTGAGCCCTGACATCCAGACCTCAATGCC 

CTGACTCAGCCCATGGCAGCGACCCCTTCAAACAGGCTCCATGTGGGCCTGTAATCAGGA 

AAGACTAGAGCATCCAGGAGTGGAGATTTGATTCTGAGATGCTGGGAGTTGTTCCTGGTT 

CTTGCTCCTGTCTGGCTGGTGCTGACAGCCAAAAGCGGGGAGTTGTGTGAGCTTTGTCCT 

TCAGCACTAGAGGCTTCACTCAACCCATGAGAAGGGGATGCCCCAGGCTTGCCCATCACA 

AGCACTGGCCAAGCTGACTTCCCGCTGCTGAGGGAAAGA 

Sequence 938 

AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACGCCTCTGGATTCACTTTTGATAATT 

ATGTCATGAATnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTCGGCAATCA 

GTGG N AGTGGC GGTG GC AC AC ACTACGC ATACTTC 

Sequence 939 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGGGGCCTGA 

GMTTTACATTTCTAGCCAGCTTCTAAGTGATGCTGATGCTGCTAGTTTGGAGACCACAC 

CTTMGAACCACTGCTGTTG.GTCTTGTCTGCACAGTAGATGCACCAAGGCAAATGTTTAG 

AAGTACCT 

Sequence 940 

CGCCCGGGCAGGTACTGCTGACTGCCCTGGCTGGTCAGTGGGGCAAAGCCAGGCATGAAG 

AAGTGCAGGTAGGGGAATGGGATGATGTTCATGGTCAGCTTCCACAGGTCAGCATTCAGC 

TGGCCTGGG/vATCGCAGGCAGGTGGCAACCCATTTATGGTGGCAGATACTAGGAGGTTTA 

GGTCACTCT^vGGTGGGCATGGGCAGCTTTAGGGTTCTGAAGCAGATGTCATAGAAGGCTT 

CATAATCMTGCAAAAAGTCTCATCTGCATTTGCTATGAGACTTGGAGGGTGAC ATTGT G 

GGGCTCTAGCATGGTGCCTGACACCTTGGGCAAGMCAAAAATGCrJNAATNNGTTTTNNA 

MNCCTAT7TAAAATGGGTCTGAATMCTGGTGATGAATTTACTTGTATAATCAGAATTC 

MGATTAAW.TTNAGACMCTCTT1TGTTTNCAACAGAANGGAAAANNANNATTAANTN 

TTNNANNNNGTCCCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCTNGAANGAATTC 

NATATCAAGCTTATCGATNCCCGTCGACCCTCGANGGGGGGGCCCGGTNCCCAACTTTT , 

Sequence 941 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTACCTGAGACAATAGC 
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ACCAAATTCAACAG AGAG AAAGAATGAGTGAGAGAGCACTTTACAC CAAG GCTCTGCACA 
TAATTGGTGCAATTTGAAATTGAATGGCTCAGAAGACTGCTCTGTGAGGAGCAGATTGGA 
GAGGATANACTCATCTTGCCGCGTACCT 
Sequence 942 

CGCCCGGGCAGGTACTCCCCTCCCCAAATAGAAACCTCAAAGACTGATCCATTTCCCCTA 

GGGCCTGGGCCAGGAGTAGCTCACTGCTCACTGCTGAGGAGAAAGGCACAAGATATAATG 

TCATAAGAGCAGGAGAGTGGCTCACCTACAGAGTTCCCTATAGGGGAAAGAAGGCAGGAA 

ATAGGCGCAGGGTCTGGTCCTGTCCCTGCACCACCCTGAGCAGCTAGTCTTGGGAAGGGA 

TTACAGGCCCT GGGC CATAGGCTGCTCGCCATTCTGCTTTCCTATCCTGTTTCTCTCCCT 

GTGCTGCTCCCTTTTAGCCAGGGCTGAGAAATGTTCAGCACCTGAGGCAAAAACTGCCAT 
AATACCT 

Sequence 943 

TGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCATGGTACCACGTGCATGAAACCCCTGTCA 

CATATCCCCTTGCTTGCTCAAATC AATCACGACCCTTTCATGTGAAATTTTAGTGTTG N G 

AGCTCTTAAAAGCGACAGAAATTGTGCACTCGGGGAGCTCGGATTTTAAGGCAGTAGCTT 

GCCAATGCTCCCAGCTGMTAAAGCCCTTTCTTCTACAGNTCCGNGTCTGAGAGGTTTTG 

TCTGNGGCTTGTCCTGCTACAGTCCTTGGTTCCCTGACCATGGAAGCGAGGTTGACTGAC 

AAGCCCTGTGGAGCGTCCCTGNANAGGA'CTNCGGCCTGCNTGAATGACNCAATCCAAAGA 

GCGCTTNCCGGNTAGGAAATGGNCCTGGTGGAATGCC 

Sequence 944 

CCGCG GTGG CGGCCGAGGTAC rTTTTTTTTTTTTTTTTTTTTl GACCATCCACATTATCT 

GTTTTATTTTGGCAAAATGAAAG CATCTTAAAAAAAAAAG GC AATCAC AAAATACAGGGC 

TTTCTTAGAAAGCAACATGCATCATATCTTGMCAAGTTACCCGAGCGGTTTCTCCGTCC 

TACAAACAGACATGGATCAATCTCTCCTTCTGGGAGCTTCTCTGTCTTTGCCCAGGTCGC 

TCCCTGAGAGGTGAGGTCTGAACCCACTGGGAAGCAGGACGATGTTCAAGGCTTGTACCT 
GCCCG 

Sequence 945 

CTACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACAAGTTTACACCGTAA 
GAGGCAACATGGTCAGCCACAATGTCTTCACCTCCACAAGGGCTCATNACGGTGGTCAGG 
GCGAGGGCCCCCAGCATCAGAGCTTTGTTTAGGATCATNCTCTTTCCAAGGCAGCCTCAN 
C AGTTGCTGTTNTGAG CTGN AG AGC AATTG N N C C C GC GTAC CTGC C C G 
Sequence 946 

AGGGCNAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACCTGCCTGGAGCCTGGTAGACT 
TGCTGGGTGGCTAGATTCAGAAGAGAGAGAGCAATCACTACAGCTCAGCTCTCAGGAAGC 
CACATCCATAGGAAAAGGGAGAGAGTACCTGCCCG 
Sequence 947 

TTAGG GCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC nTTTTTTTTTTTTTTTTl 
TTTTTNTTN NG AGTTACAGTAATTTTACTTTAN CAATCAN AC ATGTATTCTGTGTCTC AC 
G GGCCA CCTGCTGAATGANAGGACTCCAGTTGAAAGGTCAAGAACATAAAACCACAAAAG 
Cm u l GAGTGGGTCTTN AACTTANTTAAAATAAAAATAAAGGACCTN CCGCGTACCTGC 
CCG 

Sequence 948 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGAATTCATGTGGAG 

GTCAGAGTGGAAGCAGGTGTGAGAGGGTCCAGCAGAAGGAAACATGGCTGCCAAAGTGTT 

TGAGTCCATTGGCAAGTTTGGCCTGGCCTTATCTGTTGCAGGAGGCGTGGTNAACTCTGC 

CTTATATAATGTGGATGCTGGGCACANATCTGTCATNTTTNACCTATTCCGTGGAGTGCA 

AAGACNTTGTGGTAGAGGAAGGGACTCNTTTTCTCATCCCGTGGGTACCTGCCCGGGCGG 

CCGCTCTANAACTAGGTGGGATCCCCCGGGCCTGCANGAnnnnnnnnnnnnnnnnnnnnn 

nnnnnCGTCGACCTTCGAGGGGGGGGCCCCGGTNCCCANCTTTTTGTTCCCT 

Sequence 949 

GCTCCCCGCGGTGGCGGCCGAGGTACTACTCAGCTGATCACAAGCTGCTTGATGGGAACC 

TACTAGATGGACAGGCTGAGGTGTTTGGCAGTGATGATGACCACATTCAGTTTGTGCAGA 

AAAAGCCACCACGTGAGAATGGCCATAAGCAGATAAGTAGCAGTTCAACTGGATGTCTCT 

CTTCTCCAAATGCTACAGTACCTGCCCGGGCGGCCGCTCGATATTATTCTTCAATTCAGC 

TTGTTAAACCTCCTTCAGGATTCTAAAACCTTTTAGACTCTTAAATTGCAGCCTTCCATG 

TCCCTTGTCCTGCCTCCAGCACACTCTTCAGTAAACAAAAGTCAACAGCACCAGGGCA 
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Sequence 950 

ACCCTACTATAGGGCGATTTTGGAGCTCCACCCGCGGTGGCGGCCCGAGGTACCAACAGA 
AAACCTGGCCAGGCTCCCAGGCTCCTCATCAATGATNNATCCGACAGGGCCACTGGCAnn 
nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
nnnnnnnnnnnnTTGCAGTTTATTACTGTCAGCAGCGTAGCAACTCGTACCTGCCCG 
Sequence 951 

GCGAATTGGAGCTCNCCGCGGTGGGCGGCCCGAGGTACTTTGGCCTCTCTGGGATAGAAG 

TTATTCAGCGGGCACACAACAGAGGCAGTTTTCAGATTTCAACTGCTCATCAGATGGCGG 

GAAGATGAAGACAGATGGTGCAGCCACAGTTCGTTTGATCTCCAGCTTGGCCCCTGGCCA 

AAAGTGTAAAGATTTGTAGAGCTTGCATGCAGnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGTCAGGGA 

CCCCGGAGGCCCGATTAGAACCCAAATAGATCAGGAGCTGTGGAGACTGCCC 

Sequence 952 

ACMCTTTGGATTGTCAATGAMGCTTGNTTGTGGATACTNATCAAACNNTTTTTNGTCT 

ATTTANTCCTTGNTGAAGCTAAAAAGGAAAGAGAGAGTGAGAGGCAAAGAAAGACAAAGG 

AGGAAAGACCTGCCCGGCGGCGTCA 

Sequence 953 

CCGGGCAGGTACAGATGATGACTGCAAAATGAAGACCTACTTTCAACTNCTTTTTCCCCC 

CTCTAGAAGAATCAAATNGMTCTTTTACTTGCCTCTTGCAAAAAAAAGAAAAAAAGAAA 

AAAGTTCATTTATTCATTCTGTTTTTATAGCAAAACTGAATGTCAAAAGTACGTGCCAAG 

CATCCTCGTGCGACCGCGAGAGCCCGGGGAGCGGGGGCTTGC 

Sequence 954 

AGNATTTGGTATCTGCGCTCTGGTGAAAGCCAGGTACCTTCG GAAAAA AGAAGNGGGGAG 

GCTCTNGAACCCGCAAAAAAAACACCGCTGGGAGCGGGGGG TTTTT TGGTTGGAAGCAGC 

ANAATACNCNCANAAAAAAAGGACCTAAGAAAAACCCTTGAACTTTTNTACGGGGGCNGA 

CCCTCANNGGGAACAAAAACNCACGNTAAGGGAANTTNGGCATGAANAATANCAAAAAAG 

GANCNTCACCCAAAACCTTTAAAA 

Sequence 955 

CCGCGGTGGCGGCCCGAGGTCAACGGCCAGCCAATCACCAAAATGGCCTGTGGGGCTGAA 
TTCAGTATGATAATGGACTGCAAAGGAAACCTCTATTCCTTTGGGTGCCCTGAATATGGT 
CAGCTGGGACACAACTCAGATGGGAAGTTCACCGCCGGGCACAGCGAAGAGTACCTGCCC 
GGGCGGC 

Sequence 956 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTTTAGATGCAAAGTTTTAGATGTTGGTATTTG 

AATGGGATACAAMTTGGGAACTTGGAATTTGGGTTGAGACATCTTTCCTATGCAGTTGG 

GAACTTGGAATCTGGGTTGAGACATCTTTCCTATCCTTGCAAAAAAAAAAAAAAAAAAAA 

AAGTACCT 

Sequence 957 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGG CCCGAG GTCGCGGGGAGGAATTAACATA 

CACTAACTTTGAGGAAGATGTGCTCATATACATTATTTTTATAATTCAGTCATATGAGGC 

ATmATTCACATCCTACATGAGGAGATTGAGA^AAAATATGTAATGTAGATTCAAGTT 

GACAGTGAGTGAGGCCAGAGTTCATACCCCAGTCTCGTGGTCTTTCTGCAGCCTCATGAT 

AG AGC AAGGATATTGTCACCTG AGAAACCAAC A TTG AACCTCCTTCTCTG AAGCC C AGTG 

AGTGCCCTTCCTCTAGACTCTGGCCACAGCTGGGCCTCCCTCCATCAGCCCACATAACTC 

ATTGCAGTGTGCTTATCTGTTCACTACTCCTTGTCTTCAAAGGAAACTCCAGAATTGTAA 

TAGCTGTATCCTCTTCATCTCTGAACCTCCAGG\CTTCAAAAGAAGTGACTGGACGTAGT 

AGGTGCCTGGGAAAGGTTTTTGAATGAATG 

Sequence 958 

GTGTGCCACCGCCACTACCACTGATTGCCGAGACCCACTCCAGCCCCTTCCTGGAGTCTG 
GCGGACCCAATTCATGACATAATTATCAAAA.GTGAATCCAGAGGCTGTACCT 
Sequence 959 

CCGCGGTGGCCCGAGCGGCCGCCGGGCA3GTACATTTAACAATTGGACATACAGCACTGT 

AAACAATTCAAAGTCCATTGAGCTTACCATGAAAGAACTGAAACGTATGTAGGAACTGTG 

TCCTGCTGCAGACAAGCAGCTGCTCTGGCTCTGGTGCGGTGCGGTCANGAGGTGAGGTGG 

GCAGGTGTGGAAGGGAGAATTCCTGAGGCCAGTCACAATTATTGAAGCTAAATTAATTTC 

TTCCTGACACTTTAGCATTAATTCTAAGTAGTTCTAAGTTCTAAACCATTAACTGTATAG 
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TTAAI Mill IAAAT CAAA GAAGAAAAAGGCTTTCACACAGATGAATGTATAGGAAAACA 
CAA GTAATG ACCAATTTTCAAGAGCCTTATTGTAAAAAAAGAAAGAAAGAAAAAAGGACA 
ATTATTTTTGGAGTTACTGTCTCTAATTCTTAAAAAGTAAGTACCT 
Sequence 960 

CCGGGCAGGTACTCATCAGATGGAATGTTTTACCCTGCCGAGGTTNTAGTCATGATGTGC 
TGAGCTCTCTGNGTCTGACGTGACTGACTGGTANCTGGGCGTTGGCNNGACCCTCCTTTN 
N CTNTAN N ANC ACATN ATG N AGAATTTN NC ACCTATG GGAAAATTAAANTGATNG AC ATG 

CCCNNNGAAAGGATGCATNCCCATCTGNNCAAAGGCTATGCGTACCTCGGCCGCTNTAGA 
NCTANT 

Sequence 961 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTGGTTTTAACTTC 

ATGTGTCTGAAAGCAGCACTGAAGAGGTTAGAAAAGACCATCTTGATCTGAGGATGCCAC 

CCCTCTCCCATCTCCCAGCAGCAGCCACCGTGGCATGGAGAGAATCTGTGTGCTAGGGAG 

AGGGACAGCACAGTGATTGTGAGATACTGCTTTGAACTCAGTGCTGCCCTGTCACAGCTG 

AAAGCAAAACTGGGCTGAACTCAGCCAGCACCCATCCACACAGGGAGCATTTAGATGAGC 

TCTAGCCAGAGAGGAATCGCCCATCCCAGCAGTCTGAATATGAGTTCCAGCAAGCCCCGT 

CACTGTGGGCTAAATTGGTCCAAGACCTTAAATAAACTTGAAAGGGCAGTCTAGGCCATA 

AAGACT 

Sequence 962 

NCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCTGCCAGTTTCTTCT 
CCACTGCACTCTTATATTCTACCAAAAATTTCTCCATAGCACCAAATCCCGCCATTTCCG 
AGCTACGATCACCCGCGGCTGAGGAAGGACGAATCCCCCGCGTACCTGCCCG 
Sequence 963 

CCGGGCAGGTACTCCGCCATTTTACGTGAGAGACTTGAG CATTCTTGGATTTG GTATC CT 
CAGGAGTCCGGGAACCAGTCCTCCATGGATATCMGGGATGACTGTTTGCCCGTGTTTTT 
AGCCTTTGTATCTTTGCTCAAGAAATCCCTAAAACCTGGAGTACCT 
Sequence 964 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTC rri I ( I I iT CC Tl I N I I I 
AAGAGTATAAGGTTTACACAATCATTCTCATAATGTGACGCAAGCCAGCAAGGCCAAAAA 
TGCTGGAGAAAATAACGGGATCTCTTCCTTGTAAACTTGTACCTGCCCG 
Sequence 965 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGTCCTTGTT 

TTCCCTCTGATANGGGTTTGGATCTGTGTCCCCACTAAATCTCATACTGAATTGTGATCT 

CCAGTGTTGAAAGTGGGGCCTGCTGGGAGGTGATGGGGGATCATGGGGGTGGAGTTCTCA 

TGAATGGTTACCACCATCCCCCCTTCATACTCTCTCTTGCTCCAGCTCTGGCCATGCGAT 

GTGCCTGCTCTCCCTTCCCCCTCCACCATGATTCTAANTTTCCTGAGGCCTCCCCAGAAG 

CTGAGCAGATGCCAGCCATGCTTCCTGTACCTN 

Sequence 966 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTAGCAATCGGGTTGCCAGC 

AAAGCACTGGATGCAAGCCTTGCCTTCCAGAAGCTTACCAGTCGGGTTGCCAGCAAAGCA 

GTGGATGCAAGACTTGCCCTCCAGGAGCTTACCATCACAACGAAGAAGACAAATAAATGC 

ATAATATACAGACGACATAAATCCATACTGCACTCCAGCCTGGGTGACAGAGCAAGACTC 

CATCTCAAAAAAAAAMGAAAAMGAAMGAAAAAAMGAAAGAAAAAGAAAAAACAAM 

AACCANAAGAGCAAGGGAAGGCTCACAGGGCTGAATCTTGAAGAATGGGGGTTCTCAGGG 

TTACAGAGAAGGGGGAGGACATTCTGGACAGAAATGGAAATGTGTGAAGATATTTGTGTA 

CCTGCCCGGGCGGCCGCTCTAGAACTAGGTGGATCCC 

Sequence 967 

AATTGGAGCTCCCCGCGGTGGCGGNCGAGGTACCAATAAGCATATTGCTTTGGCAATGCA 

TCTCCAGAGCANGTGACCCTGGCCGTCTGTCCTGGGGACACTGAC 

Sequence 968 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCCTAATGAAACCCGTCTTTGG 

CCTGTGGCTGAGACGCCATCTGTAGGCGGTGAAATCTTTCCAGCCTATGTGACGAGGGTC 

ATGCCACAGGGTGCGCACCTGAAGGAAAAGGAAAAGGAAAGGTTGAAACACAGCTCCTTC 

AGGGCTTNTTCTTACCTGGCCAGGGGTGTTTCCTGTGTGCCACAGGGCGTTCCGCAGGTG 

CTCGCCAGGCCCTGTGGTGGAGTTTACAACTTTCACAGAAAGGCCCGAGTATCCCTGAGC 

CCTCGTGGGGTTGGTGTCCCAGTAGGACTGGGTGACTTGCTTCACATCACAACATAAAAG 
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CGGCTGCTGGACTGGTAGCCAAAAGACAAATNCAGCATAGTCATCGTCCCTTTTCGGTGT 

TGATGAAGAAGGTGCCACTTGAAGTCCACAGCATTAAACTCATCATAACCTGGAAGAAGA 

ACCCAGAGCCAGGTTAAAACCTGCAGCCTTCAGAAAGGTTCTTCCGTCATGTTTCCTAGA 

CATTCCTCAACACCCACAGGGGATAGGTACCCTGNCCCGGGCCGGCCGCTTNTAGAACTA 

AGTGGAATCCCCCCCGGGCTGCAGGAAATTCNATATTAAGGCTTTATTGATACCCGTCNA 

CCTTNAAGGGGGGGGCCCGGGACCCCAACTTTTTG 

Sequence 969 

TGGAGCTCCCCCGCGGTGGCGGCCCGCCCGGGCAGNTACTi I I i 1 i I I I I I NTTC Mill 
TAAGATTATTNTTNTAAAAGGGGAGATAGGTAGGAGTAGCGTGGNAAAGGTGATGAGTGT 
GGGGAGGAATG 
Sequence 970 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGCAGGTACCTCATCCTGTCTC 
CAGGCCACTGAATAACACCCAAAAGAGCAAGCAGCCCCTGCCATCGGCACCTGAAAATAA 
TCCTGAGGAGGAACTGGCATCAAAACAAAAAAAAAAAAAAAAAAAAGTCCTCGGCCGCTC 
TAGAACTAG 

Sequence 971 

ACCNAAAGCATTCTACCACAGTTGTATTTGACTCCCACTTGTAATMCTCCTTTTAAAAA 

TTCCATGTTTAACCATATGACCCTGCTTGCTTACTCATATTCTCCCTCCCTCTCCCCTTC 

CTTTCTCTTCTTCCAGAAGTCATTTGCTGTTGAAATATTTTGTAGGGATTGCTTATTATA 

TTATTTTAGCTGATGAACCTCAGGACAACCGTCTACACACACACACATACATACACGCAC 

ACAAAATCTCAGCTGTTGAAGAGTGGGCTTGGAATCAGACTTCTGTGTCCAGTAAAAAAC 

TCCTGCACTGAAGTCATTGTGACTTGAGTAGTTACAGACTGA TTCCA GTGAACTTGAATC 

TAATTrCTTTTGATCTAATGAATGTGTCTGCTTACCTTATTTCCTTT^ 

CCAAGTAGTTGCTAATTTTrTGACMCTNTAAAATGAGTTNCATTCACTTCTTTTACTTA 

ATGGTTTAAGGTATAGACCTGCCCGGGCGGGCGCTCTAAGAACTAGGTGGATNCCCCGGG 

GCTGGCAGGGAATTCGAAATCAAGCTTAATCGATTNCCGNCCAACCTCGANGGGGGGGG 

Sequence 972 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACI I I I I I I II I I I I ICTCC 

ATGGTTCTTTNGGTATCTCTCCTTCCTTTCCCTGTCAGCCACAGTTCTTTATATTAAATT 

CTTTTGATGGCGAGAATCCTGGAGTAGGAGTGAGAACTGGACTGTAGATGCACCTCTCCC 

ACTAACCCTGTTNGGGGCAAATACGTCTGTCTCGTTG GGAGATGA ACTTTACAAGTTCAA 

TTACTAAATCTGAAATATCTGGATTAGAATATTTCTAGG I I I I M I I CTGCTCTTGACAT 

TCTATGATAACTCCATGACAAATGTTCTTTTTCCAAATAATTAAAACTTAAAACATTTCC 

GTTTAAACCTTCAAAGACAAAGACMCCTTNTTCTTTTTGCATTTGGGGTCATAA^ 

CTAAACTAATAAGGTTAGATGTTTTAGGGGAGAAATCTATTTGCCTACAATAACCACCCC 

CCCTCCAAAAAGCC 

Sequence 973 

CGAGGTACCTATGGGCC AATG AATCCCACCATTTTC AAATAAATC NTTN NTCAN ACTCC A 

CTCAGNTGGAAAGTTTTGCTGAAAATATTTCTGGATAAAAAACAAAAAACCTCATGTTGG 

AAATCAACTGTCCTTCCCATTTGGAACATTAAGAGGAGTCCCCAGGGAACACTGGTAATA 

GGTGAAAATGGTTCCTGGAGTCACCAGTGTTGTACAGAGCCGCCAGGCGACTCAATTCAC 

GTGCTCCTTTTCTGGGGATGAAAAGGTGCTCACCAGATCTTTCTATGCAACTGAGAGCTC 

CTAAATCAAGTCAAAGGGGACCATCGTACCTGCCCGGGCGGCCCGCTCTAAGAACTAGTG 

GATC 

Sequence 974 

CCGCGGTGGCGGCCGAGGTACTCAGAAGTGTCTCTAGGAGGCTGGGCCAGTTTCCTCTCT 

TTTCTGCAGGCGTCTCCAGAGCACCCTCTCACCAGCTGTATCTACTCACAATCGTCTGGC 

ATTTGGAATCTGGTTGAGTTTGGGTCCTCAATACCCAGAAAATAGAGCCTCCAGGACCCG 

CCCCTAAGCAGGAATTTTTCAGATCTCCCTTCTGGGTCCTTTGGTCCCTAAGTCTCTGGC 

TTTGGCATTCCTGGTGGGAATCCTTGCGGAGAGCCATCCTGGTACCTGCCCG 

Sequence 975 

CGAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGCCNGGTACANTTNAATACATTANTG 
TAGTAAGNTATTAANTGGTGCCCCTATGATCTNCGAGAGGTAATACACTATCACGTGTTC 
CAAATTTTNACAGGAAAAGAATCATAGANTCCTATANCTGAAGGGGGCTNTACCGGGNTC 
TACAAANGCCTGCCAGGTGCTNGGATNTCCTN CATC AC ATNC AN CCATG AN AAGTT AC TT 

GTGTCATGGTACCT 
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Sequence 976 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGCAGAAGTGTCTCTCAGTCAGGACAC 

AGCNTGGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCTGCnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTTCTGCATCCAGTTTGCAAAGnnnnn 

nnnnnnnnnnnnnnn nnnnn nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnCAACCTGAAGATTTTGCAACTTACTACTGTCAACAGAGTTACAGTATCCCCCTCACTG 

TCGGCGGAGGGA 

Sequence 977 

ACCCACTATAGGGCGAATTTTNAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCTG 
GGTGGGGGCTGGGGGGGGTNCCGCCGACCGTTTTTCTCTTCCGGCCAGGTGCTTTTCTGT 
CAATTTCTATGGAATGCAAMGGAGGTTTTTGTTTTATTTTGTT^ 

GAAAAAAATCACATCTTATACTTGAGCCTCCATACTTAAAAAAAGAAAAGAAAAGAAATC 

AATAAAAAGAAACTGGGGCGCAGTT 

Sequence 978 

GGCC GCGGGGGCTGCGTMCCATGTTTNCATGAAACAAGCCAGCCANTTTGAGGNGTGCT 

TTTTGAGTGTAAGGCCATTATCATGGACTnTTCCCCAGCTGCAGGGTTATCTACAACTC 

CGGGTGGAAGTCAAAGATTGGAAGAGGATGCTCTGATCCTGGAACTGCATTGCTTGGTGG 

TCAGGAGGGGTTCTTAGTAAGCAAACTCAGTTTAATCCCAGAGTCGGAAGTGGGTGTCAA 

CMGCCAAAGCAACATCCTTTCAGCTCTGCTGCTTTTTTGTCTTGGTGGAGGAATAGCCA 

ATGG AAG AAGATGGAGAGTCCGGACCAG AAAAAAAAAAAAAAAAAAAGTAC CTGC C CG G 

Sequence 979 

GGTGGCGGCCGCTCTATAACTANGNCCACACCCTGNTATTCCTGCAGGTNGCCTTTTGCA 
GCTNCAGGCNNTATGGAACAAATGAAGGTCNATGTCGACTCTAATAGAAGTAATNGTNGA 
NNG GTGTT CTTCACATCCACTTCTGTTNNTGATTGAGTTAGGCATCTCTTTCATGGTAAA 
ACCCTTTTCATTAAACACAAAGAAAGC [TTTTTTTTTTTTTTTTI 
Sequence 980 

GAATNGGAGCTTTTCGCNGTGGCGGCCCGAGGTACAATAACTNGCCTGAANTTCTATGGC 

AACAGGCATTTANTNAGCGGAGCGGTTTTATGGNCTCATCTGTATCTGGGATGCAAAGAA 

ATGGGMTGCCTGANGTCAATTAAAGCTCACAAAGGACAGGTGCCTTCCTTTCTATTCAC 

CCTTGGCAAGTNGGCCCTGTCAAGTTGGTACCTGCCCG 

Sequence 981 

TAAAAATCACGTCACTAT ATAGG CACTGCTGTATGGAAAACGCATTTTGTGTTTCTACAA 

ATTGTMGCGGA AATG CCTTTTGAGTACGCGGGATTTGGATGATNGACATAAGGTTTTTA 

GCATGTNCCTCCTTTrCTTCACCCNCCCCTTTTrTCTTCTATTAATCANGAGAAACTTCA 

AAGTTAATGGGATGGCCGGATCTCACAGGCTGANAACTCGTTCACCTCCAAGCATTTCAT 

GAAAMGCTGCTTCTTATTAACCATACAAACTCTCACCATGATGTGAAGAGTTTCACAAA 

TCCCTCAAAATAAAA 

Sequence 982 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACGCCTTGCCAGACCAGA 
GTCAGTTGTATCATCCTGGGGCAAGTATGAGGAGAAGCTCACGATTACCAGGCACCTCAT 
TGTGAACATGCTTTCTGCAACGCCTGCATCACCCAGTGGTTCTCTCAGCAACAGACATGT 
CCAGTGGACCGTAGTGTTGTGACGGTCG CCCATNTGCG CCCAGTACCTGCCCG 
Sequence 983 

AGGGCG AATTGG AGCT CCCC GCGGTGGCGGCCCGAGGNC l I I I I I I I I I I I M I I I I I I G 
TCTGMGTTTTTTCCCATTTTATATCGTCAATATC 

TCCAAGTTGATGTCCTGACCCAAGGCACCCCAGGTTTCCAAGGATTCGTAATCTCTTTGG 

AGCTGACCTATAAAAAGCAAAAGAGTTACTGAGAGTCACTTTCAGGGAAAGCTGGGTTGG 

GACTTCTTTGACTCTTATGCTTACCTTTGGAAGAAACATTGAAAATCAGAAGGAAGCTGG 

CTACAAATTTACATGGAAAGCTCATCTTAGGGGTTGGGTTCCTCTTTTGATTCTCCCCGC 

GTACCTGCCCGGGCGGCCGCTCTAGAACTANTGGGATCCCCCCGGGCTGNNAGGAATTCG 

ATATCAAAGCTTATCG 

Sequence 984 

TCGAATTG GAGC TCCCCGCGGTGGGCGGCCGAGGNCAGAGCCGCCAGGCGACTCAATTCA 
CGCGCTCCTTTTCTGGGGATGAAAAGGTNCTCACCANATCTTTCTAtGCAACTGAGAGCT 
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CCTAAATCAAGTCAAAGGGGACCATCGTACCTGCCCGGGCGGCGTCTAGAACTAGTGGAT 
CN 

Sequence 985 

CGCGGTGGCGGCCCGAGGTAC I rTTTTTTTTTTTTTTTTTTI I GCGCGCCTGCTGCCTTC 

CTTGGATGTGGTAGCCGNTTCTCAGGCTCCCTCTCCGGAATCGAACCCTGATTCCCCGTC 

ACCCGTGGTCACCATGGTAGGCACGGCGACTACCATCAAAGTTGATAGGGCAGACCGTTC 

GAATGGGTCGTCGCCGCCCCCGCGTACCTGCCCG 

Sequence 986 

ATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCATGGTACGCGGGGGAGACCAAGGGCTA 

ANGCTGGGAGGTGAGTCTGTCACCTTGAGCCGGGCGAGCGCTGTGGGCCAAGCAGGGGTT 

GCAGGGCAGTAGGAGTGCAGACTGAAAAAATGCAGACCGCCGGGGCATTATTCATTTCTC 

CAGCTCTGATCCGCTGTTGTACCT 

Sequence 987 

TATTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCATGGTACAAGCACCA 

GCACCAAGGGATAATCAATGGGTGGCCACCACGCATTCACAAGTATGCCTAGGTTTGAAG 

CTGTGGAACCTTTTAGCTCCAAGTTCTCTGCATGGGTATTGGGGCAACCTAAACTCCTAA 

TCTAGGAGAGTGAGTACCT 

Sequence 988 

GGCGGCCGCCCGGGCTTGTACCTACTTANTTCCAGCTGATGNTTGACCAAAGAATTTGCA 
GGTGGATGGTTTGGTATCACTGTAMTAAAAAGAGGGCCTGAAAAAAAAAAAAAAAAAAA 
AAAAAAGGAGTACCCAAGATC C CC AAATAAATTAGTAAAAG N GCC ATTAAGAATT ATC AG 
ACATAGAATGTGTTTCAAATACAAATTNGTTAAATCCAAAAGATACAGTAATTTTATTAC 
ATAAAATTTCTCCTGCGNGCTAAAAATGCTAACTTACAAGACATAATATTTGTAGTAGAG 

Sequence 989 

GNGGGNCTGGGACAGAGTTCACTCTCACCATCAGCAGCCNGCAGNCTGAAGAANNTGCAA 

GAATANNACNGGGCAGCAGCATAATAACTGGCCCCCGNACCTCGGCCCGCTCTAnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnANNCGANAACAAGCNNATCGATACCCGGNNACCCAG 

AAGGGGGGGGCCC 

Sequence 990 

TAGNGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATGCATGACGCTGGAGGAGA 
TGGNCTTCGAGAGGGGAATCTGGTCGGCAGCCCTGAATGGAGACCTGGGCCGAGTGAAGC 
ATTTAATCCAGAAGGCCGAGGACCCAAGTCAGCCCGACT 

Sequence 991 

CCGCGGTGGCGGCCGAGGTAC rTTTTTTTTTTTTTTTTTT Tl ITTT TATCATGAANAACA 

CAAAATATTACCCAAAGATGAAGATTTAACAAAATTAATAATTTTTACTGTTTCACCA^ 

GACTTTATAGTGAAACCGAC I I I I I M I I I I I I ACACTGCAAGTATGTGGCAGTCAANAA 

CNCAATGGCTACTAACTAGCACCGTTTCATGCCACTAATGCTGATTCACACTACAGTGCA 

AGTAATTTMTTTTACCCATGGNGCTNTTGCNCCATCAGCACAAATGTCAACACAGTGGA 

ANAGCCAAACAACATTTTAATTTTTTTG 

Sequence 992 

CTACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGAATTTCCATCGGTTTGC 

TGAGAAGCACAACTTTGCAAAACCCAATGACAGCCGTGCTCTCCAGCTGATGACCAAATG 

TGCGCAGACTGTGATGGAAGAACTAGAAGGATATTGTGATCGCGTATGGACAGAGTGATG 

AGTACCTGCCCGGGC 

Sequence 993 

CCCGGGCANGTACATCAAGCTTTTTTTACATCAAACCCCAAGGCTCTCATTGCCACCTTT 

A/\TTCATGATAATCTATTGCTTCATCTTTGTCTGTATCAAATAGTTCAAAAGCATCTTTA 

ATTTCTTGTTTTGTCCTCAGACAGTTCTCTGANGNTTCCTCTTTGTTTTGTCCACTAGAA 

GCTCACTTCTCAGAGCTAAACTCATTATCTCTTCGCACAGAGACGTTCCTCTCAAGAACG 

ATTTTAACCCCCTACCCAAGGCAGCAAGACGCCCACAGCCGTTCAACAGACACGAACGAC 

CTCAGCGGCCTCCCCCGCGTACCTCGGCCGCTCTAGAACTAAGTGGATCCCCGGGCTGGA 

AGGAATTCGATTTCAAAGCTTATCGATAC 

Sequence 994 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCGGCAGGTACTTTTCTAGAAAAGCCTACA 
TGTATATACTTAGTTACAGCTGCACTTCCCCATTACTTATTTTTAGGAAGGTTATAGAGA 
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TGGAATAGATGCTGGCAAAGCAGTTACTCTTCAACAGGGCTTCAATCAAGGTTATAAGAA 

AGGNANANGTCANT NTANA CTATGGACNACTCCGAGGMCATTGAGGTAATTTTTAAAGT 

CTAAATGCTGAATCAT TTTM CCTCAATACTACTGGAGGATGTTTCTGTATAAATAAAGT 

GTTTAAACTGAAATGCTTTTCCTGGTGCTAAATACACTAAAGCGTGTCGCAGATCATAGA 

ATTATATTGCCT TCAA AAGGTCAAAATCNTNATNAGNCTCATCTACTTTAATGTGTGAAC 

TACAATATTGTCTTTTCGTGCAAAGAAATGGTAAAGAAGATGTATACTTCTGCTACCTGA 

ACAATTATCTATCTCATTGAAAGGTCTTCAGATTTTGA 

Sequence 995 

GAATTGGAGCTTTCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGTTTTTAGCTTTTT 

AMTCTGCTTTGGTATACTCCATANNNTTTGTGCCATTGGATTATTCTGTTCCTATAGAA 

ATCCCCACTATAAAATGTAAACCAGACAAACTTCCATTATTCAAACGGCAGTATGAAACC 

ACATATTTGTTGGCTCAAAAACTGCAGATCCTTGCTGTTACGGTCACACCCAGTTTCATC 

TGTTAC CTGAC AAATTAAGAAGGGAAAGGCTTTTGAGACAAAACTGTGCTGATCAGATAG 

AAGTTGl I I I I AGAGCTAATGCTACTGCAAGCCTTTTTGCTTGGACTGGAGCACAAGCTA 

TGTATCAAGGATTCTGGAGTGAAACCAGATGTTACTCGACCTTTTGTCTCCCAGGCTGTG 

ATCACAGATGGAAAATGCTTTTCCTTTTTCTGCTACCAGCTAAATACTTTGGCACTGACT ' 

ACACAAGCTGATCAAAATAACCCTCGT 

Sequence 996 

CCCGCGGTGGCGGCCCGCCCGGGCAGGTTCACCGTGTCAGCATTTGTTGAATTNGCACTT 

ATTGTTNAATTTAGCTCTGGAACMTGCAGGGMTTTGMGTTTCTTGTAAATAACCACA 

ATTAGGAAAAAACCATACAGCTCAAGGAAAATCCACTAGTATANCCAAGATACCCTAAGT 

TCTTCAAGAGACACAGANGGGAGAATTATGCCAAAGGTAACTATCACCACCAGAACGCGG 

CCATCCACGTACCTCGGN 

Sequence 997 

CCGCGGTGGCGGCCGAGGTACGCGGGGGACGT7AGGTGTCCGCCGGAGGTGTCGTTGGTG 
TGTTGCGCGACTGGCCTTGAGGGAGAGCTGGGGCCTGCTCCCGGAGAGATACCGGCTATG 
TCGATCGAAATCGAATCTTCGGATGTGATCCGCCTTATTATGCAGTACCTGCCCG 
Sequence 998 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTNCCAACAGAAACCTGGCCAGGCTC 

CCAGGCTCCTCANCTATGATGCATNCAACAGGGCCACTGGCATNCCAGCC 

Sequence 999 

CCGCGGTGGCGGCCGAGGTACTCACGCCTGGTCGGGTGGCAGCTCGCGGCGCAGCTCGTC 

CATGGTAATGTAGTTCTTGTCCCCAGCCAGGATCTTGAAGGAAGCCATGACTTGGTCTGC 

TGTATCTGTGTCGGCTGTCTCGCGGGACATGAAGTCAATGAAGGCCTGGAATGTCACTAC 

CCCCAGGCGGTTGGGGCCCACGTGGCATCGATCCTCCCTGCCCGCGAAGTGACAGTTTAC 

AAAATTATTTTCTGCAAAAAAAAAAAAAAAAAAAAGTACCTGCCCGGGCGGCCGCTNTAG 

AACTA 

Sequence 1000 

CGGCCGAGTGTGGCTCTTCTGGTGTTTCAGCTTGGGGAGAGAGGAGTGGCCTTCCTCTTG 

CAGTTGAGGCCGGCGCCGAGCCGGACTTCAGCCGGATCTCGTGGCGGAGCCCATCTTGCT 

CCCTCTCCAGGCCTTTATCCGCTCCCTAGGATTCCCGGGCCCTGTAGGTGGGAGTTGGGA 

GACGACAGTACCT 

Sequence 1001 

AGGTNCCCCTGCGCCAATCATCACAGCCTGGTNTGTATNAAGTGTGTGCCAAACTGCCTG 

CGCTAACNACCAAATGCAAATGTCATACNTCTNTGTCACTAGCANCATTAAGTTNCTNTA 

GAGGACACGTTGTGAGCCACACCTNAAAAGACAAGCCTTCATCAATCTTTTCCTGGAAGA 

TAGCTTG ACCTAGC AAATTCATTATGTTC AAAATAAN AG NCTATCTTTTCTTTN GTACN C 

TGGCCCCGGNGCNGGGCNCCGCTTCTATNGAANCTAAGTTGNGTATTCCCCCCGGNGCCT 

NNCCAGAGAAATTTTCTTATTATTNCAAAAGNCTTrNATACCNATTTNCCCCGGTCCGNA 

CCCCTTTTGGAANGGGGGGGGGGTGCCCCCCGGGGGTTAACCCC 

Sequence 1002 

CTA CTTA GGGCGAAT TGGA GCTCCCCGCGGTGGCGGCCGAGGTAC I I M I M I I I I i I I i 

TTG7TTTAAAGTAGTATTTTATTTCTCTGGGCTGTGCACACTTAAGGCATTAGATCCACA 

GATGGGCTCATCCGCTGGGATGCTACTTGTGGTGATCATCTCCTAAGCCCCACGTTTTAG 

CCTTGTACCTGCCCG 

Sequence 1003 
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CCGCGGTGGCGGCCGAGGTAC 1 rTTTTTTTTTTTTTTTTTTn 1 "I ACCAAGTCTTACAGT 

GATTATTTTACGTGTTTCCATGTATCTCACTTTGTGCTGTATTAAAAAAACCTCCATTTT 

GAAAATCTACGTTGTACGTGGGATACAGGTCACGGGCAGAGCTCCTGGCCTCAATGATGC 

CTCCTGATCTATCGCTGGGCCTGGACGACCAACACTGGGATGATGACNAGCAGAATGGTC 

ATGAAGATGCTCAAAATCAGGGCCCANATGTTCAGGCACTTGGCGGTGGAGGCATAGGCC 

TGGGCCCCG 

Sequence 1004 

TACTATAGGGCGAATTNGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATTCTCTG 

AAGAGGAAGCAGCTGGTTGGACAGGATTTCTTGAAGAGCCAGGTGC TAAG GGCATCAGGT 

CGACATCCATAGTAACCATGTGCCATAACATCTACACATTTCCACTTGTTTTACAGACAA 

GGTAACAGGCAGAAGGAAAATCCAGAGTCCTGCAGTAAGCAGATGACAAAACTTCAATAT 

GCTTGGGCACCACTTAGGTGACCCCAGGGAGATTTAGTGTGGCCTTAGGAAAGCAAAAGA 

GCACTTTTTATTGGAAATATGAGCTTGTCACTGGGAAAGATTTGTAAAATTGATCAAGAA 

CTTGATTTATAATTATGCCTCAAAAAAAAAGTTCTCATTTAGTAGTGGAGCAATCTAGAA 

AACATACCI MM IGTT 
Sequence 1005 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAACAAACCC 

CTCAAGAAAGGGGCAAAAAGGCCCCGAGTCACTTCAGGTGGTGTGTCAGAGTCTCCCAGT 

GGATTTTCTAAGCACATTCAATCCAATTTGGACTTCTCTCCAGTAAACAGTGCTTCTAGT 

GAAGAAAATGTGAAGTACCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnCGCGCTTGGCGGTAATCATGGGTCATAAGCCTGTTT 

CCTGTGTGGAAAATTGTTATTCCGCTCACAATTTCC 

Sequence 1006 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCCGGCCGCCCGGGCAGGTACTTTTTTTTT 

I n I M I M M M I M M l M I I ACTTTGAAATTACTTTAATTTAAAAATANAAAACATN 

TTGAAAGGAAAAAAAAACCCACAAAACATACAGGCAGATT TGTAT TCTGTGAGTTTCTNA 

CACCCTCACACTTGTTCACCAATNTNTMNATGAAAAACTNTTTTCCAAAATGTATTCAG 

CCNCCAACAATGGNGTTATTAATAGGAATATGGATCAATTTCACAAANAANCCAATGAAT 

CCCATTNTA 
Sequence 1007 

GGGCGAATTGGAGCTCCCCGCGGTGGCCGGCCGCCCGGGCAGGTACCAAAGACTTATGAA 

TAGATTTATGAATTGGAGTAGAAAACATAGACATTTCTACCTTGAGGATATAGAAGGGAA 

CTTAGGAAGTGAGAAGTCAGATTACCCTAGTCCTCTAGGGTGCTTAAAGCTAACTAGTCC 

CGTGAATGAGAATCTAACCTATGTGAAAATCTCCAGCCTGTACCTnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnGCTrrrrGTTrCCCTrTAGTGAGGGGTTAATTGCCGCCGC 

TTGGGCCGTTAATTCATTGGGTCATAAGC 
CCGCGGTGGCGGCCGCCCGGGCAGGTACM 

TATTACCAGTTTGGTGGTCTTCTAATGAGAAATCTAAAGCACACCTCCCCACCCACTCGG 

ATATCTGTTTCTGATGATCACGTGACATCTAGCATATACGTCTGTAGGAAAAGGGAAAAA 

GCAAACCCATTTCACAAATGGCTAAAGTCAATCCCGCGTACCTnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn nnnnnn nnTCGATACCCGTCGACCTCGA 

GGGGGGGGGCCCGGGTACCCAGCCTTTTTGTTTCCCTTTAGTGNA 

CCGC^GTG^GGCCGCCCGGGCAGGTACATCTTCTTCCGGCTGATMTTGCATTATGGAT 

GAACACAAACGAGAAATTGCGGAAGCTAAGCAAATTGTTAGAAAGCTTACGATGGTTGTG 

TTGTCTTCTGAAAAAATGGATGAGCGATAAAAAAAAAGAAAAAAAAAAAAAAGTACCTnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnrnnnnnnnnnnnnnnnnnnnnnnNGAGCGGTTAA 

TTGCGCCGCTTGGCCG TAATC ATTGGTC ATAAG NCTGTTTCCT 

Sequence 1010 _ 
CCGCGGTGGCGGCCGAGGTACCATGATGATGACCTGGCAGATTTGGTCTTTCCCTCCAGT 

GCGACAGCTGATACTTCAATATTTGCAGGACAAAATGATCCCTTGAAAGACAGTTACGGT 

ATGTCTCCCTGCAACACAGCTGTTGTACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnn 
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nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGTTAATTGCGCGCTTGGCCGTAATCAT 

GGTCATAAGGCTGTTTCCTGTGTGAAAATTGGTTATTCCCGCTCACCAAATTCCACACCA 

AACANTACCGAAGCCC 

Sequence 1011 

CTNCTTAGGGCNAA7TGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGTAGGAT 

AAGGAGGAAGGCAGTAAAGCTGCAGTATGTCCTTGAAGCGTGTCAAAGTGGTATGGTAAG 

GAAMGGAGAGTTTTATCTCACAMGCCATAMCACTAAACAACTTMTT7TCCATTCCA 

GAAMTCAGCAGTCATCAAGACAGCACTTGGTAAAACATTTAAGATTTGTGATAATAACA 

AGTACCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGGTTACCCAGCCTTTTGTTCCCCTTT 

TAGTGGAGNGGTTTAATTGCCGCCGCCTTGGGCGTAAATCATGG 

Sequence 1012 

CCGGGCAGGTACGCGGGGTGTGAGACTTCGGCGGACCATTAGGAATGAGATCCGTGAGAT 
CCTTCCATCTTCTTGAAGTCGCCTTTAGGGTGGCTTGCGAGGTTANAGGATTTGGTNGGT 
TTGGGTGGNGCC TGTTTCACCG NGGNAAAAAAGGGGTTGNCCCCCCCGGGGGGGGNAAAA 
AAAAAACCCCCCC I I I I i 111 I IGGGGGGN I I I I M I i M I CCCCCCNGGGGGNGNTAAA 
AAANGGGGGNNN TTTNNAAAA AAAAAI I I I I I Ml I NCCCCCCCGGNGNNGGTTNTCTCC 
CCCTTTTCCCCCC I I I I I I I I I NNAAAAAAANAANGNG' l HI I I I ll I AAAAAAAAAAAN 
''"Ml UGGGGG GGGGGGGNTTTTTTA I I J I Mi I A ACCCCCCCCCNTTTTTTTNGGG 

ggggag i rrm innggggggnnngnnaaaaaaaaaaaaaaaaaaaacccccccnaaaan 

AAAA AANNCCCCCCCCCCCCAAAAAATNTATAAAAAANANATAAAAAA 1 1 I J 1 1 1 1 I 1 M 
TTTTNNAA MNTN NNTAAAAAAAAATNNNM 

ctttnatcattttattanggggggggggggggtaataaaaaaaaaaaaaa i inmm 

TT 

Sequence 1013 

ccgcggtggcggccgaggtacaccaaggcgaccacagcagctgcaacctcagcaatgaag 

atgaggaggaggatgaagaagaacgtcacgagggcacacttgctctcagtcttagcacca 

tagcagcccaggaaaccaagagcaaagaccacaacgccggctgcgatgaggaagtagccc 

acgttgacaaactgcatggcactggacgacagtggcccgaagatcttcagaaaggatgcc 

ccatcgattgacacccagatgcccacttgccaacagggcttgcaccacacaggaaagatg 

aagccaaattgmgaaggatcatcattgggtctttaatgaaagcctggaagncacntgcc 

atgggtgggctccctgtttcaggg 

Sequence 1014 

actacttagggcgaattggagctccccgcggtggcggccggggaaggtcagcgccgtaat 

GGCGTTCTTGGCGTCGGGACCCTAGCTGACCCATCAGCAAAAGGTGTTGCGGCTTTATAA 

gcgggcgctacgccacctcgagtcgtggtgcgtccagagagacaaataccgatactttgc 
ttgtttgatgagagcccggtttgaagaacataagaatgaaaaggatatggcggaggccac 

CCAGCTGCTGAAGGAGGCCGAGGAAGAATTCTGGTACCTCGGCNnnnnnnnnnnnnnnnn 
nnnnnnnnnnnnnnnnnnNAATTCGATATCAAGCCTTATCGATACCCGTCCGACCTCGAA 
GGGGGGGGGCCCCGGTACCCCAGCTTTTTGTTCCCTTTTAG 
Sequence 1015 

TTCGAATTCAAGAAGGGCAGCCAGCAGAGGATGAACACAAAACCTTAGGTCACTGGGTGA 

CAATCAGNTGACACAANAAAAGTAGGCACAGCACNTGACNNGNTGTATTTGACNNCACTT 

TGCAAGAAACACTGACTTGANNTCANCTTTNAAANGTNTAGCAGTTNCCTATAATNANAT 

GGCCANTACCCTNTTCCATC CCCTC TNCTTAGGANCANTATTCCCNTTTCCACNCANTAN 

AGGTAAGGAt4CAATTCCTNATTTTTNTTAAAAAGGGGNCAAANNAACCTTGGGNTCCNCA 

ANANANAANCN GGTTN TCTCGNTTGNTCNANTTTCTTAGTTNCCAAACGTTNCTTTCCNG 

ATNGGTTATGN I I I I I NTAAACCNAAANAAANANAANTTTCCGGTNGGGGTTTCTTTCTT 

TNTTAATTN 

Sequence 1016 

TGGAGCTCCCCGCGGNGGCGGCCGCCCGGGCAGGTACTTTTTTTTTTACAAGAGTGCAGA " 
AGAGAGCGAAAAACTCGTGAAAGCTGAAAGAAAATTCATTGAAGATAGGGTTAAAAAAAT 
AAT AC G AACTG AAAAG G A AAGT CTGTG GC N G ATTC AC ATAAAG G ATTTGTTGTTATT AAT 
CAAAAGGGAATTGACCCCTTTTCCTTAGATGCTCTTTCAAAAGAAGGCATAGTTGCTCTG 
CGCANAGCTAAAAGGAGAAATATGGAGAGGCTGACTCTTGCTTGNGGTGGGGTAGCCCTG 
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AATTCTTTTGACGACCCTAAGGTCCTGGACTGCCTTGGGGACATTGCCAGGGACTTTGTA 

TATTGAGGTATACCATTTGGGGGAGGAAAGAAGAAAGTTTTA 

Sequence 1017 

ACGACTNCTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACGCG 

GGCAAAACAGTTTTCCTAATCTCAGCAGTATCCAGTGAGTGAAGAACACTTGACTGACTC 

TTGGGCCACCTCTGTTACTTACTGTACTATGGAAGCTCCTGGTGMTGTTTACAATTATG 

GGATGTAGTATTTCTATTTGTACTTTAAGTCAAATGCTTATATGAAATATGTGACAACAA 

ATAGAGAAGACTGGCTCTGTTAGTAATTATGCAGTATGTACCTnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnNCTTATTCGATACCCGTNCGACCT 

TCGAGGGGGGNGGCNCCGGCTACCCAGCTTTTTGTGTCCCTTTTAG 

Sequence 1018 

NTGAGGGGTTMTTTGCCGCGGCTTGGGCGGTAAATTC ATTGGG GGG rTTTTTTI CCCCC 
CCAAAAAAAI 1 S M 111 1 AAAAMGGGGGGGGG NCCCCNT TTTTTGGGGG GGNATTTN NA 
rTTTTTNNTTTTI I ANAAACCCCNNACCCCCCCCTTTTTTTGGGGGGGGGN l l l l l H iG 
GGGGGGGNNI rTTTTTTI I GGGGGGGGGGNNAAAAAAA ANGAAAAA AAGGGGAAAAAANT ■ 
rTTTTTTTTTTTTTTn I GGGGGGGGGNGG NNTNTTTTNN I M III I 'I AAAAAAAAAAAT , 
TTTTTTNTGNCCCCCCNCCCCCCCCCCGGGGGGGGGGGCCCCCCC I I I 11 1 I I I 1 1 11 TT 
CCCCCCCCC AAAAAAAAAAAN CCCCCCCCCC AAAAAAAAN N N AAAAAAN NNN N NTTTTTT 
TNNNTTTTTTTANNNNNNNNCCCCANNNCCCCCCCCNAAAAAAAAAAAAACCCCCCCCCC 
AAAAAAAAAAAAAANNCCCCCCCCCCCAAAAAAAAAAAGGNGAANAANGGNGGCNCCCCC 
CCCC 

Sequence 1019 

TGAGGATCCTGTGCGCGATCGCAATGGACAGCTACTAATGGTCACAACGACCTCTTTCTC 

TAATCCTTACTTTATACATCAACCTTCAGGTGGCTCCCCAAAATGTTATACCAAATCTGT 

AAAATGCAAATAAAACAAAAGACAAATCTCCATCACGTACAACATGAACTGAACAGCCAT 

TTAAAAATTCAATGGGTGAGMCTTCTTGGACCTCATTGGCTAGATTCCTTTCTTAAAAT 

CCATACACCCGNAATTTAACAGGGAAGGTTCCCCATTCCTTGGAAGGAGCAAACTTTCAG 

NATGGTACCAAGACCTTGGAGNTGAAAATTATATTTT 

Sequence 1020 

NATACCAGGCGTTTCCCCCTGNNAAGGCNCCCNTCGTGGCGGCTTCTTCCATTTGGGGGG 
I I I I I I N I I l I M I I NCCCCCNCCCCCCNGGGGGGTNGAAAAAAAAANCCCCNGGCCCCC 
CTNCCCN J Mlll l 1 NNGGGGGGGGGNNNCCCCCCCNGACCCCCCCGGGGGGGGGNNNAC 
CCCCCN I M I I J I II I I I I I 1 I I 1 1 I A AAAAAAAAAACCCCCCCCCNCCCCCCGGGGGGG 
GGGGGGGGGGGGTTTAAAAAAAAA I I I I I I I I I I AAAAAAAATACCCCCCCATCCCCCCT 
lllllll I NNGGGGGGGNN I \ I I I I IN I I I I CCCC CCCCCC CCC CCCGG GGNGGGGNGA 
TTTNTTTNTNTCCCCCCCCN N N ATTTTTTTN N AG AATTTTTTAAN N ATTTTGTN N ANANN 
CCCCCCC 1 1 I I I I I 11 1 1 I I NTTNNNNAANNNGNCCCCCCCNNTTCCNCNNN A I un iN 
TTTTTNATTATANTANCCCCCCCCCNCCNATATTTNAGGGNGNTTTNTNNGNTTTTTAN 
AAAAAAAAAAAAAAA 
Sequence 1021 

CCGCGGTGGCGGCCGAGGTACTTACAGGGGACCGCCAGGGGCCTCCiAGAATCGGTATCCT 
GAGTCCTCTTGAAGAGCAGTAGAGGTTGTrTCATTAAGTGGCAAACACACTTAGTTGCTT 
AATTTGAAAACNGGGGGGACACNAATACNGGCANCTCGCGNNGGGGCiGANGGGGGNTTTT 
AAAAAATTTCCCCCCGGTTTTTCTNAANAAAAAAAAAAAAACCT(TNAN NCNTA TAATAA 
ATTTA 1 mm I NAATNTNGGNNCCCCNCNACCCCCNAAAAAATNTTITrTTTNTAT TT 
TGGGGGGGGGGGNCCCCCCCTTTTTNTAAANAATANAAAAAACNCNCC CCCC NCCl Mil 
TTTTTNCCCCN I I I I I I I I I I ANAAAAAANNGGATTGGGNAAAAAAATrTNTT TTGA TTT 
TTTTTNNCCCCCC I 11 1 111 I AACCCCCTNNGGG GGGGNGGNGG GGGNGNTTTTNNCCTC 
NTTGNGCCCCCCCCNNAAAAAANAANAACCCNC H 11 M I I I I 1TGGGGGGGGGN GNNGG 
NGGGGNGNANTANAAAAAAANGGGGGGGGGGGN I WWW I NTN;MTTCCCCCNCC t l l l I 
r illl NGGGGGGGGGGGAATAAAAAAAAANAAAAAAAAAA 
Sequence 1022 

ACTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAGGACCTNTAACTCCCC 
CTGACACAGAGCAATTAGACTCCCATAACAATGGTATCAATTATACCACTCCATTGGAGG 
GACTTCCTTTATGTGTCACCCAGGATACATT GCTCAA CTGCAAGTTGGCCTTTG GGGG GC 
CCCCAAAAAAGGGGGGGN I I I I I M I I I I I M I I i I I GGGGGGGG AAAAAAAAN ITTTTN 
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T NCCCCCCC CCCCCCCCCCCCTAAAAAA AAAAAA AAAGGGGGGGNNCCCCCCCCNAAAAA 
A I I 1 1 i I 11 1 NGGGG G GGGGGGG GGGGG I I M I I 1 I I 11 1 1 1 I GGGGGGGNNAAAAAAAA 
AGGGGG GG1 I I I Ml I 1 HI 1 1 I I ANAAAAAAACCCCCCCCCCCCCCCCCAAAAAAATTT 
MIIII 7'AAAAANNMNAAANTNTNTNNAAAANNNGNAAAAAAGGGGGAAAAAANTNAAA 
AAANCCCCCNAAANNN TN I M I 1 I I I I I I 1 I I I I 1 I i N AAAAAAAAAAA I rTTTTTTTT G 
GGGG GGGGGGGG I I I 11 I I I H I AAAAAAAAAAAAAACCCCCCCNTTCCCCCCCNTTTTT 
TTTTTTGGNGGGGGGGGGGNNCCCC 
Sequence 1023 

CCGGNCAGGNACTTGACCTTNATCTNCTAACCATTCTTTTATGTCATCAAGAGTTGCATC 
ANNCGGGAAGCCTTTAATATNAACGGATCTGNNNTNNTACATCATTCNTATACTCANCAG 
TT TCACCTTTCATAG TTTAGTGGAG I I I I I I I I I ATTTTTTGGGGNANAAAACNCCCCCN 
NC I n I I I I I I I I I INTNNAGGGGGGGGGGGGGGNTTTTTTTAAAAAAGGGGGTGNTGNA 
AAAAA ANNGNCCCCCCCGNAAACCCCCC 1 1 I I I I INNGGNN 1 1 I GNCCCCCCCCCTTTT 
TTTTTNGGNGGGGGGGGGAAAAAAAAAGNTTTCCCT A I I I I I NGGGGTTNTGTN I I I I I I 
AANAANAAAAAAAGGGGGGGGGGGN Mill ) NTTTTAT NNANTNTCCCCCCGTNTTTTT . 
GNAAAAAAAAATNNTTTNTTTNGGCCCCCCCTNTTTTTTTTNNNNCCNGNGCCCCCCCCN 
AAAAAA AAAAAANCCCCCCCC I IllMlllll I NGGGGGGGGGGGGNTTTAAAAANAAAT 
TTTTTTNNT AGNTTGNTNTA GNTAAATAAAAA AGAGG GNCCCNNNTTGCCCCCCCNANAA 
AAAAAAAAAl MI NIMI I NGGGGGGGGGNNTTTTTAAAAAAAAAAANNGGGGGGNGTT 
I I t I I I I 1 I 1 1 I I I 
Sequence 1024 

CCGGGCAGGTACTGACCTTGGAAGCTAACCCCTGAGTATGATGCAACTCCACTCTAATGT 
AAATTAAAATGCCATGATCTTAAAAATGCCATAATATNTGTCAGGTATAATTTTAATTTT 
C CCAGGNT TN AGGTT TCCCAATC I l l M M I NTCCCCAAAAAAAAAACCCCCCCNAAAAA 
AAI I I I I I I NNTTTTTTNN TTTTTTTAAAAA A 

GGGGGGN 1 1 M M 1 1 TTTG GGGGGG GNG I HU H I GNGGGGGGGGN M I M I M l IGGG 
GGGGGGG I M M II II U ICCCCCCC! I I I M I I NNGGGGGGGGN I I M M I M IGGGGG 
NNN GGGGGGGNATN TNTTCCCCNTNTTCCCCCCCCGGGGGGGGGG l MINI M NNCCCC 
CCCI M M I M I M I INCCNCCCCCCCCCCCI MINIMI I NGGGGGGNNNNNNNNNNN 
NNN'l I I I I I M I INGGGGGGGGNGTGTTCCCCCCCCNGNGTCCCCCCCNAAAAAAAAAAA 
AANGGGGNGGGGTGGNCNCCCCCCCCCNAAAAAAAAAAAAAA I I I N I I I I NNN I HIM 
TTTNI MINI I NNNNNNCCCCCCCCCNAAAAAAAAAAA 
Sequence 1025 

CCGGGCAGGTACAGCACTGGAACTTCTTGAGCAGGAGCATACCCAGGGCTTCATAATCAC 
CTTCTGTTCAGCACTAGATGATATTCTTGGGGGTGGAGTTGCCCTTAATGANAAACCAAC 
C AGTAATATTTT GGTGGNTGTCACTCAN N7TGGGG GGGTTA I INN I GGGGGGGGNTTT 
lllllllllll I GGGGGGGGGGGGGGN l INI I AAAAATTTGTAAAATNTTAAAAATNTN 
AAAAAATTTAAAAAANNTCNCCCCCCNNAAAAMNNTNNCCCCCCCAAAAAAAAAANAAA 
AAAAN I I M M I I M II M I M I INNAAAAAAAAI NUN I GGGGGGGGGGGNNTTTTT 
TTTTT AAAAAAAAAl llllllll I GGGGGGGGGGGGCNCCCCCCCNAAAAAAAAAGNGGG 
llllllllllll I NGGGGNTTTGGGGGNATTTTCCCCCCCCC I lllllllll NNTTCCCC 
CCNNN AAAAA AAAAAAA I I N II II N I NGGGGGGGGGNNNTCCCCCCCCCCCCCCNTTT 
TTTTTNAAAAAI lllllll I GGGGGGGGGGGGGGNTN lllllllll! I AAAAAAAAAAAA 
' NNGNGGGGGGGGG.I II I I N N I! I M I I M I NCCCCCTATTTTCCCCCCCCTTTTNNCC 
CCTAANAAAAAAAAAAAAGG 
Sequence 1026 

TGGTCACTGGA ACCTGGC CTGGGGATACCCATNNTGGGCTCCTTAAGTTNGGG GGGGGGN 
NTNTTAAAAAAN TTTTTTTGG GGGGGGTTCCCCCCCCNAAAAAAAM 
CCAAAAAAAAAA I lllllllll I AAAAAAAANNGNGGGGGTNNNAAAAAMNTTTTTTTN 
GGGGGGGGGGGNAAAAAAAAAANCCTTTGNGGGGGGGNGTTAAAAAAAAAAGGGGGGGNN 
NGTCCCCCCNTTTTCCCCCCCNI lllll I NTTNGNNGTAN I I MINI I GGGGNTTTATT 
AAAAAAAAAAANGGGGGNNGNTANCCCCCCTCTCTCCCCCCCl lllllllll IGGGGTGN 
ATNTANATNTNNTTCTCNCTNCCCNTTTCNCCCCCCCCANAAAAAAAAAAAAGTGNGGGA 
NNNNGGNNNCAANNNCTTTTTNTTCCGNNTNTTTTNTT^ 

CCCNTNI II I II I I NGGGGGGGGGGGGGGTTCCCCCCCC I I I M I II MINI IGTNNGG 
NTTN AATTAATCNTTAT A I IHIIIIlllll 
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Sequence 1027 

CGAGCGGCCGCCCGGGCAGGTACAGAGATAAGATACAAAGATAAAAATGTTGGTATCAGT 
TGATTTTGAAGCAGGTAATTACTGTGCCACCTCACAGTGCTCAAAAATAGTTCAGNNCCT 
GGAATAGGGTTATNTCAGNCCCCATGCAAAATTTTTTAAAAAAAAAAAAAA 
CCCCCCCNCAAAAAAAAI rTTTTTTTTTTTI 1 GGG GGNNNGGGGN NNNNGGG GGNNTT TT 
1 1 1 1 1 II 1 I 1 M M I NGGGGGGGGNGNCCCC CCCn 1 II II i 1 TTTCC CCCCCCi M M i 
TTTTAAAAAAAAAAAAAAAAANCCCCCCCCC1 rTTTTTTTTTTTTTn INGNNCCCC CNN 
CCCCCCCNN rTTTTTm 1 'NTNCCCCCAAAAAAAAAAACCCCCCCCAAAAAAAAANTTTT 

rmrrm i ccccnncccccccc i 1 1 n 1 1 1 1 1 1 1 1 i a ctntntnnnccccccc nnncnc 

CCCNNNNNNCCCNTNNGGNCCCCCNTTAANNCCCCI rTTTTTTTTTTTTTn I NANCNNN 
NNCCCCCCCCCCNNNTTTTTTGG 1 I I I I I I i I NNAAAAAAAAAAAACCCCCCCCC 
Sequence 1028 

CCGGGCAGGTACTTATGTGAAAGGTAAAAAAGATCTCATAGAAGTAGAGAGTGGCTCATG 
CCTCTAATCTCGGNGCTTTGGGAGTCTAAGGTGGGAGGATCGCTTTGNCGGGCCTGNGTA 
GTTTTTGGTNACTCAAGNCTCTGNAGGTTNNCCCCCNAAAAA AANAAAAAATA TTCCCCC 
CAAAAAAAAA l rTTTTTTTTTTTTi I GGGNTNGTNGCCCCCCCN I i I I M I II IC CCCCC 
C N N AAAAAATATTAN NTNTG NTAAAATTTTTTAAAAG NGN N G AAAATTTTTTTT 
Sequence 1029 

CTCCACCGCGGTGGCGGCCGCCCGGGCAGGTACCACACCGAAAGTAAAGAAAGATAAACA 
QAACATCTTTGAATTTTTTGAGAGACAGACTTTGTTACTTAGGCCTGGAGGCAAATGGTA 
TGATCTGGTAGGTACCAGGAGGTGAACANCTGGAACCAGGATTCACCAAAAAAAANGGGG 
GGGGGGGCCCCCAAAAAANANCCCCCGGGGGGGGGNA AAAAAG GGGGNNGTTAAAAATTN 
AAATTTAAAANGGCCCCCCCCNAAAAAAA I J I I * I I I J TNNT I I i I I AAAAAAA AAAAGG 
GGGNGNNNNTTTTTTN I I I I I I I I AAAANCCCCCC i rTTTTTTTTI ICCCCCCN MUM 
TTTTCCCCCAAAAAAACCCCCGGCCCCCC1 111 I I II M I ATNTNCCNNNNCCCCCCN NN 
NCCCCNTTNNCCCCNTTTNCCCCAMGNGGGGNTNCCCCCCCCGGGGGG GGGNAAT TTTT 
rTTTTTTI 1 CCCCCCCCN 1 I M 1 1 M 1 I NTTTGNNNCNNNN ANNCCCCCCCCC 1 11 11 ii 
. TTNN I HUM 1 NNNANCCCCCCCCCCGGGGGGGGGGTG M I H M I I 1 H 1CCCCCCCC 

Sequence 1030 

TAATTCAGGGGGGATAAGCGCAGGNAAAGNAACCATGNTGGAGNCTANTAAATTGAAAAA 
NNAAAAAAGGGGTTTGGGGGGNTTTTTCCCCCCTTTCCCCCTAAAAAANGGGNGGNNGCC 
CCCCTNTAAAATTTAAAATANNNTAAAAAATTTAAAAATGGGGGNNNTGGGGGGGGGNTT 
TNCCCCNTTTTTNCCCCCCAAAAAAAAANGGGGCGGGGGG GGGGGGNGNNA AAAAAAAAA 
AAAAAAAAATNCCCCCCCCCTCCCCCCCCCGGGGGGGGGN I M ll I I M I I I NANAAANT 
GTNAAMTXTTNAAAAMNTNAAAAAMTTTNAAAAAGGGGGGGGNGGGGGGGGGGNTTT 
TTTNNCCCCTTNTTTTCCCCCCCCCGGGGGGGNGNNTATANTCCCNCNNGGGG GGGGG GG 
GNGANN 1 1 i M I I 1 I ANANTTTTTTTNGGGGGGGGGGGGGNATTCCCCCCCCCN 1 I mm 
1HIUH I 'GGGGGNATTTGGGGAGNTNTTTTTTCCCCCCCCCCCGGG 

Sequence 1031 

CCGCGGTGGCGGCCGAGGTACCACCTGCTCCTCATCTTAGGAGTCTCCTTTTCAAATAAT 
TAGGCTCTGTTCCCATTTTAAMCTCTGATATTGGCCTTCACCTGTGACT GGACACCTT T 
ACNTANGAGG CGCC ATGTTTCAN CTATAAN C AANTA TAG AAAAG A AAAAA I U M M uN 
NCCCCCNTTTTTTNAAAAAAAGNAAAAAGNGAAAAAN1 1 1 M I 11 I NT NAAAAAG GGAAA 
AAAGGGGAAAAA A 1 1 1 I 1 1 1 1 1 1 1 11 1 1 1 1 1 1 1 GGGGGGTTTNGGGGN mm \ AAAAA 
AAANAAAAAAAAANGGGGGTTTTTGGGGGGGGNNTTAAAAAAAAAAAAAAAAN mini 
TTTTNAAAAAAANAAAAAAAAANCCCCCCCCCCNAAAAAAANNNNAAAANAGGNCCCCCC 
CCGGCCCCCCCCNAAAAAAAAAACCCCCCCGGGGGGGGTNNGTAAAAAAAAANNAAAAAA 
ANNNTNTTNTNGGGGGGGGGGGNTmNAAAAAAANm NAAAAAANNTNNTAA AAAAAN 
NGGGGGGGGGGGGGTGGGGAAAAAAAAANNNAAAAANN1 1 1 1 1 1 1 1 M 1 1 1 1 TAAAAAA A 
AAAGGGGGGGGGGGGGGGNAAAAAAAAAAAAAGGGGGGGGGGGGGN M M M M M NNTT 
TTTTTTGGGGGGGGGGGGGGGGGGGGG 
Sequence 1032 

AGGTACNCGGGGNCTTAANAACGAACGGNTTGGGCGCGGACTGGTATCCGGGGACTGTGA 
CTTGCAGGGTCCGCCATGGAGCCAGANCAGATGCNT GGAGGGGTA CAAACGTCAAGGTTT 
TGTC AAGTAAAAATTCCTTACNTCTG AAAAAN G NG G NG rTTTTTTTTI NAAAAAAATTNC 
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CCCNNAANCCCCNl I 1 I I II i I I JGGGGGGNNTTTCCCCNTNNATCCCCCCtTTCNCCCC 
CTATGNGTGNAATNGGGGNTA I I I 1 I I I I I NTTTCNCCCCCGNGGGGGGGGGGGGGGGNG 
NGN TNTTTCNTATTN NCCCCCCCGGGGGGGGCCCCNNCNCC ) I fl M M ) 1 1 I i CTCCCC 
CCCNT TTTTTTTTTTAAA NAACAAGGGNGNTTNGTTAAAAAAAAAAAAAAAAA^ 
CCCCNI rTTTTTTTn I AATAAAAAAAGGGG I I I M I I I I I I I I I I NGGGGGGGGGGGGG 
GGAG GNTTAAN AAAAAAAAAANA I II I Hi I INTNTTANCCNTTTNTTCCCCCCCCNCTA 
TATTAI I 1 II nNTTTNTGAANTANGNTCCNCCNNNNNTGTTTTTTATTTNGAGGNATTA 
TTATTTGGTNTATTTANCCNCCCCCCCCNCN 1 1 1 1 M ) 1 I 1 I NN 
Sequence 1033 

TGAACTCTGTCCCAGGACCCACTGCCACTGGAACNCTGGGCTGGNGATACCCAGGTGGNC 
CCCTGGTTGGNGGGG TNTAAAA AAATTTTTTTGGGGGGGNGNCCCCNNAAAAANNCCCCC 
NTCCCCCCAAAAAAAN I II II f i'NTAAAAAAAAGGGGGGGGGGAAAAAAAAA rTTTTTTI 
TGGGGGGGGGNNAAAAAAAAMNNNTTTTTNGGGGGGTTTTTAAAAAAAMGGGGGG 
TNCCCCCCCGCCCCCCCC I I I I I I I I IGGGGGGTNNNGGGGNTTNGGGGGTNTTNTAAAA 
AAAAANGGGGNGNTTNGCCCCCCCTNNCCCCCC I rTTTTTTI IGGGGGATNNNNNNATTN 
TCCC CCCCNTN CCCCCCCCAAAAAAAAMNGGGGGNNNNNGGGGGGNNNNNNNNTTTTTT 
NNNNlTTTTTTNN TNTTTnTNNN NNNCCCCCCCCCC I I I I I Hi I I GGGGGGGGGGGGA 

Nccccccccccc i nrmrn intggggtnnnnnggnnnnnnn i rrnrnrrrn i aaa 

AAAAAAAANCCCCCC 
Sequence 1034 

ATGAANNCTGGNC CTGGGGG ATACCANNNGGGGNCCNTCCCCCCCCCCC I MINI I NGG 
GGNGGGGGGGGN I I I I ) I I IGGGGGGGGGGGGGGGGTANAAAMNTTTTTTTNGGGAAAA 
ACCCCCCCCNAAAAAAAAANNCCCCCCTNCCCCCCCCAAAAAAAAAA I rTTTTTTTTTTA 
AAAAAGGGGGGGGCNCAAAAAAAAAATTTTTTTGGGGGGGGNGNNAAAAAAAAAAANNNC 
NCCCNGG NGGTn'nTAAAAAAAAAANNGGGGNGGNGGNNCCCCCCNNNNCCCCNTTTTT 
I I I I I I I I GGNAAAAAAAAAAAAATAGGGGGGTNTTTNAAAAAAAAAAAANNGTNTNTTT 
TTTNNNCCCCCNNNTAACCCCCN I I I I M I I I I I I I I I I GGGGGGGNTTNNAAAAAANTT 
ANNNNNNCCCCNTNNNTNNCCCCCCCAAAAAAAAAAAANAGGGNGNNNNGGNNGANNANA 
NNNNNNN7TTTTNNCCCCTNCNNTTN I I I I I I f I I NTTTTCCCCCCCCC I I I I I I I I I I I 

Sequence 1035 

TCGANGT ACAT GGGGGTCCGTGCGGGCANAACCCAGGGCATGAAGATCCAAANGGGCCTG 
GTTCAGCTTTTTNTCCAGGGCCATGGCAGCTTTTCATTGGCCTTNTTGAAGG I Mil I AN 
CCCC CAAC CTACATTCTTTCCAANCTTTGTGTCATTTACCCCI rTTTTTTTTTTl \GGGG 
GGGG TTTTNAAAAA AAA I I I I I I I I I NGGGNGNNAA'l rTTTTTTI I AANNCCCNTTTNCC 
CCCCNI {(Mill IGGNGTGNTGTGGNNNTNTTTAAAAAAAMNNAAAAAAAAANGNGGT 
NTNNGTTTAAAAAANGGGGGGNNGTTTAAAAAAAAAANGGGGGGGGGTCTNCCCNCCCNG 
GTGGGGG NTN CCCNAAN GNG N ATANTG GG NTTG I I II I I i I I CTAATATNTCCCCCCNGN 
GGANGGNGGANNCCCTA I I 1 1 I I I I I I I I I I INCCCCI I I I I I I II I I AAAAAAAANAGG 
NTGGTTNTTNAAAAAANA TAAAA AAAAAAATCCCCCCCCCCCl I I I Ml I I I I AATAAAA 
AAAAAATGGGGGNTNCTN I I I I I I I I I I I INGGGANGI I I I IATNGTA! I I i I I I IAAAA 
AAAAAA 
Sequence 1036 

CCGCGGTGGCGGCCGAGGTACCCCCATGCTGTGGCACGGCTTCAGCTGTGATTGGGTTAT 
TATACCCCTGCATTGACAGACATCTAGGAGAACCACATAAATTTAAAAGAGAGTGGTCCA 
GTT GTAATGCC GGTGGTGTAGNCAGTCTTTTGCCTTTTGGGGGGGGGGGGGI I I 1 1 I I I A 
AAAA l I I I I I l NTAAAAAGGAAAAAGGGAAAAATN M I > I I I I NCCCCCCAAAAAAATTT 
TTTTNGGGGGGG GGGCCCCC CCCCCCCCCCAAAAAAAAGGGGNN I I H I I M I NGGGGGG 
NGCCCCCCCCCN I MINI I I I AAAAGNNAAAAAAANNAAAAAAGGGGGGNN rTTTTTTI 
TTGGGGGGGGGGGGGGGGGNNAAAAAAAAA I I I I I I I I I 1 1 I I 1 1 NNNNTTTTNNNCCCC 
CCCCGGGGGGGGGGAAAAAAAAAAAN mTTTTTTTI NAAAAAAAAAAAAAAAAAACCCC 
CCCCCCCNAAAAAAAAAAANAAAAAANAGNCCCCCCCCCCAAAAAAAAAA I rTTTTTTTN 
AAAAAAAAAAAACNCCCCCCCCCCAAAAAAAAAAANGGGGGGGGGGGNNNNTTTTTNNNN 
NNI I n I l 1 1NGGGGGGG 
Sequence 1037 

AGGTACAAAAATTAGCTGGGCATGGTGGCGCACAACTGTAGTTCCAGCTACTCAGGAGGA 
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TCAGGCAGGAGAATCACTTGAACCCAGAAGGTGGAGGTTGCAGTTGACCCGTAGATCATT 
GCCCANCTGNCATTTNCCAGTCNCTGGGGGGGGGNNCCCCCGGGGGNGGNAAAAANNCCC 
CCNAAAAANGGGGNGNNAAAAAAAGGGGN I I I 11 I I t 1 NGGGGGGTNAAAA AAGGGGG GN 
TTNTNAAAAAAANCCCCCCC I I I I I 1 I 1 ICCCCCCC1 1 1 I I I I I I GGGGGGGGTTTTTTT 
TTNNCCCCCCC I I I M I I I I 1 1 I NCCCCCCNCCCCCCCCCCNAA AAATT TNAAAAANNNN 
N AAAAAANTTTAAAAANTNTAAAAAN NTTAAAAAAAATTN AAAAAATTTTN AAAAAN NTN 
AAAAAAN N NN AAAAATTNTN AAAAAAN NTTTAAAAAAAN NTN N N AAAAAANTNTN AAAAA 
AATTTTTTN AAAAAN NTN TAANAAAAATTTN 

AAAANNN I 1 IN I I t I 1 f NNNTTTTNTNNNNNGNAAAAAANNNNNAAAAAAAAANNNTNN 
NAAAAAAl Mil I AAAAAAN NGGGGGGGGGGGGG 1 ITTTTTTTTl 
Sequence 1038 

NCACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCCNAGCGGCCGCCCGGGCAGGTA 
CTCAGGGCTTGTGGTTGTCTGCCTTGTGTTTAGACATACCCAGGAATCTCATTACTTTGC 
ATTGMGACTGTTTTTATTTTTGGAATGTTCTGAATGTTTCCAAACTGCTGAACTGTGCC 
TGAAANGGGGNI I I I i NAAAAAAl III I I Hi U I NGGGGGGGAAAAAA AAAAAA AAAAA 
AATTTTTNNCCCCC I I I I I I I I l I I I l INNNCCCCCCl M I I I I I i I I I I I I I I M INCC 
CCCTTTTTTGGGGGGGGGGGGGGGGGNAAAAAAAAAAAA AAAACCCC CCC I II I M I I il 
TTTTTTGGGGGGGGGNCCCCCCCAAAAAAAAACCCCCCCN I M II I I I NCCCCCCCAAAA 
AANT I I I I I I I I I NNAAAAAAAAAAAAAAAAAAAAACCCCC CGTNCCCCCCCCAAAA AAA 
AAAAACCCCCCCCCI I i I 1 1 I I i NGGGGGGGGGGGN I H II I I 1 1 I 1 1 I I 1 1 I I i I ICCC 
CCCCCCCC l I I I I I I I I I I I I I I I I I I AAAAAAAAAAAAACCCCCCCCCTT 
Sequence 1039 

CGACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGAGA 

CCGGGAGGTGTGCTCTTTTTCTGGGAGCATGTGGCAGAACCATATGGAAGCTGGGCCTTC 

ATGTGGCAGCAAGTTTTCGAGCCCACCTGGAAACACATTGGGGATGGCTGCTGCCTCACC 

AGAGAGACCTGGAAGGATCTTGAGAACGCCCAGTTCTCCGAAATCCAAATGGAACGACAG 

CCCCCTCCCTTGAAGTGGCTACCTGTTGGGCCCCACATCATGGGAAAGGCTGTCAAATAA 

TCTTTCCCAAGCTCCAAGGCACTCATTTGCTCCTTCCCCAGCCTCCAATTAGAAACAAGC 

CATCCACCAGCCTATCT 

Sequence 1040 

CACTACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACGCGGGGGAGCCT 

GCGGAGTTCGAGACCATGCTGCTGTTCTGCCCCGGCTGCGGGAACGGGCTGATCGTGGAG 

GAGGGACAACGCTGCCACCGCTTCGCCTGCAACACGTGCCCCTACGTGCACAACATCACC 

CGCAAGGTAACAAATCGGAAGTACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnGAGGGTTAATTGCGCCGCTTGGGCGTAATCATGGG 

TCATAGGnnnnnnnnnnnnnnnnnnnnnnnnnnnCGCTCACAATTCCACACCAACATACC 

GAAGCCCGGGGAGCATTAAAAGTGG 

Sequence 1041 

GACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTCAGTGCAAAAG 

TTAGTTGMCTGGTTCATTCATCTCTATGGTAACAGCTTCCTCCTCTTTATCGACATTAC 

TTGTCTGTGACAATTTAATGTTTCCATTTCCAAGTTCTCCACTTGCAGAAMTTTCACTC 

CGTCTTTTGCACAGGAAATTACAACAGCATCTCCAATATGGCTGAGATCTCGGCATATAC 

GTGCAAATTCACC AG AAGGCATCTTTACTACACAGCTGTACCTGCCCGn nn n n nn n nn n n 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGTCG 

ACCTCGAGGGGGGGGCCCCGGGTACCCAANCTTTTTGTTCCCTTTAGTGGAGGG 

Sequence 1042 

TGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTATAGCTGTAAGGAGAAGCTGA 

NAAATGATACCCAGGAGCAGCAGGCTTTACGTCTTCAGCCTAAAACCCAAAAAAAAAAAA 

AAAAAAAAAAANGTACTGGACCGGGTTGCTGAAAANACTCACACCCNAATACCCGCGTAC 

CTGCCCG 
Sequence 1043 

CCGCGGTGGCGGCCGCACGAGGATGCTTGGCACGTACAGGATCTTGCCATCACCTGTTCG 
GTCAAACAGCTGGAAGGCCTCCTTGAACTCTGCGGTCTGGTCTTCGGTGAAGTCACACAT 
CTTGACTGCTTAGCTCTCCGGGACCTTTTCCTGCAGTAATGGCCCCCGCGTACCTnnnnn 
nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
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nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
nnnnnnnnnnnnnnnnnnnnnnnnNnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
nnnnnnnnnnnnnnCGAGCCCGGGGAGCATTAAAAGTGGTAAAAGCCTGGGGGTGCCT 
Sequence 1044 

GNGGCAGCAGTCACTGTGGCNTAAATTTCTAAACTTACATANCNNATCNTGTAATAGCAA 
ACCTTAA AAAGGGC AAGGNAN AGAG GACTTAAAAATGAANAANAAAAGNGATAGGGAGAA 
TGAAGAAI Mill I AAAAGCTGTTTTAGATCAGGGTAAGACCGNAGGCTTAAACAAAAAA 
AAAAAATAAAAANGTNCCT 
Sequence 1045 

GACTN CTTAG GGCAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTGCTTTTGCCGCT 
TCTGGJ I I I IGCAGACATCCACTACTCCCCAGCTGATTACACCAACTTGAATGAAACGAC 
TTCTCTTGTG AACTATC AAG G GG CC GCC AG AATC ACCTCTGC AAGTATTG GG GTC AG CAT 
AGGGACTCACTCCTTCCAGTACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
nnnnnnnnnnnnnn nnnnnn nnnnnnnnnnnnnnnCGTCGACCTCGAGGGGGGGGCCCCG 
GGTACCCAAGCTTTTTGTTCCCTTTTMGTGAGGGTTAAATTGCGCCGCTTGGCGTAATC 
ATGGGTCATAAGCTGTTTCC 
Sequence 1046 

CACCCTTACGACTCCTATAGGGCGTTTTCGAGCTCCCCNCGGTGGCGGCCGAGGTACGCT 

GGATAGCCTCCAGGCCAGAAAGAGAGAGTAGCGCGAGCACAGCTAAGGCCACGGAGCGAG 

ACATCTCGGCCCGAATGCTGTCAGCTTCAGGAATGCCCCCCGCGTACGCGGGGCCCGCGG 

AGTATACACCATGAGCAAAGCTTCCCCCTCCCGAAGTTGAAAAAATTTATGGACAAGAAG 

TTATCATTGAAATTAAATGGTGGCAGACATGTCCAAGGAATATTGCGGGGATTTGATCCC 

TTTATGAACCTTGTGATAGATGAATGTGTGGAGATGGCGACTAGTGGACAACAGAACAAT 

ATTGGAATGGGTGGTAATACCGAGGAAATAGTATCATTCATGTTAGAAGCCTTGGAAACG 

AGTATAAATAATGGGCTGTTCAAGCAGAAGA 

Sequence 1047 

GGCCGAGGTACATTCCCATNCATTGCCTTCTTNTTTNACCNCTTTTNTTTGTAAANCTNT 

GNAAGATGCAAAGAGGTNGGATCAAGTTNAAATGACTGNGCTGTCCCTTTCACATCAAAG 

AACTACTGACAACGAAGGCCGCGCCTGCCTTTCCCAANNCGGCCAANCTATCTAGGCTGG 

GCAGGGGAAGGAAAGAACTTGCATGTTGGNGAAGGAAGAAGTGGGGNGGAAGAAGTGGGG 

NGGGACGACAGTGAAATCTANAGTAAAACCAAGCTGGCCCAAGGTTGTCCNGCAGGCTGC 

AATGCAGNTTN AATCAGA GNGCCATTTTCTTTTNTGGNNCAAANGACATTAATTATTTGG 

GAMGCACAMTTTTTTTNTTAAMTATGGNCAAAATAAAAAMGGN 

TCCAAAAAAAAAA 

Sequence 1048 

CCGCGGTGGCGGCCGAGGTACTGTGGGCTGGAAATCCCAAATCTAGATTCCAGCAGAGTT 
GGTTCTTTCTGAGGTCTGCAAGGAAGGGCTCTGTTCCATGCCTCTCTCCTTGGCTTGTAG 
GAGGCATCTTGTCCCTATGAGTCTTCACATTGTCTTTATGTACCTGC CCn nnnn nnn n nn 
nnnnnnnnnnnnGATCCCCCGGGCTGCANGAATTCNNnnnnnnnnnnnnnnnnnnnnnnn 
nnnnnnnnnnnnnnnGCCCGGTACCCAGNTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnNCTGTGTGAAATTGTTATCNCGCTCACA 
ATTTCCACACAACATTACNGAGCCCGGGAAGCATAAAGTTGTTAA/^GCCTGGGGT 
Sequence 1049 

AGGTGTCGACCCACNCGTCCGGCTACTTACTGGCTTGCTTTTGriTGCCCTGCCTTGCTC 

AGCTTGCTTT TTTAT AGAACCCAAGACTACCAGCCTAGGGCTGGCACCATCCACAATGGG 

CCCTCCCACCTTTTGATCACTAATTTGAANAAAATGCCNTTACAGCNTGGATTCTCATGA 

AGGCATTTCCTCAAGGGGAGGCTCNCTTTTTGAAAGAAC 

Sequence 1050 

AGGTACGTTTTCAGAGATGTGTTGACTGAAGTAACCTTCAGCGGGCCTGAGTAAACCCAC 
AGCCAGTATCC AGCC CTGCCTGGGAGTCCCTGAACTGGGAAANTATNNTGTGATTTCTTT 
CACCTGGTGGCTTTTAGNTTATTTAAGGCCGTTNTTGGGAGGAAGGGCNCATTGGCCTNA 
NNCCATCTTTAAAAGAMGTTTCTTTrCTCCCCTT^ 

TTTCCTNNTGGTAGNNC CANGNAA CCNCANCNCCCCCTTTCTTTTGGNAAGAAACNTTTT 
NCAACCCNTTCCCAATl Mill)) I NGGAANCCAAAACCANCCTCTAACNCCGGGGNGNN 
CNAA NGTAANNCAAAAATTCTGGGGGTTNTTAATTTGGGTTTTTAAAGGGGGTTCCTTTC 
TTTTCCANTNGGNGGNMGATTTTTTGGGGGNGGGNNAAAAGGTAANANTTTTANGGAAA 
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Sequence 1051 

ACACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCCAACATATAT 

TTTTACGAGGGTTATTTTGATCAGCTTGTGTAGTCAGTGCCAAAGTATTTAGCTGGTAGC 

AGAAAAAGGAAAAGTATTCTCCATCTGTGATCACAGCCTGGGAGACAAAAGGTCGAGTAA 

CATCTGCTTCACTCCCCGCGTACCTGCCC 

Sequence 1052 

CGGGCAGGTGGCCGCCCGGGCAGGTACTGCTAGTGCTATCGGTTGAATAAATAGGCTGAT 
AGTTTCGATGATGATTAAGTATiSGGAATTAGTGAGATGGGGGTTCCTTTCGGTNANGAAG 
TGGGCTAA 
Sequence 1053 

CGGGCAGGTACAGATACTTGMGCAATTCTGCAAAGACCCTGAGTTTTCACAGGGCTGGT 

TTTGGCTATCGTAAAAAAAAAAACAAANACACGCACACACACACCTATAGTTTTCCTGCT 

TGCCATAGGATCTCTCTTGGGAGATGGACAACCCTNAAAGGCACTGATTTGGTTGACATT 

GGTAGCTCTGGTCCTTACTCAGGCAGCCAGCTCATCCAAGGCCCGGTCCAAGGGATCCAC 

TGACCAGTAGTCATTATCCCAGCTCGAGGAACCAGCCCACAAAGGAAGGAGGCCTNAAAA - 

GCCTTTGC CTT AACCG AC AGTTG ATG G GG GGTC C 

Sequence 1054 

GGCCGCCCGGGCAGGTCGCTCGTGATCTAGATAGTGAGCGGACGCGTGGGTCGACTCAAG 

ACTTTTAMGATTTATCAAMTTTGGTGAGCCGAATCTCAGAAMTTGGTGMTTCGG^ 

AGTTCCCAATAGGCCGCAGTAGTMTMGTGGAGTGTTCGGGATATANAAAAATTTTCAC 

GMTGGAATNAAAATTTCTAGGCGMTTCAGTGAGTTCCCAATGGGACATGATTGTATGA 

GTGAGCl I I l"l CAATATATAGCAACATTTTTAGGTTCANAAAACTTAAAG 

Sequence 1055 

CGGCAGGTCTTGAGTCGACCCACGCGTCCGCAAAATTCTAGCCAGAATAGCCAGACAAAA 
AAAGATCTAGGGGATACAGAAAGGCAAAAGGAAGTCAAAATATCACTATTTACATATGAT 
ATG ATAGTATATTTC AG TG ATCCC AAAATTTCC AG NGG AG AACTCCTAAACCTG ATAAAC 
ACCTTCAGCAAAATGGCTGTGTAAAAAATTAACTCAAATAAATCAGTAGTCTTCATCTAC 
AGAAAAGACAAACNAGCCAAGGAAAGGAGATTAGCGAAAAGACACCCTTCATAATAG 
Sequence 1056 

AGGTACCATGTTTCACGTTAAAATGCCAAAATGTGGTGCCTTCAAAGAAACAGTCAATGA 
AACAGAAAGCAAAAATAAGCAGAAAATTAGAAACGTTATTTGGCATCATGAAGGGCAACA 
CCAAATTTCACTACTAACTGCAAGGATTGATAAACACCATGATTTCAGATTTGAAAAAAA 
ATGGAAAACTGCTAATGCACAATGTTAAGAGATCTCCATGAGAACCAGGCATTAATCCCA 
TATAGCAATGATTTAATATTGTTACCAATTTCAGGAAAATGA I Mil I GATAAATGGGCT 
ATTGTAAGATATACC I I I I I ATTTCTTAGGAGCATGTGACCTGCATACGTAAACAG 
Sequence 1057 

AGGTACAGAGCCAGCCAGTGTTGGGCAGCAGGCTCACAGCCTCAATAGGGAGAAAAGACA 

AAGGCCTCAAAATGACAGGCAGCCTGACAGAGGAAGGAGTCTGACACCTCAGCTTGAGGC 

GTCTTTGGAATTCCTAGCTCATCTCAGAATTATATCTTAGAGTGATAATATGGGGTGGTA 

GCCAGTGGCCAAACAGCAAGMCTAAGAGTGGGCCCTTGCAAAAAAAGGTTGGGAAAGCT 

GGGCCCATATTGCCTGGTAAACCCTTGAGCCTGATGCTCATACAGCTGTCCCTTGTTTTA 

GCCAGGTCTTGACAGAAGGGTTACCAGCA 

Sequence 1058 

AGGTACAAAAAGTCAAGCCCTGMGTAGTTTTTCCCCCAAMGTTCTGATGTTAAM 

TAAAAAAAAAAAAGAGAGAGGTAGTGTGTAAAAATATTTTTACTTAAAATGTTGAACCCC 

TGCACCACTTATATTATAAATGAMGTTMCCTTCTACATACTMCAT^ATTTTGCCATT 

CMGAGTCAAATTTGTAAGACTCACAGMTGTTAAAGCCGGAATGCCATTAAAAATAGCA 

TATAAATGGAAGAACTTCGGTGAATTCAACAAAAAGTACCTCGGC 

Sequence 1059 

CCGGGCAGGTACTCAGTATAAAGTCCAGATGCCTTTACCATGACCTGCAAGGTTATACGT 

GGTCTGACCCATGCATTTCTCCGTGACCTCATCTTATACTCCAGGTTTCTCTGTnTACT 

CATTCTTTTCCAGCCACACTAAACTTGCCGATCTTAGAATATGTTGAGTTCATTCTAGCC 



Page 142 of 63 5 



WO 01/51628 



PCT/US01/00798 



Table 1 



CCAGGTCTTCAAATTTCCTCCTCCTTTCCTGAAATAGTCTTCTCCCAGATTTTGGTGTGG 

CTCTGTACCT 

Sequence 1060 

CCGGGCAGGTACGNTGGGGGGAGGATTGCATTCAGTNTAGTTCCTGGTTTTNGGCTGAAA 

TAACCTGACCGAGAGCATCACCCCCTGAACATGGACTTGCAGAATTCCACAGAAGAGAGG 

AGACTGGCCTAGACAGACAGCCAGAGCTGAGGGCCCAACAGGCTTTCTACCCTGGATGCT 

GCTCCCATGCCCTGACACGAGGCCCACTACAATGCTCTCCAAAATGTCCACAAAAGTGAC 

TTAAGTCAGGTTCCCCCAAACCAGACACCAAGACAAGAATCCATGTGTGTGTGACTGAAG 

GAAGTGCTGGGAGAGCCCCAGCTGCAGCCTGGATGTGAACTGCAACTCCAAAAGTGTGTC 

CAGACTCAAGGCAAGGGCACTAGGCTTTCCAGACCTCTACTAAGTCATTGATCCAGCACT 

GCCCTGCCAGGACATAAATCCCTGGCACCTCTTGCTCTCTGCAAAGGAGGGCAAAGCAAC 

TTCAGGAGCCCTTGGGAGTCCTCCAAAMGAGTCTAGGGTACCTN 

Sequence 1061 

CGAGGTACCTTCATGCTCTAAATCAGATGATCTACTATCTGAAAAGGAAACGACAAATCT 
GACAACAAANGAATTTCACAACAGATAGGCAGTTGATAGCATGAGGCACTAACATTAAAC 
CCAAGTCTTTCAATGGCACTTGGAGTCCCAGGGTCTGCCCG 
Sequence 1062 

CTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAATTATAGCGACATACAC 

AGAGAATGCAGTTAAAAACTCTACCTGTGATACATGAAAGCTTCAGGGGGCTATATATAG 

CCAACAGTAATTGACTATTACACMGMCCAMGAATTGGACCTTGAAAGGAAATATTCT 

ATCTG G AG ATTAAC AC AAACTACTTAGTAAC AAAAATG AACT ATAAC AAAC ATT AAG GAT 

AAGTAGTCTTCTGTGGTAATAAMCTATCATTTTGAMGGACAAAMGATTATrrrTAAC 

C CTGCTTAACTNTTATG N NGTMCTACATTCC AATTTAAC AGC ATTTAAATTAAAGAAAA 

GG 

Sequence 1063 

GACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGCACACCAGGAG 

AGAAGATCATTATTAATGTGCTMTAGCAGCATTTTATTTTGAAACCCACTCTGCATGGT 

TACAGGGCTCAAAACAACATATTCTAACAGGAAGATACATTACCGAMTATTTTAATGAG 

AATATTTAATATGCATTGAGAGGTCCGCATTTTCTTGCAGAGACCTTGTAGGTAGCTCTT 

TGAGATTTCTGTCTCTATGCATTTMGTGMGGAGTTGGTTGGGTATTTTAGTTGGCAAA 

TTTTGCAGACATGTAGCTTTGGTAGTGGAGGGGGTAATAGTTACNCTGCCCGGGCCGGCC 

CGCTCTTNGGAANCTAGTGNGNATCCCCCGGGGCTG 

Sequence 1064 

TATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACCCCTTTTAAGT 
TTGCTTGGCTTTCTCAG ATTCTG GTTTCMTTAAC ATGCTTATGATTTTCTGGGTTTG CA 
GGGCAGATTAGTAGGCTTTGGTGTGGC I I 1 1 n CTTTCTTCCTTTCTGCCTTTACGCCGT 
TTGGGTTGATGCATCTGTGTACCT 
Sequence 1065 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTGAACTAGCTCCTTCTGGTTAATTTGTTGATT 
GGATTGGAAATTAGAACATGGAGCTGGTCAATGCACGGTATCTGGTAATTGTGGATGGGG 
GAGATGACTGGGGCAGAACTGAGCTCATTTTTGCCAACATAGTACCT 
Sequence 1066 

CCGCGGTGGCGGCCGAGGTACCTCTGTGGAAATCCCTTCCTTCCAGAAGCCTCCACTCAC 

TTCTGATGCCAAGGAGCTCTGCGGCGCCCTGCACGCACCTTTACAGATGCAGGTGGCTGT 

TTCCTGTGTCAGACTGCAAGCTCCCGTGAGCTGGGTTCATTGCTCATCGTTGACTTGGCC 

TCCCCGCGTACCTGCCCG 

Sequence 1067 

CCGCGGTGGCGGCCGGTGCCACACTGGGCCAGGGTAGTCATCCTGAAATACATGTCCAGG 
GTCTTGTTTGTCTATGATGTGGGTGAAAGCTGCCTCAGCCCGCACCACAGTAGAGAGCGG 



Page 1 43 of 635 



WO 01/51628 



Table 1 



PCT/US01/00798 



GACCACCTCACGAAAGTTTTAGCAAACTCCCAGGGTCTAACCTGAAAGCAGCCAGGAACA 
AAGACCTTTCCAGAAAGAAGGACATGAACAAACGCTTAAAGAACGACCTGGGCTGCCAGG 
GTAAGAACCCTCAGGAGGCCGAGAGTTACTGTGCACAGTACCT 
Sequence 1068 

CTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGCTCTTGTTAATTCCATTCAATAAA 

CACACACAAAGCCCACTCTCCTTCCTTCAGACCCTATTAATTCTAGTGTAAGAGAGATGG 

GTGATGTTCCACAGAAGAAGCTAAGTTGGACTTTGAAGGATAAAGAGACGTATGGGAGGA 

AAAGGCAGGAGGTAGAATTCTAGCCACAGGAAGAACCAGGTGAGAATAAGCTTGAAGATG 

TTTAGGGGAGAAGGAGTTGTCCCTTAGCCTTGATCCTTAGAAACAGACGTAGAGGTAGAG 

GAAGCTGTGGCTGCAAAGGCAGTAGGCAAGAAAGTGAAGCCAAACGCCACCCAAGTGGCA 

CTCTCATGCCCCTCTCTAATGTCAGGATATGTTTACAGCCAGCATGATTTCCATATACAT 

TCCTCCTTCACTTC 

Sequence 1069 

AGGTACTCACTTATAGATGGTTATTGGCCAAAAATACAGACACCATGCTATTATCCACAG 

ACTCAGAAAAGCTACACAAGAAGGAATGCACAAGTGAGGATGCCTGAATCTCACTTAAAG 

GAGTGCATAAAGCAGTTCCTAAGAGGAAGACGGAGGGAGTGAACTGAATAGAGAAGAGAT 

ACGGAGGGCTATGGGATGGGTTTGGAATCAG 

Sequence 1070 

CTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATACGCGCATTTGT 

GTGTGTGTATCTATGAGAGGGGGAATTAACAAATTACATGGATTTGTGTATGTGTGTGTG 

CCTAAGAGACAAGGAAGGAAAACAGTGCAGTTGAGAAAGGTGAAGCAAATGTGGCCAAGT 

ACCTGCCCG 

Sequence 1071 

AGGTACl rTTTTTTTTTTTTl I CCTGAGGGAGATGGATAGGTGAAGTAATGGTTGACAAG 

CATAAGTGGGAAGCTTCMGTCCCAGGTTCTCCCATCAAATCATTTTCTCACAATCTTGC 

TCAGTATTCTCTTCTAGTAGGCCANAATTTGAAGGATTAGCTCTCTGTATATCTAATCTA 

AATCCTACTCCTTGTTTTAAGCTCTTTGATTTCAAATATTAATGATCTGGTACAGCATCG 

CTGGTGGTTTCAAAAAACGTAGTCATTCTTCTCACTGCAACAATGTAAGATAAGCAGGGT 

AGATCTGTTATTTCCAAATTAAAGGTGATTAAGATATATGGAGAGAGAACATGGCATGTT 

GAGGTTTATAGGGCTAGAA 

Sequence 1072 

AGGTGCTAGGTTGAGTCGACCCACGCGTCCGCTCAAGGAGAAGGAAGACCAAAGTATGCA 

GGAAGGACCCTAGGGCCTTTCACCAGCTTACATCCACCCACCTTGGGCAAGGATAGGTTA 

GATTCCAGTTTCCTGCCTCCCGCCATGGCTAGGCATTCTCAATCCACAGGAACATTCTTG 

AGTAG/\ACATTCCCAGAATGATCTGACTGATGCTTGCTAGCCAATAGATTTAAAGGTTAA 

TATGCTTAGCCTATAAGTTAGAAATGTAACCTTTCTGAGGTAACCTGTGCCCTTAAAAAG 

TATAAG 
Sequent 1073 

/vGGTCACGCGTCCGCTGCTGGGCTGAACCAACCAAAGATGGAGTGGCAGAACCCTGGGAG 

CTGATCGGGGTTCTTTCTAGTAAGTAGTTGCCAGGCCTTAGAACTGATTGGTGTCACCTA 

CTTC/vGTGTTCATTCATTTCAATGACAAGATCAAAATCAGATTTTACAGGATGACCAAGA 

TTCTGAGTTTCTCCCATGGCAACAAGTTCTCAGGTTAAAGTCTAGGAATGGTCATACTTG 

CAGCTTGTTCTTCCAGAAGACCTGACTTTGGTTCCCAGAACCCATGTTGGGTGGCTCACA 

ACCATCTATAACTCCAGCTCCCGAGAGCCAATGCCTCTGACCCCCAAGAGTGCCTGCACT 

CATGTGTATATATCCCATAATTAAAAATAATAAAATAAA 

Sequence 1074 

CGGACAGGTTGCAGAAAGGGAAGAAATGCCAGAAGGAACCAATGAAGGATGAGCCCAGCT 

TCCAAATTTTGATGAAAGCTTTCGTCTCTACATCAGAGAAATTCTGCAAGATCTTAGAAG 

GATGCGCTCAGAGAGATTCACACCTAGAAACACCATACTCAAAGGGTAAGGAGAAGAGAG 
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CCCCAGGGAAGAATACAGAAGGGGACACATGACTGGCCACACAGAAAGCTCATCAGGAAG 
AAAACATGACTCCCTGTCAGAAACCCAGGGTCCAGAAGGAGTGGGATGACGTTTGCTAAG 
TTATAAAAGC GAG CTGTGCACCC AG ATTCTATAC 
Sequence 1075 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCCCAATCATTTGCCTGCATCCTTCTCTTCTCA 

ACAGACTCAAGAAG TAG AGACGCTGACTTTTTTG TC TTTG TATCCCTAGG AC TTAGCACT 

GCCACCAAATACMCCAMGTTTGCTGAGCTGACATGATTAAGAGCCTATCTGTGGAAAA 

GTATGATCGGTCTTTGTGMGTTGCAATCTCTCAGAGACTTGTTTTGTGCCTTTACAAAT 

AATAGATTTTTTT 

Sequence 1076 

CCGGGCAGGTACTACTGGCACTGAGCCAATGTATGCTATCAAGGAAAGCTTTATCTGTCA 
CTGAGCAAAGGGTGAAGTTCAATTAGGTCAGTTTTATCACTTCTTTTCCTACACACAAAC 
TATGAGGAGAGATCATTTCTTTTCTTTCTTTTATTTATTTA I MUM I GAGACAGGGTC 
CCACTCTGTCGCCCAGGTTAGAGTACCT 
Sequence 1077 

CGGNCAGGTGCTTGAGTCGACCCACGCGTCCGCACTAAGATTNTAAGTGAATACCACCAA 

ACCCAGGAATAAACTGA I ((Mill I CCCTCTTAACTGGAAAATAACCTGAATGTNTACN 

CAGNTAAAAAATGAGATTGGNGGGAACCNTAAACGACCCGGGCTGNTGTTTGCTCTCCAC 

AACCTGATGGCAAGGCCCTGTTGNTGGAAATAACACCCACACGGTTNACTGAACATGGGA 

AGTGGAGCTGGTGCNTTTNTAGAGTCTTNAACCCCATTGGNTGGCGTTCTTGGAACCAGA 

GGTAGCCTGCATGC 

Sequence 1078 

AGGTACCCAGAAGAAGGTGCAGATATCTGATTGAAACTGTTCATCTTACCMCTCTTCCT 

ATTGCAGAGAAGACTAAGGTCCAGAGAGGGATAGTGACTTTCCCAGGTCACACAGGAAGC 

CACAAGAGCACATTGAAGCCAGATACACATTGTCAAAGCACGCATGTCCAGCCGAAGCAA 

GGACCCGAAATGCTGATTGTTTCCCCTGCAAGGAGGATCCTGAGTTTGTCCCCGGGAAAG 

GTTTAAGAGMCTGGCAGGMCCAGCCTTCATTCTTCTAGGTCCGGTTACCCTTG 

Sequence 1079 

CCGCGGTGGCGGCCGCCCGGGCAGGTACACACCACCACCTCTGGATMTTTTAC H ' I I I I 

TATTTGAGACAGAGTTTCGCTCTGGTTGCCCAGGCTGGAGTGCAATGGCGCGATCTCAGC 

CCGGGTGACAGAGCGAGACTCTGTCTCAAAAAAATAAAAAATAAAAAATAAAAAACAGCA 

TTCCTTGGGAGGCTAAGCGATCCTCCTGCTTCAGGCTCCCAAAGTGCTGAGATTACAGGC 

GTTGAGCCACTGCACCTGGCCTCAACTA I i I I I l I AAATCATGTCTG AATCTTAATTCTC 

CTAGGGATACTTATTCCCCAAACTAACACAAGTTGAGTGACGGGATGGGCCCACTACATT 

TANTAAAAAGAAAGGTACCTCGGCCCGCTCTAAAAACTNGTNGGATCC 

Sequence 1080 

AGGTACAGCTAGGACAAACCTTGACTCCGGGTCGAGCGCTCATTGTATAACATGCAGTGG 

CATGGCCCGTGGTTAGGAGGAGTCTGAAATCAGATTGCTGAGGGTTCCGGATCTGGTTCT 

ACCACTCACAAGTGACTTCGTAGGCCTTTCTTATATTCTGATTTCTTATATCTGAACTGG 

AGCTTACACATGCACCAGAATCCTACAGGGATACTGTGGGGGACACACAGCTTGAGGTAA 

AGTGGTTACCAAGAGGCCAGGCAGTTGGGGGATCACCCATATCACAATAATTTGTCATTG 

AGCATGTAGTTTCTCTCTTCATGGGATGACA 

Sequence 1081 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGAGCTAACACTTATCC 
TCCAGTGCTCTACTAATTGCTTGGTGCTATGTTGACTACTGGGAATGAACCACAAGATAC 
" ATCACTGTAAACTGCAGGGAGNGCTTCACTGATCATGAGGTNAAAATCTCCCCATCATCA 
NAAAGAGAATTCTTTACTTAACTTAGTGGTACCTGCCCG 
Sequence 1082 

AGGTACACATTTTTTGTTGCAAATAGTCCTGCAGCTGGTTTTTCTAAAAAAAAAACCCCA 
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AGAATACATATAAATCTGAGTTCCCTATGGCGCTAGTNTACTGGGCAGACTTCTGGTCTG 
GGGTTCTTTCTTTNTTTTAATAGCCTTCTGA/^^ 

GTAGCACCTAGTTCTAAGTGTGTGCTATGCTTTGGTCTAATATCCAAAGAGATGtCTAAA 

GCAANTGTGCTTGCTTTTCATGA 

Sequence 1083 

TTAGGGCAATTGNAGCTCCCCGCGGTGGCGGCCGAGGTACAAGTTGGTCTCAAACTCCTG 

AGCTCAAGTGATCCTCCCACCTTGGTCTCCCAGGCGTAAGCACTGCACCTGGCCTTAGAA 

ACCTCTTTAAAAGTGGTTGTGGTATCCTCATAGAATCTAGGAATCCATTACTCAAAAAAG 

CAAGACTGCAATGATACACAGTACCTGCCCG 

Sequence 1084 

AGGTCGACCCACGCGTCCGTGTATCTCCTACAGTTCATTATGCAGCAGACACGGATGTM 
ATGGGGACAAANATGAAAATACCNACTCAGACTCTTTGTGAGACGTTCANCA fTTTT I'CT 
GAAATGGCGCAATGTATTCATTTTCCA 
Sequence 1085 

ACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCTACA 

GTGAAACTGmCTACTGTGGCTTMmGCAAMTCCAAGCAGATATTACTTTTAAAAA 

TCCAACTATCTCTTTCAAAGGATTTGTAAAGTTGCTTACAGACATAAGGTGACATCAGAA 

AAGCATGTTGGTATAATGCAATTAGCTAAGGTTTCTAGGMGCTTTGGMTGT7TTGTTT 

TTAAAGAAGGCATANATAAGTGCAAGTCCTACAAGTAATGAACTGTCCTAGAAGCAAAGA 

MTTCATTAMTAGATCTCCTTGATTTCTCCTTCATGGCCCCATGTCCATAGAGATTGGG 

GAATGTACCT 

Sequence 1086 

CGGGCAGGTACTGACTTAAAATAACAGAGTAACCTGTCTGGCTTACTTGTCTCTGAAAGT 

GAATGTTCAAAGTGGGTGGAATNGAAATCACATATTAACGATAGTCCATACTTATATGCA 

MGCCATATGATTCCCTAAGCTACTTAAACTGGAATAAAGACACTAATAAAAATTTCATA 

AAACATTGAACCTGGTGCCAGCCTCACATTAAACCCATAGGTTCTTCACGATTAAGGCTG 

ACAATTTTAGCTACTGCTTAGAAAGTTGCAAACACAAATAGGGGGGAGTTTAATGTTCCA 

TCAGAAATTAACCCAANCTGGTTTTGAAACTCTACTTGGCACTTA 

Sequence 1037 

AGGTCTTGAGTCGACCCACGCGTCCGCCTAGGTCTAAGAGCTGAAGCTATAACTGCTAAG 

AGGCCTGGGACTCTGTGACACTGAAGTTCAGCAANTGTCTTAANAAACMNAACAGNAGG 

GGCTTGGAGAGATGGCTCAGNNGGTTAAGAGCACTGACTGCTCTTCCTGAGGTCCTGAGT 

TCAAATTCCAGCAACCACATGGTGGCTCACAACCATCTGTAACGATATCCGATACCCTCA 

TCTGGTGTNGTCTGMGACAGCTACNTACATAGMTAAATCTTAAAAAAAAACAAAAACA 

AAAACAGAGAAATAGGAAAAGGAATGGGGGAAGAAAGGGAAG 

Sequence 1088 

CGGCAGGTCTTGAGTCGACCCACGCG\ r CCGCATTTATTGTGGGCCTTACTTCATGCCTTG 
CTAGGCMNTAAAGATACTTTAANACAACCTATTNA^ 

AAAGAACCTTATTTTATTGGGGTGNTTGGGNCAGGGTTAMCGNGCCAGGAAAATNGGGA 

GGNAAACCATGNAAAAAGNAATTAANCNATTTCCACCCATATTTTGGGGGGTTCCAAGAT 

GGGANAAATTGGAAAATTGGTTTTAAGTAAGNAATTTCNGGGGTNGGTTCCTTGGCCCAN 

ATGGCCTTTTAGGGGGTNNGGAAAAATTAACC I I I I \ GGGGTTTGGGAACCCTTTTNGGA 

AGGNAAGGGGGMTTTGCAMNGNGGMGNGGGGGGCCTGTTTmCTTTGGGGTAAGGC 

CAAAAAANGNGGGGNAATTTGNGG'l UN I GGCCAATTGGGGGNGGTTGTCCTTTNGNGG 

CCNNAGGNACCCGGGGGANAATG3AANGGGGGGAAAAANGAACCANTGGTANGTCCCAAN 

NGGNAAAGGNAAAANNGGGNAAAGNGGGGGNG 

Sequence 1089 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCATGACACAAAGTAACTTCTCAT 
GGCTGGATGTGATGGAGGATATCCAAGCACCTGGCAGGCATTTGTAGAGATCCCCGCTAA 
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AGCCCCCTTCAGCTATAGGATTCTATGATTCTTTTCCTGTGAAAATTTGGAACACCGTGA 
TAGTGTANTTACCCTCTCCAGATCATAGGGGCACCATTTAATAACTTACTACATTAATGT 
ATTAAA TTGTACCTGCCCG 
Sequence 1090 

AGGTACGCGGGGAAGCAATGACGTGGGGAGAGAGCGCGGAGGAAGGGAGAGATCCAGAAA 
GGTAGATTCTTCTGTGTAGGTGAGTGGATGAGATMTGAAGCMGTTAATTTGGGCTCTG 
CTGACTTATTTATCTGTCCCGGGCACTTTTCTCCGTACCTTGCCCGGNCCGGC 
Sequence 1091 

CGGCAGGTACAGTAGAGACACCATCTCTGTTAGGAAACTGTTAGTGGCTACCATGTGTCC 

CTAAACTCCCTAGCTTGGTAGTCATAGCCTCACGTTCTGCTTCCAGTTACTTACCTGTCA 

ATCCTTTTCTTCCTACTGATCCCTAGTGCACTTTTAACACATCCTTATCCTCTTTGTCTA 

TGTCTGTTTGTCCTATTCTCTGCACCTGCCTTTCTCTCCCCTAGTCCTACTCCCGCATAC 

CTTAAATGCTGCCAATAACTTGATTATGGCTTAATTACTTAATTATGCCAACTGAAAGGG 

GTCTACCGTGTTAAAAGGGCTTTGGCCCCTA 

Sequence 1092 

NGATTACCCTNGTNCGCCC I I I I l CTTCCCTTTTCGNGGTAAAGCCGGTGGGNCGCCTTT 

TTCTTrCANTAAGNCTTCAACAGCCTNGATATGGGTNATTNCTTNCAANTTCTCNGTGTN 

GATMGTGGTTCTGTTTTTCNGNCTTCCCCAMTGACCTTGGGAGGCCTTGGTTGGTAGT 

CCAACCGAANACCCCCCCCACANGGTTTACAANNCCCCCGNACNCCGGCTTTGTCGCCTC 

TTTMTTTNCNGNNTTAAANCTTATTCCGGTTACTTTGTAGGTTTCCCAAACCCCCGNGN 

TTATAAGTAACCAACCNGAANCCTTTTAATTTCCGGCC 

Sequence 1093 

AGGTACTCTTCTGGTTGCCTTGACTCTAGTCCTGATCAGAGTGCCCAGCAAAGAACATTA 

TACGCACTGTCCCGGCACTGTCCTGTGACATAAAAAGCAATTGACCTGATATCAGATTCC 

CTGGGATAGGACCTGAAAAAGAGTTGAGGGCAGGCTCTGCATTTTGTCCTTTTCTGATGC 

AAATG AGAACG AAATAA I I I I I I I CAACTTACTTTATCCTATGGTGAGTAGAAGCCACCA 

TGTTTCCTGTGGAAACCAGGGCGGTGGCATGGCTTCTGCTTCTGTCTGAGAGAACATTGC 

TTTCCTGGGACTTATCCCCCTTTTA 

Sequence 1094 

GGCCGCCCGGGCAGGTGCAGAACGCTGGAGAGGGTGCAGTCTCTGTGAGGGTGAGCGGTG 

TGGTGCCCTGGAAGGACTCCTTCCCACCAGTCACTAAGCAGCAGAGGCCAGGCAATCACA 

GGCTAGGAGCCTTCTGACCCAGGAATAGGCATTCCCGCATCCTTAGACCACACAATGCAT 

GCCCACTGTGTCACTGCAGAAGAAAGGATAAAGGTGGCCATGGCTAGCTGGCCATGCAAG 

GCACTCACCAGCGTTAATGAGACCTGGCCGGNCAAGGCAGTTACTCTGATGGTAACTAGG 

GCTTGTCTCCTTGCCATGAGAAGAGTTAGGGACACAC 

Sequence 1095 

CTGAGTGGCTGGGACTACAGGTATGCGCCATACCTAGCTAATCTTTTTGTATTTTTTTGG 
• TAGAGAGAAGTTTTCGCCTTGTTGCCCAGGCTGGTCCGAACTCCTGAGCTCAAGTGATCT 
GCCTGCCTCAACCTCCCAACCTGCTAGGATTACAGGTATGAGCCACCCTGCCTGGCCAGA 
TTTMTGAAACTTTATTAATCACTTAGTTN^^ 

AATACAACGCTATTGTAGTTAATACCGTTG I I I I I GCGGTTATTTGCTTACATGTCTGTT 
TCTGCTACTGAATTGTAAATCCCTTGAATATAGGGAAAATATATTTGTATCCCTACCATC 
TGAAAAGAGTACCTTCGGNCGCTCTANAACTAGTGGATCCCC 
Sequence 1096 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTAGCCCTGCTTGTCTG 

CAGATCCAGAGAGCCTTAATTGACAGATTCAACTGTGCCTATTCCTGAACCAATTCTGAT 

CAAGAAAGTGGTCCCACCAGTCAGTGCTGACAGGMTTAGTCTTTTGCTGTCAACTAATG 

GTTTCCAATTAAGGGCMCTTTTTCCACCCAGAGATATTTGTCAATGTCTGGAGACATCT 

ATGGTTGTCACAGCTGAGGGAATATTAATTAATAGCACCTAGTGGGTAAAGGTGAGGAAT 
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GTCATTAAATACCCTACAATGCAGAGGGTMCCCCCCACCAACAGAGTACCTCGGCCGCT 

CTAGAACTAGTG 

Sequence 1097 

CAGGCAGGTCTAGCTTGAGTCGACCCACGCGTCCGATTTAGAAGAGCCAGAAATAAAATA 

AAATATC7TGGTGTAACTTTAACTAAGTAAGTAAAAGAACTGTATGACAAGATCTTGTCT 

TTGAATTAAAAATTCTGAAGATGACAGTAGAAGATAGAAGATGCTCATGAATTGGTAGGA 

TTAACATAATAAACATGGTGATCTTACCAAAGGCAATAAACAGACTCAACACAATAACTA 

TCAAACTTCCAACATTTTTTTATTMTCTTGCAAGAACACAACTTAACTT 

ACAAAATAAAACAGTATATGTCCAGCCTGGCTTGGCCCTTCAT 

Sequence 1098 

TCTAGAGCGGCCGAGGTACTCAAAATCCTAACCAGAGCATTCAAACAACAAAAAGTAGGT 

TAAAGGGGATACAAAACTNGGAAAATGAMGNAAGGTCAATTAATTATTCACCTANTTT^ 

GCCAGTATGGAATNATGNATTAGGTTTTTACCTTTTAAAGNTGACCNCCCCAAAAAAGGT 

TTC N C AACTCG G G AAG M AAAAC CT AACC T ATATAG N C NTTG AATTAAAAAAC C AAAN CTT 

TTCCGAACCAAAAGGGGTNANGCCTTGGGGGGTTAATTAAAAAACCNTAAAACNTTTCCA 

MNTTTCCAATATTAAAGGNCCCCTTTTCNCCI Mil I NAACCTTCCAAAAAAGGGGGAA 

TTNAAAATACCMGGGGGNTNTTGGGAAGGNAAAAAANAAAAATATAATTTTAAGNGGGG 

GGNAAAAAANTTGGGAACCCAACCCCCCCNTTTTTAA 

Sequence 1099 

AGGTACACACACACACACACACACTCTGCAGAAATTACTCACAGTTTTGGCCTGTAAGCA 

GTATTCATTCTTCCATTAAATCTGCTAGTATTAATATAGGTTTTCTCCCATTTTATATT^ 

CTTTGACTATTAAAGTAAGAATTCAGGACTTGGAGAGAAAGTGGAAAATTTCCATTTAAT 

TCATTAAGTTTTAAATTATTAGCTATACCAGAACCCTTTATTATTATTTTATGGTGTATG 

GATATTTTGCCTATATGTATGTTTGTGCACCATATCCATGACTGGTTGCTTGAGGTCAGA 

AGTAGGCATT 

Sequence 1100 

GGGCAGGTCTCGTGCGGACGCGTGGGTCGACTCAAGCTAGCANATGGAAGCTAACTGCCC 

ATTGATAAGTAAAG I I I I l ACAAGACGTAGAACAGTNTTGTTCTNGTNATACATTATCTT 

TGGTTNNATTCTTATGAGAATCATATTGMNCATCAGGCCATNNTTTTAAAGGGTGGATA 

TTGGCGCGTGNAACCCTTTGGCTCCCATTTCTTAAATACCAACCAGGTATTCCGTTTGGA 

ATAGNTTTGGTAACCAGGAAGTACCTATTCTTGGACCCCCATAAAAAAACCTTTAANGGT 

TCTTACACNNTGGTTTTGGAAGNATCCTTATTAGGAAAAAAAAAGGGTNTTGGCTTTTTA 

TAATTNCTNTAACAGNTTCGGTTTTAATTATTGTTGGTTGGGGGGGTTTGGT 

Sequence 1101 

cgggctgagccgggagcctgaagcccagctcngagttatattggggacagggctcctttg 

gctcntggtaatggagnaaagggccttgangaatttaaangggaacnctt.gaaggggaac 

caaannaaaagnamggccaccccttccaattttaatnttggggttggttgcittcccctt 

cggagaaataccgaaagnggcnccatntnttggttggggttgncctttan;ttcnaattt 

tnccccttnccccgcgggggggctttaaaaaaaccaa 

Sequence 1102 

GACGAGGGCAACCCAACACCCTAGCCTAAAGNCCCGTTGACAACTGCAGhiCATGGGTTGG 

CTTGGCCCACCGCCTTGGCANCCGNTCACGNAAGGAAAAAAAGNCCGCGGGCCCTTAAAA 

GGCGGCGGAGGTTCTTGGGTTGAACCTTGGGGCANCCCCACCCGGTTCCCANGGCCTGGA 

TTGGGGTNACCCNCAAAAGGCCTGTCCCCAAGGCCCGNAACCTTGGGGAAAANGAAATGG 

TTCCTTTTNGNGNNAAAAAAAAAAAATGGMCCCCGGTTGGGGGNANGGCCCCTTGGGGG 

GGCCTTTGGGGAAGTCCNCCCCGNAAGGGGTTCCCCGGGCCTGNTT3GCCGGGGNCCCCA 

AAAANTCCAAAATGGNCCANGGGGNTGGGGNCCAACCCCCG 

Sequence 1103 

AGGTACCTAGAATATTTATTTCAGATAATATCAAGATTTACCTATATTTCTACCAAAGAC 
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AATGATAGGTTTTTTTTTATATAGTTGGAGGGCAGGGATAGTGTTTAAAGGTTTATGATG 

TATAAGCACGCTTAGACACACCAGGAGAAAGTATCAGATCTCATTACACATGGTTGTGAG 

CCACCATGTGGTTGCTGGGATTTGAACTCGGGACCTCTGGAAGAGCAGTCAGTGCTCTTA 

TCCACTGAGCCATCTCTATAGCACTTTAAAAAAAATATTAAGGCATCATACAGCNTTTCT 

GCCTCTATGTATGCCCTGCA 

Sequence 1104 

CGGGCAGGTCTTTGTCTCTCAAACTTTTATACTGTCATAGTTCATTTTTCTTGTACAGAT 
GCTCTTTCAAC I ((III IGAGAGTATATACATAACAATTGCAGTTGAAAGTAGGGTGGGG 
TTTAGTCTCCTGGTCACAGTTCAATAAGTACGGACGCGTGGGTCGACTCAAGCTAGCTTG 
TACACCTN 
Sequence 1105 

TCGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACTTATAGTT 

GAGAGCCAAGTCTCCCTTATCATTGGTGAATGAGGATGAGCTACTGAAAACAAAAAGAGG 

GTCTTCTACTC AG CCTCTACCCCTAATATTTATATCAG AAG CAG AG ATTAACTGTCCTTA " 

CTCATTCACACGTTAATGGAAGAGAAGGAAGTTTCCTAGAAAAATCCTCCCGCTCCACCC 

TGCAAACTTTATGCT1TTCTGTTACATAATCAGGCAGGGGCAAGACCTAMCTATTTTGA 

ATTGGTGGTGTTGAGGCTAMTTCTCTGCTATTGACAGAATTGAGAATGTGATCAATTTC 

AGAGTAGCCATGTTACAAATTTTGTCCCAATTTCAATGGGGGAGGAATTATAACCAAATC 

AAGTAGTGGTTGGGAAGA 

Sequence 1106 

CCGGGCAGGTACGCGGGGAGTTCTCTCAGGCTCTCCAGAGCTCAGGACCTCTGAGAAGAA 
TGGAGCCCTCCTGGCTTCAGGAACTCATGGCTCACCCCTTCTTGCTGCTGATCCTCCTCT 
GCATGTCTCTGCTGCTGTTTCAGGTAATCAGGTTGTACCT 
Sequence 1107 

TCACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGTAGAT 

GGAAGGAAGAACTCGTGTGCTTAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTA 

CTTGTAGGACCCTATACGGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGG 

AAAATGCAGTAAAGTAGAAGAGACAGGGATATCCCAGAAGGTTATGCAAAACATCAAGAG 

AAGATGAGAGGAGTCTATATGTCAGAATACACATTTCCCACCTTGCCCAACAGTAGAAAA 

ACATAAGAAGAGAAAAACATTAAAAAATGACAAGGAAGTTAATGGAAGTCAGCAATGTGA 

TGGTGTTTGGGAGGTGGAGCCCTTCAGAAAGGTAATTAATGCCCTTGTAAGAAAGAGGCC 

AG N AG AGCTTG CCGCACC 

Sequence 1108 

GGGCAGGTACTTGTCCTGATGGGTCATCCAGAAGAGTTTCATGGAGATCAGTTCACAACA 

TrGGACAGACTATGAAACAAAGTCTGTAATATCATTTATACTCAGTATTGGTAGGCTTTT 

TCCTTAATACCCCCACTCCAACAAAACAGACCTGCCTTCACTTCTCTTGAAAATATTATT 

ACCATCAGTAACTGCTTGTCTACTATGTCTCCAAGGCAAGCCACAGGCTATAAGCCAATT 

CTGCAAAAGGGAGCAGCAAAAGAAATGCTTAGTGCAATTCCTCTATTTAAAAGTTTGCTT 

GAGTTTTGATTATTTTTGTTATTTTG 

Sequence 1109 

AGGTCACGCGTCCGACCT AGCTTG AGTCGACCCACGCGTCCGATTTCTGGTTTCTTTGG A 

GACTCGGTTGTGCCATGTGATGCTTTTCTGGAAACTGTCTTATGAGAGGMGCTTTTGCT 

GAAGCAGACACATGGGAGAATGTTTTGTTAAGAACAGACATGTAGTATTTTTCTAGAGGC 

AGCTTGAATAAAGAGCATGTGGTATTTTGCTAGAGGGGACACTTGAGAGAACATGTGACA 

TTTGGAAAGGATATATGTATAACCTAAAAGGACAGTGGCCAATGCTGTGTGGTATTTGGG 

TGTGCCTTACCATTCTTTGCTGGC 

Sequence 1110 

TCGTGATCTAGATCCCTATAGTGAGCGGACGCGTGGGTCGACTCAAGCTAGGTCGGACGC 
GTGGGTCGACTCAAGCTAGGTCGGACGCGTGACCTTCCCCAAAATTCTTCTTTTTTCAGA 
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TGAAAGTCATACACTCAGCTTTCATATTTTAATATACATAAAACAGCTCAGTGACTGGGA 

CACTTCCTAACCTCCATTTGGGTAATTCACTTCCCTCCAATAATCCTGAGTTATCATTTA 

TTATATTTTGTATTTTACCTTGGCTGCACTGGAGCCAGTTGGGCAGCCCCTCTTGGGCCT 

CACTCTCATGTTCCTACATGGGCAG 

Sequence 1111 

CGGGCAGGTCACGCGTCCGCATGCCTACCCGGGCTTTTTCTGACTCCGTGAAGCTGATCT 

CTTCCTCATTCGAGGMGGGTAAACCCTTTGTAGATGGGTAAACCCTTGTCAGTCAGAGA 

CAGTAGCCATCCCCTGTAATTCCAGAGCGCAAGAGACTGAGGCAGGATAATTACCTAAAC 

CTAGGAGTCCAAGATTTGGCTAACGTAGTMGACCTGTCTCGAMCAAAACAAATCTTGA 

GGGATGTGATGCTTGCTTAGTGTTAGATTGTGTGATTAGGATACACAGGGCACTCAGCTT 

CC TATCTCC AC TG AAG C AGTC AATTA 

Sequence 1112 

CTACTGAGACCATTACAAAATTAAATGGTGACTTTTTAATCTGGAATTGCAAACCGACAG 

CANCCTGTCTGGGGAGCTCCATGTNATGAGTGTATGTCTCCAACAGNCTTCAATGACTGA 

CAGGTGAAAAGCTGTGCCACAGCCTGGAATTATTATGCTGACACACAAAAGAGGGCGGGA 

GTGACAGCGTGGTGATCTACCANTGCTGTCTGTGGGGNTAATGGTAAGCTTTCCCACTGC 

TMTGGTTTATNNANAGMCAAAGAAMGGAAAACGGTGCANGGGAAAATCAATANNTAT 

TAACATAGTCATGGTAATGAATGCAATCCNTTATTTCTGGGAT 

Sequence 1113 

AGGTACATAAAACTTGTAGAGATTTACACACGCCAGCCGCACATCGCCCCTTCCCAATGG 
CTTTACATCTTGGTGACCTTTCAGAAGCCMCGGCACCCTTCCTGTTGTTACCCGTAATT 
GGAAAAGCATCAGAGATAAAAAAAAGTGC 
Sequence 1114 

CGGGCAGGTGAAACATTTTTATTATTTATT^ 

TGAAAGCCAAGGCAGAGTTCATGTGTCTGGGCTGACATTCCCATGGCCTATGAGCACTGA 
GCAGCCACAGGCTTTTCCAAGGAAGGCCCAAGGCCATCACATC CCCA TCTACATCTACAA 
TACTTGAATGTATTTACATGTGTTGTTCTTTAAAAAGGATTACACATTTTATTCT 
AATCTAAAAAATGATGGAGAAAAGGAACACAGTTTATCTTACACATTTGCAGGATATACA 

AGGGTAACT 
Sequence 1115 

TTGTGTGGATTTACTTGTTGCCCACTGTCCCCATACCCTCCACCCCCACATGGTGTGCTT 
CCTCCAGAAAGGGACTACTTCTGCTGACCACACAGAAGACGTGTGTAAAGTCTGTGTATC 
AATGAATGGATTCTCATCTTTCATAG I I M H M I AAATAGTTTTATGTAGTGTTTAACT 
AMTTTCACTTAAAMGATATTTACCAGAAGCTGANAGTANGGTGTGATGAGGTTGGGTT 
CANGAAGGACTGNTATCACATGGCTTCCCTAAGGTTGTATATTACATTGCTAGGACACCT 

GACA 

Sequence 1116 

CGGGCAGGTACTTTTCTCTTAACTTACCTGACTGAAGGCATCCAATCAGAAAGGAAAGTG 

AACAAGTATAATGTTTCTAAGTGAGTCCAGATGGTAACAAGMTTCAAACATTATTCTTT 

TTGTATGTTTAAATAAAACAAATTTTACATTTTATTTGACCAGCAGATATMCTTCCTGG 

GCATAATTTTATTAGCAAAGAACTGATGATACTGGTAATAAAAAAAAAAAAAAAAATGAA 

AGGAAAAAGGAGAATGCACCTCAAACCAGMTACCATAACATGTTTTGTGGAAGCTGCAT 

TTTATTTGAAAAC 
Sequence 1117 

AC I M I M 1 I I CTCTGTAGGAAGTCAACGATAGAGTGACATGTTGGCTCTrTCATTTCAA 

CTGANACACCTGGTAGCTATTGGATGGCATTTCTGTAATTATGAATAGTAGTTGTGATAG 

GCTTATTTGCACCTTGTACATTCTCTTACTTGGGAACGCCATCTCATGCCATTTCTACGT 

TTCTTATGCAGTTAGTTTGGGGCATGNGGTANTCCACTGGAAAATGACTCAAGCGTGGAA 

TGTATTTATGATCACAGATAAGGAAGTGGCCGTCCACAAAGCTTTGGATGGTTGTATTTG 
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TCATTTGAACA 
Sequence 1118 

AGGTACCTGCTTTGTGATACTGTGAAGATTAGATAAAGCAAATATCCACACAGGAAAGCC 

CTGGGTGTTAGCATAAAGCCGGAGTTGAGMGGCCAGCTTTCTATTTACGTGCTGAACGA 

TGAATAGTTAAATAGTTTCCGTCTGGCCCACGTCCTTACACACACCCCTTGGGTTTTCTT 

CTCTTCCCTGGTAGGACTGCATCGGACGCGTGGGTCGACTCAAGCTAGACCTGCCCGG 
Sequence 1119 

GCGGGCAGGTGTCCGCTCACTATAGGGATCTAGATCGGACGCGTGGGTCGACTCAAGCTA 

GCTTGTACAAAAACTTTCAATAAAATGCTTCAAATAATGATTGTCCGAAGCTTGCTAGAC 

ACTTTGTAMTGTCAGGMCACATTAAATATGTTAAATGAGCATGTCCAATATCCAGAAA 

AC AAC AC G AATG ATTTACTATC AATTTC AC AC ATGC CTG G G AATGTCTC CCTG ATG C AAG 

ACAAAAAGGTCCTCTTTGACCAGAGCATACTGACCCCAAGTTTTGGTCCTGGCACAGTAT 

TCATGGGTGCCATTCAT 

Sequence 1120 

GGGTACAATGGAAAAGGGGCTGTGTTACTAAGACTGACCAAAGAAGAGATTCCAATATCT 

TATGATTTGTATGGTCACTGTTAACTGCAATTGTAACATGAACATATTGCCTATAAGCCT 

GTGCTCTCCCCTTTAAACACAAAATAAAAAACAAGAAAAACAAAACAAAACAAATGAACC 

AAAATAGAAAACCTTTTACAGCGTGAGTTGTTTCTACTTAGGATTGGACATCAGTGCATT 

GATCTGAGTCTGGAGATCTTAGGCCAACTTAACAACCTGTGAAAAC7TCTCAATGGGGTA 
A 

Sequence 1121 

CGGGCAGGTCTCCGGCGGACGCGTGGGTCGACTCAAGACTTTTAGCCTCTAAGCCATCTC 
TCTCACCCATTGATTTAAAATATTTTGCTTTGTGAATTTTGAAT^ 

ATAATTCTATCATTTTCTACCTTCATTTTTCTCCTTTCMTCGCTTCTATCTATTCTCCA 

ATTTTGCTGG 

Sequence 1122 

CTNCT ATAGGGCGAA TTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATGCTGTTCTATAT 

CTTGAI ITTTTTTT1 I CTACTTTACACATCTCATG AGTTTTATC C C C AG C ACTGGGTATA 

GAATTGCCCCATTCTTTTCAAGGGCTTGCTGTAAAGTATTCTGGGAACTGGGCGCACTCT 

CTTTAACTCTTCCTCTGTTGATTGTTTTCAGTCTCCTTCTTTCW 

G AAACATTCTTTTAACAATG ! I I f 1MCMGTCTTAGC AATG GTTTTTATTATCTG TAG G 

ATAATCTAGGAGTAGAGTTGCCAGTTTTTAGAGTATGTGCTTITGAAATTTTGTTTGTAT 

TCACTTGGGCAGCAGTMGTCATCTTTTGCTTATTGTCGNCGCTGTCTCGACCTGAAATC 

CTTTC7TTTTATATTCTTGCCTTC TTTT 

Sequence 1123 

GCNCGAGGTACTACTGGCACTGAGCCAATGTATGCTATCAAGGAAAGCTTTATCTGTCAC 

TGAGCAAAAGGTGAAGTTCAATTAGGTCAGTTrrATCACTTCTTTTCCTACACACAAACT 

ATG AGG AG AG ATC ATTTCTTTTCTTTCTTTTTATTTAATTTTN NTTTTTTTTGG AAN AAC 

ANGGGGNTCNCCACNTCTGTCCGCCCATGGTTTAAGAAGTTACCCTGGCCNCGGGGCNGG 

NCCGGNTACTTAGTMCCTAGTTGGGATTNCCCCAGGGCCTGCAAGGGAAATTTTCGATA 

CTCAAAGCTTTATCCGGATACCCGCTTCCGACACTTNGAAGTGGGGGGNGGNCNCCCGGG 

TAACCC 

Sequence i 124 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCTAACCTGGGCG 

ACAGAGTGGGACCCTGTCTCAAAAAAAATAAATAAATAAAAGAAAGAAAAGAAATGATCT 

CTCCTCATAGTTTGTGTGTAGGAAAAGAAGTGATAAAACTGACCTAATTGAACTTCACCT 

TTTGCTCAGTGACAGATAAAGCTTTCCTTGATAGCATACATTGGCTCAGTGCCAGTAGTA 

CCTGCCCG 

Sequence 1125 
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TCACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGCTCTC 

TCAGATTAAGGATATGTGTCTGCTAGGTAAAGCGACATCTGGATTCATTGTGTAGGATGA 

AAGAAACCAAATTCTGACTGCTTATTTGTGGATCCGCCAAATCTGGCACGATGCCTATCT 

CACGTGGGACCGAGATCAAGTCCTGGCCCG 

Sequence 1126 

TCACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGATC 

TCGGTCCCACGTGAGATAGGCATCGTGCCAGATTTGGCGGATCCACAAATAAGCAGTCAG 

AATTTGGTTTCTTTCATCCTACACMTGAATCCAGATGTCGCTTTACCTAGCAGACACAT 

ATCCTTAATCTGAGAAGAGCCCCCGCGTACCT 

Sequence 1127 

CCGCGGTGGCGGCCGAGGTACTGTAGTTAAATGTGGGTTCTACATGGTTCTGCAGTTTCT 

AGCCCTATAAACCTCACATGCCATGTTCTCTCTCCATATATCTTAATCACCTTTMTTTG 

GAAATAACAGATCTACCCTGCTTATCTTACATTGTTGCAGTGAGAAGAATGACTACGTTT 

TTTGAAACCACCAGCGATGCTGTACCTGCCCGGGCGGC 

Sequence 1128 ^ 
CCGCGGTGGCGGCCGCCCGGGCAGGTACAAGGCATTTTTGCTAACTGTAACTTCCCACTT 

AATCAACAAAAACAAAAACACTCATTTCTGAACATTCAGTGCATCCATGATTAATCTTAA 

TTACACC ACAAAGGTATTTTTC AATGGTGATTTTGC G GGAGTG GGGTAAC AG TTTC G AAA 

GCAACATTGTCAGAAACATAGTTGATTTTAAAGGTTCTTTCTGGTGACTTTGACTTCTGC 

TTTTTTAGAAGACCTTACACAGAGTTGTATTTATTTC 

AGAGTGAAAGGGTATACATTCCMTTTGCGTATGA GATAA MTTTAGTTACATTGAGAAG 

CTATTTTCTTTAGTTACAGGGAAAAAATTGTAGGGCTTTTC 

ATAGGAGGA 

Sequence 1129 

CTATCACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGTGCCACACTGGGCC 

AGGGTGGTCATCCTGAAATACATGTCCAGGGTCTTGTTTGTCTATGATGTGGGTGAAAGC 

TGCCTCAGCCCGCACCACAGTAGAGAGCGGGACCACCTCACGAAAGTTTATAGCAAACTC 

CCAGAGTCTAACCCTGAAAAGCANNCAGGAACAAAGACCTTTCCAGAAAGAAGGACATGA 

ACANACGCTTAAAGAACGACCTGGGCTGCCAGGGTAAGAACCCTCAGGAGGCCGAGAGTT 

ACTGTGCACAGTACCT 
Sequence 1130 

GGCGGCCGAGGTACAAGGGCAGCAGAAGCAACAAAGGCTGTGGCCAGCAACGAAGGCAGA 

GTCCCCGGTAGGCGGAGGTCCCCGTGCACAGTGTGGTGCAGCAGTGAACCCAAGATTTTC 

CTTCACAGGGCAGGGGAACTATTTCCTAATAAGCATTGGGCATTTCAGGCCACAANGGGC 

AGCCATTACTCAGCACTTGCACTGACCTTGGGAATTTGCTGGGCAAGGAAAATGACGCGG 

CCCTCATCATCGATGGCCACACCCTGAAGTACCTGCCCG 

Sequence 1131 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACACTTTCGTGG 

CTTATGGGTATCTTAGTATAACCTTTTCTGGTTGAGTGAACTGTGTCATTTCAAAAGCCT 

GAAGACATTGTGATGAGTGCTGCCTCCATAATGGCTACATTCTAGGGGCTTTGCCCTGAA 

TCGCATATATTAACTCAAAAAACAAACAGTACCTCGGC 

Sequence 1132 

CGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATGAAAACCCTAAAGCAAATGAACA 

GCTTAACCGGATATCACAAAGGCTACAACAATTAACAGAGGTTTCAAGAAGGTCGTTACG 

CAGTAGAGAAATTCAGGGTCMGTTCAAGCAGTTAAACAGAGTTTGCCACCAACTAAAAA 

AGAAGCAGTGTAGCANGTACCCTTGCCCGGGCGGCCCGCTCTAGAACTA 

Sequence 1133 

TCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCTAAAACTTAAAGTAT 
AATAATAAAACTAAAAAAAAAGTTATCCATGGCCTAGACAAATCAAAAAAGCATGTGACA 
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CTATATGACATCACATTTCAATTTCTTGTAGAAAGGMCAAGAGTAGTTTAACACCAGCA 

GCATTCATATGCAGAGAGCATCATAAGTACCTGCCCCGGGCCGGCCGGCTCTAGAACTAG 

TGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

n n n n nn n n nnn nnnTGTTTCCTGTGTGAAATTGTTTATCCGCTC ACAAATTC C AC ACCAA 

CATAC 

Sequence 1134 

CCGCGGTGGCGGCCGAGGTACCTACATTCTAAAAGTATAATAAAAATATACAACAATAAA 

MTGTAAACTGTAGGCTATAMGAAAAATAAGMTGTTAAAATAATAAC AAATTGTG C C A 

AGAAGTTATAAATCATCTGGTAGGCAGTCCACTGGTAAAATTAAGAGTTATACATAGAAA 

TGACACACTGAMTAAMCTTTTATTTCTGCCTCATTTCTCAACTAAAACACCAGTGACA 

CTTAGCACAAAGTTCACATGCACAACTGTTCAATCACAGTTTATAAAAGCACCTGAACAT 

AACACAAAGCCAAAGCAATCCTGGACAAATGGACTAAAGCTGGAGACATCACACTACCTG 

ACTTCAAAATATAATACAAATCTATAGTAGCCAAATCAGTATGGCACTGGCATAAAAACA 

GACACATAGACCAATGGGAACAGATAGAAACTCA 

Sequence 1135 

AAC AC AC ATACC ATAG G G ATG AAGATAAG AAAGG AATGATTGTG ACTCTGG ATC AAAATG 

CTCCCAGTTCCTGAAGGTGCATGATTTAACTGTGAATGCCAACTGAGATCCTAACATCCT 

ATACAACCAAAGATGAACTAGTCTCTGAACTGGTAACTCATAAAAGACAAATCCCACAGT 

ACCTGCCCG 

Sequence 1136 

AGGTACCCCCTCTCGTTCCCTTTCAGCTTTATCCGTGCTACATAGGGTTCACGTCACCCA 
TCTG AATGTC C ATCAC ATC A TAACATCAG ATC ATG GTTG AG AG CTG ATTACATGCTAATT 
GCATAAACTCATTGAATTTTCACAACCAACAGACAAGATAGATATGGACTGACATCGCCT 
CCATTACACTCATGGT ACCTGCCCG 
Sequence 1137 

GACTCACTATAGGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCATCGATCCT 

AGTGGGACGCGATCCAAAAATATGCCTATTAAAGATAATGCTTTGGTTATGTTTAATGGG 

AAAGTCTATCTGTTGGCTAAAAAGGGGACAGATGTTCTGCCATCACAAATTGGCCAACAG 

AATTCTGTTCTCCTGATACTCCAGTAAGAAAAGACACGTTACAGACAGTGAGTTCAAGTC 

CAGTCACAGAMTATCCAGAGAGGGTTGTAAATATTGTTTTGGCTAAAAGTAAATCTTCC 

CAGATGGAGACAAAATCACTTTCCAATACCCAGCTTGCTTCCATGGCCAATCTAAGGGCA 

GAGAAGAATAAAGTGGAGAAACCATCTCCTTCTACCACAAATCCACATATGAACCAATCC 

AGTAACTACTTAAMCAGAGTAAGGACTTTATTCACAMTCCAATCTTTCCAGTTGGGAT 

TTAGTACCTGCCCG 

Sequence 1138 

CGAGGTACTCTAACCTGGGCAGCAGAGTGGGACCCTGTCTCAAAAAAAATAAATAAATAA 

AAGAAAGAAAAGAAATGATCTCTCCTCATAGTTTGTGTGTAGGAAAAGAAGTGATAAAAC 

TGACCTMTTGMCTTCACCTTTTGCTCAGTGACAGATAAAGCTTTCCTTGATAGCATAC 

ATTGGCTCAGTGCCAGCAGTACCTGCCC 

Sequence 1139 

CCCTGAAGGCCGCTTACTTGGAACCGTATCTTCTAAATGCTCTGAAAA.CCTATCCAAAGG 
CAATTAAAATCACAAATATCATTGAAAACATCTAAGAACTACGTGTAAGCACTTCGGGAT 
AGTCAGGTAMGGCGGCCTCTAATTATGAAGCATTTTGATAATTAATAGAATTCTAAGCC 
A AAGC TTAAATCCGGATACACATCATAATGGTTAAGCCAAATMCACTTMTCATTTTAC 
CT TTTAT TATTGATAAAGACAACACACTGAAGTAGACAGACAGTAAAGAAAAAGTCCACA 
TCI I I I I ATTGGGTCCAAATAACCTGATTGNAAATACTTAGCCTCTGATTTTCATGTTTC 
AGAATTTGTATGTTAATGACCTTAGTAAGATGTGGCTTTGGCCAGCCTCAAAAAGGGTTG 
CTCTN C AGCC AGGGCCCCTTTTTTCC AGTTACTGCTCCC ATCTN CTTTAAGTATATTC AA 
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CTGGTGAGTGGAACTCTTTATCCCGTGTGACAAATGCGTATTGAGGCCCTAACATGCCCC 

CAGACCTCAAGATGTGCTGCTCATACATTTGAAGGTAGCCCTGACTTGTTTNGGGGGGNG 

TGGGGGNGGGCCTTTTCCATCAGTTGTNTTGGANGNTGTGGCCAGCAATCCATTCCAAAT 

TGNCTGAGTTTTCTTNATTTTACTTTNGGA 

Sequence 1140 

AATTGNAGCTCCCCGCNGCGGCNGCCGANGTCAATACTAGAGAAGGGAGAGAATATAGTT 

AGGACMTAAAT^ATTGMCAAGTGArrTTrCTGTGATGACT.^rACCCATmCCAAAAA 

CTAATTTCATTTTTGGTGTGTGTGTGMGAAAGTTTGGAGGCTTCAGTTTCMTCACTGT 

CTCAATACTTAATTTTCCATTATTCACAGAGAAATTG I I I I l CTGGAAATTGGCAAGAGG 

AAGGAAGTAATAATAAAAGATGACTTATGCAGATCCCATGTGACCATCTTACATTCTTAC 

TCTTGTGCCACTGTTTCCCTGGCAGTCCGTTACTGGTTTAAAGAGCACTGGGATATAACT 

CTAGAMTACGGATTATTTTCTCACCTCTGACACTCATTCATGGTGGAGCCCTTACCTTA 

NTGCCAAGGTTATTGGCATTTAACTTTACTATTTAACCTTGCTGAACTACAGTGTCCTTC 

CTCAATGTCAGAAGAAGCTTAGATAGTTAAATAACTTCTAAGGATTATCTATCTGGATTA 

TTAGGATTTTTCATCATTCTTGGCGTGATTGTCACAAAGAAAACAGGACATAGCTAGGTG 

GCAAGTGCATCCTCACTCAAAGTTAC I I I I! I 

Sequence 1141 

TCACTATAGGGCGAATTGGGAGCTCCACCGCGGNGGCGGCCGAGGTACCAGTGAGGCATT 

TCCTGATTTTTACTCATAATATTTACAAAGATTAAAAGAGATAAAATTCCAMTGTGM 

GCTTAAATTGTTCAAATCCTTACTCAAAGAAATAGTGGGAAAGGTGGGTAAGTCAGAGTC 

TGCTGCGTGAAGGAAAAAAATCAAGTTAATCTCTTCTGAAATTATCATGATTTGTTTGGT 

TTAGATTCCCAGAAACANAAAAGANNAACANAAAAAAAGGTCCCT 

Sequence 1142 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACAGAATGAGCTATAGCCTG 

CAGTGACCAGAAATCTCCAAGTAAGGGAGTATACACTGAATGTCATGCCCTTCATTAAGT 

TAAGGCCTGTAGTTNAAGTTCTTTCAGTATGTGCCATGAACAACAATAAACAATGTCCTT 

GGTGTACTCCTTTGAAAGATGGACATTGAAAGTTAAAAATCAGTCACTATATAGGGCCTG 

TTGATNGGAAAACCCATTTGNGGTTCTTACAAATTGTAAGCNGGAATGCCTT 

Sequence 1143 

CCGCGGTGGCGGCCGAGGTACACCGTAGCTTATTTCAAGTGACACAGACCAGGCAACTGA 
CAAATACCGACAAGTAAATATTGTTCATTTTCCTAGGTAATCTCTATAATCAAGGGGACT 
G A Mil I ATTCTGGAATTTTCCCCAAATGTTTTCTT^ 
TGTGTGCATGGTAGTACCTGCCCG 
Sequence 1144 

CCGCGGTGGCGGCCGAGGTACTCTAACCTGGGCAACAGAGTGGGACCCTGTCTCAAAAAA 

AATAMTAAATAAAAGAAAGAAAAGAMTGATCTCTCCTCATAGTTTGTGTGTAGGAAAA 

GAAGTGATAAMCTGACCTMTTGAACTTCACCTTTTGCTCMGTGACAGATAAAGCTTT 

CCTTGATAGCATACATTGGCTCAGTGCCAGTAGTACCTGCCCG 

Sequence 1145 

AGCGGCCGCCCGGGCAGGTACTCTGACGGAGTTACACACCCCCAGCTATGAAGTAGTCTT 

GCCAAANCAATTGAACATGACATTTCATNAAGCTCTTANAACCGACTATACCAATTTATG 

G N ATAAAC AG AANG ACC ACGTG AAAC AAAAN CTC ATTAAN GTATG CTC ATAN N GC AAAAA 

TTCACGACTTATAAAAAANACTTTTNACTGGGACCAATAGCATCNGCANGTTTTCTTGNC 

ANACAAAAATATTGANANAAANTGTCAATAANGGGGTAATAGNAGGAACANGTANTAANT 

GTAGATAACANNAAATTGGAATTATGGAGGGNGNAGTAGTAAGGAATTCAAAGCGCATNG 

TCTANCAAACCGGTGGTTGGGGAATGGTTTAATTTTTNGGTACCCCCCTTAGATTTTGGA 

AANANAAAACCAAGGTTACNTTAAAAANAA I I ! I I NGNAATTTCCCCTCTNTTTTNTTTC 

CTCTTANTAGGAAAAACAAACCGGCANAACCTTCCAAGGGTTTTTNAANAANANGGATNA 

ATTTTTn I ATTAAGNCCCTTCCNGGTAATTGGAAACNTTTTGGGTNTTTTNTTCCAATT 
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NTCCAATTNTTAAAATTAAAAT 
Sequence 1146 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCMTTATGATTTT 

TAAAAATGCAATGGCATGAATAAATAAATGAGTATACCCATTTGTTGATCTTATTCCCTA 

TTACTTCAAATTTGTAAGTATCCTTAGTGGAAAAGA 

ATGTCTCAACATGATTGGGGTCTACTCAAGGGGGAGGGTGGGAGGAAGAAGAGGAGCAMA 

AAGGATAACTATTGGGCACTGGGCTTAACACCTGGGTGGTAAGATAATATGATCAACAAA 

CCCTTGTTACACATGTTTACCTATGTAACAAACCTTCACATGTACCTGCCCGGGCGGCCG 
CTCTA 

Sequence 1147 - 

CC GCGG TGGC GGCC GAGGTACTTGCAAGAAACCTCAGGACTTGAGTAACAGCAACATGGT 

AAGTTTTCTAAGTTTTCTTTTTGTCTCCCATATACGCTGGGCTGTGCTGGAATCACCAAC 

AGGCACAGAAAAAAATGACAACAAAACAACAACAAAACCCCCAAGAATATCCTGTTGTCT 

TTGGCCAAAGTTCAGGAAAGGGGAGCCCCAACAGAGACCCAGTAGGAAGCTCTAGCCCCT 

GTTTTGTACCTGCCCG 

Sequence 1148 

TNGGAGCTCCACCGCGGNGGCGGCCGAGGTACTTAGCTTTGTCGTCTTTCTTTCCTCCTC 
TTGTT NGAGT TTAGTATTAATAAAAGTTGGACTGAGAAA 

GGTTATTTTTAGTGTAMCGTTTTAGAAGTANAATATACATATAAAAACTGCACAGATCA 

AA TGTGTG CATCTCAMTGGTGTTCCATTTTCAAAATATGAATACATATGGGCAGCATTT 

ATA II I 1 lAAAAAGTCAGAAGGTGCCTCCTCATGCCCTTTCCACTTCTCACTCATTGTCC 

CTCAACCCAGGCAT.AACTACTCTCCTGACCTCCAACATCATAAACTAGTTTCACAAGCTT 

TGAAACTTTATCCAAATGAGTCATACAGGATAGATGTTCACAGAGCCTACTTAGATATTA 

TGGATCCTTGCCAGGTAT 

Sequence 1149 

AGCGGCCGCCCGGGCAGGTACCTTGCAAGTATTAGACTTAGAATAAAACTGTGTTGACTG 
ATATAACTGATGCATGGTTGGAATAGTAGGTTCTTGGGTAAATTAATCCTAACATCAATA 
GCTCAATTCCATCATAGTCCTTGAACAGGCAAAATGTATGCTAATCACAACAGCCGTACC 
T 

Sequence 1150 

AGCGGCCGCCCGGGCAGGTACCTTGCAAGTATTAGACTTAGAATAAAACTGTGTTGACTG 
ATATAACTGATGCATGGTTGGAATAGTAGGTTCTTGGGTAAATTAATCCTAACATCAATA 
GCTCAATTCCATCATAGTCCTTGAACAGGCAAAATGTATGCTAATCACAACAGCCGTACC 
T 

Sequence 1151 

AGGTACTATTTTCAGATGGTAGGGATACAAATATATTrrCCCTATATTCMGGGATTTAC 

AATTCAGTAGCAGAAACAGACATGTAAGCAAATAACCGCAAAACAACGTA1TAACTACAA 

TAGCGTTGTATTCAAAACATTATGGAAAGATAGTAATTAGCAAAACTAAGTGATTAATAA 

AGTTTCATTAAATCTGGCCAGGCAGGGTGGCTCATACCTGTAACCCTAGCAGGTTGGGAG 

GTTGAGGCAGGCAGATCACTTGAGCTCAGGAGTTCGGACCAGCCTGGGCAACAAGGCGAA 

AACTTCTCTCTACCAAAAAAATACAAAAAGATTAGCTAGGTATGGCGCATACCTGTAGTC 
C 

Sequence 1152 

CCGGGCAGGTACCCTTGGTTTCTCAGACAACTCACTGATTTATGGTCTTGAGACCATAAA 
CTCATTTTCCTTATATGAATGACATTTCCACATCCACAACAATACCACCAAATATATGTA 
TCTAGTTCTT ACTAACTGCAAATC CTC AAAGTG AACTG CGTGC ATTTTAATGTTG CGTAG 
TTTGCTGATTTATGATTTCCCTTAATGTACCT 
Sequence 1153 

CCGCGGTGGCGGCCGAGGTACCAACCAGGGCTTTGATTTTCATGCTGTCCCAAAGTTGCA 
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GACATGTTCACGACACAGCTGCGTTTGTTCCTTCATCTTGTTCAGCAACCTTTGGTAAGA 
AACTACCATATTTAAGATTGGTATTCTAGATCCAAAAGAAAAATGCATGTGTTTGCTGTG 
TGTGCATGGGCACATGTATGTGTATGTCTGTCTGTAAGGATAGAGAATGGCTGAAAATTC 
ACAGGTTTTTAAATTCTTAAAAACAGGAACGAAGGAATTCTAACAGCTATGACTCAAAAC 

AAGATTCTTAATAAAG TTNC ATG AC AAN G TAAAAC AAC TAT7TNAATTG AG AATG TTAAA 
TAGCCTTGGTACCTGCCCCCGGCCGGGCGNTCTTAGAACTAGTGGGATCCCCCCGGGCCT 
TGGAAGGGAATTTNGATTNTTNMGCCTTTATCCGAATANCCCGTCCCACCCTTNGAAGG 
GGGG 

Sequence 1154 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAGGAGGCAAAAAAGCAATCAGTATGACTTGAA 
CTGCTGGGTTTAATTACTCAATATAACACTTGCCATTTAAAGATCCATATGCCCATCAGC 
ATGGCAACAGTCTCTCATAAAGATTCCGGTATCATATGGCACAATTTGTACCT 
Sequence 1155 

CCGCGGTGGCGGCCGCCCGGGCAGGTACATACGTAACAGGGACTGGACCACTCCTCCCTT 

AGTTTGTTGTCCATTTCTCCAGTCAACATGTATTTACTGAACACCAGTTATGCATAAGGC 

CTTCTGCTGTCTCTCATTAGCAACGGGAGAGTGAACCAGACTAGAAAGGACAGACAGGTA 

ACTGTCTATATCTMGMCTCTTTCTTTTGGAAATCAAAACTCATCTGCAGAGAATTAGT 

CTTGCCTCTGTCCAATTTTCCTCCTTATGGCCCCCAACTTCCACTAAGGGTCATGAGGTT 

GGAGGAGGGCTCGAGGGGCCACTGAGTGAATCCCTCTGCCTAGAATCAGCAGCACACTTA 

TAGGATCCTCAATGCACAAGAACCTGTCAACACANAACAGCTTTCAGAGAATACCTTCCC 

T 

Sequence 1 1 56 

CCGCGGTGGCGGCCGAGGTACACGAGGGAGGCACGTGAAGTCTATCATTCCATGCTCTCA 

AATGTTAACTTTCTGCTTTATATACATAGCAATGCCTCATTTACCTAAT GTTGG GGAGTA 

AGGTATTATATGTGAAAATTTTATTTTAGCTATACCTTTTGTGGAAAAATTT^ 

TATGCTTTGGTTATTATTACTAGCATACTGGCAGTAGTTGAGTGAGGCTATGAGATGATC 

GAAAAATGTAGTTTGNTGTCCACTGCGCTTTTTGAATTATTGAATCTACTTGAGTCGATT 

CAGTAGTCAGTCTTTGTCTTCTTTCAGTGGNGGAAAGCCGCTCTCTTGCTGCTGTTAGTA 

TCCTTGCTGCCCTCTTTTAAATTATGCAGCTTCTTATCTGTGGAGTAGGAGCTTCCTGAC 

ATTGAATGTAAAACAGCTAAAATGGC 

Sequence 1157 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCATCATACAACATAATT 
XTAAAAGCTTCACTGTTTCAATAAAAAAACTAATATAATCATGTTTA TCTGA CAGACCCA 
TTGCTTTATTCATTATAATCAMTAGAAGMGAGGAAATTTATGGTGTTTTGAGAA 
TGCAGGTnTCTTTTTGTTTTTTCTTTTTGAGGTGTGAATACAGTAA 

AATGCTTG/\AAGGCTTGAAGAAGGGACTAGCGGTGGAGTGCAAGGACAAGAAGAGATGAG 
aagc i rrm I I CTTTTCTTTTCCTTTTGGTTTCAGAGGG AAGAAGCTATGTAAATTGGA 
CTGGGAATTAATATCCAGCTTCAGTCAGAGAATAAGAAATAGCTTGAAAGTACCTGCCCG 

Sequence 1158 

CCGCGGTGGCGGCCGAGGTACTTTTAAAACATTTTTTCCTTGAGAAATCATTGCCTAAAT 

CTCATACATATGTGCTTCTCTGCAGAAAAAAAAATGAAATAAAATAAAAATCTTTATTTC 

CAATGAGCTAATGAGGAAAAAGGGGATGATGAAAAAACAGGGAGTGGGAGCAATTTTGGA 

AATGG/vAAGAATGCTGAATACCTACATGATGCC TMTTCAGAA GAACATTCTGGTATGTA 

GGACATTATTTTCAACTCTATCACTTACGGCAGA1 111 M I I I I ATATGCAGATGTATCT 

CTACTCTCTAAAGATGTACCT 

Sequence 1159 

CCGGGCAGGTACTGTGTATCATCGCAGTCTTGCTTTTTTGAGTAATGGATTCCTAGATTC 
TATGAGGATACCACAACCACTTTTAAAGAGGTTTCTAAGGCCAGGTGCAGTGCTTACGCC 
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TGGGAGACCAAGGTGGGAGGATCACTTGAGCTCAGGAGTTTGAGACCAACTTGTACCTCG 

GCCGCTCTAGAACTAT 

Sequence 1160 

TCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATTTTCAAATGGTTA 
AAATGGTAAATTTTATGTTACATGTATTTTACCACAA I lllll r ATAAGACCAGATTTTA 
AAGAGAATCAAACAGMCCTGCAGAAATAAMGACATAATAATTGAAATTAGTTCGATGG 
ATGAATCAAATAGCAGATTAGCTCAGCTAAAGATGACTGTGAATACACAAAATGTAGCAT 
GGTGATACAAAAATTGAATATATCAAACAGAAGTTATGCAACATGAGGACAGAATAGAAT 
GTCCAACATATCTAMTACCATTCAAGAGGGAMGAAMGAAAAATGGAGAACCGAGGCA 
ATATTTTGGAGGATNTTATTGCTNGNACTTTTCCAAAMCTGCTGTAGTTTCTGGM 
Sequence 1161 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTTATTCCCATTTTATAAGCAAAGAGGTAGGAC 

TGGAAAGGTTAAGAGCTTACTCAGCGAGAGAGAGCTGGTAAATAAAGGAGTGAAGATTCT 

AACCCAGATGGAATGTCATGCCGAAGCTATTAGCCACTAGAAGATAATACCTCACTTGCA • 

CATTCTGCTGCTGCTGCTGCTGCACCTGAGGCCTTCACCTGGGCTCGGTGGTGAATGCCC 

TMGATGGTTCAAAMCTGCATGTGAAAAACTGTTTTAGGCCAAGCTATCAACAAATCTT 

CTGGTGATATCAAACCTGGTGGCACAGATAAGAGAMTTTGAMGTGACTTTCAAAAGCA 

GGGTGGAAGCCGTAGTACCT 

Sequence 1162 

AGGTACTCCATAATGGATGTGGGCCAGGGTAAAAGCAGTTGTTAATGTATACAGGAGAAT 

GCTCTC AACGTAAGAGGCTACTCCTAGGTTTAC C ACTAAG AC AAC C AAG AAG AGAG G AAC 

CAGCAACCAATTCAAAAGTTGGGACACCGGGTACCTTGCCCGGGCNGGCCGNTCTAGAAC 

TANGTTGGATCCCCCGGGGCTGCAGGNANTTCGATATNAAGCTTATCNNTANCCGTCGAC 

CTCNAAGGGGGGGGCCCGNTACCCAGCTTTTT 

Sequence 1163 

CCGCGGTGGCGGCCGAGGTACATTTTGAACAATCTACTCTTCAAGTGGATCATTTACAAA 

ACAAGAACATAATTCCCTGAATCAGTAAATTTTATTTTCATGAAAGGATGACAAAGAAAT 

TCCATTTACACTATCACGATTTACATTTTCCTACAATTCATGCAGCTTTCTTCAATTCAC 

AATCTACAAACCACAACAATGCACAAATTTGATGTTAGAGGATACTTTAATGTTTAGTTC 

TCATCTCAACAGTCACTGTTGCTAGTGTCTCTCATTCTCACTCTTCACTGTTACTTAAAG 

AAC AAAG AACTAAAGAAGCCAG AAAGAACTAGATC AAG C GATG AGAC CAATTTTC AG G GT 

AAGGCATMTMCATTAGCTAGTCAAAMCTGGAGTATTTCTATCATTTAATCCTGCAAG 

ACAATAACCTNCTGTTTTCCCTCACTTTCCTTCCAATAACTTGACAGAAGTCTGGGGAAG 

GATGGATCCCAAGCAGAAAACCTTCACAAATGTGAAAGAAAATGGAAATCCTAGCATAGT 

CACATGACTAACTAGAAATCCCAGMCTCTGTCMCMCCTTTGTTTTCAATGCAGAAAT 

CATTTTTGTACCTGCCCGGGC 

Sequence 1164 

ATTGGGAGCTCCCCGCGGTGGCGGCCCGAGGTACACTGATCGTTTCTGCAGCTGCAGATT 
CTGCTGTTCGACTCTGGTCTAAAAAGGGTCCAGAAGTAATGTGCCTTCAGACTTTAAACT 
TTGGAAATGGATTTGCTTTGGCTCTCTGCTTATC r I I \ I I GCCAAATACTGATGTACCTG 
CCCG 

Sequence 1165 

CGAGGTACGTTGTCAGGACATTTTGTAATTCGGTATTCTTGGTCTCCTGGCCCCAATTCT 
AAAAAAAGTTCACGTTTTGATGAAAAAAAGATATGGGCATTATGGATGTTGAGGAGATCA 
CAGCTCAAAAGATTGACAGGGGCCCCAGGGCATATAAGAAACATATGGTACCTGCCCG 
Sequence 1166 

CGGCCGCCCGGGCAGGTACTGGTCTGNCTGCAGAGGCACCAAGATGATATCTTCCTCCAC 

TGTCAGAGTTATTTANGAAAAATAATCTCTGGCATANGCCGAGTGCTCATAAATACTTCA 

NGATGAGACAATAAAGCCGCTCCCAGGTCCTGGCATGGCCCAACCTCAGGACACTGGCCT 
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ACACATTTGCAGGTAGAAGCAAAGGCCTGCGTCAGGCCTGTGGCCAGCTTTTGGCTGTGC 
GGGAGGCTCTAAAATGAAGACAATCTTACTCCAGGTCTCCTTCCACTTTGTTGCCTCTAG 
AATGTCATCTCCCTCCCCTCCTCTNGGAAAGGAGTCACTCCACCCCAACCAT 
Sequence 1167 

CTCTGCTCTCATTTCCTGCAGATTCTCTGAAGCCTATCTGTGCTTCTCAGTATTCTCTAG 

CGGAGTTGAAAAACGCCCCTTAGAGGTGCACAGTTAATTATAGAAGCTGTTAGCTTTCCC 

ATCTGTAGTCATGCCCTGGAGCTAAGAGAAGGACTTGAATACAGAAGAGGAGATGCCCCC 

TGTACCT 

Sequence 1168 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTACTCAGAGAATTTCATTGCTACTGCCTGTTC 
GTTAACCTAGCAAGTGAGTAAATTGAGGCTTTAATTAAAAAAAAAAACGGTTGTTGTTCT 
TTGATATACATTTTGACATAGCTCAAGCTTCAACCCTGCCCAGTTCC TATGCT CAGACCC 
C ATCTTCAAG GATGCTCCTCCTGCCTAACTCCTCTCTCAGAATATTG I Mil I'CCTTTCC 
CGTTTAATCCCCAGGCTCAGCTCCTGGGACCAATCCTGGTTCAGGCTCCTGTTGCCTGCC. 
TCATCCCACAGCCGTCCATAATTTGGTGGTTACCGTCATGCACCTGAAGTGGTAGTGAGC . 
ATTTATAGTAATAAAAATC ATAGTCTC AC ACTG C C ATT 

Sequence 1169 

CCGCGGTGGCGGCCGGGCGCCATGGCTCACGCCTGTAATCCCAGCATTTTGAGAGGTCGA 

AGCGGGCGGATCATGAGGTCAGGAGATCGAGACCATCCTGGCTAACACGGTGAAACACCA 

TCTCCACCAAAAATACAAAAAACTAGCTGGGCGTGGTGGCGGGCGCCTGCAGTCCCAGCC 

ACTTGGAGGCTCAGGCAGGAGAATGGCGTGAACCCGGGAGGCGGAGCCTGCAGTGAGCCG 

AGATTGCGCCACTGCACTCTAGCGTGGGTGACAGAGCAAGACTCTGTATTAAAAAAAAAA 

AAGTAACTTTGAGTGAGGATGCACTTGCCACTAGCTATGCCCTGTTTTCTTGTGACAATC 

ACCG CC AAG AATG ATG AAAAATCCTAATAATC C AG ATAG ATAATC CTTAG AAGTTATTTA 

AC 

Sequence 1 1 70 

CCGCGGTGGCGGCCGAGGTGCTTrCTTTTTCTTTCTTTCTTCI M M I M IAGCCCTGAG 

ATGTATTTGACTTCTGAACACCTAGAACCCTTTGTTrTCTTCTTTCCTGTATTCCCTGAC 

CCTTTATTCCAGTTCCAGCTTGCTATTTTTGTTTGATACATTTGCAGATTCACTGAAGAC 

CACCCAGCCCATATATAGCTAGCTCTTCTGTGAGATGTGTATTTAACCCTGAAAAAACTA 

ATGCAAGAAAATAGGAGAGATGACCCGAAGGGGAAAGACTCATAGGATTCTTTTCCCCCC 

TTGGCATTTGGCCAGGGATTTTCATACATTCTCNACCACTTAAGTTGAAGANGGCTTGAA 

GCCTGCTTTGGCTTTTAATAATAACCTGGNAACCCCTTGGCCCCGGGGCCGGGCCNGGTT 

CTTAGAAACTTAAGTGGGGNATCCCCCCGGGGCTTGCAA 

Sequence 1171 

GACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAACATCAATAA 

GTCGAGAAAATTATATTAACTGAAAGAAMCA/^MTAATAGAGAATTT^ 

TTCTAATGTTTCTCTTCATGTTTGGAGAAAAGCTGCCACATAATTAAAACAATTCTTACC 

CTGTAAAACTGATTGTCTTCCAATCTCAGGAG3TTTTACATTAAACAGGGAATATAGAAT 

AANGAAACAGGCCTATGGCCCGGGCTCCGTGGCTCACGCCTGTAATCCCAACACTTTGGG 

ATGCCGAGGCGGACGGATCACCGAGGTCAGGAAATCCAGACCATCCTGGCTAACCGCGGT 

AAAAACCTAGTTCTCTTACCTAAAAAAATNCANNAAAANANAAAAGGAATGGAAAGGAAA 

AGAAAAAAAAAGTTACCTGCCCGGGGCCGG 

Sequence 1172 

CCGGGCAGGTACCCTCCAGGCCCTGTG1ATTATTAAGATCTTTTAGTAGCGAGTTGCTCT 
TTCTCTGGGAAATCGGCTGTTAA/vGTCAoAGGGAGCTCTTAATAGTTTGCATGGTATTTG 
ATTAAATGGAACAGTTGGATCAGTACCT 
Sequence 1173 

CCCTTGTTGAACAGGCGATNTNTNNACCATGCNCACAATGAAATCAATACTCAGAGAAGG 
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CAGATAATTCTCCACGAAGCCAGAAAACTAATAAATGAACAACTTGGGTGAAATGTNCCA 

CCAGACGGNGTGATATTTAGTAGCCCNTAAAGCTGCCANGGGGTTGAATGACACTATCTG 

AAGATATGAACCANTTTGNTCTCCATAGGGAGGATTTTATCAACAGGAAACANATGCCTG 

GAAGGCATTGGATTT 

Sequence 1174 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGATCTCACTCAATCTTACTCCCTTGTA 

GAACAGGCGATATCTTCACCATGCGCACAATGAAATCAATACTCAGAGAAGGCAGATAAT 

TCTCCACGAAGCCAGAAAACTAATAAATGAACAACTTGGGTGAAATGTCCCACCAGACGG 

TGTGATATTTAGTAGCCCANAAAGCTGCCAAGGGGTTGAATGACACTATCTGAAGATATG 

AACCAGTTTGCTCTCCATAGGGAGGATTTCATCAACAGGAAACAGATGCCTGGAAGGCAT 

TGGATTTGCTMGTGTCTATGCCATATGATTCTGCTGTTTGCGTTTGATTTAGAATGCTG 

AGCTGACTCAAAGTCAAACCTA 

Sequence 1175 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGNCNGGTACTGAACTTATT 
TTAAGGTCTCCAATTTATTTGAATATAA^ 

TATGTGTATTCTGATTGTTTATCTGTGTTTGGCATATAATAAAAGTTGATTGAGCTGAAG 

G ACTTA AAMCCTGMNCACACCTTCCATGCTTTTCATTTGCTTTACCTNTGCATGG 

ACTTTTAATCTGTCTCTCCTGTGTGAAACTTCTCTGAAACCCTTACCAGCTGNCTGTGCT 

GCTACTTGTCTGNJGTACCT 

Sequence 1176 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTACAGGGGACCGCCAGGGGCCT 

CGAGAATCGGTATCCTGAGTCCTCTTGAAGAGCAGTAGAGGTTGT7TCATTAAGTGCAAA 

CACATTGTTCTTMTTTGAAAACTGTGGGCAGAMCAGMGCCCGAGACTMTTTTTCCA 

TT GCTA ACTCTAGATTCTCGGCCACTGGAGTCTGAAGATACTCTCTTTGAGAATGCATAT 

TATTTT GCTCAC AGCTAAMCATTTAAGTATCATAGCTGATCAGTGGAGTGAGATTAAAA 

GGTTTC i Mil iGAATCATCAGCTAGAGATGTGCAAGGGAGACAGCAATGCAGGGTGTGT 

TTCAGAAGAGCTTGCTGAGGTGCTCGGCTCTTAGCATTTAAAAATGTGATNGTTGGTATA 
TCATTC 

Sequence 1177 

CCGGGCAGGTACAAATATTGAAGGGGMGAAAATATAATTGTCCTTTCTATCTTTTTTCT 

CATAAT7TAGCTCTTTCTCCACCACGTCATGCAGCTGAATTGAGCCAAGCATTAGTGGTA 

GGGATAAAAATAATGCCGGMTCACTCTCCCCCATTTCCAACTTCCACTCTCTTCATGTG 

CGCAGCTGCACTTCGTACCT 

Sequence 1178 

AGGTACAATACTAGAGMGGGAGAGAATATAGTTAGGACAATAAATTATTGAACAAGTGA 
rm ICTGTGATGACTATACCCATTTTCCAAAAACTMTTTCATTTTTGGTGTGTGTGTG 
AAGAAAGTTTGG AGGC7 TCAGTTTCAATCACTGTGCTCMTACTTAATTTTCCATTATT 
ACAGAGAAATTGl I 1 I iCTGGAAATTGGCAAGAGGAAGGAAGTAATAATAAAAGATGACT 
TATGCAGATCCCATGTGACCATCTTACATTCTTACTCTTGTGCCACTGTTTCCCTGGCAG 
TCCGTTACTGGTTTAAAGAGCACTGGGATATAACTCTAGAAATACGGATTATTTTCTCAC 
CCTCTGACACTCATTCATGGCGGAGCCTTAACCTTAGTGCCAAGGTTATT 
Sequence 1179 

TACCCAATTGTCCCAAAMCCTTTATTAACAATCAAAATCAACCTTGACTCCACTCAACT 

ATTATTCCAACTTCACCTAAMCTAAATTCACAANGTGTATATGGGTCTATTTGGGAATC 

TNTGTCCTGTCACAC TGGC CTGTTATTCCTGAGCTGAGCACTGTAGCATGGACCANACCC 

CACA CACCTCTCTTCTTTTGCCAGAGGCTCATGGCACTTCTGTAGGTC Al I I II M I CTT 

TTTTGAATATACCCTTGGTACCTGCCCGGGCGGCCGCTCTAGAACTAGTTGGATCCCCCG 

GGCTGCAGGGAATTCGATATCAAGCTTTATCGATACCCGTCGACCTNGAGGGGG 

Sequence 1180 
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GGTACATGGAGGCAAAAAAGCAATCAGTATGACTTGAACTGCTGGGTTTAATTACTCAAT 
ATAACACTTGCCATTTAAAAATCCATATGCCCATCAGCATGGCAACAGTCTCTCATAAAG 
ATTCCG GTATCATATGG CACAATTTGTACCT 
Sequence 1181 

AGGTACATCTCTAGCTGATGATTCAAAAAAGAAACCTTTTAATCTCACTCCACTGATCAG 

CTATGATACTTAAATGTTTTAGCTGTGAGCAAAATAATATGCATTCTCAAAGAGAGTATC 

TTCAGACTCCAGTGGCCGAGAATCTAGAGTTAGCAATGGAAAAATTAGTCTCGGGCTTCT 

GTTTCTGCCCACAGTTnCAMTTAAGAACAATGTGTTTGCACTTAATGAAACAACCTCT 

ACTGCTCTTCAAGAGGACTCAGGATACCGATTCTCGAGGCCCCTGGCGGTCCCCTGTAAG 

TACCTGCCCGGGCGGCCGCTCTAGAACTAGT 

Sequence 1182 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACTGATCGTTTCTGCA 

GCTGCAGATTCTGCTGTTCGACTCTGGTCTAAAAAGGGTCCAGAAGTAATGTGCCTTCAG 

ACTTTAGACTTTGGAMTGGATTTGCTTTGGCTCTCTGCTTATCTTTTTTGCC^ 

GATGTACCTGCCCG 

Sequence 1183 

TGAATNAGGGGATAACGCAGGGAAAGAACATGTTGAGCAAAAAGGCNCAGCAAAAAGGCC 

AAGGGAAACCCGTTAAAAAMGGGCCCGCGTTTGCTNGGCCGT^ 

NTCCCGCTCCTACCTTGTACCGTAGCCATTACACCAAAAAAATTCCGACCGTCTCAAAGT 

TCNAGNAAGNGTTGGGCCGANANACNCCCGAACCAGGGACCTTATTAAAAGGAATACCCC 

AAGGGCCGT 

Sequence 1184 

NGGTGGGCGGCCGAGGTACCTNTCCACAAGCCTGAACATGGAGTAGATGCATGGCACGGC 
CAGATTAATGCAGGACGCAACGAAAGGCAGTAACAGCACCGCCCCAGGGTTACTGTGTGT 
CTTCAGGAACTCTAAGTTGTACCTGCCCG 
Sequence 1185 

ATGCATGGCACGGCCAGATTAATGCAGGACGCAACGAAAGGCAGTAACAGCACCGCCCCA 

GGGTTACTGTGTGTCTTCAGGAACTCTAAGTTGTACCTGCCCG 

Sequence 1186 

AGGTACAAAGGGAATAGATAACAGGGACATTGATCTAAAGGGAGGTTAGGGAGGACTTCT 

TGGAGGAAGTGGTATTTGTGTCTGAGTTATATGGGATGTGTGGGATTTCTTAGGTGAAAG 

AGGGCAGAGCAATACGGTGGTAGGCTAGATCTGTGCAGCCGGCTTATTAGACAAGACTTG 

TTACCCTGCCCGGGCCGGCTCGCTCTTAGTAACNTAGGTGGNATTCACCCCGGGGCCTGC 

CAGGGAATTTCCCATTATCAAAGNCTTATTCGCATTACCCGTGCGAACCTCNTAGGGGGG 

GGGGCCCCCGGGTACCCCAGNCCTTNTTTGTCTCCCNTTTTMGATGMGGGGGTTTAAA 

TTTGN CG NCCG CCTTTG GGCNGTAAATCATTG N GTGCAATAAGACTTGTTTTN CCTGGTT 

GNTGG/AAANTTNGTTTAATTNCCGCCTTCMCCAATNTTCCCACCANCAAACNANTAAC 

NGNAGGNCCCGGGGNAAGGCCATTAAAAAAGNTNGGTAAAAAAGNCCCTGGG 

Sequence 1187 

NTCGGNCGTTCCNGCTGCGGCGAAGCCGGTATCANCTCAACTCAAAAGGCGGGGNAAATA 

CCGGGTGTATTCCCACCAGGAATTCCATGGGGGGAATAAACCGNCAGNGGNAAAAGAAAC 

AATGGTNTAGTCAAAAAAGGGCCCCANNCCAAATAAGGGGCNCACGGTAAAACCCGGTAA 

AAAAAANGGGCCCCGNCGATNTGGCCTGGGGCGG I I I I I I I I I NNCCAATAAGGGCCTTC 

NCCrjGCTCCCCCCCCTTGTAACGGAAGGCCAATTTCAACCNAAAAANAAAATTCCGGAAC 

CNNCCTTTCAAAAGGTTCNAANGAAGGGGTTGGGGNCGNAAAAAACCCCCCGGNACCAGG 

GGGAACCTTAATTTATANAGGGAATTAACNCCAAGGGGGCCGGTTNTTTTCCCCCCCCCT 

TGG 

Sequence 1188 

CGAGCGGCCGCCCGGGCAGGTACCTTGGCAATAAGTTGCTAGTTATCTCAGCCTATAAAA 
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TGTAGGGCATATGGACATTAAGATTATAAATTAGCTTTGGTGTMTCAAATTTTATAATT 
TGTAAACTGGACTATGTTGTGCATCCTTTCATAGTTTAGGATGATTAAGAGTTTGGACCT 
TATAAGTAAACTGCCACATTTTGAATCATAGGCCCCAACTGGGTACCT 
Sequence 1189 

GGCGGCCGCCCGGGCAGGTACTACTGACACTGAGCCAATGTATGCTATCAAGGAAAGCTT 
TATCTGTCACTGAGCAAAAGGTGMGTTCMTTAGGTCAGTTTTATCACTTCTTTTCCTA 
CACACAAACTATGAGGAGAGATCATTTCTTTTCTTTCTTTTATTTATTTAI I I I I I I IGA 
GACAGGGTCCCACTCTGTCGCCCAGGTTAGAGTACCT 
Sequence 1190 

NCACACAACAATACAGAGCCCGNNGTANCCATTATANTNTGTTAAAANGTNCTNGGGGGT 

GCCCCTAAATGGAGTTGTAGCCTAACTCCACAATTTAATTTTGCCGTTTGCGNCTTNACN 

TGCCCCTCTTNTCCNTGTCNTGGTAAACCCTGTNNTTTGCCAAGCCTNACAATTANATGN 

AAANTNTGGCCAAACNGTNTCGTGGAGTAGGAGNGGCGGGTNTTGGCCGGTAATTTGNNG 

CCGCCTTCTTTTCCCG 

Sequence 1191 

AGGCCAGCAAAAGGCCCANGAAACCCGTTAAAAAAGGCCCGCNGTATGCCTGGCCGTTTT 

TTCCATAAGGCCTCCCGCCCCCCCCTTGACTGAAGNCATTCACCAAAAAAATCCGGANNG 

CCTCAAANTTCAANTANGGGTGGGGCGTAAAAACCCCCGGACCAAGGGAACCTTNTTAAA 

AAGGAATANCCCMGGGCCGGTTTTTTCCCCCCCCTTTGGGAAMGACCTTCTCCCTTCN 

GTTGGCCGGCCTTCNTTCACCTTGNTTNTACCCCGMCCCCCCTTGGCTCCNGCNTTTTT 

ACCCCCGGGAATTAACCCCTNGTTCCCCCNNCCCTTTNTTCTTCCCCCCTTTT 

Sequence 1192 

GGGCGGCCGAGGTACCCATCCAGGGCTCCAACATGAACCACACAATGGACCTTAGGCCCA 
TTCCAGCACACAATAACTCAACAAAGGATGCTTAAAACATTAAAAGGATCTTTACTATCC 
TCTTGGCAAAGTACCTGCCCG 
Sequence 1193 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGATGTGGGCTGAG 

AGAGCCAACCAAATCCAAGTGGGGTTTTGTCAGGAACGGTATTGGAAACTCTGAACTAGA 

TAAATAATTCAAAGGATATTAGCTAAAGTGATCAGGGACCAGGACAGGAAGGTACCTGCC 

CGGGCGGCCGCTCTAGAACTA 

Sequence 1194 

GGCGGCCGCCCGGGCAGGTACANGCATCGCTGGTGGTTTCAAAAAACGTAGTCATTCTTC 

TCACTGCAACAATGTAAGATMGCAGGGTAGATCTGTTATTTCCAAATTAAAGGTGATTA 

AGATATATGGAGAGAGAACATGGCATGTGAGGTTTATAGGGCTAGAAACTGCAGAACCAT 

GTAGAACCCACATTTAACTACAGTACCT 

Sequence 1195 

CGCGGTGGGGCGGCCCGCCCGGGCAGGTACCCTCCAGGCCCTGTGTATTATTAAGATCTT 

TTAGTAGCGAGTTGCTCTTTCTCTGGGAAATCGGCTGTTAAAGTCAGAGGGAGCTCTTAA 

TAGTTTGC ATNGGTATTTTG ATTAAAATGG AAC AGTTGG GATCAAGTAC N CTCG G N C CGC 

TCTAGGAAACTTAGGTGGGANTCCCCCNCGGTGCCTGCMNGGAAATTTCCGAATAATTC 

AAAGGCCTTTATTCCGGNATTACCCCGGTGCNNAACTCTACCGGAGGGGGGGGGGNGGGG 

CCNCCCNGNGNTTAACCCCCCAAGACCTTrTTATTGGGTNTCCCCCmATANTANAGTT 

TGGAAGGGGGGGTTTTAAAATTTTGGGCNGNCCGGCCNTTTTC 

CMTTGGNGGTfCCAATNAAGGCCTTTGGTATTTTTCCCTTNGGTGGGTNGGAAAAMTT 
ATTGGTTTTAATTTTCCCCGNCTTTCTACCNAAATTNTCNCNAANCAACCAAACCTATT 
NNCNGTAANTCCCCCGGGGGGANGGCCANTTAAAAAAGNTNGNGTANAAAAAGNCCCTTG 
GGGGGGGNTGGCCCCTAAANTTGGAAGGTTGGGANGNCCCTTAAACCTTCC 
Sequence 1196 

TCGAGGTACGCGGGGGGATGTGGAGAGGACCATGTGAAGAGAGAAGCTGAGACTGAAAAG 
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G ATTTATGTATTAATATTG ACAG AAGCCAAGG AACACCATCTG AAGTTCTGACG G CAAC A 
TCAGAAGCTAAGAGAAAAGGCATGGAAAAGATTTCTCACCTTAGTAGCATNCAGAGGGAT 
GAGGTTGGNTCNCTGCANNACAANCNCTNTGNTNTTATCTTGACCCTTCTNGACCCTTCN 
CNNAAAACCTTGTAGNAGNGNGAAMGAAAMTTTTTC^ 

CCANCAACCAGGCCTTTGGTNGNGGTTACCCCTTGCCCCCCGGGNGNCNGGGCTNCGTCN 

TTCTnAGTAAAACNCTAAGGTTGGNGNNATCCCCNCCCCCCNNGGNCNTTGGCAANNGG 

AAANTNTTCNNAATTAATTTNNAAAAGTCCTTTTATTTCNGAATTACCNCCGNTNCAGAN 

CNCNTTCTGATAGGGNGGGGGGGGGCC 

Sequence 1197 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATGCAGAAGATATACACCATCCT 

GTTGCCATATATCTAAG I I ! i IAACAAI Mil I CTTAATATCTATCTCCTCCCCAGATTC 

TGAAACAATAGATGCACAAATATGAAACGATTAAACTAACTTGATTAGCTGAAGCCAATA 

ATMCTACCCAAACAAAAGCTGAACACTTAAACCTACTCAGCCATAAACAGAAGAGCTTG 

AAAGCTGCAGGCTTTGTCCCTAAAGCTAGGGCCTGCTGTTGTGGTGTTAACATCGAGAAG 

CTTGAGATAAAGGTGTCTGTCCCATAGAAGGCTCAGTTAGGTAGGCAGTATCATTNGCTC 

ATTTGCTCCAGCCAGTNGGAGGGTAAGATTTTCCATTCTTAATTTTCTCATCCACTAGGA 

TCATCAGGTAGAGAATGGAGAGATTGGGGAACC 

Sequence 1198 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGAACTTG 

TGTGCTTAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGTAGGACCCTAT 

ACGGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAATGGTAAGCCCT 

TGGTTAAATTTATTGGCTTCATTTATGTATCT 

AGGAATAGAAACTTACTTTTGGAAATACAGTACCTGCCCG 

Sequence 1199 

AGGTACAGGCAAAACACATTTCTTGTTTCACACTTCCTTATAACACCCAGCAAAATACCA 

GGGAAAAATAGCAMTATGAATCAATCTTTAGTAAGTTTTATTCTAMCAAAAGGAGTGA 

CTGATGAATCACATGGNAACAGGAGGGGGATTGMTTGTTTTTG I I I I I l AAAATGGNGA 

TNCAAATTCCAAGGGAGGAGAAGATGAAATTATTACCTAAAACCCAGGCTTCTCCNCAAT 

TATTGNATCCCTTAAMCCTTTAGNCAACCCATTTTTTGCCCCGCCCTTGTTGGAM C^ 

GGCTTCTTTTTNCAAGGMCCTCCGTTCCAAATTGGGGTNGTTGGAAMG NTGG CCTTTT 

TCCTTGGAATGGTTGGTTTTCCCCTTrMTTCAAMCCCCCCACCMTTCCTTTTGGNCT 

TAAGNCCAACCTTCCCCTTAATTAACCTTTTCCTTCCCCTTTTTCAAAAAACCTTC 

AMTTTTGGGGGGTGGGTTNCAACCNCNTTTCCCCTTGGTTAAGGGGGMTTGGCCTTTA 

AACCCCCCCCCAAAAAAATTTTCNCCC 

Sequence 1200 

AGGTACGTATCTAAACCCCGTGAGCTAGGCACTCCCCACCCATCTGCATACACTTTGTCA 

TCTTTGCAACTACAGCGTTTAACAAGGGCrGGGAGATTTCAACTAAAACAGACAGAATGA 

ACTTTCTTTTAGGGATCTTAGMCATTTTGCATGGATAGGTGGAGAATTTGAATAAAATG 

ACCTATAGGGATAACTGTCTTTCTAAAGGTCTAAAAACAAACTGTCCCCAGTTCATTAAC 

CMCATATTTATTGCCACTAGAAATCAAT1TTCAAAGAGTGGTAAAAATGACAGGGACTG 

TAGCCACTGCCCAAAACCCGACTGACACiGGTCTCAGCACTAAATCAAGTAAAATTTAATA 

AGTGTCACTGTCCCAGGTGTGACAGCTAGGATA 

Sequence 1201 

CCGCGGTGGCGGCCGGTGCCACACTGGGCCAGGGTAGTCATCCTGAAATACATGTCCAGG 

GTCTTGTTTGTCTATGATGTGGGTGA,AAGCTGCCTCAGCCCGCACCACAGTAGAGAGCGG 

GACCACCTCACGAAAGTTTATAGCA/'sACTCCCAGAGTCTAACCTGAAAGCAGCCAGGAAC 

AAAGACCTTTCCAGAAAGAAGGACATGAACAAACGCTTAAAGAACGACCTGGGCTGCCAG 

GGTAAGAACCCTCAGGAGGCCGAGAGTTACTGTGCACAGTACCT 

Sequence 1202 
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CGACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGACGTATG 

TTGAAAG AGGTAAATAAATGTATG AAAATATAAAG C ATTC AG AAG AC AATTGTC CAT AAA 

AAAGAACCAAGTGGGTATCCTGGAAATTCAAAATACAATGTCTGAAGTTGAGAATTAGAT 

TTGTTACAAACCAGATGGCACACATCAGAAATCTGAATGAGAAAACTCAGAAGCAAGTTA 

ACAGAAAATATCCAAACTGAATCACAGAAGAAGACTAAAAGAGAGCATGGGAGTTATGTG 

AGACACAGACAAAAGTTCCAACATCGATGAAATTGGAATCCTGGAAGAAGAGAGAGGGGA 

AGGGCAAGAGAAATATTTAAAGAAAAAAGTAATTGAGAATTTTCCAAATCTGATGAGAAA 

CTTTAAATCACTGAATCAAAAAGTTCACTAAGCCCAACAGGGTGGACACAAGTGCACACC 

CAAACNTTCCACACAAAACCCATCCAAGTACCTCGGGCCGCTCTAGAACCN 

Sequence 1203 

ITTTTTTTTTTTI NTNTTTTCGAATGACAGCCTANTCTTTATTGCAGG Mill I ATTTTA 
ATTTAATACCAGAGAGCCCTAAGCCAGGATTTATTTTTCCCATATGAAAATATTTNGGTC 
TGACTCTGAAGTCTCANANCGGTNGCTGAATTCTTCAAAGTGAAGAGTGAAAAGCTTTGT 
AAGTTTCTTCAAGGCATTTCATGTGTCATTGTTTTNTCCTCCCAATAGANANTGAANGTA 
ATGGTGATGTCCNCCAGCACATCACCTGGATTAAATTTNTGCTGNGCANGAAATGCATCN 
GCACCTTTACCGTNACCTAATNAATTTAACNGCCCAGGCCCNTGGGNAANCAATNCCNGC 
AAGGTTCNCA I Nil ICTTNACCAAGAANGTTTNTTACTTTTCAGATCTCGATGGGGCAA 
ATTGNTGGTTTGCCTTGAAGTNCCCTNGCCGNTTCTANAACTANGNGGGATNCCCCCGGC 
TGCAAAGGAANTCNAATTTCTAANCNTTANTNNANTNNCCGGCAAACNTGAGGGGGGGGG 
C 

Sequence 1204 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACAGACTTTATTCTCTATTTCAGGGGCCTTGTT 

TATTAAGCAGACAGAACTCTGCTTGATTATTCCATGAAGTCGCCAACCATTAATCATGCA 

CATTTGTTTAACAGGTCAAATGAAGAGAGGAAATTTGTGTGCAGTCACGGCAAATACTGC 

CTCATTTTACCCACTGTGATACTAAAGGTACCT 

Sequence 1205 

CGCCCGGGCAGGTACCTCTAGGCAATTCATTACAGTAGCATTAAAAGACACCAAGGCCAA 

ATGCTTTGGGGTCACTCTTCTAAAACACCGTGTCTAGACAAGAGTCATCAAGCAATGATT 

GCTTGACTGATTTTTTAAAATTATAAGTCACACAMCACACAGTCATCTAATAAAAAAAA 

ACTGATTTTCTTAAGATTCTCAGAGTGGTAACACTCACAACTGTCATGTCAGCAAGTGGA 

ACTTTCTTCTAAGTAMCATCAGAATTCCMTCAGCCTTTGAACCACTGCAAATATTTCA 

CAGTCTCAAAATTCACCTACCTTAAAACCAAGCGCGTACCT 

Sequence 1206 

ATTGGACTCCACCGCGGTGGCTGCCNACGTACCATGTCTGCACCAANAGTGGCAATAAGG 
AGGAGGTGGCATGTGGTGGACNGANGGANAGCCANA1TCTGTTTCAGGTTACANCATGCA 
AANAAGGGCGATGAGAATNTTTGCATTATACATA 
• Sequence 1207 

CCGCGGTGGCGGCCGCCCGGGCAAGGTACCACTATTTGAGTTCTGCAGTTGGCCAGGGTT 

GTGGGAGGGGCCTCAAAGTTAAGTGGAGACACTGAGATGGACAAGACCAGATGCTATGCA 

CAATGGAAATGCATGGTTGAAGTTTTCCTTCCTGCCTGTGTGGGGCCTTAGGGCCACAAG 

GGCAGCCATTACTCAGCACTGCACTGACCTTGGGA/\TTTGCTGGGCAAGGAAAATGACGT 

GGCCCTCATCATCGATGGCCACACCCTGAAGTACCT 

Sequence 1208 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAACGATCTCTTTTGTGGTCTTCTATGTCAACT 

CTCAAACTAATCTCCTACAACTTTTCCACCCCCCACGCACCGTGCCACAAATCATCTGTT 

ACCTAACACATMGTGACAACACAATTCTGTTCCTGCCATTTTCTGCTGAAGATTGTTCC 

AATTTACTTCTCTGTTGACATATAAAGTAGCCTTTAAAGCTTTTAAGGTCTTTAA^ 

TAAAGTAGCTTCAACATGATTTTGATTCTGC \ I I I I ATTCTCATTAAAATGAAATATTTC 

TGGGATTGAACAATGAGAGNAAAATAATAGAAGAGTAGGTGGTGAAAAATTAAAAAAGTA 
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GGGCATAAGCAAAGTNGTCCCTATTGGTCACAAGACCGTCAATTAAANGGCAATGAAAAA . 

TACAGAAGCNTGAAATGGTGGAATTTA 

Sequence 1209 

CCGCGGTGGCGGCCGAGGTACTGATGTGGGCTGAGAGAGCCAACCAAATCCAAGTGGGGT 
TTTGTCAGGAACGGTATTGGAAACTCTGAACTAGATAAATAATTGAAAGGATATTAGCTA 
AAGTGATCAGGGACCAGGACAGGAAGGTACCTGCCCG 
Sequence 1210 

TCACTACTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGACCATAGATAT 

CCCTGGGGGATGTAGTGGATTCAAAACATGAGTTGGATTCTCCAACGTGCCATCGGAATT 

CTTCTAATATTCANCAGCTTTGATTGTTGTTTTACATCTGACCTGTATTTAGATTTCTCT 

GATTCCTGGAATATTTGTGGAAAATAAGTACCTGCCCG 

Sequence 1211 • 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTATGCTGTTTGGATTACTACAGC7TTGTAGTA 

TATTITGATGTCAGGTGGTATAATGCCTCCAACTTTGTTTTCTTTGATCAAGATGGCATT 

GGTGCCAGCCTCGTGGCTCACACCTGTAATCCAAGCACTTTGGGAGGCCAAGGCAGGTGG 

ATAACTTTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACCCCATCTCTA 

TCAAAAAATACAAAAATTGGCTGGGCATGGTGGTACCTGGAMCCCAAGATTCTTGAATT 

TACCTCCTGAATCTTCATTGCTTGTMGCTGAGTTTCTCAGCTCTACTGTAACTTGCTTC 

TTMGTTTCTAGTTATAAACTTCAGGTTGTAGACCCTGCCTTACTTAGTTGCAAAGAAGT 

TGTGAAAATGAGTGT 

Sequence 1212 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGAACAAAGGATCTG 

ANAAAAGTTGGGAAAGAGTANCTTGGGTCAGGAAAGGAGGAAGGTAGGTGGCACTATGGA 

AGTTGACCCTGTCAAGAAGAGAATGGTCANAAATCAGGAAGGGAACTTGTCTTCTAAAGA 

ATAGGTGGCTAANAGAAAGTAACAGTGACCTGAGGTGAAACCAAGTGGGTAACCATTGTA 

AACTTGTGAGTGCTGGACTCTGACAGCNCCCTGGGGTGTGGCTGTACCTGNCCGnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTAGGGAATTCNANTATAAANNTTTATCAA 

TACCGTCTNACCTCTTNGGGGGGGCCCNGTTACCCANCTTTTTGTTTCCCTTTTA 

Sequence 1213 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAGGAAGGTTATTGC 

ACAGCTCCTTAACTGATCGGGGTCAGGGCAGAGTGGTCACTTTCCCCACAGCCAGGCTCT 

GGATTTGCCTCCTGTGAAGACACCATGCCTAGCACAGGCTGACGGGGCGGCTGCAGTCNA * 

ACCTTGCCTCCAGATTNATGAAACCAGTTAAGTAGCACAATTTCTCGTTGGCTACTTTCA 

CTTCAAGAGTGTCATGTTTATTGATGTGGAGCT 

Sequence 1214 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTATTGTCTCTCATCTTGTCCAT 
CATGTTGTTTCTCC I I I I I AGGCGTATATATATCTATGGAAGAAAAAAGTTTAAATAGAA 
ACACTCATTTGATGAGGMTTTTGGTGTAAAGAAAGGGTAAAAATGGGAAATKNATAGGA 
AAAAAAAGTTTNCT 
Sequence 1215 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGACAGCAACTTC 

CTTGATCCCTGCCACGCACGACTGAACACAGACAGCAGCCGCCTCGCCATGAAGCTGCTG 

ATGGTCCTCATGCTGGCGGCCCTCCTCCTGCACTGCTATGCAGATTCTGGCTGCAAACTC 

CTGGAGGACATGGCTGAAAAGACCATCAATTCCGACATATCTATACCTGA/\TACAAAGAG 

CTTCTTCAAGAGTTCATAGACAGTGATGCCGCTGCAGAGGCTATGGGGAGATTCAAGCAG 

TGTTTCCTCAACCAGTCACATAGAACTCTGAAAAACTTTGGACTGATGA1GCATACAGTG 

TACCTGCCCG 

Sequence 1216 

GGAGCTCCCCGCGGTGGCGGCCGAGGTACCTGCAGGCCTCCTACACCTACCTCTCTCTGG 
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GCTTCTATTTCGACCGCGATGATGTGGCTCTGGAAGGCGTGAGCCACTTCTTCCGCGMT 
TGGCCGAGGAGAAGCGCGAGGGCTACTGAGCGTCTCCTGAAGATGCAAAACCAG 
Sequence 1217 

AGGTACCCAAGTGATGTCATCTCCCCATCCTCTTGAGAGTGTCTGAGGAGGCCTCTTTTC 

CTTTTTTATTGCAATGGCAAGGTTGGAAGAAACTGTGACNAGTAAGAGGCAGAGACCCAN 

AGCTGAGTGTAGCATCATGTCTTCTAGAGACTCTCNCNNAANAAANCTGACTTNGGCCAG 

TGCTNTGGTTGGAAATGTATTCTGGATCCCCGCAGTACCTGCCCGGGCGGNCGNTCTAGA 

ACTAGTGGA 

Sequence 1218 

GCMTTCTNCAGTGGGAGTAAACCACAGACTACGAGGGAGTTTGACAGCTATTAAAACCA 

GGGCTCCACAGTTAGGAGGTAGCTTTGCAGTTTGGGGAGGGCTG7TTTCCATGATTGACT 

GTATTATGGTTCAAGTCANANGAAAGGAAGATCCCTGGAACTCCATCACAAGTGGTGCCT 

TAACGGGAGCCATACTGGCAGCAAGAAATGGACCAGTGGCCATGGTTGGGTNAGCCGCAA 

TGGGTGGGCATTCTCCTAGCTTTAATTGAAGGAGCTGGTATNTTGTTGACAAGATTTGCC 

TCTGCACAGTTTCCCAATGGTCCTCAGTTTGCAGAAGACCCCTCCCAGTTTGCCTTCAAC 

TCAGTTACCTTCCTNACCTT 

Sequence 1219 

CCGCGGTGGCGGCCGAGGTACAGGTCCGAAAGTATGAAGAACACAGAAGGCAGGCCAGGG 

GCACTGTGAGATGGTAAAAGAGATCTGAAGGGATCCAGAATTCAAGCCAGGAAGAAGCAG 

CAATCTGTCTTCTGGATTAAAACTGAAGATCAACCTACTTTCAACTTACTAAGAAAGGGG 

ATCATGGACATTGAANCATATCTTGAAAGAATTGGCTATAAGAAGTCTAGGAACAAATTG 

GACTTGGAAACATTAACTGACATTCTTCAACACCAGATCNCGAGCTGTTCCCTTTG 

Sequence 1220 

ATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGTGGACGGCGTGGAGGTGCAT 

AATGCCAAGACAAAGCCGCGGGAGGAGCAGTACTCAGNTAAAATATTTCTTCATAAATAG 

TTACAAAACCTGCCAAAACACAAAGGGGAMGTATTmC^ 

AATGTNACNCAACACAAGAAACAATGCTTAGAGACATGTTATTGTTTCCAGAAGAAAATG 

GTCAGTAAACTACTTAGCTGAAGACAAAAG 

Sequence 1221 

ACTACTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGTCGCACTCATTTACCCGGAGA 

CAGGGAGAGGCTCTTCTGCGTGTGGTGGTTGTGCAGACCCTCATGCATCACGGAGCATGA 

GAAGACGTTCCCCTGCTGCCACCTGCTCTTGTCCACGGTGAGCTTGCTGTANAGGAAGAA 

GGAGCCGTCGGAGTCCANCACGGGGAGGCGTGGTCTTGTAGTTGTTCTCCGGCTTGCCCA 

CTGCTCTCCCACTCCACGGCGATGTCGCTGGGATAGAAGCCTTTGACCCCCGCGTACCTN 

GGCCG 

Sequence 1222 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGATACCTCACCGTGGCTGCTGT 

CTTCCGTGGTCGGATGTCCATGAAGGAGGTCGATGAGCAGATGCTTAACGTGCAGAACA/V 

GAACAGCAGCTACTTTGTGGGATGGATCCCCAACAATGTCAAGACAGCCGTCTGTGACAT 

CCCACCTCGTGGCCTNAAGATGGCAGTCACCTTNATTGGCAACAGNACAGCCATCCAGGA 

GCTCTTCAAGCGNAT 

Sequence 1223 

CCGCGGTGGCGGCCGAGGTAC I ITTTTTTTTTTTriTTTTTI J GTTGCAGTTAGTTTATT 

GAAATGGAAATGAATTTGAGGTTTCAACAGTTGTCTAGCAGTTTCCACATAAAAAGACAA 

CACANGAAAGAAAAGTGGTTTCTAGCAATCAACAGATGAGTTCTGGCCATAGTCTGTAGC 

CCTNTGAGCCAAACGCCTTGGGTAAAGTTMTTACTCTTCATA7TACACCAAATGCTGTC 

GTACCTGCCCG 

Sequence 1224 

CCGCGGTGGCGGCCGGAAGGAGGATGGTATCACTCAGGCTCTCAGGGTGACACTGAAGCA 
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AGACACTCATGGGGTAGGACATGACCCTGCCAAGGAGTTCACAAACCACTGGTGGAATGA 
GCTCTTCAACAAGACTGCGGCCAACTTGGTAGTGGAAACTG.GGCAGGATGGAGTACCTGC 
CCG 

Sequence 1225 

AGGTACACTTTTGGCCAGGGGACCAAGCTGGAGATCAAACGAACTGTGGCTGCACCATCT 
GTCTTCATCTTCCCGCCATCTGATGAGCAGTTGAAATCTGGAACTGCCTCTGTTGTGTGC 
CTGCTGAATAACTTCTATCCCAGAGAGGCCAAAGTACCTGCCCG 
Sequence 1226 

CCGCGGTGGCGGCCGTGGTACTCCATCCTGCCCAGTTTCCACTACCAAGTTGGCCGCAGT 
CTTGTTGAAGAGCTCATTCCACCAGTGGTTTGTGAACTCCTTGGCAGGGTCATGTCCTAC 
CCCATGAGTGTCTTGCTTCAGTGTCACCCTGAGAGCCTGAGTGATACCATTCTCCTTC 
Sequence 1227 

AGGTACTTTTATTAAAAAATAGTCACGCAGACAGTGCCCTGGTGGCTCTGCCCCGCATCC 
CAACTCTGGGGTGGGGGAAAGGGGTCAACGTTTTCGCAGCCCCAAACCGGGCCATCACTT 
GCCCACCGAGTCGAATATGATGCGGTTCTGCTCGGCGCGCTCCCGCGTACCTGCCCG 
Sequence 1228 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATAAGTTCCCTATGGCTTATGGA 

GAGTTATTTATTAATTMCTTTATGGTAGGGCTAGTATGAATACCTTTTTAACAAT^ 

TGCTATTACAACAATGAAGATTCAAATGACTCCGCTTTGAAGGATGTTTTCTCTATATGG 

TAAAAT ATATATG AAG AAGTCTTG ATTAC N GTGG AAGATC ACTTG ACTCAG AATACTTCA 

ATGTATTTTGTTCACATTACCACTAAGCATATTATCAGTAAACTATTAACTGACTGCACA 

TTATGTAA 

Sequence 1229 

GGAGCTCCCCGCGGTGGCGGCCGAGGTACTCCCTTGTGGATAAACGCTTCTAGTTCTTGG 

CG TG TTTGG GTTGC TTTTCCTCC TAAAAACTGAATG G AAC TAG AAC TTTC TAG G AAC C CT 

TTCAAGCTGCAGTAATGATTTACTGAAGCAGAAAGAACATGGTTATTTTGAGTCTCACNA 

NAATAAAANANANATAAAAGTNCANATTTNCANCTNTTTGAATCAGGTTCAGGAAAAAAG 

CAAACATGCACACCTAAAACTTAAAAGACCGTTTCATCACTATGCATACGTTTATCATTT 

GACATCCATCAACTGTATACCTGGTTTCAAAAGTAAATTTAACTTGTGATCTCAGCATAG 

CTCATGTNCACATTTCATGCANAGGGTCCAGGAAACTCAAATCACACTTTGGTAAGTCAC 

C ATGTTC ACTC ATTTTT . 

Sequence 1230 

CATTAGGCGAATTGGACTCCACCGCGGTGGCGGCCGCCCGGCCAGGTACTCCATCTAGTC 

TGGGTGACGGAGCGAGGGTCCATCTAAAAAAAAAAAAAGACATNGAAGNNCNGNAGAACT 

GAGTTAGAMCCTTCTTCNATTCCGTGTTTGGACCTTGGGAAGGCTGTTAAGCCTCCTGG 

TGCCTAGTTTTTCTCATCTGTAAACAGGCNTMTNMCACCATTTGCACATGGTTGGTAC 

CTN 

Sequence 1231 

AGGTGCGGTATCCTTGCACCTTGCTTCTCATACTTCAAGCTACTGCTTGGGTGGGTCCCG 

TTGTAAACTANATGAACACTCATCTGACTATGGATATCATCACTCCTAGATACTCCTGCT 

CATATGATCCACTTACCAGAGNNTCTGTNTGGACATCTTTTAACCAAGCTAGTAGTAACT 

GCCCG 

Sequence 1232 

CCGGGCAGGTACCANGCTGTAACCAATACGATTCTGGGGCAGGTTGrGGGCGAGTAGAAG 

AACCTCCTTCCCCTCTGCGACATTGAACGGCGTGGATTCAATAGTGAGCTTGGCAGTAGT 

GGGTGGGTTCCAGAAGGTTAGAAGTGAGGCTGTGAGCAGGACCTCCTTCCAGGGGACATG 

CAATCTGCAGGGAGGGGCTGAGGGGGGTCCCATGGTCTCTGCTGTCTTCTCTGTCCACCT 

CTTTGTAGAGGAGCTTGAGCTCCAAGGAATGCTCTGGTCAGGGCTGCTGTGACTGTTGGC 

CCTGCTGTCCTTCCTTC7TCTTGTCCCCGCGTACCTTCGGCCCGNTTCTTAAAAACTNGT 
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GGANTCCCCCCGGGCCTTGCAGGGAATTTTCGAATATTCAAAGCCTTATNCNATTNCCCN 

GCCGAANCCTTTCNAANGGGGGGG 
Sequence 1233 

TGGCCGAGGTACTTTGGCCTCTCTGGGATAGAAGTTATTCAGCAGGCACACAACAGAGGC 
AGTTCCAGATTTCAACTGCTCATCAGATGGCGGGAAGATGAAGACAGATGGTGCACGCCA 

CAGTTTCGTTTTGATTTCC 
Sequence 1234 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAACGGCCTCAATTTCTCGGGCAGCATCAGCTC 
CACTTTATCGAGGAGCCGGTGGTAGGTGGCCCGCAGGCCTCGGGCACCGGCGGCCGACAT 
CGCCGCCGAGGGATCGTTGGCAGCCGGGTGGGAGCGTGGCTGTGTTCTCGTCCCTGGCTG 
ACAACGAAGGGGAGCTAGTCACTTTTCCTGCCACGACGACTCGCGTCCGCCTCTCGCCTG 

GAGTACCT 
Sequence 1235 

CCGCGGTGGCGGCCGAGGTAC I rTTTTTTTTTTTTTTTTTTTTTI I GNGTGGAGCATGGG 
ACCTTTATTCGTTAAGACATCAGGCTCCAGATATGAACTTTCAGCAAAAGCGCTTGCCGG 
GAGCAAAGGGACAGAAAAGCTGAGATGAACAGTGCCTGGCAGCAATCACAGCCGGGCAAG 

GGTGCTCCGAGCCTCGCATCCCC 
Sequence 1236 

ACTACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACCAGCAG 
AAACCTGGCCAGGCTCCCAGGCTCCTCATCTATGGTGCATCCACCAGGGCCACTGGTATC 
CCAGGCnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
nnnnnnnnnnnnTTTGCAAGTTTATTACTGTCTCCAGTATAATAACTGGCCTCCGTACCT 

Sequence 1237 ^^^.^ 
CCGCGGTGGCGGCCCGCCCGGGCAGGTACTGAACTAGCTCCTTCTGGTTAATTTGTTGAT 

TGGATTGGAAATTAGAACATGGAGCTGGTCAATGCACGGTATCTGGTAATTGTGGATGGG 

GGAGATGACTGGGGCAGAACTGAGCTCTATTTTTGCCAACATAGTACCT 

Sequence 1238 

GGCGGCCGCCCGGGCAGGGACTTTCTTCTGTGTGACAATTACTGCACAGTCTTTCCCTCT 

GACAGCTACTGATGTAAGGCCACCCTGGTTAATAGCCTTAAAAGCATATTCTACTTGGTA 

GAGCCGACCCTCGGGTGAAAAAATGGTAATGTGGCGGTCAAAACCGGCGCTGGAACCACG 

GGACATGTTGGTACCT 
Sequence 1239 

AGGTACACTTTTGGCCAGGGGACCAAACTGGAGATCAAACGAACTGTGGCTGCACCATCT 
GTCTTCATCTTCCCGCCATCTGATGAGCAGTTGAAATCTGGAACTGCCTCTGTTGTGTGC 
CTGCTGAATAACTTCTATCCCAGAGAGGCCAAAGTACCTGCCCG 
Sequence 1240 

CTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTGGCCTCTCTGG 

GATAGAAGTTATTCAACAGGCACACAACAGAGGCAGTTCCAGATTTCAACTGCTCATCAG 

ATGGCGGGAAGATGAAGACAGATGGTGCAGCCACAGTTCGTTTGATCTCCAGCTTGGTCC 

CCTGGCCAAAAGTGTACCTGCCCG 

Sequence 1241 A ^„_ 
CCGCGGTGGCGGCCGAGGTACAACTAAATATTAGCTAATGGTAGCACAAGTAGAACCTGT 

TTAAAATTTTTAAAAAACAACCTGTTCACTGTAAGMTGAGATACCCAAGGAAAGAACAG 

GAAAGAGGTGTATTTTTMTCCTGCCTTTTGTATTTAGTGTGCCCCATGACTAGCTTGTT 

CATCTAACATGCTTCTTGGTTCTCGGCAGGTGAATGGCTATGTCGTGAACCTTTTCTGGG 

CATTTGACACCCACAAGCAAGAGAGGAGAGCTTTGAACTTNCGAGGAAAGCATATTGTCA 

CACAAAGGTAACACTTTTTGGTGCTGGTCAGTGTTGTGTGCCAGGAAGGGGGTAAANGGC 

TNGGACCCAATNCCTTTTCCCCGCCCGTAANAAGCCTTTGAACTNGGGGGGTTCCCCAGG 
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GGAAAAGG 
Sequence 1242 

CACTACTATAGGGCGAAATTNGAACCTCNCCGCGGNGGCGGCCGAGGTNCGGGGGCCAGT 
TATTATACTGCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTACTAGTGGCCCTGG 
TGGATGCACCATAGATGAGGAGCCTGGGAGCCTGGCCAGGTTTCTGCTGGTACCTGCCCG 

Sequence 1243 . 

CCGGGCAGGTACGCGGGGGAGGAACTGCTCAGTTAGGACCCAGACGGAACCATGGAAGCC 
CCAGCGCAGCTTCTCTTCCTCCTGCTACTCTGGCTCCCAGATACCACTGGAGAAATAGTG 

ATGACG 
Sequence 1244 

AGGTACCTGCAGGCCTCCTACACCTACCTCTCTCTGGGCTTCTATTTCGACCGCGATGAT 
GTGGCTCTGGAAGGCGTGAGCCACTTCTTCCGCGAACTGGCCGAGGAGAAGCGCGAGGGC 
TACGAGCGTCTCCTGAAGATGCAAAACCAG 
Sequence 1245 

CCGCGGTGGCCGGCCGCCCGGGCAGGTACAGCTATTCAACATGTTTAAACAGATGTTTGC 

TGACAGCACTGACTTCTCCAAGAGCCTAAGAGAGGAGAGCAGCCAATTAAACTCCACTGT 

CTTCTAAAATTTTAATCCTAGATTGGTTTCAGATGGGTTAAGACAGTCATTCAACCCACA 

AGTATTTAAGGAAGGCACCTGGGACTGACTCAGTTCCTAGATTTAAACCTATTCTGTGAA 

CTGCCCACCCCTACCTTCAAAGTACCT 

Sequence 1246 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAAGAGTTACAGCCCTTATGACATGTTGGAAAG 

CATCAGGAAAGAGGTTAAAGGAGACCTGGAAAATGCTTTCCTGAACCTGGTTCAGTGCAT 

TCAGAACAAGCCCCTGTATTTTGCTGATCGGCTGTATGACTCCATGAAGGGCAAGGGGAC 

NCGAGATAAGGTCCTGATCAGAATCATGGTCTCCCGCAGTGAAGTGGACATGTTGAAAAT 

TAGGTCTGAATTCAAGAGAAAGTACCT 

Sequence 1247 

TCGCACTCATTTACCCGGAGACAGGGAGAGGCTCTTCTGCGTGTAGTGGTTGTGCAGACC 
CTCATGCATCACGGAGCATGAGAAGACGTTCCCCTGCTGCCACCTGCTCTTGTCCACGGT 
GAGCTTGCTATAGAGGAAGAAGGAGCCGTCGGAGTCCAGCACGGGAGGCGTGGTCTTGTA 
GTTGTCCTCCGGCTGCCCATTGCTCTCCCACTCCACGGCGATCCCGCGTACCT 

Sequence 1248 

TCACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAAGGGAATAGA 

JAACAGGGACATTGATCTAAAGGGAGGTTAGGGAGGACTTCTTGGAGGAAGTGGTATTTG 

TGTCTGAGTTATATGGGATGTGTGGGATTTCTTAGGTGAAAGAGGGCAGAGCAATACGGT 

GGTAGGCTAGATCTGTGCAGCCGGCTTATAGACAAGACTGTACCTGCCCG 

Sequence 1249 

CCGGGCAGGTACGCGGGGCCGCAGTAGTTGGAGTCTAAGGACTCGTGACAATCTTCGGGT 
GCCCTTCGAGAGAAAAGGGGAGGATGCCACTGGAGTCATCCTCTTCAATGCCACTATCCT 
TCCCATCTCTCTTACCCTCAGTACCT 
Sequence 1250 

CATACTTAGGGCGAATTGGAGCTCACCGCGGTGGCGGCCGCCACCATGTCCGCCTCGGCT 
GTCTTCATTCTGGACGTTAAGGGCAAGCCATTGATCAGCCGCAACTACAAGGGCGATGTG 
GCCATGAGCAAGATTGAGCACTTCATGCCTTTGCTGGTACCT 
Sequence 1251 

ATCACTNCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGACATCATTT 

TCAAGAAGTTTTGCCATAAAAGTGAAAAATGGTAATAGCTGAAAAAATTTAGCGTTGACA 

GAAGGCTTTTTTTAAAGACAGGAAAAATTATGGCGCAGAGGGAAATGCTGATGATACAAG 
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ATGGAGAGAGAGAAATTACTGGAGTAATGACTCTGAGCAGATGTGGATGGCATTAATAA 
Sequence 1252 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACTACTGGCACTGAGCCA ATGTA TG 
CTATCAAGGAAAGCTTTATCTGTCACTGAGCAAMGGTGAAGTTCAATTAG GTCAG TTTT 
ATCACTTCTTTTCCTACACACAAACTATGAGGAGAGATCA7TTC 
TATTTATTTTTTTTGAGACAGGGTCCCACTCTGTCGCCCAGGTTAGAGTACCTGCCCG 

Sequence 1253 

CCGGGCAGGTACAGTAGTCAGCTACTTGGATATCAAGACTAAGTGCTGATATGAAGACTT 

GCCTTTGTGTTTTGTTATGAAACACGCAAGCATAAAGCAGGACTCAAGCACAAGGCTGAC 

TGTTCTACTTTGAATAACAGCTTCCTTGCAGTCTCCTCCACATGGGTGGTACCTCGGCCG 

CTCTAGAACTAGT 

Sequence 1254 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAGAACTTTTGGAAACACCGTCCCTTCAAATGC 

TACCTTACTTAAAATATGTAGACATTAGTCCATCCAACCTCTGCTACCAGGCAGTAACCT 

TTCATAGATCTTCCAAAGAAAATGTTCTTGATAAAACCAGAAGAATCTTCTCAGAGAGAC 

CCAGGGGAGTGATCTGACCCTCACTGCTATCTTGACTTAAGTTTTAGCTGCAGCCATGGC 

GACTCAAGTACCTCGGC 

Sequence 1255 

NCCGGGCAGGTACTAACCCAACAATAGAATTACCTATCCNTAACTCGTTAGCCCAACACA 

GGCGTGCTTTMGGAAAGATTAAAAAAAGTAAAAGGAACTCGGCAAACACGAACCCCGCC 

TGTTTACCAAAAACATCACCTCTAGCATAACAAGTATTAGAGGCATTGCCTGCCCAGTGA 

CTNAAGTNAAANGGCCCCAC 

Sequence 1256 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCTGAATTTACCAGTATTTTCTCTTACGGCTTT 
TGGATTTGACTCATAGTTGAAAAGGTCTACCATGTTCAAGGTCATTAAGAAAGTATCCTG 
TTTTTCTCCTAGATCTTGGGTAGCTTTTGGTGTTGACCTTTACAl Mil lATTCATTTGA 
AATTTGGCCTGGTGTAAGACAGGCTTTCCCCTAGAATGCTGCCCAGGTAGCTCAGTATCA 
AATATTGAACTATCCCTATTTTCATTCCACTCTGATTTATAAT GCMT TTTTATCCCAC 
TATATTTGG ATCTATTTCTGAAC 1 rTTTTTTTI I AACATAC ATATT TTTACACGTTCCAA 
CTCTTAGAAATGTTCCATGCTGTTTCTGTTTTTANGCTCCTTTTANGTACCT 
Sequence 1257 

CCGGGCAGGTACTCTTGTTTAACCATCAGAGGTGATTCCATCACCTTCACAGCCCCAGCC 
TCTGCTCCAGTCCCTCCCCAGCGAAAAGGGCCGCCCATGCCATCCTGCTGCT GGTGA TTT 
G CTTTGTGGTCATGGACTCAGTGGAC ATCATTATTTTATTAAC C GTGTGGTAG GTTTTTA 
ACTCAGTTATCCTGGATATCCAAAGGTTTGTGGTCCATCTTTAGGCTTCCGTTTGTTCTT 

TGGTACCT 
Sequence 1258 

CGNGGTGGCGGCCGAGGTACAAGATGCACTTGGTCACCCGGTTGTAGGCTCCAGGCTCCA 

TGAACCAACAGAAGCTTCTCTAAGTAGGAAAACTGTGCGATGGCGTAATCCGAGTTGTTG 

GTGGCCTGCATGCCTTCATTCCCACTGATTCCCACACCCACGTGGGCTGTCTGGATCATC 

CCGACATCGTTGGCGCCGTCTCCGATGGCGAGGGTGATGGCCTTCACCCGCTTCTTCACC 

ACATCCACTATCTCAGACTTCTGCAGAGGAGACACTCTGCAGCATATGACCGCTTTGCAC 

GAGAGTGCCAAATCCAGGAAACTCCTCCGGACTTCGAAGGAGAGCGCGACCTGCCCGGGC 

GG 

Sequence 1259 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCTCGGTTGCAAGCACAAGCA 
AATGTGCCAGGGTGGTTGATGCAGCTGTGGTCACAGGTCCTATCCAAAGAGCACTCATCC 
ACATCTTGGCAAGACTTCTCATCTGTTMTAATTTAAATCCTTTCTT 
Sequence 1260 



Page 169 of 63 5 



WO 01/51628 



PCT/US01/00798 



Table 1 



AGGTACI lill I I GTTTG I nTTTTTTl I CACCTTAAGGGAGGATTTAATTTGCTCCCAA 

CTGATTGTTCACTTAAATGAAAATTTAAAAATGAATAAAAAGACAT 

Sequence 1261 

AGGTACATTATATACAAATATATATACGATGCATATGTAGATGTATACATGTATGTATAT 

ACTGTTGTATAGAGATATGAGTATACATGTATGTGTATATCTCTTTATATAGATATAGGT 

ATAGATATAG ATATAATC ATATAGGTTGTATAGC GTTTCC AAG G CTCTTTC ATAAC TAGT 

TCTCAACTGGGAGCAATCTTGCCTCCCATGGAACATTTTGGCAATGCCCAAAGATATCTA 

TATTTGTCACAACTGAGGGGGAGGGTTTACTAGTGACAGGTACCr 

Sequence 1262 

AGGTACTTTGATCTTGGACTTCCAGCCTCCAGAACTGCAGTGTTGGTCTGCAAGCTTCAA 
GAGCCAGTGCCCTGACTGCCAAGTGATTTGCCGAAGGGAATTATGGGGAATTCGCAGGTA 
TTCCTGATCTGCTATGTATTCCAGAAAGGCCATGGGCAAAAGGAAGTACCTGCCCG 
Sequence 1263 

AGGTACCGATTTAGCGTGGTCCCAGGCATGTCGGATGTGGGAGCATTCGATCCGCGGCGC 

TGCCTTCCGTCGAGCATTCCGGTTCAGACGCTTTCGGTTGACACTGTAACCAAACTTCTG 

CCTCCGGGTTTTGCCCTTGGCCTTGGGGCAATTNGCGCTGNACACACCCGGCACNCAGNC 

AAGACTCAAACAACGCTGCNCTCTGTACTNTTCAAAACNCTNNGTGTTCCCANGCCGATA 

ACCTTGCACCGGGGCCGGNCCCGTTTCTTAGGAACCTTAGTTGGGGAATNCCCCCCCGNG 

GNCTTGGCAAGGGNAAATATCCGAAATATTCAMNGCCTTTTATTCNGNNATAACCCGNT 

NCAGAACNCTTNTGAAAGNGGGGGGGGGGNCCACTCGGGNTTAACCCCCAAGANCTTTTT 

TATTGGTTATNCCCCTTTTTAAGNTTGGGAAGGGGGGGT^ 

GNCCCTTTTGGGGGCCCGG 

Sequence 1264 

AGGTACCTATGATTCGATAATTGACAATGGTTATCCGGGTTGTCATACACTTGTGCTAGG 

AG AATTGG TTCTTG TTACTC ATATTAAC AGTATTTC ATC TATG G ATC TATAG ATTAG CCC 

AATTTGTAATATAGGAATTTATTGAGGTTTGTGGAATTAGTGTGTGTGTAAGTATGTATG 

TTGAGCTTGAACGCTTTCTTTATTGATGGCTGCTTTTAAGCCTACAATGGTTAAGTGGAT 

TGTAGTTGTTTTATTCACTATTTAAGGNTTTTTCCCG 

Sequence 1265 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAACTAAAAACTCTGG 
ACATTACATTAAAGCAAACATAACAAAACATTGAAGGGTGGGGAAGAAAAAGCAAGGTGG 
CTAGAAATGACATGGTGGTAAGTTCCCTGGGTTTCCTTTTTGCCCAATATATCCCAGACT 
TGAAGCTGAAGAAACCAGTAACCTAGAAACGCCAATGGGTACCTGCCCGGGCGG 
Sequence 1266 

AGGTTCTTCAGCMGCCAGCTTCTCGGATGGCGGCTGGGCCATATTCCACTCCTTTCTTT 

TTCTGTGAATCATACAAATCCCACAGACATAAGAGAAGGTGAGAAAAGGAGCAGAAACAC 

TTTACACACAMTCTCTGATGCTACACGGAGTTCCTAACAGATTCCTCTGCAAATCTTTC 

ACAGCATCAAMCTTCTTCGGTAGAAGGCTCGTCGGTTCATTAAATATTAATGAATGGAG 

GAAGGAAGATTTTGTTCTTGAAAGCAAATCAACCTTCTGGGTCCCTCACTGAGAGCCAGA 

CCAGAGGAGTCTGTCTTCCGATCGGGGAAAGGGCT 

Sequence 1267 

TACCAGTGGCCCTGGTGGATGCACCATAGATGAGGAGCCTGGGAGCCTGGCCAGGTTTCT 

GCTGGTACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnNGCGCTTGGCCGTAATCATGGGNCATAGGCTGTTTTCCT 

GTGTGAAAATTGTTATCCCGCTCACAATTTCCACACAACATTA 

Sequence 1268 

ACGGGCAGGTACACATATTACAAGAGCCTGCTACTGGGAACTAACTGAATTCACAAAGAA 
ACAGTGATTCAAGTTCGTTAAAACTTTTGCTCTATCTTGAATAACAAGCTTATTACATGC . 
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GAAACCTATTGGGGACCAACTGAATTCACCGTMTACTTAGATTCCGTTCTTTAAATGTT 

GCTATATATTTAAATGCACACTCATATAAGCATGTCCCATTGGNAACTCTCTGAAATCGT 

CTAGAAATTTTGACTCCATTCGTGAAAATTTTTTNTACATCCGGGAACAGTCCACTTATT 

ACTTTCTGTGGCCTA 

Sequence 1269 

GGGGCAGGTACATGCGAATCCTATTGGGAACCTACTGAATTCACCATGATACTTAGATTC 

CGTTCCACAAAATGTTGCTCTATAATTGAAAAGCAAACTCATACAAGCATGTCCCATTGG 

GAACTCACTGAATTCGCCTAGAMTTTTGATTCCATTCGTGAAAATTTTTGTATATCCCG 

AACAGTCCACTTATTACTACTGCGGCCTACTGGGAACTAACCGAATTCACCATGTTACTN 

AGATTTCGGCTCACCCAAGTTTGGATAAATCTTTGA 

Sequence 1270 

CCGGGCAGGTACGCGGGGACCCGGCGCTCCATTAAATAGCCGNAGACGGAACTTCGCCTT 

TCTCTCGGCCTTAGCGCCA I I I I I I I GGAAACCTCTTGCGCCATGAGNAGCCAAGTGGTA 

GGAAGAAAGCCGAAATGTTGCAANGCNTGAAANCCGCCAAANAGNAANGNAAAG^ATTGA 

AGGGCAGNTAGTGTTCCAAAGNTAAAAACNCGCCTAGNCTTTGTTTTGCCACCCCGCTGG 

GNAGNGCCCANCAAGGNAGTCCANNAAAACANTGTGNAAATTGNCACAAGAACTGCTTGG 

GGGGGANTGCCTTGGNTAANGCNTTCGNGGCNCGGCCTTCTTAGTAAACCTTAAGNTNGG 

GAATCNCCCACCCGGGGGTCTTGGCTANNGGAAANTTTCGGATTAATTCCAANAGGCCTT 

TTAATTCCGNATTAACCCCGTTCCGTAANCACTTTTGTAGGGGGGGGGGGGGCCCCCCNG 

GGTTATCCCCCCAAGACC I MUM IGGGTG 

Sequence 1271 

CCGGGCAGGTACACTTTTGGTCAGGGGACCAAGCTGGAGATCAAACGAACTGTGGCTGCA 

CCATCTGTCTTCATCTTCCCGCCATCTGATGAGCAGTTTGAAATCTGGAACCTGCTCTCT 

GTTTGTGTGCCCTGCTTGAAATTAACTTTCCTAATCCCCCAGAAAGNAGGGCNCCAATAA 

GATATCNCTTCGGGGGCTCGNCNTNCTTAAGAAACCTTAANTTGGGGTAANTTCTCCCCC 

CGGGGGGCNTTGGCCAAGNGGANANTTATCCGNAATTAANTTNATAAGGNCCTNTTAATT 

NCTGGAATTAANCCCCGGNTGCCTNANNCTCCTNCNGTNAAGNGGGGGGGGGGGGGGACC 

CCCCCGGGGGGATNMCCNCCCCNANGCCCTTTTNTTTTGGTATTTACCCCCCTTNTTAT 

NANGCATAGGNAAGGGGGGGGGTTT 

Sequence 1272 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAACTTTAAACTGTATTGTATTCATGTTGCTAA 

ACAATATTGGCCTTCTCGATGATTTTATTCATGTTGCTCCAAAGTTAAAACCCTGTAGAA 

CTAAGTAGGTGAAGAGATATTTTGTATAAGTGCCACAGAAGAGAAAATATAATAAATTAA 

ATAGTGAATTGAGCATCACTAGAATAAAATAAAATGAGTAGGCATTCCTAAGATGTGAAA 

TGATCACCTMGATATACATGCTCCAACCATATTGATTTTAGAACAAAACACAGCAGCCC 

ATAACAGTTTGTGGCCTCCACTAACTGTCCTCTGCTGTCCCATCTAAGAGGGTTATGTTT 

CTCCTATTTTTAAAMTAAAATGGTAGTTTAAMTTAGCCTGACGGGATGTTTCCT 

Sequence 1273 

ATCATACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTACT 

TACCAGCCCTTGAACCATTTCTATATGATGCTCCTCCCAATATTCTGAAACATGTGGTTG 

GACAGTTCAGTAAGGTAAGAAATGCCAGCCTCTAAGGGGAGGAAGAAAATTAAAATGATT 

TGTTGTTA I M I M M I CTGTTGGGTCAGTAGCTTTTTCTGTAACACGGGGTCATGGTTT 

TACTG CTTAACTCTC ACC AATCTGTATGG AC AG AG AAGTT ACCTTAATAAAATTCAC ATG 

TAATTAAAAAAAAAAAAAAAAAAAAAAAAAAAGTCCCT 

Sequence 1274 

AGGTACGCGGGGCTTTTCACAAGATGGCGCCGAAAGCGAAAGAAAGGAAGCTCTGCCCCC 
TTCTTAAAGCCTGAAGCCCCAAAAGCCGAAGGGCTTTTAAAGGCCAAGAAAGGNCAGTGT 
TGAAAGGTNGTCCACAAGCCACAAAAAGAAGAAGATCCCGCACGTCACCCACCTTCCnnn 
nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
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nnnnnnCGACCTTCGAGGGGGGGGCCCCGGTACCCAGCCTTTTTGTTCCCTTTAGTGAAG 

GGGTTAAATTGGCGCGCTTTGGCCGTAAATTCATGGGTCATTAAGCCTGTTTTCCCTGTG 

GTGGAAAAATTTGGTTATCCCGCTTCAACAAAATTTCCCACACCAAAACATTAACCGAAG 

CCCGGGGAGGCATTAAAAGGTGGTTAAAANCCTGGGGG 

Sequence 1275 

CCGGGCNGGTACCACTCATGAAACCTGGCCAGGCTNCCAGGCTCCTCATCTATGGNGCAT 

CCACCCANGGCCACTGGTATNnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnCNCCCTGAAGATNNTTACAAATTTTAATTATTGNCAGCC 

AGTTATTAATAAACCTGGNCCCTGGTGTACCCTCGGGGCCCGCTTCTAAGAAACCTAAGT 

GGGAATCCCCCCCGGGGCCTTGGCMGGGAAAATTTCNGMTATTCAAAGCCTTATTCGA 

ATTACCCCGTTCNAACCTTCNAAGGGGGGGGGGGCCCCCGGGTNACCCCCAAGCCTTTNT 

TGGTGTCCC I I I I IMGTTGGAGGGGTTATAAATTTTGCCGGCCGGCTTTTGGGGCCGAT 

AAAATNCAATTGGGGTNCNATTAGGCCTGGTTTTTCCCCTTGGTGGTTGGAAAAATTTNG 

NTNTNATTCCCCCGGCNTNCACCAAAANTTTTC 

Sequence 1276 

TCACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGGGCGG 

GGGGATAGCTGTCCAAGGTCTCCCCCAGCACTGAGGAGCTCGCCTGCTGCCCTCTTGCGC 

GCGGGAAGCAGCACCAAGTTCACGGCCAACGCCTTGGCACTAGGGTCCAGAATGGCTACA 

ACAGTCCCTGATGGTTGCCCGCAATGGCCTGAAATCCAAGTACCTGCCCGGGCGGCCGAG 

GTACCAACAGAAACCTGGCCAGGCTCCCAGACTCCTCATCTATGATGCATCCGACAGGGC 

CACTGGCAnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnTTTTGCAGTTTATTACTGTCAGCAGCGTAGCAACTCGT 

ACCTGCCCGGGCG 

Sequence 1277 

CGAGGTACGGGGGCCAAGTTATTATACTGGCTGAAAGTAATAAAATTGCAAAATTTTAAA 

ACTGCAGGCTTGCTTGATGGTGAGAGTGAACTTTGTnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnTACCAGTGGCCCTGGTGGATGCACCATAGATGAGGAGCCTGGGAAGCCTGGC 

CAGGTTTTCTGCTGGTACCCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnTCGATATCAAGNCTTATCGATACCGTTCGACCTTCGAGGGGGGGGGCCC 

CGGTACCCAGCTTTTGGTTCCCTTTTTAGTGNAGGGTTTAAATTTGCCGNCGCTTGGGCG 

TAATCAATGGGTCATAGCTGTTTTCCTGTGTTGAAAMTTGTTATCCGCTCACAAATTTC 

CACACAACATTAACCGAGCCCGGGGAGCATTAAAAGTGTTAAAAGCCTTGGG 

Sequence 1278 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAGTGTCTTGATTAT 
AGTAGCTTCATAGCGAGTCTTAAAGTCAGGTAGCAGGTAGCATMTTTCTCATTTGTTCT 
TTTTCAAGA1TGTGGTAGATATTATTTTTATTATTA I I I I I I I I GAGACGAATTCTC ACT 
CTTGTCCCCCAGGCTGGAAGTGCAGTGGTGTGATCACAGCTCACTGCCTCCGCCACCTCC 

TGGGCTCCCCGCGTACCTGCCCG 

Sequence 1279 

CCGCGGTGGCGGCCGAGGTAC 1 1 I I I I I I I AC I ! 1 I I I I 1 I i 1 I I I I I I I AAGAAACTGG 

GTCTCTCT3TCACCCAGGCTGGAGTGCAGTGGCAACGATCATAGCTAACTGCAGCTTTGA 

CCTCCTGGGCTCAAGTGATCCTCCTGCCTCAGCCTCCTGAGTAGCTAGGACTACAGGTGT 

ATGCCACCATGCCTGGCTAATACAAAAAAATTNTTTTAAGAGATGGGATCCC TCCC TTTT 

AAMTC/iGMCTTGTTCACATGGTGGTTGCTTGTGGCAAAACGGAGTTCAAATTTTGCTC 

TCCTATrGCTATAATTCTGCTAGCAATCTGTTGAGGTGAAACTTGGGATCTGACTCTTCA 

GCAAGCAGCAAATGACCTAGTAACTCAGGGACAACTATTTTTGAACTTTAAGTGCCACTT 

TAATGCAGTTAGTT 

Sequence 1280 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCACCTATGTGGAGGAG 
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ACTGCAAGGAAGCTGTTATTCAAAGTAGAACAGTCAGCCTTGTGCTTGAGTCCTGCTTTA 
TG CTTGC GTGTTTCATAACAAAACACAAAGGC AAGTCTTCATATC AGCACTTAGTCTTGA 
TATCCAAGTAGCTGACTACTGTACCTGCCCGGGGCGGCCGCTCTAGAAACTAGTGGATCC 

CCCCGGGCTGCAN 
Sequence 1281 

TCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGTGGANAGGAC 

CNTNTNAAAAAAAAAACNTNAAACNGAAAAGGATTNNTGTTTNAATATTGACANAANCCA 

AGGAACACCATCTGAAGTTCTGATGGCAACATCAGAAGCTAAGAGAAAGG CATG GAAAAG 

ATTCTCACCTAGAGCATCCAGAGGAGAAGGTTGGTCCTGCAGACACCTTGTTTTCTGACC 

TCTGACCTCCGCMCTGTGAGGGAAGAAATTTCTGTTGCTTAAAGACACACAGCTTGTGG 

TACCTGCCCG 
Sequence 1282 

C'CGGGCAGGTACGCGGGGAAGGAACCAGAGGCCACTTGTATATATAGGTCTCTTCAGCAT 

TTATTGGTGGCAGAAGAGGAAGATTTCTGAAGAGTGCAGCTGCCTGAACCGAGCCCTGCC 

GAACAGCTGAGGAATTGCACNTGCAANCCCTTNAANTGANNAACCAANTAAGAAATTTCC 

CTTTGGGNANAAGCAANGCCCTACNGGGCAAACCTTAAAAAANTGNCCCANTNTTCCACC 

CNTGGNAAAACCTTTGTATTGGNGAAGGGGNGGGAAAAAACCTCCCCTNTNG GAATAG NA 

NTTTTTTGGAMGGAACCMNAGNTTTNTTTTm 

CCAGGAAAATTCTGTTTGAAAATTTCNAAAAAGNCCCCANCCAANTNGATTGGCAAAACN 

CTTAACNTTGGGGGCCCCTTAATTCCNTAAAACAGNCCAACCCCTTCCANNAAAGGGGGG 

CCAAAAAAAACNNAAAGGGCCAATAGCCCCCCNTTGGGNAAAATTGGCTTNTNANCCGTT 

TAAAAAAGG 
Sequence 1283 

CGAGGTAC \\\ t \\ \ \ M U H I i I I I I W > 111 T 'GGGGGNAATTGGGCATAATTTACTA 

GCCAATTTAGGAAGTTCCCCTCACATCAGTAACATACAGTACATCACCCAGTATGTCAGA 

GGACACAATGGCATAAGTTTGCCTTTGCAAGGTTTGAGGGATGGCCATTTCCCGACCTGA 

CTCAGGAAAGTCTGTAGCTGATATCCATCTTCAAGTTTGNGGNTCTTTCTCTCTATATAT 

ATATTTGAGTN C AG CAGTCATGCTGG AGTCC AG AGT AGGTG AT TCTTTCTGCTTTAGCTT 

GACTCCTCCTTAAGATTGCAACTCTCTCAGTTTTACATTTTTTTG 

Sequence 1284 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAACTTCCTCCTAGCGTTTCTGGTTAACTATGT 

GAACACACAGGCAGACAGATCAAGCCGCTATGTGAGGGACAAATAAACATACTGAAGACC 

TAAAAATGTGAAGTCAGCTGAGCCTCAArrCAATCCAAACCTTTGGGGGGCTTATGCACC 

ATTAGTCTGTATCATGGCCTCCAGTCATTTGTCCCTCTAGAACAGGAACTCTGAAAGGCC 

TGACAMGAAGMTGGACMTCACAGATGCTGGTTTTGCCTTCTCATTCCCTGCAAGTCT 

ACTGGATGCAGGAAGAGGGTGGGGGGGTTCAAGCTGC CTGCT TAGGAGCTTAAACATGAC 

MGGGATTTTCTAATGAAAAGCTAACCCAGAGATTCCTCTTTTTCCTATGATACCACAGC 

ATCCATCATTCAAAAACCGAAAAAGGGAC 

Sequence 1285 

CGAGGTACAAGGACCACCAGCATCAGCATCACCTGAGAACTTTTTAAAAATGCAGAATCC 

CAAGTCAGCTGAATCACAGTGTGMTTGTTAACTAGGTTTCCCGCTGATTTTCTATGAAA 

ATTTCTGGTCTATACGGAGTGTATATGACAAATATATACATGTGTGGCCATGCAGACATG 

CTTATTCTCACCTATGGCAAATAGAACACAGCTCTCCATGGTCAGGTGCTTCCATCCCTA 

ATGGCTTCCCACCAGTGAGAAAGACATTAAGAGTCAGAATACTTTTGCTGTAATTCTCTG 

GTAAATTTTGGTACCTGCCCG 

Sequence 1286 

CGAGGTACAGAAGAAACTTGATTCTATAATTGATTTGACAAAAGAAGGCCTATCCAACTG 

CAATACAGAAAGTCCAGTATCCCCCCTGGAGTCACATTCGAAAGCTGCTTCAAACTCAAA 

GGAAACAACCCCATTGGCACAAAATGCAGTCCAGGTTCCTGAGTCCTTTGAGCACCTGCC 
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ACCTCTCCCAGAACCACCAGCACCACTACCTGAATTAGTAGACAAAACCCGAGACACACT 

TCCTCCCCAGAAGCCTGAGTCGAGCGGCCGCCCGGGCAGGTACTTTGGCCTCTCTGGGAT " 

AGAAGTTATTCAGCAGGCACACAACAGAGGCAGTTCCAGATTTCAACTGCTCATCAGATG 

GCGGGAAGATGAAGACAGATGGTGCAGCCACAGTTCGTTTGATCTCCAACTTGGGCCCCT 

GGCCCAAAGTNTACCTTCGGNCCGNTCTAGAACTAGTG 

Sequence 1287 

AGCTCCACCGCGGTGGCGGCCCGAGGTACTGGAGGACACTGTGCTTGGTCTCATTGATCC 

TGTTAACTTGACTGACTGCTCGATGGTCCAGCCACATGGTGACGTTTCGATGGGAATCCC 

CTTCCTGGTTGACTGGTAATGGGTGAAACTGCTTATCAAAACAACCAGAGAACACGTGGC 

ATCAMCCCMGTCCTCGMTTTGGTTTAAATCAATCCCTTGTACCTGCCCGTGTNTNTT 

AAATCTANAATGATGNAAAGTTTTGAATAAGNNGACTATCTTACTTCATGCAAAGAAGGG 

ACACATATGANATTCATCATCACATGANACAGCNAATACTAAANGTGTNNTTNGATTATN 

AGAGTTTAGATAAATNTNTGAAATGCANGACCACNNNANGGNATG7TTATGGGG 

Sequence 1288 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTTCAAAGAGCCAGTTGGG 

TTGGCTGTGTTCTTTCTATTCGTGGAAAA.CAGCTGTGGGTGCCGGGTGTTGAAGTCAGGC 

TTTTGTTCCATGCTMGGTAGGAAGGTTTGTTCTTGAGACCTATCA CTCT TCTGTGGAGG 

AGGAGTTCACAGAATGGTGAGCCGCTTTATCACCATCTCGGAGGCTTTTGAAATATGTTG 

ATAATGACCTGGGAAAGAGGGCCGAGTGGGATGAGGGTGGGAGTAGGAAAGAAACATTCG 

GCCCACACAGTGTTCATATCTTCTCTTGTTTTTGTATCTGTGGCAACTGCAGATTAAATG 

AATTAGGTTATTCTTTTCCATTGCCTTCTACCAAATAAGAAGTACCTGCCCG 

Sequence 1289 

CCGCGGTGGCGGCCGAGGTACT I M I I I I M II I I I I I M I I I I H I H I AA GGAACTGG 
AAACTAAAAACTGCCCTGGCCTATCATAAGGGAAAAGATGGGAGTCTTTTGCTGTCCATC 
TTGAATGTAATTCACATTGTCCATGAGTGCTGATCTAACCCCATATGCCTNTGCCCGACT 
GCCTTTATGAAATATGGTAATTTAATGCTTAAACAACTGTTTCATTTGTGTACCTGCCCG 

Sequence 1290 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGTTTGTAGATGGAAGGAAGAACCTGT 
GTGCTTAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGTAGGACCCTATA 
CGGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAATGAGTCTATATG 
TCAGAATACACATTTCCCACCTTGCCCAACAGTAGAAAAACATAAGAAGAGAAAAAAAAA 

AAAAAAAAAAGGTC CCT 
Sequence 1291 

TACCATTGCCCCTGGCNNNATGCACNATACATNAGGAGCCTGGGATCCTGGCCAGGTNTC 

TGCTGTTACCTGCCCGGGCnnnnnnnnnnnnnnnrnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnAGTGAGGNGTTAATNGNGCGCTNGGnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGGAGCCATAAAAG 

TTGTTAAAGCCTGGGGGTGCCTAATGAAGTGAAGCTAACCTCAACATTAATTGCCGTTNC 

CGCTTCACTGNCCGATTTNCCANTTCGGGA^AACCTGTTCNTTGCCAAGCTTGC 

Sequence 1292 

AGGTACTGTTGAATGTAACATTAGAAATTGGGCGGTTAATTAACATTCCCTAAGTCATAT 

GCTTTMTTTGATMTGATTGCMTCCMGOTCTGTGGTTCCGAAAATGAGATTTGTAAA 

TTCAGTTTGCATTCCCCCATCTTTAGGAGTACCTGCCCGGGCGGCCGCCCGGGCAGGTAC 

AGAAAGAAATATGTCATCTACATCGAGCGGGTGCAGCGCGAGAAGGCCAACGGCACAACT 

GTCCACGTGGGCATTCACCCAAGCAAGGTGGTTATCACCAGGCTAAAACTGGACAAGGAT 

CGGAAAAAAATTCTTGAACGCAAAGCCAAGTCTCGACAAGTTGGAAAAGAG 

Sequence 1293 
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CCGGGCAGGTACTCATCAGATGGMTGTTTTACCCTGCCGAGGTTTTAGTCATGATGTGC 

TGAGCTCTCTGCGTCTGACGTGACTGACTGGTAGCTGGGCGTCGGCGTGACCCTTCTTTC 

CCTCTAGGATCACATCATGGAGATATTTTCCACCTATGGGAAAATTAAAATGATTGACAT 

GCCCGTGGAAAGGATGCATCCCCATCTGTCCAAAGGCTATGCGTACCT 

Sequence 1294 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAMCTTTTACTTTGGTTTTGTTATCTGACTAT 

TGGGTCTTTGACACTTAATGGGCACTTC \ (TTTTTTTI I ATTTAATTTAGAAGTTATGTA 

ACTTCTAAAGCTATGGTTCTCAAGTGTGATATCAGGATTCCTGGAATTTCCTGAGAGCCT 

TTCATGGTGTCTATGAGGTCCTTGTTTTTCCMCTATGTATCTGTGTCAGGTCAGGTTTT 

CTTCATGTACCT 

Sequence 1295 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGTGGGATTTGTCTTTT 

ATGAGTTACCAGTTCAGAGACTAGTTCATCTTTGGTTGTATAGGATGTTAGGATCTCAGT 

TGGCATTCACAGTTAAATCATGCACCTTCAGGAACTGGGAGCATTTTGATCCAGAGTCAC 

AATCATTCCTTTCTTATCTTCATCCCTATGGTATGTGTGTTCTGAAGTTTAACTGACAGA 

TGGCAGCTGGTACCTGCAGGCCTCCTACACCTACCTCTCTCTGGGCTTCTATTTCGACCG 

CGATGATGTGGCTCTGGAAGGCGTGAGCCACTTCTTCCGCGAATTGGCCGAGGAGAAGCG 

CGAGGGCTACGAGCGTNTCCTGAAGATGCAAAACCAG 

Sequence 1296 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGAAGTTTGCGTTCATCGTCCACTT 

CAATGAGTTTCTGGCAGCCAGTGGCTGGGAAGGAGATGTTCAGCTTCATCTTGAAGCAGC 

TGAACGCCTCCGAGGCGCCACGGAAAAGAGGCCCCGCGTACGCGGGATTGGGATTGGGGG 

GAAAGAGGGAGCAACGGCCCATAGCCTTGGGGTTGGACATCTCTAGTGTAGCTGCCACAT 

TGAI I I I I CTATAATCACTTGGGGTTTGTACCTGCCCG 

Sequence 1297 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGAAATCTGCAAGCCAGCTTCACCCCT 
CCCATCGAGTTCTGGTCAGGAGGCTTCTGGTTGAGTTCAAATTGCTATAAAGTTCAGCTG 
GAGCCTTCCTTCTCCCTGTGGCCTTTTCCCAGTAC I ITTTTTTTCTTTTTNTTTTTITTT 
CCAATACTTGTCTTAGCTTGTTmCATm 

TTTTCTATCTTGNTTANTATCTTCCAATTTTCTTCTAAGTTTGTACCT 
Sequence 1298 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACI I N I I I I I I I I I I I I I HI 

INI IGGMTGGGTTATGAGGMTTGTGGTCAAAAMCGAGAACTTCTTTTTGGTGTTAA 

CAGGGCTATGTCTGCTTTGCTCTAGTGATTCCTAACCTACTGTGCGAAGGAGGCATAGCA 

AGGCAAGACTGTTTAAGTATAACCATAAAGGGACTCACGCACTCTCAGGTGAAGTGCAAA 

AATACTGATCACGGTGCTCAAACATTCTGGAGGAGTAAGCATTCAAATTCAGAAATGTAA 

GACGATTTCTCAAAGAAAAGAGGAAAAACCTATCATTAATTTTTTAAAATATACCTTCCA 

GAG GAG GTTTCTC AAC AATTTTACATAGMG ACG AGTATTTTTGTCTCCTTAAG ATTTGG 

ACTTAAGGTCAAAGTTGTTGAAGGTTGATCGGCTATTGTAAGCTTCAGGTGAAACCTANA 

GAGGAGAAGGGAAGAAGATAAATCTACTATCCTAGATTCTTTTT 

Sequence 1299 

ATACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTGGCCTCTCTGGG 

ATAG AAGTTATTC AGCAGGC ACACAACAGAGGC AGTTC C AG ATTTCAACTG CTCATC AG A 

TGGCGGGAAGATGAAGACAGATGGTGCAGCCACAGTTCGTTTGATCTCCAGCTTGGTCCC 

CTGACCAAAAGTGTACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnr.nnnnnnnnnnnnnnnnnnn 

nnnnTGTTCCCTTTTAGTGAGGGTTMTTTGCGCGCTTTGGCCGTAATTCATGGTCATAG 

CCTGTTTCCTGTGTGAAAATTGTTATCCGCTCACAAATTTNCCACACAACATACCGAGCC 

GGGGAAGCCATTAAAAAGGTGTAAAAGCCCTTGGGGGTGGC 
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Sequence 1300 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGATCCATGAATACATTTCC 

AACAAGAGCACTGGCCAAGTCAGCTTCTTCTGAGAGAGTCTCTAGAAGACATGATGCTAC 

ACTCAGCTTTGGGTCTCTGCCTCTTACTCGTCACAGTTTCTTCCAACCTTGCCATTGCAA 

TAAAAAAGGGAAAAAGAGGCCTNCTCAGACACTCTCAAGAGGATGGGGAGATGACATCAC 

TTGGGTACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnGCCGCCGCTTGGCCGTAATCATGGTCATAAGCTGTTTCCT 

GTGTGGAAATTGTTTATCCGCTCACAATTTCCAC 

Sequence 1301 

CCGGGCAGGTACAGATCTCTGCAGTTACAGGAACATCGTCTGCTTCATGGTAAATGCAAT 
TTCCTGTTCAGAAACCACACTMCATTACTTCTTTCAGTATTTCTGCCTCAAAGTTGGTT 
TAAGATTATCATCATTATAAAGAAAAATTTTGTNCTANTTGCTGGNCCAAAGGGAAAA 
Sequence 1302 

AACTCCTATAGGGCGTTTTGGAGCNCCACCCGCGGTGGCGGCCGCNCTAGAANTAGCGNA 

NNCGGCNTTCNGNAGGANTANTTTATCA,^GCTNATCNATACCGTNGACNCCGANGGGGNG 

CANNGGAACCCANACTTTGAGTAAnnnnnnnnnnnnnnnnnnnnnnnnnnnnnNANTAAN 

CATGGNCATAGCTGGGACCNGTAGTGAAAATTGTTATTCCGCTGCACAATTCCACACAAC 

CATACCGAGC 

Sequence 1303 

AGGTACACTTTTGGCCAGGGGACCAAGCTGGAGATCAAACGAACTGTGGCTGCACCATCT 

GTCTTCATCTTCCCGCCATCTGATGAGCAGTTGAAATCTGGAACTGCCTCTGTTGTGTGC 

CTGCTGAATAACTTCTATCCCAGAGAGGCCAAAAGTAACCTTCCCNGGGCGGGCCCGCTT 

CTTAAGAAACNTAGGTTGGGATCCCCCCCNGGGNCTTGCCATGGGAAATTTCCGAATTAT 

TCAAAAGGCCTTTATTCCGAATTAACCCCGGTTCTGAACCCCTACNGA AGNGG GGGGGNG 

GGCCTCCCGNGTTTACCCCTCAANGCCN I I I I I I I TNGTTTTNCCCCC l nil lAAGGTC 

GGNAAGGGGGGTTTTAAAATTTTNGGCCGGCCCGGCCTTTTGGGGTCNGGTTANMTTCC 

AATTGGGGNTACCAATTAAAGGCCTTGGGTTTATTCNCCTTGGGTGGNTNGGAAAAAAAT 

TTGGGGTTTTAATTCCNCCGGNCTTTCNAACCAAANTTTTCCCCAACCCAACCAAAACCC 

AATTANCCCGGANGGCCTCCGGGGGGGTAAGGCCAATTAAAANAAGGTTGGTTAAAAAAG 

NCCCCTTNGGGGGGGGGNTTGNCCCCCTTAAAANTGGAAAGGTTGGGAAGCCCTTTAANA 

ACCTTCCAACCAATTTTAAA 

Sequence 1304 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGGCCAAGGTCCT 
CTGTGACTCGCCGCCCACTACCCAAGTGAATGAGTCTCCCCTAGAGCTTTGCTACTCAGA 
GGGGTCTGAGGACAACAGCATGGGCCAACACGTGCACTCGAGCTGCCTGGAGATCTTGTT 
CAAAGGCAGATTCTGAATGAGTAGGTCTGGGTTGGAGCCTGAGAGTCTGTACCT 
Sequence 1305 

ATTGGTCTCCCCGCGGTGGCGGCCGAGGTACCATGCAGAAACCTGGCCAGGCTCCCAGGC 

TCCTCATTTTTGGTGCATCCACCAGGGCCACTGGTAnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTCTGCAAGTT 

TACTACTGTCAGCAGTATAATAACTGGCCCCCGTAC CTGCCC Gn nnn nn nnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnGTCATAGCTGTTTCCTGTGTTGAAATTGTTATTCCGCTCACAATTCCANCA 

CAAACATACGAGCCCGGNTAGCATAAAGTGGTAAAAGCCT 

Sequence 1306 

CCGGGCAGGTAC I i I I I ill I I II I I I I II I M M H i 'NGCTATAAAATTGGTATTTGTA 
TCTTCCTCCCATTCTGCGCTCCCCCAACCCTGCAGGAAGGTGGATTCTTCAAGGGAACTG 
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GAGGAGACTCTTCTGTTACCACCAAGCCAGGAAGTTTCATGGCTTGGGAAGCCCCGCGTA 
CCTN 

Sequence 1307 

AGGTACGCGGGMCACATTTCTTTGGGATTTTGCCCTTCCTGGGGTATAGGGGATCAGAA 

ATATTGATCCATGTGCACGCAGATAAAATGGCTTCTGCTAAACAGACTAAAATCTTTCTC 

TCTAGTCT7TCTCACTTGTACCTGCCCGGGCGGCCGCTCGACCGTCTTTCCCTTTCGCCC 

TCCACCTCCTTCCTGAGACCAGCTGTGTTTCAGCTCCCCCTTCCCTGGGCCCACACTGAA 

TCTCTGTTACCCTTACAAACCCCTGCTGCCTCTAAGAGTAAGGACCCCTATCCCATCCCA 

AACTCAAACCTMGCACATGCAGCCTCATCCCATTTCTCCAAAAC ATGTG AAAGCAAAAG 

AAMTAAAAAACTMCTCAGCMTCTTGTTCTTTCCTACAGTGGNCTTTTGGCTAC 

Sequence 1308 

GATCATATAGGGCGAAATTGGANCTCCACCGCGGTGGCGGGCCGCCCGGGCAGGTACGCA 

GGGGGCCTCTGTAAGGCACATGGAGGTGATGATGTTTCTTAGAGAGAAGATCACTTGAAG 

AAACTTCTGCTTTAATGACTTTACGMGCTGGCAATATTACMTCCTTGACCTCAGTTGA 

AMGCAGTCATCTTCAGCCGTTTTTCCAGCACCTATAAGCCACCCCCCAAGTGTGGTNTA 

TTGCCCTCCCTCGGATTGCCTCCNGTTAGCTTCGGGCNCGCTTCTAAGAAAACCTAGGTT 

GGGAATCCCCCCCCGGGGCCTTGCCAGGGAAATTTCGGMTNATCAAANGGCCTTAATTT 

NGAATACCCGGTTCGAACNCCTTCNGGAGNGGGGGGGGGGGCCCCCG GGTTA ACCCCCCA 

ACCCTTTTmGGTTTTCCCNCNTNTTTAAGTTGGMGGGGGGTT^ 

NNCGGCCTTTTGGGGCCGTTTAAAMTNCCMTTGGGGTTTCCMTTMGNNCNTTGNTT 

AT^NCCCCTTGGTTGGATTGGAAAAMTTTTGGTNTAATTTCNCCGGCCTTCCAACC 

Sequence 1309 

GCGGNGGCGGCCGCCCGGGCAGGTACCAGCAGAAACCTGGCCAGGCTCCCAGGCTCCTCA 

TCTATGGTGCATCCACCAGGGCCACTGGTAnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCTGCAGTTCTGGAAGAT[TCTTGCAG Mil 

ACTTACTGGTCANGCCAGTATAAATAAAACTGGGCCCCCCGGTACCTTCGGCCCGCTTCT 

AANAAACTTAGGTTGGGAATTCNCCCCCNGTGGNCTTGCCAAGGGAAATTTTCGGAATAA 

TTCAAAAGCCTTTAATTCGNAATTACCNCGGTANCCGAACCCNTTNCGNAA GGGG GGGGG 

GGGGGCCCCCCGGGGTTMCCCCCCAANCCTTTTTTTNNGTTTTCCCCCN I l i I lAAAGN 

TTGGAAGGGGGGTTTAAMTTTTTGNCNGGCCCGCCCTTTGGGGNCCGGTTAAAAATTCC 

AANTTGGGGTTTCCAANTTMNGCCTTGGTTTTTNCCCCNTTGGTTGGTNGGNAAAAAAT 

NTTGGTTTNAATTCCCCGGCCTTTCCAACCAAAATTTTTCNCACCCAACCCA 

Sequence 1310 

TGCCAGTGGCCCTGTTGGATGCATCATAGATGAGGAGCCTGGGAGCCTGGCCAGGGTTTC 
TTGTTGGTACCTCGGCCCGCTCTAGAACTAnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnTAGTGAGGGTTAATTGGCGCCGCTTGGCCGTAAATCATTGGTCATTMGCTGTTTTCC 
TGTGTGGAAAATTGGTTTATCCCGCTTCACAATTTCCACACAACAATACCGAAGCCGGGG 
GAGCAT 
Sequence 1311 

CCGGGCAGGTACAAACTTATGAAGMGGTCTCTTTTATGCTCAAAAAAGGAGACAAGGTC 

TCAGGAAAGGAGTGGGTAAAACACTTATCTCCAGAAAGCCAGCAAGAGGGAAGAAGGAAG 

AAAACCCCATCCTGCATGGTATGATGCTGAGATTCCTCACATATCTTCAAAGCATCCTAA 

GAAAAATTGAAGGACCCATGTTTTCAGATCAAGATGCTCCAGCTAGTAACCTGAACAATG 

GAGAAAAAGGAACTCACAGCAGAACATCTACCTGGAAAATGCAACCACTTTGGGAAAAGG 

GGAAATACTAAAAGTTTCAGAGAACAGAAGTGTGTCACATACAAAAGACCAACAATGAGA 

ATGGCTTTTAACTTTTCAATATCAGCTCTAGATGTTTAAAGACAATAA 

Sequence 1312 

AGGTACGCGGGGACACTTCCGGGCGCGGCGCCGGCTTGCTGCCACTGCAGAGCCCCGCCA 
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TGGAAGACACGCCGTTGGTGATATCGAAGCAGAAGACGGAGGTGGTGTGCGGGGTCCCCA 

CCCAGGTGGTGTGTACCTGCCCGhnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGGGCTT 

GCAGGAATTCGATATCAAAGCTTATCGATACnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTGGTTCATA 

AGCTGTTTTCCCTGTGTGAAAATTGTTATCCCGCTCACAATTTCCACACAAACATTACCG 

AG CCCGGGG AG CCATTAAAAGTTGTAAAAGCCCTGGGGGTTGCCCTAATGGAGTTG AG CT 

AACTCACATTTAATT 

Sequence 1313 

TACCAGTGGCCCTGGTGGATGCACCATAGATGAGGAGCCTGGGAGCCTGGCCAGGGTTTC 
TGCTGGTACCTGCCCGGGCGGCnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTTGGCGTTAATCATTGGTCATAGCTGGl i I 
CCTGTGTGAAAATTTGTTTATTCCGCTCACAATTTTCCCACACCAACATACCGAAGC 
Sequence 1314 

ATANGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGGGGGCCAGTTATTATACT 

GCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTACCAGTGGCCCTGGTGGATGCAC 

CATAGATGAGGAGCCTGGGAGCCTGGCCAGGTTTCTGCTGGTACCTGCCCGGGCGGCnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTGAAGGGGTTAATTG 

CCGCGCTTGGCGTAATCATGGGTCATAGNCTTGTTTCCTGTGTTGAAAATTGTTTATCCG 

CNTCACAAATTCCACACAAACATACCGAAG 

Sequence 1315 

TACTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGATGTGGAGAG 

GACCATGTGAAGAGAGAAGCTGAGACTGAAAAGGATTTATGTATTAATATTGACAGAAGC 

CAAGGAACACCATCTGAAGTTCTGACGGCAACATCAGAAGCTAAGAGAAAGGCATGGAAA 

AGATTCTCACCTAGAGCATNCAGAGGAGAGGTTGGTCCTGCAGACACCTTGTTTTCTGAC 

CTCTGACCTCCGCAACTGTGAGGGAAGAAATTTCTGTTGCTTAAAGACACACAGCTTGTG 

GTACCTGCCCGGGCCGGCCGCTCTAGAACTAGTGGGATCCCCCGGGCTTGCAGGGAATTT 

CNATATCAAAGCCTTATCCGATACCCGTCGACCTCGAAGGGGGGGGGGCCCCCGGGTACC 

CAAGCTTTTGTTCCCTTTT 

Sequence 1316 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGATGTGGAGAGGACCATGTGAAGAGA 

GAAGCTGAGACTGAAAAGGATTTATGTATTAATATTGACAGAAGCCAAGGAACACCATCT 

GAAGTTCTGATGGCAACATCAGAAGCTAAGAGAAAGGCATGGAAAAGATTCTCACCTAGA 

GCATCCAGAGGAGAGGTTGGTCCTGCAGACACCTTGTTTTCTGACCTCTGACCTCCGCAA 

CTGTG AGGGAAGAAATTTCTGTTGCTTAAAG ACACAC AGCTTGTGGTACCTnn n nnn nnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGTN 

CGACCTCGAGGGGGGGGGCCCCGGTTACCCAGCNTTTTGTTCCCTTTAGTGAGGGGTTAA 

TTGCG 
Sequence 1317 

TACCAGTGGCCCTGGTGGATGCACCATAGATGAGGAGCCTGGGAGCCTGGCCAGGTTTCT 

GCTAGTACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnCGCGCTTGGCGTAATTCATNGGTCATAGCCTGTTTCCTG 

TGTNGAAATTGTTATTCCGCTCACAATTCCAC 

Sequence 1318 

CCGCGGTGGCGGCCGAGGTACGCGGGATTGATGATGGGAGGCAGTGAGTCTTGATGATAA 
GGGTGAGAAACTGAAATCCCAAACACTGTTTTGTTGCTTGTTTTATTATGACCTCAGATT 
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aaattgggaaatattggcccttttgaatmttgtcccaaatattacattcaaataaaagt 

gcaatgggagaaaaaaaaaaaaaaaaaaaaggtacctgcccgggcg gccg ctcgatgctt 

ggtgctgmgtataattaatatttgagtgtttgctatgtgccataaacttttctaaatgc 

cctttacatacattagttatcaaaacaaccctntcctgggtaggggcatttatccccatt 

cttcaagatgagggamctgaggccaccaaagamtggtttccaaggtttggtaccctcg 

GCCG 

Sequence 1319 

CCGGGCAGGTACCATGCAGAAACCTGGCCAGGCTCCCAGGCTCCTCATCTATGGTGCATC 
CACCAGGGCCACTGGTAnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnAGCAGTATAA 
TAACTGGCCCCCGTACCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnAGTGAGGGTTAATTGCGCCGCTTGGCGTAATCATGGGTCATAGCCTGTTTTCC 
TGTGTGGAAAATTGTTATCCCGCTCACAATTTCCACACAACATACCGAGCCGGGGAGCA 

Sequence 1320 

aggtac i i n 1 1 1 n 1 1 1 1 1 1 1 1 1 n 1 1 n h n \ i gtttcttggtttctgagtccagtc 

CCCTTTGCATAAACGCCCGGCGTGGCTCTCTCAGCGTCCCCGCACCTTCATGCNCAGCAN 

ATTCTTGGACAACTTTGTCTTTCTCT 

Sequence 1321 

AGGTACCAGCAGAAACCTGGCCAGGCTCCCAGGCTCCTCATCTATGGTGCATCCACCAGG 

GCCACTGGTAnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnAGCAGTATAATAACTGG 

GCCTGTGTACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnNGGTTAATTGCCGCGCTTGGGCCGTAATCATTGGGTCATAGCNTGTT 

TTCCTGGTGGTGAAAATTTGTTATCCGCTCACAAATTCCACACNAACATTACCGAAG 

Sequence 1322 

AGGTACAGATAATGAGGGTTGTAAAATGAAAGAGAGAGGAGAGGTGAGACGAAATGAAGA 

CAGCTAATTTTTCATTCGACAAAATCAGGGTTCAGAGACATTTGTAGCTACACTGCCCAA 

TGCAGCTACTATTAGCCACATGTGGCTACTTAMTrrAAATTAATTAAAATTAAGTACCT 

GCCCG 
Sequence 1323 

CCGCGGTGGCGGCCGAGGTACCAAGTCAGGTTGTTCATTTGAGCCAAC AACAG ATTTCTT 
GGTTATTGTGCTATTGCCACAGTAGGTCCTATTTGCATTGAATTTGTGTGTTTTTAGCTT 
CTCTCCTTTTACGTGTTGCTTTATGCTACTCTCCAAMGTCTTTCTTGTAGAAGTTTATT 
AAGTTGTATAATTTTTCTTTGCTCTGTTTTGTTTTGTTTGACA 

TATAAGCCATTTCCATCATCAGTTTCCTGTCTCTGCTGGGGGTTATCTTAGTGCCTCTCA 

TGMTCGGGTGTTTTTCAAATTTCTCCTGAGTTTCCTTGTGGCACTGGCCCGTTTGGGAC 

TTTGGAGTGGTGGATGCTTTTTTACACCCTTTCTTCCACATTTCTCATGCCAAGGTCACC 

CACCAT 
Sequence 1324 

ATAAAGATGATGTCTTTTTGTTATTGNCTGTCTGTTTNGTATGTGTCTGAGATAAGGGAT 

AGAGAGGAAACATCCGTCAGGCTAATTTAACTACATTTTATTTTAAAAATAGAGAAACAT 

AACCTCTAGATGGGACAGCAGAGGACAGTTAGTANAGGCCACAAACTGTTATGGGCTGCT. 

GTGTTTTGTTCTAAAATCAATATGGTTGGAGCATGTATATCTTANGTGATCATTTCACAT 

CTTAGGAATGCCTACTCATmATTTTATTCTAGTGATNGCTCMTTCACTATTTMm 

ATTATATTTTCTCTTCTGGGGCACCTTATACAAAATATCTCTTCACCTACTTAGTTCTAC 

AGGGTTTT 
Sequence 1325 
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CCGGGCAGGTACCTTCTTGGCAGAGCCAMGTCACCCGTTCTGCTTTTGAGGGAAGGAAT 
TATGAGAAATTCCAACTTCTGCCTTCAAGCAGGAACTTTCCCAGCCATTTCCTTGAAAGG 
TTGTTTACAAGGTCACAGGCAGCCACAAACATGATAAAAATTCATTCCTGTACCT 
Sequence 1326 

CCGGGCAGGTACTCCAAAGATTCAGGTTTACTCACGTCATCCAGCAGAGAATGGAAAGTC 

AAATTTCCTGAATTGCTATGTGTCTGGGT1TCATCCATCCGACATTGAAGTTGACTTACT 

GAAGAATGGAGAGAGAATTGAAAAAGTGGAGCATTCAGACTTTGTTCTTTCGGCAAAGGA 

CNTGGGTCTTTCTATCCTCTAGTACCCTTNGGACCGCCTCTTAGAACCTANGTTGNGNAT 

TCCCCCCCGGNGGCTTGGCMNGGAMTTTCTTATTAATTCAAAGNCTTTTATTCNGCAT 

TAACCCGNTCCNGAACCCCTTCTGNANGNGGGGGGGNGGGGGGCCCCCCCGGGGTTANNC 

NCNCCAAGGNCC I I I I I i I I I GG ! I M I I CCCCCCTTNTNTTAANGGTTGGNAAGGGGGG 

GNTTTTAANAAATTTTNGGCCCGNCCCGGCCCTTTTGGNGNCCNGGTTAAAAANTTCCAA 

TTTGGGGTTCCCNATTTNAAGGNCATTGGGTTNTTTTNCCCNTNGGGTGGNTTCGGNAAA 

AAANTTTTGGTTNTAATTCCCCGGTCTTTCCAACCAAAA 

Sequence 1327 

CGCGGTGGCGGCCGAGGTACTGNAAATCGTCAAGGGTGAAAGGTCGATCGCNTGCCAGCT 

TGCTTCCACTNGATTCTCGGA I fTTTl I ACCGGTANTGATGTTGTAACATTCGTATCCAA 

GGCTCCGCAAAGTTNGNCACNTCGGTATGTGANAAGGCTNTCCTCATTCCCACNTATATT 

GGNGTTACCCTTGGCCCNCGGGTCCNAGGCCCNCCTTAATTAANAAACNTTANNTTTGNT 

ATTCNCCANCCGGGNCNTGNCAANNGTAMTTTTTCGTAATTAATACNANANGNCTTTTA 

ATTCCGNNATTANCCCGGTTCTNTAACCCCCTANGATAGGGGTGGNGNGTGNGNCCACAC 

CGGNGNTTANCACCCNAANNATCC I I I I I I NTGNATATTTNACCCNTNTTTNNATGNTTG 

GNAAGGGGNGGTAATTAAAAANTATNGACNCNTCCNTCCNTATTGGGAACNGAATAAAAA 

NTTCAAATTGNGGGATTCCAANTTNAGTCCATTTGNCTATTNTTCCCCCTTGGTTGGANT 

TGNTAAAAAAAA 

Sequence 1328 

GGTGAAGATGCTGAGCCGGAATCCGGACAATTATGTCCGCGAAACCAAGTTGGACTTACA 

GAGAGTTCCAAGAAACTATGATCCTGCTTTACATCCTTTTGAGGTCCCACGAGAATATAT 

AAGAGCTTTAAATGCTTNCCAAAACTGGAACGAGTATNTTGCAAMCCATTCCTTGCTTC 

GNTGGATGGTCACCGTGATGGAGTCAATTGCTNGGCAAAGCATCCAGAGAAGCTGGCTAC 

TGTCCTTTNTGGGGCGTGTGATGGAGAGGTTAGAATTTGGAATCTAACTCAGCGGAATTG 

TATCCGTACCT 

Sequence 1329 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGAGCAGAAGCAGAAAAGGGTTGTTGT 

TACCAACTTGAATGGTCACACCGCCCGAGTCAATTGCATACAGTGGATTTGTAAACAGGA 

TGGCTCCCCTTCTACTGAATTAGTTTCTGGAGGATCTGATAATCAAGTGATTCATTGGGA 

AATAGAGGATAATCAGCTTTTAAAAGCAGTGCATCTTCAAGGCCATGAAGGACCTGTTTA 

TGCGGTGCATGCTGTTTACCAGAGGAGGACATCAGATCCTGCATTATGTACCT 

Sequence 1330 

AGGTACAGATACATGGACACAATCATGGCAGCCAGCTCGAGGCCCCCAATTCCAGCTGCC 

ACACCACCCACGGTGACTNGCATTAGTTCGGATGTCATACAAAAGCNTGATNTGAAGCAA 

CACCTCTACG 1 1 I I I GGTCGTAGAGCTCTTTTTGCTTTGNGNTGCAAGGATTTTCATTTT 

GGGGCTTGTTGNTTTGGGATTGNACNGNTTTGTTCATTTTGNNAANACCATNAAATTGGG 

TGTGTGTAAANMNGGNCCAACCTTGATTTTCATTCTTNTTTGGNATAGTATATGGGNGN 

TTGTAAGGTTTCNCCTTTCAAAAAAMNTTCCCCGNTTTANTTAGNATTTGGNGT^ 

AMNGACTCCMCCATGTNCAAACCTTTTGGAAGGCCCCCCTTTT 

Sequence 1331 

AGGTACGTGGGCCTGTCTGCAAACCAGTGTGCCGTGCCAGCCAAGGACAGGGTGGACTGC 
GGCTACCCCCATGTCACCCCCAAGGAGTGCAACAACCGGGGCTGCTGCTTTGACTCCAGG 
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ATCCCTGGAGTGCCTTGGTGTTTCAAGCCCCTGCAANGAAAGCANGAATGCACCTTTCTT 

GANGGCCACNCTCCAGCTGACCCCCGGCACTGNTTNTAGTAACGTAGCTGGGTAATCTCC 

CCGGGTGCCTGGCNMGGGAAATTTTCCGNATANTTCAAAAGGCNTTTATNCCNNNTTAA 

CNCGGNTNNGTACNCCTTCNGTAAGNGNGGGGNGGGGCCNCTCNGGGTTTACCCCCAAAT 

CTTTTNTATGGNTTTTCCNCNTTTTNAAGCTTGGAAGGGGGGTTATAAAATTCTGGACCG 

TCCGGCCTTNTNGGGCCNGGTTAACATTTCTAATTGNGTNTCCAATTAAGGGTCCNTGGN 

TTTTATTCTCCTTGGATTGGTTGGA 

Sequence 1332 

TTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAGTGAGGCATTTCCTGATTTT 

TACTCATAATATTTACAAAGATTAAAAGAGATAAAATTCCAAATGTGMGGCTTAAATT^ 

TTCAAATCCTTACTCAAAGAAATAGTGGGAAAGGTGGGTAAGTCAGAGTCTGCTGCCGAT 

GMGGAAAAAAATCAAGTTAATCTCTTCTGAAATTATCATGATTTGTTTGGTTTATNATT 

CCCAAAAAAAAAAAAAAAAAAAAAAAAGTCCCT 

Sequence 1333 

AGGTACTAAAGCATTCATGGAGGCTCTTCAGGCTGGTGCAGACATCTCCATGATTGGGCA 

GTTTGGTGTTGGCTTTTATTCTGCCTACTTGGTGGCAGANAAAGTGGTTGTGATCACAAA 

GCACAACGATGATGMCAGTATGCTTGGGAGTNATTCTGNTGGAGGTTTCCTTTCNACTG 

TGGCGTGCTGACCCATTGGGTGGAGNCCCATTTGGNACAGNGGGCTACCCTTGCTCCGAG 

GNCNGGCANGCTTGCTACTAGNMNCTTAGATGNGATCCCCCCGNGGNCTGGCNAGGGAA 

CTTNCGTATTATCNAAGGCCTTATTCCGGATANCCCGCTCCGTACNCCTANGGATAGGGG 

GTGGGCNCCCCGGTTACCACCAANCATTTTNTGGTTTCNCCCTTTAAAGGATGNANGGGG 

GTTTTAAAATATNGCTGCCGGCCTTTGGGCCGGTTAAAATTCCAMTTGGGGTTCAANTA 

AGTCCTTGGATNTTNCCCCTTGGTCGNTTNGAAAAA 

Sequence 1334 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAGGGTCCTGGTGGGATGGTGAACCGTGACGA 

GTACGCCATCAGGGTAAAGAACTCGTCCAGACTGTCAACTGTGATCCTGGACTCGCTGTA 

GGTGAGTAGCGAGTTCTTAGATCCTAAGAGACTGATGCATACATGGGGAAAAACAAATAT 

AAAACCTGGCAGTTGTACCT 

Sequence 1335 

CGGGCCCGCCCGGGCAGGTACATTTCCTTGTAGACTCTGNTAATTTCCTGCAGCTCCTGG 

TTGGTTCTGGAGCAGATGATCTCAATGGAGAGAGTCCTCGTCGGTTCCCAGCCCCTNTCA 

TTGCGAAAGCTTTTTANCNTCANAAAGCCGTTCATTACCTGAGGCCAGGGTGGTCTTTCA 

AATAAGGGCCCCAAACAATTCAACCGGTNCTTCCAAGGGGTGGGGCCAAGNATAAANGGG 

CNTGGANCTTNTCAAGCTGGCTTGNATTGNCAAAGNTNTCCCI I I I I I I I NGGGTGCCCT 

TACTACTNGGTAAGGGGCGNAAAAGGGCNAAATTAATNCCCTNGTTNCTTCTTGNTCGNC 

AANANTGNCNTNGCCGNGGTATTGGNNTCCANNAAAATTGGTNTGGAACCAAAATGGGGT 

NGGAACCCTTTCAATTCTCCAACCAACNCTTTTTTGGGNTACNTTAGGCAATNGNGGCCT 

TGNNTTTCCCAAAATTGGTTTACCACANAAGGCCAMTTCNCCCGNCCTTTCAAGANCAA 

TTTCNAAAACCGGTTTNAAGGTNAATTAAGGGGNNCTTTTTTGGGANCANGGGAACCCCC 

CNATTAATTTGNCCAACCTTTTGGGGGGGG 

Sequence 1336 

GTGGGCGGGCCNGCCCGGGCAGGTACAGCTGAAGCCTCGTAACCCAGACTACAGAAGCAA 

TAAACCCATTATTCTCAGAGCTCTCTGTGACTACAAACAAGATCAGAAAATCGTGCATAA 

GGGGGATGAGTGTATCCTGAAGGGACAACAAACGAGCCGCAGCAAGTGGTACCCTCGGCC 

CGCTTCTAAGAAACTAGGTGGGATCCCCCCGGGGCCTGCNAGGNAATTTCCGAATATTCA 

NAGCCTTATACGAATACCCGTTCGNACCTTCTGAAGGGGGGGGGGCGCCGGNTACCCCAA 

GAN I I I I [ l TGTTTNCCCCTTTTNAGATGAAGGGGGTNTAAAAATNTGNCGCCGCCNTTT 

GGGCCGTTAMTTCMTGGGGTTCCATTAANGCCTGGTATTTTNNCTTGGTTGNTTGGAA 

AAAATTTGGGTNTAATTCCCCGGCCTTCCAACCAAAATTTTNTCCACCAANCAAAACCAA 
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TTANCNGNAAGGCCCCGNGGGGMGGCCMTTAAAAAAAGGTTGGTTAAAAAAGGCCCCT 
TGGGGGGGGGTTGGCCCCCTTAAAATTGGAAGGGTTGGAAGGNCCTTAAAACCCTTCCAN 
CAATTTTA 
Sequence 1337 

CATATAGGGCGAATTGGAGCTCCAACCNGNGGGGGCGGGCCGCCCGGGCAGGTACCAACA 

GAAACCTGGCCAGGCTCCCAGGCTCCTCATCTATGATGCATCCAACAGGGCCACTGGCAn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

AGTAGCCTGAAGATTTTGCAGTTTTATTACTTGTTCAGCAGCGGTAAGCAACTTCGTACC 

CTCGGCNCGCTTCTAGNAACATAAGTTGGGAATCCCCCCCGGGGCCTGCANGGAAATTCC 

GCATANTCAAAGCCTTAATCCGGAATAACCCGGTCCGGACCCTTCNNTAGGGGGGGGGGG 

GCNCCCGGGTTACCCCNCMGACmNmGTTmCCNCTTTTTMNNTTNGAAG 

7XTTAAATNTTGGNCGGCCNCCCTTNGGGCCGGGTAAAATTTCMTTGGGGTNCMTTAA 

GCCTTGGNTTTTNCCCCTGGTNGGNGGNAAAAAANTTGGTTTAATTNCCCGGCCTTCAAC 

CAAAATTTTCTCAACCCANCCAMNCCATTTAACTNNAGGCCCCGNGGGNAAGGCCATTA 

AAAAAG NTTGGTTAAAAAAAGCCN 

Sequence 1338 

AGGTACAGCTGGAGCAGAGTTGCAC7TCACTGTGTAGAATGGCTTTATCTGAGCCACTAC 

ATTCTGCCATTGACTGCGTTNCTTCNCAATCTTTCCCAAGATCTCCCACAAAAAAATGCA 

TTTTTCCCTGTCAGGGTNATGTTCATNAAACATACNCTTNTCAATAACAN TTTC CCAAAG 

GGTTTTGNTTTCCTTCAATNCCAACAAGGGNTCANACCGGNAGTTAAGN7TTTNGCCATT 

CCATNCAAATAAAAANTCCCCTTTTCATTCTTCCACCCACCTTATNTTCTCAACANANAN 

ACCCGGAACAGTTTANTTANAAACCCNAGGGNAAAANGAACCAAAGGAGGAACGNGGCCC 

ACCCAAAGCCTAATGGTC7TGGGGGTCCCTTTANGNAAATTANTGCCAAACAAANAAATG 

GTACCANAGTTTGGGGGNAGGAATGNGGAACCCCCCCCTTATCNATTCNAGCCTNGCGTT 

TNCCCCAAAGGGGTGGGCTTCAAACC 

Sequence 1339 

TCCTATAGGGCGAATTGGGAGCTCCACCGNGGTGGCGGGCCGAGGTACTATGGGAACAGG 

XTCAAAATGAATATAAGGGAGTTATTTATATATTAGGTAAAGCACAAAAATGATAATTAC 

TGGACTGCACACTCACAGGTTAAAGGTCATTTGGGNGCAGNAAAAATGTCCGGACCCGTT 

TGGGGAAACAAGAACCATTAAACCTGCTTGTAAAAAGGAAGCCCAAGGGGGGAGGGGGTA 

AACCCAAAAAAGCCGTTATTTCAAGAAATANGCTTGGTGGGAAAAGCTTGGGGGGGGAGG 

ACCACCCTTGNAAAAAAAATCCTAAGCCANATTAAGGCCGTTGGAAAAAAATTCAACCTT 

TAAAAGNTTAAAAAAAAAATTAAAAAACCTTTTTCAAGGAGGGGGCCTGGAA^ 

TTTTTTGAAAATTTTAACCCANACGNAA 

AAAGGAAGGGGAATTTMGGTAAAAAAAATTTTCNTTTCCCATTGGCCNNAAAAAATNAC 
CAAACCAAGGTTAAATTGGGGCNTCCCCAATTCCAACCGGAACCATTTTGGTTANAAAAC 
CCCCCAAAANGCCCCCGAAGGAGGTTNCACCNTTATTGGTTAAAGGTTGGAGGAAGGTTT 
TGGAAGGTTGCCANAATTTTTGGTTAAATTAAAGGGAAAATTAAAAAAAAACCAA 
' Sequence 1340 

CCGCGGTGGCAGCGGCCGCCCGGGCAGGTACCATCACCCCTTCATGCTGGCCCTAAGCTT 
TCTCCAGCAGTCCCACTTCCTGATATTCGTTCTCTTCAGCAGCCTAAAATACAGCTTTCT 
TCTGTCCCCAAAGTAAGCCGCTGTGCTCATTGCCCTAATGAACCCTCCACTTCGCCAATG 
CGTTTTGGTGGTGGTGGTGGCGGTAGCGGAGGNACCTCGGCCGCTCTAGAACTAAANGGA 

TCCCCCGGGCTGCAGAGAATTCGATATC 

Sequence 1341 

CGCGGTGGCGGCCGAGGTACCAACATGTCCCGTGGTTCCAGCGCCGGTTTTGACNCGCCA 

CATTACCATTTTTTCACCCGAGGGTCGGCTCTACCAAGTAGAATATGCTTTTAAGGCTAT 

TAACCAGGGGTTGGCCCTTACATCAGTAAGCNTGTCAGGAGGGNAANAGGACCTGTTGNC 

ANNTANATNTGTTCACCACCAGNAAAGAAAAAGTTACCCTNGCTCCNGGNGCGGGCCCGC 



Page 182 of 635 



WO 01/51628 



PCT/US01/00798 



Table 1 



TTCTTANGAMNCTTATGNTGGGATTCNCCNCTCGGNGCCTTGCCAAGGGAAATT7TCTG 

ATTATTTCNNANGNCTTTTANTTCNGGAATTANCCCGNTCGGATCCCTNCGGAGGGGGGG 

GGGGGGCNTCCCGGGGTAACCCCCCAAGGC I Mi l l 1 GGTATTCCCCCTTTTTAAGATTG 

GAAGGGGGGTTTTAAAA I I I I IGGCCTGGCNGGCCCTNTTGGGGCCGGTTAAAATTNCAA 

TTGGGGTTNCAATTMGGNCTTGGTTTTTNCCCCTTGGTGGGTTGGNNAAAAMTATTGN 

GTTTAATTTCCCCGCCTTTCAACCAAAATTT 

Sequence 1342 

TCACTATAGGGCGAAATTGGAGCTCCACNCGCGGTGGCGGCCCGACATCCGCCGCCCGAG 

GGATCGTTGGCAGCCCGGGTGGGAGTGTGGCTGTGTTCTCGTCCCTGGCTGACAACGAAG 

GGGAGCTAGTCACTTTTCCTGCCACAGACGACTCGCNGTCCGCCTCTCGCCTGGAGTACC 

TCGGCCGCTCTAGAAACTANGTGGATCCCCCGGGCCTGNAGGGAATTCGATATCAAAGNC 

TTATCCGAATACNCGTCCGNANCCTCTGAAGNGGGGGGGGCCCCGGNTACCCCAGNCTTT 

NTTNGTATCCCC I I I I I AAGTTGAGGGGGGTTTAAATTTTGGCNGCCGCCNTTTGGGCTG 

TAATATTCAATTGGGGTTCCANTAAGCCTTGATNTTTCCCCTGTTGGTGAAAAAATTTNG 

TTTAATNCCCGGNNTTCAACAAATTTGCCCACCAACCAAACCAATAACCGTAAAGCCCCG 

GGGGAAGGNCCAATTAAAAAGNTNGTTAAAAAAGNCCCTTGGGGGGGTTGGCCCCTTNAA 

ATGGNAAGTTTGGAAGNCCTTAAATCCTTCCAACCAANTTTTAAAA 

Sequence 1343 

CCAATGAAAATTCCTGTTGCCATTCCAAGAATACCGCCAGCATTGGCCTTGACAATGCCA 

TTATAAAAAGGGACAAGCCCTGCNTTAMCCGGTTCATTGCCCAAGCCCTTTTAAGTTAA 

GAAACAAGGCAAAAAGGGTNCACCCCTNMTTTTCAATTGGAGGGGCCAAACC7TCCATT 

GGACCAAGGAAACCACCAAGGCCAAMCCMGNAAAAGGTTAACCCCCTTTGGGGGNNCC 

GGCTTTCTTTAGAAAAAACTTMNGTTGGGGGAAATTCCCCCCCCCCGGGGGGGCCTTTG 

CNAAGGGGAAAAATTTTCCCGGAATTAATNTCNAAAAGGCCCTTTTAATTTCGGGAAATA 

ACCCCCGGGTCCGGNAACCCNTTCNGGAAGGGGGGGGGGGGGGGGGCCCCCCCGGGGTNN 

NCCCCCAAAAGCCN I I I I I I I rGGTTTTNCCCCC I I I I i I NANGTTGGAAGGGGGGTT 

Sequence 1344 

CGAGGTACGCGGGAGGAACTGCTCAGTTAGGACCCAGAGGGAACCATGGAAGCCCCAGCT 

CAGCTTCTCTTCCTCCTGCTGCTCTGGCTCCCAGATGCCACCGGAGACATnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nGTCAGAGTGTTAGCAGCTACTTAGCCCTGGTACCTTGCCCGGGCCGGCCCGCTCTTAGA 

AACNTAGTTGGGATCCCCCGGGGCCTGCCAGGNMTTCGATTATTCAAAGNCTTTATCGG 

ATTACCGTTCGACCCTCGNAGGGGGGGGGCCCCGGGTACCCCAAGACGTTTTTTGTTTCC 

CCTTTTNAGGTGGAGGGGGGTTAAAATTTGCTGCCGCTTTGGGNCGGTTAAATCAATTGG 

GGTCCAATMGCTTGGNTTTTNCCTTGGTGGTTGGAAAAMTTTTGGTTNTAATT^ 

GCTTCCAA 

Sequence 1345 

GGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGCCATTACTGCAGGAAAAGGTCCCG 

GAGAGCTGAGCAGTCAAGATGTGTGACTTCACCGAAGACCAGACCGCAGAGTTCAAGGAG 

GCCTTCCAGCTGTTTGACCGAACAGGTGATGGCAAGATCCTGTACGTGCCAAGCATCCTC 

GTGCGACCGCGAGAGCCCGGGGAGCGGGGGCTTGCCGGCCGTTGCACTCATTTACCCGGG 

GACAGGGAGAGGCTCTTCTGCGTGTAGTGGTTGTGCAGAGCCTCATGCATNACGGAGCAT 

GAGAAGACGTTCCCCTGCTGCCACCTGCTCTTGTCCACGGTGAGCTTGCTNTAGAGGAAG 

AAGGAGCCGTCGGAGTCCANCACGGGGAGGCNGTGGTCT 

Sequence 1346 

CCGGGCAGGTACCATCACCCCTTCATGCTGGCTCTAAGCTT7TTCCAGCAGTCCCACTTC 
CTGATATTCGTTCTCTTCAGCAGCCTAAAATACAGCTTTCTTCTGTCCCCAAAGTAAGCT 
GCTGTGCTCATTGCCCTAATGAACCCTCCACTTCGCCAATGCGTTTTGGTGGTGGTGGTG 
GCGGTAGCGGAGGTACCT 
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Sequence 1347 

GGAGCTCCCCGCGGTGGCGGCCGCCACGCTGGTTTTGCATCTTCAGGAGACGCTCGTAGC 

CCTCGCGCTTCTCCTCGGCCAATTCGCGGAAGAAGtGGCTCACGCCTTCCAGAGCCACAT • 

CATCGCGGTCGAAATAGAAGCCCAGAGAGAGGTAGGTGTAGGAGGCCTGCAGGTACCAGC 

TGCCATCTGTCAGTTAAACTTCAGAACACACATACCATAGGGATGAAGATAAGAAAGGAA 

TGATTGTGACTCTGGATCAAAATGCTCCCAGTTCCTGAAGGTGCATGATTTAACTGTGAA 

TGCCAACTGAGATCCTAACACCCTATACAACCAAAAGATGAACTAAGTCTCTGAACTGGT 

AACTCATAAAAGACAAATTCCCACAGTNCCCTTCGGGNCGGTTNTAGNAACTTAGTGGAT 

CCCCCCCGGGG 

Sequence 1348 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAGATCTCTGCAGTTACAGGAACATCGTCTGCT 

TCATGGTAAATGCAATTTCCTGTTCAGAAACCACACTAACATTACTTCTTTCAGTATTTC 

TGCCTCAAAGTTGGTTTAAGATTATCATCATTATAAGAAAAATTTGTCTAGTGCTGGTCA 

AGGAAATGGGTCTGTGTTTCTTTGGCATGAGCTTTGGAATGGCTGTATTTGGAGGAGGGG 

TAAGGCAAGTCAGATCCACTACTGCGTATCATGACGCAGATGCACATGCACACAGATGCA 

CATGCTCGAAAACAGGCACCTGGAGATTTTAGNAAAATTCATTTGGGGGAAGATTNGCCT 

AACTTrCCTTTCTACTTCAGTGGAATGAATTACCATTTCT 

Sequence 1349 

GGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCACACTCAGGGCAGTTTCCAGCTC 

CTCTCACAAACAGTAAATCTACACAACTTTCACAGCCAGGTCAGCAACTGGTTCAAGAAC 

CGCCGGCAGCGCGACAGGAACCCCTCCGAGACCCAGTCCAAAAGGTGAGCGCCAACTTTC 

CTCCTCCCCCGCGTACCT 

Sequence 1350 

CCGGGCAGGTACCAGCAGAAACCTGGCCAGGCTCCCAGGCTCCTCATCTATGGTGCATCC 

ACCAGGGCCACTGGTAnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTCTGCAG l I i I AACCTACCTGTTCAGCCAG 

TANTACATAACCTGGCGCCCCNGTTACNCTCGGGGCCCGCATCNTAGAAACCTTAGNTGG 

GNATTCTCCCCGCGGGGGNCTTGGCAGGGAAATTTTCGNATTATTCAAAAGNCTTNTATT 

CCGNATTNCNCGGNTCNGTACCCTTCTNAGGGGGGGNGGGGCCCCCCGGNTNACACCCAG 

GCAI ! II ( I NGTTTTCCCCCTTTNTAANNTGGGAAGGGGGTTTTAAAATTTTGNCCGGCC 

GGCCTTTTGGGGGCNGNTTAAAAATTCCAAATGGGGGATTCCAATTAAGNCCCTGGGGAT 

TTTCNCCTTGGNTGGGTTGGAAAAAAANTTNGGTNTTNATTTCCCCGGGCGTTCCAACC 

Sequence 1351 

CCGCNGTGGCGGCCGCCCGGGCAGGTACATCTACATACACATCAGTGCCGTGCCTTCCAT 

TCCCAGGGAGGGCAGGGTCACTAGACAACATCATTAAAATGACCCTTTATCAGAGAACGT 

TTTTCTGTTTAAGGAAATAAACTAGAGGTACAATTTTGAAAGTTTCTACCTGTATATM 

GGATGTTAACCA3TATCAATAAATCACTTCGTATAATCTCAAAAAAAAAAAAAAAAAAGT 

ACCT 

Sequence 1352 

ACGCGGTGGCG3CCGCCCGGGCAGGTACCAGCAGAAACCTGGCCAGGCTCCCAGGCTCCT 

CATCTATGGTGCATCCACCAGGGCCACTGGTATCCCAGCCAGGTTCAGTGGCAGNNGGGT 

CTGGGACAGAGTTCACNTCTCACTCATTAAGTCAGCNCTGGCAGNTCTTNAAGNATTACT 

TGCANATTTTA.CCNACCTTGGTTCAGGCTAGTTANTANATTAAACNTGGGCNCCCCNCTG 

NTACNCTTCTGGGCNCGCTTTCCTTAGNAAACCTTANGTTGGGGATTACTCNCCCCTGNG 

NGNCTTGTCAAGTGGNAAAmCCTGAAATTANTTCAANMGNCCNTTTTAATN 

TGANCCGCCGGTTCCGTAACCTCNTTCCGTNAAGCGGGNGGNGGGGNGTCCCCCCTGNGN 

TTAACCCCCCAAAGGNCTTTNTNTTTTGGTNNTTC 

GGGGTTTTNAAANTTTTGNCCAGCCGGCCNTTTTGGGGCCGGTTAAAAATACCNAATGGG 
GNTTCCAATTAAGGCCNTTGNTTTTTTCNCCTTTGGNGGGTTNGGAAAAAAAT 
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CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGNCATGGTACCACGGGCCT 

GGGCACTGCATCTTCCCAGAGAAGCAGACTCCACAGGGGCCTCATGACCATCTTTCTCTG 

GAGAGGCAGGCAAACTCCCTCGACAGCCAAAATCCAGAACCTCCCCCCGCGTACCTCGGC 

CGCTCTAGAACTAAGTGGATTCCCCCNGGGCTTGCANGGAAATTTCGATATCAANGCTTA 

TNCGTTACCGTNCGNACCCTCGAAGGGGGGGNCCCGGTACCCAGCCTTTTTGT 

Sequence 1354 

GGCGGCCGAGGTCGCGGGGGAGGAACTGCTCAGTTAGGACCCAGACGGAACCATGGAAGC 
CCCAGCGCAGCTTCTCTTCCTCCTGCTACTCTGGCTCCCAGATACCACTGGAGAAATAGT 
GATGACGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnAGCCACCCTCTTCC 
TGCAGNGGCCAGTCAGAAGTGTTGGCACCAACTTANGCCTGGTACCTGCCCG 
Sequence 1355 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGAGCAACGTGCCA 

TGCTACAGCCCCTGCTTGACMCCAGGTTGGTTTTAAGAACATGCAGAATGTGGAGCATG 

TTCCGCTGTCCTTGGACAGAGCCATGCNGCTGGTGAAAGATGTCTTCATTTCTGCGGCTG 

AGAGAGATGTGTCCTGCCCGGGCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnTCAATAACAAGCTTANNTNTACNATGGGCCTNCATNGGGGGGGGCCGTTCAACCT 

TTCTGTTTGCTGTGTGCTGGCCAGAATGCCTNTTGGGCCGTCCTGTTGTTCTCTTCCCGT 

GTTGTACCTGCCCGGGCGGCA 

Sequence 1356 

GCCTGGGATACCAGTGGCCCTGGTGGATGCACCATAGATGAGGAGCCTGGGAGCCTGGCC 
AGGTTTCTGCTGGTACCTNnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
nnnnnATNAAGCTTATCGATACTCGNCCGACTTTNCCAGGGGGGGAAACCTGTCGTGCCA 
GCTTGNATTAATTGAATCGGCCAACGCCGCGGGGAGAGGCGGTTTTGCGTATTGGGC 
Sequence 1357 

CCGCGGTGGCGGCCGAGGTACCCGGGAGGCTCGCATGTGTGGGTGCATGCCGTGGCAGGA 

GAAGGATGGGTGATCCCTAAATTACCTTCCGAAGAGCCATGCATGCGTCCTCTGGCTGCT 

CCAGGTGTAAAAAATGCCAGGGAGGGGACAATCATGAGTCCTAGGCAGGAAAGGTCCTCT 

GGAGAGAGGAAATGGCACACGCGCAAGCACGGAGGCATGAGGGGCCAGGGTCCTGGAGGA 

CACTGGGCCGGGACAGATCANACTCCAGGCAAGGAGGGAGATGGCGCCAGACTGTGATCA 

TAAGCTTCACGATCCCCGCGTACCTGCCCGGGCGGCCGNTCT 

Sequence 1358 

CTCATAAAGGGAATAAGCTTGCGGCCGCTTAATTAAAGATCI I I I I ! I I I I I I I II I I 1 1 

AACCTTTAATCTCTTTGGACTTTATCCTTGGGCTTTTCTGAMGCACTTTCTGTCCAGCC 

CTTGCTGTCACCCAGCCAAACCTAGCTACCTCACAGGCCCCCAGCAGCTTTCCTGGCTCT 

GCTTCNAGCCCACCGGCCTTNCAACCCTTCCCATAACAATTGGTGCTACCAACCCTTCCN 

AATAACCCCCTGGTCCCCCC/\ACCCCTCCCCAATTACCTGGGNTCCCmCAACCCTTTC 

NCAACNAACNTTNGTTTCTCCNCMCCCCCNCNTTTTTMCCTTNTTTTTGGCCCTNC 

CCCTTCNTTAACCT iT! I I 1 1 GGCCTTCCCCATNGGGNTGNCG I I I I \ I I CCCAATTTTT 

TGGCCCMCCCCTTTCNAAA/yTTC^ 

TANGGGGTT7TTGGAAAAAAAGGN I I I I I 1 NCTTAAGCCTCCNTGGGGGTCTTNTAAAAA 

AANNAATNCANCCTNTTTTTTGTAAAAAANTTT 

Sequence 1359 

CCTCCTAAAGGGAATAANCTTGCGGCCGCTTCATTAATAAATAGACAAACTTCTTATATC 

TCTGCCTTCTGACCATCAMCTGAATAATTTrAATTCAACAGGTCTTATTTTCATTTCCT 

TTGAAATTTTCCTAATTGCCTTCCTTTC 

N G G AATAACCGC AAN AAAAAGAAAANGAANTG G G AAAAAAANTAAAAGTTAAAGG GGTAA 
GGGGAAATTANTAAAAAGGAAGGGTTTCCCTTCCCTTGGATTTNCNTAAAGGTTAATTCN 
GNGGAANCCCCAAAAAAACCTTGGGTGGGTTGGTTAATNCC Mill TCCAAGGGGAAAAG 
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GGGGGGGGAAGNAAGGGGAATTGGNGGCCAACCNCCCAATTCNTTTCTTCCCCCCTTCNA 

AAATTTGGCCCCAAGGGGAACCNCCTTCCCCCCCCTTTNCTTNCTTTGGGAAGGGGAACC 

CNTTAANCTTTCCAACCNCCANTTGGCCCTTTCCCCCTTTCCCCGGGAACCCAANNAAAG 

GGGGCCCCCAATGCNAACCTTTTTCNAANTAAACCNTTGGGNTTNTTGGNAAGNGGGGNG 

GNGGAAAAAGNCCCCCAANGGNCTAAACCCCAACCAAAAAAAGNCCCTTGNCTAAANGGN 

GGAAGGGAAAAANTAAAGGGGGTTTTGGGGGGGAAAAA 

Sequence 1360 

CCTCCTAAAGGGAATAAGCTTGCGGCCGCTTAATTAAAGATC I I i I I I I I M I I I I I I I I 

TTTTCCTTTNAAAACAGCGTCATACTAAAGTTTATTTTTCTTTACACA^ 

AG G C AG AC AAGG AG AC AT ATATTGG AGTTTTC C ATTTTTAAT ATTTG GTTAAAC AAAATA 

CATTNTTTTTGNTAAANCAAACCGNCANGCACCAAAGGGCCAAACNAACCMTAAANGGG 

AAATAAAAAANGCCANAGGAAANAAAACCAAATAAAAANGCCCTTAGGGGGGTTTGGCNC 

CCGTTATATTGTGGGNGNCAATTCANGGGGGTTCAAGGGGCCCNTTGNGTTNCAANCTTT 

CCCCTTTAAAAAAAGGGGAAAAGGTNNGGGGGGGCCTTNTTTNTTTNGGCCCCCTTNTGG 

GGNGNCNANTTANAANGNCATTTGGGGGGGTTCCCCTCTTNGGTTTNCCCNTTAAAAGGC 

CAACCCCAAGGMNATTTTGGGGGAATNTATTNGNANACCCANGGNANGTAAAAAACCC 

Sequence 1361 

CTAAAGGGAATAAGCTTGCGGCCGCTTAATTAAAGATC I M U I I I I I I I I I I I I I I MG 

CAGGTTGGACGAGTTTGTATTAAGGCATTCTGACTCCTTCACTCTCCCCAGCTCCCTCCA 

TGTCAATCCCTTGGCAGAMCACTGTAGACTTATAAATGGATGACAGATGCAGGCTTTTG 

AGAGGGGACCAAAAAGGGAACCACCTTCTTCCCACTTGGGGGGTGTTGGGGGGTCAAGTC 

CATTCCTTGGGGAGGGGGNGCTTGGACCATTTAGAAGGGGGCNAAAAGCTTCAAGGGGNC 

TTTACCAGGCCCACNAGGCCCACCATTATTGCCCCATTCTNTCTTCCCTTGGTGCCAGGG 

CTTC AATTTATTN GG G G C AATTTTTC CTTG C C AAG N CTTTTCC 

Sequence 1362 

CNGCCCGAGGTACTTTTTTTCTTAA I I I I ( I I I I I l I I I I t I GAACCAAGCAGGTGTATT 
CTGGAAGAAAATCAGACACCCTCCCTGGTTAAGMTTAGTTAACTATAATGTATTGGTGA 
ATTGAGTTTATAGTCTTTTGGTGGGGGTAAAAGGGCCCTCAAAGTCATAGCAGGTTTAGT 
TGTACCTGCCCG 

Sequence 1 363 

ATNGGAGCTCCCCGCGGTGGCGGCCCGAGGTACATTTCCTTTCTGTGGTTTTCTTTTGCT 

TCTTAGAAATTCTGTTAGTGGTTAGTAMGAATTTGAAAGTACGCGGGGCTTTCTTTCCG 

CGCCGATAGCGCTCACGCAAGCATGGTTANCNTGCCTAAAACCCGCCGGACTTTCTGTAA 

GAAGTGTGGCAAGCACCAACCCCATAAAGTGACACAGTACCTGCCCG 

Sequence 1364 

AGGTACTCAGAGTGAGGATTTTCTGCAACCTGCG'.TTGTCCCTCCAGCATCTGCTCTGGC 

TCCATGGCGGACCCTGCAAGTCACAGTCCCCGGATACCAGTCTGGGCGCCGGGAGCACCA 

CGAGCAGTAGAGCAGCAGAGTGGGACAGTCCACGACTGGGCGCCTACATGGGGTCTGGAA 

ACTCTACGACAGTCCCAAGTACCTGCCCG 

Sequence 1365 

AGGTACCTTGGTG7TAC7TCCTTTGAAAGATGGACATTTGAAAG7TAAAAATCAGTCACT 
ATATAGGCACTGCTGTATGGAAMCGCATTTTTGTGTTTCTACAAATTGTAAGCGGGAAA 
TGCCTTTTGGAGTACCTCGGCCCGCTCTAGA/XACTAGTGGATCCCCCCGGGGCT 
Sequence 1366 

CGNGGTGGCGGCCGCCCGGGCAGGTACTGTTTACTACACACTTAAAAACCCGAGTGTCNA 
GTGCCTTTAGGGGAAGAACACTTCTGCCTGCATGAAAATGTCTTAAAAGTGAAGGAAAAT 
TTGAAGAACGCTNTTGATCAGAAGTCCAGACTGTAAACATAATC 
Sequence 1367 

GGNGGCGGCCGAGGNACGAGGGCCTTGGTTNAGNGGGGGGAAAACCCCAAACCGCAAAAN 
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CNNCAGACNGCAGGCNGCNGNNNGTGAGAGNGAACGGGGNCCCACNCCCANANACACNGA 
ACCNGGCNGGGANACCAGNGGCCCNGGNGGANGCACCANAGANGAGGAGCCNGNGAGCCA- 
GGCCTGGTTNCNGCGGGGACCNGCCCG 
Sequence 1368 

ACTATTAGGGCGAATTGGTAGCTCCACCGCGGTGGCGGCCGAGGTAC iTTTTTTI GTTTG 
I I J 1 1 I I I II I I CACCTTAAGGGAGGATTTAATTTGCTCCCAACTGATTGTCACTTAAAT 
GAAAATTTAAAAATGMTAAAAAGACATACTTTCTCAGCTGCAAATATTATNGGAGAATN 
GGGGCACCCACAGGNAATGAANAGAGGAAAGCANCTTCCNTAACCTTCAAAACCATT 
Sequence 1369 

CCGGGCAGGTACCAAAATGG I I I I I GAAGTTAGGGAGCTGCTTTCTCTCTTCATTCCTAG 

TGGGTAGCCCCAATCTCTCCATAATATTNTGCAGCTGNAGAAGTTATGNCTTTTTATTCA 

TTTT 

Sequence 1370 

GAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGGGCTCTGACCACTATTGGTTTTG 

AGACCACGATGTTGGGAGGGTATGTTTACAGCACTCCAGCCAAAAAATACAGCACTGGCA 

TGATTCACCTTCTCCTGCAGGTGACCATTGATGGCAGGAACTACATTGTCGATGCTGGGT 

TTGGACGCTCATACCAGATGTGGCAGCCTCTGGAGTTAATTTCTGGGAAGGATCAGCCTC 

AAGGTGCCTTGTGTCTTTCCGTTTGACGGAAGAGAATGGGATTCTGGTATCTAGACCAAA 

TCAAGAAGGGGAACAGTACCTCGGCCCGTTCTAGAACTAGTGGATCCCCCCGGGCT 

Sequence 1371 

CCGGGCAGGTACCAAAATGGTTTTGAAGTTAGGGAGCTGCTTTCTCTCTTCATTCCTGTG 
GG NG CCCCAATTCTCC ATAATATTNTG C AGCTGG AGAAGTATGTCTTTTTTATTCATTTT 
TAA Al I t 1 I CATTTTMGTGACAATCAGTTGGGGAGCAAATTAAATCCTTCCTT 
Sequence 1372 

NCCGGGCAGGTACTTACTTGATGTGACTCTCCTCTCATGCCTGGGCCCTGCTTACAGGTG 

TGATTGTGACACATAGCTTGGCCTAGCCCCTAGGTTATGTNACTCTCCTCTTATCCTTCA 

GTTATTTTCACAAGGGGCATTGCGACATATTGCTGGACTTGGGAACCCAGGTGATGTGAC 

TCTCCTCTACTGNCTGGAATACAACCANAAAAAAAAAAAATNAANANAAAAAGNNCCCTN 

CGNCCGCTCTAGAAANTAGTGGGATC 

Sequence 1373 

CCGGGCAGGTACCAAAATGGTTTTGAAGTTAGGGAGCTGCTTTCTCTCTTCATTCCTGTG 

GGTGCCCCMTTCTCCATMTATTTGCAGCTG 

ATTTTCATTTAAGTGACMTCAGTGGGAGCAMTTAMTC^ 

AAAAAAACAAACAAAAAANGTACCT 

Sequence 1374 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACiCTGATTGCATTGCACTT 
AMTGCAGATTATTTTGGAGTTTGAAAAGGGACTATTAATGAMTCTTTCTT^ 
TTTCTCTTTTTCCCTTCCCCGCCACTGATTCAGTGAGCTGGAGATTGGATCACAGGTATA 
ATTCAAGCTTTCATGTAGTCATGTAGATCACTAGACTCCTTGGTGTACCTGCCCG 
Sequence 1375 

ACTACTATAGGGCGAATTGGGAGCTCCCCGCGGTGGCGGCCCGCCCGGCAGGTACTTTGG 

CCTCTCTGGGATAGAAGTTATTCAGCAGGCACACAACAGAGGCAGTTCCAGATTTCAACT 

GCTCATCAGATGGCGGGAAGATGAAGACAGATGGTGCAGCCACAGTTCGTCTCATCTCCA 

GCCTGGCCCCTACCAAAAGTGTGCGGCCAGTTATTATACTGCTnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnTACCAGTGGCCCTGGTGGATGCACCATAGATGAGGAGCCTGG 

Sequence 1376 

AGGTACCTGNAGGCCTCCTACACCTACCNCTCTCTGGGCTTCTATTTCGACCGCAGATGA 
TGTGGCTCTGGAAGGCGTGAGCCACTTCTTCCGCAGAATTGNCCGANGAGAAGCGCNGAG 
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GGCTACGAGCTGTCTNCTGAAGATGCAAAACCAGCNTGGTCGGCCGCTNTAGAACTAGAT 

GANATCCCCCGGGCATGCAGGTAATTCGATATCAAAGCCTT 

Sequence 1377 

CCGGGCAGGTACCAAAATGGTTTTGAAGTTMGGGNGCTGCTTTCTCTCTTCATTCCTGT 
GGGGGCCCCAATTCTNCATAATATTTTGCAGCTGAGAAGTATGTCI I I I IATTCAI Mil 
AAATTTTCATTTAAGTGACAATCAGTTGGGAGCAAATTAAATCCTCCCTTAAGGTGAAAA 
AAAAAAAAACAAAAGTACCTN 
Sequence 1378 

ATCACTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAACTT 

TAAACTGTATTGTATTCATGTTGCTAAACAATATTGGCCTTCTCGATGATTTTATTCATG 

TTGCTCCAAAGTTAAAACCCTGTAGMCTMGTAGGTGAAGAGATATTTTGTATAAGTGC 

CACAGAAGAGAAAATATAATAAATTAAATAGTGATCGAGCGGCCGCCCGGGCAGGTACCA 

GCAGAAACCTGGCCAGGCTCCCAGGCTCCTCATCTATGGTGCATCCACCAGGGCCACTGG 

TAnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnTCTTGCAGNTTACTACTGTCAGCAGTATAATNACTGGCCCCCGT 

ACCTTGGCCGCTTCTAAAACTAGTGGGATCCCCCCGGCCTGNAAGGMTTTGATNTTTNA 

GCCI I I 1 I GATNCCCGNCGACCTTCANGGGGGGGGCCC 

Sequence 1379 

CCGCGGTGGCGGCCGAGGTACTTTTTCCTAGTAGTGACCCAGTAAATATCTNTNCAACAA 

ACTTGGCTTTGCTGGTGTGCAAAGTGT^TATAAGGAAAATTAGTGACATTTTTTTGCCT 

TTTAGTGTGTAAC NTGG G AATGTATANGTATAG AAAAC AG AG AG ATAGTTATATATTTTA 

ACAACTTGAAATTCATAGGTGTTAATAGATTCCTNTTTTTGAAATTAAATATCTCCTAGG 

ATTTTGATCACTTACAGTGTTGTATGCACTACCTTA 

Sequence 1380 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC I M I I i II I I I I I I \ I I I 
TTCCAAAAGCCTTG I I l I I ATTTATATAGAGTCCTAACCACTTCGGTGGTAGGAGGGGTG 
GGAGAGGCTCC I I I I I CAA 
Sequence 1381 

AGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACTGTGTATCATTGCA 
gncttgc nrm GAGTAATGGATTCCTAGATTCTATGAGGATACCACAACCACTTTTAA 
AGAGGTTTCTAAGGNCAGTTGCAGTGCTTACGCCTGGGAGACCAAGGTGGGAGGATCACT 
TGAGCTCAGGAGTTCGAGACCAACTTTGTCNTT 

Sequence 1382 

ACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACTTNNTTNTTTTTTTTTT 

TTTTTCTTCTTCTAGTTGTTTCTTGGTTTCTGAGTCCAGTCCCCTTTGCATAMCT^ 

GCGCAGCAGGTTCTTGGACAACTTTGTCTTTCTCTCGGCCGCCATGGTAGGTTCCGCCCA 

GCGCCGCACTC 

Sequence 1383 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCATGGTACCAGCAGAAACCTGGC 

CACGGCTCCCAGGCTCCTCATCTATGGTGCATCCACCAGGGCCACTGGTAnnnnnnnrifjn 

nnnnnnnnnnnnnnnnnnnnnnACAGAAGTTCACTCTCACCATCAGCAGCCTGCAGTCTG 

GAAGATTCTGCAGTTTACTACTGTCAAGCAGTATAATAACTGGCCCCCGTACCTnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTAGTGGAGGGTTAATTGC 

GCGCTTNGGCGTAATCATGGGTCATTAGGCTGTTTCCCTGTGTGAAAATTGGTTATCCGC 

TCACAATTCCACAACAAACATACCNAACCCGGGGAGCANTAAAAAGTGGTAAAAGCCCT 

Sequence 1384 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAACAGAAACCTGGCC 
AGGCTCCCAGGCTCCTCATCTATGATGCATCCAACAGGGCCACTGGCAnnnnnnnnnnnn 
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nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnAGCAGCGTAACCAACTGGCCTNGGTTMCTTTTCGGNGG 

ANGGACCAAAGGNGGAGATCAAACGAACTGTGGCTTGCACCATNTGNCTTNATCTTCCCN 

CCATTTGNTGAGCANTTGAAATTTTGGAACTGCCTTTTGNTTGGNGGGGCCTGCTGAATA 

ACTTCTATCCCAAGAGAGGCCAAAGTTACCTGCCCGGGGCCGGCCCGCTCTAAAACTAGT 

GGGATCCCCCGGGCTGCAANGAATTCGATATCAAAGCTTATCGATACCCGTCGACCTTCG 

AAGGGGGGGGGGCCC 

Sequence 1385 

CNAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCGCGGGGGACATTCATGCCCGGCGG 

TTCGGGGCGACGGACTCTCCATTCCAGAACCATGGCCCAATTTGTCCGTAACCTTGTGGA 

GAAGACCCCGGCGCTGGTGAACGCTGCTGTGACTTACTCGAAGCCTCGATTGGCCACATT 

TTGGTACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGTAATCATGGGTCATTAGCTGTTTCCTGTG 

TGAAAATTGTTATCCGCTCACAAATTCCACACAACATACCGAAGCCCG 

Sequence 1386 

GCNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTNTTCTGTGTGACAATTACTGCA 

CAGTCTTTCCCTCTGACAGCTACTGATGTAAGGCCACCCTGGTTAATAGCCTTAAAAGCA 

TATTCTACTTGGTAGAGCCGACCCTCGGGTGAAAAAATGGTAATGTGGCGGTCAAAACCG 

GCGCTGGAACCACCGGGACATGTTGGTACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGAGGGTTTAAATTGCGGCGCTTTGGCGTA 

ATCATGGTATTMGCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

ACATTACCGAGCCCGGGGAGCATAAAAAGTGTAAAGCCTGGGG 

Sequence 1387 

ATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAAGAAATGCATGAACTGGCAANTTATT 

TGTGACTGCAACAGCCCGTGCGTGTTCAGGAGTGTGTCTTCCAATCTGNCCTAAGGCCCA 

AGCAGCTGCAGCCTTAATATGATCTTCCGGTTCTTCTGACAAGCANACTGACAACTGGGG 

TACCTGCCNG 

Sequence 1388 

CTTNGGCGGCCGCCCGGGCAGGTTCTTTGGCCTCTNTGGGATACGAAGTTTTTCANNCAG 

GNTCACAACACGAGGCAGTTCCAGATTTCAACTGCCCATCAGATGGCGGGAAGATGAANA 

CAGATGGTGCAGCCACAGTTCGTTTGATATCCACTTTGGTCCCAGGGCCGAAAGAAGGCA 

GACCAAAATATTGCTGACAGTAATCCCGCGTACCTNGGCCGGACCA 

Sequence 1389 

AGGGCGAATTGGAGCTCCACCCGCGGTGGCGGCCCGAGGTACGCGGGAGGAACTGCTCAG 

TTAGGACCCAGACGGAACCATGGNTTCCCCAGCGCAGCTTCTCTTCCTCCTGCTACTCTG 

GCTCCCAGATACCACTGGAGAAATAGTGATGACGCAGTCTCCAGCCACCCTGTCTGTGTC 

TCAGGGGAAAGAGCCACCCTCTCCTGCAGGGCCAAGTCAGAGTGTTGGCAGCAACTTTGC 

CTGGTACCTGCCCG 

Sequence 1390 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGANGTCCNNNNACACCCNNNGTGC 
ANNGGTCTTGCGTNGCCNATGCCNCCACCATTGTCCANAAGACTCAGTTAACATTATTTG 
CAGTACAGACGGTTATTGATCCACGATGATAGAAGAGGAT 
Sequence 1391 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTACTTTGGCCTC 
TCTGGGATAGAAGTTATTCAGCAGGCACACAACAGAGGCAGTTCCAGATTTCAACTGCTC 
ATCAGATGGCGGGAAGATGAAGACAGATGGTGCAGCCACAGTTCGTTTGATCTCCAGCTt 
GGTCCCCTGACAAAAGTGTACCT 
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Sequence 1392 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTGTGCAATGCCTCAGGTTGCACCTGACTTATA 

TGCTGAACTACAGAAGGCACATTTAATTTTATTCAAGGGTGATTTGAATTACAGGAAGTT 

GACAGGTGACAGAAAATGGGAGTTTTCTGTTCCATTCATCAGGCTCTGAATGGCTTCCAT 

CCTGCACCACTCTGTACCT 

Sequence 1393 

TANGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACCAGGCTAAGGAGGTGCTAA 

CACTCTGACnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnTCCGGTGGTATCTGGGAGCCAGAGTAGCAGGAGAA 

GAGAAGCTGAGCTGGGGCTTCCATGGTTCCCTCTGGGTCCTAACTGAGCAGTTCCTC CCC 

GCGTACCTGCCCG 

Sequence 1394 

CGAGGTACTGTCGTCTCCCAACTCCCACCTACAGGGCCCGGGAATCCTAGGGAGCGGATA 

AAGGCCTGGGAGAGGGAGCAAGATGGGCTCCGCCACGAGATCCGGCTGAAGTCCGGCTCG 

GCGCCGGCCTCAACTGCAAGAGGAAGGCACTCTCTCTCCCCAAGCTGAAACACCAGAAGA 

GCCACACT 

Sequence 1395 

GACTTTGTTTG I rTTTTTTTTTTTTI ! CACCTTAAGGGANGACTTNATNTNTTNCCNTNT 

GACCGCCACTTAAATGAAAATTTAAAAATGAATAAAAAGACATACTTCTCAGCTGCAAAT 

ATTATGGAGAATTGGGGCACCCACAGGATGAAGAGAGAAAGCAGCTCCCTAACTTCAAAA 

CCATTTTGGTACCT 

Sequence 1396 

CGGCCGAGGNACGCGTTGCTTTTCACAATTTGGCNCCGAAANTTTAANAACGAAGCTCCT 

GCCCCTCCTAMGCTGMGCCAMGCGAAGGCTTTAAAGGCCAANAAGGCAGTGTTGAAA 

GGTGTCCACAGCCACAAAAAGAAGAAGATCCGCACGTCACCCACCTTCCnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTCNATATCAAGCTTATCCGATACCCGT 

NCGACCTCGAGGGGGGGCCCCGGGACCCCANCTTTTTGNTCCCTTTAACTGAGGGGTTAA 

ATTTGCCGCCCCTNTGGGCCGTAANTCATTGGGGCAATAAGCCTGGGNTCCCTNGGGGGG 

AAAAANTGGGTATTCCCGCCTCNACNAATTTTCNCCACAAACAATTACCGAAGCCCGGGG 

AGCATAAAAAAGGGGTAAAAACCCCTGGGG 

Sequence 1397 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGCCAGGTACTGGAGGAGTGAGTCC 

CTATGCTGACCCCAATNCTTGCAGAGGTGATTCTGGCGGNCCCTTGATAGTTCACAAGAG 

AAGTCGTTTCATTCAAGTTGGTGTAATCAGCTGGGGAGTAGTGGTGTNTGNAAAAACCAN 

AAGCGANAAGCAGGTACCT 

Sequence 1398 

ACTACTTAGGGCAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACCGCGGGAGCTTCTCC 
TTGCCAGTTTCTCCCAGCAGGACCCTCTTCTTGTTTTGAAAGATGGTCGGCTGCTTTTGG 
TAGGCACGCTCAGTCTGAATGTCCGCCATCTTCC 
Sequence 1399 

GTACGCGGGGGAGGCTCGCGGTTNGAGTGCGGCGCTGGGCGGNTCCTACCATGGTTTTNG 
AGAGAAAGACAAAGTTGTCCAAGAATCTGNTGCGCATGAAGTTTATGCATAGGGGACTGG 
CTCAGAAACCAAGAAACAACTAGAAAGAAGAAAAGAAAATCATTAGTGAAGAGCACTGGT 
ACCT 

Sequence 1400 

GCGGTTTGGCCNCCGCAGGTACCAACANAAACCNGNCCAGCTCCCANGCTCCTCATCTAT 
GATGCATCCAACA I I I \ I CACTGGCAnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTTNAGNGTATTACTGNCAGC 
AGCGTAACAACTCGTACCTCGGGCnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
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nnnnNGATATCAAGCTNATCGATNCCGNNAGACCTCGAGGGGGGG 
Sequence 1401 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACl rTTTTTTTTTTTTTTTTTl 

AGAGATGGGGGTTTCACCGTGTTGGCCAGGCTGGTCTCGAACTCCTGACCTCAAGTGATC 

CACTCGCCTCGGCCTCCCAAAGCGCTGGGATGACAGGCGTGAGCCATTGCACCAAGACCC 

GATAGGTCTAAGGGGTCTCATTTGAGTCATTGTATATAAAI I I M I I i I CAATG1 I I I I I 

GTCTAAAAAACTTTAGACAGGGAATGCACTCTCCAATTACCTTCAACTGCTACCTNTCCT 

TATTTTMTNNCCNAAAAACNNATT1TGGCCTTTAACMTTTTM 

AAATTTGTTTNTTNTTAAAAMTTN I I I I I I I ATTTNGGNNGGGGCCNANCAAAANCNGG 
GGGGGGACCCNNGGGGGGGGTTCANTTTTCNCCNCMTTTTAACCNGGGTTTCCCCNGGA 
AAACNGGNNGAAAAAAAGTGNGGGGGNCNTTTTCCCAATTNAAAAAANTNGGGGAAANAA 
ACCCTTTTAAANNANGGGGG 1 I I 1 I I I I NCTTTTTNACCNNNNCAAAAAATNNTTTTTTT 
TTT 

Sequence 1402 

AGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACGCGGGGAGGAACTGCTCAGTT. 
AGGACCCAGAGGGAACCATGGAAGCCCCAGCTCCACTTCTCTTCCTCCTGCTACTCTGGC 
TCCCAGATACCACCGGA 
Sequence 1403 

AGGTACGCGGGGGAGCTACAACAGGCAGGCAGGGGCAGCAAGATGGTGTTGCAGACCCAG 

GTCTTCATTTCTCTGTTGCTCTGGATCTCTGGTGCCTACGGGGACATCGTGATGACCCAG 

TCTCCAGACTCCCTGGCTGTGTCTCTGGGCGAGAGGGCCACCATCAACTGCAAGTCCAGC 

CAGAGTCTTTTATACAGCTCCAGCAATAACAACTACTTAGCTTGGTACCTGCCCGnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnCTGCAGGAATTCGATATCAAAGCCTTATCGA 

TACCCGTCCGACCTCGAGGGGGGGGGCCCCCGGTTACCCAAGC I M M GTTCCCTTTAAG 

TGGAG 

Sequence 1404 

CCGCGGTGGCCGGCCGCCCGGGCAGGTACCAAAATGGTTTTGAAGTTAGGGAGCTGCTTT 

CTCTCTTCATTCCTGTGGGTGCCCCMTTCTCCATAATATTTGCAGCTGAGAAGTATGTC 

TTTTT ATTC ATTTTTAAATTTTC ATTTAAG TG AC AAT C AG TTG G GAG C AAATTAAATC C T 

CCCTTAAGGTGAAAAAAAAAAAAAAACAAACAAAAAAAAAGTACCTnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnNCGACCCT 

CGAGGGGGGGGCCCCGGTACCCAGCCTTTTTGTTCCCTTTTAGTGGAGGGGTTTAAATTG 

CCGCCGCTTTGGCGNT 

Sequence 1405 

GGACCCCTTATCGACTNCTATAGGGCNNATTGGAGCTCCCCGCGGTGGCGGCCGAAGGAG 

AAATAGAACAGTGCAGGCAAAAGAAGAAAGGCGCGGGCTGGGTGGGAAGAGGATTCGGAC 

TCGTCACACTGCAGAGCAGCAGAGCGAGAAAGGATGAGAAGAGGCAGAGAAGGCGACGGC 

AGAAGAAAAAAGGAAAAACTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnGCGCCGCTTGGCCGTAATCATGGTCATAGCCTGTTT 

CCTGTGGTGAAAATTGTTATCCCGCTTCACAAATTCCCACCACCAACATACCGAAGCC 

Sequence 1406 

CCGCGGTGGCGGCCGAGGTACAGAAACATTTTAGAAAAAGAAAGGGAACTTCAGCAACTG 

GGAATCACAGAATACCTAAGGAAAAACATTGCTCAGCTCCAGCCTGATATGGAGGCACAT 

TATCCTGGAGCCCACGAAGAGCTGAAGTTAATGGAAACATTAATGTACCTGCCCGnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnCGACCTCGAGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnCGCTTGGCGTAATCATGGTNTAAGCCTGTTTCCTGTGTGAAAATTGTTATCCCGC 

TCACAATTTCCACACAACATTACNAGCCCGGGGAGCATTAAA 
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Sequence 1407 

ACTACTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTGGCCT 

CTCTGGGATAGAAGTTATTCAGCAGGCACACAACAGAGGCAGTTCCAGATTTCAACTGCT 

CATCAGATGGCGGGAAGATGAAGACAGATGGTGCAGCCACAGTTCGTTTGATCTCCACCT 

TGGTCCCTCCGCCGAAAGTGAGCGGGGTACCTnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnGTTAATTGCGCCGCTTGGTCGTAATCAATGGG 

TCATAAGCCTGTTTTCCTGTGTG 

Sequence 1408 

CTACTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTGGCCTC 

TCTGGGATAGAAGTTATTCAGCAGGCACACAACAGAGGCAGTTCCAGATTTCAACTGCTC 

ATCAGATGGCGGGAAGATGAAGACAGATGGTGCAGCCACAGTTCGTTTGATATCCACTTT 

GGTCCCAGAGCCGAAAGTGAATATAGGATGAGGCCAGTTGCTACGCTGCTnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnAGAGTGAAGTCTGTCCCAGACC 

CACTTGCCACTTGAACCTGGCTTGGGATTGCCCAGTGGGCCCTTGTTGGGATGCATCAAT 

AAGATTGAAGGNAGCCCTGGGGNAGCCTTGGCCCAGGGTTJCNTGGTTGGG 

Sequence 1409 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTTCCAGGAAGTGAAGTAAAACCTGGTCTTGGT 

TGATAGGCCCCAGGTTGGCTTGGAGCCATTCCAGGTTGAGAGGCAGGAGCCACAGTATAA 

TTAGTAGGCTGAGAAGTTTGGGCAGTGTAAGTTTGTGCAGGATAATTGCTCGCCTGGTAC 

CTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnGCCGCCGCTTGGCCGTTAATCAATGGGTCATTAGCCTGTTTCCTGGTGTGGAAAA 

TTGTTT ATT C C G C 

Sequence 1410 

CACGCTGGTTTTGCATCTTCAGGAGACGCTCGTAGCCCTCGCGCTTCTCCTCGGCCAGTT 

CGCGGAAGAAGTGGCTCACGCCTTCCAGAGCCACATCATCGCGGTCGAAATAGAAGCCCA 

NAGAGAGGTAGGTGTAGGAGGCCTGCANGTACCTnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGTAATCATGGGT 

CATAGGCTGTTTCCTGTGTGNAAATNTGGTTATCCGGCTCACAAATTCCACCA 

Sequence 141 1 

ACTACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGCCTTTGCCTTGAGTCCA 

TCCACGCTTTAGCAGCTGCTCCTCAGCAAACTTCATCCCACGACTCTTGACCTCTGGGGT 

GGCATTCATGGTGAGAAAAAAGTCTCTATCCTCAGACTCTCACCAGAGAAGAAGAGGTGA 

TATCATCCGTCATTGCTGGGTCCGCTCCGCGGAAACATGTGCCGGACCTGACTTGTGCGC 

CGCCATCTTCCCCCGCGTACCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnCTTATCGATACCCGTCGACCTCGAGGGGGGGGGCCCCGGGTACCC 

CAGC l I I ) I GTTCCCTTTTAGTGGAGGGGTTAATTTGGCGCCGC 

Sequence 1412 

CGAGGTAC II II I I I II I II M I II I I I I I I I I I AAGNGATTTGCAGTTAGTTTATTGAA 
ATGGAAATGAATTTGAGGNTTCAACAGTNGNCTAGCAGTTTCCACATGAAAAAGG ACAAC 
NCNAAGTAGAAGTAAAAGNTGNGNTTTTCTTAGGNCCCCCNAAAAAANNTANNANTTTTT 
TTTGGGCCCCCGNAAAAAANANAAAAATTCCCCCCCNNAAAANGGGGNGGGGGGGNAAAA 
NA TTTTTTTTI I GGGGGNNGNNAAAAAAAAAAGGGGNGCCNGNTTTTTTN I ( I Hi I ING 
NNCCCCCCCCi WWW I GGGGGGGGGGGGGGGGGGGGGGGGCCCCCCCCCCCCCCCCTA 
AAAAAAN T IXTTTTTTTI I AAAAAAAANNGGGGTNNTG I nTTTTI I GCCCCCCCCCTTT 
TTTTTTGGGGGGGGTN I I I I I I I I I I AAAAAAAAAANNGGGGGNNTNNNNCCCCCCNTTT 
CCCCCCCCCNGGCCCCCCCNN I I I I I I I I I I I U I I CCCCCCCCTATTTTTTTGGGGGGG 
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GGGGGGGGTAAAAAAAAAANGGGGG T"l Mil INGNCCCCCT III! INCCCCCCCT 
Sequence 1413 

ATACGACTNCTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACC 

AGCAGAAACCTGGCCAGGCTCCCAGGCTCCTCATCTATGGTGCGTCCACCAGGGCCACTG 

GTAnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnAGCAGTATAATAACTGGCCCCCGT 

ACACTTTTGGTCAGGGGACCAAGCTGGAGATCAAACGAACTGTGGCTGCACCATCTGTCT 

TCATCTTCCCGCCATCTGATGAGCAGTTGAAATCTGGGAACTGCCTCTGTTTGTGTTGCC 

CTGCCTGAAATAACTTTCTATCCCAGGAGNAGGCCAAAAGTTACCTCGGGCC 

Sequence 1414 

AGATGATGTC I I I I IGTTATTGTCTGTCTGTTTTGTATGTGTCTGANATAAGGGATAGAG 
AGGAAACATCCGTTCAGGCTMTTNTAACTACATTTTATTNTTAAAAAATAGNAGGAAAA 
CCATAACNCTTNTTTAAAAAGGGNGTNNNNAAAN ! I I I I I I I I GGTTTNTGGGGGTTGNG 
GGTTTTTAAAAAANAAACCCCCCCCAAAAAANGGGGGGTGTNTCCCCCCCCCCAAAAAAN 
NGGGGNGTTTAAAAAAAMGTGNGNNNGGGGGGNNTTTTAAAAAAAANNCCCCCCCCAAN 
AAAANGGGGGGGGTN M I t M I U I 111 l } l I AAAAAAAANNAGGGNGGGG I I M I I I M 
TAAANANNNGGGGGGGGANAAAAAAAANGGGGGGGGGGGGGGGGGTGNGGGTCCCCCCTN 
C CCCCC N C N AAAN AN N N CCCCCCCCC CC NTTCTC ANTTTTTTTTATTATNTNT AAAAATT 
TTTCCCCCCCCCCCCN I ! I M I I I I I I NGGGGGGGGGGGGGGGGCACTNNTTNTTCNTTT 
TTTTTNNCAAAAAAAAA I I I I I I I II I I I I I I A TTTTNTNTAGNGGGGGGGNTATTTATA 
N i I I II IATACCCCCCCCCCC 
Sequence 1415 

AGGTACCTGCAGGCCTCCTACACCTACCTCTCTCTGGGCTTCTATTTTGACCGCGATGAT 
GTGGCTCTGGAAGGCGTGAGCCACTTCTTCCGCGAACTGGCCGAGGGAGGAAGCGCCGAG 
GGGGCTACCNAGGCGTNCTCCTGGAAAGATTGGGGGNCCCCCCCAAAATNTTAAAAGGAA 
AAAANNNAAAAANNCCCCCNCGCCCCCCAAAAAAANNGGGGGNTNCCCCCCNGGGGGNAT 
I I I I I I I I GGGGGGGGGGGGGGGGGNNTNCCCCCCNGGNTTTGGGGGGGGNNTTTNCCCC 
CNTCNCCCCCCCCNGGGGGGGGGNNCCCCCNI I I I I I I I I ICCCCCCCCl I I I I I I i i i I 
AAAAAAAAGGGGGGGGGGGNAAAAAAAAAAAAAAAAAAAACCCCCCCCTN I rTTTTTTTA 
NAAAAAAAAGGGGGGGNN I I I I I I I i NNGGGGGNNNNGGGGGNNNNNNNAAAAAAAAAAA 
nTTTTTTTTTTi I NNNCCCCNNNNNNCCCCCCCCCNNNNNCCCNNNNTGNNCCCNTTAT 
AATNNCCCCGGGGGNGGNNGNGGGGGGGGGGGNNNNNNNANNTTNTTTTTANCCCCCCCC 
CCCC ITTTTTTTTTn NGGGGGGGGGGGGCCCCCCCCCCCNNAAAAAAAAANGGGGGGGG 
GGG 

Sequence 1416 

AGGTAC II I I I I I I I GTTTG rTTTTTTTTTTTl ICACCTTAAGGGAGGATTTAATTNGCT 

CCCAACTGATNGTCACTTAMTGAAAATTTAAAAAGTGAAGTAAAAAGNACATTACTTCT 

CAGCCTGGCAAATTATNTATNGGGAGGAANAAAAAAA I I I I I I I I I I I I I I IGGGGGGGG 

NGGGGGGGTATMGGGGGNNTNNGGGNTTTNCCCCNCTAANAAAAANTCCCCCNTTTNCC 

CCCCTTNNCCCCNTAAAAAAAAAAAACCCNCCNNAAAAAAAGGGGGGNNGGGGGGGNGNT 

TNANAAAAAAAAAANAAA A I I I I I I I I I I NGGGGGGGGGGGGAAAAAAAANANNANAAAA 

AGGNGGNTTTNTAAAAAAAAAANNNGGNGNTGAAAAAAAAANNGNGGTCGNTNTAANAAA 

AATNANATAANTNNTTTAAAATNGGGGGGGNNTTCCCCCCCCANAAAAANAGGNGGGGGT 

GNCCCCCCCCCCTCl I I I i I I I I I II I I I NTACNNANGGCCCCCCCCTTTNCCTNNTTTT 

I I I I I M l I I I I I AAAAAAANAGAATAATTNTNNCCCCCCCCCCTTTTN I I I I I I I I GTN 

NNTANNGNGANCCCCCCCCCCCNCNAAAATATCNNCACTNTTGNTGNGAAAAATATTTCC 

CCCCCCCCCCCCCCCC 

Sequence 1417 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAAAATGGTTTTGAAGTTAGGGAGCTGCTTTC 
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TCTCTTCATTCCTGTGGGTGCCCCAATTCTCCATAATATTTGCAGCTGAGAAGTATGTCT 

TTTTATTCATTTTTAMTTTTCATTTAAGTGACAATCAGTTGGGAGC 

CCTTMGGTGAAAAAAAAMCAAACAAAAAAAAAGTACCTnnnhnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGTCGACCTCGAGGnn 

nnnnnnnnnnnnnnnnnnnnnnTGTTCCCTTTAGTGGAGGGGTTTAAATTGGCGCCGCCT 

TGGCGGTAAATCA 

Sequence 1418 

AGGTACATGGGGGTCCGTGCGGGCAGAACCCAGGGCATGAAGATCCAAAAGGGCCTGGTT 

CAGCTTTTTCTCCAGGGCCATGGCAGCTTTCATGGCGTCTGGGGTTTTACCCCACTCATC 

TTCAGCTGGCTTCTTGATGTCCTGGAAGAGAGCnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGTTMTTTGCCGCGCCTTGGCGTAATCATT 

GGTCATAAGCCTGTTTTCCTGTGTGAAAAATTGTTTATCCCGCTCACA 

Sequence 1419 

TATAGGGCGAATTGGANCTCCCCGCGGNGGCGGCCGAGGTAC I I 1 1 I T I I GTTTGTTTTT 
TTTTTTTCACCTTAAGGG AGG ATTTAATTTG C TCCC AACTG ATTGC CAC TTAAATG AAAA 
TGTTAAAAATG AATAAAN AAGTACANTACCTTTTTTC AAN C NTGGC AAATAAN AATTTAA 
TNGNGNAGAAAATTTNTGGGTTNGGGGTTTGGGGGTTNTTTTTNCCCCNTAAAAAAA^ 
TACCCCCTNNNCCCCCTTTTNNCCCCCCTATAAAAAAANCCCCCCCCNTAAAAAAAAANN 
TTTATGGGGGGG I Mil! I N AAAAANATAAAAAAAAN AN I i 1 1 I i I i i GGGGGGNTGTGT 
CNAAAAAANNCTAAAAAAAGGGGGNGGTNTNCCCNAAAAAAGGGGGGNGNTNTTAAAAAA 
ANGNGGGGGGGGGNTGTANAAAAAATAAATATTTA I till I AGGGGGGGGGGGTNGNCCC 
CCCCCTNTNAAAAAANNNANNANGNGGNCCCCCTNAI I I I I I I I I I M NCCCCNCTTTCC 
CCCCNTNTCTACNTC 1 I I I I 1 I I I I I I I I I I I I GTAAAAANANNGNAATTAATNNCCCCC 
CCCCCCCCNN rTTTTTTTTTl I NATCTNTNNNNCCCCCCCCCCCCCCTAANTTGTNCTTA 
ANATTATTNATAATAMNATNGTTTTTNTATTTTNCCCCCCCCCGTNNGTCTCCCCCCCA 

Sequence 1420 

CTTAGGGCAATTGAGCTCNCCGCGGTGGCGGCCGAGGGACCAAGGTGGAAATCAAACGAA 
CTGTGGCTGCACCATCTGTCTTCATCTTCCCGCCATCTGATGAGCAGTTGAAATCTGGAA 
CTGCCTCTGTTGTGTGCCCTGCTGAATAACCTTCTATCCCAGNAGNAGGGCCCAAAGTAA 
AANTAAAAAGGGGGGN I I (MM I AAAAAACCCCCCCNCCCCCC I rTTTTTTTI ICCNNG 
GGGGGGGGGGGGGGNGGCCCCCCTTCCCCCCCGGGGGGGGGCCCCNAI i M 111 M I ICC 

ccci mrrm i naaaaaagggggngnnaaaaaaaaaaaaaanatccccccn mrrm 
tnaaaaaaanggg i i i i i i i i t i i i i ggggggggggggggggggnnaaaaaaattttttt 
tttnccccnncccccccnnccctnttncccttnttntccccgggggtnttngggggnnta 
atnnnggn i i i i i i i ccccccccntn i i i i i i i i nnggggggggggnccccccccaaaaa 
aaaaaaanggnnggggggggggggggggtttaaaaaaaaaaaaaaaaaaa 

Sequence 1421 

AGGTACCTTGGTGTTACTTCCTTTGAAAGATGGACATTGAMGTTAAAAATCAGTCACTA 
TATAGGCACTGCTGTATGGAAAACGCATTTTGTGTTTCTACAAATTGTAAGTCGGNAAAT 
GNCC i INI i GTAGTATCCTTCGGGNCTTCCCCCGGGGGGGGNCCCCC \ I Mill ICCCC 
CC l 1 I II I I I AAAAAGGGGGGGGNAAAAAAAAAAAAGCCCCCCCNTTTTTTTNAAAAAAG 
GGGGN I 11 I II n I I I GGGGGGGNGGNGGGGGGGTTTTNAAAAAAAN rTTTTTTTTTTNN 
CCCCNNNGCCCCCCNNGTCCCCCCTTNGNCCCCNNNTNCCCCCGGGGGGNNTNGGGGGGG 
NNTTNGGGNl II I I INCCCCCCCCCI 1 I I I U I I I GGGGGGGGGGNNNCCCCCCCCCNAA 
AAAAAANATNTTTTGGGGGGGNTTTTTAAAAAAAAAAAGNGAAAAAANNN 1 I II I I I I 1 I 
TNNNII (Ml I NNNNNGNCCCCCCCCCGGGGGGGGGGNGANAAAAAAAAAAAAN I Mill 
TTTNAAAAAAAAAAAI M M II M I M I NCCCCCCCCCCCCANAAAAAATTNNAAAAAAN 
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GGGGGGGGGGGG 1 I I I HCNCCCCC 
Sequence 1422 

AGGTACCTGCACAATGTCTACCCAGAGATGTTTGTTCCTGACCTGACGCCCACCTTCTAT 
GGTGCCATCAAGAACCTNGGCACCAACCAATGCCTGGATGTGGGTGAGTAACAACCCGCG 
GNGGGGGMNGCCCCCTCAATCAATGGGNTTTTTNAAAAAAAAACCCCCCTNCCCCTTTT 
TTTTGGGGGGNGTNCCCCCGGCCCCCTTNCCCCGGGGATTTGGGGGGTTTNG I I I I I M I 
CCCCCCCGGGGGGGGGGGGGGHH I rTTTTTTTTTI I CCCCCCCGGGGGGGGGGGCCCCCC 

c 1 1 1 1 1 1 1 1 1 n icccccccn i nrrrm i aaaaaaagggggggggggganaaaaaaaa 

AAAAAAAAAAANCCCCCCCCC I I I I I I 1 I k I AAAAAAAAGGGGGGGN I I I I i I I I t I I GG 
GGGGNNNNGGGTNNTNAAAAAAAAAANI I I 1 I I I 1 I I 1 I 1 I 1 NCCCCNNGGNNCCCCCCC 
CNNNNNCCCCTGTNGNCCCCCTNTTNNNCCCCCGGGGGTNNNNTNNNGGGGGGGGTTTTN 
NNGNI MUM ICCCCCCCCCI rTTTTTTTTTI I NGGGGGGGGGGGGGNCCCCCCCCCCC 
NAAAAAAAAAANANATTTNTTTGGGGGGGGGNTTTTGNANNAAAAAAAAA 
Sequence 1423 

AGGTACATATTTATGCCAGAAAGCTTGACAAACACTGAACAGTCTGTAAAAAAAATGGAA 
AATCTCTTGACCATCAATTCAAATAAATATMTTTTMCTATTTTGCAGCCAATACCTAA 
ATTTTACCTTTNTAATTTAAANANTTANA I I I I I I I I I I I I ANAAAAAAACCCCCCCCCT 
ITTTTTTm ICCCCCCC MINIUM NAAAAAAAGNCAAAAANGNCAAAAANGTAAAA 
CCNI M I M M I I GGGGGGGGGGGNCCCCCCCCCN I M M M i M INCNCCCCNI M I M 
I M M M M I NGGGGGGGGNGNTAANTATNTTCTTCNCCTGGGNGGGGGNN ! M I M M I 
TTTCCNCCCNCCAAAAAAAAANGTGTTTNGGGGGGGNNAATNNTTTTTTN 
AAATTNCCCCCCNNNNGCCCCCNN I I I I III I I I I I NCCCCCCCCCCNTNGGGGGGTNTT 
TTAATACCTCCTNNNCNCCCCCCGGGGGGGGGGGGGGNTTNCCCCC I nTTTTTTTTTNT 
CCCCCCCCCNU I I M I M I I I NANAAAAAANAANGGGGGNNNGGTNGNGTGGCCAAAAT 
NGNTTTATTTNTTTTCCCCCCCCCCC IMIIIIII 
Sequence 1424 

AGGTACTTTC rTTTTTl GTTTG I M M M I M I CACCTTAAGGGAGGATTTAATTTGCTC 
C CMCTG ATTGTC ACTTAAATG AAAATTT AAN AAATG AATAAAAAGG AC ATAC CTTCTCA 
GCNTGGCAAAATAATTTATTGGGGAGGGGNAAAAAAAAAAAAAA I M I U I INTTTTTGG 
GGGGNNGGGGNTTNNAGGGGNTTGGGGGNTTTCCCCCCCCAAAAAANAANNCCCCCCTTN 
CCCCCNNTTTNCCCCAAAAAAAAATCCCCCCCNAAAAAAAANNNANTTTTGGGGGGGGGN 
GNTTTAAAAAAAAAN AN AAAAAA I M M M M 1 NGGGGGGGGGGGGGGGGGNGTNTTAAA 
AAAMNGGGNGNTTTTNANAAAAAAMNGGGGGGGNGNTAAAAAAAAANGGGGNGNNGT^ 
TAAAAAAAATAANAAANAN1TTTAAAANGGGGGGGGNNNGNTNCCCCCCCCCCAAAAAAA 
AAANGGGGGGGNGGGGCWCCCCCCCCCN I M I M M M M I NNANNCNCNNNNNCCCCCC 
CNACNANTCCCCC 1 I I I II ! I II Mill II I NAAAAAANNNTAAAAANNNCCCCCCCCCC 
CCCCCTTTTTTTTTTTTTTrrTNACACCCCCC 
Sequence 1425 

CCGG NCAGGTNCACTTG \AACCAMNTTCTAAAACTNGTTTTTCTTAAAAAATAGTTGTT 
G NAACATTAAACCATAACCTAATC AGTGNGTN CACTATGCTTC C ACACTTAG CNCAGACC 
NTTCTCACNACATTNUTACTTGGGG I IIMMII I CCCCCNAAAANTTTTAAAAAGGGG 
GNTTTTTTTTT1 M NCCOCN1 I I I M M 1 I I NCCCCCAAAAAAAAAAAAAANANGGGGNN 
TTGGGGGGGGNTTTCCCCCNAAAAACCCCN I M I M I 1 I I NGGGGGGGGGGGNAAAAAAA 
NCCCCCCCTTAAAAAAATGGGGGGGGTNTAAAAAAAAAAACCCCCNCCTAAANAAAAGGG 
GGGNGCCCCCNAAAAAAAAAAAAAMNCNNNTTTTTTGGGGGGGNCNGGGNTTNCCCC 
CCCCCCN 1 M I II M I M M 1 I I I GGGGGGGGGGGGGGGGGTTTTTTTAAAAAAAAAGGG 
GGGGGNNGNTGGANAAAAACCCC I MM IINNNN1 I M M M I I NNNGGNNTGGGGGGNT 
1 1 M I ' M M r GGNNGNTGTGNNAGTNTTTCCCCCCNTTTTTTTCTCC 1 I I I 11 I I AN NAG 
ACCCCCCCCCCNTT 
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Sequence 1426 

CGCCCGGGCAGGTACMCCATCCAGCAGGTCCCAGAACAGTTTTCTTCTGGGCTCCAATT 
ATGAAATGGGGGTTGGTGTGTGCTGGATTGGCTGATGTNGGCCAGNACTCTGCAGGAANA 
MCTTAGGCACMGGCNTCAATCTGTCTTGGGANNGGGTTTTTANTTTTTNNNTTTTAG 
GGGGGGGTGAAAAAAAAN I I I I I NGGGGGGGGGGGGNGTNCCCCCCC I I I I I I I I I NAAA 
AAAAAANCCCCCCCCAAAAAAAAAMNNTTTTGGGGGGGM^ 

AN I I II I I I I N I I I I I NNAAAAAAAAAAAN I I I M I I I I I I I I I CACCCCCCCNNNGGGG 
GGGGGGGGGGGGGGGGG \ I I I I I I M I I I I I CCCCCCCCAAAAAAAAAAAAAAAAANGGG 
GGGGGGGGGTAAAAAAAAAANA I I 1 1 1 I I I AANAAAAANCCCCCCCCC I I I 1 I I I I I I I I 
TTTCCCCCCCCCCAAAAAAAAAAAAAAACCCCCCCCCCCNA1 1 I I I I I I NTTATTTTGGG 
NGGGGGGGGGGGGG ITTTTTTTTTTTTTTT1 1 AANNTTTTAAAAAAAAT 11 1 II 1 1 1 1 II 
TATTTTNTTNTTNAAAAAAAAAAAAAAA I MINIM I NNTGTTNTGTNGGGCCCCCCCC 
CCCC 

Sequence 1427 

TACTTAGGGCGMTTGGAGCTCNCCGCGGTGGCGGCCGCCACGCTGGTTTTGCATCTTCA 

GGAGACGCTCGTAGCCCTCGCGCTTCTCCTCGGCCAGTTCGCGGAAGAAGTGGCTCACGC 

CTTCCAGAGCCACATCATCGCGGTCGAAATAGAAGCCCAGAGAGAGGTAGGTGTAGGAGG 

CCTGCAGGTACCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnCGTAATCATGGTCATAnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnCGCTCACAATTTCCACACAACATACGNAGCCCGGGGAGCATAAAGGTGTA 

AAAGCCTGGGGGTG 

Sequence 1428 

CTACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGATACCTCACCGTGGCTGCTG 

TCTTCCGTGGTCGGATGTCCATGAAGGAGGTCGATGAGCAGATGCTTAACGTGCAGAACA 

AGAACAGCAGCTACTTTGTGGAATGGATCCCCAACAATGTCAAGACAGCCCGTCTGTGAC 

ATCCCACCTCGTGGCCTCAAGATGGCAGTCACCTTCATTGGCAATAGCACAGCCATCCAG 

GAGCTCTTCAAGCGCATCTCGGGAGCAGTTCACTGCCATGTTCCGCCGGAAGGCCTTCCT 

CCACTGGTACCTCGGCCGCTCTAGAACTAGGTGGGATCCCCCGGGCTGCAGGGAATTCGG 

ATATCAAAGGCTTATCGATACCCGTCCGACCTTCGAGGGGGGGGGGCCCCCGGGTACCCC 

AACTTTTT 

Sequence 1429 

ANCACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGGTCGAGAGGTGGTTAC 

ATTCGTTGAAGGACACCAGCTGCGGAATTTGCGGCTTTGGCAGATTGAAATCATGGCAGG 

TCCAGAAAGTGATGCGCAATACCAGTTCACTGGTATTAAAAAATATTTCAACTCTTATAC 

TCTCACAGGTAGAATGAACTGTGTACCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGCGGTAATCATGGGTCATTA 

GCCTGTTTCCTGTGTNAAAATTGTTATCCGCTCACAAATTTCCACACCAACATACCGAAG 

CCGGG 

Sequence 1430 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACACTTTCATTTA 

GTAATTAAATTTTAGTCAGATTTTGCCCAACCTAATGCTCTCAGGGAAAGCCTCTGGCAA 

GTAGCTTTCTCCTTCAGAGGTCTAATTTAGTAGAAAGGTCATCCAAAGAACATCTGCACT 

CCTGAACACGCCCTGAAGAMTCCTGGGAATTGACCTTGTAATCGATTTGTCTGTCAAGG 

TCCTAAAGTACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnCGTCGACCTCGAGGGGGGGGCCCGGGTACCCAGCTT 

TTTGTTCCCTTTTAGTGGAGGGTTTAAATTGGCGNCGCTTGGCCGTAATCATGGGTCATT 

AAG 
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Sequence 1431 

GACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATCCCC 

AGTCGTGGCCCTCTGGACAAGTGGCGGGCCCTGCACTCATGAGGGCTTCCAATGTGCTGC 

CCCCCTCTTAATACTCACCAATAAATTCTACTTCCTGTCCAAGAAAAAAAAAAAAAAAAA 

AAAAAAAANGTNCCGTCCGGTGTAATGGCGGCNCGGTGGAGCTCTAGGTGATGTTTGCAG 

ACAGAAAAAGTAAACTGTCAAGGAATTCANAGCATTGGGTACCT 

Sequence 1432 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAAATGGT 

TTTGAAGTTAGGGAGCTGCTTTCTCTCTTCATTCCTGTGGGTGCCCCAATTCTCCATAAT 

ATTTGCAGCTGAGAAGTATGTCTTTTTATTCATTTTTAMTTTTCA 

AAGTTGGGAGCAAATTAAMTCCTCCCTTAAGGTGAAAAAAAAAAAACAAACAAAAA 

AAGTACCTCGGCCCGCTCTAGAACTA 

Sequence 1433 

ACTACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACCAGGCG 

AAGTTGGTGCCAACACTCCGACnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnCGTCATCACTATTTCTCCAGTGGTATCTGGGAGCCAG 

AGTAGCAGGAGGGAAGAGAAGCTGCGCTGGGGCTTCCATGGTTCCGTCTGGGTCCTAACT 

GAGCAGCTCCTCCCCCGCGTACCT 

Sequence 1434 

MCGACTACTTAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTTTCTTTT 

TTGTTTG i ITTTTTT! ICACCTTAAGGGAGGATTTAATTTGCTCCCAACTGATTGTCACT 

TAAATGAAAATTTAAAAATGAATAAAAAGACATACTTCTCAGCTGCAAATATTATGGAGA 

ATTGGGGCACCCACAGGMTGMGAGAGAMGCAGCTCCCTMCTTCAAAACCATTTTGG 

TACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnCGCTTGGCCGTAATCATGGTCATAGGCTGTTTCCCTGTGTGA 

AAATTGTTATCCCG 

Sequence 1435 

CGGCCGAGGTAC i I I I I I I I NTTTG I I I I I I I I I I I I I CACCTTAAGGTNNGATTNAATT 

TGCTCCCAACTGATTGTCACTTAAATGAAAATTTAAAAATGAATAAAAAGACATACTTCT 

CAGCTGCAAATATTATGGANAATTGGGGCACCCACAGGAATGAAAGAGAGAAAGCAGCTC 

CCTAACTTCAAAACCATTTTGGTACCTGCCCGG 

Sequence 1436 

TCATACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACTCCAG 

GCGAGAGGCGGACGCGAGTCGTCGTGGCAGGAAAAGTGA<;TAGCTCCCCTTCGTTGTCAG 

CCAGGGACGAGAACACAGCCACGCTCCCACCCGGCTGCCAACGATCCCTCGGCGGCGATG 

TCGGCCGCCGGTGCCCCNAGGCCTGCGGGCCACCTACCACCGGCTCCTCGATAAAGTGGA 

GCTGATGCTGCCCGAGAAATTGAGGCCGTTGTACCGTATATAAAAGACAATTGCTCACAA 

TGATAGCACTGAAGCACTGAGAGATATCAAAGTACCT 

Sequence 1437 

CTATACNGCGAATTGTTTTCCCCGNGGTGNCGGCCGCCOGGGCAGGTACACGGGAGATGA 

AGGTGCGCATCCTCTCCGNNCGTAAGAAGCCTCTGGACATTGACTACATGGGGGAGGAAC 

AGCTCCGGGAGAAAGCCCAGGAGCTGTCGGACTGGATCCACCAGCTGGAGTCTGAGAAGT 

TCGACCTTGATGGGCNAAANCCTGAAACAGCA/.GAAAATATGAGATCAACGTGCTGTACC 

TCGGC 

Sequence 1438 

GACTACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACCNTGCA 

GAAACCTGGCCAGGCTCCCAGGCTCCTCATCTATGGTGCATCCACCAGGGCCACTGGTAn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
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nnnnnnnnnnnnnTCTTGCAGTTTACTACTGTCAAGCAAGTATAATAACTGGCCCCCGTA 

CCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGAGGG 

TTTAATTGCGCGCTTGGGCCGTMTCATGGGTCATMGCCTGTTTfCCTGTGTGAAAATT 
GTTATCCC 

Sequence 1439 

GACTNC TATA GGGCATTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTAGAATATCAAC 

AGAGTCTTTTAMTGAAANTGATGGAAAATCAAGATGGTTCTACTCACCGTGGTTGTTGG 

TAGAATGTTACATGTATCGAAGAATTCATGAAGCAATTATCCAGAGTCCACCAATCGATT 

ACTTTGATGTATTTAAAGAAATCAAAAGAGCCAAMTTTCTATGGGTCACAGGAATCCAT 

CATTGCi I I ATGTACCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnCGCTTGGCCGTAATCATGGGTCATTAGGCTGTTTCCC 

TGTGTGAAAATTGTTATC 

Sequence 1440 

TATCACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACAAACTGTT 
CAATCTTCTCAAAGTGATGGACTCGGAAGATGCCACGGGTGTCACGGCCATGGGAGCCCA 
CCTCCTGACGGAAGCAGGTAGACAGGCCAGCATACTTGATGGGCAGGTCCTC 
Sequence 1441 

CTATCACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGC 

GGGGATCCAGAATACATTTCCAACAAGAGCACTGGCCAAGTCAGCTTCTTCTGGAGAGTC 

TCTAGAAGACATGATGCTACACTCAGCTTTGGGTCTCTGCCTCTTACTCGTCAAGTTTCT 

TCCAACCTTGCCATTGCAATAAAAAAGGAAAAGAGGCCTCCTCAGACACTCTCAAGAGGA 

TGGGGAGATGACATCACTTGGGTACCT 

Sequence 1442 

TCACTACTTAGG GCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTTTGTTTGTT 

rTTTTTTTm II CACCTTAAGGGAGGATTTMTTTGCTCCCAACTGATTGCACTTAAAT 

GAAMTTTAAAMTGMTAAAMGACATACTTCTCAGCTGCAAATATTATGGAGAATTGG 

GGCACCCACAGGAATGAAGAGAGAAAGCAGCTCCCTAACTTCAAAACCATTTTGGTACCT 

GCCCG 

Sequence 1443 

ATC ACTACTTAGGG CGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC I iTTTTTTT GT 
TTGI I I I I I I I I I I I CACCTTMGGG AGG ATTTAATTTGCTCCC AACTG ATTG C ACTTAA 
ATGAAAATTTAAAAATGAATAAAAAGACATACTTCTCAGCTGCAAATATTATGGAGAATT 

GGGGCACCCACAGGAATGAAGAGAGAAAGCAGCTCCCTMCTTCAAAACCATTTTGGTAC 

CTGCCCG 

Sequence 1444 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAGCAGAAACCTGGCCA 

GGCTCCCAGGCTCCTCATCTATGGTGCATCCACCAGGGCCACTGNA 

Sequence 1445 

ATMTTGCGCGCTTGGCNGTMTNCATGGTCATAGCTTGTTTTCCTGTGTTGAAAATTGT 
TANTTACGCTCACTAATTTCCAACACAAANCAATTACTGNAGCNCGGGGNAGTCATTAAT 
AGTTGGTTAAAAAGNCTCTGNGNGGNTGNCNCTTAAAATGGAAGNTGCAGGCCTTAAACN 
TCAACCATTTTANATTTGGCNGTTTTGACT 

CMGNTNCCGCGGNAAAAMCCACTTGGTTCCCNTTGNCNCCAGTCTTGGTCMTTTTAT 

ANTGNAAAATTCNGGGNCCCAAAACTGACNGTCCGTGNGGGGAAGGAAGGGNTCGGGGTT 

NTTTGGCCGGTNTATTTNGGGGGGCCNGCCTTACTTTTTCCCGGNCTTNNCTCCTTCGGN 

NTTCCAACTTGG 

Sequence 1446 
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CCGGGCAGGTACACAGTAAACAGCATTCCTCACTTCGCTGAACAGCANNAGGCAACTCGT 

TTCTAAGTTCCAAGCTCACTCTTCATAGAGNCTATCCACCTTGGGCTNG 

Sequence 1447 

ATTGGTTATCCGCTCACAAATTCCACAACAACCATACGAGCCGGGGAAGNCATTAAAAGT 
TGGTAAAAGCCTGGGGG7TGCCCTAATGGAGGTGAGNCTTAAACNTCACMTTTAATTTG 
CGTTTGGCGGCNTCACTTGCCCCGGCri I I I I CCAAGTTCGGGGGAAAAACCNTGGGTTC 
CNNTTGGCCCAANCNTTGCCAATTTTAAATTGGAAAATTTCG 
Sequence 1448 

TACCAGAGGCCCTGGTGGAGGCACCATAGATGAGGAGCCTGGGAGCCTGGCCAGGTCTCT 
GTTGGTACCTGCCCGGGCGGCCGNTCGACTGTTGGACTTGAACCAGCGCGCGTTCAAGAA 
GTTTGCCGGGCAACCGCACCAGCGCCTTNGNCGAACTTCGACCGGCCAGCC 
Sequence 1449 

CTACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACCTGAGGCACCAAGG 

TGGCCTTCGCTCTCTTGACATCTTGGTCGTCTGCGTCCGGCGCTTTCCTCTTGACGGACT 

GAGGCGCTGAACTGGCGGACGAACCCTGGCTGGCTTCCTTCGCGCCTTTCAGCTCGGCCA 

CTTTCTCTCGGTGCTGCTTTCCCAGGACGTGAGTCTGCCACAGGAGCTCGCTCTTAACCG 

GAGTGTTACACAGGGCACAACTCAGCTGCCCCAAACGGTTGTACCTGCCCGnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGT 

CGACCTCGAGGGGGGGGCCCGGTACCCAAGCTTTTTGTTCC 

Sequence 1450 

TACCANTGGCCCTGGTGGATGCACCATTAGATGGAGNGNAGCCCTTGGNGAGCNCTNGGC 

CAAGGATTTTCTTGCTTGGGNTANCCCTNCGGGCCCGNTTCTTAGAAAACTTATGTTNGG 

TATTCCNNCCCGGGGNCTTGCNAAGTGNAAATTTCGNAATAATTCCANMGCCTTTTATA 

CGGTATTAACCCCCGGATCNGAACNCNTTCNTAAGGGGGGGGGGGGG 

Sequence 1451 

ACGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTT 

i n u m n i n 1 1 1 1 1 1 1 m i nggcacagtatctcactctatcagccaggctggagtg 

CGGNGGCATGATCACCGCTCACTACAGCCTTGAACTCCTTGGCTCAGGNGATCCTCCCAC 
CTCANCCTCCCAAGTAGCTACNACTACAAATGCGTGCCACCACACCGGGTAATTTTT^ 
G ATTTTi I GTAGGGACNAGGTTTCACCATATTGCCCAGGCTGTATCCCCTNTTTANACTC 
ATGC ACTCTNTGTGGAGC AGGAACTACACTACTCTAAG N G ATTCTG ATTCTCCTCTN AAA 
TCCCATCGGTCACCCCAANAAAAGCACTCACTACAATTCTTTCCCATAAGGAGAAATAAT 
TAT 

Sequence 1452 

CCGCGGTGGCGCGGCCGCCCGGGCAGGTAC I 1 1 I II I I I I II i I I I M I M I IGTCCATC 

TGCCCAGTTTTATTGGGAACAAGGGCATTATAACTGCTATCAAAGAGAAGGGAGCCCAAG 

GGCTCCTTCTGTANCAAGATCCTTCTTCANAGTTTNAGCCAGGGCTGGGAGGGTAAGANA 

CCC I I II I I CANGCAGGGTCACACTACCACCCTCAGCATGACTTCCCCAAAAAGTTATCC 

TCCTTTAGCTCAGCACTTGGCACTTNAGGAAGAAAGGACCAA 

Sequence 1453 

CGGGCAGGTCGAGCTCCGGCGCACGATGCAGAACCTGGAGATCTCGCTCTTGCCGCTCTG 

CACCAGTTCGCTGTTGGTGGCCACCTCGCGGTTCAGCTCCTCTGTCTTGGTGAAGAACCA 

ATCTTCGGCATCCTTGCGGCGGACGCGTGGGTCGACTCACCTnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTACCGAAGCC 

GGAAGCATNAAAGTNTAAAAGCCTGGGGGTGC 

Sequence 1454 

AGGTACTCCATCGCCTGCTGCTTCAGCTTCTGCAGCTCCCTAGTGGCCCCACAGCTGCGG 
CCGCCGCCTTCTTCCTCCATGATGAAAACATCTGAGACTCAGGGCTAAGCACCTTGCCCA 
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AGGCCACACAACAAGTAGGTGATGGAGAGTGTTGGCGTGTACCTGCCCG 
Sequence 1455 

CCGCGGTGGCGGCCGAGGTACCANNGACCTCTAACTCCCCCTGACACAGAGCAATTAGAC 

TCCCATAACAATGGTATCAATTATACCACTCCATTGGAGGGACTTCCTTTATGTGTCACC 

CAGGATACATTGCTCMCTGCAGTTGCCTTGCANTTTGATCCCAAGCATGGNTGAGTTAC 

CATAAAAAAATTATGTACCT 

Sequence 1456 

CCGGGCAGGTACNAGACAGAGTCACCATCACTTGCCGGTCAAGTCAGACGATATCTATGC 

ACAGTCTGCGAGCTGAAAACGGACCCTCACTTTTGGCCGGGGGACCAAGCTGAAGATCAA 

CNCGNAACCTGTNGTGCTGCNCCMTCTGGTTCTTTAATCATTCCCCGCCCANTCTTGAT 

GNAGNCACGNTATGAAAMTCCTGGGAAACTGGCCCTACTGGTTTGTTGTTGNCCATGTC 

TTGAAAATTAAACCTTTCTTATCCCCAANNAAGNAGNGCCTCANAANGNATNNCGCNTTC 

TGTGTCACCGNCTTTCCTTATGGAAAACCTTAATGTTNGTGTANTACNCCCCCCACTGTG 

GGTCCTTNGTCNAATGNGGTAAAA I I II I CC 

Sequence 1457 

CGGGCAGGTACTTCCTGTGGAACTCCCTGGAGATGAAGCCATGTTCGTCCTGGCGCTCTT 

CGTGCTTGCCGTGCACCTCAATCACGTTTCCCAGAACCTTGACTTTGAGTTCCTCTGGAG 

AGMGTGCTTCACCGTCCAGNTTCACAGGAGMCCCTGTNCTTTTCCATACGCATNTTCT 

GAATAGCCCAGTGTTCAATTCCAACTAGGGTGCCCCNC 

Sequence 1458 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAGATGTCACCGGCAGCATAGATATCAGGAAGG 

GATGTGTGCATATGATCATCCACTTTCAGGCCACCATCTTCCCCTAGATCAAAACTGTTA 

CCATGGAGAAAAGGTTCTACATTTGGTGTAACTCCTGTANCACTGACAATGAAAGCGCAG 

CCATATATCTTTTCATTGGTCAATTCCACATAGACAGGCCACATCTCTGTATCGGCTGTA 

ACTGACTTATGGTCTCTTGGAAAAGTGAAGGACTTTTTCTTCAAMTTCTAAACTCATCC 

TGAAGGTAGATTTTCTTTACTTCACACATAGTTTCAAGGTGAATCTTATGAGAAAACTCT 

TTTGTTCCTTTAAGATTCAAGCCTTCATGCCAAATCTTGG 

Sequence 1459 

TCGAGGCCGAGGTCTTGCGGATCTGGCGGACCTGCTGGTGCTGCGCATAGGATGTCTTTC 
TMTCTGGTTGTTACGTTTCTTAGTGAAACCAACACAGAAAAGTCGGAGCAAAATA 
Sequence 1460 

CCGGGCAGGTACATAAAATGGCAAGAGTGCATCCTTTAAAGCTTGCACATGAGAGACTTG 

GAAACAATCTTATTAGMTTGTGAAAATTCTAAGTAGTCAAAAAAMTAAAAAAAAATAA 

AAAAGAAAGTCAAGTACGTGCCAAGCATNCTNGTGCGACCGCGAGAGCCCGGGGAGCGGG 

GGCTTGC 
Sequence 1461 

TAGGGCGNTTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTATGAATGTAATTCATAATT 
TAAAAGGAAAACTAAAAACTATTTTGATTTGGGAAA 

TACACTGAGAACTAGCCTCAGGCTTAATATTCTCATTGCATTTGCAAGATCTGAGCAAAT 

AAGATTAAGTAAAACAAATCAATTGTATATATAATTGACCTTTTTGTGGAACATGTAGTT 

TATAGAAAGTATACTCTAMGGGAATTTGCCGAAGACCTTTTACTGATTGAACAGTTGTG 

CTACAATCAACTTTTCATAGTACCTCGGCCCGCTCTAGAAACTAGTGGATCCCCCCGGGC 

TGCAGGAATTCCGATATCAAGCTTTATCGATACCGGTCGACCTCGAGG 

Sequence 1462 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCGGATTTTG 

AAmCGTGATTTTGGTRGGTATAAACGGTAAAAGTGTGTGTGCCCTCTTTACCTGTTC 
TTTGTCTTGTGGTATGTGTGGTGTGAGCGTGGTATTTTGTCTGGAAAAAAAAAGAAAAAA 
AAAAGTA 
Sequence 1463 
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CGGCCGCCCGGGCAGGTACTCAGTGGATGACGAGTGCTTGGTTGAAATTGTTGAAAGGCC 

TGTG TTCTGAAATACCTGGGCCGTGTCCAGTGAGGCCNAGGAGAATTTTTATNGAGCATC 

TTTTGCCAATGAANAAGAANATTAAATATTGACCACTTACCTTGATCCCAAACGCTCCTG 

ACTGGAAGANTGGCACCATGNCGTGTTTTTATTNGAAGCTAAAGGCNANTAAAANNGTAA 

AGTAGTGCTCATTCANMNNTATNTTNGGNAAANTTTGTCNCNNANTCCAAANAAACTTT 

ACCANANGCAANATTTNACCTTTCCCAA 

Sequence 1464 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTATAGCTGTAAGGAGA 

AGCTGAGAAATGATACCCAGGAGCAGCAGGCTTTACGTCTTCAGCCTAAAACCTAAAAAA 

AAAAAAAAAAAAAAAAAAAATTTAAAACAGCTATTAM 

AAAAAAAAAAAAA^AAAAAAANGT^ICCTN 

Sequence 1465 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTGAGATCACTTGCACTTTTCCTCATTTCTTAA 

GAAAATCCAGTTTGCTTGCCATGACTTGTTTAGCGTTAGGTTTACAAAGCTTGTTGT^ 

ATGTTGAATATTTGTCCAMGGAGTTTGACAAAATAGAAGCTGCTTGGTCCCTGTTCTGC 

CTGCTAGGTCATAAGGATGAAGGCCATGTGGCTGGGGTACCT 

Sequence 1466 

CCGCGGTGGCGGCCGAGGTACTTTTCTTGAACTCAGCATTGCTGCCCTGAACCCTCCCCC 

AGCTGCCTTCCCTACTTTGGGAGCCTCTATTTCTAGAACAGGAAAACGCTAGGCTAAATA 

TATGTCCTAATGGTCTCTTCCATCGGTGTGAAGGAAAGAAAATGGGCCCAATGTTGTTGC 

MGGATTAGGGAAGGTCCTGCCTTGCCACCCAGTGGGTTTAGAAATCCATCTTTACGGCC 

TCCAGAGAAGCTGGTGATCAGGTTCCCAGACATCAGCAATCCTGTTTCCTGAGGAGCTTG 

TGGGGGCTCTCTCTGGGGATGAATCTGGGTCCTAGACAAAGAGCCACAACTGATGGGGGT 

CACTGGGATTGCAGGGGGCCATGGGCTGGC I I I I I GTCCTGG 

Sequence 1467 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCACGGCTAATGTTGCTG 

CTAACACTCTGACnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnCGTCATCACTATTTCTCCAGTGGTATCTGGGAGCCAGAGTAGCAGG 

AGGAAGAGAAGCTTGCGCTGGGGCTTCCATGGTTCCGTCTGGGTCCTAACTGAGCAGTTC 

CTCCC 

Sequence 1468 

NCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGGAGGAGTGAGTCCC 

TATGCTGACCCCAATACTTGCAGAGGTGATTCTGGCGGCCCCTTGATAGTTCACAAGAGA 

AGTCGTTTCATTCAAGTTGGTGTAATCAGCTGGGGAGTAGTGGATGTCTGCAAAAACCAG 

AAGCGGCAAAAGCAGGTACCCTAGCCATGCTGAGCTGCAGAGGCCACAGGACAGCTCTCC 

CCGCGTACCTGCCCGGN 

Sequence 1469 

CCGCGGTGGCGGCCGAGGTACTGCGCAGCAAMCAATTCTATCTTCTTAATAGTTCTTCT 
ATGTCTTTCCCAGTGTTTACTGAAGAAGCTTCACCCTTGAAACGTCCT7TCTCCACCTCC 
ACCTCTACCTAACAAAGTCCCCCGCGTACCTGCCCGGGCGGCCGCTCTAGAACTA 
Sequence 1470 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAAATGGTTTTGAAA 
G7TAGGGAGCTGCTTTCTCTCTTCATTCCTGTGGGTGCCCCMTTCTCCATAATATTTGC 
AGCTGAGAAGTATGTCTTTTTATTCATTTTTAAATTTTCATTTM 
GAGCAAATTAAAATCCTCCCTTAAGGTGAAAAAAAAAAAAACAAACAAAGTCCT 
Sequence 1471 

CCGGGCAGGTACCACCATGGACACCTATAAGGAACTGATTGAGAGGATTCCTGAGTTGAA 
CAAAGTGGCACGAGCAGCTGCAGAGGTCGCTGGTCAGTTCGTGATTAAACTAACCCATGA 
TGTTGAATTGAACCTGGACTATGAGAGGTACTC rnTTTTTTTTTNTTTTTn GAAAACT 
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TTTTATTGCTCTTTTGGATAGAMCGGGMTTTATTTGCCAGGAAGGATGATCCCATCAT 

ACTTCTGCTGGAACCAGCGCATGGCCTCCTCTTTGCTGATTCTGTGTTTGGCCCCAATGC 

AGCCTGTCCTGCGCTTCTTGTCTGCGATGCTGAAACCTGGCCTACCCAGCACCACATNNN 

AAGTCCAGGGCCGTTAGATACCAATGCTT 

Sequence 1472 

CGGGCAGGTACTTACAMNTGCCCCATTCTNTTCATCTGAAMTGTCCCCGAGCCTTTTG 

GAAACAGGNGCAGGGCATNCANANTNTTCCCAN7TTTTATNGMCTTGANGNACGGTTNC 

AACNANMNNNNCTTMTTTCTACANACATCTTTNTTGGGTNATTTGTTACCGAAGNTCC 

TCCTTATTTGAGACCCCACCTAGACACCGACCTTGNANATTGGAACAAGTAAGCCCCTNT 

NCNTTCNTACAATTGNAAATGGTANCGTGGGAAGAAAGCCTTGGTTNGMTTTTANAACC 

CCCTGANTTNATNGTTGNCCCATGGNACCCAGGGGAACCAATTGGG / I /MM IAANAAA 

AAA 

Sequence 1473 

AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGCAACCTCTTGGTCAAACATGGCATG 
CCACCATCTTTTTCAMGCCCAATATTGAAGGTGCCCCAGGGGCTCCTATTGGAAATACC 
TTCCAGCATGTGCAGTCTTTGCCAAAAAAAAAAAAAAAAAGTACCTGCCCGGGCGGC 
Sequence 1474 

AGGTGTCGACCCACGCGTCCGCAAATAATGTTAATCCACACACCAAAAGGACGAACCTGA 

GCCCTAATAATTGTATCCCTAATTATATNTAATCGGGCTCAACCAACCTTCTAGGGCTTC 

TTCCCCATACATTTACCCCTACCACTCANCTATCTATAAACCTAAGCATAGCCATCCCCC 

TATGAGCAGGAGCCGTMTTCTAGGCTTCCGACACAMCTAAAAAATTCTTTAGCCCACT 

TCTTACCGCAAGGAACCCCCATCTCACTAATTCCCATACTAATCATCATCGAAACTATTC 

AAGCCTATTTATTCAACCGATAGCACTAGCCA 

Sequence 1475 

AGGTACCGCTTCCCTGAAGAGCTCACCCAGACCTTCATGAGTTGTAACCTCATCAGTGGA 

ATGTTCCAGCGMTGGACAAACTGAGAAAAGAATGCCTTTGCTAGTGTTATCCTGTTTAG 

AAACMCMCAGCAGCTCCATTTCTGGGGTTTGGGTCTTCCGAGGCCAAGATCTTGCCTT 

TCTGCTGAGTCCAGATTGGCAGGTGGACTATTGAGTCATATACATGTCGGAAACTGGGAT 

CCTGGTAGCCGAGGAGACCTAGACCCTCGTTCGAGACCTGCCCGGGCCGGCCCGCCCCCG 

CGGGTGGGAGCTCCAGCTTTTTGTTC 

Sequence 1476 

CTATAGGGC GAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACI rTTTTTTTTTTTTTTI 
TTTTTTm IGGGCANCTAAAATNTAGGATGGNGNNAAGCTTCTTCATTTTGNCAATTAT 
ATATAAAAAATNACAMCAACATGAATCNGCATTTCTTGGATGAGATANNTAATAACAAA 
CTATTTCTCMTATTTGTATACTAAAMCTAGTAMGGTGNTATGTGmCAGTATCTTA 
TCTNTTATTTGAACATGGGTTTCTGAAAGGAGCCTATNTAATAATATAAATGGTATGTAG 
TAAANGAGGCNCTGCCTTGGCTGGGACTGCTNTAAAAAAATTACCATAGACCATTGACTA 
ANCCACAAACNTATACTTCTCACAN NTCTGGAA 
Sequence 1477 

CGGGCAGGTGTACAAGCTAGCTTGAGTCGACTCAAGCTTAGGGATCTAGACCACGAGCGG 

CCGCNnnnnnnnnnnnrnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnGCGNAATCATGGTCATAnnnnnnnnnnnnnnnnnnnnnnnnNATCCGCTCACAAATTC 

CANACAACATACGAANCCGGAAGCCATAAAGTGTNAAAGCCTGGGGTG 

Sequence 1473 

AGGTACTTCAGGGAAATACCACCTCCACAAACGTTGAGCCAATTCAGGGGCTGCCAGTAG 

AATCCTAAAGAGTGATGTCTCCGGTTCATGTGAACACCATCTGCANTCCACACCCCGCTT 

TNAGGGACAAATTTNANNGCCA7TAAGNGNNAAGAACCATTGCTTCAAGCCAGAAGCTNG 

TCCCCCTTTTNATCCCANTCCCCNAGTTTCACTCACCCCCTTTAAGNCCTCTGGAATATG 

GGNGACCNAGTGCCGGAATGGGCCTTCNAATTAAGNCTCNACGGGTAAANGGGGCNTGGC 
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TCCTNNNCCCNTTCAAGNGGCNTTATNCTCCCNAAGATANANCNCCNAAANTTCTATANG 
GGGGGGGCNMTTTMCCGGNCCGNGGCCTTTCNANNTCCCCCAACCAACCNTTTTTTGG 
NCAAN CCTTTTC AATTAAAT I I I I CCCCCCGGTCTMTTGGGCCTTTTCCCCCGNCCTTT 
TAAAANAATTGGTTAANCCCCTT 
Sequence 1479 

AGGTAGTTAATNTTCGGATGGTTCTGAAGCTTGGCTTGAGACCCCCCTCTCCAACACACA 
CACACACACACACACACACACACACACACACACACACACGCACACA7TATCCAAGCGGAC 
GCGTGGGTCGACTCAAGCACCTGCCCGG 
Sequence 1480 

CCGCGGTGGCGGCCGAGGTACTGTGGAGCTGTTGGGACTGGGCAGGCGGCAAAGATCTGC 

AACAACATGCTGTTAGCTATTAGTATGATTGGAACTGCTGAAGCTATGAATCTTGGAATC 

AGGTTAGGGCTTGACCCAAAACTACTGGCTAAAATCCTAAATATGAGCTCAGGACGGTGT 

TGGTCAAGTGACACTTATAATCCTGTACCTGCCCGGNATNTTAATGAACCGNTGCATGCA 

AATCTTNTTTACTTCATNTCCTGTCAGGGCATACTTAANTNTGTTCCTCAGGAAAATGAT 

GAGGGGGAGACACTNTCTNAACTTGNGGGGACCNGGTGGATGGACG 

Sequence 1481 

CCGCGGTGGCGGCCGGGATCGCGGAAAGTGATGGCTGTCGTCCCGGCGTCTCTCTCAGGA 

CAGGACGTGGGATCATTTGCATATCTTACAATTAAAGACAGAATACCACAGATCTTAACT 

AAGGTTATTGATACATTGCATCGACATAAMGTGMTTTTTTGAGAAACACGGAGAGGAA 

GGCGTGGAAGCTGAAAAGAAAGCTATCTCTCTCCTTTCTAAATTACGGAATGAATTGCAA 

ACAGATAAACCATTTATCCCCTTGGTTGAGAAATTTGTTGATACTGATATATGGAATCAG 

TACCTCGGCCCGCTCTAGAACTAGTGGGATCCCCCGGGCCTGCAGGGAATTCGATATCAA 

GCCTTATCGGATACCCGTCGACCTTCGAGGGGGGGGCCCCGGGTACCCAGCT 

Sequence 1482 

TACCAGTGGCCCTGGTGGATGCACCATAAATGAGGAGCCTGGGAGCCTGGCCAGGTTTCT 

GCTGGTACTGTTAACAACCGGTATTAGAAATTGAGATGTTTTTATTTTGACTATTTGA 

GTAACATTGTAGATMTGTTCCAAACCATCATCAAAAATGCAGAGCAAACTTTTCTGTAT 

ATAAACTGTACCTGCCCG 

Sequence 1483 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCATNCCTTGTTTTCTGC 

TCATTTCTTNCAGCCTGAGGAACAGACGTCAGAAAAAAAAACAACCACCCAGACAACTTC 

CCCTTTTAGCTGAGCTGTCTTCTGACCAATCAACAGGGACCAGCCAAAAAAAAAAAAAAA 

AAAAAAAAGTNCCTGCCCG 

Sequence 1484 

AGGTACGCAGGCGGCAGCCTCACTCTGAATGGACCAAGGCGCCAGCTCCTTGTCCAAAGC 

CAAAACCTTTGGGCCCAAAGTTTTTGGCATAGCATCCTTTGCAATAGATCTCACCATCCT 

TGTCTGCCAGGGTGGTCGACTCGAGGCCTTTGCCACATTTGGCACATCGGAAGCAGGACT 

TATGCCAGGACTTCCCGGCACNCGATCACCTTCTCCGCTGCATAGACTGCCTGGCTTGCA 

TCGCGGGACAGCGCTCAGAGCCGCCCGATCTTCTGAAGCAAACTTGGATGCCATTGGGGT 

TGGGTGGGTAGCGGGACGCGTTGGGTCCG 

Sequence 1485 

CCGCGGTGGCGGCCGAGGTACGCCTTCGCAGTTTCCTGCCTGATCCTTCCTCCAGCAAGT. 
AGGTCACATCAATGGCA I I I I I ICCAGGGAI III! I CAAGG AGATTGAGCAAG ATCCGGT 
TGAGTCGTTCCCGTTGGTTATGCCGTCGTTCCTCAGGGGTACCTGCCCG 
Sequence 1486 

CGGGCAGGTCGACTCCTATAGGGTCTAGATCACGAGCGGCCGGCCGCAC I I I ! I I I M II 
n I M I M I n I I II 1 1 I I I I I I ! I I I I I I I I I I I TAAANACACACACACACACACACAC 
ACACACNCNCACACACACACACACACACACAC 
Sequence 1487 
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CCGCGGTGGCGGCCGAGGTACTGGGAAAAGGCCACAGGGAGAAGGAAGGCTCCAGCTGAA 

CTTTATAGCMTTTGAACTCAACCAGAAGCCTCCTGACCAGAACTCGATGGGAGGGGTGA 

AGCTGGCTTGCAGATTCCACAGGTGGGGAMGMGGAAGGCTTTATTTGCTTTGCAGCTG 

GGAGGTGGGTACCTGCCCG 

Sequence 1488 

AGGTACAGTGGTTAAATAATTGACTCAATGACAAAAGTGGGATTGGAGTCCCAACAGTGT 
GGACATGCTTGCTTTGGATTGAATGTTTGTGTCTTGCCCCATATCCCAATGTTACAGCCC 
TAACCACCAATGTGATGGTATTGAGATTAGGGCCTTGAGTACCTGCCCG 
Sequence 1489 

AGGTACGGGGGCCAGTTATTATACTGCTGACAGTAATAMCTGCAAAATCTTTAAAATTG 

AAGGCTTGTTGAATGGTGAAGMGTGAAACTTTGTCCCAGAACCCACTGCCCACTTGAAA 

CCCTGGCTTGGGATACCAGTGGGCCCTGGTGGATGCACCATAGATGAGGAGCCTGGGAAG 

CCTGGCCAGGTTTTCTGCTGGTACCTTGCCCGGGCGGCCGCTnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnGCTTATCGATACCCGTCGACCTCGNAGGGGGGG 

GCCCCGGTTACCCAGCTTTTTTGTTCCCTTTTAGTGGAGGGTTAATTGGCGCCGCTTGGG 

CGTMTCATGGTCATMNCTGTTTCCTTGTGGTGAAAATTGTTTATCCGCCTCACAAAAT 

TTTCCACACCAAACATACNA 

Sequence 1490 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATTTTGTTCCACCAAG 
TTATCCAAAACACCAGTGAGGAMTCTTTGCCCAGGGATTCCAACACCTTAAAGTGGGCT 
MINI I GTGGGTTGGCCTTCTGCCATAGGGMCAGCCTCCGTCCTTTTTTACAGCGTTG 
GAAAACCCCGCGTACCTGCCCGGGCGGCCGCTCGAATCA 
Sequence 1491 

ATACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAATAAGCATATTGC 

TTTGGCAATGCATCTCCAGAGCAGGTGACCCTGGCCGTCTGTCCTGGGGACACTGACACC 

GAGGGTGGCTGTGCCAGTTCATAGGAGGCCTCAGAGCCTGTGCAGAGAGTGAGGAGGGGG 

AGAAGTAGAGGGATCCAGGCCATGGTGAGACACCCAGAGTTCTGCCTCCTGGACCCACCC 

CCGCGTACGTGCCAAGCATCCTCGTGCGACCGCGAGAGCCCGGGGAGCGGGGGCTTGC 

Sequence 1492 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGAGCTGGAGACGGCCTGG 

GTGCTGGCGAAGCGGAGGCCGGANTAAGAAAGACTGTTAGAAATGCCCCTCGGGTAACAC 

AGMGGCTTGAGAGATCCTGACATAAATCCTTGTTTGTCGGAATCTGATGCTTCCACCAG 

ATGTCTGGATGAAAATAACTATGACAGGGAAAGGTGTTCCACTTACTTCTTGAGGTACCT 

GCCCG 

Sequence 1493 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTTCTTCACTTGGG 

CCACCCACCGTGATTTCTGTCCTGGATTCCCTGAGCCAAATACCCATGATGGGATGGTTC 

AGTGATGTGGATGGATGGCTATTTGTGTAAGGGCCGAAMCTAGTTTTCTAAGATGGGAG 

AGCTGGGCAGTTACTTATACTCTGMGCCCTTCTCTTTCTCTGCTGGGGCTTTCCCAGCA 

GTTCCTAATAGTTCCTTGTATTTCAGATAAGTCATCATGA7TGTTAGGAAAAAAAGAAAA 

AAGAAAGTTAG I n I I AGCCTCCTGACTTCTTMGTCAAGTGAGACTTTTAAAATTTAGT 

AMTAATAGCTCATTATTTTTATGGTAGCTATTTGATTATTiTAATTCTTTAGCAG 

GAAGAAATTCAGAGAAGTTCAATAGAGAGTAAT 

Sequence 1494 

CTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGAGTGTAAG 

MGGCCATCTACAATCAAGATTGAGAGTGGCTCTAACAAGTGCCATTTTTCCTTGTTAGC 

TTTCATTTCTCAGCCCTTTACMGATTAAMTAGTCTGCAGTTTAATCTCTCCAAAGCTT 

TACGGACAGTGATTCTGTCCTAAACAAGACAGTGACTCCAGGATTTCTGAAGACTATTGT 

GGAAGAAGCATCCATTAAGGCCAAGCTATAACATCAGAAATGGTGAATGAA 
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Sequence 1495 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGGAGGAAGTGAAGTCC 

CTATGCTGACCCCAATACTTGCAGAAGGTGATTCTGGCGGCCCCTTGATAGTTCACAAGA 

GAAGTCGTTTCATTCAAGTTGGTGTAATCAGCTGGGGAGTAGTGGATGTCTGCAAAAACC 

AGAAGCGGCAAAAGCAGGTACCTGCCCG 

Sequence 1496 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAACTGGCCTCAT 

CCTATATTCACTTTCGGCCCTGGGACCAAAGTGGATATCAAACGAACTGTGGCTGCACCA 

TCTGTCTTCATCTTCCCGCCATCTGATGAGCAGTTGAAATCTGGAACTGCCTCTGTTGTG 

TGCCTGCTGAATAACTTCTATCCCAGAGAGGCCAAAGTACCTGCCCG 

Sequence 1497 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCGCGGGAGCTTCTCCTTGCCAGT 

TTCTCCCAGCANGGACCCTCTTCTTGTTTTGAMGATGGTCGGCTGCTTTTGGTAGGCAC 

GCTCAGTCTGAATGTCCGCCATCTTCCCGGCCGCTCTAGAACTAGTGGGATCCCCCCGGG 

CTGCAGGAAT 

Sequence 1498 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTGATTGCATGCAC 

TTAMTGCAGATTATTTTGGAGTTTGAAMGGGACTATTAATGAAATCT^ 

CCTTTCTCI I I I ICCCTTCCCCGCCACTGATTCAGTGAGCTGGAGATTGGATCACAGGTA 

TAATTCAAGC I 1 1 I f CATGTAGTCATGTAGATCACTAGACTCCTTGGTGTACCTGCCCG 

Sequence 1499 

ATCATACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGCCATT 
ACTGCAGGAAAAGGTCCCGGAGAGCTAAGCAGTCAAGATGTGTGACTTCACCGAAGACCA 
GACCGCAGAGTTCAAGGAGGCCTTCCAGCTGTTTGACCGAACAGGTGATGGCAAGATCCT 
GTACGTGCCAAAGCATCCTCGTGCGACCGCGAGAGCCCGGGGAGCGGGGGCTTGC 
Sequence 1500 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTATATCAACCAGAGCA 

ACATACATGAATAAGCCAGC AGTAAGTG CAAATATC C AC ATAG AAACATTTTC AGCATAA 

TGACCAATGAAAATTCCTGTTGCCATTCCAAGATACGCCAGCATGGCTGACAATGCATTA 

TAAAGGACAGCCTGCTTAACGGTCATGCCAGCCTTTAGTAGAACAGCAAAGTCACCTAAT 

TCATGAGGCAACTCATGACAGAACACAGCAACAGAAGTACCTGCCCGGGCGGCCGCCCGG 

GCAGGTACAAGGGATTGATTTAAACCAMTTCGAGGACTTGGGTTTGATGCCACCGTGTT 

CTCTGGGTTGTTTTGGATAAAGCAGTTTCACCCATTACCAGTCAACCAGGAAGGGGGATT 

CCCATCGAAACGTCATCATGTGGCTGGACCATCGA 

Sequence 1501 

CCGCCCGGGCAGGTACCAACAGAAACCTGGCCAGGGCTCCCAAGGCTTCTTCAATCTTAT 

GATGCATCCCGACANGGGCCCACTTGGCATCCCCAGCCCAGGTTCAGTGGGCAGTGGGGT 

CTGGGACAGACTTCAnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnAGCAGCGTAGCAAACTCGTACACTTTTGGCCAGGGGACCAAGCTGGGAGATCAA 

AACGAACCTGTGGCTGCACCATCTGTCTTCATCTTCCCGCCATCTGATGAGGCAGTTGAA 

ATCTGGAATTGCCCTCTGTTGTGTGCCTGCTTGAATTMCTTTCTTATCCCAGAAGAGGC 

CCAAAAGGTACCCTTCGGCCCGCTTCTAGGAACTAGGTGGGATTCCCCCCGGGCTTGCAG 

GGAATTTCGATAATCAAGGCTTATTCGNATACCCGTCGACCCTTCGAGGGGGGGGGCC 

Sequence 1502 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGGGGCATTCGTATTGCGCCGCTA 
GAGGTGAAATTCTTGGACCGGCGCAAGACGGACCGGAGCGAAGAAAAAAAAAAAAAAAAA 
GTACCT 
Sequence 1503 

CCGCGGTGGCGGCCGAGGTACGCGGGGGTCACTTCCGGCTTCCTTCAGTCCGCTGGTCCC 
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GAGCACGAGCTGTGAGGGGATTCACTTGTGTGCGGAACTCCTCGGAACCATGGCGTCCCT 

TTCCCTTGCACCTGTTAACATCTTTAAGGCAGGAGCTGATGAAGAGAGAGCAGAGACAGC 

TCGTCTGACTTTCTTTTTATTGGGTGCCATCGCCATTGGAGACTTGGTAAAGAAGCACCT 

TGGGACCCAAAGGCATGGACAAAATTCTTCTAAGCAGTGGACGAGATGCCTCTCTTATGG 

TATCGAGCGGCCGCCCGGGCAGGTACCCTTAAACTGGCAGGACATTTTTGAAATCACAAA 

TTTGCACATAAAGAATGTCACGAACAGCCATGTATCCATATACAGCAATCAAATAAGGAA 

CTTATGACCTAAAGCAAAGGTAAACTTTCTTGAAACT 

Sequence 1504 

NAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCTGTGGTGATGCAGTCAAA 
GCCCAGGACAGTCGTATAAAGGAAAGCAAGACCCATGCCAGCCAGAAACACAGGCTGGTT 
GTAGTAGGAGACCCATCCATCTCGGAAGGTACCTGCCCG 
Sequence 1505 

AGTCGGGGAAACCTTGTTCTTGCCCAGCTGCATTAATGAAATCGGCCCAACCGCGCCGGG 

GAAGAAGGCGGTTTGCGTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnCAGAATCAGGGGATAACCGCAGGAAAAGAACATGTGAGCAAAAAGGCCANCNAAAA 

AGGCCAGGGAACCGTAAAAAGGCCGCGtTTGCTTGG 

Sequence 1506 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAGCAGAAACCTGG 
CCAGGCTCCCAGGCTCCTCATCTATGGTGCATCCACCAGGGCCACTGGTAnnnnnnnnnn 
nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
nnnnTCTGCAGTTTTACTACTGTCAGCAGTATAATAACTGGCCTCCGTACCTGCCCG 
Sequence 1507 

CTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACATTCAGGACAA 

GGATGGCGCTCCAGTGGCCACCAATGCCTTCCATAGTCCCCGCTGGGGTGGCATTATGGT 

ATATAATGTTGACTCCAAAACCTATAATGCCTCAGTGCTGCCAGTGAGAGTCGAGGTGGA 

CATGGTGCGAGCGATGGAGGTGTTCCTGGCACANGTTGCGGTTGCTCTTTGGGATTGCTC 

AAGCCCCAGCTGCCTCCAAAATGCCTGCTTTCAGGGGCCTACGAAGTTGAAGGGGCTAAT 

GACCTGGGAGCTAGACCCGGCTGCTCTGGGCTCGGTCAGTGGGAGAACCTGGCCACAGCC 

ACCACCACCCTTACCTCCCTGGCGCAGGCTTCCGGGCAAGATCAAGCAACATTTGTCATT 

AAGGACGACGTGGCATCTGAGGTGTACCTCGGCCGCTCTAGAACTAGT 

Sequence 1508 

NCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTTCCGAGATGGATGGG 

TCTCCTACTACMCCAGCCTGTGTTTCTGGCTGGCATGGGTCTTGCTTTCCTTTATATGA 

CTGTCCTGGGCCTTGACTGCATCACCACAGGGTACATCTTTCAATTCTGGAATATCTGTG 

TCTMGCACAATATCTTCACACTGTGCTGTATTGCTGCTGMCTAAATGCACTTTTCCCC 

ACATATGGGGCACTGGCTTCAAACMTTCAGTTCAGTATCATTACTTTTAATCTCATCTT 

TCCTTTCTTGGTAGTTGTTAATACAGTTATGGAAAAGAGG CAC ATTG C ATAG AAGCCATT 

GGGGAGTTCAGTGGAAGTTCTGTAAGAATGTGCATGTCCTGCCCG 

Sequence 1509 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAGCAGAAACCTGGCCAG 

GCTCCCAGGCTCCTCATCTATGGTGCATCCACCAGGGCCACTGGTA 

Sequence 1510 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGATGGGGGTCTCACTACAT 

TGCCCMGCTGTGCCAGAATTCCTGGATGCAAGCAATCCTTCTGCTTTGGCCTCCCAAAG 

TGCTGGGATTACAAGCATGAGCCACCATGCCCAGACAAAGCCCTCAATTTCTTACATATC 

GATTTAAGGGCCTGAATAAACCMCACATTAAAAAGGAAAAAGMTAATTCACATACCAA 

GCCGTGAGTAAATTCAAGACTGAGATGATATAATCGGTACCTGCCCG 

Sequence 1511 
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GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC1 Mil TGTTTG I 1 I I I M I I I I I 
TCACCTTAAGGGAGGATTTAATTTGCTCCCAACTGATCGTCACTTAAATGAAMTTTAAA 
AATG AATAAAAAG ACATACTTCTCAGCTG C AG ATATTATGGAG AATTGGGG CACCC AC AG 
GMTGAAGAGAGAAAGCAGCTCCCTACTTCAAAACCATTTTGGGTACCTGCCCG 
Sequence 1512 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCCTCTTTCACTATGA 
AAGTGTGCATGAGTCCCAAGCATCGGATGTCAAACAAGAAGCAAATCACCTGCAAGGATC 
G G ATG G GC AG C ANTCTGTC AAGG AAAG C AAAAAAAAAAAAAAAAAAAAAAAAAGTTC CT 
Sequence 1513 

AATTGGAGCTCCCCGCGGTGGCGGTCGAGGTACATCATTTCCAGAGCAGGCACTGGCAGC 

GAGATAGGGTTGGAGGAGAAGTAGCGCCGGGACTTCCGGATGGCAAACTTCTCTGTGGGT 

AGAGATTTCCCAGCAATCTTGAGCTTCAGGCCTGGCACAGCTCGAAATAATTCCACTTCG 

TCGTCCCCGAACGGCTTGTGGTCCTCCTTCCCAAACATGCTGAGGTAGGCGGCCTTCATG 

TAAATGTAGGTGGCC \ I I I I AAGTCAGATCATGTCAGTTCCTTCTGGAAATCTGGTTATA 

TTCCATCACACTCAGGAGACATCTCCTACAATTTCCTTGACACCTGCAGCACTCCAGCCA 

CACGACGGCCCCCGCGTACCTGCCCG 

Sequence 1514 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACCGGGGGATTCTTNAGAG 

GTCAGCCAAGTCGGTAACATTCTGTGAAGCAGATTAAATACAGTAAAATCAAACAAACGG 

AGGTCTTTGGAAGTGATGCTGGGCCCANGCAGTGAAAATCAAGAAAAGAGGAGAAAGACC 

AAGACCTGNATGTCCTGGACGACCTGACCCAAAAANGGGGCACAAAAGCAGCAGAGCTTC 

ACCAGGTGACTGGGGTGGGCTTCTTTTGTCCTCATTTAAATGTACCTTGCCCGGGCGGCC 

CGCTCTAAGAACTAAGTG 

Sequence 1515 

CCGCGGTGGCGGCCGNCGNACTCATTTACCCGGGGACAGGGAGAGGCTCTTCTGCNTGTA 

GTGGNTGTGCACAGCCTCATGCATCACGGAGCATGAGAAGACGTTCCCCTGCTGCCACCT 

GCTNTTGTNCACGGTGAGCTTGCTATAGAGGAAGAAGGANCCGTNGGAGTCCAGCACGGG 

AGGCCNTGGTTCTTTCCGCGTACCTCGGCCGCTCTAGGAACTANTGGATCCCCCGGGCTG 

CACGG AATTNGAATATCAAACCTTATCGATACCTTNCGACCTCCN N N 

Sequence 1516 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGNCNGGCCAGGTACAGCCCACANANG 
GCTGATNGGCTTTTGNTGNTTNTTCTNGGAANATANTGATGAGTTCCTCNAAACNCNTTA 
ATTTTACCTCTTCCGGGACATCATCCTT 
Sequence 1517 

CCGCGGTGGCGGCCGAGGT^.CACTNTTNGTCAGGGGACCAAGCTGGAGATCAAACNAACT 

GNGGCTGCACCATCTGNCTTNATCTTCCCGCCATCTNANGAGCAGTTGAAANCTGNAACT 

GCCTCTGTTGTGTGCCTGCTGAATAACTATCTATCCCAAAAAGGCCNAAGTACCTGCCCG 

GGCGGGCGGCTCTATANCTANTGGATC 

Sequence 1518 

CCGCGGTGGCGGCCGAGGTACGCGGGGGGAGGAACTGCTCAGTTAGGACCCAGACGGAAC 
CATGGAAGCCCCAGCGCAGCTTCTCTTCCTCCTGCTACTCTGGCTCCCAGATACCACTGG 
AGAAATAGTGATGACG 
Sequence 1519 

CCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTACCTCTGGAG 

AAAATAAGTGTGTTAGATAAGACTTGTTCATGCATGAGTGATAGTGCTGTTAGTGTGAAT 

TCAATGTTAATGAATCAATAAAATATATGAAATAAAGTGTCTTTAAACAGTAACACAAGG 

TTGTATGTTCTCAGTTGACAAAAACATTGTGACCAGAGGCTCACAGGAGTCTTAAGTCTG 

CATTTCCCCAGGAGCAATGGCTTAGCATTTACTAATCGAGTATTCCTGGCAACTTTATAG 

AACATAACTACTGTAAATAATGAGAATCAACTGGATATTATAAATCCCCAAATATATATT 
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TTGCTTTAC ATMTTG AGTGTTTTTCAAAGG C ATATAG A 

ATGTACCTCGGC 

Sequence 1520 

ACTATAGGGCGAATTGGAGCTCACCGCGGTGGCGGCCCGCCCGGGCAGGTACGCGGGGAG 

AGCCACCCGGGCTGGGGACCGGGGAGCGGAGGCTGAAATCCCCAGACGCCGGTTTTCTGG 

GCTGGGCTTTCTGCCTTACTCACTCCTTCTCCCTCTTTCAGGATTTTACCGACCGCTCGC 

GAAACAGCTTTCACACAATGGAGCTTCATGTCCTCGTGCAGGAAGTACCT 

Sequence 1521 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACACAGAAATTC 

CTAGACTACAAGATGTCATCCTTCTAGAAGAAGACCTAAAGCATACACTGGGAGGAAAGT 

TTTACAGGGGMTCTAGACTACTTMTTTTTTCTTCACAGTAACAAATGACAAAGCACCA 

ATATCTAAAGGTTCCCAACACACAGGATAAACAATAAGGAAGTTCTCCCAGTGTTTCCCA 

TTTTAGGCAAAAAAAAAAAAMGGATTTCAGCTAGCACAGTGCATCCAAATGACCAAAAA 

TACTACAATGCCGATGTTTTTTAAAAATAMTATAA 

TGG AATAATAAAATTAATG GTGAATGGGTACCT N 

Sequence 1522 

GCCGGGGAGAGGCGGTTTTGCGGTATTGGGCCGCTCTTTCCGCTTCCTTCGCTCACTTGA 

CTTNGCTGCGCTCGGTCGTTCGGGCTTGCNGGCGAAGCGGGT 

Sequence 1523 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACGGGGGCCAGTTATTATA 
CTGCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTACCAGTGGCCCTGGTGGATGC 
ACCATAGATGAGGAGCCTGGGAGCCTGGCCAGGTTTCTGCTGGTACCTGCCCG 
Sequence 1524 

CCGCGGTGGGCGGCCCGAGGTACGCGGGGATCCAGAATACATTTCCAACAAGAAGCACTG 

GCCAAGTCAGCTTCTTCTGAGAGAGTCTCTAGAAGACATGATGCTACACTCAGCTTTGGG 

TCCCTGCCTCTTACTCGTCACGGTTTCTTCAACCTTGCCATTGCAATAAAAAAGGAAAAG 

AGGCCTCCTCAGACACTCTCAAGAGGATGGGGAGATGACATCACTTGGGTACCTGCCCGG 

GCGGCCGCTCGACCGCCTGGACCAACTTGTCGGAAACCACCTTTCTGATGCCGCCCACGG 

CAGCG 

Sequence 1525 

AGGTACGGGGGCCAGTTATTATACTGCTGACAGTAATAAACTGCAAAATCTTCAGACTGC 

AAGGCTGCTGATGGTGAGAGTGAACTCTGTCCAAACCCACTGCCACTGAACCTGGCTGGA 

TACCAGTGCCCTGGTGGATCACCATAATAAGGAGCCTGGAANCCTGGCCAGGTTTCTGCT 

GGTNCCTGCCCGGCnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnCGACCTCGAGGGGGGCCCNGTACCCANCTTTTGTTCCTTTA 

AGTGANGGGTTAATTGCGCNCTGGCGTAA 

Sequence 1526 

AGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCACCATGTCCGCCTCGGCTGTCTTCA 

TTCTGGACGTTAAGGGGCAAGCCATTGATCAGCCCGCAACTACAAGGGCGATGTGGCCAT 

GAGCAAGATTGAGCACTTTCATGCCTTTGCTGGTACCTnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnAGCTTATCGATACCGnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnCCTGTGTGAAATTGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnATTGnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnACCTGCGTNGCCAACTGCATTNAATGAAT 

CGGCCAACGCGCnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnNTTCA 

nnnnnnnnnnnnnnnnnnnnnnnnnCGGCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGATAACGCANGAAAAGAACATGTGAACC 
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AAAAAGGCCAGCAAAAAGGGCCAGGAACCCGTA 
Sequence 1527 

GGGCGAATTGGAGCTCNCCGCGGTGGCGGCCCGAGGTCAATATAGGGCAGACAGTTTGCC 
TTCAGAAATTCAGAAATGCAGCTTTTGAGGGAGGTCAGCATCATTGGTCTCAGCTACCAT 
TTTCCTGCAGGATGTTNATAAATAGTTCCTGGCAAAGGAGGTTGCGTATAGGGTCCTACA 
AG 

Sequence 1528 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACCAAMTGGTTTTGAAGTTAGGGAGCTGCTTT 
CTCTCTTCATTCCTGTGGGTGCCCCAATTCTCCATAATATTTGCAGCTGAGAAGTATGTC 
TTTTTATTCA I I I I I AAATTTTCATTTAAGTGACAATCAGTTGGGACAAATTAAATCCTC 
CCTTAAGGTGAAAAAAAAAAACAAACAAAAAAAGTACCT 
Sequence 1529 

TATANGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTCTTN I I! I I I II I I I II I I 

TTTTGATTTGCAGTTAGTTTATTGAMTGGAAATGAATTTGAGGG7TTCAACAG 

AGCAGTTTCCACATAAAAAGACAACACAAGAAAGAAAAGTGGTTTCTAGCAATCAACAGA 

TGAGTTCTGNCATAGTCTGTAGCCCTCTGAGCCAAACGCCTTGGGGTAAAGTTAATTACT 

CTTCATATTACACCAAATGCTGTCGTACCTGCCCGGGCGGCCGTCTAGAACTANGTGGAT 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGACCTCGAGGGGGGGCCC 

GGTCCCAGCTTTTGTTCCCTTTAG 

Sequence 1530 

CTACTATAGGGCGAATTGGAGCTCACCGCGGTGGGCGGCCCGACATCGCCGCCGAGGGAT 

CGTTGGCAAGCCCGGGTGGGAGCGTGGCTGTGTTCTCGTCCCTGGCTGACAACGAAGGGG 

AGCTAGTCACTTTTCCTGCCACGACGACTCGCGTCCGCCTCTCGCCTGGAGTACCTCGGC 

GCTCTAGAACTAGT 

Sequence 1531 

CCGCGGTGGCGGCCGCCCGGGCAGGTACATAACAAACATGGCGAAAAAGGAGATGTTTGA 

AACCATCTGCA I I I II I I CTGTGATCGGTCTTTAAGCTCACTGTAAATTGGCAGGACTGA 

CGGGTGGCAAACAAATGCAAATGCMTGGTGGGTAAAGCATACACCGGTCTTTGAATTGA 

AGGTAACATA i i i i GGCGTACCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnTAATnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnTCACAATTCCACACAAACATACGAGCCGGGAGCATA 

AAAGTGTAAAAGCCTGGGGGT 

Sequence 1532 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACI II 1 1 1 I ATCI II I 1 1 I 

mm IGAGATGGAGTCTTGTTTTGTCGCCCAGGCTCTGGAGTGCAGTGGCGCTCACTGC 

AACCTCTGCCTCCCGGGTTCAGGTTATTCCTCTGCCTCGGCC/iCCCTAGTAGCTGGGATT 

ATAGGCTTGAGCCACTTGCGCCCGGCCCAATTTTTA1 I 1 1 I I AATGGTTGTTATGGACTG 

AATGTATCCCTCTAAAATTCATATATTGAATCTCTAATCCCCAA.TGTGATGACACTAGGA 

AGTAAAGCCCTTGGGAGATAATTAGGTTTANATGAGGACATGAGGATGGGATCCTCATGG 

TGAAATTAGTACCTGCCCGGGCGGNCGCTCTANAACTAGTGGGATCCCCCGGGCTGCAGG 

GAATTCNATATCAANGCTTTATCGATACNCGTTCGACCTTCAAGGGGGGG 

Sequence 1533 

CCGGGCAGGTAC I I N N II I M 111 I I I I M < I II M C TTCT CATACTG NG AATNGTTC 
TTGACTCCTTTTNTTGACATTNAGTTTTCANAATTTNCATC1 I I I I I NTGGAACTAANGT 
GCTGTTCTCTTGACTGCCTGNTGGGCCANCATCCGATTGCCANGNCCANAAACGTAAANN 
CTTGNCCAANATGGCCAAGNTACCTNGGC 
Sequence 1534 

GAGCTCCACCGCGGNGGCGGNTCGTGGCACTCATTTACCCGGAGGCAGGGAGAGGCTCTT 
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CTGCGTGTAAGTGGTTGTGCAGAGCCTCATGCATCACGGAGCATGAGAAGACGTTCCCCT 

GCTGCCACCTGCTCTTGTCCACGGTGAGCTTGCTGTAAAGGAAGAANGANCCGTNNNGGA 

NTTNCAANCACCGGGGAAGGGCGTTGGGTCTTGTTAGATTGAATNCTCCCGGCGTGCCNC 

ATTTGGCTTCTTCCCCACCTCCCACCGTGCGNATGGTCCTCNTGGGGAATAAGAAAGNNC 

CTTNTGGACNCAAGAGCCMGGGTCAANGGCCNTGMCCCCATGGGCTTTTCTTTTGGGT 

TCCAAGTCCTTCAANTCCNCNNGGGGGGAATNGNGGGNGNGGCTAGGGNGGNTTGGTTAC 

CTCCTTCCGNGTCTCCGGCCTTTNTTNAGGGAAAACCTTANGGTNTGGGGGGNATTCCNC 

C CNNC CCCGGGNGNACmGGNCCAGTGGGMNMmCCCNAATTATTTNCAAAAAGGC 

CTTTTAATTCCGAAATANCCCCGGTTCGGNMNNCNTTTTGTAAGNGGGGGGGG 

Sequence 1535 

TCATAC TTAGG GCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGGGGTTGGGGAG 
AACCCGTTTTGA1TACMGCAGATMTTATCTCAGTGAGATGGGGGTTAGTTCAAGGAAG 
TAAGGAGGGGGGAGGATGTGAGGMGTTAGAACAACCCMTGCTTATTTGATGGGCTGAA 
TAAACTATTCCAGGACTGAACTATTTTTGAGCACTGTGAGGTGGCACAGTAATTACCTGC 
, TTCAAAATCAACTGATACCMCATTTTTATCTTTGTATCTTATCTCTGTACCTGCCCG 
Sequence 1536 

CGACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGAAGTT 
GATGCCGAGCGCAGATCGCCTGCAGCTTGCTAGCTGTGTGGGCTGGGAGGTCTGGTAGGG 
CTGAGCTTGCAAG AGG ATCAACATG CCTTTGG CTAGAGATTTACTAC ATCCG TCCTTG GA 
AGAGGAAAAGAAAAAACATAAAAAAGAAACGCCTAGTACCTGCCCG 
Sequence 1537 

ATCTTTTMCTAAANGTAATNGTCACCATMTCTTATAGACAMGCATTGAGGTTTATTG 

AACTAATGCTGAAGGTAGTAAGTGGAGGAGCCAGGATGAGGTCAGAATCTGAGATTNTAA 

CCATGCCTATGCTGTCACTTCTTACACTNTAAAATACCTCCATGCTCNTGTGGACACCTA 

NG MCAA NTGAATATTTCTATTCTTGTCCCAGAATNTCAAMCATTAAACATGTTANACT 

GTATTTTNGNTTACCATAANCCTTTCCAGGAGGAACAAGCACTAAACACAGTCTCTGGCT 

TAG GATTT GGATGAACATATTCAAAAGCCATCTGCTTCCCAGCAATCATAATCATACCCT 

TTGCTTTTGGCCACTATCACCAAGATCTCCANTAGTACCTCGGCCGCTCTACAACTAGTG 

GATCC 

Sequence 1538 

CCGCGGTGGCGGCCCGAGGTACCTATGATAATCATGATGGAGATCTGGGGAGGGGAGAAC 
GTGGAAATGTCCTTCCGGGTGTGGCAGTGTGGGGGCCAGCTGGAGATCATCCCCTGCTCT 
GTCGTAGGCCACGTGTTCCGGACCAAGAGCCCCCACACCTTCCCCAAGGGCACTAGTGTC 
. ATTGCTCGCAATCAANTGCGCCTGGCAAAGGTCTGGATGGACAGCTACAAGAAGATTTTC 
TATAGGANAAATCTGCAGGCAGCAAAGATGGCCCAAGAGAAATCCTTCGGTGACATTTAN 
GAACGACTGCANCTGAGGGAACAACTGCACTGTCACAACTTTTCCTGGTACCTGCCCG 
Sequence 1539 

CGAGGTACGCGGGGCAGGTGATGTTTGTTTTCACGATGGTCTTCAGATGCCCACGTGGGC 

ACTGCTGAGAMGCCACTTGGTAAMCTGATGCCGGAMTGGGGCTTTTTGGGATCCCTG 

CTCAGCTGCTTCTGAGTCCCAGCATGCCCTGGTTACCTATGGCCCTCTTTCCCATGGGAC 

CTGACCTATGATCGGCCCCCGCTCCTAGGCGTGGTGACCAAGATGAAGATGCAGAGGACC 

ATTGTCATCCGCCGAGACTATCTGCACTACATCCGCAAGTACCTGCCCG 

Sequence 1540 

GGCCGCCCGNGCAGGTACAGCATCNCTGGTGGTTTCAAAGAACGTAGTCATTCTTCTCAC 

TGCAACANTGTANGATAAGCANGGNAGATCTGTTATTTCCAAATTAAAGGTGATTAANAT 

ATATGGAGAGAGANCATGGCATGTGAGGTTTATAGGGCTAGANACTGNACANCCATGTAC 

AACCCANATT 

Sequence 1541 

TTAACTGGACAGGTCAATTATCTGAGCTGTGTAAATCATTGTCTGGCACTACAGTTGAAT 
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CACCTTGACACTGAGCTCAGGAGCAGCCCATTTAAGAAGCATCTTGTAAATAACAAAGTG 

ACACCTTCGAGATTACAGTTGCAGCTATGCAAGTCATTCACAGAGTAGCTGAGGGCTCTC 

TTCTGAAGGAGGACCCATTACATGAAGGTAGACCTTTCATGTCTTGTTGCCGACGTCCAA 

GTCTCTTGCGGTCCAGAAATGCCAGGATGGTTGTTTCAAATGCATTGGGTATGAACTGCA 

CCTTGAATCTGCCCTTCCCTT7TTCGATTAAACCTTTTCACATAGCTGCTGCTGCTCCT 

Sequence 1542 

CCGGGCAGGTACGCGGGGGAGACCAAGGGCTAAAGCTGGGAGGTGAGTCTGTCACCTTGA 
GCCGGGCGAGCGCTGTGGGCCAAGCAGGGGTTGCAGGGCAGTAGGAGTGCAGACTGAAAA 
AATGCAGACCGCCGGGGCATTATTCATTTCTCCAGCTCTGATCCGCTGTTGTACCT 
Sequence 1543 

CCGCGGTGGCGGCCGAGGTACAGAAGGGCCATGCTGTTATTACTCTTACACAAGGAGGCA 

GCCCTCGAGCCACAGGGTCCAGCTGTTGGCTATAATAGCCTACCGGTCTCTGATGATCAC 

CATGTTTCTGGAATTCAAGCCAGGAAGAAGCAGCAATCTGTCTTCTGGATTAAAACTGAA 

GATCAACCTACTTTCAACTTACTAAGAAAGGGGATCATGGACATTGAAGCATATCTTGAA 

AGAATTGGCTATAAGMGTCTAGGMCAAATTGGACTTGGAAACATTAACTGACATTCTT 

CAACACCAGATCCGAGCTGTTCCCTTTGAGAACCTTAACATCCATTGTGGGGATGCCATG 

GACTTAGGCTTAGAGGCCATTTTTGATCAAGTTGTGAGAAGAAATCGGGGTGGATGGTGT 

CTCCAGGTCAAATCATCTTTCGGTACCTGCCC 

Sequence 1544 

ATACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCACGCTGGTTTTGCATCTTC 

AGGAGACGCTCGTAGCCCTCGCGCTTCTCCTCGGCCAATTCGCGGAAGAAGTGGCTCACG 

CCTTCCAGAGCCACATCATCGCGGTCGAAATAGAAGCCCAGAGAGAGGTAGGTGTAGGAG 

GCCTGCAGGTACCAGCTGCCATCTGTCAGTTAAACTTCAGAACACACATACCATAGGGAT 

GAAGATAAGAAAGGAATGATTGTGACTCTGGATCAAAATGCTNCCAGTTCCTGAAGGTGC 

ATGATTTAACTGTGA 

Sequence 1545 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCACCTAACACATTATTA 
TGTTTATTAACTTTGAGACCCAGAMTAAATTCTTTTCTTTTCTTGATT 
AAATACAAAAAAAAAAATGTTTTGTTTTGTGTTA I I I I I GGTTTGTTTATTGGGGGGCTT 
Tm I AATTGTCAGGATTATGATCTTGCTG 1 1 I I I CTTCAATATGTATACAAGGTGATGT 
GAAAAGATGACTTTGGGCAGAGGAGTAAGAACAAAGTAGGCTTGNTTCTTCTACTTTGCT 
TCAGAACTCAGTTTAATGCCCAAAAGCCGGGAGAATCAAAGCCCATGTTTGACCGTTCTC 
GTTGCTNCACCTGCATTTTTCCCAGGAGAGTGTGACNACTCATGCNAGTCCCCTGAGAAA 
AMTAAMTTNAGGGACCATTACTTTCTCCTTTTTAGTC i I I i J I AAAAAATTTC AAAAA 
AACGTTTTAGTCNCAAAGGGGGAAC I I I I 1 1 I I ATGCCTATTCAGGGAAAAGGGT 
Sequence 1546 

NCCGCGGTGGCGGCCGCCCGGGCAGGTACTGCTTCTATAAGAACGTGGTCCTGTATATTA 

TTGAGCTTTGGTTCGCCTTTGTTMTGGATTTTCTG 

GCATCGGCCTGTACCT 

Sequence 1547 

CACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACCTCATTGAGG 

ATCTAATGGACCCCTATCTTCAGCATGGGGAACAGGGTATAATGTTCACCACCTTGAAGG 

CATGTTACTACCAGATTCAGCGTGAGAAGCTTAACTAGGCTGCATAACAGCTTTGAAAAC 

TGG ATTATCTACTAC AG AGTGTTATAACACC ATCTGG AGTCTTCCTGTAGTN GG C AAA>^ 

MGAACAAGTGTTGAMTTTGGMGGGGAeTTTGTGTTATTTNAGGTTGTTAGAATGAGC 

CTTACCAATAATAAGMGCCCTTGAGCCCAGAAAAMGGACTGTTTAGTTTAAAGGGAGG 

ATTGAMGGNAGNGTAAAAAMTCNNATTAGACCAGTTNTTGGCCGCTCTAGAAACTAGG 

TGGGATCCCCCGGGCCTGCAGGGAATTTCGATNTCCAAGCTTATTCGATTCCGNTCTGAC 

CTNCNAGGGGGGGGGGCC 
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Sequence 1548 

CCGGGCAGGTACCACAAGCTGTGTGTCTTTAAGCAACAGAAATTTCTTCCCTCACAGTTG 

CGGAGGTCAGAGGTCAGAAAACAAGGTGTCTGCAGGACCAACCTCTCCTCTGGATGCTCT 

AGGTGAGAATCTTTTCCATGCCTTTCTCTTAGCTTCTGATGTTGCCGTCAGAACTTCAGA 

TGGTGTTCCTTGGCTTCTGTCMTATTAATACATAMTCCTTTTCAGTCTCAGCTTCTCT 

CTTCACATGGTCCTCTCCACCCCCGCGTACCT 

Sequence 1549 

"I I I I I I I II M I I I I I I I □ I I I I NMTATTTTAAAGTATTGCACTGCAGTGGCATTTGT 
CTTGATGGCTGAGTGTTTGGGCACCTCTTGAGATTTCCACCCTAGTGCCAGCTCTGTCTG 
GACGTCCATGGAGGACCTGCCCG 
Sequence 1550 

AGGTACCMTTCTTCTTTGAATGTCTMTTGMTTCAGCTGTGMTCCTTCTGGTTTTAG 
C C CTTTTTTGTTGTTGGCMCTTTCTTTTTATAAC C AATTCC ATTTAGCTGCTTGTTATT 
GGGCTGTTCATGGTTTCTATTTCTTACTGATGTMGACGGTTGTATATTTTGAAGGATCT 
ATCMTTTTCTCTAGATTTCCTAGTCTGTATGTGTAAAGGTGTTCACAGTAGCCTTAAAT 
AAIIIII 1ATAATTTTTGTGGTATTGGTTGTMTATGTCCAGTT1TATTTCTAACTGAGC 
TTATTTAGATTTCTCTTTTCCTT7TCTC 

CTTTCAAAAAAATAGCTTTTGGTTMTCTTTCCNATTTTTGGTTCCMTGGCA 

CGTACCTGCCCGGGCGGCCGCCCGGGCAGGACTCAAGGCCCTAATCTCAATACCATCACA 

TTGGTGGTTAGGGCTGAACATTGGGAAATNGGGGCAAGACACAAACATTCAATCCAAAGC 

AAGCATGTCCCCACTGGTNGGGACTCCMTCCCCTTTTGGCATTGAGTCATTAATTAACC 

ACTGNACCTCGGNCCGTCTANAACTAGTGGANCCCCGGGCTGCAGGAATCNATATCAGCC 

TAATCGATNCCGNCNACCTCNANGGGGGGGCCCGNNCCCACCTTTTGG 

Sequence 1551 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGGAGTCAGTCTCAGTCAGGACACAGC 

ATGGACATGAGGGTCCCCGCTCAGCTCCTGGGGCTCCTGCTACTCTGGCTCCGAGGTGCC 

AGATGTGACATCCAGATGACCCATTCTCCGTCCTCCCTGTCTGCATCTGTAGGAGACAAG 

AAGTCACCATCACTTGCCGGGGCAAAGTCAAGAGCATTAGCAAACTATTTGAATTGGTAT 

CAMCAGMTTCTAGGNAAMGCCCCTTAAGCTCC7TGATCTATGCTNCATCCAGGTTTG 

CAAGGTGGGGTCCCA 

Sequence 1552 

CCGCGGTGGCGGCCGAGGTAC I 1 1 I I 1 1 I IGTTTG I I I I I I I I I I I I CACCTTAAGGGAG 

GA7TTAATTTGCTCCCAACTGATTGTCACTTAAATGAAAA7TTAAAMTGAATAAAAAGA 

CATACTTCTCAGCTGCAAATATTATGGAGAATTGGGGCACCCACAGGAATGAAGAGAAGA 

AMGCAGCTCCCTAACTTCAAAACCATTTTGGTACCTGCCCGnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGCGCTTGGGCG 

TMTCATGGTTCATAGCNTGTTTCCTGTGTGAAMTTGTTATCCCGCTCACAATTTC 

Sequence 1553 

NACCTGTCCGTGCCAGCTGCATTAATGGAATCGGCCCAACCGCCGCGGGGNAGAGGCCGG 
TTTTTGCCGTATTTGGGCCGCTTCTTCCGCCTTCCTTCGCTCACTTGACTTCGCTT 
Sequence 1554 

CACTACTTAGGGGGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATTGGT 

GATCGGAGTATAGTTGGAGCGCTTTGTCATGATTTCCAGGTTGGCTTTGTCCACAGCTAT 

GTTGGCCAATGCACCTTGAGCCTCAAAGCTGGCAAATCGTCCAAATTCTTCAAGCCGCCA 

GACCGTCTCCTTCTTTGCCATATCCACATGGAAAATCTCATCACCCCGCGTACCTnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nCGCTTGGCGTAATTCATTGGTCATAGnnnnnnnnnnnnnnnnnnnnnnnTATCCGCTCA 
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CAATTTCCACACAAACATTACCGAGCCGGGAGCATTAAA 
Sequence 1555 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGCTGAAAAGTGCG 
CCCTACGAAAGGACGGTCGACTCCGGCGGGGCCCAGGTGCCAATGGGTCATGTTTGGTTA 
GAAGGTGACAATCTACAGAATTCTACAGATTCCAGGTGCTATGGACCTATTC CATATG GA 
CTMTAAGAGGACGAATCTTCmAAGATmGGOT 

GTGCCAGCCCTMTGGCCACAGATTTTCTGATGATTAGTMGCATTTATTCTTTTGACTT 
GATTATTTGTCTCCTTTTCATGTGAATTTTATT^ 

TTGCCCGGGGCGGGNCCGCTTCTANGAACTTAGGTGGGAATTCCCCCCCCCGGGGCCTGC 
CAANNGAAATTTTTNGNATTATTCCAAAGGNCTTTTATTNCGMTTANCCCCGN^ 
ANCCCTTCTGAAAGGGGGGGGGG 
Sequence 1556 

CCGCGGTGGCGGCCGAGGTAC1 I I I IGTTTGI I I I I II I I I I I CACCTTAAGGGAGGATT 
TAATTTGCTCCCAACTGATTGTCACTTAAATGAA 

CTTCTCAGCTGCAAATATTATGGAGAATTGGGGCACCCACAGGAATGAAGAGAGAAAGCA 

GCTCCCCTAACTTCAAAAACCATTTTGGGTACCTGCCCGGGCGGCnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGTTAAATTGCCGCCGCCTTGGG 

CGGTAAATCATGGGTCATAGGCTGTTTrCCTTGTGTGAAMTTGTTTATCCCGCTCACAA 

TTT 

Sequence 1 557 

CCGCGGTGGCGGCCGCCCGGGCAGGTAC I I II I I I I I! M I I M I I I I CTTTTCAATGTT 

CACAGCTACTTCTCCTTTMGAGAAGCAGAAAGGCAGTTTTAGGTGTGGTCTGTCTTTAT 

TCCCCCGAACTTGATCAATGTATACAATGCTTACACATTCAAATATGTGCAAGATTCAAC 

AAGGAACACTCTCCACTGCTGTGCTTGAAGTACCTCGGCCCGCTCTAGAACTAGTGnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGCGCTTGGCGTAATCATG 

GGTCATAGCTTGTTTCCTGTGTGAAAATTGTTMTCCCGCTTCACAATTTNCACCACAAA 

CATACCGAG 

Sequence 1558 

CTATCACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGTCC 

CAGCTCCTGGCTGACTTCTAGTCTTCTGGTTGAAGCTGCGCCTTTAGATGACACGACCCT 

ACCCACCCCTGTTTCCAGCGGATGCCCGGGCCTGGAGCCCACAGAATTCTTCCAGTCCCT 

GGGTGGGGACGGAGAAAGGAACCGTTTCAGATTGAGATGGCCCATGGGCACCACCACGCT 

CGCCTTNAAGTTCCAGCATGGAGTGATTGCAGCAGTGGATTCTCGGGCCTCAGCTGGGTC 

CTACATTANTGCCTTACGGGTGAACAAGGTGATTGAAGATTAACCCTTACCTGCTTGGCA 

CCATGTCTGGCCTGTGCAGCAGACTGTTCAAGTACCTGCCCGGGCCGGCCCGCTCTAGAA 

CTAGTGGGATCCCCCGGGCTGCAAGGAATTCGAT 

Sequence 1559 

CCGCGGTGGCGGCCGAGGTACCCATTCCCTTGATGTCTACAATATCACCTTTCTTATAGA 

TTCGCATATATGTGGCCAAAGGAACAACTCCATGTTTTCTAAAGGCCTAGAGAACATATA 

TCGGGTGCCTCTCCTCTTTCCCT7TGTGTTCGTCATTTTGGCGAATTACTGGAAAGATGG 

CGGTTCCCGGCCCGCTCTAGAAACTAAGTGGGATCCCCCGGGCTGCAGnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnhnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnAGTGAGGGTTAATTGCGCCGCTTGGCGTAAATCATGGTCATTAGCTTGTTTCCCTGTG 

GTGAAMTTNGTTATCCCGCTTCACAMTTCCACACCAACCATTACCGAAGCCCGGGGAG 

CATTAAAAGTNGTTAAAAGCCCTGGGGGTGCCC 

Sequence 1560 

TANGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTCAATGATAAAACCATTGA 
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TGAGGAACTAGMCGGGACMGAGGGTCACTTGGATTGTGGAGTTCTTTGCCAATTGGTC 

TAATGACTGCCAATCATTTGCCCCTATCTATGCTGACCTCTCCCTTAAATACAACTGTAC 

CTGCCCGGGCGGCCCGCTCTAGAACTAGTGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnCGCGCTTGGCGTAATCATTGGTCATAGCTGTTTCCTGTGTnnnnn 

nnnnnnnnnnnnnnnnnnnnnCACACAANCATACCGAANCCGGGAGCCATAAAGTGTAAA 

GCCCTGGGGTGCC 

Sequence 1561 

AGGTACAAAAMTATTCATATAMCGTTGTTCACACGTAGGTCCTAGATTACCAGCTTCT 

ATGCAAAAAAAGGAMTGAAGAAAAATAGATTTGTTMCTAGTAT^ 

TGCCTGGCTTAAAACCTCCCTCACGCTCGTCTGTCCCACACAAATGTTTAAGAAGTCACT 

TGCAATGTACCTGCCCGGGCGGCCCGCTCTAGAACTAGTGnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnGAAATTGTTATCNCGCTCACAATTTCCACACAACATACCGAAGCCGGGGAGCCATA 
A 

Sequence 1562 

CCGCGGTGGCGGCCGAGGTACAAAACTTTTTAMTTTTATATTATT^ 

TGGGTTTGTGGCACCCTGGCCACCCCATCTGGCTGTGACAGCCTCTGCAGTCCGTGGGCT 

GGCAGTTTGTTGATCTTTTMGTTTCCTTCCCTACCCAGTCCCCATTTTCTGGTAM 

TTCTAGGAGGTCTGTTAAGGTGTACCTGCCCGGGCGGCnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGTCNACCTCTTAGGGGGGG 

CCCGGTACCCANCTTTTTGTTCCCTTTTAGTGAGGGTTAANTGCCGCCGCCTTGGCCGAT 

AATTCATGGGTCATAGCCTGTTTTCCTNGTGNGGAAAATTGTTTATNCCGCNTTAACAAN 

TTTC C ACC AC AAA 

Sequence 1563 

CCGCGGTGGCGGCCGAGGTACTATTTACAAAGCATGAATTCACTACAATGCTCAACTGTT 
TGTTTAGCTTTATCTCACTTGGGGAATTTATTCCTGTCTGCTGCATTGTAGGTAGCTGGG 
TAGGATATATTTCCACTTGCTTTTTAMTTMGTTCTTCACCTCCATTGACACCTCGGTT 
TTTTNGGTTTTCTCCCTATAGTGTTGGGTTGGGTGCTAAGACACCAGTCTGACCCCCAGA 
AATGG GGAGGTTATTTC N ATCCATCTTTTCCTCC ATCCTTTCC AAAAACCACATATCTAC 
ACAAGGGAAAAATTTAATACATCTAGGAA I I I I I I l I I I I AATTACAAGCTATTTAAAGA 
GATGAATGTGGCCAAAGTTTTACA 
Sequence 1564 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAACTGAGCACATAGC 

ACCAGMTATTGGMCAGATGCTTCTCCAAAGCCAAGGGTTGCCTTACATTTAGAGTGGG 

AAAAGAGGAAAAT GAGTT AGCAGAGGAGCAGCGGGGAGGGGGGTAAATTACAGGACATAA 

TTTGCTTCTAAAMT7TTGTG1TC 

TGCCCGGGCGGC 

Sequence 1565 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGGAGCATTATAATATTGCTnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnTCGGTCAGGGACCCCGGATTCCCGGGTAGATGCCCAGTTAAT 

GAGCAGCTTAGGAGACTGTCCTGGTTTCTGCTGGTACCTGCCCG 

Sequence 1566 

CGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGG 
GGGTTCCCAGGATGGTGATCCGTGTATATATTGCATCTTCCTCTGGCTCTACAGCGGGAA 
GAGATGGATCAAGATGGTGAAATAGAAGACTCCACCAATCATCCCCTTCACACGAACACC 
AATTTAACAACTATCTACATGCACACACACAAAAGCTTCATCAATTTAACAACTATCTAC 
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ATGCACACACACAAMCCTTTGTAAGAACTCAAAATCAGATTAAGAAGAAACAACAAGAT 
GTGCTTGGTTTCCTAGMGCCMCAAMTAGGATTTGAAGAAAAAGATATTGCAGCCAAT 
GAAGAGAATCG GAAGTG GATGAGAGAAAATGTACCT 
Sequence 1567 

CTATACGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCGTTTTTGTTGGTTTTT 

TTCTCCCTCAAAGTGAAATGNNAGACTATTGTAGGGTTTTGAGCCAAGGAGTGATATTAT 

CTGAATTCTGTTTTAAAAGGNTCATTNTGGCTCCTGTGTGGAGGCAGGGTAGCTACTTAN 

AAACCAAGCCATCTCAGCCTCCCGATTTGTAAGAGATGTGGTCTCACTATGTTGCCCAGA 

CTGGTCTTGAACTCCTGGCCTCATGTGATCCTCCCACCTTGGCCTNCCAAAATACTAAAA 

TGATAGATATGAGCCGCCACACCTGGCCATAAGCTTCAGCCTTGAGCAGATACTTTCTAA 

ACACTTTATATAGGGAAAAAAAACTM7TCTTTGCGTAGCCCAAGGTGTTCATTTTTGCC 

GAGAATAACCGTAGCCATANGTGTTTACAATTACGTGGGTGGTAATTT 

Sequence 1568 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAAGGAGAAATAGAACATGT 

GCAGGCAAAAGAAGAAAGGCGCGGGCTGGGTGGGAAGAGGATTCGGACTCGTCACACTGC 

AGAGCAGCAGAGCGAGAAAGGATGAGAAGAGGCAGAGAAGGCGACGGCAGAAGAAAAAAG 

GAAAAACTGCGGCCCGNGATCANAGCCTGAAAGTGTCCCGCGTACCTnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnTNnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGCTT 

GGCGTAATCATGGTTCATTAGCTNTTTCCTGTGTGAAATTGTTTATTCCCGCT 

Sequence 1569 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGGAGCGC 
TTTAGGCTGGCCGGCGGCGCTGGGAGGTGGAGTCGTTGCTGTTGCTGTTTGTNNAGCCTG 
TGGCGCGGCTT 
Sequence 1570 

CGAGGTACCATCATGGCTTGAGTGCTCTGAAGCCCATCCGGACTACTTCCAAACACCAGC 

ACCCAGTGGACAATGCTGGGC I MM I CCTGTATGACTTTTTCGTGACTTTCTTCTCTGG 

CCCGTGTGGCCACAAGAAGGGGGAGCTCTCAATGGAAGACGTGTGGTCTCTGTCCAAGCA 

CGAGTCTTCTGACGTGAACTGCAGAAGACTAGAGAGACTGTGGCAAGAAGAGCTGAATGA 

AGTTGGGCCAGACGCTGCTTCCCTGCGAAGGGTTGTGTGGACCTTCTGCCGCACCAGGCT 

CATCCTGTCCATCGTGTGCCTGATGATCACGCAGCTGGCTGGCTTCAGTGGACCACCTTC 

CTTTGGAGACTGCTCCATCAGTGCCGAGGTGTGTGGGAACAGGCTTCACTGCACCGCCAT 

CTTACTGAGTTGCTTCACGTGAGGAAAAGGGGGCTTTGGCCCTGTGACTCAGTTCCACAT 

TTTGGATTGCATACTGGAAAAGAACCAATCTTCTTGCTAGTAAACCAGCAACCCGGCTGT 

ATACAGTGGTGACCCAAGCAATGGATATAAACCTAAAAATCTGANGGAAGGGAGAGGTNG 

MTACAGTANTTCTTGGAATCTGMGTCTNCTATTTGATCAAGGTTATTTCCTG 

Sequence 1571 

CGCGGTGGCGGCCGCCCGGGCAGGTACATGATACATGACCCCAGGTTCCAGTGTAGAACC 

TGAGTCCCCCATTCCCCAAAGCCATCCCTGCATTGATATGTCTTGACTCTCCTGTCTACT 

TTTGC ACACACCCTTA A I i I l I A ATTGGTTTTCTTGTAAATACAGTTTTGTACACTTTTG 

GTCAGGGGACCAAGCTGGAGATCAAACGAACTGTGGCTGCACCATCTGTCTTCATCTTCC 

CGCCATCTGATGAGCAGTTGAAATCTGGAACTGCCTCTGTTGTGTGCCTGCTGAATAACT 

TCTATCCCAGAGAGAACAAAGTACCTTGGCCGCTCTA 

Sequence 1572 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGAGTTGCTACGCTGCTn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTGCCAGTGGCCCTGTTGGATGCATCATA 

GATGAGGAGCCTGGGAGCCTGGCCAGGTTTCTGTTGGTACCTGCCCGnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
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nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGCT 
TGGCGTAAATCATGGGTCATAGCTGTTTCCTGTGGTGAAAAATTTGTTTATC 
Sequence 1573 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCACTCTCCTAGATTAGGAGTTT 

AGGTTGCCCCAATACCCATGCAGAGAACTTGGAGCTGAAAGGTTCCACAGCTTCAAACCT 

AGGCATACTTGTGAATGCGTGGTGGCCACCCATTGATTATCCCTTGGTGCTAGTGCTTGT 

ACCTGCCCG 
Sequence 1574 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGAAGAGAGAGGGAGAGAGA 
GAGAAAGAAGAGCANGAAAGATCCCGAAAGGAGGAAGAGGTGGCGAAAAATCAACTGCCC 
TGCTGGATTTGTCTTTCTCAGCACCTTGGCGAAGCCTTGGGTTTCTTTCTTAAAGGACTG 
A I I I ( I AGAACTCCACATTTGAGAA 
Sequence 1575 

CCGCGGTGGCGGCCGAGGTACANGGAGATGTGGAACCTGGTTCCAGGCAAACCAACGCTT 

CCAACTCCAAAGAGTTGGGGGTTGTTAGAGAGCCCTTTCCCAGAAAGCCCGACACTCATG 

TCTTTAGTCTGGCAGCTGCACTAGTCACTTTCAACTGGCTGACAGGTGCCCAGTATTTAG 

CCCCCAAATTCTAAAGGAAAAAGTTNGGACAAGAATAGCAAGTGAAAGGGGATCCGATGG 

TACCTTGCCCGGGCGGCCCGCTCTAGAACTAGTGnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnAGCTTATCGATACCGTNCGACCTCGAGnnnnnnnnnnnnnnnnnnnnnnnnTGG 

TTCCCTTTAAGTGAGGGGTTAMTTTGCCGCGCTTGGCCGTAAATCATGGGTCAATAAGC 

TGTTTTCCTGGTGTGAAATTGT 

Sequence 1576 

AGGTACTGGCTGTCCACCCCTGAGCCCATGCCCATGTCAATGGCACCCATCACGGGGGAA 

AACATAAGACCATTTATTAGTAGGTGTGCTGTGTGTGAGGCGCCTGCCATGGTGATGGCC 

GTGCACAGCCAGACCATTCAGATCCCACCGTGCCCCAGCGGGTGGGTCCTCGCTTGTGGG 

ATCGGCTACTCTTTTGTGATGCACACCAGCGCTGGTGCAGAAGGCTCTGGCCAAGCCCTG 

GCGTCCCCCGGCTCCTGCCTGGAGGAGTTTAGAAGTGCGCCATTCATCGAGTGTCACGGC 

CCGCTCTAGAMCTAGTGGGATCCCCCGGGCCTGCAGGAMTTTCCG ATAT CAAGCTTAT 

CGATACCCGTTCGACCTCGAGGGGGGGGGGCCCCGGTTACCCAAGCTTTTGTTT 

Sequence 1577 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGGTGCACACCCCCACTCC 

CACGGTGTGGGTAGTGCCGTGAATTGGAAACCGCAGATACCCTGGAGGCCTGGTAGGCAG 

CTGACTGCCCAGCACTGGGCAGACTGATAGTCCGTAAGTCTGATGGCAATGACNACCTAT 

AGTTCTGAGGAAMGGAAGGTGGAACTCAAAATTTCCTTGTGCTTGGACTTGAGAAAACT 

GNAATCTTGAAGGTTTGGCCGAAGTT 

Sequence 1578 

ATCGACGCTCAAAGTCAGAAGGTGGCCGAAACCCGACAGGGACTATAAAGATACCAGnnn 
nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnrnnnnnnn 
nnnnnnnnCGCCTTTCTTCCCTTCGGGGAAAGCCGTGGCGCCTTTCTCATTAGCTCACCG 
CTTGTAAGGGTATTCTCAGTTTCGGTGTTANGTTCNGTTCCGCTTCCAAGGCCTGGGCCT 

TGTGTTGCACGAAACCCCCCCGTTC 
Sequence 1579 

TCATACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGCGGCCCCAGAG 

CTCCTGCCAGTCTGGGCGACAGCTAGCTCTCCGCCTCTTCAGCAAAGTTCGGCCCCCTGT 

TGGGGGTATCTCTGTTAAGGAGCATTTTGAGGTAAATGTGGTGCCTCTCACCATCCAGCT 

GACACACCAGTTCTTCCACAGMTGATGGGCTTTTTCTTTCCTGGCCGAAGTGTGGAAGA 

TGATGAAGTTGGTGATGAAGAGGATAAGTCCAAACTGGTGACTACTTGGAATACCAGTGG 

TGAAGCCTCGGCAGCTGATTGCAACAAGATGATGCAGTACCTTGCCCGGGCCGGCCGCTC 

TAGAAACTAGTGGGATCCCCCGGGGCCTGCAGGGAAATTCGATATCAAGCCTTATCGATA 
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CCCGTTCNACCTCGAGG 
Sequence 1580 

TCGCACTCATTTACCCGGAGACAGGGAGGGGCTCTTCTGCGTGTAGTGGTTGTGCAGACC 

CTCATGCATCACGGAGCATGAGAAGACGTTCCCCTGCCGCCACCTGCTCTTGTCCACGGT 

GAGCTTGCTGTAGAGGAAGAAGGAGCTGTCGGAGTCCAGCACGGGAGGCGTGGTCTTGTA 

GTTGTTCTCCGGCTGCCCATTGCTCTCCCACTCCACGGCGCCCGCGTACCTnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnNG 

ACCTCGAGGGGGGGCCCGGTTACCCAGCTTTTTGTTCCCTTTAGTGAAGGGTTAAATTTG 

CCGCCGCTTGGCCGTAAATCATGGTCATAGCTGTTTT 

Sequence 1581 

CTATCATACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGG 

AAAGAGAGAGCCTTCTGGTGAAAATCAAAGTCCTGGAGCAAGACAAGGCAAGGCTGCAGA 

GGCTGGAGGATGAGCTGAATCGTGCAAAATCAACTCTAGAGGCAGAAACCAGGGTGAAAC 

AGCGCCTGGAGTGTGAGAAACAGCANATTTCAGAATGACCTTGANTCAGTTGGAAGACTC 

AATTATTCCCGCAAGGGAGGAAGGCTACTAGGAAGNATAGAATCGGAAAGAGAAAAGAGT 

GAGAGAAGAGAAGAACAGTCTTAGGAGTTGAGATCGAAAAGACTCCAAGCTGGNGATCCA 

AGMGAATTTGTAGNAGAGGTGCAGGCGTTTANGCTTGGAGGGATTTCTTCCCCAGGGAG 

GACCCCCAGACCNCCANTTTTTGNAAAACCAGGAAA 

Sequence 1582 

TNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACr'l 1 1 I I M I II i I I I 
I I I I I I I I I I i GGTTTGAGGGGGAATGCTGGAGATTGTAATGGGTATGGANACATGTCAT 
ATAAGTAATGCTA 
Sequence 1583 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACANGCCTCTGGATTCACTTT 

TGATAATTATGTCATGAATnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTC 

GGCAATCAGTGGTAGTGGCGGTGGCACACACTACGCAGACTCCGTGAAGGGCCGGTCCAC 

CATCTCCAAGAGACAACTCCAAAGAACACACTATATTTGCAAATGAATAGCCTGAGAGCC 

GACGACAnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnATTTGNCGCCGCTTTGGCGTTAATCATGGGTCATAAGCNTGTTTTCCTGTG 

TG AAATTTGTTTATCCCGCTCAC AAATTTTN C ACACCAACATAC CG 

Sequence 1584 

NGGAGCTCCCCGNNGTGGCGGCCGCCCGGNCAGGTACACGCCCAATGAGATCTGCGTCAT 

CCAGAGAGGAATGCGGTTCAGCATAGATGTCTTTGAGGAGACCAGGGGCTACATCTTGGA 

GGTCTATGGTGTCCACTTTGAGTTACCTGACCTTGGACCAATTGGGGCCAATGGCTTGGC 

CAATCCTCGTGATTTCTTGATACCCATTGCCTGGGTATGAGGGATCGCCAAGTTACCTTn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnCNACCTCNAGGGGGGGGGCCCGGTACCCCAAGCTTTTTTGTTCCCTTTTAGTTG 

AGGGGTTTAAATTTTGCCGCCGCCTTGNGCCGTNAMTCATGGGTTCATTAAGCCTTGGT 

TTTCCC 

Sequence 1585 

GACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGAGCGGCCGCCCGGGCAGGTACC 

TATGATTGAAAACAGTAGTTGGTCTATGACTTTTGAGGAGAGGGAGAACCGAAGATTACA 

GGAGGCCAGCATGAGGTTGGAACAAGAGAATGATGACCTTGCCCATGAACTAGTAACAAG 

CAAAATTGCTCTACGGAATGACTTGGATCAGGCAGAAGACAAGGCAGATGTGTTGAATAA 

AGAGCTCCTTTTGACCAAACAGAGGCTGGTGGAGACTGAAGAGGAGAAGAGGAAGCAAGA 

GGAAGAGACTGCCCAGCTAAAAAGAAGTCTTCAGGAAACAGCTAGAGAAGGCAGAATATG 

GAAATAMGAMGACTACAGCTATCATTGCTTGMGTTATAAACAGGATCTTGTTCCGCA 

GTTTGAAGTACCTCGGCCCGCTTCTAGAAACCTAGTGGGATCC 
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Sequence 1586 

CTACTATAGGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATTTTATTTTTCT 

CAAGAAATACGTTAAGTCCAACTGCTGAACATACATCTTGAATCTCTGTAGCTGTAGGAT 

TTTCMCAGCCTTACTTATGGGGATTCGCCTTCCCTCTGCGATGGTCTTCTTATTATTTA 

AATAAGCAGGATAGATACAAATAAACCTGTCCTGGTCGGCCCGCTCTAGAACTAGTGGGA 

TCCCCCGGGCTGCAnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGCTTGGCGTAATCA 

TGGGTCATAGCTGTTTTCCTGTGTGGAAATTGTTATCCCGCTCACCAATTTCCANCACCA 

ACCATTACCGAAGCCCGGGAAGCCATTAAAAGTGTAAAAGCCTGGGG 

Sequence 1587 

CCGCGGTGGCGGCCGAGGTAC I I I ! I I IGTTTG J i I I I I M I I CACCTTAAGGGAGGATT 

TAATTTGCTCCCAACTGATTGTCACTTAAATGAAAATTTAAAAATGAATAAA^ 

CTTCTCAGCTGCAAGTATTATGGAGAATTGGGGCACCCACAGGAATGAAGAGAGAAAGCA 

GCTCCCTAACTTCAAAAACCATTTTGGTACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnniiGTTAATTGCGCCGCTTTGGCCGTAA 

TCATGGTCATAGNCTGTTTTCCTGTGTGAAAATTGT 

Sequence 1588 • 

CCGCGGTGGCGGCCGAGGTACTGATTTCAAAACGTGGACGTTCGGCCAAGGGACCAAGGT 

GGAGATCAAACGAACTGTGGCTGCACCATCTGTCTTCATCTTCCCGCCATCTGATGAGCA 

GTTGAAATCTGGAACTGCCTCTGTTGTGTGCCTGCTGAATAACTTCTATCCCAGAGAGGC 

CAAAGTNCCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnCGCTTGGCCGTAATCATGGTCATAGCnnnnnnnnnnn 

nnnnnnnnnnnnnnnCGCTCACAAATTCCACACCAACATTACGAGCCGGGGAGCATTAAA 

GTGTAAAAGCCCTG 

Sequence 1589 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGNCCGGGCAGGTACTATCTGTATCA 
TTCTGC AAACCTTTTNGGN N AATTN NN NCCCCCCTC AACMTTTCTGGGGG ATNCTNTTT 
NCTGGTTAAATACATGTGG 
Sequence 1590 

AGGTACCTAATGAAACCCGTCTTTGGCCTGTGGCTGAGACGCCATCTGTAGGCGGTGAAA 

TCTTTCCAGCCTATGTGACGAGGGTCATGCCACAGGGTGCGCACCTGAAGGAAAAGGAAA 

AGGAAAGGTTGAAACACAGCTCCTTCAGGTGACAGAACAGATGTCCCATGGTGACCAAGG 

ACCCCAGCCTCCATTGACTGTAAGAAGATGAACCACAGGCAACAATTAAGATCAACTTTC 

AAATCCCTCCCTTGTCATAGACTCTCCTAAGCCCGTTCAAGAAACCTCTGCTTCTCTGAC 

ACACGGAAAGMTGCTGGGGAAGAGGGTGAGTTTGTTnTCATTAAAAGA 

Sequence 1591 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACACTTTTGGTCAGG 

GGACCAAGCTGGAGATCAAACGAACTGTGGCTGCACCATCTGTCTTCATCTTCCCGCCAT 

CTGATGAGCAGTTGAAATCTGGAACTGCCTCTGTTGTGTGCCTGCTGAATAACTTCTATC 

CCAGAGAGGCCAAAGTACCTCGGCCCGCTCTAGAACTAnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGTAATCATGGTCATAAGCTGTTTC 

CTGTGTGNAAATTGTTATCCCGCTCACAATTCCACACCAACATTACCGAAGCCCGGGNAG 

CATTAAAAGTGTAAAAGCCTGGGGGTGCCCTAATG 

Sequence 1592 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTACNTGCNCATCAATGT 
ATGATGATGACTATTTGGCTTAAATGGGATACAATATTCGACCACGCTCAAGAACCTTNA 
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GAAGCTTGACTGAAATTTCTAANACTGATATTTCTTACTGAAATGTGNCCTTTGGGTTTN 

AANCTGAGGTTNCTATNGTTTGCTTNCNACAGACATATGTTTGACATACAAAATATGGTA 

AAGATCCACTACTTNTGCAGAGGTATATTGGCCATATTATTCCCTTATNGGCCTGGATTN 

CCTCATCAATAGCACAATTCATTCAAA 

Sequence 1593 

CCGGGCAGGTACTACAATGAGACAGTTGGATCCAI I I I I CCAGGGTTAAGCCTGCAATAC 
CTGAAGCTCCAGACCTGAAACCCAGAGCTCCTGCCAGCTGCTATGCCTGTGAACCTGTGG 
AAGACCTGAAGTACCT 
Sequence 1594 

TCCTGTGTGAAATTGTTATTCCNGCTCACAAATTCCACACAAACATTACCGNAGCCCGGA 
GGAGGCMTAAAAGNTGTTAAAAGCTCTGGGGGGTGCNCTAATTGAAGTTGAAGCCTAAA 
CATCACCATTTAMTTTGCCGTTTTNCGACTTCACCTTGCCCNGCCTTTTTCNCM 
CCGTNGTAAAAACCCNTNNTTCCGTTGGCCCCAGGGCCNTTGCCAATTTTAAAATT 
Sequence 1595 

AGGTACCTCATTACTGCTCCCCATGTGGCCTCCTTACCCAGGAAAATTATGAAAGTCTGA 

ACTCAGTCTCTTTNGACACCATCTCAACAGGAGGGGAAGGGGCACCTCATTATTGCCTGG 

CAAGGGAGGAAGTCTAGGTTCCTCACTCAGTCTTTGCTGGTGACACAGGATTCTTTGGGA 

GGCCACTTCACCAGGCTCCACCCATTCAGCCCAGCAGGCCGTGCTTGGTTTGCGCTACTG 

GCCTGGATCCCATATCCTGGGTAAAATGGTGAAGGGTGTGTGTGCAAGTGGGTGTGAGGT 

CTGACCACAGTGCACAGCCNGNCACACAGGCTGCTTGCAGTTGGGGCTGGGCAGGCAGCT 

CCAGATGCCAGCACAGATGCCTCCTTCATGTTGAAAGCNTGCCNGCTGGACTAGGC 

Sequence 1596 

ATTGGAGCTCCACCGCGGTGGCNGGCCGANGTACAGCGGGGGGGCGGGGGGATAGCTGTC 
CAAGGTCTCCCCCAGCACTGAGGAGCTCGCCTGCTGNCNTCTTGCNCNCNGGAAGCAGCA 
CCAAGTTCACGGCCAACTGCCTTGGCACTANGGTCCAANAATGGCTA 
Sequence 1597 

CCGGGCAGGTACTGGGCGCAGATGGGCGACCGTCACAACACTACGGTCCACTGGACATGT 

CTGTTGCTGAGAGAACCACTGGGTGATGCAGGCGTTGCANAAAGCATGTTCACAATGAGG 

TGCCTGGTAATCGTGAGCTTCTCCTCATACTTGCCCCAGGATGATACAACTGACTCTGGT 

CTTGGCAAGGNTGAACCT 

Sequence 1598 

CCGGGCAGGTACGCGGGCTACMGCATMGGCATCATGCCTGGCTAATTTTTTTACATGT 
I I I I I I I I IGTCGAGATTATGGTATACACTATGTTGCTCTGGCTGATCTCAAATTCCTGA 
CCTCAAGTGGATCTTTCNTGNCAC/TNNCTTCCTAAAAAGNTGTCCANGGGATTTANCAT 
TGCNATNAATGGCCAACCCAATTGCCCTTAANCCGATAANATGGTAT7TTTATCMNTTT 
AATTGTTATTC AAATAN NTTCTTTAVANTNTTNTTAAN N GTAAG N CCTC ACTTNTTAATN 
TTNG N ATCTTTTTTNTGGGC NCCTTAAATAATN ATATC NTTC N AAATTAAATTATAANTA 
ATTTCCTTNTTCGTAAANAACCTTrTTA \ 1 1 1 I II f I ! CG AATCCAAAAAN AATTATNTT 
TNGTGGGGNGGCTAATTGGMTTrNTTATTTNTACGTAAGNTAAAAGGGGGGTGTGGTTT 
TTTTTGGAAAATAACC ATATTNTA NCTTTATAATTAAAAATGG 
Sequence 1599 

AGGTACATGTGTAGAAAGCAGTGGGAACTG TGATTGGATTTGGCAACATGTC AGCTTTAT 

AGTTGCCGATTAGTGATATGG3TCTGATTTCGATCTCTTCCTGATGTAAACCATGCTCAC 

CCATATCCCACTATACAAATGCAAATGGTTGCCTGGTTCCATTTATGCAAGGGAGCCAGT 

ACCTGCCCG 

Sequence 1600 

TATCACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGGAGGCCAGT 

TATTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGCCTGGGATACCAGTGGCCCTGG 
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TGGATGCACCATAGATGAGGAGCCTGGGAGCCTGGCCAGGTTTCTGCTGGTACCTGCCCG 
Sequence 1601 

AGGTACGCGGGAGATGAAGGTGCGCATCCTCTCCGAGCGTMGAAGCCTCTGGACATTGA 

CTACATGGGGGAGGAACAGCTCCGGGAGAAAGCCCAGGAGCTGTCGGACTGGATCCACCA 

GACTGGAAGNTCTGANAAAGTTCGACCCTNGATGGGCGAAAGCTTGAAAACAGCNNGNAA 

ATATTGNAANATCAAACGGTNCCTGGTACCCTGCCCCNNGGCCGTGCCCGCTTTTTAANA 

AAACCTTANNNNNGGNANTTCCCCCCNCNCGNGGNCTTTGNCCACTGNGAAANTTNCNNA 

ATTANATTCTNAMGCCTTTTAAATNCCGAAATTNCCCCGGTNCNTACCCCNCTTCGCTA 

NGNGGGGGGGGGGNGCC 

Sequence 1602 

GACGGAAAGCTTAGATATGCCAACAACAGCAATTACAAAAATGATGTCATGATCAGAAAA 

GAGGCTTATGTGCACAAGAGTGTAATGGAAGAACTGAAGAGAATTATTGATGACAGTGAA 

ATTACAAAAGAAGATGATGCTTTGTGGCCTCCCCCTGGATAAGGGGTTGGGCCCGACAGG 

GAGNCTTGAAATTGTAATTTGGGGGATGAAGCACATATCTTTTTACCCACCATCAAAAAA 

ATTAGGGTNTCTCTTTATTGGATGGTAAAATCCAGTCCAAAGGGATTCCTGGAAGGCCCT 

TCGNAGGTAl I 1 1 I I ACTTATTTTGGGTTACCCTCGNGGCNCGGCTTCTTAGAAACCTAG 

GTGGGATTCCCCCCCGGGGCCTGGCAAGGGGAATTTCCGMTANTTNAAGGCTTTATCCG 

AATACCCGTTCGAACCCTCNAAGGGGGGGGGGGCCCCCGGGTTACCCCCNAGCC I I I I II 

GGT 

Sequence 1603 

ACTATAGGGCGAATTGGAGCTTCNACCGCGGGGGCGGCCGCCCGGNCAGGGTACTCCTTC 

GTNATACCCATCTGGGCAGTCCAAGACACCATTGCACAGCTGGGATAAATGAACACATTT 

GTTGGTACNCGGGATGATAATGTCCCGATCACCTTCGGCCAAGGGACACGACTGGAGATT 

AAACGAACTGTGGCTGCACCATCTGTCTTCATCTTCCCGCCATCTGATGAGCAGNTGAAA 

TCTGGAACTGCCTTTGTTGTGTGCCTGCTGAATAACTTNTATCCCAGAGAGGCCAAAGTA 

CCT 

Sequence 1604 

GATCNTATAGGGCGAATTGGAGCTTCANCNCNGGGGNCGGCCGCCGGANGGNGGGTGACG 

TGCGGATCTTNTTC I I I I IGTGGCTGTGGACACCTTTCAACACTGCCTTCTTGGCCTTTA 

AAGCCTTCGCTTTGGCTTCAGCCTTAGGAGGGGCAGGAGCTTCCTTCTTCGCTTTCGGCG 

CCATCTTGTGAAAAAGCCCCGCGTTACCTCGGGCCCGCTCTAAGAACTTAGTTGGGATTC 

CCCCCCCGGGGCCTTGTCATGGAMTTTTCCNAATMTCAMGCCTTTAATTCNAATTAA 

CCCGGTTCCNNAACNCTTCTAAANNGGGNGGGGGGGCCCNCCGNGTAACCTCCNAAGNCT 

I I I I I IGGTTTNCNCCCTCTTTAAAGATCGAAAGNGGGGTTTTAAAATTATAGCCAGCCC 

GCCTTTTGGGNCCCGG 

Sequence 1605 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAGATGGACCCGCTACGG 

AACGAGGAGGAGGTTCGAGTGMGATCAMGACTTGAATGMCACATTGTTTGCTGCCTA 

TGCGCCGGCTACTTCGTGGATGCCACCACCAtCACAGAGTGTCTTCATACTTTCTGCAAG 

AG7TGTATTGTGAAGTACCTGCCCG 

Sequence 1606 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGACTATAGTTACGTTA 

TTTCTCACCAGTCACATACTCAAGAATTCTTTTACTCAGACCCATGACTTGACCACAGTG 

ATAGCAGAAGCTGGGGTGGGCACAAATGCTTATACACAATATTACAAAAGACGACATTCA 

CTGGCTTAGAGAAACTG ATGAATTTTCATTACTTTTACA 1 1 I I I ATAAGGCTAGCTAACT 

TTAACAGTTAACTTTAGATAAATCTACTTACACAGTACCTGCCCG 

Sequence 1607 

GGGGGCGGCCGAGGTACCANAACTTCTATGCACACCTCCCTGAGAGTCTGGGAACCTTCA 
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CCGCTGACCTGTGTGAGATGTTCCCAGCAGGCATTTATGACACCAAATATGCTGCTGAGT 

TTCATGCCCGTTTCGTGGCCTCCTACTTAGAATATGCCTTCCGGAAATGTGAACGGGAAA 

ATGGGAAGCAGCGGGCAGCTGGCAGCCCACGCCTTACCCTGGAGTTCTGCAACTATCCTT 

CCAGCATGAGGGACCATATTGATTACCGCTGCTGCCTGCCCCCAGCAACCCACCGTNCTC 

ATCCCACCAGCATCTGTGACAACTTCTCGGCTTATGGCTGGTGCCCCCTGGGACCACAGT 

GTCCTCAGTCTCACGATATTTGACCTTATCATTGACACTGATGAGGCTGCGGCAGAGGAC 

AAGCGGCGACGG 

Sequence 1608 

TGGCCCGAGGTACGCGGGGGAGGAAGGACCTGGTAGTTTTGATGACCGCTGTCCTGTCTA 

GCAGATACTTGCACGGTTTACAGAAATTCGGTCCCTGGGTCGTGTTC 

Sequence 1609 

CGAGGTAC I rTTTTTTTTTTTTI I GAGTTAAGTGTC I I I I I AATTGAAAAGCTAACTGTC 

CAGTTACATTTCTCCCAAAAAACCACAAACTGGGTAGTAACTGAGTCTCTAGGCAATATA 

TTTAAAACTAAGAGGATTAAAAATAAAGAAAAAGAAAACMGTCCTCAGATTGCAATGGA 

AGGGGAGCCCTGCTACATTACAGGCCTAANTCAAATACAGTGGGTTACCTGCCCCGGGGC 

CGGCCCGCTTCTAGAAAACNTAGGTGGGAATCCCCCCCGGGGNCTGCAAGGGAAATTTCG 

GATATTCAAAGCCTTTATTGGGATTACCCCGNTCCGAACCCTNNGGAAGGGGGGGGGGGC 

CCCCGGGGTACCCCCCAANGC! INN I I NG7TTCCCCCCTTTTTAGGGTNGNANGGGGG 

NTTTAA 

Sequence 1610 

CGCTCACTTGCCCGCTTTCCAGTCGGGGAAAACCTGGTCGTGCCAGCTGCATTAAATGAA 
ATCGGCCAACGCCGCGGGGAGAAGGCCGGGTTTTGCGGTATTGGGGCGCTTCTTCCCGCT 
TTCCTCGCTT 
Sequence 1611 

GTGTGCCACCGCCACTACCACTGATATGCCGAGACCCACTCCAGCCCCTTNCNTGGAGTA 

NNGGCCGGNACCCAATTCATAGACATAATTTATCCANATAAGGTGAANTCTCAGTAGGGC 

TTGTTNCNCTCTGAGCCCGCTTCTNAGNNAACCNTAAGNNGGNATTNCCCCCTCCGAGAG 

TCNTGGCAANGGTAAANTTTNCNNATTANNTCCNAAAGCCCTTTTAATTCCGGAATAAAC 

NCAAATTCNGGACNCCTTTNNTAGNGGNGNGAGGGGNGNCTCCCGGGTNTAACACCCAAG 

GACATTTATTTGGATTTCCCCCCTTNTATTANGGTTGGAAGNGGGGTNTNAAAATTTTNG 

NCCNGCCCGGCCCCTTTNGNGNCCCGTTTAAAAAANCCAAATTGGGGGTCCCAAATTAAG 

CCTTTGNTrrTATCCCCTTTGATNGGTTGNAAAAAAATTA 

Sequence 1612 

aggctcccgcccccctnacngagcatttcacaaaaaaatcgnaacgc;tccaaagttcaag 
gaaggttgggcgaaaaacccccgacnagggacctattaaaaaagaataccccaaggccgt 

TTTTTCC 
Sequence 1613 

CGCTTACNCGGATTACNCTGGTCCCGCCTTNTCTCCCTTTCGGGAAAGNCGTGGGCCGCN 
TTTTCTTCATAAGTC ATC AACGGCTTGTAAGGGGTATTCTTCAAGTT TCGGTTGGTTAG N 
GTTCGGTTTCCGGNTTCNCAAMGACTGGGGGCCTTGNTGTTGGCACCGNAAACCCCCCC 
CCGNTTTCAAGCTCCCGMCNCGCCTGCNGCCCCTTTAATTCCCGGGGTNAAACCTTATT 
NCGTTTCTTTGGAGTTTCCCAAAACCCCCCGGGTTAANAGGGA^.NCAACCGNAAC Mill 
AATTCNGNCCCAACCTTGGGGCCAAGGTCAATGNCCCAACCTTGGGGGTTAAAA 
Sequence 1614 

CCGGGCAGGTACTGGGCTCTGACCACTATTGGTTTTGAGACCACGATGTTGGGAGGGTAT 

GTTTACAGCACTCCAGCCAAAAAATACAGCACTGGCATGATTCACCTTCTCCTGCAGGTG 

ACCATTGATGGCAGGAACTACATTGTCGATGCTGGGTTTGGACCGCTCATACCAAGATGT 

GGCNAGCNCTCTGGMGTTAATTTTCTGGGGNAAAGGGATCAIsIGCCTCAGGTTGCCCTTG 

TGGTCCCTCCCGTTTTGACCGGGAAAAGNANGAANTGGGATTTCCTGGGATATTCCTANN 
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GAACCCAAAAATTCAGNAAAGGGGGNAACCANGGTTNCCCCTTNGNGGACCCCGGCCTCC 

TTTANNANACCTTAAGGTNGGGGAATTNCNCCNCCTCCGGGGGCCCTTGGCCAAGGGGAA 

ATTTTTCGGAATTATTTCAAAAANNCTTTTAANTTCNGMNTTATCNC 

CTCTTNCAAAGGGGGGGGGGGGGCCC 

Sequence 1615 

CGAATTGGAGCTCCACCCGCGGTGGCGGCCCGAGGTACATCCAGAAATAGATCCAAGAAA 
TGGGGTGGTTGAGTGGGTCCGCACGAAATGCTTGATTATGTCAGCAACACCCAACACTGT 
CTGTTTTCCATTTG NTGGTTTTMTCATAAAATTGTCAAGTGATNCN NGTTTTTAAAAAN 
AAAAAAAAAAAAAAGTACCTGCCCG 
Sequence 1616 

ATCACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTAT 

TTAAGAAAAGAACAAGGTTAACTAACTAAAAGCAGAACTCACTTA I MM IGCTCCCTAG 

CCAATTAAAAATAAGTTCATTAAAAGCACTTGAMTTATATAmAACCTGAMAAAAGT 

TG CTAAAATTCC AATATAMTGTAAATATCTTTAACTTGCTTAACCC AGCTATCC C C AAA 

ACAGTGTAGTGGGGCAAAATGTTCAAAAGAAAAATCATCCAGTGCACGTAGATGGGCACC 

AAGAAGCTAAGCTTCCCTGAGCGCGTACCT 

Sequence 1617 

TATCACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCG 
GGGAGGTTAAGGGAGAAGATGGCGGCGCTAGGGGAACCCGTGCGGCTGGAGAGAGATATT 
TGTAGAGCAATTGAATTATTGGAAAAACTACAAAGGAGTGGAGAAGTACCT 
Sequence 1618 

CCGCGGTGGCGGCCGAGGTACAGGTTTGTAGCCTAGGAGCAATAGGCAATATCATGTAGC 
CCAGGTGTGTAGTAGACTGTCTCATCTAGGTTTGCGTTAGTGTGTCTATGATGTTGGCAC 
AAGGACAAAATCAC CTAAC AATGAATTTCTTGG CAG GTGTCTCTGTTAAG AAG ACATGAC 
TCCAGTACCTGCCCG 

Sequence 1619 

ACTACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTAC rTTTTTTTTTTTTl 
IIIMimiUlllllUM I AGGCTTTTTGGANACCTTGCCCTTGCTGGCTAANTTAT 
CCAAGCAGGTTTTGATATANCTTTTATAGTAATCCACCTGCTCCCCATAAAAGGNGGCCT 
TANAGTTCAAANCANCGTATGTCTGTTGCAGTTTCACTANTTCGGCCTTTCTTTCTNTGT 
CGGTACCTTGCCCGGGCGGCCGCTTN 
Sequence 1620 

TCACTACTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGGGGGCCAGTTAT 
TACACTGCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTACCAGTGGCCCTGGTGG 
ATGCACCATAGATGAGGAGCCTGGGAGCCTGCCAGGGTTTTCTGCTGGTACCTGCCCGGG 

CGGCCCGCTCTAGAACTAGTG 
Sequence 1621 

GACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC I ITTT TTTTT TTT 
rTTTTTTTTTTTn I CTGGGACATCATTCANATGTTACCAAATCAGTCACCTTAG mm 
AATCCTCACTATCATGCTAACATNTAACACTTGGATNGCATTTTGTGACGTGTAAAGNGT 
TTCCNCCTNTCCCAGGATATTTTAATTT 
Sequence 1622 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACANGACTCCAAGTCC 

AMGCAMTACCCATGGCATGMCAMTAGGATGCCTMGATTCTAACATTTGCTTCGGC 

ATGGCTTTCCTCCTCACTTCCCTACCTCACAACAAGAAAAAATTTCCTAGATGGGACAAT 

GGCAAGATATTGTGCTAATACATTTCATAGGATTTCTAGATGCTGGGCTATAACATCCAG 

GTTGCAATTTGGTGACGGATTTTTTTATTTCTATTGCAAATGGACTACACT^ 

AAAAGATTCTTGCCATAGCATCCAAACATCCCTATGTAGACCCACACTTCCTCTCAATAC 
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AGAAAGAGAATCACTCAGAAAGGACAAAGCTGGCAGACCAAGAAAGGAGGCAAGGGTAAA 

CTCTGTCACTGGCCAATTNTCTAAAACAAAAAT 

Sequence 1623 

ACTAGTGGATCCCCCGGGCTTGCAGGAATTTNGATATCAAGCTTATCGATACCNnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGCGCGCT 

TTGGGNGTAATCATGGACATTAGCTTG I ( I I I CTTGTGTGAAAATTGGTTATCCGCTTCA 

CAATTTTNCANACTATCCATAACGAGNCCCNGGGTAGGCATTAAAAGGTGGTAANTGGCC 

CTNGGGGGGTGCNCCTTAAATGTAGTTGGAGCNTAAACCTTCAAACAATTTAAATTTGGC 

CGTTTTGGCCGGCCTTCCANCTTGNCTNCCGTCNTTTTTCTCCNAGGTACNGGGGGAAAA 

AA 

Sequence 1624 

AGGTACTGCTTGTGATCCAGCTACTGAGGTCTGAACTCGAACAGACCTGGGAAGTGNCTC 

AGAGCTGTCTNTCAGGCTCTGAGAGAAAGTAGCCAACATTACAGTAGACAGAGAGAGACA 

GAGATCMGCACAGAGACAGAGACAGCATMGGCAAAAATACCTTTATGAAAATTGCCAA 

GTCCTCAMGCAAATCTTANNTTTAAACTTCTTNTNAGGCNAAACCTGTNTGGAACACAN 

GTC NTTTTAATC CTTC AAGGGTTTATATAAN N ATANTAAAATAATN AAGTTAC N CTGCTC 

CGGGAGTCGGGTCTCGACTTCTAAGAAACCTANGATGNGAATCACCTCNCGGNGCTTGCC 

AGTGGAATTTTCCGT 

Sequence 1625 

ACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACCAACATGTCCCGTGGT 

TCCAGCGCCGGTTTTGACCGCCACATTACCATTNTTTCACCCGAGGGTCGGNTCTACCAA 

GTAGAATATGCTTTNAAGGCTATTAACCANGGTGGCNTTACATCAGTATGCTGTCACCAG 

GGAAAGACTGTGCAGATNGTTGTCACACNNTAATTAAAGTACCTGCCCGGNCTNGACGAN 

CTATAANANGTGCAATNCGGAGCACNTTCACTGNNATGTTNCGCCGGAAGGCCTTCCTCC 

ACTGGTACCTGCCCGGGCGGCCGCTCG 

Sequence 1626 

ACTACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACCCATTCCCTTGAT 

GTCTACAATATCACCTTTCTTATAGATGCGCATATATGTGGCCAGAGGAACAACTATCAT 

GTTTTCTAAAAGGCCTANAGMCATATATCGGGTGCCTCTCCTNTTTCCNTNNGTGTNCG 

TCA tin GGCGAATTACTGGAAGATGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnCCCTTATNCGNTACCGGGGNNNTCCNGGGGGGGCCC 

GGTGCCCACTTTTAATGAGGGCTTAATNGNGGCGC7TNGGNGNAACGCTTNGGNCNTAAT 

CANGGTCNTGGGNTGAAATTTNGTTNTCCNATTTACAATTTCCNNACAAACATACCNAAN 

CCCGGGTGCCATTAAAAGTGTAAAAAGCCTGGGGGTGGCCTAATGAAGTGGAGCCTAACT 

TACCATTTAATTTGCGTTGGCGCTCNACTGGCCCCGCTTTTCCAAGTCCGGGAAAACCC 

Sequence 1627 

GGGCCTGAGGTTTTACTCCNNAAMGCANAGGAGGTGGCAACCTTGGCTTTGGGGTT7TG 

GCAGGCCCANGANAGGCAGNGGAGGANAAGCTCAAAGCCGGGTTTCATGTTTCACACCAA 

AGGTCTAATTTGTGGGGNAGAAGGACCAAATCCCAGATNCCCCTTGTTTAGCACANCAAT 

TTAGGNTTCACNAAMTTGTNNTNTTTTGGGCCAAAAAAACCATGTTGGACCACNCTCAT 

CCCATGGANTANATTTTGTACTTTAGATTCTCAAAGGAAAAAGA 

Sequence 1628 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACGCCTTGTTTCAAACT 

GTTCACTGTGATCTGTGGGTCTTTGAGTTTCAGTGAGTTTGCTGAAATGTCGAAGAAGTA 

GTTCCAMCTTCAATGTTCMTGAAATTTTTGTTCAAGTTTGAAATGG 

TAAAAGGTACCTGCCCG 

Sequence 1629 

TCTAAAGGAATAAGCTTGCGGCCGCTTAATTAATTTTTAGGCACATGTAGGGTGATCCAA 
ATTGGTCTGGGGTCAGGATTTGGCCCACMGTCTATCAGTTTTGCGTCATCTAGTTTTAG 
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TTTTGACTTTAGGTTTCTTGAAGGAAGGGACCA^ 

TCACGGCTTAAGGTGTTAAGCCCCTGNTNCAAAAGAAGCCTTGGTTACCAGGGNAAAANG 
GCCACCNTTTGTNTTTGTCAAAAAATATGAATTGTGGTTGGGGAATTGGAAAGGGGGGAA 
GNGGGTTAAGGGCCAANAANTGGGGAATGGAAGTTG 

Sequence 1630 

CTACTAAGGGAATAGCTTGCGGCCGCTTAATTAAAGATC! rTTTTTTTTCTTTTTTTAAC 

AAMTTCCTATAGMTATAACMTATTAAAGAGGGMCTTTMGTGMCCCCCACTTTTG 

GCCATAACTTCT1TTTGTGTCAGCCAGCAAGAGCCAAGAGAACATAACCAGGGTGGTAGG 

ATGTAGCTAGATMGCCTCTTTAAAGGCACAMGAGGCCACTTCTTCAACCAGATTTGCC 

AAAGCCCTTGGTTCAATGACCAGATAGGCCAACACGGGTAGTAGTGGGGTCATCAAGCCC 

TAAATGCAGCTTTCTGGGCTACCTTCTAGGATGCTCTGGAGCTTGGAACTGTCCACCTGA 

AAGAGGGTTGAGGGAGGCTGGGATGAATACAATATTTGGGTCCACTGGATTTTCCTCATA 

ACCCTTAGCCCC 

Sequence 1631 

GCCTTTTAAAGGGAATNAGCTTGCGGCCGCTTAATTAAAGATC I M I M I M i i m i i m 

TTMGAAGCTGTMTGATAGTMCAMGMGTCCMCCTTTATATTGCACAMTNTTTAC 

TAACCATGTAGCATCTNTGTGCCTGACCAGCCCTGACAGATGAG 

Sequence 1632 

CCTCCTAAAGGGAATANGCTTGCGGCCGCTTAATTAAAGATCTTN I I I I II 1 1 I II I I II 

TAMCGCACAGMCMCCTTTTGMGGTGCCAMTTTAAAGATGGTAAAGTAACAGAGAA 

GTAMCTGTAACCTMTTGCTATGGCTTCAGAATAACCCAGTCAAAANAC TTTCAA CATN 

N AMGGAAAMGGMTTTTAAMTTMGTTGCCCCTTTG AAGTTTC AAATATT^ 

NGGGTTCTTGNAA 1 M II I I I I I CCCTTACCTTTTTAGCNANAGGGGGNAANCA AAATTN 

TTAAGNTCCCTAAGAMTTTGNNTACCCCAGGGGGGTTGTTATTCNTNCCCCCCTTTTTN 

TTTCNAAAAAGATAAAATTTTTTGNCNAAGGGTTTTGGGAACCNT^ 

Sequence 1633 

CCTCCTAAGGGAATAAGCTTGCGGCCGCTTAATTAAAGATC1 I I I I I I I I I 1 1 I U I I GG 
AGATGGAGTTTCAAACAAAAAAAAAGAAAGCAGAATCCACCATCTTCCTCCCTGCTGCAC 

ATGCTCTTCTG 
Sequence 1634 

CCGCTTAATTAAAGATC I I 1 1 1 1 II 1 1 1 1 M M 1 1 I GGTTTAATATGAGGTTNCATATTT 

ATTTTGGTCATAAACATATGAGTNTTTNATATATAACCAGGTGTCATGTGCTNTTTGGCC 

CNGTNTTTMNANCAAAAACCTAAAACAANATCAACCCATTCGCNCNAAAAAACCCCAA 

Sequence 1635 _ 
G CTAAATAACCCTC ACTAAAGGGAATAAG CTTGCGGCCGCTTAATTAAAGATC I lllin 
Tl 1 1 I I I I I I ATACTACTTATCACTATTTAAAATTATATTTAGGTACTTATTTGTTATCT 
GCTTCCCCTCTAGGCTTTCTGCTCTGCAAGGCCAGAGACGTTGTCTCTCTTCTTCATAGC 
TCGACCCCCACTGCGGAGAACAAGTGCTCAGCACTTGCTAACATGGCAGAAAC AAAG CTG 
TCTCCCTCAGTTCTCGTGGCAGCCCTCATTCTAGTTTGACTTCCTATGAMTTACTTTTC 
l- CTAGTTGTCTTGAAGGTCATGTTTTGCTTAAGATGTAGCTCCACCAAGCAGACAGCCCTA 
TGTGGGGTTTGGAGGTGAGCAGTGAGAGGTGGTACCCATAAGGAGGAACATCANATCTGC 
GGGGGGCAT 

Sequence 1636 

TACTAAAGGGAATAAGCTTGCGGCCGCTTAATTAAAGATC 1 Ml M u u n u m i t u 

ACMCTAGTGTATTGTACACTAGCTCAGCTCCACCAAACTACCTGTTCGTTCTCTTTTAT 

TTGACATTGTTCACAGACTAGTACATATTACAATAAGAGTGCTGGATAAAAACATGAGGT 

ACNGAAAGTGGTTCAAAGGATTATAGGGTCATTGCAGNATCATGCTNGACAGTAAAAGAA 

AATTGTGACTGAGAAAACACTAANTAAAAATACATAAAGAATGTGCATTANTAAAAAAAA 

AAAAAGGACATTNNAGATNAAGNACNNCAAAAAAAAAAACCAAACTCCATTTNCANCAGC 
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CTTGTGGCTTTNTTTGGGTNACCTAMGTAAGGGNTTGGAACCMNTTTATCACCAAGTT 

TTTTTTAATTTCCCCCACCCCCGGAAANAAAAAAA 

Sequence 1637 

TAAAATAACCC TCACTAAAGGGAATAAGCTTGCGGCCGCTTAATTAAAGATC MINIM 

TT M I I M I I rCATTTTAGTCTTTAGTTCCTTTTTCAACTTCTTCTTTACCTCCACGGAG 

GAAGGCAGCTTGGTTTTCTCCTCATACACCTTGTGGAGAGGTTCGGGAGTTGGGAGGAGA 

CACTGNCGGNAGCAANGAGNTTATTAAGGGAAAAGTTTGGGGGGGGGCATTATTTTTGGA 

AGGGGTGGTCTCCCCAAGNTTTTTTGGCTTTTACNGTACCAAACNCTTNATTCANATNGG 

NANGGGCCAATTCCCNATmGGTTCACATTGGAANTTTTMTTTTGGGNAM 

GGAGGNGTTTTNCCNANCCCGNGCAAAAAANNNAAGNGNCCATATAAGNGGAAGTTTNGG 

GAAATNCNCCTTGGAACC 

Sequence 1638 

CTATAGGGCG AATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTAC H 1 I 1 I 1 I 1 1 
I M I I M M 1 1 GTAACAGGATCTTGTTTTGTCATCCAGGCTGGAGTGCAGNGGCACAGTC 
ACAACTCACTGCAGCTTCAACCACATTGGCTCAAGTGATCCTCCCACCATCAGCC 1 UN 
GGTAGGCGGGACTATAAGCACATGCCACCATGCCCAGTCAAG I I I I I AATTTCTTTGTAG 
AGACAGAGATCTCACTATGTTGCCCAAGCTGATCTCAAACTCATGAGCTCAAGTGATCCT 
CCCATCTTGGCCCCCAAAGTGTTGGGATTACAGGCGTGAGCCACTGCTCCTGGCCTTGCT 
ACTCCTTTTATTATTCACAACCTCTAGAAATAGTATAGGAGATCCAAGCACTAGTCTCCC 
GTTGTCATGGATACAGAAAGTGGAGAAGAGAGCAGGATAAGTGGCTTGTTTGGGACTAGA 
CAAGGCTGTCACTAAATGACTCACTACTTCAGCCCTTTGCTCCTCTTGGCACC 
Sequence 1639 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGMTCATTTTGCTTCCAACCCCATTTA 

GCCTGCCATTGAAATGCAAAAGTCTGTTCCAAATAAAGCCTTGGAATTGAAGAATGAACA 

AACATTGAGAGCAGATGAGATACTCCCATCANAATCCAAACAAAAGGACTATGAAGAAAG 

TTCTTGGGATTCTGAGAGTCTCTGTGAGACTGTTTCACAGAAGGATGTGTGTTTACCCAA 

GGCTACACATCAAAAAGAAATAGATAAAATNAATGGAAAATTAGAAGAGTCTCCTGATAA 

TGATAGTTTTCTGAAGGCTCCCTGCAGAATGAAAGTTTC 

Sequence 1640 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGAGGAACTGCTC 
AGTTAGGACCCAGACGGAACCATGGAAGCCCCAGCGCAGCTTCTCTTCCTCCTGCTACTC 
TGGCTCCCAGA 
Sequence 1641 

CGAGCATTNACAAAAAATTCNGACGCTCAAAGTCAGGANGGTTGGCCGANAACCCCGACC 

TGGGACCTATTAAMGGATTACCCANGGGCCGTTTTCCCCCCCTTNGGAAMGCCTTNCC 

CTTCGTTGCCGCCTTCTTCCCTTGGTTTCCCGAACCCCTTGGCCCGGCCTTTAACCCGGG 

MTTACCCCTTTGGTCCCCCGCNCCTTTTTCNTTC 

Sequence 1642 

AGGTACGCGGGGGGAAGAGGAGGAAGGAGGGCGAGCGAGGAGGATGGCGGAGTCGGGGCT 

CCTGACGGAACTCTAATGAATCATTGATTGACCAGCACTATTTTACCAGTTGGAATGAAT 

G ATC AG AAATGGGCNTAGCTGCTTTTAAATACN ACC NTG N G N AN ATGG ATGTTACTCC AT 

GCTGATTACTNCTTNNAGCCAGCTCCTNNCTNGGCGGTCCCNCTNGNNGTAGGNANnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTCATGGTC 

ATMCTTGTTTCCTTGTGTGAAAATTTGCT^ 

CCNANCCCGGGAAGCCATTAAAAGTTGNTAAAAGCCCTGG 

Sequence 1643 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGTCTTCATCTTC 
CCGCCATCTGATGAGCAGTTGAAATCTGGAACTGCCTCTGTTGTGTGCCTGTTGAATAAC 
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TTCTATCCCAGAGAGGCCAAAGTACCTGCCCG 
Sequence 1644 

TCACTACTTAGGGCGAATTGGAGCTCACCGCGGTGGCGGCCGAGGTACGGAAAACTCTCC 

TCATCTGGACTCGTGACGTCCTTGCAGCCCGAGTTGGCCATATCCCACTACGCCCCTGCA 

CTGGAGCCTGAAGCAGAGTGTAAGGAACGGCCAGAGAGCGCAACACTGGGGCCCACTACC 

CCGGCGCAGTGACCCGCCGCGCCCCCGCGTACCTTGCCCGnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnACCCTCGAA 

GGGGGGGGGCCCCGGTACCCAAGCTTTTTTGNT 

Sequence 1645 

AGGTACACTTTNGCACCTGCAMTCTGTTTATTAATTAGACCATAGMTCTGTGTTTCAT 

TGGTGGTCAGCTGGCCAATCTGATGGTCTTGTCATACACAGGTAAGAAGAGACAAAGAGC 

CMGAGTTTGTTTGGAGGCCGGAGTGAGCTTGACAGAACTAGATGAAATTGGCCACTGGG 

CACTGGGACCCAGAAAGGCCTCATTGCCACAGAGTTGCAACAGAGTGTGGGGTCCCATTA 

TCAGAATCCTTTCTATGGTGCATCCAATTGGCTATGACTCTCCTAAGAACAGCGCTGCAG 

GAAAATGCAAGTTCACAGGATAATGAATGCACTGTTGCTTCTCT 

Sequence 1646 

AGGTACGCGGGATGCTCTGGTTTMTGTTTTTCAATAGTGTGATA CAATT GTTCAACTTA 

TTGGCAGCTTTAGTGGTATTTGMCAACTTTGCTATGTTTGGGTAATTTTCCACCCTGTG 

AATCAGCTCCCCAAAGCAGAAGTCATAAAATCTGTTTTCCAGCCAGGGTGTGGAAACCTG 

GGCCCCTGGTATCAGGTCCACCCATGTGCAAGCTATGGCATTTGTGCCCAGTGGCAACAC 

AAAGCGATCTTCACCAGAACATTCCTCTCCTGAGGTTTCAGCACTGCTCTTGGCTACGTA 

GCCACCGCTAAGCTTCTCTCTCTGGCCTTCCTAGGAA 

Sequence 1647 

TACTCACTATAGGGCGAATTGGTAGCTCCACCTGCGGTGGCGGCCGAGGTACACGGGGAC 
CTCANAGCTGAGCTGGGCATGAGNAGATGCTCAGTAAGTGGTAGCACAGGGTTGGTCCCT 
ACGGTGGAGGCCCCCTAACACCG 
Sequence 1648 

CCGGGCAGGTACCAAAAGAATGTTCATANGCAGCATTATTTGTCATAGTCAAGAGCTGGA 

AGCAACCCAGTTGTGCATCAATGGTAGAATGGATGACTAAATTGTAGTGTATTCTTACAA 

TAAAATACTATAGAGCAATGAAAATAAATAAACTATGGCTATNTGCAACAACATGGATAA 

ATCTTACCAACAATGTTAGACAAATGTAGCCAAACCAAAA AATAC ACATGTTCTATGAAT 

CCATATATGCAAGGTTCACAATAGGTGATAGTAGCTACTCTTTTGGGAAGGAGTGAGGAA 

AGGACTGAGAGAGTATTGCAGAGGGGGCTCCGGGTGTTTGTMTTTCTATTTCCCAGTGC 

AGGTGTGGTTTGCATGANCATGTTCACTTTGTGGAAATTCATTGAGCTGTAA 

Sequence 1649 

CCGCGGTGGCGGCCGAGGTACTGGGGCCTTACTTCCCTAAAGACCCAGGGAGGGGAGAGA 

AAGTGCTCTGCTTGTATCCTAAGCACAAATGCAATACAGCAACATCAGTAATGGTAACTG 

AGTG CTTACTGCATGCCAGACCCGTGCTAAG CTCTCTCCATGCACTATCTTAATGTTCAC 

MCCTCACACAGTAAGCATTATT.AACAACAGCACTTTACAMTAGAAATT TGTTG AGGCT 

CAGAGTTGCCATAGCTGGTGAGCATCCACAGCCTACATTCAMCCCAAGTTTTCATTTCC 

AMGCTTTGCTTTCCTGCTGCTGGATGTCTTTTTGCGTCATCTTCGCATATTTAACCTCC 

TTNGGCTCAACTGGGGAACC 

Sequence 1650 

CCGCGGTGGCGGCCGAGGTACTGGGGCCTTACTTCCCTAAAGACCCAGGGAGGGGAGAGA 

AAGTGCTCTGCTTGTATCCTAAGCACAAATGCAATACAGCAACATCAGTAATGGTAACTG 

AGTGCTTACTGCATGCCAGACCCGTGCTAAGCTCTCTCCATGCACTATCTTAATGTTCAC 

AACCTCACACAGTAAGCATTATTAACAACAGCACTTTACAAATAGAAATTTGTTGAGGCT 

CAGAGTTGCCATAGCTGGTGAGCATCCACAGCCTACATTCAAACCCAAGTTTCATTCCAA 

AGCTTTGCTTTCCTGCTGCTGTATGTCTTTTGCGTCATCTTCGCATATTTAACCTCCTTG 
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GCTCAACTGGGAACTCTTTGMGGCTAGGGCCATCAATTACATTTTTTGTTTCTTCCA 
Sequence 1651 

TCGCGGTGGCGGCCGAGGTACCGTTCCTCCAGTGCCCAGAGATGCTCTCCGCACCAAGCC 

ACAGATGTGGAGGAGGCAGGTCTGGACTTCAGGCAGACAAGGAACTTCAGAAAATCCTGG 

GGAGAGACAGCACANGGTAAGAGGNTAGGTCAGNAGAAANTCTNAAGGCTCCCTTTTTTC 

AACTTTAGNCACTGTACCATGCCCCGGGGCGGGCNCGTNTCTAGTAACCTAAGTGGGAAT 

CCCCCGGGCTTGCAAGGNAANTTCNGATTATTCAAAGCNTTATCTGTATTACCCGTTCGT 

ACACTTACNAAGGGGGGGGTGCCCCTGGNTACCCNCAANCCTTTNTTGTTTTCNCCCTTN 

TTAGATGGAGGGGGTTTAAATTTTGNCTGCNGGCCTTTGGGTCCGTTAAAATTNCAAATG 

NGGTTNCAATTAAGCCTTGGTTTTTCCACTTGNTGGTTGGAAAAA 

Sequence 1652 

CCGGGCAGGTACCAGCCAATCCCTTCATTAMTGTATACAGATTTAGTTAAGTAGCATTA 

MTAGGATTCTTAGAAGTATGTCCTCATAGAACTTTTAATACTTMGGCTTTGTAAAAAC 

TATCCATGAAGGGAAAGCTCCTCAGCATAACTGCTCAGGGAAATAGGGCTAAATAACTGA 

ACATTAAATAATTGGTTAAAGGTGCTGTTAGCCGAGCCTCAATGCTTGCTACAAGGATGT 

ATGTACCT 

Sequence 1653 

CCGGGCAGGTACGGGGGTCTGCTTAAAACTCTTCAGAGGTTCACCATCGCTTCCAGAAAA 

AGATCAATACTCCTGACGTGGCAACCTGTCGAGACTCAGGACCGCTAGGAAATCAGCTCT 

TGTTTACCCCTCTATCCATTACACTACGATCCAGCTGGGTTTAGGCATCACTTCCTCTGA 

GCAATCCTCCCTGATGTCATGAGGCAGTGCTGAGTGCCTTCTCTGGACCCTCTGTTTACT 

TGCCTGTTAAATCCTGAACATCTGGAAGGGAGTCGTGTTCAGCAACATATCTACGCCTTG 

CAGCACGGTGCCTGGCCTGCAGCTGGTGTTCCGTAAACAGCTGCTGACT 

Sequence 1654 

AGGTACGCGGGAGTAMTGCAGTACAAATTGTTCTACTGTrTTAAAMGTmCCGCAGA 

ACAG TGCAT TTATGGCAATGCTATGTTTAATGAGTTAGGGACATCAAATATATAGTAGTT 

CCTTATTTTCAGTTGTGAAAATGAMTGGCTAAAGCAGAAGAGACGTCTATTTTAGTCTT 

TTAAAAATGTGTGTGGGTGGTCTTTTTTCCTCAGAAGCCCAAAGCACATGTATATTTTGT 

TATTTCTCCTTGCTATATTCCTGAGACTATACTAAAAACTTTAAGAAAAGGMCAAGAAA 

AAGGTAAATTCATGTGTTCCCCACTGCTGTGTCTAGAACCAAGATCACATTA 

Sequence 1655 

CCGGGCAGGTACCAGCCAATCCCTTCATTAAATGTATACAGATTTAGTTAAGTAGCATTA 

MTAGGATTCTTAGAAGTATGTCCTCATAGAACTTTTAATACTTMGGCTTTGTAAAAAC 

TATCCATGAAGGGAAAGCTCCTCAGCATAACTGCTCAGGGAAATAGGGCTAAATAACTGA 

ACATTAMTAATTGGTTAAAGGTGCTGTTAGCCGAGCCTCAATGCTTGCTACAAGGATGT 

ATGTACCT 

Sequence 1656 

CCGGGCAGGTACCTGGTAATTTACAGAACGTTCGTTATCAGATTTGATAATTATATATTA 

AGGTTGTAAAGAATGTGTATGCCAGAATTATAGGTAGTAGATCCTAGATTCTTAGGAAAA 

ATAG1TTCTTTATAATCTTTTGAGTAGTGAAATGGTTACTTTTACMTGGTTATGAACTC 

GGTCAAGGCAAAAGGGCCACTATATGTCTTCAGTCATCTTTCTATGCCTGAAATCCAGGA 

AACAGTGAAAATGGATGTTCCCTGGAACAGCCATTGCGATGCCATATGTTGGTCATTGGT 

GTCCTTAAAGTGTATCTCAGATAATTGTGTGTCTCCCT 

Sequence 1657 

TNACCGCGGNGGCGGCCGAGGTACAATTGATATAACAGCTTTGTTGCCACCCCAGGCTTT 
TCTGTGATGATGCCATGGGCCACATTCTGATCAAACTGCACACCCAGAAGGTGAAGTGTT 
GGCTCCAAGCGAGAAAAATTATTAAGTTTGGCACTTGAAACCCTGCTGTCCAAAAATTCT 
G AAAAATC ATC CTG AAGTTC AAACTTGTGTAG AACTTCTCC AAGTAG ATAGC C ACTGG AA 
MTGCCTTTGCAAATGACTTGGGACTCACGGTCCGGGACACCTTCAACTCCTTGTTGAGC 
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CACTGGCACAGGGATCTCCGACATGGTTGAGCCCGGTGCCTCATACCGCANGCGGGGGCT 

TAGAAA 
Sequence 1658 

TCCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACTTATAGTTGAGAGC 

CAAGTCTCCCTTATCATTGGTGAATGAGAATGAGCTACTGAAAACAAAAAGAGGGTCTTC 

TACTCAGCCTCTACCCCTAATATTTATATCAGAAGCAGAGATTAACTGTCCTTACTCATT 

CACACGTTAATGGAAGAGAAGGAAGTTTCCTAGAAAAATCCTCCCGC TCCA CCCTGCAAA 

CTTTATGCTTTTCTGTTACATMTCAGGCAGGGGCMGACCTAMCTATTTTGAATTGGT 

GGTGTTGAGGCTAAATTCTCTGCTATTGACAGAATTGAGAATGTGATCMTTTCAGAGTA 

GCATGCTACAAATTTTGTCCCMTTTCAATGGGGAGAA 

Sequence 1659 

CTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCCCTNTGGTNGTNGCTCAGGAAGCTTTANA 

NATATATGGCTATTTAATTTGCAMTAATACTTAGAACTGCAAAMCACGTTCACCTGGT 

CTTGTTGAGAAAGAAATCATAAGTCCCCTGACTTAAACTCTCTCAATTGATGAAGTGACC 

AGGATTACTCAGAGTGCAGTTATTGGCACCCCCGAAGCACCAGAGTCCAGGAAAGATGCT 

TATCMTCTTTTGCCATTGGCTTCTACTTGCAGGTGCCTTCTCTACTATTCAATCATATA 

CGATCATTGGAAAGACAAATGAACTCAAGAAAAGCAGCGAAACAGTACCTCGGCCGCTCT 

AGAACTAGGTGGGATCCCCCCGG 

Sequence 1660 

CCGCGGTGGCGGCCGAGGTNCCCGGGGGATGCTGCCACCTAGGTTACTTGTAGGACCCTA 

TACGGCMCCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAATGCAGTGAAG 

TAGAAGAGACAGGGATATCCCAGAAGGTTATGCAAAACATCAAGAGAAGATGAGAGGAGT 

CTATATGTCAGAATACACATTTCCCACCTTGCCCAACAGTAGAAAAACATAAGAAGAGAA 

AMCATTAAAAMTGACAAGGMGTTAATGGAAGTCAGCAATGTGATGGTGTTTGGGAGG 

TGGAGCCTTCAGAAGGTAATTAATGCCCTTGTAAGAAGAGGCCAGAGAGCTTGCGCACC 

Sequence 1661 

AGGTACAGTTACAACTGACTTGCATAACACTAAAGAAAACATTGTAATTCCTACGCCAGT 

AGTGGGGGTAMCTGAGAACCMCCTMTTTTCACCACAGGGCCCTCTGGAAATAGAGGA 

ACTAAAATGATGACTGTTTGAAAGTGAAGAGGCAGATTCCTAGAACATCACTCCTACTGC 

ATTCCTTACCTCCACTCCATACCCTTTCCTTCTTCACTCAGGTAGGACTGTCTCTGGGAA 

TGCATGGACAGCATGCCACAGTTCAG NAGCTTGGGATGTGATCCTGGGAAATGG GGC CTT 

AAGATGACCCTTTATGGAAACACCTGGAAT 

Sequence 1662 

CTTAGGGCGAATTGGAGCTCACCGCGGTGGCGGCCGCCAGGGTCAGGTCCGTCATCGCCA 
CCCCAAGCCCGCGGGCGGCGGCATCCAGTGCCAGTTCGTCGAGGTTGAACGGGATATGCC 
GGTAGTCGTCCAGCGACCGCGCACCATTGGCCGCCAGCCAGGTC/TCCAGTCTTGCTTGT 

0 

Sequence 1663 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGC/vGGTACCTGGAAAGATG 

AACAGGATTGCAGCAGATAGAGACAGGTAGAAGGAGAGAGATTCCAAGTGGAAGGATAAA 

GGAGAGGGAAGAGAAGCAAAGAGTGCMCTTCCAAGAAGATTTTCTGGTTTCTGGCACTG 

GTCATAGATAGCTTGGGAGGAAACAGGCAGAGAGGGTTTTGGAGAAACAGAGCTGCAAAG 

CCAAGTTAGATGTTAGGTGAGTTGAAAAAATAACTTAGGGCAAAAGCAGAGGGAGGGAGG 

TATTTACTCTGAACAGAGATGACAGGGTGGGGGAGGACAGATTGTTTTGAGTAGGCAAAG 

GCCANGAGCATGAAGCAAGCTAGGAATTG 

Sequence 1664 

CCG G GC AGGTACTATTAAATTTAATG AG AGGTAACCC ATGC ACCGGC AGGAAGTTCAAG G 
GGACAGAAGGGGCTTGCTCCACTCTGGAGCCCTATCTCCTCTCACCTCCTTCCTTCTAGT 
TTGACATGCGATCTTGTCTCTCTCTTTTATAAAGTGCTTCCAGGCCCTTTTGATTAGGAT 
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GCTGTCCTAAATCCTTCCTCCCGCGTACCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnCTTTATCGATACCCGTCGACCCTCGAGGGGGGGGGCC 

CCCGGTACCCCAGNCTTTTTGTTTCCCTTTTAAGTGAAGGGGTTAAAATTGCCGCCGCTT 

GGGCCGTAAATCAATGGGTCATAAG 

Sequence 1665 

CGCCCGGGCAGGTACACCAGAACTTAAAGTATAATAATAAAAAATAAAGTATTACAGAAC 
ATGGGTCAATTATATTAAAAAGTAGATTAATTAATGTCCATATATACAI I I I 1 1 GGTGGA 
AATCAAAC ATAATGAAAACTATTTC ACATAATTAATATTTTGTTTA I i I I I I ACTTATTA 
ATTTATTGCAGAGATGGGAGTCTTGCTAGGTTGCCCAGGCTAGTCTCAAACTCCTGGCCT 
CAAGTAATCCTCCCACCTCAATCCCCCAAAATGCTGGGGGATTTCAGGGAACATGCCCAC 
TGGCACTTTATTAGGTGTTACCCTGGTNCATAAM^ 
ATTTTTCAGGTATATTATATTATACCAC 
Sequence 1666 

ATACCAGGCCGTTTCCCCCTGGNAAGGCTNCCTCGTGGCGCCTCTCCCTGTTNCCGACCC 
CTTGCCCGNTTTTACCGGGAAANACCCCTGTTCCCGCCC I I I I I 1 1 I I CCCCTTTTCCGG 
GGGA 

Sequence 1667 

AGGTACTCTTGTTTAACCATCAGAGGTGATCCCATCACCTTCACAGCCCCAGCCTCTGCT 

CCAGTCCCTCCCCAGCGAAAAGGGCCGCCCATGCCATCCTGCTGCTGGTGATTTGCTTTG 

TGGTCATGGACTCAGTGGACATCATTATTTTATTAACCGTGTGGTAGATTTTTAACTCAG 

TTATCCTGGATATCCAMGGTTTGTGGTCCATCTTTAGGCTTCCGTTTGTTCTTTGGTAC 

CTGCCCGGGCCGGCCCGCTCTAGAACTAGTGGGATCCCCCCGGGCCTGCAGGGAATTTCG 

ATATCAAGCTTATTCGATACCCGGTCGACCCTCGAGGGGGGGGGGCCCCGGGTACCCCAA 

GC Mill 

Sequence 1668 

GGANCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGTAGATGGAAGGAAGAACTTGTGTG 

CTTAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGTAGGACCCTATACGG 

CAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAATGAGTCTATATGTCA 

GAATACACATTTCCCACCTTGCCCAACAGTAGAAAAACATAAGAAGAGAAAMCATTAAA 

AAATGACAAGGAAGTTAATGGAAGTCAGCCAATGTGATGNGTGTTTTGGAGGTGNGAGCC 

CTTCAGNAAGGGTMTTTMTGCNCCTTGTTMGNAAGAAGGCCCANNMGAGCTTTGCG 

CCACCTTTCTTCCCTGCCCATTGTGGAGGGAAGCCCAAGNAAAGCCCGGC 

Sequence 1669 

ACTAC"iTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCCGGGCG 

TGTGAGGTGTCAGTCTGCCCCTACTTGGGGGTGCCTCCCAGTTAGGCTACTCGAGGGTCT 

GGGACCCACTTGAGGAGGCAGTCTGTCCGTTCTCAGATCTCCAGCTGCGTGCTGGGTGCT 

GGGAGAACCACTACTCCCGCGTACCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnrnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnAGGGTTAAATNTGCGCCGCTTGGCCGTAAATCATGGGTCA 

TTAGCCTGTTTCCCTGTGTGGAAAATTGTTTATTCCCGCTCAACAATTTCCCACACCAAC 

ATTACCGAAG 

Sequence 1670 

CCGCGGTGGCGGCCGAGGTACACACTAGTrTTGTCATACCGTGAGGCAGTGTATTAAAGA 

GGAMTGTTCTGTCTTTGGGACCAGACAGACTTGTTTCCATAACCCTGATTGTCTAGCAC 

TTTATAGCTCTGAGTCCATGGACCACCACTTAGCCATCCCTACCATCAGI MM IGTAAA 

GAGCTATGTCAGGAATGTGGAAGGATTCAAGAGAGGGCTTTTAAAGTGTCCATCACGACC 

TTTAACCCAAGGGGAATTAATGCAGGAACAGAAAACCCAAATACCACATGGTTCTCACTT 

TATCTGTAGGGAGCCTAAAACATTTGGAGGCCACCACCATGGGACCACCGAAGGGGGAGG 

CCAATTAGGACCCTACCTATTTGTAGGG 
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Sequence 1671 

GCGTTGGCGCTCACCTGCCCGCTTTCCAGTGCGGGGAAACCTGGTCGTGCCCAGCCTGCA 
TTTMTGAAATCCGGGCCAACCGCCGCCGGGGGGAGAAGGCCGGTTTTGCCGTTATTGGG 
GCCGCCTCTTCCCGC 
Sequence 1672 

CTACTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCTGCTGAAAGATTAT 

TTCTMCAGGCTTGTAGAGAAACGTCGGTTCATGTAAATTAGAMTTATGGGGCCACTTT 

GCCATTCTTCACACCTGCAATGAACAGGTGTTTATCTGCAGTTCTGACTTATCTCTTGAA 

CTCCATTTGCATGTTATAGTGGGATGCAGCTGATGCCCTGTCCAGATCTTCTTCAGGCCA 

CTACATCTATATATGCATTCATATTCCAGTGGCTGTGAAGTGTTGGGCTGTTGGTTGACA 

GAAGGGAGCTGCATCCTTCTGGGAGGGAAACTGAACTTCAGCTGGATGAAAAGCCCACCC 

TGGGTCCTTGGGAGGGTGAAAGCATCTTTCCAAAATGGACAGGCCTGGCAGGTCAATGGA 

CTGGAT 

Sequence 1673 

CCGCGGTGGCGGCCGAGGTACGCGGGGTAGATGGAAGGAAGAACTTGTGTGCTTAGACCT 
GACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGTAGGACCCTATACGGCAACCTCCT 
TTGCCAGGAACTATTTATAAACATCCTGCAGGAAAATGAGTCTATATGTCAGAATACACA 
TTTC 

Sequence 1674 

ACGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAAATATTTA 

TAGATATGAGGCTTCACTTTGTTGAATTCATCATACACACACACTTCTTTTTAGCAGAAA 

TTAATAGAAAGCCAATTCTAAATTAATGTAGGTAATACATAATCTACTCMCTGGATGAA 

TACCTTGGTTGCAAGTCTTGTCCTGAGCGAGCTAAGTGACAACACTCTCAATCTACTGAG 

CTTCCAGCATCTAGGCAGAGCTCCCAGGACAAGCAGAAGTCACAGGGAACAGGAAGCAGG 

AAMGAGCAGGAAMGATCCTTTAGTTTCCATGTGTAGAAACAAATGCTCAACTGNCTTN 

TGTATTGAANCCTTCTGGATGCCTGNCNNTTTTATTTGGTCANTTGGNGGN ATAGGAAA A 

TCCTTTTTTAGGTTGGGCAGNCCNATTTANAAAAATTTGGGAAACCCCNCCCI I I I I I IN 

AAAAAA 

Sequence 1675 

GTACCAGGCTAGGCAGCTCTGGAGAAAGCAGAAGTGGATAAATAAGGTGTGGACTCACCA 
AAGACAGTTCCAAAGTCAATTTCACTCTGACACACTCTCTGTGATCTTCCACAGTCAGCA 
CAATGCCTGCCCCCTGCTAGGCCTGATGGATGATCTGATAGAAAACTGGCTTCAGC 
Sequence 1676 

CCGCGGTGGCGGCCGAGGTACTGTTCGCTGCTTTTCTTGAGTTCATTTGTCTTTCCAATG 

ATCGTATATGATTGAATAGTAGAGAAGGCACCTGCAAGTAGAAGCCAATGGCAAAAGATT 

GATAAGCATCTTTCCTGGACTCTGGTGCTTCGGGGGTGCCAATAACTGCACTCTGAGTAA 

TCCTGGTCACTTCATCMTTGAGAGAGTTTAAGTCAGGGGACTTATGATTTCTTTCTCAA 

CAAGACCAGGTGAACGTGTTTTTGCAGTTCTAAGTATTATTTGCAAATTAAATAGCCATA 

TATCTCTAAAGCTTCCTGAGCAACAACCAAGAGGTACCTGCCCGGGCGGCCGCTCTAGAA 

CTAG 

Sequence 1677 

AGGTACGCGGGACATTGATGGGAGAGATGTGGAGGAAGAGGGAAGAGGTCATTTTCCGTG 

MTACAGCTmCTTTTTACCCTTTAAACATTTTTGCTCCC 

ACTTCTGTTTGCTTGGGTTCATCTGGCTCTCCATGAATGTCCTGCTTTTCTGGAAAACCT 

TCTTGCTGTATAACCAAGGGCCAGAGTATCACTACCTCCACCAGATGTTGGGG CTAG GAT 

TGTGTCTMGCAGAGCCTCAGCATCTGTTCTTAACCTCAACTGCAGCCTTATCCTTTTAC 

CCATGTGCCCGAACACTCTTGGCTTACCTCCGAGGGATCACAGNAAGGTTCCAAGCAGGG 

AGAACCAGGGAGATTGTTGGATAAAAGCAGA 

Sequence 1678 
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CTACTTAGGGCAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACGTAGGGGCC 

CTTTGCTGCAAGCTTAGGCCAGTGTCAGCTCCTGAGCCACTCCTGACTCAGTTGGAGGAG 

ACTCAGTGTCCTCTGCCCCCTGCCTGGTGGAGCTTGGCAGCAAGTCCCAAGTTTTACAGT 

GTTGAGACCAAGTGGGCAGCCATCACCATTGAGGGGTGCCCTTTTGCTCCCCCTCGAGTC 

CMGTTATAGTATTGCTTACTTCCTACCTGAAATAGTTTATGGGTCATGGGTCTGGCTTA 

CATCAAGCCCCATAGCCAGTCTGGTTGCCCCACCTAAGGTCTTGGTGCTCTTTGATAATA 

GCCACAGCTAAATGTATTAGTCCTGTCCTAGGGAAATGCATTGATCTTCTNCCCAAAAAT 

ATATA 

Sequence 1679 

TTCTCTTACTGATAGTAGGATAT1TCTGCTTTAGTTATTGTCACCTTAAATATATTTTCA 
ATGTTGAAATCCTCACAGCATGTTTGATGAAATCTAGTTTTCAAATTTTC™ 
TTCTGTCACGTTGGCATGATAACAAATGCAATAACCCAAAAGACCCCAAAAGCTAGTGTA 
ATCCCTTTTGCAATCCAAGCATGAGGATTCATCTTCATGTTGACAGTGCGTGAATGTTCG 
GTAG GCTTTG TCAAGCTTGCATACAATAAATTATATATGTCCCTTTTCTTTTAGG GTC 
Sequence 1680 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTTACAGGGGACCGCCAGGGGCCTCGAGAATCG 

GTATCCTGAGTCCTCTTGAAGAGCAGTAGAGGTTGTTTCATTAAGTGCAAACACATTGTT 

CTTAATTTGAAAACTGTGGGCAGAAACAGAAGCCCGAGACTAAI I I I I CCATTGCTAACT 

CTAGATTCTCGGCCACTGGAGTCTGMGATACTCTCTTTGAGAATGCATATTATTTTGCT 

CACAGCTAAAACATTTAAGTATCATAGCTGATCAGTGGAGTGAGATTAAAMGGTTTCTT 

TTTTGAATCATCAGCTAGAAGATGTACCCTCGGCCGCTCTAGAACTAGGTGGGATC 

Sequence 1681 

CCGCGGTGGCGGCCGAGGTACAGCAATGGCACACTCTATCCTAATAATGACATCCTGTAT 

AATTTAGTATGCTCTGTATCTCACTTTAAATGTAGCATTAAGGATGAAGCAGTAGAAAAT 

GAACGCCGGGCCTTTTAATAATGAGGCTCAGGTTTGCTATCTTAATTTCAGGAATTAGGA 

ATGGGAGGGATGCTAAGGAGACTGTTTAAATTTTTAAAAAGTTCTCTCTATGCCTGATAC 

AGATGCAAATTGCTTGCTCATTACATTCTTTCAGCCT 

ATCATTCTCTGGCATTAAGACAGCTTACCTGCCMGCTTGGACATTTTCTATAACCCAA 
Sequence 1682 

CACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGTGATTTTAAG 

AGAATAAATGTCCAGATTCCCAACTCCTATATGCACI 1 1 I I ACTAAAATGATTCTGCTTG 

AGAATGTGCCTGCAGTGTGAGGGAATGCCGTCTCTTTTCTCTTTTCCCATTTTCCATTCA 

CCCTTTAAAAAGGCAAAGCACACCTCTTCCTCATCCTGAGTCTATTCTGGCATATCACCT 

TGTGGTGTAAAAACCTCCAGGCACCAAACAAAGGGACACTTTGAAAATATTAGTGTCATA 

AACTTGTGATAGGGCTCCAAATCTTTCCTTGCTGTTATATATGTATTTCTGTTAAGGAAA 

GAAGTGGGATAGAAATACAMGTTG/iGGAGAAAAAGGAAGGGATGTAAAGCTTTCTGGAT 

TGTCT 

Sequence 1683 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGGGCTCTTGATA 

TATCCTGGCCTTAAGTCTCAGGTTCTCCAACAAAGCTGGCAGCCTGTGGTATGCCTTCTG 

GGGCCAGAGAGTGTGGCATGATCCGAGGTCCTGAGTCTCTCTCTCAATGAGCCCATCATA 

TTGGCTTAACCAAAGCCAGTGGTCTTCAGGA 

Sequence 1684 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACAGATGAAGAGGC 

AAGAAAATTATGCCAAATTACCCTGTTTCCCTTTGTCACACTGCCTCCTCGCTGACACCT 

TCATTCTCCTGCAACCTTACACTTTGATGATGACTTGCTCTGCTTCCATGGAACTACTTC 

ACTTGGCCTATGGCACAAGGCACTTCCTTCCGGCAACATTCGCACCTTGGGAGGGGGCTA 

TTTTTGCATGCACCTAGTCCATTGCAATCCATTTGTTCTTGAAGGGCAAGATAGAAACTT 

TGTTTTATTATCTACACTCACGAAAAACAAATTTTGTGCACCATTCCCAACACTATTCTT 
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TTTTC UH1I T ATTTTAGTAAAAGG 
Sequence 1685 

GTGGGCCGGGCCCGAAGGTTAACGCCGGGGGAAGCCTACCGGGCNA AATCC CTGGAAACT 

TTCCCTGGAMGAATGGTTCCTTTGGANTGGTGGCAGGCTTGGGGCCATTTTCCCT7TCC 

GAACTTTCTTCTTCCCANCCCCTAAGCCTTTCCCCAAGAAACCAATTCACCATTAATCAA 

CTTGCCAAAAAAAATTMGGCCMTTTGCCANTAACCATTGGGNNATTCCAAGGGCCCCC 

AGGTTGGGGAAAAMTTGGTTAAAMGGAAMNGGCCCCCTTGGNAANGCCTTNNATTGG 

GGGGGGTTCCAAAAAATTGAAMGGGGTTGGAMTTTTCCAANANGGCCTTGGANAANGG 

GAAAAAATTATGNNCAAAAAATTTTCCAACCCCTTACCMCCMGGTTTTTNTTTG 

AAGGGAATTGGGGTTTTNGCCAACCCGGAAAAAACCMCCAACNTTTGGGGGGGGGAAAA 

ATTTGNGGGMGGCCMNAAAMNCCMNGTTTCTTTTmGGAAAAAT^ 

NAAAACCAACCCTCCCAAAAGGGGGCCTTTGGNTNGGAAAGNAACCCTTAACCC CCTTA A 

ATTTrGGGTTAAGNAAATTAANTTTTNGGCCCACCCCCCCTTTATTGGGAACCCAA 

TGGGGNTTTNGGGTTTCCCCTTGGNAAATTTCCAAAAGGGA 

TTGGGGGGAANCCGG I I I I I GGGGCCCCCCCTTNGTTTTTTTGGCC I ( I I i I U I IAATT 
TAAAA 

Sequence 1686 

TAMGATACCAGGGCCGTTTTCCCCCTGGAAAGCTCCCTTCGTGCCGCTTCTCCTGGTTC 

CGACCCCTTGCCCGCTTTAACCCGGMTACCTGGTTCCGCCTTTTTCTCCCCTTTCGGGG 

AMGCCGTTGGGCGGCTTTTTCTTNMTAAGNCTTNMCGCCTGNTAGGGTATTCTTCAA 

GTTTNNGGTGGTAAGGGTTCGGTTTCGGCTTCCCAMGCCTTGGGGCTTGGTTGTTGGCA 

ACGGAAAACCCCCCCCGGTTTCAAGACCNCNGAACCCGGCTTTGGNCGGCCCNTTNATTN 

CCCGGGTTAMCCTTATTCCGGTCTTTTTGGMGG7TCCCAAACCCCCGGGGTTNAAGGA 

ACMCCGGMCCTTTAATTCCGGCCCCACCTTGGGGNCNAGGCNAAGGCCCCAAACTTGG 

GGNTAAAACCMGGGGAATTTAAGNCAAGTANNCCGGAAGGGGTTAANTGGTTNAGGGGC 

CCGGGTTNGNCTTANCAANGAMGTTTTCCTTTGGA 

Sequence 1687 

ATTGGAGCTCCACCGCGGTGGCAAAGCAGACTrrTTATTTTMCAAGGAAAGTCATAAAG 

TGMTTTTTTCATGAAAGAAATAGTCTTCATTTMGAMGTGTTGCCGTTAMGGGTTAT 

CTTGGCACTMTGATTGGCTTTGGGTGATTTTAATTCTGAGATGTCTGTCATAAAGGAAA 

TTATGAGACCATAAAATTGAGCTTCTAATTTTCTGTAAGAATACAGAGGAGAAGG AAGG T 

AAMCTGTTMGGGTGAGCTTAAGTMTACACAGATCCTAGATMTCAGTTTGCATTTTG 

TGCTMTGTMGATCACMTGCTTGTGTTTGMTTAATGTTTTTTTACTGATAAGAAAGA 

TTTGAAATGGACAGTTGGMTCAATCAACAATATTTCTTGGGGGTGCATTTCCACTGGAT 

GC 

Sequence 1688 

TGMTCTGCATTTCTTGGATGAGATAGTTAATAACAMCTATTTCTCAATATTTGTATAC 

JAAAAACTAGTGAAGGTGTTATGTGTTTCAGTATCTTATCTCTTATTTGAACATGGGTTT 

CTGAAAGGAGCCTATATAATAATATAAATGGTATGTAGTAAATGAGGCACTGTCTTGGCT 

GGGACTGCTATAAAAAAATTACCATAGACCATTGACTAAACCACAAACATATACTTCTCA 

CAGTTCTGGAAGTTGGAAAGCCAAGATGAGGATGCCAGTATGGCTAGCTTCTAGTGAGGG 

GTCCTCTTGCAGGTTGCAGACAGCCGGCTTCTTGGCTCCTCACATGGCAGGAAGAAGGTG 

CCGCAAGCCTCTCTGGCCTCTTCTTACCAAGGGGCATTAATTACCTTC 

Sequence 1689 

GACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGCATGGATAAT 

TTAGATGTGACTCTGGACAGAGGGATGGATCAGATGACTTCTTAAGTTATCTTCCAGTTT 

AGGAGTTCGTAAACTATACTTTCTCCTTTCCAGACTATCCTAGTAAGAAMTTCTCTTTT 

MGACAGAGTAGAACTCTGGMTTCATCAGTTTTGATGTTTCTTAAAGTGTAATCTAAGA 

TAGTGCTCCTGTATTAAGTTCTGATGTCTGACCATTGTTCAAATAAAGAGTAAAATGCAA 
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ATGACAGGAAATTGGC TGTG TTCTGMTCCTATTTTTATTTGGGATAACAATAAGCCTGT 
ATGGTCACTGGTGACCTTTTGATTTGCTGTTTCTGCAACCTCACACTTGCGTCAGGATTC 
TTCTTCCACTTCTTGCACTTTATATTG 
Sequence 1690 

GAA7TGGAGCTCCCCGCGGTGGCGGCCGAGGTACAATATAGGCAGACAGGTTTGCCTTCA 

GAAATTCAGAAATGCAGCTTTTGAGGGAGGTCAGCATCATTGGTCTCAGCTACCATTTTC 

CTGCAGGATGTTTATAAATAGTTCCTGGCAAAGGAGGTTGCCCGTATAGGGTCCTACAAG 

TAACCTAGGTGGCCCCGCGTACCTGCCCG 

Sequence 1691 

GAGMGGCGGGTTTGCNGTTATTTGGGCCGCTCTTTCCGNCTTCCTCGCTCACTTGACTC 

GCNTGCGCCTCGATTCAATTCGGGCNTGATGGCAGAAGCCGGCTATNCAGCNTCACCTTC 

ANAAGGGGCGGNTAAATTACCGGGTNTTATTCCCACCAAGGAAATTCAGNGGGGNATNAA 

ACCCCCAGGGGAAAAGTAACCATTGGTGGAGTCCAATAAAAGGGCNCACNCCAAAAATAG 

GGGCCCATGTNNAANACCCCGGGGNAATAAAAAAGNGGCCCCGGCCNGTTNTNGGCCTTG 

GGGGCCGGTTTTTTTTTT 

Sequence 1692 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACACATGATGAAGAG 
GCAAGAAAATTATGCCAAATTACCCTGTTTCCCTTTGTCACACTGCCTCCTCGCTGACAC 
CTTCATTCTCCTG CAACCTTACACTTTGATGATG ACTTG CTCTGCTTCCATGGAACTACT 
TC ACTTG GCCTATGGCACAAGGCACTTCCTTCCGGCAACATTCGCACCTTGGGAGGGGGC 
TATTTTTGCATGCACCTAGTCCATTGCMTCCATTTGTTCTTGAAGGGCAAGATAGAAAC 
TTTGTTTTATTATCTAC ACTCACGAAAMC AAATTTTGTGCACCATTCC C AACAC NATTC 
TTTmCTTTTTTTATTTTAGTNMGGNGGAGAAA 
Sequence 1693 

GGGGCCATTGAGACTGCCATGGAAGAC7TGAAAGGTCACGTAGCTGAGACTTCTGGAGAG 

ACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGACCACTGGAACAATGAGAAGGAGAGA 

ATTCTACTGGTCACAGACAAGACTCTCTTGATCTGCAAATACGACTTCATCATGCTGAGT 

TGTGTGCAGCTGCAGCGGATTCCTCTGAGCGCTGTCTATCGCATCTGCCTGGGCAAGTTC 

ACCTTCCCTGGGATGTCCCTGGACAAGAGACAAGGAGAGGGCCTTAGGATCTACTGGGGG 

AGTCCGGAGGAGCAGTCTCTTCTGTCCCGCTGGAACCCATG 

Sequence 1694 

CGGCCGCCCGGGCAGGTACAAGGGAGACAGCATGCAGGGTGTGTTCAGAAGAGCTTGCTG 

AGGTGCTCGGCTCTTAGCATTAAAAATGTGATGTTGGTATATCATCCTGATAGAAAACAC 

TGCTTTCCAAATCCTAGTCACTTGGATGGGAGGAAAGTAAGAACAGATTCTTCCAACCAC 

TACTGATTTGTTATAATTCTCCCCATTGAAATTGGGACAAAATTTGTAACATGCTACTCT 

GAAATTGATCACATTCTCAATTCTGTCAATAGCAGAGAATTTAGCCTCAACACCACCAAT 

TCAAAATAGTTTAGGTCTTGCCCCTGCCTGATTATGTAACAGAAAAGCA1' 

Sequence 1695 

GGGGCCATTGAGACTGCCATGGAAGACTTGAAAGGTCACGTAGCTGAGACTTCTGGAGAG 

ACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGACCACTGGAACAATGAGAAGGAGAGA 

ATTCTACTGGTCACAGACAAGACTCTCTTGATCTGCAAATACGACTCCATCATGCTGAGT 

TGTGTGCAGCTGCAGCGGATTCCCTCTGAGCGCTTGTCTATCGCATCTGCCTGGGCAAGT 

TCACCTTCCCTGGGATGTCCCTGGACAAGAGACAAGGAGAAGGCC.TANGATCTACTGGG 

GGAGTCCGGAGGAGCAGTCTCTTCTGTCCCGCTGGAACCCATGGTCCACTTGAA 

Sequence 1696 

CCGGGCAGGTACGCGGGACAGTTCTCTTTCCTCCACTAGACTGAGCTCTTCAGAGGAAGA 
CTCACTTGGCTGAAACCATGATTTTACTTTAAACACATTGAAAACCTCTACTGGAGTGCA 
TTGTGTCTGGTGGGCTTCAACCTTAATTCTTAAGTATGTGAAAACACATCACCTATCTGG 
AGGTTTACACTTTCTGCTAATGACTTTATTTTTAAGCCCACCACCCTAACACAACAAATA 
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CTTAAAACTTGTCTTCATTTCCTTTAGGTCTGGCCCTCATGCATGCATATMTTTATAGA 

GTCACTGTTTTGCTCGGTTGTCCTCATGCCTCTATATTATTGGGAGGTT 

Sequence 1697 

CCGGGCAGGTAC I I M I I Ml M M I I I I I I H I I I GAGATGGAGTCCCATTCCGTCACC 

CAGGCTGGAGTGCAGTGGTGTGATTTTGGCCTACTGCAACCTCCGTCTCCTGAGTTCAAG 

CAATTCTCCTGCCTCAGGCTCCCAAATAGCTGGGATTACAGGTGGGCACAACTACACCCA 

GCTAATTTTTGTATCCTTAGTAGAGACGAGGTTATACCATGTTGGTCAGGCTGGTCTCAA 

ACTCCTGATCTCAGGTGATCCACCCGCCTCGGCCTCCCAAAATGCTGGGATTACAGGTGT 

GAGCCACCATGGCCAGCCGGTTGTrTACTTTTGATAAAGGATAGCCAATGTAGGATTAAA 

GAGGATTCCAATCGCAGTCCCCAGGTTGGGAGTTTATCCATCAATAAAGCAAATTACGAT 

MCTCATAGTGATAAATATTAMCCCAAGCAACAGGAAAATATTTTATTTTCCAGTGTTT 

CATTGGGCTTCTGAATCATTCTGGACACATGTATATTTCCCTGTCTATATTAAGTAGNAT 

GTTATTTTCTCACTTTAAAATTATCCCGTTAATTTTTGGAT^ 

Sequence 1698 

CCGCGGTGGCGGCCGTAGCTTGCAGGQGTGCTATGTGAAGATGGCGGAATTCCCAACGTG 

CTTCTGCAAMTTTTATGTATTCATCCCCCTCTGC TCTCA GTAGGGTGTTACCACCTGCG 

GATCCTTTGCTTTCCCCGCTGGATACTMCAGCCTTT7TCATAATTGCTAGCACCTATGG 

MTTGAGTGTTGAACTGAACAMCMTAGTCTTTCTCTGTMTGTCAGAGTTATTTATAA 

GACGAGGCAGTGTTAGACATAAGAAAGAGCACTGGGCAAAGAATCAGACAACCAGGATTT 

GGTTCCATTCTTTTATTATCAMGAATAAAAAAAAAACMTATTATTTTAAGATGCAGTC 

CCMGTCTTATAAGGAATTTGAAGGCTTCTGGGATCTAATCTAGGTCATGCCCATTCTGC 

CTTCCCTTGCTGTGTGCCCCTGACACAAGCATGGAAACATGT ATTCAC TTACCTTTGCCC 

TTGGGAACACTTTGGTGGGGAAAAATGTTGATATAGGTTAGCATTTTTATAAACTGTCAA 

GTTGTACCT 
Sequence 1699 

TCTGAGAACTTGGGCTGCGCAGGGAATGCTGGTGTGGGAGAAGCAAGCCAACTGAGCTGA 
GCTGCGGCCCTGTTTGTCCTTAGCTCTCCAAAACTTGCTGTCTATGTCACTTGGAAGGAA 
TCTGATTCTTTTCTGCAAGGCACTGACAATACCTTACAACAGAGCTGGGAGATAAAATGC 
AGAGCTTGTGAAAGTCATCAGAAAGTTGAACACAGATGGTCCCCGGCTTCCCGCGTACCT 

Sequence 1700 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCAGCG GCCGCC CGGGCAGGTACTGCACTG 

TGGTnTGATTTTCCCTGGTCATTAGTGATGTCGAGCATTTTTTCATATGTTTGT^ 

ATTTGTGTATCTTCTTTTGAGMTTGTCTATTCATATCCTTAGCTCACTTTTGATGAAAT 

TGTTTTTTTCTTTCTAGTTTGTTTGAGTTCATTGTAGATTTTGGA 

ATGCATAGGTTGTGAAGATTTATATCAAAAGTGTAGAGCCTGAGTTTGGTCATGAGGAAA 

TATTAGAGAGACCCTGGTTAGTGTTAGCCTAAAAAGTGAAATGCCTAAATACTTAGGATT 

GTTAAGGAAATAATAAACCAAAAAACATTGAAAAACAGAGAGCAGCTGC ACTAG TGCTGA 

GMTTGAATGMTATATCAGAGTTCTTTTAGTGCCCGTGAGATGCTAACCTTTTGACCTA 

GGCAGAGTAGAGATATTCCACAGAGCTCATTGGACATTCAGTTTAGACCTCAGAAAGGGC 

ATACTGTAGGAAGTAAGGCCACATTCTAATGTAAAGAACTTAACCCTTTGACCAGTGGAC 

AAAAATTGAAACCAGACCTTATAGAGCAAATGGATAAAATCAACCCGGGTGAA 

Sequence 1701 ^„ 
TGGAGCTCCCCGCGGTGGCGGCCGCCGGGCAGGTACCTCTGGTTGTTGCTCANGAAGCTT 

TAGAGATATATGGNTATTTMTTTGCAAATAATACTTACAACTGCAAAAACACGTTCACC 

TGGTCTTGTTGAGAAAGAAATCATANGTCCCCTGACTTAAACTCTCTCAATTGATGAAGT 

GACCAGGA7TACTCATAGTGCAGTTATTGGCACCCCCGAAGCACCAGAGTCCAGGAAAGA 

TGCTTATCNATCTTTTGCCATTGGCTTCTACTTGCANGTGCCTTCTCTACTATTCAATCA 

TATACGATCATTGGAAAGACAAATGAACTCAAGAAAAGCAGCGAACAGTACCTNGGCCGC 
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TCTANAACTAGTGGATCCC 
Sequence 1702 

NAGGATACCAGGCGTTTTCCCCCTTGGAAGCTCCCTCGTGGCGCATCTCCTGTTCCGACC 

CTGCACGCTTACCCGGGATTACNCTGTNCCGCACTTTCTTCCCATTACGGGGTAAAGCCG 

ATGGGCCGCTNTTCTCAATAAGTCNTCACCGCCTNGTAAGNGGTANTNCTTCAANGTTTC 

NGGNTGGTAAAGGGTTCCGGTTACGGCCTTCCCAAAGACCTGGGGGCCTTGTTGGTTGGC 

CACCGAAACCCCCCCCCCCGTTTTCAANACNCCCGNAACCCGGCTTGNCCGGCCCCTTTA 

ATTCCCNGGGTTTAMCCTTAATTTTCGGGTCCTTTTGGMGGTTCCCCAAACCCCCCGG 

0 

Sequence 1703 

CGACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACTCACTGCGTA 

TTTAGAAGTTCTTGGGGCACTGTGTAAGTGTAGGAGAAGATAGATGGAGTATCTAATTAA 

TCTAAAGATAAAAATGAATGAGGTAGMTCCATTCAGTAGACTAATAAATAGTTCTTAAG 

ATCTATAGGATTTCTCTTTTGGAGTGTGGAGACATAGTGTGGAGACATTTGCTAATGTAG 

CCATAAAGATAGACCTATTGTCCATTGCAATGAAAGCAACAAATACTATAACATAAAATA 

TTTCCAAATTGGTTTGGGTGAAAATMGCTTGTTTCCCAATAAATGTCAGATGGCACTTG 

TTGTAAGAGGTMCTTTGMGAAATTTAAATTAATATATTATTTCCAGCCAAAAACGTCA 

TGCATTTTTAGAAGTTCACAAAAGTCTGTAAAATATGCACACAAATGTCCATCAATTGTA 

ATTTATTTCAATAAAGAAAGGTAGCTTATACCACTTGTGGCCATCAGATTTCCTAT AGAC 

TATGCCTCAAGAAATTATCTAAATCTAAGG N GTTAG CAAAATAGTGCTTACATGTATTTT 

ATAGTATGAAATCACCCTTCAATATACATATTCATATAAGTTGTTTAAGCTGGGGCACTG 

CTTTACTGAAAA^AAAAATCTTTCACATTCTCTAT^ 

GAATGGCTCANCTGGTNAATTTCTACNGAGGGTTAAAAGGAAGGAAAGAAAAAAA 
Sequence 1704 

GGAGCTCCCCGCGGTGGCGGCCGGGGGCCATTGAGACTGCCATGGAAGACTTGAAAGGTC 

ACGTAGCTGAGACTTCTGGAGAGACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGACC 

ACTGGAACAATGAGAAGGAGAGAATTCTACTGGTCACAGACAAGACTCTCTTGATCTGCA 

AATACGACTTCATCATGCTGAGTTGTGTGCAGCTGCAGCGGATTCCTCTGAGCGCTGTCT 

ATCGCAtCTGCCTGGGCMGTTCACCTTCCCTGGGATGTCCCTGGACAAGATTTTACTCC 

CATAACCCAGGCAGAAGTACCTCGGCCGCTCTAAANCTAGNGGGAATCCCCCGGCCCTGN 

GGNAATTTCNATNNANGCTTTTTANTNANNCCCNNCAAACCTTNTNNGGGGGGGGGCCC 

Sequence 1705 

CCGGGCAGGTACAAGCTATCAAAGGCGGCAATTTTCTCCTGCAAGACTACGACAAGGATC 

CGTTTATTGCAATGGAGATCATAAATAGGTGTCTTAAATTGAATGGACTTGACCATCTCC 

CCAGTACTCTGCGTTGTTACCACTGCTTCCCGCGAACTCTGCGTTGTTACCACTGCTTCC 

CGCGAACTCTGCGTTGTTACCACTGCTTCCCGCGAACTCTGCGTTGTTACCACTGCTTCC 

CGCGAACTCTGCGTTGTTACCACTGCTTCCCGCGAACTCTGCGTTGTTACCACTGCTTCC 

CGCGTACCT 

Sequence 1706 

CGCCCGGGCAGGTACAATGGAACAAGGAGATAAGCAGTGAAAGGCCAAGGGAATGTCTGG 

AGTTAGGACTTCAGGTGATTCACAACTTGGCTGCCACTCACCCGAGACTGCCCAAGCCCA 

GATTTCTTCCTTCTATAAGAATATTGATTCTTGCAAATAAGATGAACCTAAATGTGGTCC 

AGGAGTCAGCATCTTCTACATGGTACCT 

Sequence 1707 

GMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGTTCGCTGCTTTTCTTGAGTTCATT 

TGTCTTTCCAATGATCGTATATGATTGAATAGTAGAGAAGGCACCTGCAAGTAGAAGCCA 

ATGGCAAAAGATTGATAAGCATCTTTCCTGGACTCTGGTGCTTCGGGGGTGCCAATAACT 

GCACTCTGAGTAATCCTGGTCACTTCATCAATTGAGAGAGTTTAAGTCAGGGGACTTATG 

ATTTCTTTCTCAACAAGACCAGGTGAACGTGTTTTTGCAGTTCTAAGTATTATTTGCA^ 
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TTAAATAGCCATATATCTCTAAAGCTTCCTGAGCAACAACCAGAGGTACCTGCCCG 
Sequence 1708 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGAGGCATGAGCCACTGTGCTCGGCCAAG 

ATTCTCTTAAATTCATCTTTTGC I I I ! i ATTTCCGCTTACCCTGCTTCAGACTAACTTCT 

CACCGTCMTCACAGAATTGGTTGGATAGTTTGCTTTCTTGCTAATTTTGCCTTCT^ 

CTTGTTCTCTATAAATTGTCCAGACTATCAGATAAAATCCAAATGTTTCATCCTTTGCAA 

CTGCTCATATACTGTCTAACTTAGTGAAACAMCTGCTTTCATTGTTATTTGTTCTTCAG 

AATACTACCTCCTAAGTAAATGGTACCT 

Sequence 1709 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCGTTCCTCCAGTG 

CCCAGAGATGCTCTCCGCACCAAGCCACAGATGTGGAGGAGGCAGATGGCCTGTGAAAAC 

TGMGGCAATTTGGAAGGCATTTTTAMTTCCGTGGAGCCACTGGAGAAATGTTTACAGG 

AATGATGCCTTAAAATTCCTCAGTGGTAAGATCTGAGTACCTGCCCG 

Sequence 1710 

ACTNCTATAGGGCGAATTGGAGCTCCCCLGCGGTGGCGGCCGAGGTACACCAGAACTTAAA 
GTATAATAATAAAAAATAMGTATTACAGAACATGGTCAATTATATTAAAAAGTAGATTA 
ATTMTGTCCATATATACATTTTTTGGTGGAAA 
AATTMTATTTTGTTTATTTTTTACTTATTAATTTATTGCAGAGAT 

GTTGCCCAGGCTAGTCTCAAACTCCTGGCCTCAAGTAATCCTCCCACCTCAATCCCCCAA 

MTGCTGGGATTTCAGGAACATGCCACTGCACTTTATTAGTGTTACCTGTCATAAAGTTA 

ATTACTCATAATTTTATTTCAGTATATATATATACACACACACACCATACACACCATAGT 

AGMTAAATTAATTATATTTTCTACAAAAATTATTTTCMTCTTTT 

Sequence 1711 

CCGCGGTGGCGGTGCTGGCGGTCATTCTCGGCTCATTGCTGCAGGTGCTGATCCCCCGAG 

ACTGGCTGTTGCGCCTGTTTGGTCGCGCCGGCCTGGGTTCCACCCTGCGTGGCGGGCTAT 

TCGCCTTGCCGGGGATGATGTGCAGTTGCTGCGCCGCTCCGGTGGCGGCCGGTATGCGTC 

GGCAGAAGGTCTCGGTGGGCGCGGCGTTGGCGTTCTGGATTGCCAACCCGGTACTGAACC 

CGGCGACACTGGTGTTCATGGGCTTTGTGCTGGGCTGGGGCTTTACCGCGCTGCGGTTGG 

TGGCCGGGATCGTGCTGGTAGTGGGTGTTTCGCTAGTGGCGCAACGCATCGC 

Sequence 1712 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGAGATAT 

TTGCTGAACTCATAAGTTTATTTGGGTAACAAGAGTCAAAATGATATTGGTTAGCAAATA 

AAMTTCAGGATACATATTCCAAAAATGCTGTTTATGTAATTAATTTGGTGAAGTCTGTC 

AAAAAAACCANCTAGNTGCTTGTGGGCCNNGTTGACTGCTGTGGTAAATTAAGTGATGCC 

CTAGCAAACTAAAGACAGTGTCATGCCTTCCAAAAGGCCCAGAGAGAGCTTCAGGAGAGA 

ATAGGTCCATCCTCATGTAGCCGGGAGAATTGCAGAATGAAATTGCTCCCAGTATTACTA 

GTAMCCTTCAMTTTCATTTACTGGGGCAGAGATAGTTCCGGTTTTAAGGAAATAATAG 

AAAATCATGGTCTGAAAATTCTCAAACCTGAAAG 

Sequence 1713 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCCGGCCGCCCGGGCAGGTACCAAAA 

GMTGTTCATAGCAGCATTATTTGTCATAGTCAAGAGCTGGAAGCAACCCAGTTGTACAT 

CAATGGTAGAATGGATGACTAAATTGTAGTGTATTCTTACAATAAAATACTATAGAGCAA 

TGAAAATAAATAAACTATGGCTTATATGCAAACAACATGGATAAATCTTACCAACAATGT 

TANGACAAATGTAGCCAAACCAAAAAATACACATGTTCTATGAATCCATATATGCAAGGT 

TCACAATAGGTGATAGTAGCTACTCTTTTGGGAAGGAGTGAGGAAAGGACTGAGAGAGTA 

TTGCAGAGGGGGCTNCGGGTGTTTGTAATTTCTATTTCCCAGTGCAGGTGTGGTTTGCAT 

GAGCATGTTCACCTTTGTGGAAATTCA 

Sequence 1714 

TCGACTACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGGCGTCACGCCACCCGAG 
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CCGGACTGGCTGCTGATGGATTATGCCGACCTGGTCAGTTCGGAAGCCTTCCGCGGCACC 

GCCTTTGCCGCCGCCGTGCGCCAGCAGCTCGACACGCTCGAGCGCGAGCACCCGGGGCGC 

GGCGTGGAGCAAGCGATCGACATGCTGCTGNCCGTGTTTGGCdCGGCGCTGCAAAACCAG 

GGCGGCACGGCCCTGTTCCTCGATGGACGCCTGCTGACGGAATTCCTGGAAACACCGACC 

ACCA 

Sequence 1715 

TGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCACGGTACACCCTAGCTCCAGCTTCCCCTG 

GGAGACTGTGCATCTCCTGGCTCCACTAACACCACCTTCTTCTGACCTTCCAGCCTAGAG 

ATGATGACTCTGCCAGCCTAGATGGGCTCTGGGTTGTCTCCCTATTCCTGNTTGCTTTGN 

AGAATNTCCCATTATGCTGNCACCMCTCCCCAGCCTAAGCCCTCTCTATTTTAAATTCT 

CAAGTGGATTATGTTCCTGATTAGTCCCTGACTGATATACCACTCTCCTCATGATCTCTG 

ATTAGTTTTCCTGTTAGGTTGTTGCANTCNAAAAAAAAAAAAAAAAAAAAGTGC 

Sequence 1716 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCCGAGCGGCCGCCCGGGCAGGTACCTCTCCC 

CTGACCTCCTTTGGCTAMGTGCCmCAGATTTTATTACTATTGTTAACAmCCAAC 

GCCATTTTAACATTATTTTTACAAATTTGATGTGAATATTCCATATTAGAAATATG 

CAGAAGACAAAGMTTTGTTTGTGAAAATGCTAAGGACTGAAGCTGACAGAAATAACATT 

CCTGCTTACAATGAACTTTTGCACGTATAGTAGAAAACCAGGAGCGTCCTCAATGTGCTC 

CTTCACCCTATTGGCTGCATCTCCATGAGCCCCTGAGAAGGGAAATTAACACTACCTGCA 

GAGGAGACAAGAGGCACCTGAGGACCCAGGAGGACCCACAATAAGTTGCTAAAATGAAAC 

ATCTGTTACATTTGTGTTTAATTAGTAATATGTCTT 

Sequence 1717 

NATGCTCTCCGCACCAAGCCACAGATGTGGAGGAGGCAGGTAGGGGGTCAAAGAGGGGTG 
GTATCGGTTATTCAGGAC I I I I I i I I I I i I I I CTTAAATATCCTGTGCTTCTTTCAATC A 
TTTGAAGGTAAAACCAGGTCCTGTGAGTGGTAAACTGA I I I I I GGTTCTTATGAGGGTAT 
TATTTTCTGTGTAGATATTTGTTMCTTGGTGTCCTTGCAGGGTAAACGATCATCNAANC 
TTTNTGTTCCACCATNTTGCTCCTCTCATTTTAGACCTAATATATTG ACTATAAN N GTTT 
CCCTCTGGAAGTATTGACTATTNTTTTGCATCMTACTAAAGAAAATAAAATATAATNAT 
AG AGACTN AAAC ATATTAATATATATATAATGGGTGTTAAACTAAATG G NTAATTTG AA 
Sequence 1718 

CTACTTAGGGCGAATtGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTACAGGG 

GACCCGCCAGGGGCCTCGAGAATCGGTATCCTGAGTCCTCCTGAAGAGCAGTAGAGGTTG 

TTTCATTMGTGCAAACACATTGTTCTTAATTTGAAAACTGTGGGCAGAAACAGAAGCCC 

GAGACTAATTTTTCCATTGCTAACTCTAGATTCTCGGCCACTGGAGTCTGAAGATACTCT 

CTTTGAGAATGCATATTATTTTGCTCACAGCTAAMCATTTAAGTATCATAGCTGATCAG 

TGGAGTGAGATTAAAAGGTTTCTTTTTTGAATCATCAGCTAGAGATGTACCT 

Sequence 1719 

CCGCGGTGGCGGCCGCCCGGGCAGGTACACACTCTGGATAATTAACGGCAGAAAGATAAA 

CTGTATTTCTGCACATCTTCACTGCACTGCGAGCCCCAACCACAGCACACACCTGCACCC 

TCCCTGCCTTCTCACATAAAGCCCAGAGGGAACACCGGGCACCTTGGTGTCTTCTCAGAT 

GCCCCTGGAGCGTTTCACACCAATGCAATMCAGMGTTTTGATTTTCCAGATAAACAAA 

TGGATTGTTTACAGCTACAGAAGCCCCCTTCAGAGACTGCCAAATTTCTGTCCTCATCCA 

CCAACCTGCCTAGAAGAGAGAAGCTAGTGCCCTCTGCAAAACCTCCCACACTAGGGGGCT 

GGTATMGGGATGAAMCTAGGGGGAGTCTCTTTAAGGGTGATCTTTCTGCCAAAATGAC 

TTTACTTTTCCCTGAACCATAATC 

Sequence 1720 

GAGCTCCCCGCGGNGGCGGCCCGAGGTACACTTTCAAGACCAAAATGTAAATTTACAAAT 
GTGTCTCAGTAAGAAAAGTTACAGAAGCTGGGCCGGGCGCAGTGGCTCATGCCTGTAATC 
CCAGCACTTTGGGAGGCTGAGGTGGGTGGATCACTTGAGGTCAGGAGTTCGAAACCAGCC 
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TGGCCAACGTGGNGAAAACCCTGTCTCTACTAAAAAATACAAAAACTAGCCCGGGAGTGG 

TGGCGCACACCTCTAATCCCAGCTACTCAAGAGGCTGAGGTAGGAGAATCGCTGGAACCC 

GGGAGGCGGAGGCTGCAGTGAGCCAAGATCGTGCCACTGCACTCCAGCCTGGGTGACAGA 

GCAAGACTCTGGTAATACAGTCAAGCTGCTCCTGTCAGATGTATACTTCCTGAATGAGCA 

CTATAATGGATGCTGCTCAGGTTCATCTATTCAGTAAGGTTGGAGCCTACTATATGCCAG 

GCAC 

Sequence 1721 

ACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACCAGAACTTAAA 
GTATAATAATAAAAAATAAAGTATTACAGAACATGGTCAATTATATTAAAAAGTAGATTA 
ATTMTGTCCATATATACATTTTTTGGTGGAMTCAMCATMTGAAMCTATTTCACAT 
AATTAAT ATTTTG TTTA I I I I I I ACTTATTAATTNATTGCAGAGATGGGAGGTCTTGCTA 
TGTTGCCCAGGCTAGTCTCAAACTCCTGGCCTCAAGTAATCCTCCCACCTCAATCCCCCA 
AAATGCTGGGATTTCAGGAACATGCCACTGCACTTTATTAGTGTTACCTGTCATAAAGTT 
AATTACTCATAATTTTATTTC AG N ATATATATATAC AC AC AC AC AC C ATACAC ACC ATAG 
TAGMTAAATTMTTATATTTTCTACAAAAATTATTTTCAATC I II I I I 
Sequence 1722 

ACTNCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCTGATTAGTTNAA 
TATAAAGACTCCGTAA I I i 1 1 ACMTTTTAACMTMTTTTATTTCTTCAAGCTTGTTAG 
NTNGGGATTGNATTAAAACTACAGTGTGTGACTTANAAAATGATAATGCTGCTTTATGGA 
AMTGGATTATAGGTGGGTMGACTTTCNTTTGCAAAMTTtGGGNAATTACCCATCAGT 
NGTTAGGAACCCAGNTGAAGTCTANANGACAGATGATAGTATCTTATACTAGGTTGGGCT 

Sequence 1723 

CACTACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGTCAGCCATTCGGCGAACTG 

CGCTTCGGTCTCTTGCGAAATGGCATCGTAGCGCACGCACTGGAAATACGTGCCGTCGGC 

CGGCAGCAGGCTGAAGCGGCTGCCCGCCAGGCCGTCGCGGAACAGGTCGCGCTTGCGCTG 

GTAAAACGCGGGCAAAGTCCAAGGTAGGGCTTGCGGCTTGGCCATGTAGCCCGGCAATGC 

CGTGCTGCATCGGCGTGTTGACGGTAAACACGTTGTACTGGTGCACCTTGCGAAATTCCG 

CCGTCAGGGCTGCGGGCGCCGCTACATAGCCAACCTTCCAGCCCGTCTTTGCGCAATGGT 

GAGTAGTTGACCCATGGACGTATGCTAGAAGCCGCATGGATGCAGGCTTTCAGATGTTTG 

ATCAATGTCCATGGGAAAGTTCCCGATTTGACGCTCTT 

Sequence 1724 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCACATTTTCTTGATTCC 

TACAATCTATTCTTCCTACMCAGCAACAGTMTTGTTTGAAATTATACATAAGACTTCA 

TCACTATTTTGTTCAAAATTCTTAGGATTCTTCCCMTACTTTTAGGATAGGGCTCAATT 

GCTTTATCATTGACTGCAAGGGCCCTGTGTCCTCTGAACCTGCATACCTATCCAAGTTTA 

TCTCCAACACTATCCCACTGTATTACCCTACTCTGTCTACAMGTCTTTTCTGTTCTTTG 

AACAAACAAAATTTATTCTTACCTGAGGGTCTTGCATCCACACTCAGACCTTACTTGGTT 

TTAGTTTTCAGAAAGGCCACCAAGGGACCATCCTATTTAAAACAGTCCCCATGGTTAGAT 

CCCACCTCCGTCACCTnTCCTAGTGAGTTTTCACCA 

Sequence 1725 

GTNGGAGCGGCCGCCCGGGCAGGTTCTGAGTCAAGGACTTCTTTAACGTCATNNACGGCT 
CCTAGGACAGCCACAGAAAAAAATGGGGTAAAGGGGTAGGAAGAAAAAAGGAAGGGAAAG 
AGAGCCAGGGAATTGGGGAAGGAGAGGTAGGGGAAGGGGAAGGGG 
Sequence 1726 

CCGCGGTGGCGGCCGCCCGGGCAGGTACATGCCAGGCACAAGCTGGAGTCACAGATGCCA 
CACTGACTATCACATTGTCATCATACTCAGAGAGAGAAAGGTGTATACACCCAGTATTTT 
CTTATGTTTTCTCTATACTTCCATATCCCCACCCCCATTATCCCCCGAGACATTTCCCAG 
TTGGGAAGTCAGTTGCTTATGGGAAACTGGCTGCAGGGGGATCAGCCCACCTGTAACACA 
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CGGAAAGGCAGAGAAACAATGAGTCATGGATTTGAGTGCAAACAAGCCCAGTCCTGCTAC 

AGAGAGTTACGGAAAAATCTCACTGTTGAAGAGAGACATCTATAAGAGAGGTGAGGAAAA 

TGACTGATAGATCCTTTTACAATATTCTAGGGGTAGTCTGTCTGTGGGAGGGGGCAGGAG 

TCTATGGGGAACTCTCT 

Sequence 1727 

CCGCGGTGGCGGCCGAGGTACGCGGGGGATTTTTAGTAMGTTTTGCCATGTTGGCCAGG 

CTGATCTTGAACTCCTGCCCTCAAGTGATCTGCCCACCTTGACCTCCCAAAGTGCTGGGA 

TTACAGGTGTGAGCTACTGTGCTCAGCTCAGCCATGGAACAATCTTGATGGATAAATTTA 

ATCTGCAGGTATCTCCATCTCTATTTGCCACTCTTAGT,\AATGTTTTAGTTTGACAGTAA 

ACGAGTTGAGACATTAATGATTGAGAGAGCACCTAAGCAGCTCGAAGTGCTGGGAGTCTC 

TCCTAAAGAATTAGATGCACACAGCATTTTTCCCTTACAGACACTTCACTGAAGCAGTCA 

TGTTGGACTTGGGTGGCGTCTGCTATTAGACACACTATTCACCAATAAAACCCTGAAAAT 

TGCATTGCACAG 

Sequence 1728 

CCGCGGTGGCGGCCGAGGTACTATTAAATTTAATGAGAGGTAACCCATGCACCGGCAGGA 

AGTTCAAGGGGACAGAAGGGGCTTGCTCCACTCTGGAGCCCTATCTCCTCTCACCTCCTT 

CCTTCTAGTTTGACATGCGATCTTGTCTCTCTCTTTTATAAAGTGCTTCCAGGCCCCTTT 

GGATTAGGATGCTGTCCTAAATCCTTCCTAAAGAGAGGTTTTGGAGCCACCAACACTGCT 

TCACTGTGGAATAAACTTTCCTTATTAAGATTCTTCATTCACTCATAATCATCTCCTTTT 

TCCTCACTGCTCATTCATTTATGGAGATAGCACCCATATCATAGCTACTTGGTCAAATCA 

TGGTGATACTCCCCACTTAGGAGAGGAAGATGACTAGGTTATAAGCTCTTGCCTTTAGGA 

AATTAAAAACCCTGTGGGT 

Sequence 1729 

CCGCGGTGGCGGCTGGATTCTTGACATTCATTAATATCTTGACAACTCAGCCCATCAGAG 

GTTCTTCGAAAGCCACTTCCACAACGGACCACACAGCGATAAGATCCAATCGTATTGTCA 

CAGTCCTGACCAGCGTGGCAGGTATGCCTACCCAAAGCACACTCATCAATATCTTNGACA 

AGTTCTTCCATCTGCAGCTATGGTGAGGCCTTTAGGGCAGGAGCAGTAGTAAGTCCCCAT 

GGCATTGTGGCAGCTATGGGAGCAGGGATTCCCTGCTGCACATTCATCCTCATCAGCACA 

AAAAGGTCCAACTGAGTCTAAGGTAAACCCGGAGGGGCACTGATTCTGGCGATCTCCTTT 

GG ATATTG AAG C ATTG AATTTT 

Sequence 1730 

TTCACTCTTAGGGCGAA7TGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAGGTTCAAATA 

GTCAGCAGCTCATCATAATCAATGAGCGAGGACATAAAGTAGGAAAAATGCATCACCATG 

GTGAGCAAGGAAAGCAAGTTATTGGAGGCACATGTTAACACATAAAATATAAAATTAATA 

TGATCACACTGGAAAGGCTTGCCTGAGCCCACAGTTTGAATGCCTACAATAAGATGAGAT 

GCACAACANAAAGCAAGAGAACCTGATCAAGTGGGTGACCTGGCCATGGGTGCTCTCATC 

AGTG GGGGG ACCC AAATGCTTATGTG G GACTCACCAGGTATCGAATTTANCCATTGATTA 

GGGAGTGGTTTTGGTTGGTGGNTGGGCNAGGAAAACTTTTNTTAATTCCAAATTGGAAAT 

ACCAATTNGAAAACCTTTTAAAAAAMTTAATTTG 

GGCCCAAATTNGTCAATNTTTTGGCCGTTACCCTTGCCC 

Sequence 1731 

CTTAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGTTTCTTTAA 

AAAGATTTGTATCTTGCTAGGTAATTAAACAATGTTTACAATTAATATATTT 

ATAGAGACAGGGTCTTGCTGTATAGCCCAGGCATTCTTGAACTCCTGGCCTCAAGCAATC 

TTCTTGCCTTGGCCTTTCCAAAGTGCTAGGATTACAGGGTGTGAGCCAGTGTGCCCGGCC 

CAATTMTACCATCTAAGTCATTGTCTAAGGAACTTTTCGTTTCTGTTTGTTGTGGG^ 

AGACTTTGGGAAGAAAGTATGTTTGATAAGAATTTATCAGATAACCC I nTTTTTTTTTG 

TAATATCTCTTTGGGATAGCATTGTCTTCCTCTGCTTTGGGGMGTCTTTTTGGTTACTC 

ATTACTCAAAATTTGCCAAATTTAGGGGTGGGTGGTGAATTTTTGGC 
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ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGAl I I ITA 

GTAAAGTTTTGCCATGTTGGCCAGGCTGATCTTGAACTCCTGCCCTCAAGTGATCTGCCC 

ACCTTGACCTCCCAAAGTGCTGGGATTACAGGTGTGAGCTACTGTGCTCAGCTCAGCCAT 

GG AAC AATCTTG ATG G N TTAATTAATCTGC AGGTATCTCCATCTCTATTTGCCACTCTTA 

GTAMTGTTTTAGTTTGACAGTAMCGAGTTGAGACATTAATGATTGAGAGAGCACCTAA 

GCAGCTCGAAGTGCTGGGAGTCTCTCCTAMGMTTMGATGCACACAGCATTTTTCCCT 

TACAGACACTTCACTGAGCAGTCATGTTGGACTTGGGTGGCCGTCTGCTATTAGACACAC 

TATTCACCAATAAAAACCCCTGAAAATTGCATTGCACAGGACTG 

Sequence 1733 

CCGGGCAGGTACATGAAGGAGGGCACTANGGGGAGCTAANCAATTTGTGAGTTAGGTAGC 
CAAGATTCGTTGAGGTTGGAAGAATGAGGAAAGAAAGGGGATCTGTGGCAAAAATATGCT 

Sequence 1734 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTCATACTACTTTAAGT 

TACCTCCTATTGGGGGCATTATAAATGGGTAAGCAGAGAACATCATGGAAAGACATGGAG 

CTTAGTATATGCAMTGTTGAGTTACTCAGTGTMTGTGTGAAAAAGGAGTTTCATAAGT 

TTCGGTCAGGGAMGAAAGGCAGGGTCAAAMT7TCTGCTTGAGMGTTTTGGGGAGCTT 

GGGGAGACTTTAAACAGGGAGCAACACAGCGCCTCTGTACCTGCCCG 

Sequence 1735 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTCATACTACTTTAA 

GTTACCTCCTATTGGGGGCATTATAAATGGGTAAGCAGAGAACATCATGGAAAGACATGG 

AGCTTAGTATATGCAAATGTTGAGTTACTCAGTGTAATGTGTGAAAAAGGAGTTTCATAA 

GTTTCGGTCAGGGAAGAAGGCAGGGGTCAAAMTTTTCTTGCTTGAGMGTTTTNGGGAG 

CTTGGGGAAGACTTTAAACAGGGAGCAACACAGCCGCCTCTGTACCTGCCCG 

Sequence 1736 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGACAGTTTTGAGCACCTGCCTT 

GTGCCTCMTGMTACTTGTTGGGTGATGAATGATATAAAATTCATCTTGACATTCAGCT 

TAGGTTTGGAAGACAGTGTATGATTTCTTTACCATGGCCTTGACTCATCTTCTGAATGGT 

CAGGCTAAGTGCTGCCTCCTTCTCTGTGTAGCCTATTCAGGCACCTGCTCACCTACCACA 

ACCTGCCAGGCCCCTGCAGCTCTGGAAGCATGCTATGTGAGAAATAATCCTAATATCCAA 

GTCACAGACAGTGGGTGGGCTTGGAAGTGCAGGGCCTGTGATCCCTTTGTGGAGTAGCTG 

AATTTAAGAAGTTGACAGCAATTGTGTTCTATAATGAGAATGGTCCTGATCATCATGGAG 

GAATAATAGCTGCCTTTTATTGAGAAACTGTCAGGTACCTGCCCGG 

Sequence 1737 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGATCAAACATCTTGT 

TATACTTTGTAAACATTTTC I MM) I AAACCCCAGTTCCAGCCGGACGCCCCCAGACCT 

CTGAGGTTCGAGGAGGTGGTGGTTTTCATTTGGGGCTTTGCATATTTGGTTGTTAGGTTT 

TGCGAGANCCTTTTTNATTTNGCCCAGACGTCTTNATGCCGGNGGTGAAANTTNCACTTC 

GGGCNCCCCTCCCCTGGAGNTCTTTTGTGTGCCGCNGGAANTTCAGTGGAAGATCCGGTT 

TACTNAGCGATATAGGAGGGATNTATANCTNNNNANTTCNNNNNTTAANTNTATTTGTTC 

CTTGCCCGGNGNCGNGCACNCNTNNACTGCCCGCTTT 

Sequence 1738 

AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGATCAAACATCTTGTTATACTTT 
GTAAACATTTTCTT7TTTTAAACCCCAGTTCCAGCCGGACGCCCCCAGACCTCTGAGGTT 
CGAGGAGGT.GGTGGTTTTCATTTGGGGCTTTGCATATTTGGTTGTTAGGTTTTG 
CCTTCTTTA T I I I I GCCAGACGTCTCAATGCGGGGTGAAGTCCACTCGGCCCCCTCCCCT 
GAGTCTTCCGTGTGCGCGGAATTTCGAGGAGATCCGGTTACTAAGGATATAGAGGAAAAA 
AAAAAAN TTAAAAATTAAAAATTAAAGGTACCTGCC 
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Sequence 1739 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGTATGATGGAAG 

GAAGAACTTGTGTGCTTAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGT 

AGGACCCTATACGGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAAT 

GCAGTGAAGTAGAAGAAGACAGGGGATATCCCAGAAGGTTATGCAAAACATCAAGAGAAG 

ATGAGAGGAGTCTATATGTCAGAATACACATTTCCCAC 

Sequence 1740 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAACATACAGTTAAGCTA 

CTTAAGTTGTAGTGACTTAGGACTTAATTATCTTATAGGATAAAATAAAAACTTTAATCC 

TAAACATTAGCCATTGCATTAAATGCTTTAMGGMCAACTGGCTTTGAGCTCTGTTCAT 

CCATGCAAGGATTATACMTCTCAGTGCTCTAGCTAMTTGCTAATGTTTTATTACTAAT 

GCACTGGTMGCCTCTCATAGMGATGGGTCTCTTGGATTTACACTGAAATATAACTTTT 

CTCTTCCATATTATTGGAAGACCTTATTCTTCAGGCAGGAGCTTAGAAAGTCTAAATTAT 

GCACCAGAGCCTTAATTATCTGATGGCAAAAACCACGATACTCTGGTCACTACTACCAAT 

TGACAATNAGAGGACCCCAATTAAAAACTCTAGAAATATTGAGTCCTGAATACCAG 

Sequence 1741 

GGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGAAAGATGATATACATGAAAATGA 

CTTGAAAATTCGTTTCTCTAC i I I I I I I I I AATGTAACACAAAGTAAATGAGAATCTGAG 

TATTTTATCAAGTTGTTTCAATGMGACAGAGTTGATTCAGAAGGTAATTCCCAGATTGG 

ATGGTATGAGTTTCCGGCAAGATTAGAGAMGAGTTTTTCTATGTAATAACTGATATTTG 

TTCACATGCTTCCTCTTGATGGCTCATCAGGATTTTAGAGATAGAAAGGACATNTTATCC 

AACCGCTTTCATTTTCTTAAAGAAATTGGGGACCGGGCGCAGTGACTCATGCCTGTATTT 

CCAGCACTTCGGGAGGCCGAGGCAGGTGGATCCTAGGTCAGGAGTTCAAGACCAGCCTGG 

GTCAAGAATGATGAAACCCCCGTCTCTACCAAAAAAA 

Sequence 1742 

GGCGGCCGCCCGGGCAGGTACAGTTACAACTGACTTGCATAACACTAAAGAAAACATTGT 

AATTCCTACGCCAGTAGTGGGGGTAAACTGAGAACCAACCTAATTTTCACCACAGGGCCC 

TCTGGAAATAGAGGAACTAAAATGATGACTGNTTGAAAGTGAAAGAGGGCAGATTCCTAA 

GAACATCACTCCTACTGCATTCCTTACCTCCACTCCATACCCTTTCCTTCTTCACTCAGG 

TAGGACTGTCTCTGGAATGCATGACAGCATGCACAGTTCAGAGCTTGGATGTGATCCTGG 

AAATGGGCTTAGATGACCTTATGAACACTGAATGCTGATTGCAGACACTCTCAACCTTCT 

TCCTCTAGTTGACTTCTATGGCTGTGCCAGCCAGTCCTAGACCCTTCTAGAAGAACACTA 

GGATGATACATTTC 

Sequence 1743 

AGGTACTT/\CAGTTTGACTTTGAGTCAGCTCAGCATTCTAAATCAAACGCAAACAGCAGA 

ATCATATGGCATANACACTTAGCAMTCCMTGCCTTCCAGGCATCTTGTTTCCTGTTGA 

TGAAATCCrCCCTATGGAGAGCAAACTGGTTCATATCTTCANATANTGTCATTCAACCCC 

TTGGGCAoCCTTTTCTTGGGGCTMCTAAAATATTNCGGCACCCCGGTTTCTTGGGTTGG 

GGACCA1 TTTTCAACCCCCNAAGGT7TG NTTTC ANTTTTTATTTTMGATTTTTTTCTTG 

GGGCCTTTCCNGTTGGGGAAGNAAAAATTTAATTCCTGGCCNCTTTTCCTTTCTTGGAAG 

GTNATTTTGGANTCTTCTNCMTTTTGGGNGGCCNGCCCAATTGGGNTTGGAAAANAAAA 

TAATTTCCGGCCCCCTTGGGTTATNCATTMCCCCCCCCCGGCCGGTTTAACCCCCTTGG 

CCCCCCCGNGGGGNCCGGGGTCCCCGGGCTTTCTTAAGNAAAAACNTTAATGNTTNGNGN 

AATTCCCCCCCCCCGGGGGNCNTTGGCNANGGGAAAATTTTCNNGAANTNATTCCAAAAA 

GGCCTTTTATTTCNGAATANANCCCNGTTCCGNACCCCCTTTCGGNAGGGGGGGGGGGGG 

0 

Sequence 1744 

ACAAGGTCTCACTATGTTGCTCAGGCTGATCTTGAACTCCTGGGCTCAAGGGATCCTCCT 
CGCTCGGCCCCCCCAGAGTGCTGGGATTACAGGTGTAAGCCACCACACCCAGCCAACATT 
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CTTATTTTTATGCTTCCTrrGTAAATATAGTGGAGAAAAAAMTCACTGTATTAG 

CTCACCAAGGGCAAGATAGTCATATGAMTAGTCCTTAGCTTCTTTAGTTGCTTGAATAT 

TACCTCCCTGGTCTGTGAGGTATCTTTTTGCTTCANATCATTTCCTCAAAGAAACACATT 

ATTGGTTAATATNATTGGTCACCTTTTGAAGTTGGGTCCTATATGCTGATTCCGGGTAGC 

CCCAGACTTTCAA 

Sequence 1745 

AGGTACAGTGGCGCCATCTCTGCTCACTGCAAGCTCCNTTCTCCCAGGTTCATGCCATTC 

TCCTGCCTCAGCCTCCCAAGTAACTGGGACTACAGCCGCCAGCCACCATGCCTGGCTAAT 

TTTTTGTATCTTTGGTAGGGACGGGGTTTCTCCATGTTAGCCAGGATGGTCTAGAGCTCC 

CAACTTCAGGTGAGCCACCATGCCTGGCAGGACTTTTTTTGATGTMGCAGAGCCCCTTT 

TGCCAGCTTCCTTGGGCTTATTCCCCCATAGATGGCAGCTGGGATCCCAGCCTCTTGCCG 

GTTGATATTGTCATTCTTGGAGGGGGCCCAGGTTGGGGCTAATGCTCGTGTGACTAGAGC 

TTGCTGGTGTTGAGGGTGGTTCATCTGAAGTTCATGACCACTAGGCTTTGATCTAAAAAA 

A 

Sequence 1746 

ATACTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGGCTTTGCTCCAGGCCTCAGGAG 

TGTMG TTTM GGGGCGCGAACCTGGGTGGGACGGGGCAGTTTTTCCAGCGGGCCATACG 

GGAAAATTTTTGGTTCGAGGMGACMCCACCCTTTTTMGGAGAAMCTGCATCTTGCC 

CTGCGTTATTCCTACGCGGTGCCCAGGTGGGGTGTGTGTGGACCAAGTCAATGACCGCCC 

AAGCTCTCCGAGTAGAAAACCCAAACATGGTTTTGTGGGGTGTGTGCCTTTGACCCCGGA 

CTTAAGCAAAAAGCGTGGTCTTGGGCGTAGCTACNAGGTGGGTTGGTGGGGCTTCCAAGG 

CCCTGCGTGCCCCTCGACGTTGAACCGATGATTTGGGCGGGTGCCAAGGATAAACAAGAT 

TCGCCCCTTCTTTNGCCCACTTCAATTGGGCAAGGGGGGACCTTTGCTTACTTTTCCCG 

Sequence 1747 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAGCTGGACAGGGATATAAATCAGTGAACTCTG 
AGMCACTTTTAATCCAAGCACCAAATAAGGCAGCATTTCCTGTGACTGCTGGACGCCAC 
GGTTTAAGTGAGATGCCAAACATTCCTCATTTGGGAAAATGCGCAATAGTCCACAGAGAG 
AACAAAATCAAAATGTGAAAAGTTGMTAAAAAACACT 

CAGCATMGTTAGCTGACCTCATCTTTGGGACTAGAAAAATAACAGTAAATAGTAATAAT 

AGCTCCCGCGTACCT 

Sequence 1748 

AGGTACATCTCTAGC TGATG ATTCAAAAMGAMCCTTTTAATCTCACTCCACTGATCAG 

CTATGATACTTAAATGTTTTAGCTGTGAGCAAAATAATATGCATTCTCAAAGAGAGTATC 

TTCAGACTCCAGTGGCCGAGAATCTAGAGTTAGCAATGGAAAAATTAGTCTCGGGCTTCT 

GTTTCTGCCCACAGTTTTCAMTTMGMCMTGTGTTTGCACTTAATGAAACAACCTCT 

ACTGCTCTTCAAGAGGACTCAGGATACCGATTCTCGAGGCCCCTGGGCGGTCCCCTGTAA 

GTACCTGCCCG 

Sequence 1749 ^ 

A GGTA C ATCTNTAG CTGATG ATTCAAAAMN AAACCNTTTAANTTTANTTCN NTN N NCN A 

NTTTT/7\NCCTAAAATGGTTTMCCTGTGMCCAAAATAATATGCATTCTNAAAGAA 

TATCTTCA(3ACTCCAGTGGGCCCGAGAATCTAGAGTTAAGCAATGGGAAAAATTAAGTCT 

CGGGCTTCTGTTTCTGCCCACAGTTTTCAAATTMGMCMTGTGTTTGCACTTMTGM 

ACAACCTCTACTGCTCTTCAAGAGGACTCAGGATACCGATTCTCGAGGCCCCTGGGCGGT 

CCCCTGTAAGTACCTGCCCGG 

Sequence 1750 

TNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGCAGATAACTTATC 
CAAGGACAAATAGAAAGGGGACTGTGTCTAGAGGGTAATGCTAATGCCTGGCATGCTATA 
AAAAT AMTG AAATCTGATTAACACTATTGGAGGGAAGAATMGTCATCGTTTCTATGGA 
ATATTATTTTCCTGATTGGAACCATTTCCATTTTCGTCAGCTTAAAACATTGCATTA 
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ATGAGATGACTTTAAAACGGTGACAGCTATCTCAAGCCCTGTGATTATAAATGGTGTTAT 

AGGCTTGTAATCCTATGGGTTTAGACTAAAGTATTTTCTTCATAAATTGAAAGAGAAACA 

CACTAGGAATTCAGAAGATTTTAATTAAGTTGTTCAAAAAATTTAATG 

G CTGG CTC ACTAAGTAGTTTTG ATAATG AATTTAATTTATAGMG AMGTTCTTGGC 

Sequence 1751 

CACTNCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAAAG 

AATGTTCATAGCAGCATTATTTGTCATAGTCAAGAGCTGGAAGCAACCCAGTTGTGCATC 

AATGGTAGAATGGATGACTAAATTGTAGTGTATTCTTACAATAAAATACTATAGAGCAAT 

G AAAATAAATAAACTATGGC TATATTCC AC C AAC ATG G AT AAATCTTACC AAC AATGTTA 

GACAAATGTAGCCAAACCAAAAAATACACATGTTCTATGAATCCATATATGCAAGGTTCA 

CAATAGGTGATAGTAGCTACTCTTTTGGGAAGGAGTGAGGAAAGGACTGAGAGAGTATTG 

CAGAGGGGGCTCCGGGTGTTTGTMTTTCTATTTCCCAGTGCAGGTGTGGTTTGCATGAG 

CATGTTCACTTTGTGGAAATTCATTGAGCTGTAACTAAATATACAGMTTATTTATGAAT 

GAAATG 

Sequence 1752 

ACCACTGCTTTCCCGGACTCTGCGTTGTTACCACTGCTTCCCGGGACTCTGCGTTGTTAC 

CACTGCTTCCCGGGACTCTGCGTTGTTACCACTGCTTCCCGCGTCCT 

Sequence 1753 

CAATCTGTTCTGCTCACCAGCGAGTTGTTCGAGTTCATCGAAGACGGACACGGCGGCCAC 
CTGCTGAGAGTGGGGACCTCTAAATTTCCGGTCGGCACGCTGCGGTTCACCGCCCAT 
Sequence 1754 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCACCACTGTTCCCCAG 

CCTGCCAGATAAAAATAAACTGTCCACTCCAGGAGGGGAGAGGCACAAGCTGCTCTGATA 

GAAACCGTTCCTGAGCATTATCTTGACTGTTTGCAGAGGAGACATGTGTTCCCAGCCAGC 

ACAGAGCTGCAGGATTGCCTTTTGCGATTTGTGAGAATTATGTCATAGTGAAATATAGTT 

GAGAATGCTGGGCAGTACCTGCCCG 

Sequence 1755 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGTTCGCTGCTTTT 

CTTGAGTTCATTTGTCTTTCCAATGATCGTATATGATTGAATAGTAGAGAAGGCACCTGC 

AAGTAGAAGCCAATGGCAAAAGATTGATAAGCATCTTTCCTGGACTCTGGTGCTTCGGGG 

GTGCCAATAACTGCACTCTGAGTMTCCTGGGTCACTTTCATCAATTGAGAGAGTTTAAG 

TCAGGGGACTTATGATTTCTTTCTCAACMGACCAGGTGAACGTGTTTTTGCAGTTCTAA 

GTATTATTTGCAAATTAAATAGCCATATATCI'CTAAAGCTTCCTGAGCAACAACCAGAGG 

TACCTGCCCG 

Sequence 1756 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGTTATGGCTGAGTGTTT 

CAATTAAAAAATACATTGCATGACAATTTTC^'GTTTCACAAAGAGAAACAGCATT^^ 

AAAATTGAA7TCTCCTGTGTAACCTGGTAG/\GTTTTCTATTCCTGAATATACATCCTTTC 

ATAGATATTACTAGCAGGCTAAACATCTGGCATTTAAATGTCATTACTATAAATAAACCA 

CAAAATTGGCTGCCTGATTTGAAGGGTCAATCTTATGGAGTAACTGAAGMGAATCATTT 

TTAMGGAGATGAGCAGTTTTGGATTTATGATACCAATAATTTCTCTCTTTAAGGAACTC 

CMCTCACTTMTTTAATGTTAAGAATAAAATCAGACTAATTAGTTCTCAGGACAGGTCA 

ATGGCACACCATTTTCCTTTTCACTTTGGCAGAAACCTAGGAGTCACTT 

Sequence 1757 

CTATCACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGC 
G G GTG AG ATTCTG CTGGGTG AAG CCTAAGG G C ATAG ATG AAAAGTTAGG ATTTG G AAG GG 
AGCCAGAAGCCAAAAGCAGTTCCCAGAGGCCAGGCAAAAACAGAAGCTGAGTGTGGGACT 
GAAGCCAAAAAGCGGAAGATGATGAATTCCACAGCGAACCGTGGGAGCCGTCCTGGAACA 
ATGCCTGAGATGCGCGCTGGCTTTCTGGGAGCAGTTAGGGCCCCTTTAGGTATGTGAACC 
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CGCCTCACTAAATGGCCATGAGCAGAACTGAACTGCCTACCTGTTTCTCCACCTGTGCAA 

GACCAAGTmCTTAAGMNCCCAGACGMNGCTTCCTTTGAAAATNATACCGTTGCCAT 

TTGGCTTGGCCCCTTTGGCCTTAMGCCTTMCAGGTTTANGGACTTTTGGGCCCCTGGG 
TTAAATTT 

Sequence 1758 

TACGACTCCTATAGGGCNNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACTTATAGT 

TGAGAGCCAAGTCTCCCTTATNATTGGTGAATGAGAATGAGCTACTGAAAACAAAAAGAG 

GGTCTTCTACTCAGCCTCTACCCCTAATATTTATATCAGAAGCAGAGATTAACTGTCCTT 

ACTCATTCACACGTTAA TGGA AGMGMGGGMGTTTCCTAGAAAAATCCTCCCGCTCCA 

CCCTGCAAACTTTATGCTTTTCTGTTACATAATCAGGCAGGGACAAGACCTAAACTATTT 

TGMTTGGTGGTGTTGAGGCTAAATTCTCTGCTATTGACAGAATTGAGAATGTGATCAAT 

TTCAGAGTAGCATGTTACAMTTTTGTCCCAATTTCMTGGGGAGAATTATMCAAATCA 

GTAGTGGTTGGGAAGAATCTGTTCTTACTTTCCTCCCATCCAGTTGACTAGGGATTTGGG 
AAAG 

Sequence 1759 

TGGGCCGCTTTTTCCGCTTTCTCGCTTMCTGACTTCGCTGGCGGCTTCGGGTCGTTCNG 

GCTTGCNGGCGAGCGGGTATCAGCTTCACTCAAAAGGGCGGGTAATACCGGTTATCCACA 
GAATCAN 

Sequence 1760 

CTNCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAGGATGAAATGAGT 

GATTCTGTCCTTAACAAACTCCTAGTCTCGTGGGGTGGACAGCTGGGAACTCATTTAGAG 

AGATTACAAGGATGAATACCCTTGGGCTCGATAGAGGTAGAAGCAAAGTGCTTAGTGTTA 

GAGAGCGTTTCTTGAGGAGGGAAAATCAGAATCAGTCTGGGTTTTAGGAAGATAATTCTG 

GAGGTTGAGTGAAAGATGAATGGAGAGGTGATAGACTAGCTTGGGCATTATGGAGATGGA 

AGGAGAACGGAAGGGGCAGATAGGACAACTAATTGGAGGCCGGAGTGAGAGGAAGGGGCC 

ATAGATGACTTCATGATTTCTGTCCTGGTTGCAATAAAGGGATCATGATCGTAAAGATCA 

TTCCCATTTGACAAAGGGAAGA 

Sequence 1761 

GACTACTTAGGGCGAATTGGAGCTCACCGCGGTGGCGGCCGGGGCGGTATCGAGAGTGTG 

CAGCGTCAGACGGGACATGGGGTACCTTGCTAAGTTGGTAAGTCACCCGGCCATTATATC 

GGCCATATCGCGCACGCAACAGACGTCAGGGTCGTCCTTTTGTGCTCGTTCGTCTGTGAT 

TGTGCGATCCTTGATTCCGGCAGGCTCTTAGCTGGCCAGCTGGCGCAATTTCTGGTCGCG 

CCGGTAGGCCCCTGGTTGCTCGCCCGTGACCCGCTTGAAGGCGCGTGTAAAGGCCGGCGT 

GCGAGCGGTAACCCACCAGTT 

Sequence 1762 

CCGCGGTGGCGGCCGGGGGCCATTGAGACTGCCATGGAAGACTTGAAAGGTCACGTAGCT 
GAGACTTCTGGAGAGACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGACCACTGGAAC 
AATTGAGAAGGAGAGAATTCTACTGGTC 
Sequence 1763 

NACTACTATAGGTTNAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCCGGG 

CNTGTGAGGTGTCAGTCTGCCCCTACTGGGGGTGCCTCCCAGTTAGGCTACTCGAGGGT 

CTGGGACCCACTTGAGGAGGCAGTTTGTCCGTTCTCAGATCTCCAGCTGCGTGCTGGGTG 

CTGGGAGAACCACTACTCTCCCCGCGTACCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGTCGAACCCTCGAAGGGGGGGGCC 

CCGGGTACCCMCTTTTTTGTTCCCTTTTAAGTGGAGGGGTTMTTGGCGCCGCTT^ 

CGTAAATCATGGGTCAATAAGCCTTGTTTTCCTTGTGTGNAAAATTTGTTTATTCCCGCT 

TCAACAAATTTCCCACAACAAAACCATTAACGAAGCCCCGGGGAAGCATTAAAAAGTTGG 

TAAAAAGCCCTGGGGGGGTGCCCTA 

Sequence 1764 
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CAGAGGGGTCTGTCACATAGATGCCTTTCACAGCTGGAAGTGGCAGAGCAGTTGGGTGAC 

CCTACCCAAAGGAAAGATAGATGTGTTCCTGGCCAGTGCAGGGGTGAGATGTGAGCACAG 

CTGCATGGCCATGCTGCTGTCTTGGGTCTTTGTCGGAGGGGATTCAGAAGTTTAGACAGA 

GGTCATTGCCCAGATCAGTTCCAGTGCTACATGATGCTGGTGGTCATGACTTGCAAGGGT 

GGTTCCTGGGCCATCAGTGAGAATAACACAGGGAGTTCCCTCTCTAGGTCTCAGTGAGAG 

^CAGTCTCTGAGGCCCTGGCTTCTGAGCCGGATATTCCTGATGGGTACCT 

Sequence 1765 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACGAGTAAACTACTTGAAGGCAATTCCTGTGCT 

TAAAATGGCCACGTGAAAGGTGTCCCTGACATGGAAAGGACAAACCATGAAATGACCCTT 

GAAGCTCTTAAGGTGTCTGTCAAAAAAACAAAGCCAACTGTCAAACTGAAMGCCAAAAT 

CATCACAAAAACACCTATTCTTCAGAGGCCTCAAATCTCAAACTATCTCCCACCATCCTT 

TGTAAGTATGTATAAGGAGGCCAGGAAACTAGCATTTGCTGAGACTTTATTCTCAGCTTT 

GTATGCTTTGCATAAATGATGTCATAGAACACATTGACAGTGAGAAAACTAACGTATTAA 

CATTTTAATTACCTTGCCCTATCACAAGATTAGAAAGATTCAAGATATGAGACATCCAAA 

GTCTGTGATCTGAACACCAGAGTCAGAGGCTGGCAACCTATGGCCCACGGGCTGAATCTG 

GCCCACTGCCTGTTCCTATAAAATAAAGTTTTATTGGA 

Sequence 1766 

CCGCGGTGGCGGCCGAGGTACTAGGATTGCAGGCATGAGCCACCATGCCCAGACTAATTC 
CTGTTTTCAATTTTAGGTAGTCATTTGATTATTGAGTTGTAGAMTCTTACTT^ 
AATATATTAAAAACTATCACCAGTTAGCCATGTGTTAACTGCTTAG Mil l CAACCAAAC 
TTCAATCATCAGCCAGCCCTCACTTTCAGCCCAGATCGACGGACTCAACGCTTGGTGATC 
GGTTAACTCCTTGTTTTTTAGCTAATGCTTTAA 

TAAACATCATACTTCTCAATGCTTTGGTTATCAGCTAATGTCTTC I I I I ICAGCTAACAC 
CTTAGTGGACATTAATACCCTTGTTAACGTATGTTACTTCAACCAGATTAGACCTGGCTG 
CATTGGGAAAGGAAAGTGGGGGGGCTTGACAAGG 
Sequence 1767 

AGGTACTTACAGTTTGACTTTGAGTCAGCTCAGCATTCTAAATCAAACGCAAACAGCAGA 

ATCATATGGCATAGACACTTAGCAAATAGTTTGACTTTGAGTCAGCTCAGCATTCTAAAT 

CAAACGCAAACAGCAGAATCATATGGCATAGACACTTAGCAAATAGGCAAGCGTCTCTGC 

ATCCCAGAGCAAATCCTCTGAGATCCTCAGTCTCTCCTGCTGAAACTTCATCTTCCACTC 

CTGCCTTTGGGTCATTTCCAAACGGCAGGGCACCAGACACGCTCTACCTTATGTGGAGCC 

ATACTGTTCATTTATTAGTTTTCTGGCTTCGTGGAGAATTATCTGCCTTCTCTGAGTATT 

GATTTCATTGTGCCCCCGCGTACCTGCCCG 

Sequence 1768 

GCTTGAACAAATCCACCTTCTGTGGACCAAGCACCACCCTGGGCATTTCTAGCATGAGCA 

AAATCCAAGGTCCTGGCTGGACTCCAGAGATGCTATTTACCTCAGAAGCATGACAATAGG 

AGGCAGAAGGAGCAGGCAMTCCAAGTCCTTTCTTGTAGTTTCCTTGTTTGGGGAGGAAA 

AGTTGAGTTTTACTATTATGGAAAAGAAACAGGAAATAGAGACAGACAAAGAGATATGAC 

AATACAGTCCTGCCACCCAGATACTCATTTCCACCCACCATTCCATGCATTTGTTTTGAA 

TATATAAGTATGTACCTGCCG 

Sequence 1769 

CCGGGCAGGTACGTGCTAAGTTGAAAAAGGAGCCTCAGATTTCTCTGACCrACCTCTTAC 
CCTGTGCTGTCTCTCCCCAGGATTTTCTGAAGTTCCTTGTCTGCCTGAAG TCCAGACCTG 
CCTCCTCCACATCTGTGGCTTGGTGCGGAGAGCATCTCTGGGCACTGGAGGAACGGTACC 
T 

Sequence 1770 

CCGCGGTGGCGGCCGAGGTACATGTGAATGGGAAACATTCATAAGTTCCTATTTGCCAAG 
GGCATGTAAATGTGATAAAGCGGATCCGAAGACACATGGGAGTGAGACTAGGAAAGTATT 
AAGAACATGGGCCTTGACCGGGCGCAGTGGCTTACGCCTGTAATCCTACCACTTTGGGAG 
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GCCGAGGCTGGTGGATCACAAGGTCAAGAGTTCGAGACTAGCCTAGCTAAGATGGTAAAA 

CCCCGTTTCTAAAAATTAGCTGGGCACAGTGGCGGGCACCTGTAATCCCAGCTACTTGGG 

AGGCTGGGGTCACTTGAACCCTGGAGGCGGAGGTTGCAGTAAGCTGAGATCACGCCACTG 

CACTCTAGCCTAGGTGACAGAGCAAGACTCTGTCAAAAAAAAAATAATAATAAAAAAGAA 

TGTGGGC 

Sequence 1771 

GGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCGCGGTTTTTAAATCTGTCACT 

GTTCTTCCCCCACTTCAAAGCTGACTTGACAACTGCATTTATTTCACATTGCAATTGCTG 

CAGGTTAATAATATTGTTTTATCAGGAMCATTTATCTTGAGGAAGAAAGTTATCCTAAT 

ACNGCAAGATTACCTNATCCCANGGNCAAANTAGAAAGGGGACTGTGTCTAGAGGGTAAT 

GCTAATGCCTGGCATGCTATAAAAATAAATGAAATCTGATTAACACTATTGGAGGGAAGA 

ATMGTCATCGTTTCTATGAAATATTATGTTTCCTG 

TC AGCTTAAAAAC ATTGC ATTTAAAG GATTG AG ATG ACTTTAA 

Sequence 1772 

TNNNCCNTCNGTCCTTTTNGGTTTGCNGCCGAGGCGGTNTTCAAGCTCACNTCAAAGGGC 

CGGGTTMTACCGGTTTATTCCACAAGAMTCAGGGGGATAACCGCAGGAAAGAACATTG 

T 

Sequence 1773 

AGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAACTTGACAGTTTATAAAAAT 

G CTAACCTATATCAACATTTTTC CCCACC AMGTGTTCCC AAG GGC AAAGGTAAGTG AAT 

ACATGTTTCCATGCTTGTGTCAGGGGCACACAGCAAGGGAAGGCAGAATGGGCATGACCT 

ANATTNNANNCCNNAAAGCCTTCAAANTTCCTTTATTAAGMCTTGGGGACTGCATCTTT 

AAMTMTATGGGTTTTNTmATTCTTTGATMTAAAMGAATGGAAC 

TTGTCTGATTCTTTGCCCANTGCNTCTTTCTNATGTCTAACACTGCCTCGTCTTATGAAT 

AACNTCTGNACATNTACAGAGGAAAGGACTTATTTGNNTT 

Sequence 1774 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCCGGCCGCCCGGGCAGGTACACTTATAGT 

TGAGAGCCAAGTCTCCCTTATCATTGGTGAATGAGAATGAGCTACTGAAAACAAAAAGAG 

GGTCTTCTACTCAGCCTCTACCCCTAATATTTATATCAGAAGCAGAGATTMCTGTCCTT 

ACTCATTCACACCNTNATTGGAAAAANAAGGGMGGTTTCCTAGAAAAAATCCTCCCGCT 

CCACCCTGCAAACTTTATGCTTTTCTGTTACATAATCAGGCAGGGGCAAGACCTAAACTA 

TTTTGAATTGGTGGTGTTGAGGCTAAATTCTCTGCTATTGACAGAATTGAGAATGGTGAT 

CAATTTCCAGAGTAGCCATGTTACAAATTTTGTCCCCAATTTTCAATG 

Sequence 1775 

CCGCGGTGGCGGCCGAGGTACTTACAGTTTGACTTTGAGTCAGCTCAGCATTCTAAATCA 

AACGCAAACAGCAGAATCATATGGCATCGACACTTAGCAAATCCAATGCCTTCCAGGCAT 

CTGTTTCCTGTTGATGAAATCCTCCCTATGGAGAGCAAACTGGTTCATATCTTCAGATAG 

TGTCATTCAACCCCTTGGCAGCTTTCTGGGCTACTAAATATCACACCGTCTGGTGGGACA 

TTTCACCCMGTTGTTCATTTATTAGTTTTCTC 

TGAGTATTGATTTCATTGTGCGCATGGTGAAGATATCGCCTGT 

Sequence 1776 

CGAGGTACGCACTTGTGTGCNTTTCTGTTGGGTATATAACAAGTATGATATTGCTGGGTC 
TTAAAATATTTATATATTAACTTCATTAMTGCTC^ 

CTATTTTACCTTCACCAGCNCTAGNTGTTCCATATCCTTGCTGTNACTTGGCATNATCTG 
TA l I I n i ATTTTGACCAGACTGGTCTGTGTGTCATAGTATCCCATTGTCATTTTAAGTT 
GCATGCCTTTGATGAAGACATTTTCTGATGCTTAATTGGCTCTTNGAATATATTCTGTAA 
CTG ATTTGTAGG AGTCCC NTTATATATTCTG G ATATAAG NTTTTTTATTG GAC ATAAATA 
TTGCANATATCCCTCTCCCACTTTAACTTGCCTTTT 
Sequence 1777 
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ACTNCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGTA 

1TCTGAAAAATCTATCTCTAGAATGAAGTCATGACAGTTCACCCTACTCTACTAGTTTTG 

TCCMGCTAATTTTCATGTATGATTTTGTMTCTGATTGAACMTACTTGAAAAAAAAAA 

AGTTAGATATATAAATGAATTTGTTGGGGGATAAAACATAAATTAAGCCTATCCCTATAA 

AATCTACTCTTTCCTGACTTCCAGGAGTGGTCCTTTATTTGAGAGGCAAAACATATACAT 

TTCACATTACCATTTTTGAGACTTGATGAAGCCCAGTCCTAAACTGTGGCTGAGGCTAAG 

GCTGAATGGTTGGGATTTACTGTGAGAATGGACATTCTCCCGGGTCCTCACCTGAGGTCA 

TTAMTTGATTTAGTCTAGGTCCACTGGTTGTATTTT 

Sequence 1778 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAGGGAGA 

CAGCATGCAGGGTGTGTTCAGAAGAGCTTGCTGAGGTGCTCGGCTCTTAGCATTAAAAAT 

GTGATGTTGGTATATCATCCTGATAGAAAACACTGCTTTCCAAATCCTAGTCACTGGATG 

GGAGGAAAGTAAGAACAGATTCTTTCCAACCCTACTGATTTGTTATAATTCTCCCCATTG 

AAATTGGGACAAAATTTGTAACATGCTACTCTGAAATTGATCACATTCTCAGTTCTGTCA 

ATAGCAGAGAATTTAGCCTCMCACCACCAATTCAAAATAGTTTAGGTCTTGCCCCTGCC 

TGATTATGTAACAGAAMGCATTAMGTTTGCAGGGTGGAGCGGGAGGATTTTTCTAGGA 

AACTTCCTTCTCTTCCATTAACGTGTGAATGAGTA 

Sequence 1779 

CTACTTAGGGCGAATTGGAGCTCACCGCGGTGGCGGCCGGGTCCGGCGCATCCATGGCGA 

ACAGGCAGACGAACGCGCCTTGCTGCGCGCTGCTGGCGGTGGCGCCCTGCGGCGCCAGCA 

CCGTGAATTGTGGCGCCGTCACCAGTTGCAGCAGCGGCTTGGCATCGGTGCGCAGCACGC 

GGCGCGGCTGGCGGCGCGGGCCCNGCGCCGCANCAATGGCGGGCCTGNAGTTCCAATNGG 

CAAGGGTGCGCGCGCGGCNN 

Sequence 1780 

CCGGGCAGGTACAGCACCTTGTGTCCTGCCACCAGCCGGCTGTGAAACTGGTACGCGGGG 

TAGATGGAAGGAAGAACTTGTGTGCTTAGACCTGACGCTGGGAGGAGATGCTGCCACCTA 

GGTTACTTGTAGGACCCTATACGGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCT 

GCAGGAAAATGCAGTGAAGTAGAAGAGACAGGGATATCCCAGAAGGTTATGCAAAACATC 

AAGAGAAGATGAGAGGTCAGAGATGGGAAGAAACAAGAACTTTGACATGCTTGGTGTTCT 

TGCCCAAGCTTTGAAGAAGTTTACAAAGTCTATATGTCAGAATACACATTTCCCACCTTG 

CCCAA 

Sequence 1781 

TNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCGCGGTTTTTAAATC 

TGTCACTGTTCTTCCCCCACTTCAMGCTGACTTGACAACTGCATTTATTTCACATTGCA 

ATTGCTGCAGGTTAATAATATTGTrrTATCAGGAAACATTTATCTTGAGGAAGAAAGTTA 

TCCTAATACAGCAGATAACTTATCCAAGGGCCAAATAGAAAGGGGACTGTGTCTAGAGGG 

TMTGCTAATGCCTGGCATGCTATAAAAATAAATGAAATCTGATTAACACTATT GGAG GG 

AAGMTMGTCATCGTTTCTATGAMTATTATTTTCCTGATTGGMCCATTTCCATTTTC 

GTCAGCTTAAAACATTGCATTAAAGATGAGATGACTTTAAAACGGTGACAGCTATCTCAA 

GCCCTGTGATTATAAATGGTGTTATAGGCTTG 

Sequence 1782 

ATAGGGCGAATTGGAGCTCCNCGCGGTGGCGGCCGAGGTACCAAAGGTCTTCAAGGGTAA 

CGATGCTGCTCCATTTAAATAAAAGATGAAGGATGTCGAAACACAAGCAGCCTTTAACCA 

GTAGGCCCATAAATAAGACAGTAGCTAGTGCTATGCAATTCTAACACTGGCACTTGTGCT 

GGGATTGGCCTCTTCTAGGCTAGGAATTCTTAACCTGGAGGTCAAGGACTTGGGGGGAGG 

GTCCTATGGACCCCTTGCAATGTAGGCAAATTTTGCATAAACATTTTTCTGCACACGTTT 

CAAAGGAACTTGGAATATCCGTAAGTTGAGACCTCCAAATTACTAGAGCTAATTGCCCCA 

TTTTATTCCTTTGTATCATCTCAAATGCCMTATAATATCTAGTGTATAATAAATGTTTG 

CTGGCTGCTTAGGAGAGAACTAGATGTGTTAATGACGCCATCTCACAGTGAAAAGGGGCA 
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Sequence 1783 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTACAGGGGACC 

GCCAGGGGCCTCGAGAATCGGTATCCTGAGTCCTCTTGAAGAGCAGTAGAGGTTGTTTCA 

TTAAGTGCAAACACATTGTTCTTAATTTGAAAACTGTGGGCAGAAACAGAAGCCCGAGAC 

TAATTTTTCCATTGCTAACTCTAGATCTCGGCCACTGGAGTCTGMGATACTCTCTTTGA 

GAATGCATATTATTTTGCTCACAGCTAAAACATTTAAGTATCATAGCTGATCAGTGGAGT 

GAGATTAAAAGGTTTC I Nil IGAATCATCAGCTAGAAGATGTACCTCGGCCCGCTCTAG 

AACTAGTG 

Sequence 1784 

NGCGGCCGAGGTACACTTATAGTTGAGAGCCAAGTCTNCCTTATCATTGGTTAATGAGAA 

TGAGCTACTGAAAACAAAAAGAGGGTCTTNTACTCAGCCTCTACCCCTAATATTTATATC 

AGMGCAGAGATTAACTGTCCTTACTCATTCACACGTTAATGGAAGAGAAGGAAGTTTCC 

TAGAAAAATCCTCCCGCTCCAC 

Sequence 1785 

CCGCGGTGGCGGCCGCCCGGGCAGG1 ACCATTGACTACTTCATACTGCTACAAAATGCAT 

ttgaaai nrnrm i ctggccmggaacctattcatccactctctccaaattaaatactt 

ACTAGAGAAATGTTGGCACTTATTAACCTTGTAATATGMTTAGAMCAMCTTGACCTT 

CATATATTCTTTTACACTTATGMTCTATTTAACTGCATGATTTATCTGCATGATTCAGA 

ATTTTCTTGATACCATGTAACTGMGTMGATTTAGAMTATATTTTCTTGATACCCTGT 

AACTGAAGTAAGATTTAAAAAGCTGAGGTGACAGTTGTCACCCATAATGCTGATAAATAT 

AGG AATG AGTCG ATTATAAATTTAAACCTTTTG GG GG C ATTATGCTACC AAAATNTCTCT 

TTTTTAAAA 

Sequence 1786 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTTATTTCTTTGCAAGATAGTTTTGTAAAGCGT 

CCATAAGCAGAGACTGTGTCAAGGCGTCCTACGCAAATTACAGGACCTTGCACAGACAGT 

TGCATAGAGCTGCATAGGGGAAAGGCGGGCAAAAGGTCTGACCTAAGTCTTCAAACACAN 

GCCTTCATACACCGTGCCACCT 

Sequence 1787 

GGTGGCGGCCGCCCGGGCAGGTACAGATTGNGTCCACTGGAAAGGTAMTGATTGCTTTT 

TTATNTTGCATCAMCTTGGAACATCAAGGCATCCAAAACACTAAGAATTCTATCATCAC 

AAAAATAATTCNTCTTTCTAGGTTATGAANAGATAATTATTTGTCTGGTMGCNTTTNTA 

TAMCCCACTCATTTTATATTTAGAAAAATCCTAMTGTGTGGGTGACTGCTTTGTAGNG 

AACTTTCATATACTATAAAACTAGTTGTGAGATANCCATTCTGGTAGCTCACTTAATAAA 

MCAATTTCAGAATTAAAGGAMTTTTCTATGCAAGGTTTACTTCTCANATGAACAGTAG 

GACTTTGTACTTTTATTTCCACTMGTGAAAAMGAACTGTGTTT7TAAACT 

ATTTAATAAATCAGCANGGGTATTTTAGCTAATAGA 

Sequence 1788 

AACCTTTAGGGTTNTNCCCAAAGTCCCGGCCGNCCGGNTNGGGACTNGGGGGTTGAACAC 

TTCGAAANTTTCANCTTNCGCACNACGAAANCAACACNNNGAGAAGGCAGATAATTCTCC 

ACGAAGCCAGAAAACTAATAAATGAACAACTTGGGTGAAATGTCCCACCAGACGGNGNGA 

TATTTAGTAGCCCAGAAAGCTGCCAAGGGGTTGAAAGACACCTATCTGAAGATATGAACC 

AGTTTTGCTCTCCATAGGGGAGGATTTCATCAACAGGAAACAAGATGCC 

Sequence 1789 

AGGTACGCGGGAGTAAATACAGTACAMTTGTTCTACTGTTTTAAAAAGTTTTCCGCAGA 
ACAGTGCATTTATGGCAATGCTATGTTTAATGAGTTAGGGACATCAAATATATAGTAGTT 
CCTTATTTTCAGTTGTGAAMTGAAATGGCTAAAGCAGAAGAGACGTCTATTTTAGTCTT 
TTAAAAATGTGTGTGGGTGGTC I N i l I CCTCAGAAGCCCAAAGCACATGTATATTTTGT 
TATTTCTCCTTGCTATATTCCTGAGACTATACTAAAAACTTTAAGAAAAGGAACAAGAAA 
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AAGGTAAATTCATGTGTTCCCCACTGCTGTGTCTAGAACCAAGATCACATTATATCATTG 
TTAAAATTGTGTTATCTAGAAAAGGTG CAATATAG GGG AAAAC ACTCTAAGAATCTTTTA 
AAAGCCTAGT 
Sequence 1790 

AGGTACGCGGGGGACTCCTTGAGAAGAAACGGCGGAGACCTGAGACCGGGAGGCTGAGGC 

TGTAGGTGGGCTCCGCTGGGTAAAAGTTGCCGCAGCAGCTGTCCCTTGGCCCCATCGCGA 

TTTATTTTTCCCCCTTGCTTTCCGGGTCCCGGGATCCCAAGTTTGTAACTAACGGGAAGC 

GAMTCCCACCCGAAGCAAAATGTTTGCGAAGTTTCAGGCNCCCTTAATTGAAAGGCTGT 

AATTAACAAGTCCGTTGTTTGCCAGCCAGGCNCCGTTGCAGGCNCTTTCTGNNGATTGNC 

ATTTATTTCTCACMGCAACCNTAGGGGGNTTGTTANTNNTTGAAMTNTGGAAAANACN 

TNTTAAAMTTTGNGGGAAAACCNCNNTTTTTGNNNNNANCCCCNI I II I I I II I ICGGG 

GGGGGAAAAAAAA 

Sequence 1791 

CCGCGGTGGCGGCCGAGGTACAACTTGACAGTTTATAAAAATGCTAACCTATATCAACAT 

TTTTCCCCACCAAAGTGTTCCCMGGGCAAAAGTAAGTGAATACATGTTTCCATGCTTGT 

GTCAGGGGCACACAGCAAGGGAAGGCAGAATGGGCATGACTTAGATTAGATTCCAGAAGC 

CCTCAAATTTCCTTATAANACTTGGGGACTGCATCTTAAAATAATATTGI I I M I I I I AT 

TCTTTGATAATAAMGAATGGMCCAMTCCTGGTTGTCTGATTCTTTGCCCAGTGCTCT 

TTCTTATGTCTAACACTGCCTCGTCTTATGAATAACTCTGACATTACAGAGAAAGACTAT 

TGTTTGTTCAGTTCAACACTCMTTCCATAGGTGCTAGCAATTATGAAAAAGGCTGTTAG 

TATCCAGCGGGGAAAGCAAAGGATCCGCAGGTGGTAACACCCTA 

Sequence 1792 

CCGCGGTGGCGGCCGAGGTACATGCCAGGCACAAGCTGGAGTCACAGATGCCACACTGAC 

TATCACATTGTCATCATACTCAGAGAGAGAAAGGTGTATACACCCAGTATTTTCTTATGT 

TTTCTCTATACTTCCATATCCCCACCCCCATTATCCCCCGAGACATTCCCAGTTGGAAGT 

CAGTTGCTTATGGGAAACTGGCTGCAGGGGGATCAGTCCACCTGTAACACACGGAAAGGC 

AGAGAAACAATGAGTCATGGATTTGAGTGCAAACAAGCCCAGTCCTGCTACAGAGAGTTA 

CGGAAAAAATCTCACTGTTGAAGAGAGACATCTATAAGAGAGGTGAGGAAAATGACTGAT 

AGATCCTTTTACAATATTCTAGGGTAGTCTGTCTGTGGGAGGGGGCAGGAGTCTATGGGA 

ACTCT 

Sequence 1793 

AGGTACAACTTGACAGTTTATAAAMTGCTMCCTATATCMCATTTTTCCCCACCAAAG 
TGTTCCCAAGGGCAAAGGTAAGTGAATACATGTTTCCATGCTTGTGTCAGGGGCACACAG 
CAAGGGAAGGCAGAATGGGCATGACCTAGATTAGATCCCAGAAGCCTTCAAATTCCTTAT 
AAGACTTGGGACTGCATCTTAAAATAATATTG I I I I 1 1 I I I ATTCTTTG ATAATAAAAGA 
ATGGAACCAMTCCTGGTTGTCTGATTCTTTGCCCAGTGCTCTTTCTTATGTCTAACACT 
GCCTCGTCTTATGAATMCTCTGACATTACAGAGAAAGACTATTGTTTGTTCAGTTCAAC 
ACTCAATTCCATAGGTGCTAGCAATTATGAAAAAGGCTGTTAGTATCCAGCGGGGGA 
Sequence 1794 

GTAATACACTACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACTCTGCG 

TTGTTACCACTGCCTCCCGGGACTCTGCGTTGTTACCACTGCTTCCCGGGACTCTGCGTT 

GTTACCACTGCTTCCCGGGACTCTGCGTTGTTACCACTGCTTCCCGGGACTCTGCGTTGT 

TACCACTGCTTCCCGGGACTCTGCGTTGTTACCACTGCTTCCCGGGACTCTGCGTTGTTA 

CCACTGCTTCCCGGGACTCTGCGTTGTTACCACTGCTTCCCGGGACTCTGCGTTGTTACC 

ACTGCTTCCCGGGACTCTGCGTTGTACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnCGGGCTGCAGGAAATTCGAATATCAAAGCTTTATCGATACCGTCGACCTTCGAAGG 

GGGG 

Sequence 1795 

CCGCGGTGGCGGCCGAGGTACTCTGCGTTGTTACCACTGCCTCCCGGGACTCTGCGTTGT 
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TACCACTGCTTCCCGGGACTCTGCGTTGTTACCACTGCTTCCCGGGACTCTGCGTTGTTA 
CCACTGCTTCCCGGGACTCTGCGTTGTTACCACTGCTTCCCGGGACTCTGCGTTGTTACC 
ACTGCTTCCCGGGACTCTGCGTTGTTACCACTGCTTCCCGGGACTCTGCGTTGTTACCAC 
TGCTTCCCGGGACTCTGCGTTGTTACCACTGCTTCCCGGGACTCTGCGTTGTACCTGCCC 
G 

Sequence 1796 

ACTATAGGGCGAATTGGAGCTCACCGCGGTGGCGGCCGAACATTTCGACATACGCCAGCA 

TGTGATGGCCGAAGGTGATCGGCTGGGCCACCTGCATGTGGGTAAAGCCCGGCATGATGG 

TGTCGGCATGCTGTTCGGCCAGGTCCGTCANCGCGCTGCGGAACGTCGCCAGCAGGGCGG 

TGATGTCGTCGATGGCGGAACGCACGTTACAGGCGGGATGTCCGGTGGGCCACCTGGTCC 

TTGC GCGAACGGCCGGTGTGCAATCGCTTGCCCGCATCGCCTACCAGTTCGGTCAGGCGT 

TTTTC G AT ATTAAGG TG G AC ATCTTCC AAG ATCC AG C AG C C ATTC TAAC 

Sequence 1797 

NTTTTTTTTTTTTTTTT7TT7 I GGGCAATACCAGCTTTCACCCTGTCTTCTGCACTGCTT 

AGCCTTGGCTCTGCTGAGTTGTGCAACACATTCTGAGAGATGCCATACAATGCTCCAGGC 

AAACTTATAG AAAAACAG GAATG AGTG ATTTTATACGG G ATGTGTTTCAGCTGTCC AATT 

CAAAATAAATCACGTCCAGGTGCATTCTTTCCTAATTTGTGACCCAAACTGCTTGCACTT 

TATACATTMTGMTTATTTTTTTAAAAGAAATTAGCTNTATCATGAAGACAGTGCTATC 

TMCCCTTTGTGGTCTTTCTCCCTTAAAMGCMTTGTTACCATTTTATAGGTGTTAGGT 

TTAATCTACATATTTGAATGAAGCTGAAATGAACTAACGTATGTCAACCAGTAAAAGCAG 

GACACTTMTAATAATGCTTCCATGAAAATATACATTGTGCATATGTAGGCTCTTTTTGG 

CATTACCAMTTTGCCAAGMCTTTCTTCTATAAATTAAATTCATTATCAAM 

GTGAGCCAGCCTTTAAAAATTNCCAT 

Sequence 1798 

CGNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCAGATCTTACCACTGAGGAATTTT 

AAGTTATCATTC CTGT AAACATTTCTCCAGTGGCTCCACGGAATTTAAAAATGCCTTCCA 

MTTGCCTTCAGTTTTCACAGGCCATCTCTTACCCTGTGCTGTCTCTCCCCAGGATTTTC 

TGAAGTTCCTTGTCTGCCTGAAGTNCAGACCTGCCTCCTCCACATCTGTGGTTTGGTGCG 

GAGAGCATCTTNTGGGCACTGGAGGAACGGTACCTGCCCG 

Sequence 1799 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCACANGATCT 

GTTTCCAGAAACCACTACAAG TGTCATGAACCTTGGGTTGCTCTCTGATTG AATTC TGGG 

ATCCAAGCTGGGTA AGGAG AGAGATGAGTCCAGGCTGCTACAGAAGATTATTTTCCATAA 

TTTCAATATGAGCAGTTTTNAMCACANAAGGATTTNNTNGTCNATGTTAGCTAGGCCAG 

CCTTAACACTTCCTCTACCTCATTTCTTAGAAACCAACAAGGGCCTTTCCAGAGAGGGCC 

AAGGACGGTGGCATCCAGAAGGGATGAAAACACTTGAGTACCTCGGCCGCTCTAGAACTA 

GT 

Sequence 1800 

ACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAACTTGACAGTTT 

ATAAAAATGCTMCCTATATCMCATTTTTCCCCACCAAAGTGTTCCCAAGGGCAAAGGT 

AAGTGAATACATGTTTCCATGCTTGTGTCAGGGGCACACAGCAAGGGAAGGCAGAATGGG 

CATGACCTANATTAGATCCCANMGCCTTTCAAATTCNCTTATAAGACTTGGGACTGCAT 

CTTAAAATAATATTGTN 'I I II I I I ! I I ATTCTTTGATAATAAAAGAATGGAACCAAATCC 

TGGTTGTCTGATTCTTTGCCCAGTGCTCTTTCTTATGTCTAACACTGCCTCGTCTTATGA 

ATAACTCTGACATTACAGAGAAAGACTATTGTTTGTTCAGTTCAACACTCAATTCCATTA 

GGTG CTAG C AAATTTATTG AAAA 

Sequence 1801 

CCGGGCAGGTACTCACAGATCTGTTTCCAGAAACCACTACAAGTGTCATGAACCTTGGGT 
TGCTCTCTGATTGAATTCTGGGATCCAAGCTGGGTAAGGAGAGAGATGAGTCCAGGCTGC 
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TAC AG AAG ATTATTTTCC ATAATTTCAATATG N GCAGTTTTAAAACAC ATAAN GG ATTTA 

TAGGTCATGT TTAGC TAGGCTCAGCCTTAACACTTNCTCTACCTCATTTATTAGAAACCA 

ACAAAGGGCCTT7TCCAGATGAGGGCCAAGGACGGTGGCATNCAGAAGGGATGAAAACAC 

TGAGTACCTNGGCCGCTCTANAACTAGTGNGATCCNCCGGGCTGCAGGNAATTCGATATC 

AAGCTTATCNGATACCNGTACGNCCTNTGAGGGGGGGGGCCCG 

Sequence 1802 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAATCACTGGGTGTCCCAGT 

GCCCGCTGTGAGCTACGCTACCCGCCTAGGAGAGTCTGAGAGGCCAGTCTCTCTTGGCAG 

TGAGGCCGTCTGTCCGGAAATAGAGGCGCACATTGGGAGGGAAATGCTCTCTGTGTACCT 

GCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGCCTGNAAGGAATTTCTAANAT 

CAANGCTTATCGATACCGTCTGAC 

Sequence 1803 

TGGCCGCCCGGGCAGGTACGCGGGGATAATAAAAAATGTATTTACTGAGCCAGTTGTGGT 

GGCTCGCGCCTGTGGTCCCAGCGCCTTGGAAGGCCAATGAGAGTGGATCGGTTGAGGCCA 

GGAGTTTGAGACCAGCCTGGCCAACATGGTGGGATGCCGTCTCTACTGAGAATACAGAGA 

TGGGCCGGGCGCGGTGGCGCGTGCCTGTAGTCCTCAGCCTCCCAAAGTGCTGGGATTACA 

GGCGT GAGC CACTGCACTTGGCCTGGACTCATCTTCATTGTCCACCTCCTAGGCTAAATT 

TATGTCTTTTATGTGCTTGTAATCTCTGCATACCACTTTAGTGACACTTGCCCTGGAGTG 

TGATTGGTAATCC 

Sequence 1804 

AGGTACTTCTGCCTGGGTTATGGGAGTAAAATCTTGTCCAGGGACATCCCAGGGAAGGTG 

AACTTGCCCAGGCAGATGCGATAGACAGCGCTCAGAGGAATCCGCTGCAGCTACACACAA 

CTCAGCATGATGAAGTCGTATTTGCAGATCAAGAGAGTCTTGTCTGTGACCAGTAGAATT 

CTCTCCTTCTCATTGTTCCAGTGGTCTATCTTTGTCAAGAGCCAGAAGCCTTGAATGGTC 

TCTCCAGAAGTCTCAGCTACGTGACCTTTCAAGTCTTCCATGGCAGTCTCAATGGCCCC 

Sequence 1805 

CCGCGGTGGCGGCCGAGGTACTTCCGTGAGATATAAGGCTGATGATAAGGATTGGGGTAA 

AGAGTTGATTCGGTCACACTTGACTGCCTTGAAATCTGTCTTGGCATCGGGTCCTCTTCC 

TCCATCGATAGGTGGTCAGTCTCAGAGGGTTGGTCATCTCCCTCTTGCTTCAAAGACACA 

GNCCTGGAAATCNTTCAAAACTCCCAATTGTTGACATTTCTTCAGCCTGACTAGGCTGGT 

CCTCAAGGCTCTTCrTTCTCTTTCTGCACAAAGGGGGCTAGACCACAGATAGTGGCCAGG 

AGAGGTGTGACTGGCTCAGATAAAGCCAGGCAAGACAGTGCCCTGTGCTGTGACCCCACT 

CAGTCTCCCGCGTACCTGCCCG 

Sequence 1806 

CCGCGGTGGCGGCCGAGGTACATACATACTTGTTATTCTTGTTACATCTTCC7TATTGAA 
TAAATNGTCATCACTTAAMTGTCCCTCTTCATCTCTAATAAGTTTTTGTCTTACAGACT 
ATTTTATCTAGTATTAAGTATAGCTGCCCC AACTCTGTATG ATTG CTGTTTTCATGG N AT 
ATCTTTTTCTATCCTTTTACTTTTCTTTTAAAAATAATTT 

CCCTAAAATGCAGAAGATTTTTAAGTGCTCAGTTCAATGACTGGTGACMTTATACACAG 
CTATATATCCCACCCCCAMCAAGATATTAMCCATATATAATATTTCATACAGGNTTTC 
TCATGCCCCTTTCCAGGCCAATCCCCAT 
Sequence 1807 

CCGGGCAGGTACTACAAATACATTTTTATAAATGGATGATTGGAAGTGCCCTAAATGTCT 

TATAAGAAAATACATTATGGTACGATATATATACTAAAATTTATATTTCAGAAGAATATT 

AAAAGAGGAAAATGCTAGTGGTAAAATGTTAAGTAACAAAAACATATATTATCCTCAACA 

AAATATTATCAAACCAAATTCAGCAACATCTAAAAAGANTTATACACCATGACTAAGTGA 

GATTTATCCCTGGAATMGCTTAGTTTAACATCTGAAAATTCATCAGTGAAATACACCAT 

AGTAAGAATAAAGGACAAAAACCACAGGATCATCTCAACAGAAACAGAGAAGGCATTTGA 

CAAACTCCAACACCCTTTCATGATAAAAATACTCAACAAACTAGGAATAGATGAGAATTT 
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CCTCAACCTAATAAAGGGCTTCTGTGAAAATCTCAAGGCTAATTTCGTATTAATGG 
Sequence 1808 

CCGCGGTGGCGGCCGAGGTACTTCTGCCTGGGTTATGGGAGTAAAATCTTGTCCAGGGAC 

ATCCCAGGGAAGGTGAACTTGCCCAGGCAGATGCGATAGACAGCGCTCAGAGGAATCCGC 

TGCAGCTGCACACAACTCAGCATGATGAAGTCGTATTTGCAGATCAAGAGAGTCTTGTCT 

GTGACCAGTAGMTTCTCTCCTTCTCATTGTTCCAAGTGGTCTATCTTTGNCAAGAGCCA 

GAAGCCTTGAATGGTCTCTCCAGAAGTCTCAAGCTACGTGACCTTTCAAGTCTTCCATGG 

CAGTCTCAATGGCCCC 

Sequence 1809 

CCGCGGTGGCGGCCGTAGCTTGCAGGGGTGCTATGTGAAGATGGCGGAATTCCCAACGTG 

CTTCTGCAAAATTTTATGTATTCATCCCCCTCTGCTCTCAGTAGGGTGTTACCACCTGCG 

GATCCTTTGCTTTCCCCGCTGGATACTAACAGCC INN CATAATTGCTAGCACCTATGG 

AATTGAAGTGGTGGMCTGMCAMCAATAGTCTTTCTCTGTAATGTCAGAGTTATTCAT 

AAGACGAGGCAGTGTTAGACATAAGAAAGAGCACTGGGCAAAGAATCAGACAACCAGGAT 

TTGGTTCCATTCTTTTATTATCAAAGAATAAAAAAAMCMTATTATm 

CCCAAGTCTTATMGGMTTTGAAGGCTTCTGGGATCTAATCTAGGTCATGCCCATTCTG 

CCTTCCCTTGCTGTGTGCCCCTGACACAAGCATGGAAACATGTATTCACTTACCTTTGCC 

CTTGG G AACACTTT 

Sequence 1810 

GCGGTGGCGGCCGAGGTAC I I I I I I I I I M M II I I I ! I I GCAAGATGGAATTTCATCAT 

TCCATCGGTTTTGCCCAGACTGGTGGCAAACTCCTGGGCTCAGGATATCCGCCAGCCTCA 

GCC TCCC AAAGTGCTGGGAATAC AG GTATG AGCC ACTGCACTCG ACC ATTTATG CATTTC 

TTCATAATCCTTTTACTGACMCCGTAAATTAACCTCTGGGTTTCTGTCAATCACTGGAT 

TTTAAGTCACTAAGTAGCAGAACTATATCTAATTTATTAACATTCATCATCCCCCTTTGC 

C AGC ATTG AAC ATG ATGTTTTGC ATATANTAGGC ATTAAATT N GCTAAAN GG AACTAAAT 

MTGTTTGATAMTMCATCATGGCACATGACCATATTAGGCCTAACTAATTATTTATTC 

TTCCATTTAGTGAGTGCAAAACTTTAGTCAAAAACATTAAACATAC 

Sequence 1811 

CCGGGCAGGTACTATTAAATTTAATGAGAGGTAACCCATGCACCGGCAGGAAGTTCAAGG 

GGACAGAAGGGGCTTGCTCCACTCTGGAGCCCTATCTCCTCTCACCTCCTTCCTTCTAGT 

TTGACATGCGATCTTGTCTCTCTCTTTTATAAAGTGCTTCCAGGCCCNTTTGAATAAGAT 

GCTGNCCTAAATCCTTCCTCCCGCGTACCTN 

Sequence 1812 

CCGCGGTGGCGGCCGAGGTACGCGGGGGGGACAGGCCATCTCGCTATAGGAAAGGAAAGT 

GGMCAGCATTCATCCTCMCATTTTTACGAAGACAAAATGAAGACTGGAGTAGAAGACT 

GATCAGTGCAGGTGTAGCATAAAAGTGTAATCCTGGAAGATGTGGTGTGAGAAGGTAGCA 

CAAGTGAAANCAAGATACAGGAGATAGGGAANGGAAAGCTGGAAGCANAGGTCACTGGAG 

GGAGAAGGGAGATGGACACATTCAGGGCTACAAAGCAAGTTCTATGTGATTTGCTCACCT 

CTCAATTGTGGGACCCCTCAAAATGTGTACCTGCCCG 

Sequence 1813 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGGGACAGGCCATCT 

CGCTATAGGAAAGGAAAGTGGAACAGCATTCATCCTCAACATTTTTACGAAGACAAAATG 

AAGACTGGAGTAGAAGACTGATCAGTGCAGGTGTAGCATAAAAGTGTAATCCTGGAAGAT 

GTGGTGTGAGAAGGTAGCACAAGTGAAGCAAAAGATACANGGAANATAGGGAAAGGAAAG 

CTGGAAGCAGAGGTCACTGGAAGGGAGAGGGAGATGGACACATTCAGGGCTACAAAGCAA 

GTTCTATGTGATTTGCTCACCTCTCAATTGTGGGACCCCTCAAAATGTGTACCTGCCCG 

Sequence 1814 

AGGTACTCTGCGTTGTTACCACTGCTTCCCGGGACTCTGCGTTGTTACCACTGCTTCCCG 
GGACTCTGCGTTGTTACCACTGCTTCCCGGGACTCTGCGTTGTTACCACTGCTTCCCGGG 
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ACTCTGCGTTGTTACCACTGCTTCCCGGGACTCTGCGTTGTTACCACTGCTTCCGGGACT 
CTGCGTTGTTACCACTGCTTCCCGGGACTCTGCGTTGTTACCACTGCTTCCCGGGACTCT 
GCGTTGTTACCACTACTTCCCGGGACTCTGCGTTGTACCTGCCCG 
Sequence 1815 

CCGCGGTGGCGGCCGCGTCCTTGCAGGGTTTTATCGGCGCGCAGGTGAAGGCGGACCGCA 

ACCGCCTGGCGCTGGCCATGATCTTGTGCTGGCTGCTGGCCGACGACTGGTTCATCGGCC 

AGCGCTTGCCGCAGCAGGATCTGCTGCAGGTGCTGGGCGAGGCGGCGCGCGAACTGGCCG 

CCAGTACGCCGGCCCACCAGTTCACGCAAGACCCGGAGCGGCGCGAGGMCTGGCGCGCA 

TCGTGCTGGCGCGCCTGGGCTTTCGGCCCAGCGACGAAAGCGTAGCGCAGGCCACGGACA 

GATTGTCTGCCATCAGCGGCACGGAGCGCCGCCGCCTGCTCGAACGAAGCCGCTGGCCGA 

ACAGCGCGCGCGGGAAATCCGCGAGGCGCTGGCGAAAAA 

Sequence 1816 

CCGCCCGGGCAGGTACAGTAGAATCCAGTCGTTGGTTTGGAGGCGAGGGCTCTCTGCGGT 

CCTCTCCTACTGTCAGGAGCTTCCCGTGCTCA 1 1 1 I [ CATTTACTGATGCACTAAAGGAG 

AGTTTGACTTTGAGTCAGCTCAGCATTCTAAATCAAACGCAAACAGCAGAATCATATGGC 

ATAGACACTTAGCAAATCCAATGCCTTCCAGGCATCTGTTTCCTGTTGATGAAATCCTCC 

CTATGGAGAGCAMCTGGTTCATATCTTCAGATAGTGTCATTCAACCCCTTGGCAGCTTT 

CTGGGCTACTAMTATCACACCGTCTGGTGGGACATTTCACCCAAGTTGTTCATTTATTA 

AGTTTTCTGGCTTCGTGGAGAATTATCTGCCTTCTCTGAGTATTGATTTCATTGTGCGCA 

TGGTGAAGATATCGNCTGTTCTACAAGGGAGTAAGAATGAGTCCCCGCGTACCTCGGCCG 

CTCTAGAACTAAGTGGATC 

Sequence 1817 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACCTGGTAATTTACAGAACGTTCGTTATCAGAT 

TTGATAATTATATATTAAGGTTGTAAAGAATGTGTATGCCAGAATTATAGGTAGTAGATC 

CTAGATTCTTAGGAAAAATAGTTTCTTTATAATCTTT^ 

ACAATGGTTATGAACTGGGTCAAGGCAAAAGGGCCACTATATGTCTTCAGNCATCTTTCT 

ATGCCTGAAATCCANGAAACAGTGAAAATGGATGTTCCCTGGAACAGCCATTGCGATGCC 

ATATGTTGGTCATTGGTGTCCTTAAAGTGTATCTCAAATAATTGNGTGTCTCCCTTACTT 

GTGAACTGAGGGGTCACAATATTGATGTGCCTGATGTGTGCCCCTCTGCATTTCGTGGGT 

AGTGTCTCAAATTTGGCTATCATATTGATGAGAAGTCCTAATTTGGTTTAGCTGCCAGAG 

GTTTAGG N GGTTTTC ACTTCC AG N GACTTACCTG ATTGCACMTAGTTCCAACCGTC ACT 

TGGTTGCACTCCTTCAANGCTAGCATC I Nil I CTAGNAAAATTNCCAATACCATTCTTT 

TGGGNTTATCC'rTTTAAAATANTTGCTTGAG 

TATATTTGGTGGGATGCCTNAATNCAATGGGGCCTCT 

Sequence 1818 

TACGACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACGCGGGGGG 

ATTCAGAGTG/vGAAGGCATAAAGGAGAATCCCCAGCTGACTTGTGCAGTGGTTAATTGAA 

ATTATTCAGGOAAGAGATGATGGTGTCTTGGACCAGGGGATGAGGAAGGCTACAAAATGT 

GTCTACCTG 1 ATTCTGTGAGGAGAACGTGTTCCCTGGTTTTAGATACTGTGAAGATGGAT 

CAGGAGAGAGTTTATCTAGACTGTTGGGGAAAGGTGTTGCGATTCCTTCAGCTACACAGG 

ATTGAAAGGAGACATTTCTGAAGGGGAAAAAGGAMTGAAAGAAAAGATGTTTCAGATTG 

AGGATATGCTGTGTGGTGAACTTGTTCTTCACTCTGTAGGGTTCACAAATGACTCTTCAC 

TGCCCTCTTGGATGAAATAAACTGGTTCCCATAGAAATGGACCGTCTCTGATTTCACAGT 

CTAGAGCACACAAATAACCCAACTTTCTGGAAGCAAGGGTGATGGCTCGTGAGAGCTCAA 

GGACAAGGAGGAGTCTGTAAAAAGCACACCTGGACTCGGGGAAATCATTGCCTCTCATCC 

TGAATTAAGTAGCGCTTGAAMGACTCTCTTTGAATCAGAAAAATAATGCGAGCCTCCTA 

TCAAACTCCTTACACAGCCCACCTGAGTGATCAAANGCTTGGGTGGAAA 

Sequence 1819 

CCGCGGTGGCGGCCCGAGGTACTTACAGTTTGACTTTGAGTCAGCTCAGCATTCTAAATC 
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AAACGCAAACAGCAGAATCATATGGCATAGACACTTAGCAAATCCAATGCCTTCCAGGCA 

TCTGTTTCCTGTTGATGAAATCCTCCCTATGGAGAGCAAACTGGTTCATATCTTCAGATA 

GTGTCATTCAACCCCTTGGCAGCTTTCTGGGCTACTAAATATCACACCGTCTGGTGGGAC 

ATTTCACCCAAGTAGGCAAATGCATGACAGAACTCACTGGATCAATATCTCAAAATGCTG 

GGAAGAAAAAAACGGTAACCAGAAATTCTACACAGAGTGAAACTACAAAACAAAAATCAA 

AGCAAAGCAACAATTTCTGGATATGCATAAAAGATGAAATTATATGTTGCTAGAAGATCC 

ATAGCACAAGAAATGCTGATATCCNTCAGGCTGAGGGCTATTAAACAAGATGGAAATCCA 

(3AAATTATAGAMGGACAAAAAGMCAAAAATGAGAAGCATACAGGTCAATACAAAGACT 

GCATATTCTCCTT 
Sequence 1820 

TTGNCCTCCCTGGCTATMCTACTCTMTCTTCCACCTTTNTTTCTCCTTACCCTNACTG 

CTTTATCTCAAGAAGATCTTGTGTGGTATATAGATGCGAGAAAGTTACCTGCATGATTCC 

GACAGGTGCCCTGCCATCACTCTAATGCTGTTCTCATTGCTNGGCCTTATGGCACATTCT 

CACTCACTCACTACCCCTGCCCCGCGTACCTGCCCG 

Sequence 1821 

CGAGGTACATGTAGATCCAMCACATGTAMCATTTAGTGGAATTTCCAATCAGTATATA 

ATGATTCArrAAmAAAAAAAAATGGTrCCTGGACMGTGATTGATAACATGGAGAATG 

AAAAAGGTTAGATTTATACTTTACAMCCATACACATAAATAMTTTCAGAMGGTTGM 

AATCTAMTATTTAGAATGAMTGATCAAAGAACTAGMGATMTTTAGATGMTATTAC 

TGTAATCTTAGGTTTAGAGCTTTTAAAACATGATAAGGAAGAAACCATAAAGGAAAATAG 

AGGATTTGAATATATGAAMTTTTACACTTC^ 

ATMGAAGTTCMTAACTATTGCCACATAAATCACTCTCTTMGATTGATTTTCATATAT 

AAAAAGCAGTAAAAGCAAAATCCCAATAGGAAATAGCCTMGTGCATAATTACCAAGNNA 

AAANATGTNNNTAAAANNNAAAAAAAAAGTNCCTGCCCGGGCGGCCGGC 

Sequence 1822 

CCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTACAATAGGAA 

TAATGTAAATATTTTTATATTTAA \ WWW' rAATTACAGAGGTTATTGAGTTTGTGTTT 

TTCAGCTGCAGAGAGAGAAATTTGTGGGGATAAAGTCACAAGAATACCAGTTGGCAGAAA 

TTGAATTACCTTCCTTTGTATAAGTAATATCCAGTCTAGTCTTAAATGCTATAGGATGAG 

ATMGAGAAGGATAGCTCACATTCCAAAATAAAAGCAAAMTATAAAATATTGAGAATTG 

AATTTGACAACATGCGAACAAAACTGAAMTAGAAAMTATAAACCTTTATTAGAGGACA 

TAAMGAAGCCTGGAATAAGTGGAGAGACAGGTCAGTTTTCTAGAAAGA CATGAT ATTGT 

TAAGCTGCTGATTATGTTCAMTTCATACATTTAATTTCAGTC 

TTGTTTGAATCTGAAAGGCTGATACTGTAGGCCATTTAAAAGAATAATATATGGGCTGGG 

CACTATGGCTCACACCTGTAATCCCAGCACTTTGGGAGGCCAGGGAGGGCCAGATCATGG 

GGTCAAGGAGATTGAGACCCATCCCTGGCCCAACATAGTGGAAACCCCGTCTNTCTTAAG 

AAATCCAAAA/XATTANCTGGGTGCAGTGGCAAGGTGCCTAAAGGTCCCACTTGCTCANGA 

NGCTTGAGGCAAGGAGAATACCTTGAACCTGGGANGCGGANGGTGCAGTGATCCGAGATC 

CACCCTTTGCACTCTTGCCTGGCCGACCNGAACGAGACTCCGNTTCAAAAAAAAAAAAAA 

A 

Sequence 1023 

GCGGTGGCGGCCGAGGTACGCGGGCATGACAACTAATGGGAAAGACATGGCTCAGATGTG 
CAGCCTCAGAGAGCTTCTGAACATTTCTTCTCAGACTAAGCTCTTACACACAGTTGCAGT 

TGAAAGAAAGAATTGCTTGACATGG 
CTCCTATAGGGCGM 

ATTTATCATTTTATGATGCCAGAGATTTACACGAATTTGTGAAAGG I Mil iCTTACCAT 

AGATTCAGAGGCGTGTTTGAGGCAAGGTTTTCATTGTGCAGTGAGGAAATTGGCACATAG 

AAATCAATTGATGTGCAGGACTCTTGGACTGACTATTCTGGTTATGTTTCCTTATAACAC 



Page 254 of 63 5 



WO 01/51628 



PCIYUS01/00798 



Table 1 



ATTAAGAAGAAATTTGAGATAMGTTAGTAATTGTN 
AC AAATGTTG C AGTATAATTTT 
Sequence 1825 

CACTCCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATTCGA 
TTTATTATATGTTAACTACTTCTCAATAAATCT 

CCCTCAAGAGMTCCCTAAAATAAAMTTTAAAGAGGGGGAAAAAAGGAAGGGAAGCTAC . 

TGAAAGTTGTAAGMGAGACAAAMGTTACCAATMTGAAGCCAGTGTTTCAAGGAGTGT 

GCAGTTAAAATAACATGACACACAGATATCAAAACTGGCAACTGAGAGAAGTCCAAGGTC 

CTTGGCAAAGAGAAAGCCACTGACCATCCGTGGAAGTGTAGTAGGGTGACAGCCAGAATG 

TCAGAGGCTGAAGGGAAGCTATGTGAAACAGAGAGGATGGCAGTGAGCTTAGACCACTTG 

TTTAAGAAGCTCAGTAAATGAAAATGAG 

Sequence 1826 

AGGTNCACTTATAGTTGANAGCCAAGTCTCCCTTATCATTGGTGAATGAGAATGANCTAC 

TGAAAACAAAAAGAGGGTCTTCTACTCAGCCTCTACCCCTAATATTTATATCANAAGCAG 

AGATTAACTGTCCTTACTCATTCACACGTTAATGGAAGANAAGGAAGTTTCCTAGAAAAA 

TCCTCCCGCTCCACCCTGCAMCTTTATGCTTTTCTGTTACATAATCAGGCAGGGGCAAN 

ACCTAAACTATTTTGMTTGGTGGTGTTGAGGCTAAATTCTCTGCTATTGACANAATTGA 

NAATGTGATCAATTTCANAGTANCATGTTACAAATTTTGTCCCAATTTCAATGGGGAGAA 

TTNTAACAAATCATTAGTTGGTTGGCAAAANCTGGTTCTTAC 

Sequence 1827 

GGAGCTCCCCGCGGTGGCGGCCGAGGTCGCGGGGGGCTTANACCTGACGCTGGGAGGAGA 

TGCTGCCACCTAGGTTACTTGTAGGACCCTATACGGCAACCTCCTTTGCCAGGAACTATT 

TATAAACATCCTGCAGGAAAATGAGTNTATATGTCANAATACACATTTCCCACCTTGCCC 

AACAGTGGAAAANCATAAGMGAGAAAMCNTTAAAAAATGACANGGAAGTTAATGGAAG 

TCAGCANTGTGATGGTGTTTGGAGGTGGAGCCTTCANAAGGTAATTAATGCCCTTGTAAN 

AANAGGCCANANAGCTTGCNCACCTTNTTCCTGCCATGT 

Sequence 1828 

CCGGGCAGGTACGCGGGGAGACCTGACGCGGGGGGAGATGCTGCCACCTAGGTTACTTGT 

AGGACCCTATACGGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAAT 

GCANTTGAAGTNGAANAGACANGGNATNTTTCCMGAAAGGTTTATGCCANAAACATTNA 

TAGTAGAAAG TATTN NAN N AAGGG AGGTN CTATATTGGTC AAG N AATAACC AACATTTTT 

TCCCCA 

Sequence 1829 

CTATAGGGCGAATTGGAGCTCCCCGCGGNGGCGGCCCGAGGTACTTACAACTTTTCATAT 
AGTATTGGTAGTGTTTTGMTAATTTAAAAAATCATCTT^ 

TTATATAGTGCTTGTATTAATGATTTTGTGAATG: II If I AAAGATGGATACCTTGAT7T 
GGCTTGGCCTNCCAAAGTGCTGANATTACAGGCGTGAACCACCGCGCC 
Sequence 1830 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGCGTTTGTAGATGGAAGGAAGAACTTG 

TGTGCTtAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGTAGGACCCTAT 

ACGGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAATGGTAAGCCCT 

TGGTTAAATTTATTGGCTTCATTTATGTATCTC 

AGGAATAGAAACTTACTTTTGGAAATACAGTACCT 

Sequence 1831 

CCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGAGTGACTAAATTTGTAGGAATGATAT 

TTTCATGGGATTACTCAATCTCACCCACCATTAGTTGCAGGTGACAAGAAAGCTAAGTTG 

GCAGATGTTTGTGCTAGAAGCTGTGGGTTTACGTCTCCTTTGTGCATGTGTTCCAGACAT 

ACCAGTGGCTTGGTATTTAAACATCATGCTCAGGTGTGCAGGGTAGTTTTTGAGTTATAA 

TAGGTATGCAGGCGCTGTGGGATTACTTGGTTGTTTATGTAAAAATTATTTTGCACTCAC 
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TTCTGAAATGAGTGTTAGTMGMTCATCTTTANAGGAGGTTCCMGGCATTGAACTGAG 
ATACCTGCACTGTTTGCTGTAAATTTMGCTTAAAATTGAMCCAGGTTATCAGCATTTC 
ATGCCAGGAGAGAGTGGGCATGMTGATTTNAGGAAATGAANAGCTAGATTTCACCTTGA 
ATTTGCTTCCACCCTTCTGTGGCAAATTAGTGTGGGCTCACTGAGCACTTTATCTGCCCG 

TGGTAATTTA 
Sequence 1832 

CCGCGGTGGCGGCCCGCCCGGGCAGGCACCTCCTCCCTATGCATCTGCTGTGGGAAGTGT 

GGGTAACTCACAGATGATGCCACAGGGCATGTATTCAGGCACCACAGGCAGCAGTGAATG 

MTGAAGTGAMTGGTATTTATCCATTTCCTGGAAAGGTCCAGGGTTTGGCTCTGCAGGG 

CCAAGAGAACAGCTTTAGCTGTGCGTTAACCCAGTCCTGGAGAAGCCAGCAGGCCGTAAT 

CACGGGGAGGAAACCCATCTTTTAAGGGCTCCTCGCTCAGGTGGTGACAAGGTGAGGTGG 

TCATCTATGCTGTCTTTATCAGTATCTGTCCTAAATACTGTG CTCTGA CATTGATGCTAA 

TATTCCATATTATCAGGGCTTCTGTGGTGTCTAGGCCTCTMTTTTTCTCTTCAGTAAGG 

GGAATAACTTTGCTGTGTCAGCGCAGTGCCTCAGGAGGGTTGCAATCTANAAACAGTGAT 

TGTTAAAATTGCCTATAAACACACAATATAAACAATATGCCACAGACACAGAGGCCCCCA 

GCATAGTTTCTGGAANGTGAAGTCTCACCTCTGCCANAGTANAATCACAAGNGGGAATTG 

AGGCTTGGCATCTGGGTGCCTCTTGAAACTCTTNCTGNGGCTCCTGGTAGACAATTGGCC 

CCC N AAAC NC AGAAAGATTGAGATTGC CCATTGCGTTCAAN AAATTGG N GGN N AATTAAT 

GTMTTCCNGAAAAAAAMNACACACANGCTTGCTTATNmAAACCNCAAAAAAAAAAA 

AAAAAAA 
Sequence 1833 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTGGGATTACAGGAATGAGCCACTGTGCCTGGC 

CTGATTGCGGATTTTATTGGAATAACCTATAAATATTGAACAATGCACCTGCGAAAGAGT 

ATGTCTTMfCTTCTCTTGGAATACTGTCMCAAAGATCT CTGAG TTTGMGGGTAGAAG 

TMCTCACTACTTGGCTGCTGACTGTGAMTTTAGTCATGTTTTTATTGTGGGAGTGGAC 

TGATTAMCAGACMGTGTTATMGGTTCAAATTACAACATTTCTCCTAATATGCAGTGC 

TTTTAACTATGATAAACACTGCACCCCCCTTCACAAAAAAAAAGAACCCACCCACCAATT 

TTTAACTTTCTTCTACAAACACCTCTTATCTCTTCTCTATCAMGCATGACTGGCCTTTT 

TGAAATATTCAGGACAAAAAGTTAGGAAGAGGGCAGGCTCGCTAGTTCMTGTCACTACA 

TATAGGGGAMTGGAACACTTCCATANAAAGCTCTTGATTTCTCTCAGCGTGAGTCTNTG 

TCTGANGCATTCCCCTNTNTNTNTGCTACCAGACACCATCACAGCATGTCTTTNCAATCT 

GAGTCCTCAMNGGCATTCTGAAAGTGTTCAGGTTATTAATGNCATACCCAAAAGCCTTA 

TTAAMCTGCACAGTTATTCTGCTTMGGTTTACTGGCCATTAAAAATTC 

Sequence 1834 

CTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGTAGA 

TGGAAGGAAGAACTTGTGTGCTTAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTT 

ACTTGTAGGACCCTATACGGCMCCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAG 

GAAAATGCAGTGAAGTAGAAGAGACAGGGATATCCCAGAAGGTTATGCAAAACATCAAGA 

GAAGATGAGAGGAGTCTATATTGTCAAGAATACACATTTCCCACCTTGCCCAACAGTNGA 

ANAAAAAAAMCTANATTAAAAAAAAMNGG T ;\CCTNGGCCGNTCTAAAACTAGTG 

Sequence 1835 

CCGCGGTGGCGGCCGAGGTACCAGGCTAGGCAGCTCTGGAGAAAGCAGAAGTGGATAAAT 

MGGTGTGGACTCACCAAAGACAGTTCCA/\AGTCAATTTCACTCTGACACACTCTCTGTG 

ATCTTCCACAGTCAGCACMTGCCTGCCCCCTGTGGGTGTTGTATAAATATTTGTTGAAT 

GAATGAATCAATCATTCAACAGACCAAGGCCAAATCAGAACCCCAAACCCTAAGGTCTTT 

ATACTCTCACTGTCCATCCATCGATCTTCCTGTCAGAAATCAGAATATACCTTTGCAATA 

CCCTTTGCTAGCCTTTCAGTTATCTTTTGAATAGAGGCTCTGAGCCTTGAAAATATTGCC 

TGGGAAATATTAACACCCATTTGAGTATCTCCCAAACACCTCAATTAACTATATGGTGCT 

GTCTAGCCAGGACCTTATTTCAGTATAATGTGAACCTGA 
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Sequence 1836 

ATACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGACTCAATCT 

TACTCCCTTGTAGAACAGGCGATATCTTCACCATGCGCACAATGAAATCAATACTCAGAG 

AAGGCAGATAATTCTCCACGAAGCCAGAAAACTAATAAATGAACAACTTGGGTGAAATGT 

CCCACCAGACGGTGTGATATTTAGTAGCCCAGAAAGCTGCCAAGGGGTTGAATGACACTA 

TCTGAAGATATGAMCCAGTTTTGCTCTCCATAGGGAGGATTTCATCAACAGGAAACAGA 

TGCCTGGAAGGCATTGGATTTGCTAAGTGTCTATGCCATATGATTCTGCTGTTTGCGTTT 

GATTTCGAATGCTGAGCTGACTCAAAGTCAAACTGTAAGTACCTGCCCG 

Sequence 1837 

CCGCGGTGGCGGCCGAGGTACAAGCTAACTGTGCTAGGCAGGGCAGCCCTGTGAGTTCTA 

CTGCTGTCTTGGTTTTACAGAGGGGGAAGTGAGGCACAGAGAAGTTAATTAACCTCTGAA 

GTGTTGCAGTCTAAGGCACAGAGGCACAGTTCCAGGCAAGGTTCATCTGAATCTTAAGTC 

CTCACTCTTTGCCACCATCCTCCACTGCTGANACCATCCCTGTGAAGTCCTGCCGCTCTC 

CTCCCCTGGTCCATATTCACTGCTACTCAATGAGGCCAAGGAAGCCAATGGTCGTGTCCC 

CAAGAGGATATCTCTCCCCTCCTGAGAATCTTTCTTCATACATCTCAATTCTGAGATACA 

GATTGAGAAGCACCTNAGCAAATCCACTGCATGGAAGGCAAAACAACCTTGATCCACAGC 

AATTGACCACAAGCAAGGGAAGCCTGGC 

Sequence 1838 

CATACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACACAATTG 

TGCTGATTTTTCTCTCATTTAAAACGGTTCCNCAAGTAGTTAGTCCAGGGCTGGTGTCAT 

GGCTCCATGGTCACTGGGGCATTGTGTTCCTCCACGATTTCTGCTTCACCAGGCTTAGCC 

TTTGCTTCCATCCTCTAGTTTGCCTCATGGCTCAAGATGGCTGCTAGAGCTCCAGCTGTC 

TCTTATGCATTCCTGGCTGTAGAATGGAGGAAGGAGGAAAGGGTAACGGGACGCTTTACT 

GGCTATTCTTGTTTAAAGGAGCATCCTTGATCATACTTAATGAGMTCTGGAGTAATTTT 

CAGGAGATCTCAGAGAGGGTACCT 

Sequence 1839 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGT 

GTCCTTCTTCAACTGGACACCTCTAGATAAGAAGGCMGAMTTCTTAAMCTTTTGCTA 

ACCTAATTGTGTATGTAATTAGAATAACCAAGTTCAGTTAATTGAAACAATGAGTATTGG 

AGTAAGAGTTTGGGATTTGATTCCTTAGGGATAAACTACTAAATCCAGGACAGTCATTTA 

ACTGCTGCAGATTTGAGTTCCCAGGGGAACCAAACAGAAGTCACACAATTTCTAGGATGA 

AAACATCTCAAAAAAAATCAGTCCTCCCAGCACTTCGGGAGGCTGAGGCANGGAGAGCAC 

TTGAAGTCAGGAGTTGAAGACCAGACCAGCCAACATGGTGAAATCCCCGTCTCTAT.^AAA 

AATACCGAAATTTAGTTGGGCCGTGGTGACCCCGTGAC 

Sequence 1840 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGTTATCAGTATCTG 

TCCTAAATACTGTGCTCTGACATTGATGCTAATATTCCATATTATCAGGGCTTCTGT3GT 

GTTTAGGCCTCTAATTTTTCTCTTCAGTAAGGGGAATAACTTTGCTGTGTCAGCGCAGTG 

CCTCAGGAGGGTTGCAATCTAGAAACAGTGATTGTTAAAATTGCCTATAAACACACAATA 

TAAACCAATATGCCACAGACACAGAGGCCCCCAGCATAGTTTCTGGAAGGTGMGTCTCA 

CCTCTGCCAGAGTAGAGTCACAGTGGGAATTGAGGCTTGGCATCTGGGTGCCTCTGAAGC 

TCTTCCTGTGGCTCCTGTTAGACAATTGACCCCGAAACACAGAAGATAGAGATTGCACAT 

TG CG TTC AGTAATTGTTGTTTATTTATGTTATTTCTG AAAATAAAAG AC AGC AC AGG 

Sequence 1841 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCGCGGTTTTTAAATCTGT 

CACTGTTCTTCCCCCACTTCAMGCTGACTTGACAACTGCATTTATTTCATATTGCAATT 

GCTGCAGGTTAATMTATTGTTTTATCAGGAAACATTTATCTTGAGGAAGAAAGTTATCC 

TAATACAGCAGATAACTTATCCAAGGACAAATAGAAAGGGGACTGTGTCTAGAGGGTAAT 

GCTAATGCCTGGCATGCTATAAAAATAAATGAAATCTGATTAACACTATTGGAGGGAAGA 
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ATAAGTCATCGTTTCTATGAAATATTATTTTCCTC 

GCTTAAAACATTGCATTAAAGATGAGATGACTTTAAAACGGTGACMCTATCT CMGC CC 

TGTGATTATAMTGGTGTTATAGGCTTGTAATCCTATGGGTTTAGACTAAAGTATTTTCT 

TCATAAATTGAAAGAGAAACACACTAGGAATTCAGAAGATTTTAATTAMGTTGTTCAAA 

AAATTrAATGGGMTTTTAAAGGCTGGCTCACTAAGTAGTmGATMTGMTTTMm 

ATAGAAAGAMGTTCTTTGGGCAAMTTTTGGTAATGCC 

Sequence 1842 

GACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACCAGANCTTAA 

AGTATAATMTAAAAMTAAAGTATTACAGAACATGGTCMTTATATTAAAMGTAGATT 

AATTAATGTCC ATATATAC ATTTTTTG GTG G AAATC AAAC ATAATG AAAACTATTTC AC A 

TMTTAATATmGmATTTTTTACTTATTAATTTATTGCAGAGATGGGAGTCTTGCTA 

TGTTGCCCAGGCTAGTCTCAAACTCCTGGCCTCAAGTAATCCTCCCACCTCAATCCCCCA 

AAATGCTGGGATTTCAGGAACATGCCACTGCACTTTATTAGTGTTACCTGTCATAAAGTT 

AATTACTCATAATTTTATTTCAGTATATATATATACACACAC ACACC ACACACACCATAG 

TAGAATAMTTMTTATATTTTCTACAAAAATTATTTTCAA^ 

Sequence 1843 

ATGGAGTCAGCTGCGATAGCGGCCTGACGTCCGCCGTGGTCGATATCGCGCGCGCGCAGG 
AGCAATTTACGGTTGCCCTGAATGGCCGCTTCAAGGGCGGCCATATCACGCGCCATTACG 
GCCTGCCGGACAAGCGCGTGCATGCGATCCANCTGGAGATGTGCCAGTGCGTGTACATGG 
ATGAAGCGGCGCCGTTCGGCTACCGGCCTGACCTGGCCGCGCAAGTGCAGCCGCTGCTGC 

NCCAGATGATGGAC 

Sequence 1844 

CTAAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACl nTTTTTTTTTTTTTTl 

I l M M M I C H I t I I M I I U \\ GGGCAACTTTCTGGTATTACTTGTAAACNCTGGTTC 

CTTCAACmCTGATATTACTTGTAMCACTGGTTCCTTNTCAACCACCGNATTCTGATT 

GGGTCTATAAGTAGCACCCAGTCCACACCACANCACNCTTCTGGAGGAGGAATGCCAATG 

GGGACCCTGTCTGCAATGCCTGNGGGCTNTACTACAAGCTTCACAATNTTAACAGACCCC 

TGACTTTNAAAAAGGAAGGCATNCANACCANAAACCGAAAMTGTNTAGCAMTCCAAAA 

AGTGCAAAAAAGTGCATGACTCACTTGGAGGACT 

Sequence 1845 

CCGCGGTGGCGGCCGAGGTACGCGGGGGTAGATGGAAGGAAGAACTTGTGTGCTTAGACC 
TGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGTAGGACCCTATACGGCAACCTCC 
TTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAATGGTMGCCCTTGGTTAAATTTA 
TTGGCTTCATTTATGTATCTCACMCACTTCTTTTCTGTTTTAGTTCTAGGMTAGAAAC 

TTACTTTTGGAAATACAGTACCTGCCCG 

Sequence 1846 . _ _ ^' 

CCGGGCAGGTACTTGCCAAGATCTCACTCCTCAGCAGCTTTTTATTTCTAMTTTAAGCT 

TTTGGTTACTGAAAGTTTACAGCAGTTCTAGACTCCTATTCTTTAGCATTCCGTTACGTC 

CAGACCATGTGACAACCTCCATTCTACAAATATGTAACCACAAAGAAGGTGAGTGGGGAG 

GCAATAAGAACCCAGACTAACAGAACATCAGAAAAGCTGGCATACGCACACATA/VAGAAG 

GGGGAGMTATAGGACTATATGGGAGGGAGGMGGTCACAGAAMTGATTACGGTTTTGA 

TGTTGTATTTGTGAAGGTGAGGMTACTAACTATATTTCTTTAGTCTTTGAAGATCACCA 

TATTTAGCCATTATTCTATCAACTGTTAAGCCTTGG 

Sequence 1847 

CATTTTTGTTANCANNGAAGTGAGNGGCATCTATGTATTTTTTAAGGTATATAATGAAAT 

TGTGCCTAGGGGAGTNATAATTTACTCTATGTATTTNTATATNTACTTAAATCAATACAT 

TCTTACAGACTANCTTCTTTAGTGGTAACATACANGCCGATTTTCTCACTCCATGAATGA 

TTACNGACATGTATCCANNANCGGAGGTCCCTCATCCACCATTTCACCCAGGTGTCTTGC 

TCTTATCTTTCANAAGGGAAMTTGGCTAGCNGGTTTCTCTCCACCATGTGCTGTTCTCC 
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AGGGACTTTGGGTGAATCCAGGTGTGGGAAAGAAGGTAGCATCAGCTGTAAGATTCAAT 
Sequence 1848 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGATCTGTAATTTTGA 

CATATATAATATTTTAGCAGTGGGTATTTATTTCTAGATAAATTAAAATAATAATAATGA 

TACAGCGCAAAACGATAAAAGCTGAAAGGAAAATATGGTGAGGTATAAATGTAGAAAGGA 

CAGAGATTTCTCCTCCAAATACTGTTTCTCTTGGCCCATAATTTAATCATTAAAATATCA 

AAGATAAAATTGTAGGGCAMTAAAATATCCCACTTGGACAAAAATCAGTAGTATTTCTA 

TACACCAAAAACGTTCAAGATCAGCATCAAATCAAGAACATGATGTCATTCCAAATAGCC 

ACAAAAAGTTGAAATACCTAGGAATATGGATAACCATGGAGGTGTGAAAGATCCGCTACA 

AAGAGTGCTACAAAATACTGCTGAAAGGAAATTC 

Sequence 1849 

CCCCGCGGTGGCGGCCGCCCGGGCAGGTACATGCCAGGCACAAACTGGAGTCACAGATGC 

CACACTGACTATCACATTGTCATCATACTCAGAGAGAGAAAGGTGTATACACCCAGTATT 

TTCTTATGTTTTCTCTATACTTCCATATCCCCACCCCCATTATCCCCCGAGACATTCCCA 

GTTGGAAGTCAGTTGCTTATGGGAAACTGGCTGCAGGGGGATCAGCCCACCTGTAACACA 

CGGAAAGGCAGAGAAACAATGAGTCATGGATTTGAGTGCAAACAAGCCCAGTCCTGCTAC 

AGAGAGTTACGGAAAAAATCTCACTGTTGAAGAGAGACATCTATAAGAGAGGTGAGGAAA 

ATGACTGATAN ATCCTTTTACAATATTCTAGG GGTAG TCTGTCT 

Sequence 1850 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGTTTCCTGCGT 

TTGTAGATGGAAGGAAGAACTTGTGTGCTTAGACCTGACGCTGGGAGGAGATGCTGCCAC 

CTAGGTTACTTGTAGGACCCTATACGGCMCCTCCTTTGCCAGGAACTATTTATAAACAT 

CCTGCAGGAAAATGTCAGAGATGGGMGAAACMGAACTTTGACATGCTTGGTGTTCTTG 

CCCAAGCTTTGAAGMGTTTACAAAGTCTATATGTCAGAATACACATTTCCCACCTTGCC 

CAACAGTAGAAAAACATAAGAAGAGAAAAACATTAAAAAATGACAAGGAAGTTAATGGAA 

GTCAGCAATGTGATGGTGTTTGGGAGGGTGGGAGCCTTCAGNAAGGTAATTAAATGCCCT 

TGTAAG 

Sequence 1851 

CCGCGGTGGCGGCCGAGGTACAGAGAGTTCCCATAGACTCCTGCCCCCTCCCACAGACAG 

ACTACCCTAGAATATTGTAAMGGATCTATCAGTCATTTTCCTCACCTCTCTTATAGATG 

TCTCTCTTCAAC AGTGAGA I 1 1 1 I ICCGTAACTCTCTGTAGCAGGACTGGGCTTGTTTGC 

ACTCAAATCCATGACTCATTGTTTCTCTGCCTTTCCGTGTGTTACAGGTGGGCTGATCCC 

CCTGCAGCCAGTTTCCCATAAGCAACTGACTTCCAACTGGGAATGTCTCGGGGGATAATG 

GGGGTGGGGATATGNGAAGTATAGAGAAAACATAAGAAAATA 

Sequence 1852 

TTCCTAGTCACATCTGTTGTTGCAGCATGTTATCAGTAGAGCCTCCTTTTACTCTTAAAA 
ATGCCCTGGTTTGGATGATCGATCATACCTGCCCCCATCCCCACCCCCCCGCGTACCT 
Sequence 1853 

CTNCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGCAT 

TGGTGAATGAGAATGAGCTACTGAAAACAAAAAGAGGGTCTTCTACTCAGCCTCTACCCC 

TMTATTTATATCAGAAGCAGAGATTAACTGTCCTTACTCATTCACACGTTA ATGGA AGA 

GMGGAAGTTTCCTAAAAAAAATCCTCCCGCTCCACCCTGCAAACTTTATGTTTTTCTGT 

TACATAATCAGGTAGGGGCAAGACCTAAACTATTTTGAATTGGTGGTGTTGAGGCTAAAT 

TCTCTGCTATTGACAGAATTGAGAATGTGATCMTTTCAGAGTAGCATGTTACAAATTTT 

GTCCCAATTTCAATGGGGAGAATTATAACAAATCAGTAGGTGGNTGGCAGAATCTGTTCT 

TACTTTCCTCCCATCCAGT 

Sequence 1854 

CCGCGGTGGCCGGCCGCCCGGGCAGGTACCTCCTCCCTATGCATCTGCTGTGGGAAGTGT 
GGGTAACTCACAGATGATGCCACAGGGCATGTATTCAGGCACCACAGGCAGCAGTGAATG 
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AATGAAGTGAMTGGTATTTATCCATTTCCTGGAMGGTCC AGGGTTTGG CTCTG C AG GG 

CCAAGAGAACAGCTTTAGTTGTGCCTTAACCCAGTCCTGGAGAAGCCAGCAGGCCGTAAT 

CACGGGGAGGAAACCCATCTTTTMGGGCTCCTCGCTCAGGTGGTGACAAGGTGAGGTGG 

TCATCTATGCTGTCTTTATCAGTATCTGTCCTAAATACTGTG CTCTG ACATTGATGCTAA 

TATTCCATATTATCAGGGCTTCTGTGGTGTTTAGGCCTCTMTTTTTCTCTTCAGTAAGG 

GGAAT 
Sequence 1855 

CAGGTACATGTCAAAGGAAAAACACGTGAAAGATGAATTCAGCCAMCCCACCAGTGTTC 

AACCTCAGCCTAATCAATCTCATACTCCTAGAGGCTTAAGTATCAGCAGGTAAGATCGTG 

ATGACCTGTCTNTGAGGCTCCAGACAATAATTTCTAACTGCCAACTGGAAATCCTTATAT 

GGTTAGGCTGCCAACATCCCANGGAACAGGACCAAAATAAAAAGCATCACTCATTATCCT 

ACTGCMTTTTCCTCTTCCCTTTGTCAMTGGGMTGATCTTTACNATCATGATCCTTNA 

TTGCANCCAGNACAGAAATCATGAATGTCATCTATGGCCCCTTNCTATCACTCCGGGCTC 

CAATTAGTTGTCCTAT 
Sequence 1856 

CGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAGA 
GCTCCCATTTCCCCACATCCTGGATGATACATGTTATTTAGACATTCTAATTTCTTTTCC 
AATCCAGTMGAACGACATGGCAACTCATTATATTCATTTGCATCCACTCTATTGCTATT 
GAGATTMGGGCTTTTACTGTTTTTCTTATATTT^ 

GAATTGCCTTTCTTATGGTCCTTTCATGGTTAAMCAGCCCTTTAGMGACTCGGATT^ 

MGGTTGGATTAGGGGTTGGGATTTGAGAGTTTTACACCATGGCCATGCCTCGTTTCCCT 

AAGTTATCATTAAAATGCTCAGCCTCCACAGCCCATATCCATGCATATCACCCAGGTTGT 

CCCAGAACATTCAAGAACTTACTGCTCTTGCTTTGAAT 

Sequence 1857 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGAGT 

GACTAMTTTGTAGGMTGATATTTTCATGGGATTACTCAATCTCACCCACCATTAGTTG 

CAGGTGACMGAMGCTMGTTGGCAGATGTTTGTGCTAGAAGCTGTGGGTTTACGTCTC 

CTTTGTGC ATGTGTTC CAGACATACC AGTGG CTTGGTATTTAAACATC ATGCTC AG GTGT 

GCAGGGTAGTTTTTGAGTTATAATANGTATGCAGGCGCTGTGGGATTACTTGGTTGTTTA 

TGTAAAMTTATTTTGCACTCACTTCTGAAATGAGTGTTAGTAGMTCATCTTTAGAGGA 

GGTTCCMGGCATTGMCTGAGATACCTGCACTGTTTGCTGTAAATTTMGCTTAAAATT 

GAAACCAGGTTATCAGCATTTCATGCCAGGAGAGAGTGGGCATGMTGATTTCAGGAAAT 

GAAGAG 
Sequence 1856 

CCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGTAAGTTTAAAATCAAAAGCAAGANAT 

ATGATACTTGCAGCTCANAAAGCAGCAGGACTTCGGTGGGTTGGAACGCAACTCTTCTTG 

CTGTTGTCCCAGGTGATTTTTCATCAACCAAGTCTTAGATCAATCAATTACACAAATATG 

ACACTATTACATCTTTCTTCTGGAGTTATCTTTCTGTTGGTCTAATACTCAGATGCTTCT 

GAGCCACGCAAAGTAAAAGCCCCAGGTNGAATATCCATGAGATGGGACTGATTGTGCTTC 

CATGGTTTACCTATCTCAGTGGCCTGCCCTCCTGGCCCTGTGAMTCAGTTTGGAAACTT 

GCCCTTCTTCTTCCTCCTGGTGGTCTTATTAA 

Sequence 1859 

GACTNCTATAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGACGCGGGAGTTTTTC 

AGAATCAGGCCTGCTTTGTCCCTAAATGCTTTCTTATTTCCCCAGAGCCCTTATTGGATT 

TAAGCCCCATATCATATCTCACATACCTGAGACAGACATACCACAGTGAACCCTGCATGT 

CCTTGTMGTTCTTTTTCCGTGCCTAGAAGTTGCAGAAGACCAAGGAACCTTGGAAGTGA 

CTTACTCTTTTGCTTCATAAAGTTGCTTTAAMCAGCCTTTATTTTAAT^ 

TATGTCACACATTTTTAMTTTTAAGGTTMTTTAGTCATTAATT^ 

TCTTTG G AAG CT AGTTGTCTTC AATGTCTTTTACTTGG AGTT ACTTG AAGTTTG CTTAAA 
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CTTTATTCACTTGGAGTAGAACTTCC 
Sequence 1860 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAGGTTCAAATAGTCAGCAGCTCATCATAATC 

AATGAGCGAGGACATAAAGTAGGAAAAATGCATCACCATGGTGGGCAAGGAAAGCAAGTT 

ATTGGAGGCACATGTT/^CACATAAAATATAAAATTAATATGATCACCACTGGAAAGGCT 

TGCCTGAGCCCACAGTTTGAATGCCTACAATAAGATGAGATGCACAACAAAAAGCAAGAG 

AACCTGATCAAGTGGGTGACCTGGCCATGGTGCTCTCATCAGTGGGGACCCAAATGCTTA 

TGTGGACTCACCAGGTATCGAATTAGACATGAATAGGGAGTGTTTGTTGTGATGGCAAGA 

AACTATATMTCAAATGAATACMTGAAACTTTAAAAATAATTGTMGGAATCTTT 

CAGC 

Sequence 1861 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTTAAAACAI I I I ! ICC 

TTGAGAAATCATTGCCTAAATCTCATACATATGTGCTTCTCTGCAGAAAAAAAAAATGAA 

ATAAMTAAAAATCTTTATTTCCAATGAGCTAATGAGGAAAAAGGGGATGATGAAAAAAC 

AGGGAGGTGGGAAGCAATTTGGAAATGGAAAGAATGCTGAATACCTACATGATGCCTAAT 

TTAGMGAACATTCTGGTATGTAGGACATTATTTTCAACTCTATCACTTACCGGCAGATT 

TTTTTTTATATGCAGATGTATCTCTACTCTCTAAAGATGTACATGTGTACACTTATAGTT 

GAGAGCCAAGTCTCCCTTATCATTGGTGAATGAGAATGAGCTACTGAAAACAAAAAGGAG 

GGGTCTTCTACTCAGCCTCTACCCCT 

Sequence 1862 

TATCACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACACT 

GCACACCACAGGGTGGCTGCACTTTTTGGGTGGTGGGGTGGGGATGGGGGCAGGTATGAT 

CAATCATCCAAACCANGGCATTTTTAAGAGTAAAAGGAGGCTCTACTGATAACATGCTGC 

AACAACAGATTTNANNTAGGAAACTGGCCCGCTCTANNAACTANT 

Sequence 1863 

CCGCGGTGGCGGCCGAGGTACGCGGGGTGAAGGGCTCTCTGTATACTGCTGAAGATTCTA 

TAGTTTCTCCTGAAATCCATGAGGGAGCCCCCTGAAGGATTTTGGCCAGNAAAGTGTCAT 

GAATCAAATTTCTATTTTAAGAATAGTATCGAGGCAGCAATGTCAMGTTTGGTTTANAA 

AATANGGATGGTTCCTGTAATCCCAGAACTTTTGGAGACCAAGGCGGGAGGATTGCTTGA 

GTCCAGGAATTTGAGTCCAGCCTGGGCAACATGGTGAGACCTCATCTCTGCCAAAAAAAA 

ANNNNAAAANAAAGGTCCCTGCCCG 

Sequence 1864 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGGGGCCATTGAGACTGCCAT 

GGAAGACTTGAAAGGTCACGTAGCTGAGACTTCTGGAGAGACCATTCAAGGCTTCTGGCT 

CTTGACAAAGATAGACCACTGGAACAATGAGAAGGAGAGAATTCTACTGGTCACAGACAA 

GACTCTCTTGATCTGCAAATACGACTTCATCATGCTGAGTTGTGTGCAGCTGCAGCGGAT 

TCCTCTGAGCGCTGTCTATCGCATCTGCCTGGGCAAGTTCACCTTCCCTGGGATGTCCCT 

GGACAAGATTTTACTCCCATAACCCAGGCAGAAGTACCT 

Sequence 1865 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGTAGATGG 

AAGGAAGAACTTGTGTGCTTAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTACT 

TGTAGGACCCTATACGGCMCCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAA 

AATGCAGTGAAGTAGAAGAGACAGGGGATATCCCAGAAGGTTATGCAAAACATCAAGAGA 

AGATGAGAGGTCAGAGATGGGMGAAACAAGAACTTTGACATGCTTGGTGTTCTTGCCCA 

AGCTTTGMGAAGTTTACAMGTCTATATGTCAGAATACACATTTCCCACCTTGCCCAAC 

AGTAGAAAAAAAAAAAAAN N AANAAAAAAAAAAGGTTCCTGCCCCG 

Sequence 1866 

GGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGTATTTAAAGGAGGAAAGAGCGATG 
AATCTACATGGAAAATTAGGTGAAGGAGTGGAAAATTGCATCGGCCATATAAGTCCAACT 
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ACATATGGATCTATTCCTCCATGAAAAACTGGAAATTTCACTTTMC ATGGTA TTTGCAC 

AGTCAGCAGCAATTCTTAACAATGTCTTTCTTGCTGCTGACATCATA I MM iCACCCCA 

ATMTAGAGGATMTTTGGACCCTTTCTGCTTTTCATTTAATTCTGTCTATTTCTGATTG 

CACTTCTCCTACTTCTCTTCCCACATGAGAGATGATTGTATTACTTGTGGAATATGATGA 

ATCACTTCMCMGAAATATGTGGCATCTGGTTAAGTTTCTTTCTAGCATATAAATTAAG 

CTTGMTCACAGCTGGGTCAMCATTTTCTTTGAAACGGG 

Sequence 1867 

ATCGACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACTTATAGT 

TGAGAGCCAAGTCTCCCTTATCATTGGTGAATGAGAATGAGCTACTGAAAACAAAAAGAG 

GGTCTTCTACTCAGCCTCTACCCCTAATATTTATATCAGAAGCAGAGATTAACTGTCCTT 

ACTCATTCACACGTTMTGGAAGAGAAGGAAGTTTCCTAGAAAAATCCTCCCGC TCCAC C 

CTGCAAACTTTATGCTTTTCTGTTACATAATCAGGCAGGGGCAAGACCTAMCTATTTTG 

AATTGGTGGTGTTGAGGCTAAATTCTCTGCTATTGACAGMTTGAGAATGTGATCAATTT 

CAGAGTAGCATGTTACAMTTTTGTCCCAATTTCAATGGGGAGAATTATAACAAATCAGT 

AGTGGTTGGCAGMTCTGTTCTTACTTTCCTCCCATCCAGTGACTAGGATTTGGAAAGCA 

GTGTTTT 
Sequence 1868 

CTNCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACTTATAGTTGAGAG 

CCAAGTCTCCCTTATCATTGGTGAATGAGAATGAGCTACTGAAAACAAAAAGAGGGTCTT 

CTACTCAGCCTCTACCCCTAATATTTATATCAGAAGCAGAGATTAACTGTCCTTACTCAT 

TCACACGTTAATGGAAGAGAAGGAAGTTTCCTAGAAAAATCCTCCCGCTCCACCCTGCAA 

ACTTTATGCTTTTCTGTTACATAATCAGGCAGGGGCMGACCTAMCTATTTTGAATTGG 

TGGTGTTGAGGCTAMTTCTCTGCTATTGACAGAATTGAGAATGTGATCAATTTCAGAGT 

AGCATGTTACAMTTTTGTCCCMTTTCAATGGGGAGAATTATAACAAATCANTAGTGGT 

TGGCAGAATCTGTTCTTACTTTCCTCCCTCCAGTGACTAGGGATTTGGAAAGC 

Sequence 1869 

CGTATACGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGT 

AC M II i l l I I I I I I I I I H i I I GCAGAGATGAGGTCTCACCATGTTGCCCAGGCTGGAC 

TCAMTTCCTGGACTCAAGCAATCCTCCCGCCTTGGTCTCCAAAAGTTCTGGGATTACAG 

GAACCATCCTATTTTCTAAACAMCTTTGACATTGCTGCCTCGATACTATTCTTAAAATA 

NAAATTTGATTCATGACACTTTCCTGGCCAAMTCCTTCAGGGGGCTCCCTCATGGATTT 

CAGGAGAAACTATAGAATCTTCAGCAGTATACAGAGAGCCCTTCACCCCGCGTACCT 

Sequence 1870 

CCGGGCAGGTACACAAAAAAATGCCCCATTATCCCTTTGCTGTGGGAGCAGTGCTGATGC 

TAGTGGCTCGCTCTTTGGTCCTGGCTCTGACTCCACCAGCAGTAACAACCAGCAGTGGTG 

GAGGCTGTGGGTGCMTAGACTTTTATCATGGCCACTCTTCTGGAAGATGTCACTGCCAG 

CAACTCCTGACTCAGCTTTGGCAGCAGCTGCCGCATCGAAGACTGTTGTGAGTGGGGAAA 

GGTAATGGGGCTGCAGGAATGTGAGGATGTAGGGACTTTGGGACCTCCAAGGCAGGATGC 

AGTCTAGACCTCCAAGGCTTGGCTTTCAAAATGGCACC 

Sequence 1871 

CCGGGCAGGTACTTGCCAAGATCTCACTCCTCAGCAGCTTTTTATTTCTAMTTTAAGCT 

TTTGGTTACTGAMGTTTACAGCAGTTCTAGACTCCTATTCTTTAGCATTCCGTTACGTC 

CAGACCATGTGACAACCTCCATTCTACAAATATGTAACCACAAAGAAGGTGAGTGGGGAG 

GCAATAAGAACCCAGACTAACAGAACATCAGAAAAGCTGGCATACGCACACATAA AGAA G 

GGGGAGMTATAGGACTATATGGGAGGGAGGMGGTCACAGAAAATGATTACGGTTTTGA 

TGTTGTATTTGTGMGGTGAGGAATACTAACTATATTTCTTTAGTCTTTG 

Sequence 1872 

AGGTACGCGGGGGTATTTAAAGGAGGAAAGAGCGATGAATCTACATGGAAAATTAGGTGA 
AGGAGTGGAAAATTGCATCGGCCATATAAGTCCAACTACATATGGATCTATTCCTCCATG 
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AAAAACTGGAAATTTCACTTTAACATGGTATTTGCACAGTCAGCAGCMTTCTTAACMT 
GTCTTTCTTGCTGCTGACATCATAI I I I I I CACCCCAATAATAGAGGATAAT7TGGACCC 
TTTCTGCTTTTCATTTAATTCTGTCTATTTC 
CATGAGAGATGATTGTATTACTTGTGGAATATGATGAATCACTTC 
Sequence 1873 

AGGTACGCGGGGTCCAGTGGACGCCAGGGATCTGAAGGGCAAAGGCAAGGGCTGCTGGAG 
CCTGCATCATGTCGAGTCCGCAGAGGAGGAAGGCTATGCCCTGGGCACTGTCACTGCTTC 
TCATGGGCTTCCAGCTCCTGGTGACTTAT 
Sequence 1674 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAGAAAMCTCATTTCAGTTMGGGGAAAAAAA 

GAGTAAGTGAGCCAGGATTACAGTCCAACTCCAGTCAAAGTTAGTATAAGAAATAAGAGA 

ATAAATAGATCAACATCCTCACAGACAATGAAAGCACACCGGAAGGACATGACCACAAGG 

C AG ATAAAAACCTAATATTTGCC AAAAAATAAAATG CAAATATG AC AG G AAATTTATATT 

AATTAGTACCT 

Sequence 1875 

CCGGGCAGGTACCAAAAGMTGTTCATAGCAGCATTATTTGTCATAGTCAAGAGCTGGAA 

GCAACCCAGTTGTGCATCAATGGTAGAATGGATGACTAAATTGTAGTGTATTCTTACAAT 

AAAATACTATAGAGCAATGAAAATAAATAAACTATGGCTATATGCAACAACATGGATAAA 

TCTTACCAACAATGTTAGACAAATGTAGCCAAACCAAAAAATACACATGTTCTATGAATC 

CATATATGCAAGGTTCACAATAGGTGATAGTAGCTACTCTTTTGGGAAGGAGTGAGGAAA 

GGACTGAGAGAGTATTGCAGAGGGGGCTCCGGGTGTTTGTAA 

Sequence 1876 

CCGGGCAGGTACTTTCTTGGCCAGACATCATACATCTGCCTCTATAGAATCCCTCATTCA 

GGGGGTCACTGATGAAGTGTAAGGCCCTAGGGTTGAGCTTCAGCTCCACTGTGGGGTAGG 

TTTTTACCTTCTCCAAAGCTTCATTTTCTCA 

CCAAACTCTAAGGGCTGTTGTAAGAGGTTTAAATGGCATAACAGATAGAAATGATTCAAG 
CTATTMGTCCTTTATAAATGTGTGGAATGTTGTTTCTCTGTATAMTCTTAGCTTTTCT 
ATTATCTC AAC AAAGGTG ATTTTCTTGTTMG AAGGTTTAGTAGCCAGG CATTG GAC AAC 
T 

Sequence 1877 

CCGGGCAGGTACTTAGGACTATGGAGAAACAATGGAAGACATTAATTCTGCTTGTCTTGA 

TGGTAAGGGTGGGGATGAAMTACAGATCGAATTTGGGGGAGCATGTTAAGGTTAGGGGA 

ATAGAGATAGCTTATATTTAGMCAAGTTAGTAGAGACTGTATGATTTTCCAAATAGAGA 

AGGGTAAAAAGACATTCCATAGAGGAAGAAAATTACATGCAAAACCACAGAGACCTTAGT 

TTATAAAGAGTAACTGGGTATGGCCGGGTGCGGTGGCTCACGCCTGTAATCCCGGCACTT 

TGAGAGGCTGAGGAGGGTTGATCATGAGGTCAGGAGA 

Sequence 1878 

AGGTACTTCTGCCTGGGTTATGGGAGTAAAATCTTGTCCAGGGACATCCCAGGGAAGGTG 

AACTTGCCCAGGCAGATGCGATAGACAGCGCTCAGAGGAATCCGCTGCAGCTGCACACAA 

CTCAGCATGATGAAGTCGTATTTGCAGATCAAGAGAGTCTTGTCTGTGACCAGTAGAATT 

CTCTCCTTCTCATTGTTCCAGTGGTCTATCTTTGTCAAGAGCCAGAAGCCTTGAATGGTC 

TCTCCAGAAGTCTCAGCTACGTGACCTTTCAAGTCTTCCATGGCAGTCTCAATGGCCCC 

Sequence 1879 

AGGTACCCAAGCCAGAAAGTGAGCCCACTCTGACTCTCATGMCTGGGTCCTGATTCCTT 
TAAGGTTATTTTGCCTAAGTATATTTCCACTCTGGCCTCTCCTTTCCATGCCCACTGCAA 
CCACCAAATGTGCCACATCCCACCCAGATAAGTACCTGCCCG 
Sequence 1880 

CCGGGCAGGTACGCGGGGAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTG 
TAGGACCCTATACGGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAA 
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TGCAGTGAAGTAGAAGAGACAGGGATATCCCANAAGGTTATGCAAAACATCAAGAGAAGA 
TGAGAGGTCAGAGATGGGMGAAACAAGMCTTTGACATGCTTGGTGTTCTTGCCCAAGC 
TTTGAAGAAGTTTACAAAGTCTATATGTCAGAATACACATTTCCCACCTTGCCCAACAGT 
AGAAAAACATAAGAAGAGAAAMCATTAAAAAATGACAAGGAAGTTAATGGAAGTCAGC 
Sequence 1881 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAGCACGTGCAGAAAAGAAAGA 

GTCTCCGCTTGTCTTTGTCTGATTCTCCTGTCCTCTCCATGGMGTTACATTTTCTGTAA 

AGGATGAGCTGAAAATTCTCCTGGTCGTTGCCAGTTGAACTTCTGCTGTGCTC.TGGGAAG 

GCATTCTCACTCTGTTTATGTTGTCTAAGTGCANACATGGATGT GCAG GTTTGCTAGAAC 

CTCCTGAGGATGTGCMTGGTTCTGTTCATGCCTGMTCAGTTCTTTTGGGAGTGGACAT 

TCTTTCTCTCCCTCATGCACAGCCTCAGGCACATGGCTTGAGCTATGGCGGCACGCAGNA 

TGGCCATCACCCAGGTACCTGC 

Sequence 1882 

AGGTACACTTATAGTTGAGAGCCAAGTCTCCCTTATCATTGGTGAATGAGAATGAGCTAC 

TGAAAACAAAAAGAGGGTCTTCTACTCAGCCTCTACCCCTAATATrTATATCAGAAGCAG 

AGATTAACTGTCCTTACTCATTCACACG1TAATGGGAAGAGAAGGAAGTTTCCTAGAAAA 

ATCCTCCCGCTCCACCCTGCAMCTTTATGCTTTTCTGTTACATAATCAGGCAGGGGCAA 

GACCTAMCTATTTTGAATTGGTGGTGTTGAGGCTAMTTCTCTGCTATTGACAGAATTG 

AGMTGTGATCAATTTCAGAGTAGCATGTTACAAATTTTGTCCCAATTTCAATGGGGAGA 

ATTATAACAAATCAGTAGTGGTTGGCAGAATCTGTTCTTACTTTCCTCCCATCCAGTGAC 

TAGGATTTG 

Sequence 1883 

GGCGGCGGCCGAGGTACAGTGGTTTACGCCTGTTAATACTATGCACCTTGGGANNNCGAG 

GATTACAGGCGTGACCCACCGCGCCAGACCACTTGCAAATATTTCACAGAGCCAATGTGT 

MCTTGAGTGGTGCAAGGACATGAAGTTTACCCAAGGCTTAACAGTTGATAGATAATGGC 

TAMTATGGTGATCTTCAAAGACTAAAGAAATATAGTTAGTATTCCTCACCTTCACAAAT 

ACAACATCAAAACCGTAATCATTTTCTGTGACCTTCCTCCCTCCCATATAGTCCTATATT 

CTCCCCCTTCTTTATGTGTGCGTATGCCAGCTTTTCTGATGTTCTGTTAGTCTGGGTTCT 

TATTGCCTCCCCACTCACCTTCTTTGTGGTTACATATTTGTAGAATGGAGGTTGTCACAT 

GGTCTGGGACGTAACGGAA 

Sequence 1884 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGG 

TAGAACAGGCGATATCTTCACCATGCGCACAATGAAATCAATACTCAGAGAAGGCAGATA 

ATTCTCCACGAAGCCAGAAAACTAATAAATGAACAACTTGGGTGAAATGTCCCACCAGAC 

GGTGTGATATTTAGTAGCCCAGAAAGCTGCCAAGGGGTTGAATGACACTATCTGAAGATA 

TGAACCAGTTTGCTCTCCATAGGGAGGATTTCATCAACAGGAAACAGATGCCTGGAAGGC 

ATTGGATTTGCTAAGTGTCTATGCCATATGATTCTGCTGTTTGCGTTTGATTTAGAATGC 

TGAGCTGACTCAAAGTCAAACTGTAAGTACCT 

Sequence 1885 

ANGGCGAATTGGAGCTCCCCGCGGTGGCCGAGCGGCCGCCCGGGCAGGTACTGTTACTAT 

CCTTAAGATGAGGGAACTGAGGAACCAAGAGGTTMGCMTTTGCCTTTGGTTCACATAG 

CTAATGATGTGGAGATTTGMCTTAGGCTGTTTGTCTCCCMGCCTATGTTCTTAAATTT 

GGGGAAATAGTAAAGATAATTTCCACAATGTGAAGACAGTTAGCAGCCTTAAGGATGAAA 

GGATGGTGCAAATACCATGCCCAGTGAGTGACAGAGTATCAAGGCTGGTAGAGCCTGATG 

AMGCACAAGTTTTCAGAAMGAGGGAAACAAACMTTCCTATAAAGTTAAGAAAATCAC 

ATTGACACCAGACTTCTCATTGGTAGAGACCACAMGCCCTTTTCAAAAGTGGGTCAAAC 

TGGCAGGCTGAGCACATATCCTTCCCTCCCTTTCTG 

Sequence 1886 

CTCCTATAGGGCGTTTTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAAAGA 
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ATGTTC ATAGC AG CATTATTTGTCATAGTCAAGAG CTGGAAGCAACCCAGTTGTGCATCA 

ATGGTAGAATGGATGACTAAATTGTAGTGTATTCTTACAATAAAATACTATAGAGCAATG 

AAAATAAATAAACTATGGCTATATGCAACAACATGGATAAATCTTACCAACAATGTTAGA 

CAMTGTAGCCAAACCAAAAAATACACATGTTCTATGAATCCATATATGCAAGGTTCACA 

ATAGGTGATAGTAGCTACTCTTTTGGGAAGGAGTGAGGAAAGGACTGAGAGAGTATTGCA 

GAGGGGGCTCCGGGTGTTTGTAATTTCTATTTCCCAGTGCAGGTGTGGTTTGCATGAGCA 

TGTTCACTTTGTGGAAATTCATTGAGCTGTAACTAAATATACAGAATTATTTN 

Sequence 1887 

CCGCGGTGGCGGCCCGAGGTAC M I I I I I I I I I I I I ! I I I I I I GTTTGTTTG I I I I I I I I 

TTGAGACAGAGTCTCGTTCTGTCACCCAGACTGGAGTGCAGTGGTGCGATCTCGGCTCAC 

TGCAACCTCTGGGCTGGGCAACGGTGTGAGACTTGGACTCAAAACACACACACACACACA 

CACACGAAGCGGGAAAAATTCTGGTGCTTCGCAAAGTCATCACAATACAAATCAAAATAA 

ATTTGTCTTAAGCCACTTAGACAATACTAACCAACTTTCACCAGGTGTAATCCATGCACA 

ACCAAAACCTTTTACTGTGAGAGCGAAGCTCTTATGGAGTTCTACCCTCATCAGNGATCT 

CCATTATAAG GATATAAAAGACTAGTAGATTCTATTCATGAAAC CATTTTTCTTGGCTTG 

GTATTAATTTTGGTAAGAGATGAGTTACCAAGTTAGAAATTAGTGTGGTCCCAMTTAAA 

0 

Sequence 1888 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACTTTNI I I I I I I I I M M I I I il I I I I I I I AT 

TTTTNGAGACAGCTTTCTATTCTGTCACCCAGGCTGGAGGCCATTANAAMTAATTTGTG 

CTGTTCTTATAAATATGAAGMCTTTGGAMTATCTNTGTGACATCGGATGCAGTTTGTG 

TTTGGGAAGTATCACTGATAAGAAATAATACTAAAAATTAGCAACCCCATTGAAAAGTGG 

GCAAAGGACATGAACAGACACTTTTCAAAAGAAAAAATTCATGCANCCAACAAATATATG 

ACAAACAGCTCAACATCACTGATCATTAAAGAAATGCNAATCAAAANCCNCATTGNGATC 

CATCTCACTCCAGTCAGAACCACCATT 

Sequence 1889 

TNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACTGTATTTCCAAAAG 

TMGTTTCTATTCCTAGAACTAAAACAGAAAAGAAGTGTTGTGAGATACATAAATGAAGC 

CAATAAATTTAACCAAGGGCTTACCATTTTCCTGCAGGATGTTTATAAATAGTT^ 

AAAGGAGGTTGCCGTATAGGGTCCTACAAGTAACCTAGGTGGCAGCATCTCCTCCCAGCG 

TCAGGTCTAAGCACACAAGTTCTCCCGCGTACCTGCCCG 

Sequence 1890 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAGCTTTAAATG 

GGAATAAATCTTTTATAAGCCTTTCCATTTACTTTCCTGCTTATTAAAGGTGAATTGT^ 

TGGAAAACCCTAA>\TGTCACAGAGTCTGGAAAATTGAAATAATAAGGGGCAGCATTCTCC 

TTAAGTGGTATAACCTAGTAAAAAGTGGGCCTAATGGAAAGTCCACCGTGGAATCATCTG 

TGmCTATGCMCAAMTTATTTTCCTTCmCTCTMCMGGGMGTAGTTCTGACCA 

ACAAGGTTGGCTGTCTTAANCATTGNGGTGGAGAGCAATGGAAAGATGAGTGTGTAGTGG 

GAG 

Sequence 1891 

CGAGGTACGTAGGGGCCCTTTGCTGCAAGCTTAGGCCAGTGTCAGCTCCTGAGCCACTCC 

TGACTCAGTTGGAGGAGACTCAGTGTCCACTGCCCCCTGCCTGGTGGAGCTTGGCAGCAA 

GTCCCAAGTT(TACAGTGTTGAGACCAAGTGGGCAGCCATCACCATTGAGGGGTGCCCTT 

TTGCTCCCCCTCGAGTCCMGTTATAGTATTGCTTACTTCCTACCTGAAATAGTTTATGG 

GTCATGGGTCCGGCTTACATCAAGCCCCATAGCCAGTCTGGTTGCCCCACCTAAGGTCTT 

GGTGCTCTTTGATAATAGCCACAAGCTTAAATGTATTAAGTCCTGTCCTAGGAAATGCAT 

GATCTTCTGCTCTAGAACTAAGTG 

Sequence 1892 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGNCCGGGCAGGTACAAGGG 
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GAGACAGCATGCATGGTGTGNTCAGAAGAGCTTGCTGAGGTGCTCGGCTCTTANCGTTAA 
AAATGTGATGTTGGTATATCATCCTGATAGAAANCACTGCTTCCCAAATNCTAGTCACTG 
GATGGGAGGAAAGTNANNAACAGATTCTTCCGACTCACTACTGANTTGTTTATANTACTT 
C CCCATTTGAAATTTG GGACAATTANTTTG NGG G C ATNGCTACTCTG 
Sequence 1893 

GNCCGGCCAGGTACTGCCACAGGACTTTCAAAAAGGAGGGGGAAAATTAATGAAAGTGAC 

ATGCATCAAACAAATCAAGGGGCAGTGTTGAGGTCATCTCCACGGAGCTGTAAACTCAGA 

AGTGTTTCCTGGTCATATATGGTCAATTANGGTCAAGTCTGAAATAAATNTANAANAATG 

ACCTMTTTTCCAGCTTMCTCAGNAGCTAAMTCCATMCACTATCANACTTTCCTTTT 

AATTTATGAGATGGAGTCTTGCTCTTGTCGCCCAGGTTGGAGTACCTCGGCCGCTCTAGA 

ACTAGTG 

Sequence 1894 

AACACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAC 

CACCAGGGCCCTTGCTTCTGAGCAGGAAGCAGCTGGGGMTAGGCTCTTTCTCTTAATGA 

CTTCCAACATAGTTCTCTCAAACCTTACTCCTCCAGAAGGCCACCCTCACCTGGCTATGG 

CTACTTCAGAAAAAACTTGGCCTCTGGTATAATAGAGCAGAATCATCACCTCACATTCTA 

TTTCAAGCCAAAGTCAATATCTCAAAGGCTGGTTCTGTGATTTATTTGGCCCTTGGGAGC 

TCCTACTGAAAGTGCTGAAATGTCGTACCT 

Sequence 1895 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCCGCCCGCCCGGCNGGGTCCTTTAAAAC 

TNGGGGGCCNTAGGATTCATTTTAACCATTTCTTGCCTCACTCTCACTCTGTCACTATGT 

GGAGTGCTCTAATCCATTAGGGAATATAGCTCCTATATCACTGGCTACCCAAATGGACCT 

GAAAAAGG C AGACAGAGCCC ATCC CCAGAATATCCTTGGTTCTTGCTG GTCAC AAGGACC 

AATAACATCATTMCATTGATGTTTCAGAGTCMTACAGGTATAAGMCCTTAGGTTTAG 

GACTGTGAATAGGGGAGCGGTGAAATGAAAATACGGCCTTGMTTAGTTTCACTGTCAAC 

ATATCTCATATTTATCTAGCTAGGGTATTTTAATTCTAGGGTATGTTATGTCCAGAAGAG 

AGGAATGATTTT 

Sequence 1896 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCTCGAGCGGCCGCCCGGGCAGGTACTC 

TTTATGCACATCAGTGTGGGGCCTTAGTAACCACTGTGCAAGTTGAAACTGGGCAAAGTT 

ATCTTMTAATTAATGGGAAAACATAAACTTCTTCCTGCAATCTTTAAAGACACTTGTCA 

GTCAGGTGCGGTGGCTCACGCCGGTAATTCCAGCACTTTGAGAGGCCAAGGTGGGCGGAT 

CACTTGAGCCCAGGGGTTTGAGAACAGTTTGGGCAGCATGACGGAACCCTGTCTCTACAA 

AGACTACGAAGGGTTGGCCGGGAGTGGTGACATGTGCCTATAGTCCCG 

Sequence 1897 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATCTCTAGC 

TGATGATTCAAAA^AGAAGCCTTTTAATCTCACTCCACTGATCAGCTATGATACTTAAAT 

GTTTTAGCTGTGA3CAAAATAATATGCATTCTCAAAGAGAGTATCTTCAGACTCCAGTGG 

CCGAGAATCTAGAGTTAGCAATGGAAAAAATTAGTCTCGGGGCNTTCTTGTTTCTGCGCA 

CMGTTTTCAM1TAAGAACAATGTGTTTGCACTTMTGAAACAACCTCTACTGCTCTTC 

AAGAGGACTCAGGATACCGATTCTCGAGGCCCCTGGCGGTCCCCTGTAAGTACCT 

Sequence 1898 

CCGCGGTGGCGGCCGAGGTACAGATAGAGTCTCACTATATTGCCTAGGCTGACAGAGTAT 

{ Mil I GCTTMTTATTCTTTATTCTCAAATGCAGTGAACAGTCACAGTAGCCCACCATA 

GGGGGTTCATTTTCAGACATTACAGTAACTAGGGGMTTTTATTAAGGAACCAGCNGCTT 

MCCCNTANTNGGTANTAATTCTTCTTTTTTAMCCCAACCTCCAGAANTTNATTCAGAA 

TTCATCAGTCCTTCCCNATCAGGTGGCTTGTCTTTTCTNGGTCCCATGGATCCAATGTAN 

GGAATTCCACAATCACAAGTTAAGTNTTTTTAATCACTTCTTANGNNCCTNCTCCCCACT 

TCTGGGGATGGGTTTCCCTCANNTAATCCTCTTTTGGTCTTTCCCAGGGGAACCCTGGAC 
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CACCTTTTTNGAAAAGGAAGGTTANTTNGGGAACCAGGCCI ITTTTTTT 
Sequence 1899 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTATTAA 

ATTTAATGAGAGGTAACCCATGCACCGGCAGAAAGTTCAAGGGGACAGAAGGGGCTTGCT 

CCACTCTGGAGCCCTATCTCCTCTCACCTCCTTCCTTCTAGTTTGACATGCGATCTTGTC 

TCTCTCTTTTATAAAGTGCTTCCAGGCCCNTTTTGATTAAGGATGCTGTCCTAAAATCCT 

TCCTCCCGCGTACCTCGGGC 

Sequence 1900 

TGGTGGCGGCCGCCCGGGCAGGTACTACAGGGGCAGGCTACCGTCCCCGGCTAATTATTT 

TAATTTTAAI I I I I AGTAGAGATGAGGTCATGCTGTGTTGCCCAGGCTGGTCTCGAACTT 

CTTGGGCTCAAGAGATCCTCCTGCCTTGGCCTCTCAGATTGCTGGGATTATAGGNGTGAG 

CCACCNACACCTGGCCTATGATCTTAACTTTATTTACCACCTTTTTGACATTTGCCAAAA 

ACTAAGTTTTACAAGATCTCTCACCATATCTCTGGGGATGGGAGGAGGAAGGGAAGGATA 

ATGGTCTCATATTCATAATAGTATTTGCAGTGCCTATTCTCACAGTTGATAAATGACTAT 

MCCTTTTCAAGAGGATGAGTGAAAGAAAGAGGGAGGCAAGAANAATGAAAAAGGAGGTA 

A 

Sequence 1901 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACI I 1 1 I GTTGTTGTTGT 

TTTAGAGAGACAGGGCCTTGCTCTTTTGCCCATGCTGGTCTCAAACTCCTGAGCTCAAGC 

AGTCCTCTGCCTCAGCATCCCAAGTAGCTGGGACTACAGGCCTMTACTTTTTGTGCCTG 

CTCCTCTCTTCCTGTTTCTACCCCATCATTATCTGCTCAGATCTTCTTAATCAACTACTG 

CAGCAACCTGCTAAAATGTCTCTCAATCTTCTGGGATAAGTCAATCAACTGCTAAATGAA 

TCTTCCCTAAATTTTACTTTTATTGAGTTCTTC 

CTAAGTCCTCAGCATGGTATTTCTTTCTTCTATGTTAATTGACAGTCTGTCCTTAAATAT 

GAACTGTCTATTCCTGCTCTTCCTTCCCTCAAACCCAAACTCAC 

Sequence 1902 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACACATTATCTAAGATGA 

TACCTATAAGGCAGATAGTATTATCCCCATTTTACAGATGCAGAAATTGAGTATCAAAGT 

GGTTAACTTGAATAATGTCAAGCAGCTAGTCAGTGGTGGCAGTGGAATGTGAATCCAGGA 

AGTCTGACACAAAAACAAGTGGTCTTAATGCTATAACATGCTTAATGCTAAGAAAAAGTG 

GACAGTATAATGCTATACTGTTCCTGAATAAATATCCAGTAAATAAGGAAGGCTAGAAGA 

ACACAAAGTTTGTAACAACATG1 1 1 1 I CTAGTAGAAGTCTTCTTTGCTGACTTTCTGATT 

CTCAGGTATAGCATCAATCGGTGAGGGTTGCTATATGGTAAACATTATACTGACAGGCCA 

AAGCGATCAACTGTGAAAACTAGGCAAAGCCAAAGYA 

Sequence 1903 

CCGCGGTGGCGGCCGAGGTACAGCTTGAACAAATCCACCTTCTGTGGACCAAGCACCACC 

CTGGGCATTTCTAGCATGAGCAAAATCCAAGGTCCTGGCTGGACTCCAGAGATGCTATTT 

ACCTCAGAAGCATGACAATAGGAGGCAGAAGGAGCAGGCAAATCCAAGTCCTTTCTTGTA 

GTTTCCTTGTTTGGGGAGGAAMGTTGAGTTTTAOTATTATGGAAAAGAAACAGGAAATA 

GAGACAGACAAAGAGATATGACAATACAGTCCTGCCACCCAGATACTCATTTCCACCTAC 

CATTCCATGCATTTGTTTTGAATATATAAGTATGTACCTGCCCG 

Sequence 1904 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATTGCTTTATTTTTTCATG 

AGACATAATCATGTCTTATTATAAATTTGCCTAAGCAGGGGATAATATCTGACATACATG 

CTACCTGTTATTGTGTAAATTGAAMTATTCTATCTTACCAGTTGTTACAATTAAATATA 

TTATACTGATAATTTA I MM I CAGACATTGACATMTAAAGTTTTATTTGAAATATATC 

TTAATCAATAAATATAAAACAGAAACCACTCTAATAGAGGACAATTGTTCTAGTCAATAC 

TCATATTTGTTCTGACTATTGAGAAAAATTATGTCTGCATTTTAAMGTTTCAM 

AAG CTTC AG ATTTTAG AATAGGG ATTT 
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Sequence 1905 

CCGCGGTGGCGGCCGAGGTACGCGGGGAGACGGCATCTGAACCTTGCCGAGTAGGAATGT 

TCCAGAAGTAAGAGAGATGACTAGGGTTTCAAGAACAAGGACCAGCACCTATGAAGGCTC 

TGAGGCACAGACCACTGTGACCTGTGGAGAAGTGACAGCAAAGTGAAAGGCACGTTTTGA 

TTACTACCGACATTCACAGTCTGGACACGGGTGACAAATCTAAAAGATATTGCCCATTCG 

TCCAACTGCCGCTGTCCTGCCAGTATATCGTCTGAGATTACCATCAGCTAAATATTGCAT 

TGTCTTTGGACAATTTTCCTACATTGTGACTCCTCACTACATTTTCAATGCTTTTCTAGT 

CCCAGACAAGCTACAGACTGTTAATTTACACTTGCTTTTGCCTTGCCTGAGGAAATTGAA 

AATAAATT 

Sequence 1906 

AGGTACTTACMCTTTTCATATAGTATTGGTAGTGTTTTGAATMTTTAAAAAATCAT^ 
TAGGTTTGTTATTTAACTTTATTrATATAGTGCTTGTATTMTGATTTTGTGM 
TTAMGATGGATACCTTGATTTGGCTTGGCCTCCCAAAGTGCTGAGATTACAGGCGTGAA 
CCACCGCGCCCGGCCGACTTCAGGAGATCTTTAGGCATCA1TGGTTTGTGTTCTTCAGTT 
AAAAATAAAAAGCCAAACTCAAGACAAAGAATACAGTTCAGTTCCAGGTTAGGAAGAGGG 
AG AAGCTC AG CTTCTG AAGTAGTGCTATATTTACTGTTTAC CTTG ACTGTTATCTTTAG A 
AGTTGATGATNATTTTAAAAATTGNGTTAACTGCTGTTTGAACTGTTAAAGTTC 
Sequence 1907 

CCGGGCAGGTACTGGTTTTACTGATCTTTGTGCMTTTGCTGCTCAGCAGATATTTTCAA 

TGCTGATTGGTAATATGTCAGTGGATAACCAGTGGAAATGCATGTCTATTAATTACTATT 

GCACATAATAmATCTTGTTTTTATATTTTGCACAGGATmCMGGTGCTO 

TCTGAGCACTGTAAAGTTTTCTTTGAGAAATAAGTGTTTA I ! I I I AAATTGCGTTGTCTC 

ATGGGAGGGTAACATTTGTAMTATACTTCAAATTAGTGTTGTTCCCTCTGCTTTCTTTT 

TCCTGACTCAGAAACAGAGTGCTTTGACTGCTCTATGTTGGGGCCATG 

Sequence 1908 

AGGTACMTATAGGCAGACAGTTTGCCTTCAGAAATTCAGAAATGCAGCTTTTGAGGGAG 

GTCAGCATCATTGGTCTCAGCTACCATTTTCCTGCAGGATGTTTATAAATAGTTCCTGGC 

AAAGGAGGTTGCCGCATAGGGTCCTACAAGTAACCTAGGTGGCAGCATCTCCTCCCAGCG 

TCCCCCGCGTACCTGCCCG 

Sequence 1909 

CCGGGCAGGTACGCGGGATGGGTTTAGACTAAAGTATTTTCTTCATAMTTGAAAGAGAA 

ACACACTAGGAATTCAGAAGATTTTAATTAAAGTTGTTCAAAAAATTTM 

AAAGGCTGGCTCACTAAGTAGTTTTGATAATGM1TTAATTTATAGAAGAMGTTCTTGG 

CAAATTTGGTAATGCCAAAAAGAGCCTACATATGCACAATGTATATTTTCATGGAAGCAT 

TATTATTAAGTGTCCTGCTTTTACTGGTTGACATACGTTAGTTCATTTCAGCTTCATTCA 

AATATGTAGATTAAACCTAACACCTATAAAATGGTAACAATT 

Sequence 1910 

CCGGGCAGGTACCATACACCATGGAAGTJAGCCTTCCACCTCAGGAGAAAGCAAGCTGCT 
CTCATAAGCCTCTCACCAACTACCCAGTAGTCTTTGCCTCCAAACTGAACAACAGGAAAA 
AGTG CCATAGTTTTTAATTAAGCTGTTTTAAAAGTC C ATGTTCCGGG AG AAAAC AAAAAA 
AAAAAAAAAAAAAAAAAAAANGTACCT 
Sequence 1911 

TCAGCATTCTAAATCAAACGCAAACAGCAGMTCATATGGCATAGACACTTAGCANATCC 

AATGCCTTCCAGGCATCTGTTTCCTGTTGATGAAATCCTCCCTATGGAGAGCAAACTGGN 

TCATATCTTCAGATAGTGTCATTCAACCCCTTGGCAGCTTTCTGGGCTACTAAATATCAC 

ACCGTCTGGNGGGACATTTNACCCAAGTAGGCAAATGCATGACAGAACTCACTGGATCAA 

TATCTCAAAATGCTGGGAAGAAAAAAACGGTAACCAGAAATTCTACACAGAGTGAAACTA 

C 

Sequence 1912 
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AGGTACCATACTTTGTTTTTATTCTTGAGAGGAAACTTCGATCATTTAAAGCAGATAT^ 

GCCTTCATCCCCTACCCCCACCCAATAAAACAMCCTGGAAATGCCCATTTACATATTTT 

AGCTTAATCACGGTCAACTGTCTTCTATCGCCACTTCACTGCTCACAAAATACACATATT 

TAAAAGTGAAAATACTGAAATATGTCTTCACAATACAGCTAATACCACCAGTAGAAAA7T 

CACGTCACCATGTTCAGTAGACAAACGTGAGAAGCAATGTGGTGTCAGATTGGAAACTTG 

GGAGAGATTCTCAGAAATAAAGTAACTTCAATAGGTTGCTTTTTGCATAGTGCAAAATTA 

MCTTCACTTTATTTAGTGGAAGTGACCCAATTCAATCTTATCTGGGGTCAACACAGCTG 

ATTTCATTTGTGTTTGACCACTGCACATTTTGGCATCCAAAAAGTTAAGGAGAGGAAGTC 

AAGGGTATTATGGAGAACTACGAAAGACAGATGCCTCCAATAAAAGAC MINI! GGTGG 

TATTTGGTTTTGGGTTTGGGNGGATGATTTGGATCCCAAAAGAGAMCCACTTMG 

GGGAATTCCCCCCCAAAATTTCTACCTTNAGAAAGACCCAACAATTGTTGNTTNTGGNGA 

CAGCTTAATCACTGGAAGACTTTTGCATTN I I I I I I ATTAANAGAGAGGATTGGGTNGAT 

TTAAGATCCTCTTNTNCCTGCAGGTNTCCATTTAAATTNCACCNCAGATAATNNGGAAAA 

NNNTCCANATNATGGCATCCTATTTGCTNGCTTTTTTCCAATTGGAAATTNGATTTNAA^ 

NAAAAAAATNNNNNNAGNTCi II I I I I I I I I I 

Sequence 1913 

CGCGGTGGGCGGCCCGCCCGGGCAGGTACTATTAAAATTTAATGAGAGGTAACCCATGCA 

CCGGCAGGAAGTTCAAGGGGACAGAAGGGGCTTGCTCCACTCTGGAGCCCTATCTCCTCT 

CACCTCCTTCCTTCTAGTTTGACATGCGATCTTGTCTCTCTCTTTTATAAAGTGCTTCCA 

GGCCCTTTTGATTAGGATGCTGTCCTAAATCCTTCCTCCCACGTACCT 

Sequence 1914 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGATGATTAATCTTCCCTTATCCACAAAC 

TTAACTGTGGAGAAACAGGAGAAAGTGTGTGCCTAGAAGCTGGATTCAGGGACCCATGGT 

TACCCACTTCTACTTATGATTCCGTCATTACTGTTGTTTTAGGAAAATAAACAGATCTTG 

ATTCTTTCATAAAAGTCGATTCTTCAACAAGCAAATGGGAAAATCGGCAGGCCCAGATAT 

GTGTGTATAGCAGCTACTGGTTGGAAATTTGGACACAAAAGTCTTTATCTACCCAGCCTG 

TGAGCCACAAATCTGGACTGAGTATAAAAAAGAATAAAACTAAATACCACAATGTTCTCA 

CTTATAAGTGGGAACTAAACATTTGAGTATGCATGTACCTCGGCCGCTCTAGAAC 

Sequence 1915 

ATACGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGGGGCCATTGAGAC 

TGCCATGGAAGACTTGAAAGGTCACGTAGCTGAGACTTCTGGAGAGACCATTCAAGGCTT 

CTGGCTCTTGACAAAGATAGACCACTGGAACAATGAGAAGGAGAGAATTCTACTGGTCAC 

AGACAAGACTCTCTTGATCTGCAAATACGACTTCATCATGCTGAGTTGTGTGCAGCTGCA 

GCGGATTCCTCTGAGCGCTGTCTATCGCATCTGCCTGGGCAAGTTCACCTTCCCTGGGAT 

GTCCCTGGACAAGATTTTACTCCCATAACCCAGGCAGAAGTACCT . 

Sequence 1916 

CGCCCGGGCAGGTACTTGAAGCTGTCATGGATCAGGTCATCTAGGGCAAGCAGGTAGAGT 
GAGAAAAACTCTGGAGAGCTTCAGGACTTAAAATGTGATCAGAAAAAAGGATCCTTATAG 
AAGAGACTGAGAGGTAGGAAGGGAACAGAAGTGATATTGCAGAGGCCTGAGGCAGACACT 
GTTAAAGAGGGAGGCACCTGGACACTTTTGCTGTAGACAAGTAGCCCATCTAATCCT.^AA 
GAGCCCCTTTGATATGGCCACTTAATGACGTATGAAATACTTCAAAAATGATTTTAAAAG 
TGTTTAAAGTAATGGGCTTCTGTGTCATTTCCATAATATTATTCTGGGTGCCTGCCATGC 
GC AAG AC ACTAACATAGGTGTTGG ATGTTTCAAACAAATAACATTTATTTGC AAACT CTC 
GTAACTG AT ACTTTTCTTC AAAAN AAAAN ATN N N N N N N AN N AGGTCCCTCGGC 
Sequence 1917 

GCTCCCCGCGGTGGCGGCCGNCCGGGCAGGTACI I I I M I I I M I I I I I II I I IGGAGAC 
GAGGTCTTGCTATGTTGTTCAGGCTAGACTTGGACTCCCAGCCTCAAGCAGTCCTACTGC 
TTCAGCCTTCCAAGTAGCTGGMCCACAGGCACACACCACTATAACCAGTTTATTTTACA 
TTGTTCTTGTAATTCTGGTATCAGTGATTCTTATMCAATATGCTCTCAATTTATTTTAA 
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GCATGTTG AGACC TTAATGATMCTATTCAGAAACCATACTAAACATTTGATCTTGAAAT 
CTTCCATCGTTTTCCATCCTCTGCTTATGCCTCAGTTATTCCATTCGCATGATATGGGCA 
CTTAATTTAGCCAACTCACTGACCCATTTC 
Sequence 1918 

CCGGGCAGGTACTGTGATATCCACATATTTTTGAGAAAAATTCCCAAGCCAGGCGAATGT 

GGATTGGAATAAAGACATAGGCAGTGTATACCACCATAGCAATAATGGTTAGTAAGATGG 

TGTTAAACATAGATCGCTCCCAGGGCTCTAAAACAGCACAGCAGCTAATGATTTGGTATT 

GATAGTAGAGCCAGGAGAAATATTCCTTCACACGCCTCAAATCCATGGTTGGCTCCTTCA 

AGCTGCAGTMGTTTGTCCTAAGAAAGTCCAGGTCTGGTTCTTCAGCCTTGCTCCTTCGC 

GAAATGATCCTGTGTGGGTTAAGTTCTCCTCTCTGGGTTGCTGTTTCCTCATCTCCCAGT 

TGGGGTGTATCTTCCCTGCGGCTTAGGTGAAGCGCCCGAGGCTT 

Sequence 1919 

AGGTA CAAGC ATATAATGCCAACTGGTAAAGTCAGAACTCCAGATGACAAGGTACAGCAT 

GTMTCTTTTGCAGTCTATTGACAGCAATCTAAATAAAAGCAAAGGAAGATGCTTCCAGT 

ACTACAAACTTTATGGGAACTTAGGGTCTCTCAMGCCACACTGTTTTGGGTCACTAGCA 

GGAGGTACAGTATATTAAAGCAAGCAGCTCANCCTCTCACTAGTGGTGAGATGTATGAAA 

ATCCTGGCAATGCCAGGGGGAAAAGAATCTATGAGTCTTCCCTTCGGTCATCTCTCCTAT 

TTCTTGCATTAGTTTTTTCAGGGTTAAATACACATCTCACAAGAAGAAGCTAGCTATATA 

TGGGCTGGGTTGGTCTTCAATNGAATCTNCAAATGTATCA 

Sequence 1920 

CCGCGGTGGCG GCCGA GGTACGTGGCGTAGCATTCCATGTTCAGCTTTGACATTTATTTT 

CTCATATCAACCCTTTn"ACACGTGAAACACMTCTCGCTCTTGAAGTCTTAACTGCATG 

ATCTGTG AAAC CTGTATTTATATTTTCTTC AGTGTATTCTTGTCGTGTGTGTGTC CTAAA 

CAAACCAAAAGAAAACTTTCCAAATCTAAAGTATTCATTCTCCAATTGGAGCAAGAGGAG 

TCAGTTAGATA CTATC ACGGCATTCATTTGTGGCTGGCTTGTCATATTTACTTATGATTG 

ATMTAMTCTAT7TTGCTTTTTAGAGCCTCCCGAGMGCCATCTGCCTTCGAGGTATTT 

AGTTTCATGATTTTCATTTTGMTGACTTATTMCTATGTATTTTGGNGAAGTATACA 

Sequence 1921 

NCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGTCTGTTGT 

CTTCATCATGAGATCAGAAAACTCTATGCTGCTCCCTAACTATGTGACCTTGAGCAAGTC 

TCTTAAAGGAGMCTTCAGTTTCTGTATTTGGAGAAGAAAGAAGATCATGGTAGATGATC 

CTCTTGAATTGGCTTCAGGCCTCTAAAATTCTGTAGTTTGGTAAAAATGCCTAATGACCA 

AATGGAGGCAAGATCTCGTTACCAAGATCATGGTGGCTTTGGAAGTGACAGAGATGGCTG 

AGACCAGGAGGCTGAAATGAAATTAAAGTCCTATGATGTTGCTTTAAGTTAATGCTATCT 

AAGAAAAGGTGCTGGTAGAGGMGTGGGGCCCCAAACTTGGCTTCATAGTGATTGGAGTT 

TCCAAATCAATGGAATAGG 

Sequence 1922 

AGGTACGTGAMCTGCATCTCTCACTTTTTCTCTCTCTTCGGGCAGTGCTAGGAAGATTA 
CTCACAAAMGTCAAATCACATCCCCCATTCTMTGTTGGTATTTCTGGGGAAGCCTTAT 
GGAAAGCTGCGTTTTTTCATGGGGGMCACTTTTGTC 

ATTTCMTAGTCAMCTGTTGCCATAAAAAAATCAAACTTAGGAATAGAACTAAACAl'GT 
TATTAACAGAATAGGCTTGAGAGAGCMGAGMTTCTTTTGGAGTAAAGTTATCCAAAGT 
CTATGTGTTCTCCTTTGACACATAAATCTACTATCGACCTAGTCTTGGAC I (TTTTTTTG 
G AC AG AAATTG G GTAAGTTTTG C AAAAGG AC AGGC AC AAAGC C 
Sequence 1923 

ACTATAGGGCGAATTGGAGCTCACCGCGGTGGTGGCGCGCCAGCGCCGCCTCGATGCGCG 

GGATCTTGCAAATGTCGGTGCCGATGCCGTAGATCATGGGCTAGTCCTATTCCTTATGCT 

TTGTTGCCCAGGCGCGCAGCCACCATGATGGCCTTCAmCGCGCACGGCGTTTTCCCAG 



Page 270 of 635 



WO 01/51628 



PCT/US01/00798 



Table 1 



CCCGACAAACACCGCATGCGCGACGATGGCGTGGCCGATGTTCAGCTCAGTAATATCGGG 

AATGGCGGCGATCGCCTGCACGTTGGTGTAATGCAAGCCGTGGCCCGCATTGACCTTCAA 

GCCACGGCCCACGCCGAACAGTACGCCTGCCTTGATGCGCTCCAGCTCCGCAACTGCTCG 

GCGCCTGCCGTATCGGCGTAGCGGCCCGTGTGCAACTCGATCACGGGCGCACCGAGTTCG 

GCAAGCGGCGGCGATCTTGTTGCTCGTCGGCATCGATGAACAGGCTCACGCGGATGCCCT 

CGCCCTGCA 

Sequence 1924 

AGGTACAATGGGAACAAGGAGATAAGCAGTGAAAGGCCAAGGGAATGTCTGGAGTTAGGA 
CTTCAGGTGATTCACAACTTGGCTGCCACTCACCCGAGACTGCCCAAGCCCAGATTTCTT 
CCTTCTATAAGMTATTGATTCTTGCAAATAAGATGAACCTAAATGTGGTCCAGGAGTCA 
G C ATCTTCTAC ATG GTAC CTGCCC G 
Sequence 1925 

TCGACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAGGTTCAAAT 

AGTCAGCAGCTCATCATAATCAATGAGCGAGGACATAAAGTAGGAAAAATGCATCACCAT 

GGTGAGCAAGGAMGCMGTTATTGGAGGCACATGTTAACACATAAAATATAAMTTAAT 

ATG ATC AC ACTG GAAAGGCTTGCCTG AGCCC AC AGTTTG AATG CCTAC AATAAG ATGAGA 

TGCACAACAAAAAGCAAGAGAACCTGATCAAGTGGGTGACCTGGCCATGGTGCTCTCATC 

AGTGG GGACCC AAATGCTTATGTG G ACTC ACC AGGTATCG AATTAG AC ATG AATAGG AGT 

GTTTGTTGTGATCCAAGGAMCTATATAATCAAAATGAATACAATGAAACTTTAAAAAT^ 

ATTGTAAGGATCTTTACACCAGCCCA 

Sequence 1926 

CCGCGGTGGCGGCCCGAGGGTACGCGGGGAATCTTACTCCCTCGTAGAACAGGCGATATC 

TTCACCATGCGCACAATGAAATCAATACTCAGAGAAGGCAGATAATTCTCCACGAAGCCA 

GAAAACTAATAAATGAACAACTTGGGTGAAATGTCCCACCAGACGGTGTGATATTTAGTA 

GCCCAGAAAGCTGCCAAGGGGTTGAATGACACTATCTGAAGATATGAACCAGTTTGCTCT 

CCATAGGGAGGATTTCATCAACAGGAAACAGATGCCTGGAAGGCATTGGATTTGCTAAGT 

GTCTATGCCATATGATTCTGCTGTTTGCGTTTGATTTAGAATGCTGAGCTGACTCAAAGT 

CAAACTAAAAAAAAAAATGAAATAAAATAAAAATCTTTATTTCCAATGAGCTAATGAG 

AAAAGGGGATGATGAAAAAACAGGGAGTGGGAGCAATTTGGAAATGGAAAGAATGCTGAA 

TACCTACATGATGCCTAATTCAGAAGAACATTCTGGTATGTAGGGACATTAT 

Sequence 1927 

CAGGTACTTTTNl HUM I ANNTNNTTTTANTTCGCAATACCAGCTTTNACCCTGNCCT 

CTGCACTGCTTAGCCTNGGCTCTGCTGAGTNGTGCAACACATTCTGANAGATGCCATACA 

ATGCTCCAGGCAAACTTATAGAAAMCAGGAATGAGTGATTTTATACGGGATGTGTTTCA 

GCTGTCCAATNCAAAATAMTCACGTCCAGGTGCATTCTTTCCTAATTTGTGACCCAAAC 

TGCTTGCACTTTATACATTAATGAATTA I II I I I I AAAAG AAATTAGCTCTATCATG AAG 

ACAGGGCTATCTAACCCTTTGGTGGTCTTTNTCCCTTAAAAAGCAATTGTTCCATTTTAT 

AGGTGTTAGGTTTAATCTACATATTTGGAATGAAGCTNAAATGAACTAACNTATGTNAAC 

CAGTAAAAGCAGGACACTTAATANTAATGCTTCCCTGAAAATNTACATTGNGCATATGTA 

GGCTCl I I I I GGCATTACCAAATTTTCCAAGAACC I Sill I GTTTT AAATTAA 

Sequence 1928 

TCACTATAGGGCGAATNGGAGCTCCCCGCGGTGGCGGCCCGAGGTACAGGGAACTATTGG 

AGCACCTAAGAGGAGCACCTACCTTGAATTTAGGGGTTAGCAGAGGCATCCTGAAAAAAG 

TCAAAGCTAAGCCACAATCTATAAGCAGTTTAGGAATTAGCAGAACGTGCATGGTGAGGA 

GATGCCAAAGGCAAGAAGAGAAGAGTATTCCAAACAGGAGGGATTCCAAAGAGAGAAGAG 

TATCCCAAACAACATTTGCACAAACCTGATGGGGAGAGAGAATGTGGGGTGGGGATGGAT 

GATGAGACTGAAGAAGAANGCCAGGTCTAGATAATCA 

Sequence 1929 

AG GTAC C ATG ATTAGTTAAATATAAG ACTCC GTAATTTTTAC AATTTTAACAATAATTTT 
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ATTTCTTCAAGCTTGTTAGTTTGGGATTGTATTAAMCTCAGTGTGTGACTTAGAAAATG 

ATAATG CTGCTTTATGG AAAATGG ATTATAGGTG GGTAAGACTTCATTGCAAAAATTGTG 

TAATACCATCAGTGTTAGGAACCCAGTTGAAGTCTAGAAGACAGATGTTAGTATCTTAGA 

CTAGGTTGGTATTTGAATAGATATTGGTAATATCAGTAGMTTTAATAATACATTAGAAA 

GAAAGAAATCAGAGAAGATCTTTTATTTCACTTGATACTTGNGGTGGTACTTTCAATGAG 

ATAAGAAGGGCAGGCCAAGGAGAAAGTTCAGGGGCAGGGGATGAGAAAAAAACAAGAATT 

TTATGTTGGACATGCTAAGTTAAACACCTGCTTAACTCANATTGGCTTCTGTGGAAGCAG 

AGGCTGACATGAGGGTGCTTATGAAAATGCTTTTTTGAGGAAGCCATTGCAGGAGGGAAT 

CTCTTAGGAAGCCAGGGGAAGAAAGGAAAGAGAA 

Sequence 1930 

AGGTACTTTTCCATATTTCCACATACTACCAAGGATTTTCAAGTGTAGGAGTAATTCAAT 

CAGACCCTTTCTTTCATATGAGAATTCTCAAAAGAGTAGAATAAATGGAACACAGAAAGA 

AAAGACTTTAAGCAAGGGGCTTCTTGGAAGCCACTGCAGCAGTCCAGGCAAGTGATGCTG 

AAGGTCTGCAACAGAACAGGGGCCCGAGAGTGGAGAAGAGTGGGTAAACTATCATATTAA 

AGAGGCCAAACTCAGACTGGGGGCAGTGGCTCACGCCTGTAATCCCAACACTTTGGGAGG 

CTTAGGCAGGCGGATCATGAGGTCAGGAGTTTGAGACCAACCTGACCAACATGGTGAAAC 

CCCATCTCTACCAAAAATACAAAAAATTAGCCAGGTGTGGTGGTGCGCACCTGTAATCCC 

AGCTACTCAGGAGGCTGAGGCAGGAGAATCCCTTGAACCTGGGAAGCAGAGGTTGCAGTA 

AGCTGAGATGGTGCCACTGCACTCCAGCGTGGGTGACAGAGAGAGACTCTGTCTCAAAAA 

AAAAAAAAAGTACCTGCCCGGGCGGCCGCTCTAGAACTAnnnnnnnnnnnnnnnnnnnnn 

nnnnnnATATCAAGCTTATCGATCCGCGACCTCGANGGGGGGCCCCGGTACCCACTTTTG 

0 

Sequence 1931 

AGGTACI I I I J I M I I I I I I i I I I H M I M I I NAAAGGTGAAGTTTCACTTTGTCACCC 

AGGCTCAGTCTCAAACAAACAAAAAAAAAAATTGGGGGATCTTTATTTCATGCTTTCTCT 

CATTAAGAAAAGAGAGGACAATAAAGTGTTAGAAGATAATAAAGCACAGAAGTCTAGCCA 

GTGTCTTTGGAAAATGAAACGGCTGTGTGAAl lilt I ATCTTTTGGCACCTCTTTTAAAA 

ACTCCACTTMTGACCTACCTGCATTACTGAAGTGTCTCTTGTTGGGGTGATTGTAGTTA 

TNTCTTGGGTGTCCATGCATCTGTGGGCCATGGGAGGCAGGCAAATGAGGCTTCTGAGGG 

TNAAGGTTGTGTNAGGNATGGGACTGAGATATCAGANAGTCCCGGCNGGAACCTGAGGCC 

TTNGGACCAAATGGTTTNTTACCCCCAANNCANGNCGNNTNTCTTNNTTTTGCTCCNAAA 

GAGGNAAAAGGAAAACCAAANGAGCTTAAAGANCCCAGGCNTTAAATTCCCCCAGAAAAA 

GAAAAAAAGGNTTTNANCCTCTAMCNGCTNCTTTTTGGCCNTTNTTAA 

Sequence 1932 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTTTCTCTCATAGGGAAI I I I 1 1 1 1 IAAI 1 1 1 1 

AAATTTTAATTATTTAAGAGACAGGGTCTCACTCTGTCACCCAAGCTGGAGTGCAGTGGT 

GTGATCATGGCTCACTGCAGCCTCAAACTCTTGGGCTAATGTGATCCTCCCACCTCAGCC 

TCCCAMGTGCTGGGATTACAAGCATGAGCCACCACACCCGGCCATCCTAGATGATTTTT 

TAMCATCACTTTTCTCCTGGCAGAATTTTCTGACTCTGTAGTTCTTGTTAACTTCCAGA 

GAAAGCCCTGAGTAAGCCAGACCCTAATTACAAACATGCCTCAGTGGATGAGGCTACAAT 

TTGAGTCCAAACTGCTCAAGGTTCACACTGACAAAGTCTCACACCGTAAACCTGAAACAT 

CATGTTACAGTCTGAGGCCTGCTCCGAGCTGGGTAAGAAAGGCCTCGTGAATGCAGCCCC 

CTGGGCAGATGAGGTTTGGGCCTTCACCCCTTTGGGACTGCCTCCTCATTGCTCCTTGGG 

AAGCAGCCGCGCTTGCTGCCTGTGTCTTTGGGAAAGTGAAAG 

Sequence 1933 

AGGtAC AATATAG GCAG AC AGTTTGCCTTCAGAMTTCAG AAATGC AGCTTTTG AG GGAG 
GTCAGCATCATTGGTCTCAGCTACCATTTTCCTGCAGGATGTTTATAAATAGTTCCTGGC 
AAAGGAGGTTGCCGTATAGGGTCCTACAAGTAACCTAGGTGGCAGCATCTCCTCCCAGCG 
TCAGGACTAAGCACACAAGTTCTTCCTTCCATCTACCCCCGCGTACCTGCCCG 
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Sequence 1934 

CCGGGCAGGTACCTTCACAGCTGCACCACACCAGTTCTGGGCACATGGACAACAGATCAA 
GCTCCCCCATCATCACATAAGTTAGGCACCAAAGAGTTAGCAAGGGTATTCAGTTCCCAC ~ 
TTAATTGATTTGCATATCATGCAGGCTCTGATTTCAGTACCT 
Sequence 1935 

GANATGCTGCCACCTAGGTTACTNGTAGGACCCTATACGGCAACCTCNTTTGCCAGGAAC 

TATNTATAAACATCCTGCANGAAAATGCAGNGAAGTNGAANAGACAGGGATATCCCATAA 

GGTTATGCANGAACATCAAGAGMGATGAGAGGTCANAGATGGGMGAAACAAGAACTTT 

GACATGCTTGGTGTTCTTGCCCMGCTTTGAAGMGTTTACAAAGTCTATATGTNAGAAT 

ACANATTTNCCACCTTGCCCAACAGTAGAAAAACATAANAAGANAAAAACATTAAAAAAT 

GACAAGGAAGTTAAT 

Sequence 1936 

CCGGGCAGGTACCCTTCACTCTTTAGGAGAAGGTTTGAAAAAAAACTGTGTTTCCTGTTG 

AAGGCCAGTGATACCTGGCAGCTTACAGTTAGAAAACTCTATAACTTTATTATTTTCATT 

TGCACTTAAAACTGCTATGAGTTTmMTG^ 

TGMCATATTTATATTCTTCCCCCTCAGAGAGAAACGCTGTTAACATTTTGTGTATTTCG 
GTCCAGGTTCTTTTCTGTGTGTGTMGCACACATATGTGTTTATATTGTATAAAATAAGA 
GATTCATGCTGGACATACATTGmGTMCTTCCTTCCTTTCCTTCCTTTCm 
TGAGCTAGGGTCTCGCTCTGTCACCCAGGCTAGAGTGCAGTGGTGGGATCTCAGCTCACT 
GCAACCTCCGCTGTCTGGGCTCAAGCGATCCTCCCACCCAAGCCTNCCAAGTAGCTGGGA 
CCACAGGCACATGCCAGGATGCCTGGCTAATTTTTGTTAl I I I I I GANAGACAAAGTCTT 
ACTATGTTGCCCACNTTGGTCTNNAACTCCTGAGCTCAAGCNATCTACTCACCTTGGC 
Sequence 1937 

CCGGGCAGGTACTCCATGCCATGCTTGTTTCTCAGTCTTGGTTGTGTTAGCCCTCTCCCT 

GCTCATGCCCCAGTTCCGTGAGCAGCAGCTTGAGTTTCCTTAATGCCTGGGATTGGCAAC 

CCTGGTAACAAGGACCATGAACACTCTATTAAGAGCTAGGATTAACAATGGCATCTTGCT 

ATAGCTGCTTAGGCTTCAGGAAGGGTGTGGAACTAAAAGCATTTTCCCTCCCTTGAGCAG 

TTCCTTGTCTCAGTCCCCTGACTGCTCCCTATGCTAGATTCAGGGATTGAGAGGGTCAAA 

ATGCTCTCCCAAGATCTAGATTGTATGATTCCCCAGTGAGAAAATGGACCACAGAAAAGC 

CATTGCTTACTTTCTCCTGTATTATAGAGTCACTCCTGATTTCCAGCTGGTCTTGGTCAC 

AC AGG CTGGCTGTTT7TCTTTTCCTTTCTAGTTT 

Sequence 1938 

GATGGAAGGAAGAACTTGTGTGCTTAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGG 

TTACTTGTAGGACCCTATACGGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGC 

AGGAAAATGCAGTGAAANTATAAGAGACAGGGATATCCCAGAAGGTTATGCAAAACATCA 

AGAGAAGATGAGAGGAGTCTATATGTCAGAATACACATTTCCCACCTTGCCCAACAGTTT 

GAAAAAAANAAAAAAAANANAATAAAANGTTCCTTNGGCCGCTNTANAACTAGTNGGATC 

CCCCGGNG 

Sequence 1939 

CCGCGGTGGCGGCCGAGGTACGCGGGGACAATGAAATCAATACTCAGAGAAGGCAGATAA 

TTCTCCACGAAGCCAGAAAACTAATAAATGAACAACTTGGGCGAAATGTCCCACCAGACG 

GTGTGATATTTAGTAGCCCAGAAAGCTGCCAAGGGGTTGAATGACACTATCTGAAGATAT 

GAACCAGT7TGCTCTCCATAGGGAGGATTTCATCAACAGGAAACAGATGCCTGGAAGGCA 

TTGGATTTGCTAAGTGTCTATGCCATATGATTCTGCTGTTTGCGTTTGATTTAGAATGCT 

GAGCTGACTCAAAGTCAAACTGTMGTACCTGCCCG 

Sequence 1940 

AGGTACGCGGGGAGCTMTTTCTTTTTAAAAAATAATTCATTAATGTATAMGTGCAAGC 
AGTTTGGGTCACAAATTAGGAAAGAATGCACCTGGACGTGATTTATTTTGAATTGGACAG 
CTGAAACACATCCCGTATAAAATCACTCATTCCTG' l I I I "1 CTATAAGTTTGCCTGGAGCA 
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TTGTATGGCATCTCTCACAATGTGTTTGCACAACTCAGCAGAGCCAAGGCTAAGCAGTGC 

ANAGGACAGGGTGA 

Sequence 1941 

GGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATGCCCTGTGGCCCCCCATCTGAC 

ATACCACCAGAGATCTGGTTATCACCTGGCCCATTGACTACCACAGCCAGCCTTGCTTGA 

CATGCACCACCAGGGTCCCTGAGGACAGGTCTGCCCAATTAAGCTCCACCCTACCACAGT 

GAAAAACAAAAAGAMGAAAGAAAGGAACAMGGATATTCAACACAACCAGAAATCAATA 

AATAAAATGACAGGMTAAGCCTTCACATATCAATMTAACCTTCAATGTAAATGGATTA 

MGmCCACTTAGMGATAGAGACTGGCTTGMTAGATTTTAAAAATGACCTAATTACA 

TCCTGCATACAAGAAACTCATGAGGGAATTCAAAGGAATT 

Sequence 1942 

CCGGGCAGGTACTTGCAAGAAACCTCAGGACTTGAGTMCAGCMCATGGTMGTTTTCT 

AAGTTTTCTTTTCGTCTCCCATATACGCTGGGCTGNGCTGGAATCACCAACAGGCACAGA 

AAAAATGACAACAAAACANCMCANMCCCCCAAGMTATCCTGTTCTCTTTGGCCAAAG 

TCAGGAAAGGGGAGCCCCAACAGAGACCCAGTAGGAAGCTCTAGCCCCTGTTTTGTACCT 

CGGCC 

Sequence 1943 

AGGTACAACTTGACAGTTTATAAAAATGCTMCCTATATCAACATTTTTCCCCACCAAAG 
TGTTCN N AAGG GCAANGGTAAGN GAATACATGTTTN C NTGCTTGTGTCAGGGGCACACAG 
CAAGGGMGGCANAAATGGGCATGACCTAGTATTNAGAATCCCCAANAAGCCCTTTCATA 
ATTCCCTTAATAAAN G AACTTTG N G NGACCTTGC ATTN NTTTAAAN AN ATN AAATTAATT 
TGG GTTCATNTTTTCTTTAATNTCTNTG CGAATTAAACTATATAAAGTAAATTGGGGAAA 
ACCNCANAAATTGCNCATGNGNCTTTGGTTACTTTGNAATCTTCC I I I I I I GGACACCCC 
MGNTNTGGCCTTCCTTTTCTNCCTNTTMNTTGATNCCNTTAAAACCAATCCTTGGNCC 
CCTTNCNGGTTTCTTTTAAATTGGNAAAATT^ 

CCAATNTATGNAANATAGGNMCCTNAATTTTGGGTTNTTTGGTNTTNCAAGTNTTTTCA 

AAAACNATCTTTTCCAAAATTTTCCCCAATT 

Sequence 1944 

CCGGGCAGGTACTTATTTTTATATTAATGTCAGACTTCAGGATTTATTTTAGCCT^ 
TTTGAGMGCTTTCTAAGCCCTCAGCATTMGCAGCATTTTTCCTCATCACACACGGATC 
TGAGCAGGTGACCGCGGCTGGGATAGGTGCTGTTTGTGTTAGTGTGCAGGTGGGCTGGCC 
AGCCATTGGGTGTGTG l"l I I ICCTAGAGTAAGAAGCACTCCAGGCTGAGATGAGAGCTGT 
TGAAATGAGTAACATTTCCGTCTCCTGTGATACGCGCTGCTGTTGCNTTCTTTCAAA 
Sequence 1945 

NCTTTNCAGTCGGNAAAACCTNGTTNGANGCCAAGCTNGCATTTAATGAAATTCCGGCCC 

MCGCCGCCGGGGNGGAGMGGGCCGGTTTTTNGNCGTTATNTTGGGGGGCGGCNTTCTT 

TTCCCGCCTTTCNCCTNCGGCNTNCAACTTTGNAACCTTCGGTCTNGNAGGCCTANCGGG 

TTNCAGNTTCNNGGCCTTGNCCGGGNCNGTANNCCGNGGTTATTTCNAGGTCT7TCCATC 

CTTTCAAAAAAGGGGCCGGGGTTANATTAACCGGGGGTTTNAATTCCCCAACCAAGNAAA 

ATTCCAANGGGGGGGGGAATTAAAACCGGNCCAAGGGGNAAAAAAGNAAAACNAATTTGG 

TTNGGAAGGCCANAANAAAAAGGGGCCGCAAGGNCAAAATAAAGGGGGCCNCAAGTGGNA 

AACNCCCGGTTAAAAAAAAAAAAGG 

Sequence 1946 

AGGTACCTTTGTTCTTCCTCCTCCTCCATCCTCCCAGTGCCTGTGACTGGAATGGTGAAT 

GGCACCGCTGGATGGGTTTCTGCAGATGTGGAACAGCCAGGAAGCCAGCCCTGTCCAAAT 

TGAAGTGCTTTCTGTAGGACCAAGAAGGGTCAATTACTATTAGTGGTAGACTTCACAAGC 

CAGGCGGTAGGAAGGCAGGGAGAGCCCTATGAGAACTGAGACCGACTCACCTGGCATGTG 

CTGGATCCATGCATAAAAGCTACTGTATCMTGTTTCAGTTGAGTGAATGAAAAAAAAAA 

AN AN N N N AAN AN N NGGTCCCTGCCCG 
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Sequence 1947 

TACTATAGGGCGAATTNNAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAGGGAGA 

CAGCATGCAGGGTGTGTTCAGAAGAGCTTGCTGAGGTGCTCGGCTCTTAGCATTAAAAAT 

GTGATGTTGGTATATCATCCTGATAGAAAACACTGCTTTCCAAATCCTAGTCACTGGATG 

GGAGGAAAGTAAGAACAGATTCTGCCAACCACTACTGATTTGTTATAATTCTCCCCATTG 

AMTTGGGACAAAATTTGTAACATGCTACTCTGAAATTGATCACATTCTCAATTCTGTCA 

ATAGCAGAGAATTTAGCCTCMCACCACCMTTCAAAATAGTTTAGGTCTTGCCCCTGCC 

TGATTATGTAACAGAAMGCATAMGTTTGCAGGGTGGAGCGGGAGGATTTTTCTAGGAA 

ACTTCCTTCTCTTCCATTAACGTGTG 

Sequence 1948 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCCGAGCGGCCCGCCCGGGCAGGTACTAATTTG 

TAAAATTAAGAAGTTGTAGAACTGTGAACATCAACTATTGCATTATAGGGAAAAAGTTAA 

TTTAAAAGAATAGGACMGGTAAACTTTTTAAAAAATTTTGCG 

AAAAGCATTTCCAGAAGTTTGACATTCACTAGAAGCTCTCTCTGTGCCTCTCTTTTCCAC 

CATTCATACTGTTAAAAAAAGACMTATTGAAATATTACCATCAGTATTATTGACTGTTG 

AAGACTGGGCCCTTGCTGTAAGTCAGGCACCATGCTGGATGCAGTAAATGTATGTCCTAT 

TTCAAAGGTAAAGCCACTGCATCTTGCACACACAGTrTGGGTTCAAAAGCTTTAAAGAGT 

AACAAGAAGGTTTGCCCAAGCTTAGTCACATCACTGAGCAAAATCCACCGGATAGAATGA 

GGTTGAGATCTCTGAGAMGCAAMTACTMTTGACAGTAGCAGTTGGCGGGTCATGTTT 

TT 

Sequence 1949 

GCGGCCGAGGTACTGATCTAACCAAGATATTTTG I I I I I CTCACCCACCAGTNACTTTCT 

CAGTCCTTTCTGTATCCCTTGCAATTTGAACAAAGC7TGGTGAATAGTGTGCACACAAAA 

AGCACACTAGGTGAAAGACAGATACATAAAAAGGGTAAAGNCAGGATATTTTAACAANCC 

TATCAAGCTCTAAATATAAGCCTCCTTGGTAGTTTTCTCTTTAACCCTCTCTCCACTGTT 

GGATGAAATTTGCTGCATTCMTTCCAGNTCCCACCCCAACTTCCTTCTTAACACAAGGN 

CANGGTTAAGCCTTCGGTGCTTTMTCCAGAGGAAAATTACTTATTTTAAAAAGCAGTGA 

AAAACACCGCATTCCTTTGGCACAGG 

Sequence 1950 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGAAGCAGTGGTAA 

CAACGCAAGAC I rTTTTTTTTTTTTTTTTTTI I ACTGCAACAACCTAACAGGAAAACTAA 

TCAGAGATCATGAGGAGAGTGGTATATCAGTCAGGGACTAATCAGGAACATAATCCACTT 

GAGATTTTAAAATAGAGAGGGCTTAGGCTGGGGAGTTGGTGACAGCATAATGGGAAATCT 

ACAAAGCAAACAGGAATAGGGAGACAACCCAGAGCCCATCTAGGCTGGCAGAGTCATCAT 

CTCTAGGCTGGAAGGTCANAANAAGGTGGTGTTAGTGGAGCCAGGAGATGCACAGTCTCC 

CAGGGGAAGCTGGAGCTAGGGTGTACCTGCCCG 

Sequence 1951 

A7TTAGGGCGAATTGGAGCTCNTCGCGGTGGCGGCCCAGCACATAATGCAGTATACGCCC 

MTTAAATTTGAGACTCACAGAGTTTATTAGTGGCCACCTTGTCCAACCTCTCCTATTAG 

AGATGAAGAGACTTCCAGAGGTCATACCGCAAGTTGACAGCAGAGATGCAGTAGGAACTG 

MGTGAGGCAGAATTTGAGGATTCAGGTCCATGGGGAAGATAGTTTGCTGAATGAAGGAG 

CAGTGCTTTAGGGMCMTTTTMTGGAATTTAAAGACAGAATAATTCAGAACTGGATAC 

ATAGTAGATACCAACTGCACCTTGTGGGAATAATTCTCTGGATAGTATCTGGCAGTCTTC 

AGATACTTTGCTGTTATCCTGGCTAAACCACATAAGTTGTTGTGTGTTGGTTCACAATCT 

GATCTTGCAAGATGTAAGTTAGAATCTTGACAG 1 1 1 I II II G 

Sequence 1952 

AGGTACG C AAC ATG AC ATTGGCTG GTGTAAAG ATCTTAC AATTATTTTTAAAGTTTCATT 

GTATTCATTTGATTATATAGTTTCTTGCCATCACMCAAACACTCCTATTCATGTCTAAT 

TCGATACCTGGTGAGTCCACATAAGCATTTGGGTCCCCACTGATGAAGAGCACCATGGCC 
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AGGTCACCCACTTGATCAGGTTCTCTTGCTTTTTGTTGTGCATCTCATCTTATTGTAGGC 
ATTCAAACTGTGGGCTCAGGCAAGCCTTTCCAGTGTGATC^ 

GTGTTAACATGTGCCTCCAATAACTTGCTTTCCTTGCTCACCATGGTGATGCATTTTTCC 
TACTTTATGTCCTCGCTCATTGATTATGATGAGCTGCTGACTATTTGAACCTGGTA 
Sequence 1953 

ATCACTACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCTTCACGGCCGAAGCC 

CGATTCGCGGTAACCGCCGAAGCCGCAAGCCGCGTCGAACTGATTGGCGCTGTTGATCCA 

CACCACGCCAGCCTTGATGGCGGGAGCGATGTCCAGCGCGAGGCTGATCGACTCGGTCCA 

CACGCTGGCGGCCAGGCCGTACACCGTGTTGTTCGCCAGCTGCACCGCTTCAGGCGGCGT 

GCGGAAGCTCATGGCCACCAGCACGGGGCCGAAGA7TTCAGCCTGTGCCACGGCGGCCGA 

GGTCGACGCGCCCGTGATCAGGGTCGGCGGGAACCACGAGCCGTCAGCCGGCAGTTCGCA 

CGCCGGCTGCCATACGTCGCAACCTTCGGCACGGGCAGAGTCGACCAGCGCGGCGATGCG 

CTGGCGCTGCACGGGGTCAACCA 

Sequence 1954 

CCGGGCAGGTACTTGGTTTTNTTTTTTTNTTTTT 

TCAGCATTCTAAATCAAACGCAAACAGCAGAATCATATGGCATAGACACTTAGCAAATCC 

AATGCCTTCCAGGCATCTGTTTCCTGTTGATGAAATCCTCCCTATGGAGAGCAAACTGGT 

TCATATCTTCAGATAGTGTCATTCAACCCCTTGGCAGCTTTC TGGG CTACTAAATATCAC 

ACCGTCTGGTGGGACATTTCACCCMGTTGTTCATTTATTAGTTTTCTGGCTTCGNGGAG 

AATTATCTGCCTTCTCTGAGTATTGATTTCATTGNGCCCCGCGTACCTNGGCCGCGACCA 

Sequence 1955 

CGCGGTGGCGGCCGCATACACGCGGGCGATGACTTCGTCGCGCCGGTTCTGGTCGATGGC 
GCGCGTGACGATGGGGCTGCGTCCGGGCGGCAGCTCGTCGATCACCGACACTTCCAGGTC 
GGCGTAAT ACGTCATGGCCAGGGTGCGCGGGATCGGCGTGGCCGACATCATCAGCTGGTG 

CGGCACGGC 

Sequence 1956 

GAGCTCCCCGCGGTGGCGGCCGCCGGGCAGGTACTGATCTAACCAAGATATTTTGTTTTT 

CTCATCCACCAGTCACTTTCTCAGTCCTTTCTGTATCCCTTGCAATTTGAACAAAGCTTG 

GTGAATAGTGTGCACACAAAAAGCACACTAGGTGAAAGACAGATACATAAAA AGGG TAAA 

GTCAGGATATTTTAACAAACCTATCAAGCTCTAAATATAAGCCTCCTTGGTAGTTTTCTC 

TTTMCCCTCTCTCCACTGTTGGATGAAATTTGCTGCATTCAATTCCAGTTCCCACCCCA 

ACTTCCTTCTTMCACMGGTCAGGGTTMGCCTTCGGTGCTTTAATCCAGAGGAAAATT 

ACTTATTTTAAAMGCAGTGAAAAACACCGCATTCCTTTGGC 

Sequence 1957 

CCCCGCGGTGGCGGCCGAGGTACGCGGGGGCCCTCAACTTCTGAGAGCTTAAGGTATTTG 

TTCATGTTAGGCAGGTAATGTCTATAAACCAAAAGCTGCCACATACT ATCTCTC TGCTCA 

ACACTTTACTGGATACTGGAGGATTTAAGGGAAGGAAAAAAGTGACC l t l I i i GCCT TTT 

TAAGAGCTTGTMTCGGTAAAGGAGATAAMCTCACACAAAGGTCAGGGTATATTTTGTG 

CTAAGCATATGACCGTAGACATTTAAAGTGCTGCCCGTAA TGCAG AAAGAAGAAGAAAGT 

GGTGTGATCTAAMGTMCATAGGTTTTGGTTTTGGmGNTTmGNT 

TCI I l I ICI I I I ICT 

Sequence 1958 

CCCGCGGTGGCGGCCGAGGTACGCGGGGGTAGATGGAAGGAAGAACTTGTGTGCTTAGAC 

CTGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGTAGGACCCTATACGGCAACCTC 

CTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAATGCAGTGAAGTAGAAGAGACAG 

GGATATCCCAGAAGGTTATGCAAAACATCAAGAGAAGATGAGAGGTCAGAGATGGGAAGA 

MCAAGAACTTTGACATGCTTGGTGTTCTTGCCCAAGCTTTGMGAAGTTTACAAAGTCT 

ATATGTCAGAATACACATTTCCCACCTTGCCCAACAGTAGAAAAACATAAGAAGAGAAAA 
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ACATTAAAAAATGACAAGGAAGTTAATGGAAGTCAACAATGTGATGGTGTTTGGANGTGG 

AGCCTTCAAAAAGGTATTAATGCCCTTGTAAGAAGAAGGCCAGAAAAGCTTGCGCCCCTT 

CTTTCTGCCCTGTGGANGGAGCCCAAGANNCCGGCTGGTCTTGCNACCTTGCAAGAAGGA 

CCCCTCACTTAGAAGCTAGGCCNTACTTGGGCTTNCTTAATCTTGGGCTTTTCCNACCTT 

NCAGAACTTGTGANNAAGTTNTNTTGTTTGGGGGTTAATCCAATGGGCTATNGGAAATTT 

TTTTATNACCNNNCCCNNGCCMGANAGGGCCCTTATTTACTTCCTTCCCTT 

Sequence 1959 

GGCGAATGGAGCTCCCCGCGGTGGCGGCCGGGGGCCATTGAGACTGCCATGGAAGACTTG 

AAAGGTCACGTAGCTGAGACTTCTGGAGAGACCATTCAAGGCTTCTGGCTCTTGACAAAG 

ATAGACCACTGGAACAATGAGAAGGAGAGAATTCTCTGGTCACAGACAAGACTCTCTTGA 

TCTGCAAATACGACTTCATCATGCTGAGTTGTGTGCAGCTGCAGCGGATTCCTCTGAGCG 

CTGTCTATCGCATCTGCCTGGGCAAGTTCACCTTCCCTGGGATGTCCCTGGACAAGAGAC 

AAGGAGAAGGCCTTAGGATCTACTGGGGGAGTCCNGAGGAGCAGTCTCTTCTGTCCCGCT 

GGMCCCATGGTCCACTGAAGTTCCTTATGCTACTTTCACTGAGCATCCTATGAAATACA 

CCNAGTGAGAAAATTCCTTGAMTTTGCAAGTNGTCTGGGTTCATTGTCTAAAGCTTGGT 

TCCAACTTTTCCAGAATGCCCCCCAAAGMTTCACTNGGATCTTGGAAAAAGGAAAAAAA 

CCTGATGGNGGTTAACTGAANCCATTTTGGTTGGNACCCTANCCAGGGCTNGATG TCATT 

TATTTGGGAAACCCNNACAAACTTGGCTTTTTCCTTTGCCCCNGGGANGNNTTGGGTTTT 

GAGAAGC I Nil I GGNNCCTCGGCCCCTTTTANAACTTAGNGGATNCCCCCCGGCCTNNG 

GGAATTCNAATNCAANCTTTTNGATNCCCGNNACCCTTGNGGGGGGGGGC 

Sequence 1960 

TATAGGGCAAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGTAGATGGAAGG 

AAGAACTTGTGTGCTTAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGTA 

GGACCCTATACGGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAATG 

GTAAAGCCCTTGGTTAAATTTATTGGCTTCATTTATGTATCTCACMCACTTCTTTTC 

TTTTAATTCTAGGAATAGAAACTTACTTTTGGAAATACAGNACCTGCCGGCGGCCGTCTA 

GAACTAGTGGATCCCCCCGG 

Sequence 1961 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGCAAGGAGGCAAGGAGATT 

AC Al I I i I CTTATTTCTACTACTTTTTGTTGCCTAAC ACCGTTTAGCTGGTGGG AC AGG N 

TCTMGTATTNGCTAMTATTGNTCTCATTATTTTGAACATGTAAAAGATGACTGCATTC 

TTATATATTTCCCTTTTAAGTTTGAAMGTGAACTACTTTCTTTATATAAA 

GCCTATTGTCTCAGAATATCACATATAACTGGTGCACTGG 

Sequence 1962 

GCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGTTTGTAGATGGAAGGAAGAA 

CTTGTGTGCTTAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGTAGGACC 

CTATACGGCAACCTCC'iTTGCCAGGAACTATTTATAAACATCCTGCAGGAAA ATGGT AAG 

CCCTTGGTTAAATTTA^TGGCTTCATTTATGTATC 

TTCTAGGAATAGAAACTTACTTTTGGAAATACAGTACCT 

Sequence 1963 

CTACTATAGGGCGAATTNGAGCTCCCCGCGGTGGCGGCCGAGGTACCCGGCTTGGTGAGG 

AGTCCCAGATTGTGGAACAAAGTATTCCTCTGTTCAATACCATCTTCCAAAAAGAAACAA 

TGACCTAGTGTGTCAAACCCAATGGTGTCCTTTATTAGCAAGCCATTTGTCCCATGCACA 

GTGATGCACCTTGAGAAGCTGTGATGAAATAGACAGGCCGTCCACAAAATGTTGCAATGT 

CTGTATCCTCCTTTATAATCCACGTCACCACACAGGTGAAAATGAACAGGGTATCGCCCC 

ATCTGCTGCTGACCCATGTTGTTTCAATTCCACATAAGAAAGGATGGACTGGAAGTAAAA 

ATTTTTCATTATCATAATGGTGTTCCCCCAAAAGG 

TGGGCACCTGGATTTTTTTCTTGCGTTTAGCCATGGAGTTCCCTCCAACTTTCTTCC 
Sequence 1964 
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ATNGGAGCTCCCCGNGGTGGCGGCCGCCCGGGCAGGTACTCCACAAGCCAGAAGATTAGT 

CTCCTTCCCTGTAGCCTCATCTAGCTTACTTATTCAGACTTTCTGTTCACAAGGGGTTTG 

TTATCTTCATGAAATTTTAATATGTGTMGTTAGAAATGAAATTAGAAAAAGAGGTTC^ 

GCTGTAAACACTGGGCANGAGTAGCTCANAAGAAGACTGTGGTTCTTGGATGCCCTCATG 

CCTTTGTTCCAGTGCAGGTGCCTAGTGGAGGGTGCAGTGCTCTTGCAACATAGAAGGACC 

CATTCTTGCAGGGAGCTTGATGGCTTAACTTTCCTNGGAACCGGGAAGGGAGAAGGAAAG 

GGCCTTTGTTNGGACCCCCMCCTTTTGTTTNGNAATTTGGACTTGGNGCI I I I I 1 I IGN 

ANAAANMCCCANCCCGGAAAAAAANTTCCCCCTTTTTC 

Sequence 1965 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATCTCAGAT 

GAAGCTGAGCTATCAAGTGACTTGGTTCAAGACCTATCTTTCAGACTGTCTTTCCTTCTA 

CTATGCAGATCAACATCCTATGTAAGCTGTGACCCCAGGCAGTAATTAGTTTTCACAATC 

CTTCTTCAAGATTGAGGGACCCTTTTCCCAGACTATGTTTTCATGAAGTAAGCTGGCTGG 

CCTTTGTCTCCCAACATTTATGGAAAGCTTTTGGTGCCTGATGTATACTTGATGTCAAAC 

TCTTTCCCAi I I I I GTGTCCCTCTGGACACAGAAGTCAGGTAGGTCCAAGACTTCGGTGT 

CCTTTTCCTTCTCTGAGCTCAG I I i I I I I I I I GTTCACAGNAMTGTTTATTTTTGTTCA 

TATTATGCCCAGGGGAATTTTACCCTACAAAAAA 

Sequence 1966 

CGCTTTCTCATAGCTCACGCCTGTAGGTATCTCAAGTTTCGGTGTAAGGTCCGTTTCGCT 
CCAAAGCTGGGCCTGTGTTGCACAGAACCCCCCGTTTCAAGCCCCGGACCCGGCTT 
Sequence 1967 

CCGGGCAGGTACGCCTACTCAACCCGGCTGTTCACCATTGATGGCATCAGCATCCCATAC 

ACATGGMCCACACCGTTTTCTATGATCAGGCACAGGGAAGAATGCCTTTCTTGGTTGAA 

ACACTTCATGCATCCTCTGTGGAATCTGACTATAACCAGATAGAAAGAGACACTGGGTTT 

TAAAATTCATGCTTCAATATCCAAAGGAGATCGCAGTAATCAGTGCCCCTCCGGGTTTAC 

CTTAGACTCAGTTGGACCTTTTTGTGCTGATGAGGATGAATGTGCAGCAGGGAATCCCTG 

CTCCCATAGCTGCCACAATGCCATGGGGACTTACTACTGCTCCTGCCCTAAAGGCCTCAC 

CATAGCTGCAGATGGGAAGAACTTGTCAAGGATATTGATGGAGTGTGCTTTG 

Sequence 1968 

GGGGCCATTGAGACTGCCATGGAAGACTTGAAAGGTCACGTAGCTGAGACTTCTGGAGAG 

ACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGACCACTGGAACAATGAGAAGGAGAGA 

ATTCTACTGGTCACAGACAAGACTCTCTTGATCTGCAAATACGACTTCATCATGCTGAGT 

TGTGTGCAGCTGCAGCGGATTCCTCTGAGCGCTGTCTATCGCATCTGCCTGGGCAAGTTC 

ACCTTCCCTGGGATG1CCCTGGACAAGAGACAAGGAGAAGGCCTTAGGATCTACTGGGGG 

AGTCCGGAGGAGCAGTCTCTTCTGTCCCGCTGGAACCCATGGTCCACTGAAGTTCCTTTA 

TGCTACTTTCACTGAGCATCCTATTGAAATACACCAAGTTGAGAAATTCCTTGA 

Sequence 1969 

AGGTACTGATCTAACCAAGATATTTTGTTTTTCTCATCCACCAGTCACTTTCCCAGTCCT 

TTCTGTATCCCTTGCAATTTGAACAAAGCTTGGTGAATAGTGTGCACACAAAAAGCACAC 

TAGGTGAAAGACAGATACATAAAAAGGGfAAAGTCAGGATATTTTAACCAAACCTATCAA 

GCTCTAMTATMNCCTCCTTGGTAGTTTTCTCTTTAACCCTCTCTCCACTGTTGGATGA 

AATTTGCTGCATTCAATTCCAGTTCCCACCCCAACTTCCTTCTTAACACAAGGTCAGGGT 

TMGCCTTCGGTGCTTTAATCCAGAGGGAAAATTACTTATTTTAAAMGCAGTGAAAAAA 

CACCCGCATTCCTTTGGC 

Sequence 1970 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAGACCTGACGCTGGGAGGAGATGCTG 
CCACCTAGGTTACTTGTAGGACCCTATACGGCMCCTCCTTTGCCAGGAACTATTTATAA 
ACATCCTGCAGGAAAATGCAGTGAAGTAGAAGAGACAGGGATATCCCAGAAGGTTATGCA 
AAAC ATC AAG AG AAG ATG AG AG G AG ATGG G AAG AAAC AAG AACTTTG AC ATGCTTG GTGT 
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TCTTGCCCAAGCTTTGAAGMGTTTACAAAGTCTATATGTCAGAATACACATTTCCCACC 
TTGCCC AACAGTGGAAAAAAAAAN AAAN N AAN N AATAGGTACCT 
Sequence 1971 

CCGCGGTGGCGGCCGAGGTACAACCACCCCCCGGGTATCCATAACCTAACAATAACAATT 

GTTAAGCATTCGGCCACTCATGTTTCTTCAGCTCCCCCACCTCCTCCCTATCCTCCCAGG 

TTCCCACTGAATTATATATCATTTCATTCATAAATAATTCTGTATATTTAGTTACAGCTC 

AATGAATTTCC AC AAAGTG AAC ATG CTC ATG CAAACCAC ACCTGC ACTGGGAAATAG AAA 

TTACAAACACCCGGAGCCCCCTCTGCAATACTCTCTCAGTCCTTTCCTCACTCCTTCCCA 

AAAGAGTAGCTACTATCACCTATTGTGAACCTTGCATATATGGATTCATAGAACATGTGT 

ATTTTTTGGTTTGGCTACAmGTCTMCATTGTTGGTAAGATTTA 

TATAGC C ATAGTTTATTTA 

Sequence 1972 

ACGGTGAGCCGTTGCTTGTTCTCAMGAGCCATTTCTTGAACGACCCGTTCCATTGGCGC 

ACCATCTCGGTGTGTGAAGTCGACCGACGGAACAAGACCAAGCCTGTCGACCCCGCCATG 

TGCCAGCTCCACTCTGGAGCAAGCTCAACTTGCCACCCGATGAGCTTGGCCAAGTTCTTG 

TGCTTCAATAAAGCTNGGNACACTACCATCTTCGCTCAGGTAGGGCATTGACACCCGCGC 

GGTGCAATATTCGATGCGCTTCCCAGACTCCAGTGGCAGATAATCATAGTAGTACTGCAT 

GTGGTCTTGGCCTGGAAACTCGATATTGGCTGCTGAGCCCCAAGCCGGCAGGCACGCTTT 

GTAATCGACACCCATTTGCAGCTGCGCAACGACTNCTTTTCCGCTCCCCGGGGCCTCGAT 

ATGGCCCAAC 

Sequence 1973 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCCGAGCGGCCGCCCGGGCAGGTACCTTAGC 

AAATAGATAGTTTATTCACCMCTATMTCACCTTAACAATAAMGCTCTCTTTTCGAGA 

ACACTGCGTGTCAGGCACTTCATGAAAATTATACTAAACACTTTATGAAAATTATCTCTA 

GCTGCAAATTCCCGCGGATGCTATGCAAATTAGGTATAGCTACACTTTGCTGATGGGGAA 

AGTATAGCTTAGAGAAGGTAGACGAACTGCCCAAGGCCAGAGCTATGTGGCAGAGATGGA 

ATTCAAATGCAAGTATAAATAATAGMTMTTTTCAGTTGGAMTTTCAAAMGCTGTTC 

TAGTTACATTGATGTTAAAATCTGTTATTCATTTAAGATGAAGATGGCAAGGGGAGATAA 

AATTACAGGGAAGTGGTGAGGATAGAGCTGAGGAAAGAMCTTGGAATGAGATTTTAAAA 

AGAAATCTGGAAA 

Sequence 1973 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCCGAGCGGCCGCCCGGGCAGGTACCTTAGC 

AMTAGATAGTrrATTCACCMCTATMTCACCTTAACAATAAAAGCTCTCTTTTCGAGA 

ACACTGCGTGTCAGGCACTTCATGAAAATTATACTAAAC/^TTTATGAAAATTATCTCTA 

GCTGCAAATTCCCGCGGATGCTATGCAAATTAGGTATAGCTACACTTTGCTGATGGGGAA 

AGTATAGCTTAGAGAAGGTAGACGAACTGCCCAAGGCCAGAGCTATGTGGCAGAGATGGA 

ATTCAAATGCMGTATAAATMTAGMTAATTTTCAGTTGGAMTTTCAAAAAGCTGTTC 

TAGTTACATTGATGTTAAAATCTGTTATTCATTTAAGATGAAGATGGCAAGGG GAGAT AA 

AATTAC AG GGAAGTGGTGAGGATAG AGCTGAG GAAAG AAACTTGGAATGAGATTTTAAAA 

AGAAATCTGGAAA 

Sequence 1975 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAGGGAGACAGCATGC 

AGGGTGTGTTCAGMGAGCTTGCTGAGGTGCTCGOCTCTTAGCATTAAAAATGTGATGTT 

GGTATATCATCCTGATAGAAAACACTGCTTTCCAAATCCTAGTCACTGGATGGGAGGAAA 

GTAAGMCAGATTCTTCCMCCACTACTGATTTGTTATAATTCTCCCCATTGAAATTGGG 

ACAAAATTTGTAACATGCTACTCCGAAATTGATCACATTCTCAATTCTGTCAATAGCAGA 

GAATTTAG C CTC AAC ACCACC AATTC AAAATAAG TTTAGGTCTTGCCCCTGCCTG ATTAT 

GTAACAGAAAAGCATAAAGTTTGCAGGGTGGAGCGGGAGGATTTTTCTAGGAAACTTCCT 

TCTCTTCCATTAACCGTGTTGAATGAGTAAGGACAGTTAAT 
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Sequence 1975 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAGGGAGACAGCATGC 

AGGGTGTGTTCAGAAGAGCTTGCTGAGGTGCTCGGCTCTTAGCATTAAAAATGTGATGTT 

GGTATATCATCCTGATAGAAAACACTGCTTTCCAAATCCTAGTCACTGGATGGGAGGAAA 

GTMGAACAGATTCTTCCMCCACTACTGATTTGTTATAATTCTCCCCATTGAAATTGGG 

ACAAMTTTGTAACATGCTACTCCGAAATTGATCACA7TCTCAATTCTGTCAATAGCAGA 

GMTTTAGCCTCAACACCACCMTTCAAAATAAGTTTAGGTCTTGCCCCTGCCTGATTAT 

GTMCAGAAAAGCATAAAGTTTGCAGGGTGGAGCGGGAGGATTTTTCTAGGAAACTTCCT 

TCTCTTCCATTAACCGTGTTGAATGAGTAAGGACAGTTAAT 

Sequence 1977 

CCGCGGTGGCCCGCGCGGGCAGGTACATGACATTATTGTACGCTGTTGTI TACATTACAG 

GMTACTTCGGATATACCAATGCATGTGAAGAATGGGTTTCCCCTCAGTTTCACATAAAC 

TTAGAAATGTCTCAACTGACATTTCTTTTTC I I I I I H I AATNTTTAATTN U 1 1 1 I H I 

TTGAGACAGTTTCGCTCTGTCCNCCCANGCTANAGTGCAGTGGTGTGATCTCAGCTCACT 

GCAAGCTCCGCCTCCAGTGTTCAGGCCATTCTCCTGCCTCAGCCTCCCAAGTAGCTGGGA 

CTACAGGTGCCCGCCACCACGCTCAGCTMTTTTTTGTATTTTTAGTAGAGACGGGGTTT 

CACCATGTTAGCCAGGATGGTCTCCATCTCCTGACCCTGTGAACCACCCGCCTTGACCTT 

CCAAAGTGCTGGGATTACAGGGGTGAGCCGCCNCGCCCGGCTCA 

Sequence 1977 

CCGCGGTGGCCCGCCCGGGCAGGTACATGACATTATTGTACGCTGTTGTTTACATTACAG 

GMTACTTCGGATATACCMTGCATGTGAAGMTGGGTTTCCCCTCAGTTTCACATAAAC 

TTAGAAATGTCTCAACTGACATTTCTTTTTC I I M U I I A ATNTTTAATTN I I 1 1 I I I I I 

TTGAGACAGTTTCGCTCTGTCCNCCCANGCTANAGTGCAGTGGTGTGATCTCAGCTCACT 

GCAAGCTCCGCCTCCAGTGTTCAGGCCATTCTCCTGCCTCAGCCTCCCAAGTAGCTGGGA 

CTACAGGTGCCCGCCACCACGCTCAGCTMTTTTTTGTATTnTAGTAGAGACGGGGTTT 

CACCATGTTAGCCAGGATGGTCTCCATCTCCTGACCCTGTGAACCACCCGCCTTGACCTT 

CCAAAGTGCTGGGATTACAGGGGTGAGCCGCCNCGCCCGGCTCA 

Sequence 1979 

CCGG GCAGG TAC GCGGG AAAGAGAAGGAGAATATACAGTCTTTGTATTGACCTGTATGCT 
TCTCATTlTTGTTCTTnTGTTCCTTTCTA 

CCTCAGCCTGAGGGATATCAGCATTTCTTGTGCTATGGATCTTCTAGCAACATATAATTT 

CATCTTTTATGCATATCCAGAMTTGTTGCTTTGCTTTGATTTTTGTTTT 

TCTGTGTAGAAmCTGGTTACCGTTTTTTTCTTCCCAGCATm 

TGAGTTCTGTCATGCATTTGCCTACTTGGGTGAMTGTCCCACCAGACGGTGTGATATTT 

AGTAGCCCAGAAAGCTGCCAAGGGGTTGAATGACACTATCTGAAGATATGAACCAGTTTG 

CTCTCCATAGGGAGGATTTCATCAACAGGAAACAGATGCCTGGAAGGCATTGGATTTGCT 

AAGTGTCTATGCCATATGATTCTGCTGTTTGCGTrTGATrTAGAATGCTGAGCTGACTCA 

AAGTCAAACTGNAAGTACCT 

Sequence 1980 

CCGCGGNGGCGGCCGCCCGGGCAGGNACGNNGNTGGAGGGGGTGTNAAAACCCCCNGACG 

CCGGGAGGAGANGCTGCCACCTAGG N NACTTGNAGGACCCNATACGGCAACCCCCTTNGC 

CAGGAACNATNNATAAACATCCCGCAGGAAAANGAGNCNATATGNCAGMNACACANNTC 

CN ACCN NGCCCAACAGTAGAAAAAC N N AAG AAGAG AAAAAC AAN AAAAAANG ACAAGGAA 

GNCAANGGAAGNCAGCAANGNGANGGNGNTTGGAGGGGGAGCCCNCAGAAGGGAANNAAN 

GCCCCNGGAAGAAGAGGCCAGAGAGCNNGCGCACCCNCNNCCCGCCANGGGAGGAGCCAA 

GAAGCCGGCNGGCNGCAACCCGCAAGAGGACCCCCACCAGAAGCCAGCCANACCGGCANC 

CCCAACNNGGCCNNCCAACCNCCAGAACCGGGAGAAGGANANGGNNGGGGGNNAGNCAAN 

GGGCCAAGGGAANNNNNNAAAGCAGNCCCAGCCAAGACAGGGCCCCAANNACCACANACC 

AANNAAANNANNANANAGGCCCCCNNCANAAACCCANGGNCAAANAAGAGAAAAGAA 
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Sequence 1981 

CCGGGCAGGTACAATCATTACATMTCATGACATTTTAATTCTTAATTACAATGCCTTTC 
C AACTATATAAAAATATGTCTAGTGAC ACATATTTTG GAAAG CTTTCTTGG AAAAACATC 
CAGAGATTGTGTTCATCACTATCTATATAAATGAGCTTATACTGAACCTTAAATATGTTT 
ATTCAGCCTAACAATCAGAAAMGAAMCGTTTTTTAACAGTATAAAGTTCAGATGAG 
CACTTGTCTTGGAATTTTATTTTGATTTTMTTAGAAGAAAGACTGCCATTAG 
TTCTMTGAAATCTTTATAAATTCTCTGG GC AGTGC CCTGTATCTTTATTAGAAAGACTT 
CTTAI I I I I MCAATTTTATTTGGAAGAAGTNCTACTCATTT 
Sequence 1982 

AGGTACTCAGTAACTTCCTGGGACCCCTTTTGCCTATCTTCTGAATAGAAACATAGAGCT 
TCTTGC AGAG TATTAGCTC CTGGC7TTATTG CTTGC TCATTTCTAC ATAACTGG G TCAAC 
CCTGGGGCACCACAGTGACAGAAAAAAGAAAGAAAAATAATGGGGTCCCCACCCTGTCAC 
TCTTTGGAGTTTATGGTTCCCTTTTCTTAGTTCTTCTTGTCAAAAAGATAGG I I \ I I CCT 
TCAGGATTTTAGATCAGCTATCACCGCCACATTGCAGCTGTATGATTGGGGTGGCCCTTG 
ACACAGGACCAAAAGAGAGCAACAACAACAGCATCAACAACAACCATAGTGATCCTACCC 
CCTACTCCTGGTGACGGCTTCCTGGGGCTCCTTATCCCCATTCTTGGGACAGCTTATCTC 
AGATTTTCTCTGCTCACCCCTGCTATGCAGTTCCAAATTTGATATTTGGAACTGGTAGTA 
TCTTC AGATC AAGTTGAGAGACAN AN N AN AAAAN N N NAN N N N N N AAGTACCTGCCCG 
Sequence 1983 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGTATAACAACATGAATTCATCTCAAAGACATT 

ATGTTGAAGCCAGATTTGACATAACTCATTCTTTGTCATTCCATTTATGTGMCTTTGGG 

AATAGTCAAAACTTAGCTATGGTGACAGCAGTCAGGGCAGTAATTATCTATCGAGTGTGG 

AGA7TGACTGAAAAGGGCACAGGAGAACTTTCTGAAATAATAAAAATGTCCTATATCTTG 

TTTTGCATCGTAGTTACACAGGTGTAGACAGTTGCTAAAACGCTAAAAACTAAATGTGTA 

AGATTTATGCATTTTAGTGTATTTAAATTATMCCCCC^ 

CTGGAGGGTAAATATGCTCGTTAGGAGCATAAGGGGATGATTTTGAGAAACCCTTGCTCT 
GTCATTCTGTCAGTAAACCGCCACTAACTTTTCTGGTTTCAAAAAAGCAAGGTCCATTGA 
TGAAACTCCTTTGTGCCTTAATGACAAATAACATTGATAAGGCCTACTGGCAACAAGAGT 
GN GGTTGGTATATCTTrrGGTTTTAG NG ACTAAATTTGG AN GAATG ATATTTTC ATGG 
Sequence 1984 

CACTATANGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTACTGGTGTT 

ATTTATCTATTGCCATCATTTCAGGGACACCTAGGAAAATAAATTTTCAATTGCATAAAA 

CCTMTTATTTCATACTGCAGACATTTGATCAMGGCTTACTTGATCAAAGGCTTGTCCC 

AATTGCTCCTCCAGGGATTCGTTCTGCTCAGATAAGATATGGTTCCGCTTTAAGAGAGCT 

TGTCCAATTCGAGCAGCGAGTTCCAGATCACGATCCCTCTCTGCCAGGAGATGTGTAACC 

ATGTCGATGTCATTGTAAGTTTTGGTCATCTGCTCCACCCTGTCTGGTGCCTAAGAATCA 

TGTAACCGGAAAGTCTCTTCAGCAAGGAC 

Sequence 1985 

CCGCGGTGGCGGCCCGAGGTACAACTTGACAGTTTATAAAAATGCTAACCTATATCAACA 

TTTTTCCCCACCAAAGTGTTCCCAAGGGCAAAAGTMGTGAATACATGTTTCCATGCTTG 

TGTCAGGGGCACACAGCAAGGGAAGGCAGAATGGGCATGACCTAGATTAGATTCCAGAAG 

CCTTCAAATTCCTTATAAGACTTGGGACTGCATCTTAAAATAATATTG I l l M I I M ATT 

CTTTGATMTAAAAGMTGGMCCAAATCCTGGTTGTCTGATTCTTTGCCCAGTGCTCTT 

TCTTATGTCTAACACTGCCTCGTCTTATGAATAACTCTGACATTACAGAGAAAGACTATT 

GTTTGTTCAGTTCAACACTCAATTCCATAGGTGCTAGCAATTATGAAAAAGGCTGTTAGT 

ATCCAGCGGGGAAAGCAAANGATCCCGCAGGTGGTAACACCCTACTGAGAGCAGAGGGGG 

ATGAATACA 

Sequence 1986 

CCGCGGTGGCGGCCGAGGTACACTTGGTGCAATAAAGTGCATCTTTAAAGTGTTTTGAGT 
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TTG G ACACAG ACC CGTG AAACCAACACCATAATCAAGATTCATGATGAATATGTTCCACG 
TAGTTTTCATTTAA^TCCCTCTCAGTAGCATATTTC 

TTTCCATAAATTGCTTGTTCCTCTCCCCCTGCCCGCCCCCGCGTACCTGCCCG 
Sequence 1987 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACACTTATAGTTGAGAGCCA 

AGTCTCCCTTATCATTGGTGAATGAGAATGAGCTACTGAAAACAAAAAGAGGGTCTTCTA 

CTCAGCCTCTACCCCTAATATTTATATCAGAAGCAGAGATTAACTGTCCTTACTCATTCA 

CACGTTAATGGAAGAGAAGGAAGTTTCCTAGAAAAATCCTCCCGCTCCACCCTGCAAACT 

TTATGCCTTTCTGTTACATMTCAGGCAGGGGCMGACCTAAACTATTTTGAATTGGTGG 

TGTTGAGGCTAAATTCTCTGCTATTGACAGAATTGAGAATGTGATCAATTTCAGAGTAGC 

ATGTTACAAAT1TTGTCCCMTTTCAATGGGGAGMTTATMCAAATCAGTAGTGGTTGG 

CAGAATCTGTTCTTACTTTCCTCCCATCCAGTGACTAGGATTTGGAAAGCAGTGTTTTCT 

ATCAG G ATG ATATACC AACATCACATTTTTA 

Sequence 1988 

NACTCACTATAGGGGCGAATTGGGAGCTCCCCCGCGGTGGCGGCCCGCCCGGGCAGGTAC 

GCGGGTATCCAGTTATTTCATTGGTTGAAATAGCTGACCGCTTATAATAAAGTAGTCTGT 

GGACTG CTTAAATTTTGTAGTG AATTCCTGTGTGTGC ATTTTTG TAATGGTC ATTCTCAG 

TCGTAAATGTCTTTAGAAAGTCATCTAACACAAATACATTGAGAGAAGTTTTCCTAGCAT 

AAGCCTGGTCATAAACTCCACCTGATAACTTGGGGCTTGATTTAAATCAAGTGGAAGCTC 

TGTGCAAAAAGATGGATATGTGACCAATATAATMTGGCTCCTTTTGCAAGAACAATACT 

TCTGAAATTAAGGGGAATGCATTAAGATGTGTATGTAATAMTAATTGAAAATAATTTTG 

GAGGCAGAAGTTATTGTAAGTGAAATTCTGTCATAAGGTGAAACAATCCT 

Sequence 1989 

CCGCGGTGGCGGCCGAGGTACTGGGGCCTTACTTCCCTAAAGACCCAGGGAGGGGAGAGA 

AAGTGCTCTGCTTGTATCCTAAGCACAAATGCAATACAGCAACATCAGTAATGGTAACTG 

AGTGCTTACTGCATGCCAGACCCGTGCTAAGCTCTCTCCATGCACTATCTTAATGTTCAC 

MCCTCACACAGTAAGCATTATTAACAACAGCACTTTACAAATAGAAATTTGTTGAGGCT 

CAGAGTTGCCATAGCTGGTGAGCATCCACAGCCTACATTCAAACCCAAGTTTCATTCCAA 

AGCTTTGCTTTCCTGCTGCTGTATGTCTTTTGCGTCATCTTCGCATATTTAACCTCCTTG 

GCTCAACTGGGAACTCTTTGAAGGCTAGGGCCATCAATTACATTTTTGTTTCTTCAA 

Sequence 1990 

CGCCCGGGCAGGTACAGTGTCAGGGCTTATGCTACCAGACTAGGTGGAGGAGATGACATG 

CAGATTCAGGAGAGGATCCAGGGCTCATGAGGACG1GGAGGTTTTGTTCTAAGCATTGGG 

TTTTTGTTTGGCTTTTTCTGTGTAAACTTCTCTTTCACCTTCCATTCA^ 

ATTCTTTCTTCAATAGTGGAAGTATTATTGTTAGTCAGGTTTTTCAATACTTAATAGTGG 

TCTGTATTTGTGGGTGCATGTGGTATTTTGGTGCATGCATACCATGTGGAATTATCAGAC 

CAGGGCNGCCGGGACATCAGTCACCCCAGACATCCACCCCTTCTTTATGTTGGGAACATT 

TCACTTNCTCTCTTTCAGCAATTTTGAACAATGCMCAAACTACTGTTAGCTTTAAACAC 

CCAACTGTGCTATCAAACAGTACATCTTACTTCACT/XAATGNGGNTTTTATGCCCACGCC 

ANNCTTCCTGGATAAAACTTTGTCTAMGTGTTTC^AAGAGGCTGCTCAAAGGAAATGTG 

GCTTGTTTTGGGANGTCTGAAATGTTTTCAAAATT"rCTGGCTTTGAAAAM 

Sequence 1991 

CGGCCCCTCGTGGTGGCCAGGCGCGATGCGCGCGGCGTGGCAGAACTGCTGCGTCCGTAC 

ACCAAAGAGGCGCTGGCGCCCGGCGAATTCCTGCTCGAACTGACGCCGAAGGATGGCAAT 

TGGGTGCTGGTCAGCGATGCCTGGTTCTTCAAGGAAGGCGAGGCGACGCGCTGGGAAAAG 

GCCCGCTATGGCGAGTTCCGCGTGCTGCCCGATGGCCGCGCCTTGCTGGTGGGCATGCGC 

GGGGAAGATTTACAGGCACTGTAACCGCTTGATTGAACAAACCCAATG 

Sequence 1992 

CGGGCAGGTACTTTGGCACATGCTGGTAGCCAGGAGTCTGGGCCTGAAATTTGGTCCTGA 
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CTCCACCCTCATCTCTCTGCATGACTTCTCTGGGAGATACAGCCTCTCCATTTTACACTG 
AGAGAACTAAATGAGCTCTAAAGCTGCCCTGACAGCTGACAGTCAAGGTTAGCATATTTC 
TGTGTGGCTCTGGCAAAAGTACCT 
Sequence 1993 

CCGCGGTGGCGGCCGAGGTACACTTATAGTTGAGAGCCAAGTCTCCCTTATCATTGGTGA 

ATGAGAATGAGCTACTGAAAACAAAAAGAGGGTCTTCTACTCAGCCTCTACCCCTAATAT 

TTATATCAGAAGCAGAGATTMCTGTCCTTACTCATTCACACGTTAATGGAAGAGAAGGA 

AGTTTCCTAGAAAAATCCTCCCGCTCCACCCTGCAMC7TTATGCTTTTCTGTTACATAA 

TCAGGCAGGGGCAAGACCTAMCTATTTTGAATTGGTGGTGTTGAGGCTAAATTCTCTGC 

TATTGACAGAATTGAGMTGTGATCAATTTCAGAGTAGCATGTTACAAATTTTGTCCCAA 

TTTCAATGGGGAGMTTATAACAMTCAGTAGTGGTTGGCAGAATCTGTTCTTACTTTCC 

TCCCATCCAGTGACTAGGATTTGGAAAGCAGTGTTTTCTATCAGGATGATATACCAACAT 

CGCATTTT 

Sequence 1994 

CGGTGGCGGCCGAGGTACCGGTTGATGTAAGCACAGGGATGGTGGGGACAGGGTGACCAA 

AGTGAACTGGGCACGAGATGAGTAGCTGGTATATAGTGACTTGGAAGGCAAGGTGCTTGA 

AATTTCCTGGAAAATCTCTGCMGTGCACAGTAGAAACTATTATCATTGGTTACGTGCTT 

CAAGAGGACTGGGCAGATGGGGGGGCAGGAATAAGAGAGCCACTATTCACTTGACAAATT 

CTTGGACATTTTGATTTCTGAGCCATATGGATGTGTGTCCTATCAAAAAGAATAAGTGAA 

AATGTTCMTAGTMGAGMCACTTTGTAAATCTCTGGCTGCTGCTCTTTGTGATTAGCC 

TCTCAGCACTCTTATTTGGAATAATCAGAAAAATACAACTATCTAAATTTTGGGAGGAGA 

GTATACTTGTGGAGATTTGGGMGAAAGAAAATAAGTATGGAAAGTTTCCCCCAATAATA 

AAATGAMTTCATTGGAAAGCCATTTCAAACATTTAGAAAGTTAACCCAGAAAAATAGAA 

AGG 

Sequence 1995 

GGCGGCCGCCCGGGCAGGTACTCGNNAAGCAGTGGTAACAACCCAGAGTACTCGGGAAGC 

AGTGGTAACAACGCAGAGTCCCGGGAAGCAGTGGTAACAACGCAGAGTCCCGGGAAGCAG 

TGGTAACAACGCAGAGTCCCGGGAAGCAGTGGTAACAACGCAGAGTCCCGGGAAGCAGTG 

GTACCT 

Sequence 1996 

CGGCCGCCCGGGCAGGTACCNNNNACTNATTNTGCTGTGATCCTAAAACACTCTAAAAGA 

TAAAGCCTATTAAMCCATATTTTTAAAAAACTGACAGGTGTTCTTAAGATTAACTANTG 

AGTTCAGAGAATAAATNGAGGATGAGATGAATTTCTAAAAACAGCAGCACCATACATCAA 

AAATACCCAGCTAGAMTTATAAATTTTTAAAAGATCACAGAAGATAAACAAAAACAAAC 

CCCTCCCCATCTTATCTTTATTATTTNTTCTTATTTGTATAAAAATNAGGAGATATAAAT 

AAATAGAAAACACTACTGAMTATTTACAGGGATTCATGAAAACAGATTTTAAGAAATTA 

AGATACATATTGTCAGGTGTGGTGGGTCACGCCTGTANTCCCAGCACTNTGGAAGGCCAA 

GGTGGGTGGATCACTTGAGGTCAGGAGTTTGAGGACCAGCCTGGCCAACATGGCAAAACC 

CCATCTCTACTAAAAATACAAAA 

Sequence 1997 

CCGCGGTGGCGGCCGAGGTACGCGGGGGTAGATGGAAGGAAGAACTTGTGTGCTTAGACC 

TGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGTAGGACCCTATACGGCAACCTCC 

TTTGCCAGGMCTATTTATAMCATCCTGCAGGAAAATGGCCTCTTTAAATATACACTTC 

TCTGTAGTGTATGCTAGAAATGGAGTGGCTGGAATAAAAGTGGCTGMTCATCTTCAACT 

CTAGTAGCTGAGACCAATGATGCTGACCTCCCTCAAAAGCTGCATTTCTGAATTTCTGAA 

GGCAAACTGTCTGCCTATATTGTACCTGCCCG 

Sequence 1998 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATTCTAATCCCAGT 
TTACATTTAAAACACTTCAAATGGAGCCACTAGGACTCAAAGTTGCTAAATACACCCATG 
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CCCTCTTATCGCCTTCCACTGGGGGTGGTTCTTCGGTATTTTAAGTCTTGGAAGTTAATT 

TTTG AGGAAAAAAAAG TAATAG ACATG TG ACAAAAACAGGAAG AAGGG ACACAG TTC AAC 

TGGGTTGTAGCTCTAATCTCACTTTATCAGCTGCTTGAAAGAGGGAAATGAAGGACAGAG 

CGGTAGGAAGTCTGCACACACCAACCTCATGGAGACCAGCTCAGCCCCCAATGCAGTCAA 

GCTACCTAACCTCCTGGAAACATAAAGATCGGCGTGTTACTTG 

Sequence 1999 

NCCGGGCAGGTACTTGCTTAGTCAGTTCTGTGGGCCAGAGGACACTCATGGGGAGAGATC 

AAGGTTACTTTTTGCAGGTGGGGCATTTTGTAACTGAATAAAACCCAGTCAGTAAAAGTA 

ANCNACATGCNGAATCTANGCAAGTGTTGCAATGTATGTNTGCTCTTGATCTGAGGAAGA 

CCATGCATTTGACTCAGACTAGAAACCTACACCACAGAAATCCCACCCACAAGCAATGCT 

CTATTACTGTAGGTACCTCGGCC 

Sequence 2000 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTGGGGCTACAGGCAGG 

CACCACCTCACCCAGCTAA I Nil I GTATTTTTAGTAAAGACAGGGTTTCGCCATGTTGC 

CCAGGCTGGTCTCGAACTACTGGCCTCAAGTGATCAGCTCATCTTGGCTGCCCAAAGTGC 

TGGGGTTACAGGCGTGAGCCACCATGCCCGGCCCCCGCAGGACTTTCTGCTGTTTCTCTC 

AATTTTATGTAAGTAAAACAACAGCAACAATAAAATCCCTGTCACTTGAACACAGAATGA 

ATTCTGCT1TATATATTTTGTATAGTTTGTATAGATATTTTGTATAGTTTATGAGG 

GAACTTTCCAGAACAGCAACTACTGTGTATACTGATGGGGCAA 

Sequence 2001 

AGGTACATTCTAATCCCAGTTTACATTTAAAACACTTCAAATGGAGCCACTAGGACTCAA 

AGTTGCTAAATACACCCATGCCCTCTTATCGCCTTCCACTGGGGGTGGTTCTTCGGTATT 

TTAAGTCTTGGMGTTMTTTTTGAGGAAAAAAAAGTAATAGACATGTGACAAAAACAGG 

AAGMGGGACACAGTTCAACTGGGTTGTAGCTCTAATCTCACTTTATCAGCTGCTTGAAA 

GAGGGAAATGAAGGACAGAGCGGTAGGAAGTCTGCACACACCAACCTCATGGAGACCAGC 

TCAGCCCCCAATGCAGTCAAGCTACCTAACCTCCTGGAA 

Sequence 2002 

CCGGGCAGGTACTTTATATTACTTACTTGTTCACT 

ATTATTCTAATTTGGATTAATAAATATGTCAATTTATTTTATMGTCMCCGATA 
TTTTCAGCTTATMTTTTGTCTACTTAGTATCCTCTAAGTAGCATATCCTAATCTATTCT 
TAAGATTCACATTGTTATATGCTTGACATAACTTTACAACCAT CACCA TGGAGAAAGAGA 
TCTATAATGTAGACAGATTTTAATTTMTCATAATATCATCACA^ 

ACCTACACAGGTGCCATTTAMTAGGTGACAAGCTMTATTAAAATTATGATGGGAGTTG 
GGGA 

Sequence 2003 

ATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGGTCCCCTCACACCTTCTGTTGATGCC 

TTATTACCACCAGGCAGATTAGAAGTTCTGGTTTCTCTAAATTTCAGTTGACTCTGTAAA 

CTTATAATACTTATCTTATAATTCTGTTGAGAATATAAAATGAAATGATGCATGTAATAT 

ATCAAGCCACTGTCTGATGTAGAATMTATTACTMCATTTATGATTTTGAAAATTAATA 

ACAAAGTTGGAGGAAATACACCACCAACTTCAAATGCTAATATGAAGTTACAACGATC.VV 

GATAGTGTGGCATTGGTGTAAAAACAGACATGCTTATCAATGGCGCAGAATTAAAAG.AAT 

AGAAATTAACTTCTGCTTTG 

Sequence 2004 

CCGCGGTGGCGGCCGAGGTACCTGCACTTCAGACCCTGGCTCCATGTACACCTGTGCTCC 

AAACCTGGCTCTGTGGCTACTGCACAGGTGCCAGCAGCTCAAACAGTCAAACAGTGCCAC 

CCTTGCAATAAGGAAACCTACAAGCTAGACCGGATATCAAGAAAGATTCCTTCAGACATG 

ACATCCCCTGTGGAAGAAAMGACATTAGGGAGTCCCTAGCAGCCTTTGTCTTTGAAGAT 

CCCTTACTGCTGCTGTGGATGCCCACAGCCTTGTTTGTTGAAAACCCTTGCAATCTTCAC 

CAATTCTGATCTTCAGCTGACAAAAGATGCACAGGGGGAGGTTTGCCA 
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Sequence 2005 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTTGGAATGCTGTGTGGAGTCC 1 1 I FT CTACTT 

TTGTTCCTGAGAAAGAGAGATTCACAAACTTCCCACATTCTTCCAGTGGCATCTAAGGAT 

CCCTTCCTAAATGGCCTTTCTCCCAGCCAATAGACTGAAGCAAACCATGTCCCTGATGCT 

TCCTGCCCACCACCATCCATTCTTTCAACCAACAGCTTCTCCGGGCTTTCACGTGCCATG 

CACTTTGCTAAGAGCTTGAGGGAAATACAGCAATGAAGAAGAGATGATTTCTGTTTGGAC 

CCAGTGCTCAGGGATTCCTCCTATTAGGAAATCAAAGGAGGCTTCCAAAAGCCCTACAAT 

TTTTTCCCTGNTAACTAAAGA 

Sequence 2006 

AGGTACGCGGGAGGAAAACACAACATGAACAGGCAGAGTGCACACGCCGTGGCCTCGGGA 

AGCCACAATCATGTGGCTGACATGGTTCCAGAGTGTGGCATTAGGGAGACCAACTTCTGA 

GCTGCAGATTCACAGCATGGCCATGCAAGCAGGGAAGATGTGCTCTACTCCAGGATCAAG 

GAGCAAGCCCATATTTCCTCTTGATTCTGGGACTCCCTCTTGAGACTGATTTGCTCTAAT 

GTTAGCTCTTGAGAGAACTGGGGTCCCTGCCTTCTTGGGGAGCCTGAAAAGAAGTTTTGC 

T 

Sequence 2007 

GAGCTTCTGAACATTTCTTCTCAGACTAAGCTCTTACACACAGTTGCAGTTGAAAGAAAG 

AATTGCTTGACATGGCCACAGGAGCAGGCAGCTTCCTGCAGACATGACAGTCAACGCAAA 

CTCATGTCACTGTGGGCAGACACATGTTTGCAAAGAGACTCAGAGCCAAACAAGCACACT 

CMTGTGCTTTGCCCAAATTTACCCATTAGGTAAATCTTCCCTCCTCCCAAGAAGAAAGT 

GGAGAGAGCATGAGTCCTCACATGGAAACTTGAAGTCAGGGAAATGAAG 

Sequence 2008 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCCGGCCGCCCGGGCAGGTACGCGGGGAAAA 
ATCCAAGACAAGAAAAGACTGTAATAAAGCTAAATATAACAGTATAATGATACCAGGTTG 
CAGGCAGCAAATAGCAAACTTGAAACCATGGAAAAGGGAATTCATATAGTAGAGGACAAA 
TTTAAGGAGGGTTTCAAAAGTGTAAGMCAAAAGACT^ 

ATGACAGGTCTTGGAGATCCCAATTCTTTGGGAATAACAGTTTCTGAAGGAAAAAACAAC 
AAAGAGAACAACCATGTTCAAAGGGAACAAAGGAAAAATAMTCTTTCAATTAMGGAAA 
GTTTTCTGCTTGNAAAATGAATGACCATTGGGTTAAGAGGGTAAAACTTATGAA 
Sequence 2009 

TCACAAAAAATCGACGCCTCAAG7TCANNAGGTGGCNGAAACCNCGTACAGGAACTTATT 

AANAGCATACNCANGGCGGTTTTCCNNCCCTGGNAAAGCTTCCCCTNCGTGGCGCCTCTT 

CCCTGTTATCACGACNCCTGGCCCGCNTTTACNCGGNATACCTTGTNCCGGCNCTTTTCC 

TCCCCTTTCCGGGGNAAAGNCCGGT 

Sequence 2010 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAACGATGCAC . 

ACAGATGGGAGGAGCTGCCTTGAGCGAGAGGACACTGTCCTGGAGGTGACAGAGAGCAAC 

ACCACATCAGTGGTGGATGGGGATAAACGGGTGAAACGGCGGCTGCTCATGGAAACGTGT 

GCTGTCAACAATGGAGGCTGTGACCGCACCTGTAAGGATACTTCGACAGGTGTCCACTGC 

AGTTGTCCTGTTGGATTCACTCTCCAGTTGGATGGGAAGACATGTAAAGATATTGATGAG 

TGCCAGACCCGCAATGGAGGTTGTGATCATTTCTGCAAAMCATCTGTGGGCANGTTTTT 

GACTGCGGCTTGCAAAGAAAGGGATTTT 

Sequence 2011 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCCCAGCACTTTAGCCC 

TGTGTGGCCAGCGCCCCATGCGCAAGCGTCTTTCTGCCCCAGAGTTGCGGCTGAGTCTGA 

CTAAGGGGCCTGGAAATGATGGAGCTTCACCCACCCAGTCTGCACCTTCCTCTCCTGATG 

GCAGTTCTGACCTGGAGATAGACGAATTGGAGACACCTTCAGACTCGGAGCAGCTGGACA 

GTGGACATGAATTTGAATGGGAAGATGAACTACCCCGGGCAGAGGGTCTGGGCACCAGTG 

AGACAGCTGAAAGGCTGGGCCGAGGTTGTATGTGGGGATGTGACTGGAGAAGATGGACAT 
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CACTGGGAGGGGGTGTTCCGAATGGGGACCC 
Sequence 2012 

GGGGACCGCCAGGGGCCTCGAGAATCGGTATCCTGAGTCCTCTTGAAGAGCAGTAGAGGT 

TGTTTCATTAAGTGCAAACACATTGTTCTTAATTTGAAAACTGTGGGCAGAAACAGAAGC 

CCGAGACTAATTTTTCCATTGCTAACTCTAGATTCTCGGCCACTGGAGTCTGAAGATACT 

CTCTTTGAGMTGCATATTATTTTGCTCACAGCTAAMCATTTAAGTATCATAGCTGATC 

AGTGGAGTGAGATTAAAAGGTTTC I 1 1 I I I GAATCATCAGCTAGAGAT 

Sequence 2013 

CCGGGCAGGTACGCGGGATGATTAATCTTCCCTTATCCACAAACTTAACTGTGGAGAAAC 

AGGAGAAAGTGTGTGCCTAGAAGCTGGATTCAGGGACCCATGGTTACCCACTTCTACTTA 

TGATTCCGTCATTACTGTTGTnTAGGAAAATAMCAGATCTTGATTCTTTCATAAAAGT 

CGATTCTTCAACAAGCAAATGGGAAAATCGGCAGGCCCAGATATGTGTGTATAGCAGCTA 

CTGGTTGGAAATTTGGACACAAAAGTCTTTATCTACCCAGCCTGTGAGCCACAAATCTGG 

ACTG AG TATAAAAAAG AATAAAACTAAATACC AC C ATG GTTCTC ACTTATTAAGTGGGG A 

ACTAAA 

Sequence 2014 

CCGGGCAGGTACCCTGGATCGTTTCAGAGATCTAACGTGTTGCCACACCATANCTCTAAG 

AAGCTGTGGGGCACAGTTCAMTTCTTANTGGAGTGTATGATTGCACAANGMATACTTT 

AAATATGTTTACTTTTTMTTTCTAATNGACTAAGAAAATCAGCNAGCATAAAAACTAAC 

TTTTTTTANAAGGACTCCCCAAAMTrrTGCATGTTTTTTTCCA^ 

ATTTANNTNTGNTANGGGGCATTAMTTAMTNTNGTTGGCCANMTTTTAAMCNTTAN 

TTGNNTTACCCNTTCGNGCCCCGCCTTTCTTATGNAAAACCTTNGATTNGGGAMTTCAC 

CCCCCCNGGGGGGGNNTTGGGCMNGGGGTAAAAATTTTCCNGMTTTATTTNCAAAAGG 

ACCCTTTTAAATTCCGGTNAATNMCCACNGGTTCNNNAATCCCCTTTCTATGAAANGNG 

NGGGGGGNGGGGNCCNCCCCNGGTGTTAACCCCCCAAAAACC 

Sequence 2015 

CCGCGGTGGCGGCCGCTTTCTCTTACTGATAGTAGGATATTTCTGCTTTAGTTATTGTCA 
CCTTAAATATATTTTCAATGTTGAAATCCTCACAGCA 

AATTTTCTTAGGTATATTTCTGTCACGTTGGCATGATAACAAATGCAATAACCCAAAAGA 

CCCCAAMGCTAGTGTMTCCCTTTTGCMTCCAAGCATGAGGATTCATCTTCATGTTGA 

CAGTGCGTGAATGTTCGGTAGGCTTTGTCAAGCTTGCATACAATAAATTATATTATGTCC 

CTTTTTCTTTTAGGGGTCTCCTGTTTAAAGGATGGGTCTTCTGGAAGGGCTAACCTGCGG 

GAATGGAAAAGTTT 

Sequence 2016 

AGGTACMTTCCTTTATTCCMGGCCTGCTAGAATCTTACATGTTTATAMTTGCCTTTA 

TMGCTGGCATGCCTCTTGGAGTGGGGACAGACATTTGTTTTTCATGTGGTATGTCGGGA 

GTATGAACTGGAAAGACCGAAATCAGTTATAATATGTCAGCATGGAATTGATCGTCGGTT 

CTGTGAATCCMGTTGAGTTGTATTCCTGGGCCTTGTCCMCTTCAGATGATTTGAAATA 

TACTATGATTCGTTTAGCAGTAGATGGAGCCCGATATTTACATTGTTTGAGCATGGGTTG 

TGCCTACAAATATAAAGATGGGTGCAATTCGAGTTGCCAAAACTGTAATCTC 

Sequence 2017 

CCGCGGTGGCGGCCGAGGTACGCGGGCGGTCTGAATGGAGGAAGTTGGTAATACTTGGCT 

TTTTTTTTTCTTCTCTCATGCAAGTTTCC AGGATATAG CTTCTTTTTTTG CTGTATGTAA 

TTGAGACCAAACTCTAAGCTTGGGGCCCTGGAGAGAACAGTTGTGGCCTGCAGGTCATTA 

GTGAGGTGAGCAGAAGGGAATCCTGGACTTCCCAGAGAAAGAGGAGACCATGCCAGCCTC 

GTGGAACACAGCTACCAGGAAATACAGAAGCCAGGTTCTATTCCTTGCCGAACTCCCGGA 

AGTCCATGGAAAAAAACCGGGACCCACAGTGGGGGTGGGCCTAAAAATGTCCCAGCCGCC 

AA 

Sequence 2018 
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CCGCGGTGGCGGCCGCCCGGGCAGGTACCAGGCTAGGCAGCTCTGGAGAAAGCAGAAGTG 

GATAAATAAGGTGTGGACTCACCAAAGACAGTTCCAAAGTCAATTTCACTCTGACACACT 

CTCTGTGATCTTCCACAGTCAGCACAATGCCTGCCCCCTGTGGGTGTTGTATAAATATTT 

GTTGAATGAATGAATCAATCATTCAACAGACCAAGGCCAAATCAGAACCCCAAACCCTAA 

GGTC7TTATACTCTCACTGTCCATCCATCGATCTTCCTGTCAGAAATCAGAATATACCTT 

TGCAATACCCTTTGCTAGCCTTTCAGTTATCTTTTGAATAGAGGCTCTGAGCCTTGAAAA 

TATTGCCTG GG AAAATATTAACACCC ATT 

Sequence 2019 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATGCATACTCAATGTTTAGTTCC 

CACTTATMGTGAGMCATGTGGTATTTAGTTTTATTCf MM I ATACTCAGTCCAGATT 

TGTGGCTCACAGGCTGGGTAGATAMGACTTTTGTGTCCAAATTTCCAACCAGTAGCTGC 

TATACACACATATCTGGGCCTGCCGATTTTCCCATTTGCTTGTTGAAGMTCGACTTTTA 

TGAAAGAATCAAGATCTGTTTATTTTCCTAAAACAACAGTAATGACGGAATCATAAGTAG 

AAGTGGGTAACCATGGGTCCCTGMTCCAGCTTCTAGGCACACACTTTCTCCTGTTTCTC 

CACAGTTAAAGNTTTGTGGGATAAGGGG 

Sequence 2020 

NTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTGGTAGTCCCAGCTACTC 

AGGGGGCTGAGGTGGGAGGATTGCTTGAGTCTGGCAGGTCGTAGCTGCAGTGAGCCCAGA 

TGGCACCACTTCATTCCACCCTGNGTAACANGAGTGAGACCCTGTCTCAAAAAAAAGAAA 

AAAACATACACACAGACATATATGCATACATACAAGGAGCAGCCACTACCCTTAGGGCTA 

AGACAGTGTGTCCAAGAATGAGTCCCCCATTTCCCTACCACCACCCCAGGGCTTGATNAT 

TCATACCCTGGTTTAGNAAAGGGGCCTCAGTTGTGGGCTCACTGGAATGGCCAGAGNTAA 

TTGCNTGNATGTGCCTTCTCTGGGGGAAAGCGTTG 

Sequence 2021 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAAAAAATTTCAATTAACCCAAA 
AMGGCAGGAATGGAGGAACAAAGGAACAAAAAAATGACAMTAAAAAACAAATGACAAG 
ATAAGACACTTAGAAAACAAGTAGCAAGATGGGGGATCTACAGCAACCTCATCAATCATT 
ACTG G AATTG CAAATGG AGAAACTCCATGTGCAAAGCAG GGATTGTCACCAAGAC CC AAC 
TCCAT N TAC AG AC ACCC ACGTAAGATACAAAG AC AGGTTGG GTGTCTAAAGGATAGAAAA 
AGATCCACAGTGGTCCAGGAGTGGTCCCAGCATTGTGGTAGGCACAAGAGTGAGGATCAC 
TTGAGTCTTAGAQTTTTGACCTGTAATCCA 
Sequence 2022 

CACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGNCNGGCCNNGTNCTTCTN 

TmNCATGCTTGAPTGTGAGGCTTAmGAAAACMTGGCATTTGACGmCAmCT 

GACTCCTTGATTTCCA7TGTTTTCAGGAAGTTTTCATCTGCTGAAGGCAACTAATCCACA 

ATGAGCAAATTATCCTGCCCTGNAGGGTAAGCTATAATATCGGCCCCATGCTTCAGGAAG 

AGAAAATCATGTGTTCAGGGGTGACTTTGAGTCAGCTCAGCATTCTAAATCAAACCGCAA 

AC AGTAN CAATC ATt'ATTGGCATAGAAC ACCTTAAGC AAAATTG GGTC ATTTTACNTAAN 

GTTTTCTGGCTTTCGTGGGAGAATTAATTCTGCCC 

Sequence 2023 

CTATAGGGCGAA1TGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTAC1 MINIMI 

M I M M I M IGGAGACGAGGTCTTGCTATGTTGTTCAGGCTAGACTTGGACTCCCAGCC 

TCAAGCAGTCC7ACTGCTTCAGCCTTCCAAGTAGCTGGAACCACAGGCACACACCACTAT 

AACCAGTTTATTTTACATTGNTCTTGTAATTCTGGTATCAGTGATTCTTATAACAATATG 

CTCTCAATTTATTTTMGCATGTTGAGACCTTAATGATAACTATTCAGAAACCATACTAA 

ACATTTTGATCTTGAMTCTTCCATCGTTTTCCATCCTCTGCTTATGCCTCAGTTTAm 

CCATTCGCATG 

Sequence 2024 

CCGCGGTGGCGGCCGAGGTACCCTGCTGAAAGATTATTTCTAACAGGCTTGTAGAGAAAC 
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GTCGGTTCATGTAAATTAGAAATTATGGGGCCACTTTGCCATTCTTCACACCTGCAATGA 

ACAGGTGTTTATCTGCAGTTCTGACTTATCTCTTGAACTCCATTTGCATGTTATAGTGGG 

ATGCAGCTGATGCCCTGTCCAGATCTTCTTCAGGCCACTACATCTATATATGCATTCATA 

TTCCAGTGGCTGTGAGTGTTGGCTGTTGGTTGACAGAGGAGCTGCATCCTCCTGGGAGGG 

AAACTGGAACTCAGCCTGGATGAAAAGCCACCCTGGTCCTGGGAGGTGAAAGNCATCTTC 

CAAATGACAGC 

Sequence 2025 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGAGACCTGACGC 

TGGGAGGAGATGCTGCCACCTAGGTTACTTGTAGGACCCTATACGGCAACCTCCTTTGCC 

AGGAACTATTTATAAACATCCTGCAGGAAAATGTGAAGTAGAAGAGACAGGGATATCCCA 

GAAGGTTATGCAAAACATCAAGAGAAGATGAGAGGAGTCTATATGTCAGAATACACATTT 

CCCACCTTGCCCAACAGTAGAAAMCATAAMNAAAAAAAAAAAAAAAAAAAM 

GCCNGGGCGGCCGCTCGA 

Sequence 2026 

CTATAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGACGCGGGAGTTTTTCAGAAT 

CAGGCCTGCTTTGTCCCTAMTGCTTTCTTATTTCCCCAGAGCCCTTATTGGATTTAAGC 

CCCATATCATATCTCACATACCTGAGACAGACATACCACAGTGAACCCTGCATGTCCTTG 

TMGTTCTTTTCCGTGCCTAGAAGTTGCAGAAGACCAAGGAACCTTGGAAGTGACTTACT 

CTTTTGCCTCATAAAGTTGCTTTAAAACAGCCTTTATTTTAATTATT 

ACACATTTTTAMTTTTMGGTTAATTTAGTCATTAATTCTGTCAAACM 

GMGCTAGTTGTCTTCMTGTCTTTTACTTGGAGTTACTTGGMGGTTTGCTTAA 

Sequence 2027 

GAGCTCCCCGCGGTGGCCGGCCGCCCGGGCAGGTACAGATGTAACTGGGAGATACCTGAA 

TGTGCAGTCAGTGCCAAGTGGAGGCCTAGGTAGGTTTATTTGGTTATTGGGAAAAGGAGT 

GGCACCAGAAMTTGGAGGACTAGAGGACAGTTGGGTGAGAGCAGTTTAGTTTCTTTGAT 

TCTGTGCTAACTTTTTTG G ATATTTG CTGGAAAATGC AATTTATAGAG G ATATTTGCTCT 

TGGCTATGGAATGCATTTGCTGTTTCTTCTCTTTTATACGTAAGATACATCTGTGAGACC 

CTCTACAGGAGATGAATTCCTGGTGTAAAAGAGTCATGTGAATATTGTGGAGTAATTATT 

CTGAGCCAGGGGAGCAGGCTAATTAGCCTTCTGGGGAATG 

Sequence 2028 

AGGTACAGAGAAAATAAAACTGACATATTTGAGTAGCTTTAAAAAAAAAACl rTTTTTTA 

AAAAAAGGCTGCATGATGAATCAGTTATGTGGCTTTGGACCAGTAAGGAGTCAATTTCAA 

CCCAGACTGTTCTGACTTGGAAGGCCATTGTGATTCCATGTCTAACTCCAGAATATTGTT 

TTCACAATACCATMCATCTTCTAMCTGMCAAMTACTTGCACGAAATAGTTGATTCT 

CTTCTTCCAAmCTTTTCATTCATGGAMGGGAMGAAGACACATTGMTTAAGCATCC 

AACAAATATAAATACAGCATTAACATTTAMTGCTATCTCAGCAMTATTTTAAAGGGAT 

AAACCAAAAG 

Sequence 2029 

CCNCNGCGGTGGCGGGCGCNGNCNCGGGGATCTCACTCAATCTTACTCCCTTGTAGAACA 

GGCGATATCTTCACCATGCGCACAATGAAATCAATACTCANAGAAGGCAGATNATTCTCC 

ACGAANCCNGANAACTAATAAATGAACAACTTGGGTGAAATGTCCCACCAGACGGTGTGA 

TATTTAGTAGCCCAGAAAGCTGCCAAGGGGTTGAATGACACTATCTGAAGATATGAACCA 

GTTTGCTCTCCATAGGGAGGATTTCATCAACAGGAAACAGATGCCTGGAAGGCATTGGAT 

TTGCTMGTGTCTATGCCATATGATTCTGCTGTTTGCGTTTGATTTAGAATGCTGAGCTG 

ACTCAAAGTC 

Sequence 2030 

CTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCGTTCCT 

CCAGTGCCCAGAGATGCTCTCCGCACCAAGCCACAGATGTGGAGGAGGCAGATGTGATAC 

TCTAACAATGTCTGAAGCTCTGTCACTTGGGTGCTGAATCCTTTTCTATACTTTCTCAAT 
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ACTGGTCAGGATTTTACGGTGTTGTCTAGGAGAGAGCAACCATGATGGGCATGCAATGCT 
CAAGTTCACAGAAAAACATGAATAGCAAGACTCAAATCTGGCTGCCAGATTTCTGCAAGG 
TATTn ICTCTCCAMCGAGGTAATGTGTTTCTTATAACAGTCTTTATAATAAGGAGATG 
TGTGTG GAAAAAAATC AAGTGTTAAACGG AG TCATGCAAATTAGCCATCTGG ATAAAAC 
Sequence 2031 

CCGCGGTGGCGGCCGAGGTACGCGGGGGCCCTCAACTTCTGAGAGCTTAAGGTATTTGTT 
CATGTTAGGCAGGTAATGTCTATAAACCAAAAGCTGCCACATACTATCTCTCTGCTCAAC 
ACTTTACTGGATACTGGAGGATTTAAGGGAAGGAAAAAAGTGACCI INI fGCCTTTUA 
AGAGCTTGTAATCGGTAAAGGAGATAAMCTCACACAAAGGTCAGGGTATATTTTGTGCT 
AAGCATATGACGTAGACATTTAAGTGCTGCCGTAATGCAGAAGAGGAAGMGTGGTGTGA 
TCTAGAGTAACATAGGTTTTGTTTTGTTTTGTTTTTG I I II I i TCTTTTTCTTTTTCTTT 
TTCI I I I ICTTTATCTAGA 
Sequence 2032 

CCGGGCAGGTACCCTTCACTCTTTAGGAGAAGGTTTGAAAAAAAACTGTGTTTCCTGTTG 

AAGGCCAGTGATACCTGGCAGCTTACAGTTAGAAAACTCTATAACTTTATTATTTTCATT 

TGCACTTAAAACTGCTATGAGTTTTTrMTGCAAAAAATAGTGATTGAGAMGAAAAAAG 

TGAACATATTTATATTCTTCCCCCTCAGAGAGAAACACTGTTAACATTTTGTGTATTTCG 

GTCCAGGTTCTTTTCTGTGTGTGTAAGCACACATATGTGTTTATATTGTATAAAATAAGA 

GATTCATGCTGGACATACATTGTTTGTAACTrCCTTCCTTTCCTTCCTTTCTTTCCTGAG 

CTAGGGTCTCGCTCTGT 

Sequence 2033 

CCGCGGTGGCGGCCGAGGTACGCGGGAAGGTTAAGCATTTGTTTATTAAATAGCACAGAT 

TTCCACCCCCATTTGGTTTATCTAGTTTCCATTTGTAGTAACTAATAGAAAAAACTCTAA 

AATGCATCGGGAGGGAAACACATTTAAAGCTCTGGAGGCATAAAAACCTGCATATAAATT 

CAGGCTGCACCGATTGATAAGTGTGTTACTTTCTCAATTTACTTATTCTTTCTTAACCTT 

AGTTTTCTCATCAGCAAACTATCTTAATTACATGACTTTGTMTCCATAAAGCTTGCTTC 

AAGCTCTTAGCATGCACACAGCACATCTGCAAAAGTAAGCATTTGCCACTTTCGGTTCTC 

AGGATAACATATTCACCCTTCCCTACCTACTAGCAAG 

Sequence 2034 

CGAATACGACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCAAT 

GAAGAGTGGCTACTTTTATTAGATCATAAACAGTTTCCATGTTACTAACCAGACTTTACG 

ACTATTAAGATAGCATCTTAACCTACGACTGTATAAATATTTTATGATCAATAAATCTTA 

ATTG GTAG ACTCTG AG AATAAAG ATGGTGTTTTTCCAA CCCCTCAATATTTGCATACGTA 

AAAGTATTTAGCTACTATTTTAAAGCACCTAACCGGTATCTGATGTTAAGTGCATGCAAT 

CATGAMCATCTTTATGAAACACCTAACAGGATATGGCATTACATG^GCTTTTATAAAA 

GCAATTAAACCAAAATAGCCTTTACCTCCTGGCAACTrAAACTCCAGCCAAATATTGATG 

CAGACTA 

Sequence 2035 

GCTTTGCTCCAGGCCTCAGGAGTGTAAGTTTAAGGGCGCGAACCTGGGTGGGACGGGGCA 

GTTTTTCCAGCGGGCCATACGGGAAMTTTTTGG1TCGAGGAAGACAACCACCC I I I I I A 

AGGAGAAAACTGCATCTTGCCCTGCGTTATTCCT^.CGCGGTGCCCAGGTGGGGTGTGTGT 

GGACCAGTCMTGACCGCCCAAGCTCTCCGAGTAGAAAACCCAAACATGGTTTTGTGGGG 

TGTGTGCCTTTGACCCCGGACTTAAGCAAAAAGCGTGGTCTTGGGCGTAGCTACGAGGTG 

GTTGGTGGGCTCCAGGGCCTGCGTGCCCCTCGACGTGAGCGATGATTGG 

Sequence 2036 

AGAAGCATTCAAGTCAGGTAACGGGACAAACAAGAAAGAATGTAGGCAGGTAATTCTGGA 
GACTCTGAGAAGAACAGTTGAGTGAACTCACTTCAGATGCATGTAGATTATTTGCACTCC 
ANAATGTAGACTGTGGGCACTCCAGAGACTACCAGAATTGGGGGAACCACAGTGACTCAT 
TAGGTTTCTCTGTTACMTCAGGCATGAGAAATATATATATTTTGTTGATGTTAGCTATT 
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GAAATGAGATATATTGGAAACTGAGMCATTGGCTTTATGTTTMGGAAGTGTGTTGTTT 
TG GTAAAATTGCCA7TCCAC AAAAATTT 
Sequence 2037 

TCACTNCTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGGCGCCAGGAAGCCGCCAT 

CGCGGATATTCAAAATGGACCAGGGAGCCGCGAAGTATGGTTCGTGCCAGCGCAGCACGA 

ACGCCAGGCGCGCCGCCAGCAGGCCGGCGATCAGCAGGTCGGAAACGAGGGGGCCGGCGT 

CGGGCAGCTGGCGGTAGCGCCGCAGCCACCACGCCACGCCGTAGGCGACGCACAGGGAAA 

GGAACAGCAGTACCAGGCCCGTGGGCAAAGCCAGGGGGCCGATTTGCAGGGACAGCATGG 

GCGCTCTCTATCACGTCGAATGGACGGCCATTGTGGCCCATGCG 

Sequence 2038 

CCGCGGTGGCGGCCGAGGTACAACTTGACAGTTTATAAAAATGCTAACCTATATCAACAT 

TTTTCCCCACCAMGTGTTCCCMGGGCAAAGGTAAGTGAATACATGTTTCCATGCTTGT 

GTCAGGGGCACACAGCAAGGGAAGGCAGAATGGGCATGACCTAGATTAGATCCCAGAAGC 

CTTCAAATTCCTTATAAGACTTGGGACTGCATCTTAAAATAATATTG 1 1 1 1 1 1 1 1 IATTC 

TTTGATMTAAMGAATGGMCCAMTCCTGGTTGTCTGATTCTTTGCCCAGTGCTCTTT 

CTTATGTCTAACACTGCCTCGTCTTATGAATAACTCTGACATTACAGAGAAAGACTATTG 

TTTGTTCAAGTTCAACACT 

Sequence 2039 

CGACTCCTTAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTTTTTTTTTT 

iTTTTTTTTTTTTTTn I A AGCATATCCTCACTTATCTGATGTAAATGACCAAATACCTA 

ATTCATGCAAGGCTTAAAACCTAGGNGACGGGTTGGCTTAAAACCTAGGTGACGGGTTGA 

TAGGNGCAGCAAACCACCATGACACATGAATGCTTATGTAACAAATCTGCACGTTCAACA 

CATGTNTTTAAN AACTTAAAGTAAAATAA 1 I M i I I I AAAAATGC AAAGAAACCATAGAT 

7AMGAGTCMCAAAATAAAAAGAAAACAAAAAATNTTTGATTMCCCAAAAGACAGCAG 

GAAAGAAGGAACAGAGATTAAAAAATNTGGGATAAATNCAAAACTAATAGAAAATTGATN 

TAATTTTTT 
Sequence 2040 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGGGGCCNTNGAGACTGNCNTGGN 

AGACTGGNNAGGTCACGTNTCTNAGACTTCTGGNGAGACCATNCANGGCTTTTGGCTCTT 

GACANAGATNGACCACTGGAACACTGACNAANGAGAGAATTCTACTGGTCACANACAAGA 

CTCTCTTGATCTGCAAATACCACTTCATCATGCTGAGTTGTGTGCAGCTGCTGCGGATTC 

CTCTGAGCGCTGTCTATCGCATCTGCCTGGGCAAGTTCACCTTCCCTGGGATGTCCCTGG 

ACAAGAGACAAGGAGAAGGCCTTAGGATCTACTGGGGGAGTCCGGAGGAGCAGTCTCTTC 

TGTCCCGCTGGAACCCATGGTCCACTTGAAGTTCCTTATGCTACTTTCACTGAGCATCCT 

ATGAAA 
Sequence 2041 

GGGGCCATTGAGACTGCCATGGAAGACTTGAAAGGTCACGTAGCTGAGACTTCTGGAGAG 

ACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGACCACTGGAACAATGAGAAGGAGAGA 

ATTCTACTGGTCACAGACAAGACTCTCTTGATCTG CAAATACGACTTCATCATGCTGAGT 

TGTGTGCAGCTGCTGCGGATTCCTCTGAGCGCTGTCTATCGCATCTGCCTGGGCAAGTTC 

ACCTTCCCTGGGGATGTCCCTGGACAAGAGACAAGGAGAAGGCCTTAGGATCTACTGGGG 

GGAGTCCGGAGGAGCAGTCTCTTCTGTCCCGCTGGAACCCATGGTCCCTGAAGTTCCTTA 

TGCT 

Sequence 2042 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGTAGCTTGCAGGGGTGCTATG 

TGMGATGGCGGAATTCCCMCGTGCTTCTGCAAAATTTTATGTATTCATCCCCCTCTGC 

TCTCAGTAGGGTGTTACCACCTGCGGATCCTTTGCTTTCCCCGCTGGATACTAACAGCCT 

TTTTCATAATTGCTAGCACCTATGGAATTGAGTGTTGAACTGAACAAACAATAGTCTTTC 

TCTGTAATGTCAGAGTTATTCATAAGACGAGGCAGTGTTAGACATAAGAAAGAGCACTGG 



Page 290 of 63 5 



WO 01/51628 



PCT/US01/00798 



Table 1 

GCAAAGAA TCAG ACAACCAGGATTTGGTTCCATTCTTTTATTATCAAAGAATAAAAA 
ACAATATTATTTTAAG ATG CAGTCCCAAGTCTTATAAG GAATTTG AAGG CTTCTGGG G AT 
CT 

Sequence 2043 

CACTACTTAGGGCGAA7TGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGG 

GCACAATGAAATCAATACTCAGAGAAGGCAGATAATTCTCCACGAAGCCAGAAAACTAAT 

AAATGAACAGTATGGCTCCACATAAGGTAGAGCGTGTCTGGTGCCCTGCCGTTTGGAAAT 

GACCCAAAGGCAGGAGTGGAAGATGGAGTTTCAGCAGGAGAGACTGAGGATCTCAGAGGA 

TTTGCTCTGGGATGCAGAGACGCTTGCCTATTTGCTAAGTGTCTATGCCATATGATTCTG 

CTGTTTGCGTTTGATTTAGAATGCTGAGCTGACTCAMGTCAAACTATTTGCTAAGTGTC 

TATGCCATATGATTCTGCTGTTTGCGTTTGATTTAGAATGCTGAGCTGACTCAAAGTCAA 

ACTGTAAGTA 

Sequence 2044 

ACTACTTAGGGCGAATT GGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTAC 1 I 1 1 1 11 

inwnuunmi igggacagctttctattctgtcacccaggctggagtgcagtgg 

CACAATCTCAGCTCACTGCAGCCTCCACCTGTTGGGCTAAATCCATCTTCCCACTTCAGC 

CTCCTGAGTAGCTGGGACCACAGGCACNCATCACCACTCCCGGCTAATTTTGTTTTTCAT 

A GCAC ATTACATTACCGATTTCCCAGTTTATAAGTAACCAAGATGCTCAAACAAGCAAAT 

ATTTT CCTTGA GTATGACTTACTACAAAAMGAGAMGMCAGAGAMTTAGTTCTGAM 

GTGCATTTTTNTAAAGCCCATTTCAGTATTTGAT 

TCAAAATGCTAAAG 

Sequence 2045 

TCACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTCCTCCCTATG 

CATCTGCTGTGGGAAGTGTGGGTAACTCACAGATGATGCCACAGGGCATGTATTCAGGCA 

CCACAGGCAGCAGTGAATGMTGMGTGAAATGGTATTTATCCATTTCCTGGAAAGGTCC 

AGGGTTTGGCTCTGCAGGGCCAAGAGAACAGCTTTAGCTGTGCCTTAACCCAGTCCTGGA 

GAAGCCAGCAGGCCGTAATCACGGGGAGGAAACCCATCTTTTAAGGGCTCCTCGCTCAGG 

TGGTGACAAGGTGAGGTGGTCATCTATGCTGTCTTTATCAGTATCTGTCCTAAATACTGT 

GCTCTGACATTTGATGCTAATATTCCATATTATCAGGGCTTCTGTGGTGTTTAGGCCTCT 

AA I I I 1 ICTCTTC 

Sequence 2046 

CGACTACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGTACCGGCGCCGCTGGCG 

CTCAGGACCGCACCCGCGTTCAGCGTGACGTCGGCGCCGATGCCGCGATAACCGAGGfCG 

AAGCCGCAGCAGTTGGCGGCCAGGTTCGGCAGGTCTCCCACCGTGCCGGCGCCATTGCCC 

TGATTGATCAGTTTGATGGTGGCGCCGGTGATGCTGTTGACGCTGATGGAGCCGGCCA7A 

TGGTTGGTCTTGTCGCTGCCGTCCGCCATCAGCAGCGACACGTTGCCGGTGCCGGCGTCG 

ATCTTCGCCGCCACGCCCTGCTTGTCGACGCCCATGCTGAAGGGTGGGCGATGCCCGGCG 

TCGCGGTAGGCCGTCNTTGMCNCCCGTTTGGCCAAANCTTGTNCGTTTGGGCGGGT 

Sequence 2047 

ANGTACCGTTCCTNCAGTGCCCAGAGATGCTNTCCGCACCAAGCCACAGATGTGGAGGAG 

GCAGGTCTGGACTTNAGGCAGACAAGGAACTTCAGAAAATCCTGGGGAGAGACAGC^.CAG 

GGTAAGAGATGGCCTGTGAAAACTGAAGGCAATTTGGAAGGCA 1 1 1 1 1 AAATTCCGTGGA 

GCCACTGGAGAAATGTTTACAGGAATGATGCCTTAAAATTCCTCAGTGGTAAGATCT3AN 

TACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGGGCTGCAAGGNAATTTCG 

ATATCAAGCTTATTCGATACACGTGCGACCTCGAGGGGGGGGGCCCCGGGNACCCANGCT 

TTTGG 

Sequence 2048 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAGACCTGACGCTGGGAGGAGATGCTG 
CCACCTAGGTTACTTGTAGGACCCTATACGGCAACCTCCTTTGCCAGGAACTATTTATAA 
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ACATCCTGCAGGAAAATGCAGTGAAGTAGAAGAGACAGGGATATCCCAGAAGGTTATGCA 

AMCATCMGAGAAGATGAGAGGTCAGAGATGGGAAGAAACAAGAACTTTGACATGCTTG 

GTGTTCTTG C CCMGCTTTG AAGMGTTTACAAAGTCTATATGTCAGAATACAC ATTTCC 

CACCTTGCCCAACAGTAGAAAAACATAAGAAGAGAAAAACATTAAAAAATGACAAGGGAA 

GTTTAATGGGAA 

Sequence 2049 

CCGCGGTGGCGGCCGAGGTACTTACAGTTTGACTTTGAGTCAGCTCAGCATTCTAAATCA 

AACGCAAACAGCAGAATCATATGGCATAGACAC7TAACAAATCCAATGCCTTCCAGGCAT 

CTGTTTCCTGTTGATGAAATCCTCCCTATGGAGAGCAAACTGGTTCATATCTTCAGATAG 

TGTCATTCAACCCCTTGGCAGCTTTCTGGGCTACTAAATATCACACCGTCTGGNGGGACA 

TTTCACCCAAGTTGTTCATTTATTAGTTTTCTGGCTTCGTGGAGMTTATCTGCCTTCTC 

TGAGTATTGATTTCATTGTGCCCCCCGCGTACCTGCCCGGGCCGGCCCGCTCGAAATTGT 

TATC 

Sequence 2050 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACAATAAATACTTTC 
MCTATTTTGTTGCTAGCTTTACATA^^ 

AATAAAAACTAAMCAAAAAATTCAMCATCCAAAAGTTMCATCCAAAAGTGTTGGm 

TTCTTCTCAGAAATAGCAATTTGTTCTGCTTCTCATCAGTCCCACTTAGGCTCTCTCTAG 

GACTTAACTGAGCAACTCCTAAGATCCAAGTTCCCTTAAATTCCTACAGGCAGTGGTCCC 

ACCMCAMGACTTCTCTACTCTTCCTTTGCCTCCAGATGCTCAGACTTGGGCTTCTGGT 

ATTCTGAAAGGCATCCTTGTTTATTTTCCTACCAGAGATTGTTGGATTTGCCC 

Sequence 2051 

AGGTGGCACGGTGTATGAAGGCCTGTGTTTGAAGACTTAGGTCAGACCTTTTGCCCGCCT 

TTCCCCTATGCAGCTCTATGCAACTGTCTGTGCAAGGTCCTGTAATTTGCGTAGGACGCC 

TTGACACAGTCTCTGCTTATGGACGCTTTACAAAACTATCTTGCAAAGAAATAAGTACCT 

GCCCG 
Sequence 2052 

ATATATTCCACCCTCATGGCAAATATGAAAGATTTCCATTGGATGAGACACTGAACTCAG 
TAACCACMCAATGNGGGGCATMTGCMTGAAACTAAAAACATATTCTTAGGAAAGAAA 
AATG AAAG N G AGG AAGTAAG AAMTTTTTTT 
Sequence 2053 

gtccgaaaattttggggtaaggctnctcaactccgtcgngttggtgtccgggtctccgta 

aggggttacctccattttggtgmggtnactttantgtgatccgtcnattnctatcnggt 

ggtctcttnaccggtgtgcnmttcaangggtctcctttnanaagncnctcttgcaatat 

aggnnaaaaccaaatc<:agtctanttcnagnggggggncctn cctgg ctg ggcaa ggnaa 

gncnccgtctccaccnccttgnmgccccttcaaccccn ggctttt tamgtttttncaa 

ttmnanaamncaanogcnnccctttgggtggntgctccttttttgctttacccaaaan 

angtccccmgggnt^tccggttggnttattttntnctttaactccccatccaaagnttg 

tttcgmtgngggaaaccttnccccaaggnccggntttatg gtancccaag gtcctanan 

aanccnaaccaagaaaataaccattttggnttatctancccc1 i i i i i i i i i i ggnaana 

anatccnaaaanaa.catcccaanggttcgtnttcaagggtggatntaaaaaggnccaaaa 

ataanaaccaanaaaaaccaaaaanccccaancccaanaatttc ccttcaaa aaaac cct 

caaagncctcccggnccnggtaaggggggaaataacccccccl mini inccccttttc 

ccccggcrigggtnnaaccccctttgggccccccccggggggggncccgggggggctcccg 

gttcntttcntttaaagggnaamncctttatggntttgggggggaaattt 

NC 

Sequence 2054 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGA 
CAATGAAATCAATACTCAGAGAAGGCAGATAATTCTCCACGAAGCCAGAAAACTAATAAA 
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TGAACAACTTGGGTGAATTGTCCCACCAGACGGTGTGATATTTAGTAGCCCAGAAAGCTG 
CCAAGG G GTTG AATG ACACTATCTGAAG ATATG AACCAGTTTGCTCTCC ATAGGG AGGAT 
TTC ATC AACAGG AAAC AG ATGCCTGG AAGGCATTG GATTTGCTAAGTGTCTATG CC ATAT 
GATTCTGCTGTTTGCGTTTGATTTAGAATGCTGAGCTGACTCAAAGTCAAACTAGATGGA 
AGCATTCGACTGCCGGCACAAAGATGTTTGGTG 
Sequence 2055 

TCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGTAGCTTGCAGGGGTGCTATGT 

GMGATGGCGGAATTCCCAACGTGCTTCTGCAAAATTTTATGTATTCATCCCCCTCTGCT 

CTCAGTAGGGTGTTACCACCTGCGGATCCTTTGCTTTCCCCGCTGGATACTAACAGCCTT 

TTTCATAATTGCTAGCACCTATGGMTTGAGTGTTGMCTGAACAAACAATAGTCTTTCT 

CTGTAATGTCAGAGTTATTCATAAGACGAGGCAGTGTTAGACATAAGAAAGAGCACTGGG 

CAAAGAA TCAGA CAACCAGGATTTGGTTCCATTCTTTTATTATCAAAGAATA^ 

CMTATTATTTTAAGATGCAGTCCCAAGTCTTATAAGGAATTTGAAGG 

Sequence 2056 

CCGCGGTGGCGGCCGAGGTACTTCTGTGACATGTATCACCAACTCACACAGTGATTCTTA 

CGTTCTTAGGTCACTGCTTGGATTATGCAGCAAGAACAAGGCCATGTAAGGGGTAGGGGT 

GGGGCCAGGATTAGACCTGATCATTGAGAAATGGCAGATGGTAAGGGAAGGTCAGTCGCA 

GATACCTACACTGGTAGGAAATAAAMGCATATGAGACAGAACANAGTATTACAAATGAA 

GTGTAACAGACCACAGGTCCTGGGGTGTACCTGCCCGGGC 

Sequence 2057 

CCGCGGTGGCGGCCGAGGTACTTACATGGGGACCGCCAGGGGCCTCGAGAATCGGTATCC 
TG AGTC CTCTTG AAG AGC AGTAG AGGTTGTTTC ATTAAGTG C AAAC AC ATTGTTCTTAAT 

TTGAAAACTGTGGGCAGAAACAGMGCCCGAGACTAATTTTTCCATTGCTAACTCTAGAT 
TCT 

Sequence 2058 

ATAGGGCGAATNGGAGCTCCCCGCGGTGGCGGCCCGCCGGGCAGGTACAATCTCTGGCCC 

TACATTTTCTAAACGTTATGCCACCCTGACCAAGGGGCAACTCCTACAAAGCCAGGCAAA 

ATMTAAAATCATATTTGTCTCTAGTGGAATGGATAACTATGCCTAAAACTGCCCCTTTG 

AAMGCAACTAGAGAGATAATTTCTGAAGTGTTTGTCCCTACCTGAATGTGTGGCAAAAT 

TCTAAACTCCCTGAAGTGTGAAAGTGGTTTNCAAGCCACATGCACATCCAGTTGTGGTAA 

AGGGTGAAAATCTAACTGGCTAAGAGGGCTTCATAGCAACATTAACCAAAAAGTGGTTAT 

GTAGACTTTGC CTG CTTC ATAATTCCCTAGGGCATTCTATGCTATTCTGTACCTTNGG CC 

GCTCTAGAACTAGTG 

Sequence 2059 

TCTCCGGCTTCTATTTTGCCCACCCGGAGTCGAAATACTTCGTCGTCGGCAAGATCGGCA 

TGGACCAGGTGGAAGACATGGCCAAGCGCCGCGGCGCCAGCATCGAGGACGTGGAACGCT 

GGCT 

Sequence 2060 

TCTCCGGCTTCTATTTTGCCCACCCGGAGTCGAAATACTTCGTCGTCGGCAAGATCGGCA 

TGGACCAGGTGGAAGACATGGCCAAGCGCCGCGGCGCCAGCATCGAGGACGTGGAACGCT 

GGCTGGCACCGAATCTGTCGTAGTGCCATAATATAAGCACAAGGCGCGCGGAACTTATCG 

CGCGCCGNAGCGCACTAACGCAGTATGCCGGTCGGNATGCTGNATGCCCGNTCCGGCCCG 

TTCCCTGACTGGCTAAGGAGGATATATGGCAAGCAATTTCAAACTGAAACGCTGGCAAGA 

TCAAGTGATCCTGTTGCTGGGCCTGTGG 

Sequence 2061 

CTATAGGGCGAATTGGAGCTCACCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGGTT 
TGTCATACTTTAACTTTTGTCCCTTTCCCAGGTGGCCCATGCTTCTAGGCCAGCCTCCCA 
ATTCCTAGCTTGCTTCATGCTCCTGGTCTTTGCCTACTCAAAACAATCTGTCCTCCCCAC 
CCTGTCATCTCTGTTCAGAGTAAATACCTCCCTCCCTCTGCTTTTGCCCTAAGTTATTTT 
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TTCMCTCACCTMCATCTMCTTGGCTTTGCAGCTCTGTTTCTCCAAAACCCTCTCTGC 

CTGTTTCCTCCCAAGCTATCTATGACCAGTGCCAGAAACCAGAAAATCTTCTTGGAAGTT 

GCACTCTTTGCTTCTCTTCCCTCTCCTTTATCCTTCCACTTGGGAATCTCTCTCCTTCTA 

CCTGTCTCTATCTGCTGCAATCCTGTTCATCTTTTCCAGGTACCTnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnGCTTTTGTTCCCTTTAANTGAGGGGTTMTTGGCGCCCTTG 

. GCGTAAAT 

Sequence 2062 

CGAGGTACAGCACTGGGCTTACAGCGGCGCTCAACAGA! I I I I I I CTTTTTAAGAAAATG 
AATTCACTGAGGTTTAGAGATGTTAAMTACTTGCCCAAGTTCCTGCTTGCCCAGATGTG 
CCTTTGGGAATGCATC7TCTTTCTCAAGCCTCCTTACAGGGGTTGTAGTGAAGATTGGAA 

NGGGATAANGCAGGTG 
Sequence 2063 

GAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTACGCGGGGGAGATGCTGCCA 

CCTAGGTTACTTGTAGGACCCTATACGGCMCCTCCTTTGCCAGGAACTATTTATAAACA 

TCCTGCAGGAAAATGTGAAGTAGAAGAGACAGGGATATCCCAGAAGGTTATGCNAAACAT 

CAAGAGAAGATGAGAGGAGTCTATATGTCAANAATACACATTTCCCACCTTGCCCAACAG 

TNGAAAMCATMGAAGAGAAAMCATTAAAAAATGACMGGAAGTTAATGGAAGTCAGC 

AATGTGATGGTGTTTGGA 
Sequence 2064 

CGAATTGGAGCTCCACCCGCGGTGGCGGCCGAGGTACCTCTCCTGGGAAGCTCTTCTTCA 

MCATTGGTGACTCAGGGATTTGGGCTGCTTTCATCTTATAGCTCTGTCATCTCTATCAT 

GGAGCTTTCAAGGCCTTGTTTGCGCCAATATCCTCTCCTGGATGCTTGGGAAACACATCG 

ATGCATCCCGTCAGTCAGCAGCTGTTTACGGAACACCAGCTGCAGGCCAGGCACCGTGCT 

GCAAGGCGTAGATATGTTGCTGAACACGACTCCCTTCCAGATGTTCAGGATTTAACAGGC 

AAGTAAACAGAGGGTCCAGAGAAGGCACTCAGCACTGCCTCATGACATCAGGGAGGATTG 

CTCAAAGGAAGTGATGCCTAAACCCAGCTGGATCGNAGTGTAATGGATAGAGGGGTAAAC 

AAGAGCTGATTTCCTAGCGGTCCTGAGTCTCGACAGGTTGCCACCGTCAGGAGTATTGAT 

CTTTTTCTCCCCGC 

Sequence 2065 

CCGCGGTGGCGGCCGCCCGGGCAGGTAC I rTTTTTTTTTTTTTTTTTTI I NAGGNNATAT 

GTATACATAGTAAGTGTATATATTTTTGGGGAGAAGATCATGCATTTCCTAGGACAGGAC 

TMTACATTTAGCTGTGGCTATTATCAAAGAGCACCAAGACCTTAGGTGGGGCAACCAGA 

CTGGCTATGGGGCTTGATGTAAGCCAGACCCATGACCCATAAACTATTTCAGGTAGGAAG 

TAAGCAATACTATAACTTGGACTCGAGGGGGAGCAAAAGGGCACCCCTCAATGGTGATGG 

CTGCCCACTTGGTCTCAACACTGTAAAACTTGGGACTTGCTGCCAAGCTCCACCAGGCAG 

GGGGCAGAGGACACTGAGTCTCCTCCAACTGAGTCAGGAGTGGCTCAGGAGCTGACACTG 

GCCTAAGCTTGCAGCAAAGGGCCCCTACGTACCTCGGC 

Sequence 2066 

AGGTACGCGGGGGGGACAGGCCATCTCGCTATAGGMAGGAAAGTGGAACAGCATTCATC 

CTCAACATTTTTACGAAGACAAAATGAAGACTGGAGTAGAAGACTGATCAGTGCAGGTGT 

AGCATAAAAGTGTAATCCTGGAAGATGTGGTGTGAGAAGGTAGCACAAGTGAAGCAAAGA 

TACAGGAGATAGGGAAGGAAAGCTGGAAGCAGAGGTCACTGGAGGGAGAGGGAGATGGAC 

ACATTCAGGGCTACAAAGCAAGTTCTATGTGATTTGCTCACCTCTTCAATTGTGGGACCC 

CTCAAAATGTGTACCTGCCCG 
Sequence 2067 

GTACCCGGGACGCTGGGAGGAGATGCTACCACCTAGGTTACTTGTAGGACCCTATACGGC 

AACCTNCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAATGTGAAGTNGAAGAGA 

CAGGGATATCCCAGAAGGTTATGCAAAACATCAAGAGAAGATGAGAGGAGTCTATATGTC 
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AGAATACACATTTCCCACCTTGCCCAACAGTAGAAAAACATAANAAGAGAAAAACATTAA 
AAAATGACAAGGAAGTTAATGGAAGTCAGCAATGTGATGGTGTTTGGAGGTGGAGCCTTC 
ATANGGTAATTAATGC 
Sequence 2068 

AGGTACTGACTAGACAATTGAGTCTCAAAGACGATAATTTATCCAACATAATTGTGTGAA 

TMTGACAAAAAAACACCCAGTTTTGGGAAAATTACAAATGCAATTTCTAGCAGACACTG 

CTTGGTTATCTAGTGTTTTGAMTAAACMCTCCAAMCTTCGTGGCTTGAAATAAGCAT 

TATTTATCTTATTACAGTTGTTTGGGTTGATAATTTAGACAGAGCTTAGCTGGGATGGCT 

TGTCTTTGCTCCATGTTATGCTGACTTGACTCACTCATCTAGTTGCAGTTAACTGGCAGG 

GCAGCTGGAGCCTGGCTGGTCCCAGATGGCCACACACTTTGTGGCTGGTAGTTGCCTGGC 

ACTCATAAGGTGCCTTCATTTTCCTCCACCTAGCCTATCCAGCAGACTATCTTGGGCATC 

CTGGGCTTCAAGAAGAAAACCGCGATGTACCTGCCCG 

Sequence 2069 

AGGTACGCGGGGGTTTCCTGCGTTTGTAGATGGAAGGAAGAACTTGTGTGCTTAGACCTG 

ACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGTAGGACCCTATACGGCAACCTNCTT 

TGCCAGGAACTATTTATAAACATCCTGCAGGAAAATGTCAGAGATGGGAAGAAACAAGAA 

CTTTGACATGCTTGGTGTTCTTGCCCAAGCTTTGMGAAGTTTACAAAGTCTATATGTCA 

GAATACACATTTCCCACCTTGCCCMCANTTNAAAMCATMGAAGAGAAAAACATTAAA 

AAATGACANGGAAGTTAATGGGAAGTCAGCAATGTGATGGTGTTTGGAGGTGGAGCCTTN 

ACAAGGTAATTAATGCCCTTGT 

Sequence 2070 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGAGTGACTAAATTTGTAGGAATGATATT 

TTCATGGGATTACTCAATCTCACCCACCATTAGTTGCGGGTGACAAGAAAGCTAAGTTGG 

CAGATGTTTGTGCTAGAAGCTGTGGGTTTACGTCTCCTTTGTACATGTGTTCCAGACATA 

CCAGTGGCTTGGTATTTAAACATCATGCTCAGGTGTGCAGGGTAG 1 ! I I I G AGTTATAAT 

AGGTATGCAGGCGCTGTGGGATTACTTGGTTGTTTATGTAAAAATTATTTTGCACTCACT 

TCTGAMTGAGTGTTAGTAGAATCATCTTTAGAGGAGGTTCCAAGGCATTGAACTGAGAT 

ACCTGCACTGTTTGCTGTAAATTTMGC7TA 

GCCAAGGANAGAGTGGGCATGAATGATTTC 

Sequence 2071 

CCGCGGTGGCCGAGCGGCCGCCCGGGCAGGTACCTTAAAGTTTGAGTCTCACTTTCCTCT 

TATATATAAGGCTTCCTGAAAGCATTAACAGTGATGTATATGAACTGGCCAATACAAAGT 

MGACCTCAATAAATGGTGGGCACCTTCTTTTATATGCCAGCTTGTATTCTTATTGGACA 

AGGTATGCAACCTCTTTGGGCCCCAATTTCTTTCTGAAAGAATCAATTCTAGTGCTCAAA 

GATTCAGAACTTAGTGCTTACACTTTGGGAAGGAGGACTGATGTAAAGMTCAGGATTTT 

CTGGACAAGTTGATGAATCTCCTCCCTCTCTTTGGTGTCCCTGAATGAGTGATAGATGGA 

TAAAAGTCCAGTGTCCACATATGCACAGATGCTAATAAAATGAGGTGGAATGATGGTAAG 

GCCTTAAAGTAGAAAGTATGAAACCTCCACTCTTGTCTTGGCTTTCTCTCTTTCCCTTGC 

TTACTCACTCTATCTCCTTCTCTCTCTNCCTTTCTNCAAATGTCAAGATGTAATCTCTTG 

CCTTAAAATTTC N N NAAATCATCAACACCCTTGTN AAACACCTN CC AACCTTAGTTN 

Sequence 2072 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGACTCAATCTTACTCCCTTGTAGAACA 

GGCGATATCTTCACCATGCGCACAATGAAATCAATACTCAGAGAAGGCAGATAATTCTCC 

ACGAAGCCAGAAAACTAATAAATGAACAGATGGCTCCACATAAGGTAGAGCGTGTCTGGT 

GCCCTGCCGTTTGGAAATGACCCAAAGGCAGGAGTGGAAGATGAAGTTTCAGCAGGAGAG 

ACTGAGGATCTCAGAGGATTTGCTCTGGGATGCAGAGACGCTTGCCTATTTGCTAAGTGT 

CTATGCCATATGATTCTGCTGTTTGCGTTTGATTTAGAATGCTGAGCTGACTCAAAGTCA 

AACTGTAAGTACCT 

Sequence 2073 
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CCGGGCAGGTCAGCAGCTTTGCATGTTGACAAATCCGCTCTCTGCTGCAGATGCCTAGGG 
GAAGTTGCAGACTTAAATTTTCTTTTGTAAAATC 

CACTGGTTACTCCAGAAAAAAAAAAAGAAAAAGMGGAAGGAAGAAAAGAAAAAGAAGGA 
AGGAAGAAAAGAAAAAGAAGGMGGAAGAAAAAACAGMGGAAGCNNAAAAAAAAAAA 
AAAAAAAAAAGGTTCCT 
Sequence 2074 

CGAGGTACTTCCGTGAGATATAAGGCTGATGATAAGGATTGGGGTAAAGAGTTGATTCGG 

TCACACTTGACTGCCTTGAAATCTGTCTTGGCATCGGGTCCTCTTCCTCCATCGATAGGT 

GGTCAGTCTCAGAGGGTTGGTCATCTCCCTCTTGCTTCAGAGACACAGCCTGGAATCCTT 

CAAMCTCCCAATTGTTGACAT7TCTTCAGCCTGACTAGGCTGGTCCTCAAGGCTCTTCT 

TTCTCTTTCTGCACAAAGGGGGCTAGACCACAGATAGTGGCCAGGAGAGG I GTGACTGGC 

TCAGATAAAGCCAGGCAAGACAGTGCCCTGTGCTGTGACCCCACTCAGTCTGAATCTCAC 

TTCACCAAGTAGGCTGAGAGTTACACTGCCACCCTCAAATAGTCAAAAAAAAAAGTACCT 

GCCCGGGCGGCCGCTCTAGAACTA 

Sequence 2075 

CGACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGG 

GGATTATAATGTATGTTGGMTGTCCTCCTTTGAGTAMTTTTACTTMGGGTCTTTGGG 

CTGCATAAATTTGTATGTGCATATCCCTATCMCATTTGGGMGTTCTCAGCCAGTATTT 

CTTTAAGTAAGATGTTTATCTTTTCTCTCTCTCTTCTCCTACTGGAMTTCCATAAATTA 

TATGTTCATATGCTTGATAGTGTGCCATAGGTCCCTCATACTTTGTTCACTTATTTCATT 

C 1U 1 M 1 CTTTTGTTTCCCCMGTGGCTMTTTTAAATAACATGTCTTAAAGTTGACTA 

ATTCCTTCTTCTGCTGAATGMCTTTAAAAGATTTTTAATAGAAG ! I I I 1 I GGTAATCAA 

AAAGAAAAACATATAGTAGAMTGCTTTTTAAAGTTCAGGCATGGTGGCTCATGCCTGTA 

ATCCCAGCACTTTGGCCAGCCTAAAGTAGGMGATCATTTGAGGCCAGGAGTTTGATCCC 

AGTCTGAGGAACATGGCAAAACCCCATCTCTACTAAAAAAACGAAA 

Sequence 2076 

GCCCNGCAGGTNCCCTNGGGACACCCTTTTTTACCATGATCCATTTTTTGAATA CAAAGA 
CACAGTACCCCGCTTTTGGGAAACAACTGAAAAATAGGGGAATAGATAAGAAAGl Mill 
TGCTTACTTGTTTTTGTGn Mill t ATATTAAACTAAAATTTCATTC ACTAN T AAAG TT 
TTTTTTAMGMTAAATTGCTTGCTGGATGTATTGAGGTAATAAMGTTTTCCT^ 
TTATATGAAGCAATATAGGATATTAGCTTTATGAGGCACTTGCCAGTAAA 
Sequence 2077 

CCGCGGTGGCGGCCGAGGTACAAACCAGGGCTGTTACTCCTCAACAAGGAGGAATAACGC 

CTCTTAGAAGCATGGCAAGCAGGTGCCATAAAACATGTCTACTACAAT TTCTC ACTGCCT 

GGMTAATTGGAAAAGTTTCATAAATGCTCTGGCTTTCAGGTTCTTCACTTTTGCAATAA 

AGCTGCTAAGTCTGGTTGCAMGCTTTTATGATGATTAGAAATAATATATGTAAAATGCC 

TGATGCACAAGAGTCACTCAATAAGTGACAGCAGTGTGATTAATCTATGACATANTAA/\A 

AGTTCTAACTTGAAACACGAGAGAAAAAAATAACTTCCAGTGCGNAAAGAGATAATTTCA 

TGATCTCAGGGACACTAA 

Sequence 2078 

CGAGGTACTCAAATCTCTGCCTCAGGAAAACATCCMCTTTCAAAGTTTCTGGTAAGGGT 

TGTAAAGGAAGCAATTAGTGTCATTAGAAAGAACTATCGATGTCACTATGCCAAGGGAGA 

AAAGGGGTAGCAAAAATGAGAAAGAAGGCATGAGGCACAGACATT GAAA ACTAACCTGTA 

TTTAAGCTTAGTGTATTTTCGCTTTGCCTCTACTCCMGTGGGGAGTTTTCATTGCAGGC 

ACTACAAACCCCAGGTATGACAATGACATAAGAATCAGGGAAGTGGCTGGAAAGGGAAAG 

CTATATAGTTAGACTGGCCTGCCTTTGAGTCCTGCCCTGCCATTTGTACCTGCCCG 

Sequence 2079 

CCGCGGTGGCGGCCCGAGGTACTGCAGGAAACCATGTTGCCAGCACTCTCCTAGCCCAAA 
AAAATTACATCTAGGAAGCAGGGGCCATTAACCTACTGGAATCATTGACTATGTCATAGA 
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AGTCCAGTCACCCCGCAGGAACCAGAATCTTCCCATGGATATTTAAACCCAATTAGAAAT 
GGCCACCAGGTGGTGCCAAACTGCAGCTAAGGTCCAGGTGAATTTGAACCATGCAATCTT 
GGATCTTTTGGCTCTCCTGGTGA I I I I IATAGAG1 Mill I CATCGTTAATCCATTTATG 
AATAAATGCTGTCAGTTTTGATAACTCTTTCTTATTGTCTATAGGTAAAACAGAAAACCT 
TTAGCATTGG ATAAAAGTCTTCCTTA I I I I I I ACTGNGACCTCTTTCTTTCTTTTGATCT 
CTMTATTTAGTCGCATGGGMCACCMCTGTTAACTGMTTTCCATTTTTCCAT 
Sequence 2080 

GGAGCTCCCCGCGGTGGCCGGCCGCCCGGGCAGGTACGCGGGGGTTTGTAGATGGAAGGA 

AGAACTTGTGTGCTTAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGTAG 

GACCCTATACGGCNNCCTCCTTTGCCAGGMCTTTTATAAACATCCTGCAGGAAAATGCA 

GTGAAGTAGAAGAGACAGGGATATCCCAGAAGGTTATGCAAAACATCAAGAGAAGATGAG 

AGGTCAGAGATGGGAAGAAACMGAACTTTGACATGCTTGGTGTTCTTGCCCAAGCTTTG 

AAGMGTTTACAAAGTCTATATGTCAGAATACACATTTCCCACCTTGCCCAACAGTAGAA 

AAACATAAGAAGAGAAAMCATTAAAAAATGACAAGGAAGTTAATGGAAGTCAGCAATGT 

GATGGTGTTTGGAGGTGGAGCCTTCAG-AAGGTAATTNATGCCCTTGTAAGAAGAGGCCAN 

AGAGCTTGCGCACC I I I I ICCTTGCCATGTGANGAGCCANGAAGCCGGCTGGCTGCAACC 

TGCA 

Sequence 2081 

GGAGCTCCCCCGCGGTGGCGGCCGAGGTACACTTGCCAATTGAGATAATATGAAATTGTA 
TTCCATTTTAGTTTTMTTACATTTCCATGATTACTGC^ 

TATTACGGAATTTAGTTTACTCCTACGTGCATTACATTTTCATAATTTTGCCCAG I I Ml 

CCTTGGGTTG I I I I I ATTTTCTTACTGATTCCAAAGGTATCTGTAAATTATTGAAATGAA 

TTATTTTAGTATTGTGCTTTTCAGCTTTTCTmCACTT^ 

AAMGTTTTCATTTTAATGTAATCTTAG7TGNAMTATTTTTAMTTTGG 

CTGNTAI ITTTTA 

Sequence 2082 

CCGGGCAGGTACAGGAGAGTTCCCATAGACTNCTGCCCCCTNCCACAGACAGACTACCCT 

AGAATATTGTAAAAGGATCTATCACTCATTTTCCTCACCTCTCTTATAGATGTCTCTCTT 

CAACAGTGAGATTNTTCCGTAACTCTCTGTNNCAGGACTGGGCTTGTTTGCACTCAAATC 

CATGACTCATTGTTTCTCTGCCTTTCCGTGTGTTACAGGTGGGCTGATCCCCCTGCAGCC 

AGTTTCCCATAAGCAACTGACTTCCAACTGGGAATGTCTCGGGGGATAATGGGGGTGGG 

Sequence 2083 

AGGTACTTGCAAACCAAATTGGCAGCATATTAAAAGAATTACACACCAGGGAAAATAACC 

CTAGACCTTGCATTCTAGTGCTCTGCTTATCTGTGACTTCAGACAAAGATGGAGAGAGCT 

GACCATCAGTATMTTTAGCTGAATGAAATMTGAGATATTTTAGGAAGAAGAACATTGA 

ATGTCAGAAGTTAAATGTGAGATCCAAGAAGTAACAGTGAACAGACAAGTTTGTGGGTAT 

GTGGATATGTCTTAAATAAGTCAGGTTGTGTCAAATTATGATGGATGATGATGACTGCGA 

GGCAGGAAGGAAGATGAAAAGGAGGAGGCATTGAACACTAATGGTCACAGGGAGCACCCG 

GGAGCGTGGTGACCCCAGGGTGAGCTGCTTGTTCTGTGAGGGTGGAGGGAA 

Sequence 2084 

CGAGGTACTCTGACATCATCTGACATGGCAGCTCCCCAGCCATGGTGGTGGTGATAGAAA 

TATAAATGGTGACAGCAGCAGCAGTGTGATAGTCACGGTGGAGGGAGATTGCTTTTCTGC 

AGCCAGGCTTCTGGATCTCCCAACCTGGGTGGGACCCAGCTGAAGATCTGCAGAGAAACC 

AGCCATTTCTGAGAGCAAGGCAATCAGTGGGCATTTACTGGAGGGTGTCTAGGTTCTTAA 

CATCTCACTGTTATTTGGGCTCATGGGATCATAAGAATGACAAAATAAAGAGAAGTCCTN 

ATTTTNTTACCACAANAAGCTTGACACAATTTCCAGTAAANTCTTGGATTGAAAGGAGAG 

ACATGTTTTCATGTGTTTTATATTCCCTTAACAACCCAATA 

Sequence 2085 

CCGCGGTGGCGGCCCGAGGTACTCCACTTGTAACTGCTTCTCCCCTGATGCCACCACTGA 
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CCTCTGTAGCCATGGTTCTGCMGTTCTCTAATTTTCTGAGCCTCTGAGTCACTGACGAA 

GTCATGGTAGAGAGCAATGTAGGGCTCCAGGTGGATGACCTCCTTCCGGATGGGCTGGAG 

CAGCAGGTAGGCGTTGGAATTGGTCTCATAGGAACAGTAGAGGCTAGGGATCTGGTAGAG 

AGTGGGCTGGGAACCCAGGGTCTGACATAGCCCCTCGTAGGTGTCTCTGGTCTGCAGGTG 

GGGTATATTGGGCCTCTGGATGACGGCCTCAGCTACCACGTGGTTGGGGCTCTCTGCCAA 

GAGCCTTTCATATTTCAAGACATTCCTGGCCATCCTCTTATTATCTGGGCTGTAGAGAAG 

AAACTCCCGCCGCGTACCTGCCCG 

Sequence 2086 ^ 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACTTACAGTTTGACTTTGAGTCAGC 

TCAGCATTCTAAATCAAACGCAAACAGCAGAATCATATGGCATAGACACTTAGCAAATCC 

AATGCCTTCCAGGCATCTGTTTCCTGTTGATGAAATCCTCCCTATGGAGAGCAAACTGGT 

TCATATCTTCAGATAGTGTCATTCAACCCCTTGGCAGCTTTC TGGG CTACTAAATATCAC 

ACCGTCTGGTGGGACATTTCACCCAAGTTGTTCATTTATTAGTTTTCTGGCTTCGTGGAG 

MTTATCTGCCTTCTCTGAGTATTGATTTCATTGTGCGCATGGTGAAGATATCGCCTGTT 

CTACCCCCGCGTACCTGCCCGGGCGGCCGCTCTAGAACTA 

Sequence 2087 

GCTTTTAGAGGGATGGGGCACGGCATCTCAATACCATTTCCTATTTTTGAGATGGTTGCT 

TCATTCACAMGMGCTACATTCCACAAGCAGGTGTCTCTTCCTACTTTATCAACTCATC 

CATTCTTTTCCCACAAATACTTACTGAACACTCTCTATGCCTGTCGTATTGCAGACATGC 

GGAATATATCGACAAAACTGACTGMCTTNCTGCCCTTGGGGTGCTTCCTTCTTTATCTG 

GCTCAAMCTCTGTTCGCTGCAGGTGCTAGCTGCAGGTTGTTGCTCATTAAAAAGGAAAA 

TCTATTCTACMTTAGAAAAATACCTTGGGAGAACATTGCAAAACAGTATTTGAATNAGT 

NAAAGCTGGTGTCACCCCATAGGAGAAGNTTGTACCTGCCCGGGCGGACGCNCTAGACTA 

NTGGANCCCCCGGGCTNNAGNGAATTCCATATCAAANTAANNGAACNCGTCTACCTANGN 

GGGGGGGCCC 
Sequence 2088 

CCGGGCAGGTACGCGGGGTCTTACCTCAGAGCTGAGCTGGGCATGAGTAGATGCTCAGTA 

AGTGGTGCACAGGGTTGGTCCCTATGGTGGAGGCCCCCTAACACCGCCCAACCCCCCTCC 

ATGTTCTCACAGCTCCACGCACTGAGCACGGGCATGAAGGCCATGATGTCAGAATTCTGC 

ACCCAGGGAGCTGAGATGTGCCGCAGGGCCTGTGGCGGACATGGCTACTCAAAGCTGAGT 

GGCCTGCCATCACTGGTCACCAAATTGTCGGCCTTCTGTACCT 

Sequence 2089 

GATCAGCTTGGATATACACTTAGGTCATAATACTAAATATGAATATTCGl nil iNCCAT 

AACCATAGGGTGTAATCATATGCCTATGANTCTGATCCTCTACTTTATCCATCTGAATGC 

TTCTCGATCCACTCATGCAAGATGTGTAAAGCTCATCATGGAAATAATCTATACTAAGCT 

TTCAATATTGACATATTTATATAAAAATTTATNGTAATANGATTGAATAAACANTAAGTA 

ACTTTTTTTATATTGATMGNCACCACAATNT 

AGTATAAATTCAGTATTTTTTCAGATCCTACAATTCGCCTGGTTNGGTGTATCATTAGNC 
TATAATAAATNNNNGTGGAATGGCAATTTTACCAAAACAACACACTTGCTTAAACATAAA 
GCCAATGTTCTCAGTTTCCMTATATCTCATTTCMTAGCTAACATCAACAAAATATATA 

TAT 

Sequence 2090 

AGGTACAATATAGGCAGACAGTTTGCCTTCAGAAATTCAGAAATGCAGCTTTTGAGGGAG 

GTCAACATCATTGGTCTCAGCTACCATTTTCCTGCAGGATGTTTATAAATAGTTCCTGGC 

AAAGGAGGTTGCCGTATAGGGTCCTACAAGTAACCTAGGTGGCAGCATCTCCTCCCAGCG 

TCAGGTCTAAGCACACAAGTTCTTCCTTCCATCTACCCCCGCGTACCTGCCCG 

Sequence 2091 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACGCGGGAGGAAGGATTTAGGACA 
GCATCCTAATCAAAAGGGCCTGGAAGCACTTTATAAAAGAGAGAGACAAGATCGCATGTC 
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AAACTAGAAGGAAGGAGGTGAGAGGAGATAGGGCTCCAGAGTGGAGCAAGCCCCTTCTGT 

CCCCGTGAACTTCCTGCCGGTGCATGGGTTACCTCTCATTAAATTTAATAGTACCTGCCC 

G 

Sequence 2092 

AGGTACTGTATATAAATACTGAAACAAAACAATGGAAAAAGCAAATAGGAAATAGAGTGA 

GMTCAGGGGAACTGAGGGGAAGCACCMGTTTATCTTCTTTAGATAGTTCTTATTTTGG 

AGAC AGGATATTTATTTCTTGCGAGCTCTAAGGACTGTGCATTTAGCAAGGACCGTTCAT 

TTTTCATCTTCCTCACTAATGTCTTAAGGGAG GG AAAGATAAATCTGAGTGTCATCAGGA 

TACATATGGAGCTGAGGGCCACAGTTGCTMTTAAGTCCTCTTCAAAGACCCATTTGGAT 

GCCTCAGCATACGTGATAAAAAAGATTACCCCCAATGATGTCATG I ) 1 I 1 CCAGGCATCA 

AMGMCCCCAMTTACCTTGTGTTAACAAGAGTGGrTGGAGAAAGCTAAGATACCACTA 

CMTCTCCCCTTACCCTCTGCATAGTGCTGGCTAAGTGGTATGGTCCTGATTTTCTGATA 

ATTACTACTTTTMGGAGATGGTATTTTGTTGCAGTGTCACTGAGGAAACGGGATTCCAA 

TCATAACAGCCCCGAACAATTTTAAGTGCTGCAAACCCGACCCAAATANAATCACTTGAA 

CCTTG N C N G AAATTGGAAT NTACTTTCTTG AACCTC AAGGGG AG AG AAAAAGTAATTTGG 

GAACAAACAGTTCCCTTGCCCGGC 

Sequence 2093 

TTTCAAAACCCGCTTATTTCACGCGAGGACTCCTCTGCGAAGACAGGAAGGATGGGGTCC 

CACACCTGAGCCAGGCGCGGTCCCCACTGGGTGCTCCGCAGCCCGGAGTGGCTCCGGACG 

ACAGCCCGCTCAGCGGACCTTTCGCCAGGCTCGGTGGACCTTCTCCGCTGTTCTCCTCAG 

GGCGCAGTCAGGTTTCTTCGACACACACCAACCGAAGTTGAAGCGCGTGTCCCGCGTACC 

TCGGCCGNTCTAGAACTAN 

Sequence 2094 

GGAGCTCCCCGCGGTGGCGGCCGAGGTACATGAACTAATAATGCATTTCTAAAGGTGAAA 
MNGAATAGGTATTTTTCTGTTTAATATTCMTT7TATAGAGAGTAGTACGTT Mill 
TTAAACCCCAGAAGCTCAGGATCTTATCATTTTAAMGAAATTATCACCAGTTCTGTGTG 
AGTAAATAAAGTATTATAACACTTTGTG I I I I I CATCCATGATACCTTGTATTTACTTAC 
CTGAGCTTTTTTTCTAGGGAAAGAAAAATGCTCAGGTAATAACAGAGCCTTGAAAAAT^ 
GGATTTTCAAAACTACCTATTTATGTATAGGCCTTTAGATCATCTGATGTTGAATACTCT 
TTAAGTGATCTAAAGGCCTACATATAAAAAGGTA I I I I I ATTAAATTCTGGAATTAAACA 
TT 

Sequence 2095 

AGGAGCTCCACCGCGGNGGCGGCCCGAGGTACTTN N 1 1 I 1 1 1 1! M I 1 1 1 1 1 1 1 1 IGTG 

CMTACCAGCTTTCACCCTGTCCTCTGCACTGCTTAGCCTTGGCTCTGCTGAGTTGTGCA 

ACACATTCTGAGAGATGCCATACAATGCTCCAGGCAAACTTATAGAAAAACAGGAATGAG 

TGATTTTATACGGGATGTGTTTCAGCTGTCCAATTCAAAATAAATCACGTCCAGGTGCAT 

TCTTTCCTAATTTGTG^XXCAAACTGCTTGCACTTTATACATTAATGAATTAI 1 1 I I IAA 

AAAGAAATTAGCTCTATCATGAAGACAGTGCTATCTAACCCTTTGTGGTCTTTCTCCCTT 

AAAAA 

Sequence 2096 

CGACTACTATAGGGGCGAAATTGGAGCTCACCGCGGTGGCGGCCCGTCACGGTGATGTTG 

TTGTCGCGCCGGGGCACGCTCAGCGAGCGCACGCCGGCGTAGCCGGCATTGACATTCACG 

CTGGTGCGCGCACrGGCTTGCCAGTTCCAGCCCACGTTGGCGCCGATCTGGCGCGTGCGG 

TCTTCGCTGCCGAACTCCATGCCGGGCAGCAGGGCGCTGTCGATGCCGCTGGCCGTTTGC 

GCCGTACGGTCGA 

Sequence 2097 

CCGCGGTGGCGGCCCGAGGTACGCGGGTCCCTGAGTTCAGAACATAGGAATTAGATTGAT 

AGACATCAACATACCCGCTTTATTGCTGACTCATGACAACTAATGGGAAGACATGGCTCA 

GATGTGCAGCCACAGTGAGCTTCTGAACATTTCTTCTCAGACTAAGCTCTTACACACAGT 
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TGCAGTTGAAAGAAAGAATTGCTTGACATGGCCACAGGAGCAGGCAGCTTCCTGCAGACA 

TG ACAGTC AACGCAAACTC ATGTC ACTGTGGG C AG ACACATGTTTG C AAAG AG ACTC AG A 

GCCAAACMGCACACTCMTGTGCTTTGCCCAAATTTACCCATTAGGTAAATCTTCCTCC 

TCCAAGAAGAAAGTGGAGAGAGCATGAGTCCTCACATGGAAACTTGAAGTCAGGGAAATG 

AAGGCT 
Sequence 2098 

CCGCGGTGGCGGCCGAGGTACATCTCTAGCTGATGATTCAAAAAAGAAACCTTTTAATCT 
CACTCCACTGATCAGCTATGATACTTAAATGTTTTAGCTGTGAGCAAAATAATATGCATT 
CTCAAAGAGAGTATCTTCAGACTCCAGTGGCCGAGAATCTAGAGTTAGCAATGGAAAAAT 
TAGTCTCGGGCTTCTGTTTCTGCCCACANTTTTCAMTTMGMCMTGTGTTTGCACTT 

AATGAAACAA 
Sequence 2099 

CCGGGCAGGTACATGTCAAAGGAAAAACACGTGAAAGATGAATTCAGCCAAACCCACCAG 

TGTTCAACCTCAGTCTAATCAATCTCATACTCCTAGAGGCTTAAGTATCAGCAGGTAAGA 

TCGTGATGACCTGTCTCTGAGGCTCCAGACAATAATTTCTAACTGCCAACTGGAAATCCT 

TATATGGTTAGGCTGCCAACATCCCAGGGAACAGGACCAAAATAAAAAGCATCACTCATT 

ATCCTACTGCMTTTTCCTCTTCCCTTTGTCAMTGGGAATGATCTTTACGATCATGATC 

CTTTATTGCAACCAGGACAGAAATCATGAAGTCATCTATGGCCCCTTCCTCTCACTCCGC 

CTCCAATTAGTTGTCCTATCTGCCCCTTCCGTTCTCCTTCCATCTCCATAATGCCCAAGC 

TAGTCTATCACCTCTCCATTCATCTTTCACTCAACCTCCAGAATTATCTTCCTAAAACCC 

AGACTGATTCTGATTTTTCCTCCTCMGAAACGCTCTCTMCACTAAAGCACTTTGCTTC 

TACCTCTATCGAGCCCAANGGTATTCATCCTTGNAATCTCTCTAAATGAGTTCCCAGCTG 

NCCACCCCACGAGACTAGGAGTTTGGTAAGGACAGAATCACTCATTTCATCCTTGNACCT 

0 

Sequence 2100 

CACTATAGGGGCGAATTGGAGCTCCCCGCGGTGGCGGGCAGGTACTGTGTGGTTTAAAGA 

TAAGCAGCCTGTGCTGTCTTTTATTTTCAGAAATAACATAAATAAACAACAATTACTGAA 

CGCAATGTGCAATCTCTATCTTCTGTGTTTCGGGGTCAATTGTCTAACAGGAGCCACAGG 

AAGAGCTTCAGAGGCACCCAGATGCCAAGCCTCAATTCCCACTGTGACTCTACTCTGGCA 

GAGGTGAGACTTCACCTTCCAGAAACTATGCTGGGGGCCTCTGTGTCTGTGGCATATTGT 

TTATATTGTGTGTTTATAGGCAATTTTAACAATCACTGTTTCTAGATTGCAACCCTCCTG 

AGGCACTGCGCTGACACAGCAAAGTTATTCCCCTTACTGAAGAGAAAAATTAGAGGCCTA 

AACACCACAGAAGCCCTGATAATATGGAATATTAGCATCAATGTCAGAGCACAGTATTTA 

GGACAGATACTGATAAAGACAGCATAGATGACCACCTCACCTTGTCACCACCTGAGCGAG 

GAGCCCTTAAAAGATGGGTTTCC 

Sequence 2101 a ^. a 
CTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAGGTTCAAATAGT 

CAGCAGCTCATCATAATCAATGAGCGAGGACATAAAGTAGGAAAAATGCATCACCATGGT 

GAGCAAGGAMGCMGTTATTGGAGGCACATGTTAACACATAAAATATAAAATTATATGA 

TCACACTGGAAAGGCTTGCCTGAGCCCACAGTTTGAATGCCTACAATAAGATGAGATGCA 

CAACAAAAAGCAAGAGAACCTGATCAAGTGGGTGACCTGGCCATGGTGCTCTCATCAGTG 

GGGACCCAAATGCTTATGTGGACTCACCAGGTATCGAATTAGACATGAATAGGAGTGTTT 

GTTGTGATCCAAGAAACTATATAATCAAATGAATACAATGAAACTTTAAAA 

Sequence 2102 ^^ A ^ 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCTACTTACTATTTTATGAGMTTGATTTTTAC 

TAAGCCTGATTTGAATGGGTTGCATTTACACCMCTGCTATTTATTAATGAAAGTGGTAG 

CCTGCTTTTTMGTAATATGTGTTTTGCMCAGGAGAGGGTTT TCTGA GCTTTTAA^ 

AGTAACCCTGAACTTCTTCCAAAATCCACACCTTTTATTCCCCCTTTTCCTTGTCTCCTT 

CTAGGTGCCTCAGTCCTGGCATCTCGGGAGGACTCTGCCATTACTCATCTGTTACTGATT 
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CTGAATCTTGTGCTCTAGCTCTACAGCCCAGTGAAGAAGCAATATATCTTTAATTCAGGG 

CTTGCCTCTTGCTTTCCAACTCTAACCTGAGAGCAGTGCTACTTTTGTAATTGCAGGTCT 

AAATAACTTAAGCGTATATACAGAAGCCAATCACTGATCACTTCCATTGCAATGAGGCAT 

CTGTTGTCAAAGAAGGAAACCATTTGTCMGTCCCTTATAGATATTGAATCTTCAAACCC 

AAATCTGGGGATGTTTACATTTCT 

Sequence 2103 

GATGGCGGCAAGCGATGCGCCCTCCTCGCACACTGNNCCGGCCAGCATGNNCGTGCGCAG 

TTCGGCCAGCACGGCGCGCGCGCAGGCCAGGGTTTGCACGCGGCCGTANTGTGCAATCAG 

CTCAAGGCAAGCCGCGTCGCCCAGGATGCGGTCGACCGACGGCAGGCGCACGGTTTGCCC 

CGTCGTCATGCTGGCTGGCCCTCGCCGCTGTCGTTCTTCCCCAGCCACAGCAGCGGATTG 

CCGCTGGCGCGCTGGTAGCCCGCTTCGCCACCAACAGGTCGAGCATAAGCTGGCCAGATC 

GTCGGCAGCGGTTCGACGTTGTAATCGNGTTCCTGGTTGAAGACTTTGCGATAGGTATGG 

CAATCGTCGCANGTTTCCGCGCGCGCCACCTTCTTCGGATCATTCGCCTTGTTGGGCAAT 

TTGTCGTCCTTCGCCGCACGGGGTCGAACGGGTTTGCGTNGGCGCTCTAAAACTAGTGGA 

TCCCCGG 

Sequence 2104 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGTAGATGGAAG 

GAAGAACTTGTGTGCTTAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGT 

AGGACCCTATACGGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAAT 

GGTAAGCCCTTGGTTAAATTTATTGACTTCATTTATGTATCTCACMCACTTCTTTTCTG 

TTTTAGTTCTAGG AATAG AAACTTACTTTTGGAAATAC AGTACCTG CCC G 

Sequence 2105 

CACCGCGGTGGCGGCCGAGGTACAGGGCTCTTGATATATCCTGGCCTTAAGTCTCAGGTT 

CTCCAACAAAGCTGGCAGCCTGTGGTATGCCTTCTGGGGCCAGAGAGTGTGGCATGATCC 

GAGGTCCTGAGTCTCTCTCTCAATGAGCCCATCATATTGGCTTAACCAAAGCCAGTGGTC 

TTCAGGACACGGAAAAGGGAGTCCCTTAGGCTCCAGGTTTTTTTCCCTTCAGTTCACACT 

GGGAGAAGGGTCACAGAGCTCCAGTGAAATCTGGTCCTGTTAGATCATTTGTGGTTGGCT 

GCAGATGCACATGTGGGCATGTGATAGTTTTGATTTAGATAAAAAGCCCACAACTGAGTG 

GAGTGTTGAAACCTCAGGGTTATAGACTGGTGTATCTGTGAGAAAACCACAACAAGTATG 

GTATTTGCCCGCGTACCTGCCCG 

Sequence 2106 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACATGCCAGGCACAAACTGGAGTCACAGATGCC 

ACACTGACTATCACATTGTCATCATACTCAGAGAGAGAAAGGTGTATACACCCAGTATTT 

TCTTATGTTTTCTCTATACTTCCATATCCCCACCCCCATTATCCCCCGAGACATTCCCAG 

TTGGAAGTCAGTTGCTTATGGGAAACTGGCTGCAGGGGGATCAGCCCACCTGTAACACAC 

GGAAAGGCAGAGAAACAATGAGTCATGGATTTGAGTGCP/vACAAGCCCAGTCCTGCTACA 

GAGAGTTACGGAAAAAATCTCACTGTTGAAGAGAGACATCTATAAGAGAGGTGAGGAAAA 

TGACTGATAGATCCTTTTACAATATTCTAGGGTAGTCTGTCTGTGGGAGGGGGCAAGGAG 

TCTATGGGAACTCTCTGTACCT 

Sequence 2107 

CCGCGGTGGCGGCCGCCGGGCAGGTACCATGGCCA/vAGCTGAGGCAGCCAGGGTGGGACA 

GGGGCCTCCAAGCAACACAGCAGTGTTAGCAGTTCAOCTGGACCTGTTGATGAGCTGTCT 

TAGATGCACGAAGGCAAGAGATGTCTCATCTCTCTC'AAGATCTTAGATCTTAAGTGCAG 

AGGGAAGGCAGCTGGGAGTGTGGACGGCAGCCACAAGGGAGAAAAGTGCAAGAGCATTTA 

CAACCAGAAGAGAAGCCGAGGGGACATGATGGCCCATGGACAATGTAGCCATGGACAGTG 

CCCTGGGAGGCTGGTAAGGGGGGTGCTGGGACTGGTGGTGAAGAGATCATTTTTAAGAAA 

AAGAATCCCGCGTACCT 

Sequence 2108 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACATGCCAGGCACAAACTGGAGTCACAGATGCC 
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ACACTGACTATCACATTGTCATCATACTCAGAGAGAGAAAGGTGTATACACCCAGTATTT 

TCTTATGTTTTCTCTATACTTCCATATCCCCACCCCCATTATCCCCCGAGACATTCCCAG 

TTGGAAGTCAGTTGCTTATGGGAAACTGGCTGCAGGGGGATCAGCCCACCTGTAACACAC 

GGAAAGGCAGAGAAACAATGAGTCATGGATTTGAGTGCAAACAAGCCCAGTCCTGCTACA 

GAGAGTTACGGAAAAAATCTCACTGTTGAAGAGAGACATCTATAAGAGAGGTGAGGAAAA 

TGACTGATAGATCCTTTTACAATATTCTAGGGTAAGTCTGTCTGTGGGAGGGGGCANGAA 

GTCTATTGGGAACTCTNTTGTCCTNGGCCGCTTTTAAACTAATGGATCCCCCNGGCTTGC 

AG 

Sequence 2109 

rTTTTTTTTTTTTlTTTTTI 1 GGGGAACAAGGTGAGGTTCTCTGAGGTAACATTCCCTAA 

GACAGGAACCCCAGGACTTTCCAACTCTMGGATTTCCCACATTCAGGATCCAGAAGTTT 

TCAAMTTACCTCTTAAGTTTTCCTACTAAGTTTATGGCCCC AGAG GCTTCTACTCCAGG 

TAAGCAGTTCTCTGCAACTCTGAATTTGCTTGTATTTCTAGATTTTAAGGTGACAGTTTG 

CCCTGTAATATCAGTTCTCTGACGGGTCCAAGACMGCCATCAATTTTAAGGTTCTTTAA 

CTATTTCTTATTGNMGGGATGGGAATTGGAAATGCTAAAGCTCTTTAAAGATTTGGGC 

Sequence 2110 

AGGTACACACAGACCCCTGAAAAGCACCATGGCAGTAAGACCTTCCCCACATGAGTTGGT 

MCCTGTAAATTATCTTAGCCTTTCCTTATTCAACTTATCAMCCCTCCTCAGAACTCCA 

CACTATTACTTCCCCTTTGGTGACGTGAAGGGTTGGAACCCCAGCATCTCCCATTCCCGA 

TGAGTCATAACTCACACACTGCACAAAACGCAGTCCACATAACTGA 

Sequence 21 11 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACNCGGGGGTTTGTAGATG 

GAAGGAAGAACTTGTGTGCTTAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTAC 

TTGTAGGACCCTATACGGCMCCTCCTTTGCCAGGAACTAT1TATAAACATCCTGCAGGA 

AAATGGTAAGCTGAGACCAATGATGCTGACCTCCCTCAAAAG CTGCA TTTCTGAATTTCT 

GMGGCAAACTGTCTGCCTATATTGTACCTTGCCCNNATCAAATTTT 

Sequence 2112 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGTGAAGGGCTCTCTGTAT 
ACTGCTGAAGATTCTATAGTTTCTCCTGAAATCCATGAGGGAGCCCCCTGMGGATTTTG 
GCCAGGAAAGTGTCATGMTCAAATTTCTATTTTAAGAATAGTATCGAGGCAGCAATGTC 
AAAGTTTGTTTAGAAAATAGGATGGTTCCTGTAATCCCAGMCTTTTGGAGACCAAGGCG 
GGAGGATTGCTTGAGTCCAGGAATTTGAGTCCAGCCTGGGCAACATGGGAGAGGACCTCA 
, TCTNTNCTAAAATTTNANTNNTTNTAATAAAAGGTTCCCT 
Sequence 2113 

CCACTCCACTGAAATTTGCATTGGCTTATTGAAAGTCAATCACAAGTCAAAACCACAGAC 
TCACATAGGAGMCACTGTTGTTTGGGCCCAGCAGCTCTGAAATCAATACCTTTAATGGT 
AGAACCTAAAACTAAAATAAATAGGAAGGAAGGAAGGAGAAAGAAAGAAAGAGAANGAAG 
AAAGAAAGAGAAAGGGAGAGAGTGAANNANAAAAAATGGAAAAATAAGTACCT 
Sequence 2114 

GGCTAGGCAGCTCTGGAGAAAGCAGAAGTGGATAAATAAGGTGTGGACTCACCAAAGACA 

GTTCCAAAGTCAATTTCACTCTGACACACTCTCTGTGATCTTCCACAGTCAGCACAATGC 

CTGCCCCCTGCTAGGCCTGATGGATGATCTGATAGAAAACTGGCTTCAGCAAGGAGCTTC 

C ACAGGTAACCTTGGTTCTG C AGTG ATAAC AG CTCCTATG AAGG AG<\CCCC ATCAGACTA 

ACAGTGGATTTCTCAGCAGAAATCATAMGGCCAGAAAACAGTGGAATGGCATTTTCAAA 

ATGCTCTTTCCATTGCTTTTCCATACCACTGCTTT 

Sequence 2115 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCAGCGGCCCGCCCGGGCAGGTACCAAAAGAAT 
GTTCATAGCAGCATTATTTGTCATAGTCAAGAGCTGGAAGCAACCCAGTTGTGCATCAAT 
G GTAGAATG GATG ACTAAATTGTAGTGCATTCTTAC AATAAAATACTATAG AG C AATG AA 
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AATAAATAAACTATGGCTATATGCAACAACATGGATAAATCTTACCAACAATGTTAGACA 

AATGTAGCCAAACCAAAAAATACACATGTTCTATGAATCCATATATGCAAGGTTCACAAT 

AGGTGATAGTAGCTACTCTTTTGGGAAGGAGTGAGGAAAGGACTGCCAGAGTATTGCAGA 

GGGGGCTCCGGGTGTTTGTMTTTCTATTTCCCAGTGCANGTGTGGTTTGCATGAACATG 

TTCCTTTGNGGAAATTCATTGNNGCTGTACCTAAATATA 

Sequence 2116 

CCGGGCAGGTACGCGGGGGTAGAACAGGCGATATCTTCACCATGCGCACAATGAAATCAA 

TACTCAGAGAAGGCAGATAATTCTCCACGAAGCCAGAAAACTAATAAATGAACAACTTGG 

GTGAAATGTCCCACCAGACGGTGTGATATTTAGTAGTCCAGAAAGCTGCCAAGGGGTTGA 

ATGACACTATCTGAAGATATGMCCAGTTTGCTCTCCATAGGGAGGATTTCATCAACAGG 

AMCAGATGCCTGGAAGGCATTGGATTTGCTAAGTGTCTATGCCATATGATTCTGCTGTT 

TG CGTTTG ATTTAAAAT 

Sequence 2117 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAAATTTGGGCATAGTCAGAAACTCTGCTGACT 

CTCTCTTCCCATTTAACCTAATCTTACCATAGCACTCCTCACCCTATGACTTGCCCATTG 

CAGTAAACCAGAMGCAAAAAMGGTCMGCACGTGTGTTTGAGATCCAACTCTAGTGTT 

TGTAGGAGCTCTGCAGAACCACAGTTTGCACATCTGTAAATTGAGCAATAACAACTGCCC 

TGCCTGTTTCACAGGGTTGCTTAAAGAGTTATATGAGCTATTGCATGTGAAAGTGTTCCA 

TAAGCTGCCATTTGCTCTGAAGCTCACGGGTATAATTATTCTAGTCTAACCTACTTATGT 

TTGCCCTCTTTTCTGCAGTTGCATCTTTGTGCATAAAGATTCTCTCTCCAACCTGTGCAA 

TTGATTCATCTTTAMGACCMCTCAMTTCGACCTTCTAAAAGAAAATCTTCTCTGACC 

TTCAACTCTAAACTAGAGTAATTTACTAACTGGTATTTATCACTTTCAAATTCTTCATT^ 

GGTCTTGAATCATAGAATATTGGTTACCCATTG 

Sequence 21 18 

CGACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCGCG 

GTTTTTAMTCTGTCACTGTTCTTCCCCCACTTCAAAGCTGACTTGACAACTGCATTTAT 

TTC AC ATTG C AATTGCTGC AGGTTMTMTATTGTTTTATCAGGAAACATTTATCTTGAG 

GAAGAAAGTTATCCTAATACAGCAGATAACTTATCCAAGGACAGATAGAAAGGGGACTGT 

GTCTAGAGGGTAATGCTAATGCCTGGCATGCTATAAAAATAAATGAAATCTGATTAACAC 

TATTGGAGGGGAAGAATMGTCATCGTTTCTATGAAATATTATTTTCCTGATTGGAACCA 

TTTCCATTTTCGCAGCTTAAMCATTGCATTAAAGATGAGATGACTTTAAAACCCGGGGA 

CAGCTATCTCAAGCCCTGTGGATTATNAATGGGGTATAGGCTTGTAATCCTATGGGTTTA 

GACTAAAGTATTTCTTCATAAATTGAAAAGAAAACNCACCTTGGAATTCAGAAGATTTTA 

ATTAAAGGTGGTCAAAAAATTTAATGGGGGAATTTNAANGGCTGACTCCCTAAGTAGTTT 

TGGTTAATGATTTAATTTATAGAAGAMGGTCTTNGGCNAATTTGGTATGCCCAAAAGAG 

CCTACTATGNCCCAANNNTTTTTCTNGGAGCNTTATTATTAAGNGGCCCGCTTTTCCNGG 

TGGCNTCCCTNAGTCATTTCNGCTTCATCCAAATTTGTGGAATTAANCCTACCCCNTTrA 

AATGGGGACCATTGGCTTTTTAAGGGGNNAAGACCCCAAAGGGTANATANCCTGGCNTTA 

TGATAANTATTTTTTTT 

Sequence 2119 

GGGCGAATTGGAGCTCCCCNTGGTGGCGGCCGCCCGGGCAGGTACCAAAAGAATGTTCAT 

AGCAGCATTATTTGTCATAGTCAAGAGCTGNAAGCAACCCAGTTGTGCATCAATGGTAG^. 

ATGGATGACTAAATTGTAGTGTATTCTTACAATAAAATCTATAGAGCAATGAAAATAAAT 

AAACTATGGCTATATGCAACAACATGGATAAATCTTACCAACAATGTTAGACAAATGTAG 

CCAAACCAAAAAATACACATGTTCTATGAATCCATATATGCAAGGTTCACAATAGGTGAT 

AGTAGCTACTCTTTTGGGAAGGAGTGAGGAAAGGACTGAGAGAGTATTGCAGAGGGGGCT 

CCGGGTGTTTGTAATTTCTATTTCCCAGTGCAGGTGTGGTTTGCATGAGCATGTTCACTT 

TGTGGAAATTCATTGAGCTGTMCTAAATATACAGAATTATTTATGAATGAAATGATATA 

TAATTCAATGGGAACCTGGGAGGATAGGGAGGAGGTGGGGGGANCTNAAAAAACATGAGT 
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GGCCCGAATGCTTAACAATTGGTATTGTTAGGTTATGGGATCCCGGGGGGTGGTTGTCCT 

0 

Sequence 2120 

AGGGCGAATTGGAGCTCCACCCGCNGTGGCGGCCGGGGGCCATTGAGACTGCCATGGAAG 

ACTTGAAAGGTCACGTAGCTGATACTTCTGGAGAGACCA1TCAAGGCTTCTGGCTCTTGA 

CAAAGATAGACCACTGGAACAATGAGAAGGAGAGAATTCTACTGGTCACAGACAAGACTC 

TCTTGATCTGCAAATACGACTTCATCATGCTGAGTTGTGTGCAGCTGCAGCGGATTCCTC 

TGAGCGCTGTCTATCGCATCTGCCTGGGCAAGTTCACC7TCCCTGGGATGTCCCTGGACA 

AGATTTTACTCCCATAACCCAGGCAGAAGTACCT 

Sequence 2121 

CGAGGTACTTCATACTACTTTAAGTTACCTCCTATTGGGGGCATTATAAATGGGTAAGCA 

GAGAACATCATGGAAAGACATGGAGCTTAGTATATGCAAATGTTGAGTTACTCAGTGTAA 

TGTGTGAAAAAGGAGTTTCATMGTTTCGGTCAGGGAAGMGGCAGGGTCAAAATTTCTG 

CTTGAGAGTTTGGGAGCTTGGGGAGACTTTAAACAGGGAGCAACACAGCGCCTCTGTACC 

TGCCCG 

Sequence 2122 

CGCCCGGGCAGGTACTAAMCTATTCCAGGAAGACTTTAACTTTTG TTGATT CAAAAGGA 

ACATGGTGCATCATTTAGCACCGTAGGGTCATCGTTATGATMTCCATTTTTATTACAGT 

CCTGTGGTTCTCATCTACCCTAATATCCAGCAAGTGGTTCATTGCAGATTCTGATTTATT 

CCATCCTACCTCAACCCTATTACCAACTATCACCACCCACCACCCGCCACCCAAACTGGA 

AGACAACCANAGATAAATCCTCAAGAAA 

Sequence 2123 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACCTGTCCATGTGATGAAT 

TAGMCATGCTmCCACCTATAMCTACMCTTMTTTGCATAACTTAATATAAGTAGT 

ATTAATGTAAGTGTTATACATATTTCAGGTTATAATTTTGATTACAATTTGAAGACATAA 

ATACAATTGGTTTAGTTATAAAATTACACTACTTACTTGCTACTTTATAACTGCTCACTA 

CCTCCTTAGTGGAACCCAGATTTTCTCTATGATTTGAGAAAAATTACTCCACATGTTGCC 

TAGTTATCAMCTATATTGGGMGCAGAAAAAGCTATGAAAGGCTTCAAAGCTTCTGAGA 

ACCTTAGCACGAAAMCTCTTCCAAAMGCTTT^ 

ATCACATTTATMTCACTCACCAGCMTCTClTrTCCATGCTTGATTTG TG AGG C TTATT 

GAAAACMTGGCATTGACCGTTCATTCTGACTCCTTGATTTCC 

Sequence 2124 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTAAAACTATTC 

CAGGAAGACTTTAACTTTTGTTGATTCAAMGGMCATGGTGCATCATTTAGCACCGTAG 

GGTCATCGTTATGATAATCCATTTTTATTACAGTCCTGTGGTTCTCATCTACCCTAATAT 

CCAGCMGTGGTTCATTGCAGATTCTGATTTATTCCATCCTACCTCAACCCTATTACCAA 

CTATCACCACCCACCACCCGCCACCCAAACTGGAAGACAACCAGAGATAAATCCTCAAGA 

MCTTCTTTTTAAGAGGGTGGTTMCAMGTTAAATTGGGTAGAAAACAGATGAAGCAAG 

TTTTAATCTCTTTGCTGAGTAAGAATAGTGCCTACATGGTGTGGGTGATCATGTGTAAGA 

CTCTGATGGGCAAACCTGGGGCCCATTAGGAG 

Sequence 2125 

AGGGCGAATTGGAGTTCCCCGCGGJGGCGGCCGCCCGGGCAGGTACGCGGGAATGAGAGC 

CTACAGTTGAAAGCACACTTGGATGAGGCTCGGACCCTGCTTCATGGCACCAGAGGAACC 

CACCAGCACCAGGTTGAGCTTATTGAGCGAGATGAGGTTAGTCTTCATAAGAAGCTGAGG 

ACTGAAGTAATTCAGCTAGAAGATACATTGGCCCAGGTCCGCAAGGAGTATGAAATGCTG 

AGGATAGAATTTGAGCAGACCCTTGCTGCCAATGAACAAGCAGGCCCTATAAACAGGGAG 

ATGCGCCACCTCATCAGTAGCCTCCAGAATCACAATCACCAGCTGAAA 

Sequence 2126 

CCGGGCAGGTACGCGGGAATGAGAGCCTACAGTTGAAAGCACACTTGGATGAGGCTCGGA 
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CCCTGCTTCATGGCACCAGAGGAACCCACCAGCACCAGGTTGAGCTTATTGAGCGAGATG 

AGGTTAGTCTTCATAAGAAGCTGAGGACTGAAGTAATTCAGCTAGAAGATACATTGGCCC 

AGGTNCGCAAGGAGTATGAAATGCTGAGGATAGAATTTGAGGAGACCCTTGCTGCCAATG 

AACAAGCAGGCCCTATAAACAGGGAGATGCGCCACCTCATCAGTAGCCTNCAGAATCACA 

ATCACCANCTGAAAGGGGAGGTCCTGAGATATAAGCGGAAATTGAGAGAAGCCCANTCTG 

ACCTGAACAAGNACACGCCTGCGTTANT 

Sequence 2127 

GCCCCGCGTGACGAGCGGACTTGCGTCCGGCGTATCGGCGATTTCGCGGTTCTCGTTCAG 
CAGGGAAAACAGGCGTTCCATGTCGGCCAGGCTTTGCTTGATTTCGCGGTAAATGACGCC 
CAGGAAATTGAGAGGGATGTACAGCTGGATCATGAAGGCGTTCACCAGCACCAGGTCGCC 
CAGGGTCATGCTGCCGTTGATCACGCCCACGGTGGCGCGCCACAGGATCAA 
Sequence 2128 

GGGGCCATTGAGACTGCCATGGAAGACTTGAAAGGTCACGTAGCTGAGACTTCTGGAGAG 

ACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGACCACTGGAACAATGAGAAGGAGAGA 

ATTCTACTGGTCACAGACAAGACTCTCTTGATCTGCAAATACGACTTCATCATGCTGAGT 

TGTGTGCAGCTGCAGCGGATTCCTCTGAGCGCTGTCTATCGCATCTGCCTGGGCAAGTTC 

ACCTTCCCTGGGATGTCCCTGGACAAGAGACAAGGAGAAGGCCTTAGGATCTACTGGGGG 

AGTCCGGAGGAGCAGTCTCTTCTGTCCCGCTGGAACCCATGGTC 

Sequence 2129 

CCGGGCAGGTACGAGATGCCACGGCACGAAGTCTACGTTCTCCTGATCCGAAACATCTTT 

TTGAAAATATCAATCATTGGCATTCTTTGTTACTATTGGCTCAACACCGTGGCCCTGTCT 

GGTGAAGAGTGTTGGGAAACCCTCATTGGCCAGGACATCTACCGGCTCCTTCTGATGGAT 

TTTGTGTTCTCTTTAGTCAATTCCTTCCTGGGGGAGTTTTTGAGGAGAATCATTGGGATG 

CAACTGATCACAAGTCTTGGCCTTCAGGAGTTTGACATTGCCAGGAACGTTCTAGAACTG 

ATTTATGCACAAACTCTGGTGTNGGATTGGCATCTTCTTCTGCC 

Sequence 2130 

AGGTACAAGGACTGCATCGAGTCCACTGGAGACTATTTTCTTCTCTGTGACGCCGAGGGG 

CCATGGGGCATCATTCTGGAGTCCCTGGCCATACTTGGCATCGTGGTCACAATTCTGCTA 

CTCTTAGCATTTCTCTTCCTCATGCGAAAGATCCAAGACTGCAGCCAGTGGAATGTCCTC 

CCCACCCAGCTCCTCTTCCTCCTGAGTGTCCTGGGGCTCTTCGGACTCGCTTTTGCCTTC 

ATCATCGAGCTCAATCAACAAACTGCCCCCGTACCTGCCCG 

Sequence 2131 

ACACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCGTTCCTCCAGT 

GCCCAGAGATGCTCTCCGCACCAAGCCACAGATGTGGAGGAGGCAGGTAGGGGGTCAAAG 

AGGGGTGGTATCGGTTATTNAGGAC I rTTTTTTTi I CTTAAATATCCTGTGCTTCTTTCA 

ATCATTTGAAGGTAAAMCCAGCNTCCCTGTGAGTGGTMNCTGATTTTTAGGTTNTTAT 

GMGGTATTATTTTCTGNGTAGATAGTNGTTAACTTGGTGTCCTTGCANGGTAAGACGAT 

CAGCGAAGCTTTCTGTTCCACCATCTTGCTCCTCTCATTTTANACCTAATA 

Sequence 2132 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACCAAAGAACAAACGGAAGCCTAAAGATGGACC 

ACAAACCTTTGGATATCCAGGATAACTGAGTTAAAAACCTACCACACGGTTAATAAAATA 

ATGATGTCCACTGAGTCCATGACCACAAAACAAATCACCAGCAGCAGGATGGCATGGGCG 

GCCCTTTTCGCTGGGGAGGGACTGGAGCAGAGGCTGGGGCTGTGAAGGTGATGGGATCAC 

CTCTGATGGTTAAACAAGAGTACCT 

Sequence 2133 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCTCGGTTGCAAGCACAAGC 
AAATGTGCCAGGGTGGTTGATGCAGCTGTGGTCACAGGTCCTATCCAAAGAGCACTCATC 
CACATCTTGGCMGACTTCTCATCTGTTAATAATTTAAATCCTTTCTTGCAGCCCGCAGT 
CAAAACTGCCCACCGATG I I 1 1 IGCAGAAATGATCACAACCTCCATTGCGGGTCCCCGCG 
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TACCTGCCCG 
Sequence 2134 

GGTACGCGGGGACCCGCAATGGAGGTTGTGATCATTTCTGCAAAAACATCGTGGGCAGTT 

TTGACTGCGGCTGCAAGAAAGGATTTAAATTATTAACAGATGAGAAGTCTTGCCAAGATG 

TGGATGAGTGCTCTTTGGATAGGACCTGTGACCACAGCTGCATCAACCACCCTGGCACAT 

TTGCTTGTGCTTGCAACCGAGGGTACCT 

Sequence 2135 

CCGCGGTGGCGGCCGAGGTACGCGGGATGGAAAAATCACACTTCACAAGAGATCCATCCC 

AGCTTAAAGGTGTCCTTGTTCGAGCATCACTGAAAAAAAGCACMTGGGATTTGGTTTTA 

CTATTATTGGTGGAGATAGACCTGATGAGTTCCTACAAGTGAAAAATGTGCTGAAAGATG 

GTCCCGCAGCTCAGGATGGGAAAATTGCACCAGGCGATGTTATTGTAGACATCAATGGCA 

ACTGTGTCCTCGGTCACACTCATGCAGATGTTGTCCAAGATGTTTCAATTGGTACCTGCC 

CGGGCGGCCGCTCTAGAACTAGTTGGATCC 

Sequence 2136 

CCGCGGTGGCGGCCGGGGGCCATTGAGACTGCCATGGAAGACTTGAAAGGTCACGTAGCT 

GAGAC7TCTGGAGAGACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGACCACTGGAAC 

AATGAGAAGGAGAGAATTCTACTGGTCACAGACAAGACTCTCTTGATCTGCAAATACGAC 

TTCATCATGCTGAGTTGTGTGCAGCTGCAGCGGATTCCTCTGAGCGCTGTCTATCGCATC 

TGCCTGGGCAAGTTCACCTTCCCTGGGATGTCCCTGGACAAGAGACAAGGAGAAGGCCTT 

AGGATCTACTGGGGGAGTCCGGAGGAGCAGTCTCTTCTGTCCCGCTGGAACCCATGGTCC 

ACTGAAGTTCCTTATGCTACTTTCAC 

Sequence 2137 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAAGCAGAGGAGTTCACTCTTCTTCCCTGTTC 

TGMTGCAGCCATTTTCAGGTCAAAGTCAAAGAAACCCTTCAGTTCCTGCACATGGGTGA 

GATCATAGACGGTGTAGACATGAGAGCTGAGGTTGGAATTCTTACCCGGAATATTGTGAT 

CCMGGAGAAGTGGAGGACTCATGCTACGCAGAAMTCAGTGCCMTTTTTTGATTATGA 

TACCTTTGGGGGACACATTATGATMTGAAAMTTTTACTTCAGTCCATCTTTCTTATGT 

GGMTTGAAACACATGGGTCAGNAGCAGATGGGGCGATACCCTGTTTCATTTTCACCTGT 

GTGGGGGGACCTTGGGATTATAAAGGGAGG 

Sequence 2138 

CCGCGGTGGCGGCCGAGGTACAACAGGCTTGTAGTTCTTTGTGAAGGATGCAGGCAAGAG 

GGATGTCCAGTTTAGACGGGCCCTGACAGCAGCAGCCTCTTCCACATCAAGTGAAGTGTG 

ATTGTTAACGATCATTGCCTGCACAAAGTATAGTTGCCTTTGAAGTAGTCCACCCAATAC 

TTCACCATGACATAGCGGGGCAGGATTGGGGTGTTCACATTTrGCTGATAAG ATGGTG TG 

TTAAAATCTAAAGCTTGAAGAACTGGATAAATT^ 

AGGCTGGCATCTTGAGTAAACTGTTGTCCAAGGGCAGCAGCCAGATTTCCACCAGCACTG 
TCACCAGAAATGCAAATTCTGCCTGGATCAACCATATACTTCTGTAAGACTTCTGGCTTC 
AGGAAATACTTTGTGGCCCGTC 
Sequence 2139 

ATAGGGCGAATTGNTNCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGACAGGCCATCTC, 

GCTATAGGAAAGGAMGTGGAACAGCATTCATCCTCMCATTTTTACGAAGACAAAATGA 

AGACTGGAGTAGAAGACTGATCAGTGCAGGTGTAGCATAAAAGTGTAATCCTGGAAGATG 

TGGTGTGAGAAGCATCATCAGCATCATGCTATTACAATCCCAAACCATGGGGGTTTCTCA 

CAGCTTTACACCAAAGGGCATCACTATCCCTCAAAGAGAGAAACCTGGACACATGTACCT 

GCCCG 

Sequence 2140 

GTAATTTAATCTTCACATCACTTTTACTTAGACCTTGATGAATGCTCCCAGTCCCCGAAA 

CCATGCAACTACATCTGCAAGAACACTGAGGGGAGTTATCAGTGTTCATGTCCGAGGGGG 

TATGTCCTGCAAGAGGATGGAAAGACATGCAAAGGTGAGTGAAGAGTCTCTGACAATCGT 
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GACCCTAGTCTGTATCTTTTCTTTCAMTTTAAATATTAG ATTATCAG GAATTATG CC AT 
GTGCATCACCTTCATTAAAAGGTCCCATCCTAGTGGCCATGTTATTGCTCATAAGAAAAA 
CTCTACCAATATGCTCATTGCAGGAAGCAGAGAGGATTAAAAC 
Sequence 2141 

GTAATTTAATCTTCACATCACTTTTACTTAGACCTTGATGAATGCTCCCAGTCCCCGAAA 

CCATGCAACTACATCTGCAAGAACACTGAGGGGAGTTATCAGTGTTCATGTCCGAGGGGG 

TATGTCCTGCAAGAGGATGGAAAGACATGCAAAGGTGAGTGAAGAGTCTCTGACAATCGT 

GACCCTAGTCTGTATCTTTTCTTTCAAATTTAAATATTAGATTATCAGGAATTATGCCAT 

GTGCATCACCTTCATTAAAAGGTCCCATCCTAGTGGCCATGTTATTGCTCATAGAAAAAC 

TCTACCMTATGCTCATTGCAGGAAGCAGAGAGGATTAAMCCCAAGTTAAAACAGTTTG 

C TTTCTTATAG AAAACTGCCATAAATTT 

Sequence 2142 

TATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGAGTCAAGGACGTCTTTAA 
CGTCATGGACGGCTCCTAGGACAGCCACAGAAAAAAATGGGGTAAAGGGGTAGGAAGAAA 
AAAGGAAGGGAAAGAGAGCCAGGGAATTGGGGAAGGAGAGGTAGGGGAAG 
Sequence 2143 

AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAGACGGGGCTGTAATCTAGTAACTGT 

ATGTATTTTAGTTCTTCAGTCCCTGGGAAGGAGACAGGAGAAGGTGGGAGGGAGGMGGG 

GCCAGCTGAAATGGAAACAGATCCCTGATCCGGGGCGGTCAGTGGAACCCTTTCTTGGTG 

TGCCGAGAGCCTGTGTATTTCAGAGGCAGCAAAAAAGTAAAAAAAAAAAAAAATTGATCT 

TTGTTTAGATTAACAGACCCCTGACTATGAAGAAGGAAGGCATCCATACCAGAAACCGAN 

AAATGTCTAGCAAATCCAAAAAGTGCAAAAAAGTGCATGACTCACTGGAGGACTTCCCCA 

AGAACAGCTCGTTTTAACCCGGCCGCTCTAAGAACTAGTGGATCCCCCGGGC 

Sequence 2144 

CGAATTGTTCTCCACCGCGGCGGCGGCCGCCCGGGCAGGTACTTTTCTTGAACTCAGCAT 

TGCTGCCCTNAACCCTCCCCCAGCTGCCTTCCCTACTTTGGGAGCCTCTATTTCTAGAAC 

AGGAAAACGCTAGGCTAAATATATGTCCTAATGGTCTCTTCCATCGGTGTGAAGGGAAAN 

NAAAAATGGGCCCAATGTTGTTGCAAGGAATTAAGGGAAGGTCCNTGCCTTGCCACCCAG 

TGGGGNTTAGGAAATCCATCTTTTACCGGGNCTCCAGTAGTAAGCCTGGTTGAACANGGG 

TTCCCAGGACCATCAAGACAMTTCCTGGTTTTCCCTGGAGGGAGGCNTTTGNTGGGGGG 

GCTCAAATATTGGNGGGGAATGAATNCTTGGGGTTCCCTTAGGACAAAAAGGGAGGCCCC 

NCCTAAACCTTGGAATGGGGGGGGNGGTNCACNTGGGNGNATTANGCCAAGGGGGGGGGC 

CCCAATGGGGCTTGGAACCTA 1 1 I I I I I GNTTCCCTNGGGGGA 

Sequence 2145 

GCGGCCGCTCCGGGCAGGTACGCGGGATACTTGGGTTGACCCTAACCAAGGATTGCAAAT 
TGGATGCTATCAAGGTATTCTGNAATATGGAAACTGGGGAAACATGCATAAGTGCCAATC 
CTTTGAATGTTCCACGGAAACACTGGTGNGACAGATTTCTAAGTGCTGAAGAAAGAAAAA 
CAACGTTTTTGGGTTTTGGGAGNAGTCCCATTGNNATTO 

AGNCTTACCGNGCCAAATTCCTTGTAAACTTTNCTGNAATNCATNGTNNCTNTGGAATTG 

TTGCTANNCTTGGGNCAATTTTNCTNTTCGGAACTNTCCTNCTTNCCANGCTCNGNAGAC 

NTTTTNTCN N N AAGG AAANCCAATTNTACC AATTANTTCN AACCTTGGCCAAAAAAAAAA 

TTATGGTCMTTTTNGCCCATTMCCMTTNGGGNAATTCCANNGGGGCTCCAAGGTTGG 

GGGAAAAAATTGGTTTANANAAGAAAMGGGGCCACNCTTGGGAAAGGCCTTTGGAATTG 

GGGGGGGNTTACAAAAAAATGGTAAMGNGGGTNGNAAAATTTTNNAACAGGGGGCNTTN 

NNAAAGGGGGAA 

Sequence 2146 

AGGTACGCGGGMGCACATATTTGGTAGCCTTTGCATACAAGAATGTGAAATTTGTTCTC 

AAGCACAAAGTAGCACAGAAGAGGGAGGATGCTGTTTCCAAAGAAGTGACTCGAAAACTT 

TCTGAAGCTGATAATAGAAAGGATGTCCTCGGTAAGGGGAGAAAGATTGAAAAGNAATCC 
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TTGTTGTNAAGAMGAMTGAMGTTGCTTGATTATNGAAGCTACAAACATTTTT 

TTTNCTATAMCMNCACTCTTGTTTCCCTGGGTCCGATGGGCTCATNTGATTGCTTTCC 

TTCTTTCATTANTTGNMGGMCCTTTNAACTCCCACMGTTNNAMCCTMCANNArTT 

GGTTCCATTAAAGGTNGNCTTTTCAATTCCAGGGGAACCTTCAATNCGGNCCCCTTCCCT 

NGATNCTAACNTTGGGGCCTTCCCCAAAAANTTAGGTMCTCMTTGGTTCAAGANCTTT 

TCNACCCCCCCCCTNGNGCCTNTTTTGATTGGTCCTTAATTGGGG 

Sequence 2147 

AGGTACAGAGGCGCTGTCTTGGCTCACTGCAACCTNTCCCTACCAGCTTCAAGTGGCCCT 
CTCAT CT CAGCCT CCCAAGTAGATTGGGATTACAGACACGTGCCATTATGACACGGCTAA 
TTTTTGTAl mm IGTAGAGACCGGTGGTTTTAATACATGTTGACCCAATGCTGGGTTCT 
GAGNCCACACATN ATGCCCG GCCCTTGATAGTCCTTATTTTCTATTCAMGTCTTTCTTG 
GGNGTATTNGCCN illllll NTCTTTAATTAAAAAAAAAANAMCTTTTCCMGGTTTTG 
GAAGN GTGCCT TCNCAN AAAAAANGGTNGNAAATTTACCGAANAATAAGNTTACANTAAT 
MM I AAAAAAAAI I I I I I AATTMTTTCTTAATTAANTGGCAAGGGTAAAANAATTTAA 
TTAAANANGTGN NCTTT GNTNTTNGGGGAAAAGGAMTTNTCCCTCGGTTNATTTTTGGG 
CCAAAAAAATTMTTTTNTTNCCTTGGCCTNMTTAMNTNTNGGTTATNGGGGTTGGNC 
CTTTN NG TAAAAAATTTTTATTTACC N CC AATTAAT ATAN AAATAAATTAACCCCTTTCT 
t 1 l M 1 1 GGGAACCTTANAANAAAGNGGAACCNTTTGG' I Mill 
Sequence 2148 

CGGCAGGTACCACACACACACAGGCACACACAGGCACACGCAGGCACACACGCAGACACA 

CACACATAMCACACAGAGTTCACTAGTCCGAGTTACTGATTTTCTTAGGATTCTCAAAG 

TGACAACACCGGAAACAAGGTAATTCATGTTAAAACACAAGGGTTATATCAGTAAGAGAT 

GGGATCCCCGAAGTAMCCGTGGMTTTGAATCAAGCTTCGAAGAGCTAAAAAAAGAAAT 

TGGAGTTTCAACATTCACCTTCTTGAATCCTTAAGAAATACAGAAGTTCAAAATAGAAAA 

CA7TACAGTTTCAGGATACAAAAGTAGAAACATCTGAGATTA 

Sequence 2149 

CGTGGTCGCGGCCGAGGTAC I I I I I I I I I I I I I I i I I i I I I GAACCAAAATCATAGATTT 

AGAGTTGGMGTGACCTTTGTGATCATTATTTCMTCCTTTCAGAGAATTCTCTCTGCAG 

GAATATCTATGTAGGACTTGGATTTGACCCAAGCACATACAATGTCTCAGAGGGCTCTCT 

GCTCCTGGCCAAGAGCATCAGGTCTTGTTTTTMTATAATCAGTGTTGTAAATTATGGCA 

CAGAGAACAGGCTCATTCAGCTGTGGAATGAACTGAGTTAAATGGACTTGTAAACATCCC 

TAAATGAAAGAATTAAAAAGTCAAATGAGGAATTATATAGCCGTCCCTTGG 

Sequence 2150 

CCGGGCAGGTAC I I I I I I I I I 1 1 I I I I I I I I I I I I I C ACAACACC ACTGTAAGTCTTTG C 

CATTCCCTTGTTCTTAAAAACATTGCTGTTTCTTTTAATGCATGGAAATTCTA/^ 

CAGATTGGGAATTTAAAATAGTAAATAGGGGAAATA A I I I H I I I I H I 1 1 1 f I GANACA 

GTCTTGCTCTGTCACCCAGGCTGGAGTGCAGTGGCNCAATCTCGGCTCACTGTAACCTNT 

GCCTCCTGGGTTCAAGTGATTCTCCTGCCTNAGCCTCCCGAGTAGCTGGGATTACAGGCG 

TGTGCCACCAGGCCTGGCTAATTT 

Sequence 21 51 

AGGTACI I I I I II I I I I I I I I I 1 1 I I I I M I ! I 1 1 ANAAATTGCCTCAN ACAGGGATTGA 

TGTCAGGAGGCTTCAAGGAGAGAAATATACATGAATGGGAGACCCCAGGTTGCAGGCAAC 

AAGAGTCACAGGTATGGGAGAAACCAGGAAGTCTGTGCCATGTTAACTTTGCTCATCTCA 

TCGTGCTTTGTACCTGCCCGGA 

Sequence 2152 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGCG 
GTCCCAGATTTGGTGGGACTCTCCTCTGGTATCAGTTTTCAACGCCACTTCTTGTTTGGA 
GTCATTCCCAGTGCAGTCATGGACTCTATGAACAAATACGGTTATTTAAATGTATTTGTC 
CAGTATGAGAATCAGAATGAACTAGTAGAGGCTTTACAGATCAGTTGGTCTGACCCATTT 
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TACAGATGAGGAAACAGGCCTGAGAAGATGGAGGGAGTTACCCACAATCTGAAGGGGCTC 

TACCATGACTAGACCCAGGTCACCTGCCTCCCAGGCCTGGCTCTTTCCACTCCGAGGTGC 

TGGCTCACCACAGACTTATTCTTTAATGGAATTTTGGAAAGGCCTCACTCCAGTGGACTC 

TTTGGGATCT 

Sequence 2153 

CTNCTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGTT 

ACTAAGGCAGCAGAGTAGAAAGATGAAGTGCAAGGACCTGGAAATCAGACACACCTGGGT 

TCCTGGTAAGCTTCACTGCTTCCTGGTCTCTGATATCAGCTCTGCCCAAATTTCAGCTGT 

CACAGAATGAGTGATCCTCATAGTATAATGGAAATGCATCATCTCAGTTTCCCCATGGCC 

TGMTATCCATACAAMTGAGAGCCCGGTAGAGAAAMCAGAGGACGTGAGGAGTTTTTA 

AAMTTGAGGAAAAAGTGGGAAGGGAAAAAGTAAGGGATGAAAGGTGAAAGGATGAAAGG 

GAATATGTTCAGGAGATGCCTAGGCACTTGGCATCCCCCACCCACACAG 

Sequence 2154 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTAGAA 

AAATGGTTATTTCTAAGCCTCAGTGAAACGTGTCCAATGAAAGAAAAGCCTGAATTACCA 

ACATGGGTTAGCATCACTCACACGCAGGCAGAGATAACAGCAAGTGTCATTTAAGACTTT 

TCTAAAACTTCATGCCTTAAGGAGTAAAGATCCAGTTCATATAGGGTTAGCAACTGCCTT 

CAGCGAAGAGGAAAACAGTAAATAATTAACAAAAAGATCAAGTCAGGAGATAAGGCCTAA 

GAGAGACAGGACTGAAAAGGTTAGAGGGAGGATAGGTGAGGGCTTGGGAGACCTTGGAAA 

AAACAGNAGGGAGCCGGTGGAAGAATTAAGGTGGGGAGGACAAAACGGGCACCAC 

Sequence 2155 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCGCGGGGGAGATGCTGCCACCTAGGTTACTTG 
TAGGACCCTATACGGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAA 
TGAGTCAAGGAAGCTTTTCTTTTGAGCTATTTACAGCTTTTAGCAATTGAGTAM 
CTCCTGTGAACAAAATTTGGAACATATTTGTTTCTCTC^ 

GMCTAAGTTGTAGCTGAGACCAATGATGCTGGACCTCCCTCAAAAGCTGCATTTTCTGG 
AATTTCTGGAAGGGCAAAACTGTCTGCCCTATTATTTGTACCCTCGGCCCGCTCTAAGAA 
CTAGGTGGGGATCCCCCCCGGG 
Sequence 2156 

CCGCGGTGGCGGCCGCCCGGGCAGGACGACCAGTGTGAATGAATGGAGCAACCAAGAGGT 

TAGGCAGTAGGAGATGAGGCCAGAGAAGAAAGAGGGTGTGGTAGCCCCCTAATGCTGGCA 

CTTACTTCCACTGGATCTAAAATCCAGCACGGAATTATGCTGACCACATCAGCCTAACTA 

ATATTGATCTAAGTAACATTTATGTTCTAGCACCTAAATCAGGAGCCTGCAAAGGATAGC 

CCACTGCCAAAATCTGGCCCACTCTCCAGTTTTTGTAACTAAAAATGTTATCAGGAACTT 

CMGCCCAAGGTCCATTTCATTTTACCAAANCTGGTCTTACGGGTGGGCAGGAGACCAAT 

TACCGGCTCACAAAAGGNCTTAAAAATATTTTACCTCTCTG 

Sequence 2157 

CCCGCGGTGGCCGGCCGCCCGGGCAGGTACCTGTTTAAAAAATTCTAGAGAGAAAACTAT 

AGAGAGTGTAGTGTGTGATTCAGACATACTAACCATCCTCACATGGACTCACTCTCTAAA 

AACTCCAGACATAGCCAGAAAGAGGCAGAACAGCCACAGCAGGAGAGATCACTGATGCTC 

ACTGAGGCAGAGGACTTTGTCATMCCTGTAGAAMCATTTTCCCAATGTCTCTTTACCA 

C ATTGGG ATTATAMTTTTCTAC ATTCTTCTTCTTCTTTTACTGGGG C ATTNTATAN C AA 

NCTTGGAAATACANMNATATGGNGAANAATTAATCNTTAAAAATNTTAMTTTTCACCC 

ATCCCTTGTNCAGGACAAAATTAACATTCATTTATGGAATC 

Sequence 2158 

TACACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCCAGCTCTGTCTCTCCTCGGAG 
CAGGATTCCATCTTTTCCATGGCTGGGGCCCTTGACACATGTCACTGTGCACTGCTCTTT 
TCGCTGGGATTTCCAAAAGACCTTGGCTCAGGTGTCACTTTTGCAGAAAAGCATTTACTG 
ACTGTTGACTTGGCCAAATCCCTGTGCTGTGTTTCCTAGCACGGTGAGTTGCATTTTCAC 
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ATGAACTCATGCCTACCTTCTCTTGTTTTATAAGCTTCATGAGGCAGGTACCTCGGNCCG 
CTCTAGAACTAAGTGGGATCCCCCGGGCTGCAGGNAATTCNATATTNAAGCNTATTGAAT 
NCCCNCNAACCTTAAGGGGGGGG 
Sequence 2159 

CCGCGGTGGCGGCCGAGGTACATTTTACAAAATTAAAGTCACTTTGTAGGTATATAAAAA 

GTTTTAAAAGTTGGTTTATTAATGAGAGTAGTGTACTACATTTCTTTTC 

TAAATATTAACCAGTTTAC I MINI I AAAGTCAATTTACTTTTCCAGCCTGGACAACAT 

GGTGAAACCTCGTATCTACAAAAAATAGAAAAATTAGCTGGGCGTTGGTGGCTCACGCCG 

GTAGCCGTGTTTCTTTTGGTGGTGATACAAATGGTTCTAAACTGACTGTGGCACTGGGTT 

TATTACAACCTCTGTGGACTATTACCTAAAAMTTCACCTGGMTTTGTACCTGGCCCCG 

GGGCCGGCCCGCTCTAGGAACTAGTGGGATCCCCCCCG 

Sequence 2160 

CCGCGGTGGCGGCCGAGGTACGCGGGTTTCCTCAACATGGCTGCGCCCTTGTCAGTGGAG 

GCGGAGTTCGGGTGAGTCACAGAGCTGGGGCGCCGTGGGGATGGATTGAAGTCGTCGGGC 

CCAGAATTCCTTTCCTTCTGCCGTGGGGCCTGACAnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnAGGGGTTAAATTGCCGCCGCTTGGGCCGTAAT 

CMTGGTCATAGGCTGTTTTCCTGTGGTGAAMTTGTTATCCCGCTTCAACCAATTTCCC 

ACCACCAACATTACCGAAGCCCGGGAA 

Sequence 2161 

CCGCGGTGGCGGCCGAGGTACGCGGGGGAAAATGTCAGAGATGGGAAGAAACAAGAACTT 

TGACATGCTTGGTGTTCTTGCCCAAGCTTTGMGAAGTTTACAAAGTCTATATGTCAGAA 

TACACATTTCCCACCTTGCCCAACAGTAGAAAAACATAAGMGAGAAAMCATTAAAAAA 

TGACAAGGAAGTTAATGGAAGTCAGCAATGTGATGGTGTTTGGAGGTGGAGCCTTCAGAA 

GGTAATTAATGCCCTTGTAAGAAGAGGCCAGAGAGCTTGCGCACCTTCTTCCTGCCATGT 

GAGGAGCCAAGAAGCCGGCTGTCTGCAACCTGCAAGAGGACCCTCACTAGAAGCTAGCCA 

TACTGGCATCCTCATCTTGGCTTTCCAACTTTCCAGAACTGTGAAGAAGTATATGTTT 

Sequence 2162 

CCGCGGTGGCGGCCGAGGTAC I I I M I I i M I I II I II I I M II I GAACCAAAATCATAG 

ATTTAGAGTTGGAAGTGACCTTTGTGATCATTATTTCAATCCTTTCANANMTTCTNTNT 

GCAGGAATATCTATGTAGGACTTGGATTTGACCCAAGCACATACAATGTCTCANAGGGCT 

NTNTGCTCCTGGCCAANAGCATCAGGTCTTGTTTTTAATATAATCAGNGTTGTAAATTAT 

GG C AC AN AN AACAGGCTCATTC AG CTGTGG AATG AACTG AGTTAAATGG ACTTGTAAACA 

TCCCTAAATGAAAGAATTAAAAAGTCAAATGAGGGAATTNTATAGCCGNCCCTTGGTATC 

CTCAAGGGATTGGGTTCTAGGGACCCCTGGGGGATACC 

Sequence 2163 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGCAGTGGGAAGCTCGCAGCAGCTGGG 

GAGGAGCCAAAGCCTCGGCGCTCACCTAAGCCGCAGGGAGATACACCCAACTGGGAGATG 

AGGAAACAGCAACCCAGAGAGGAGAACTAACCCACACAGGATCATTTCGCGAAGGAGCAA 

GGCTGAAGAACCAGACCTGGACTTTCTTAGGACAAACTTACTGCAGCTTGAAGGAGCCAA 

C 

Sequence 2164 

NCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGTGAGTATTGCTTAT 
ATG TCCTTATGGGG ATGGGTGCCACAAATAG AAAATATGACCAG ATCAG G G ACTTG AATG 
CACTTTTGCTCATGGTGAATATAGATGMCAGAGAGGAAMTGTATTTAAAAGAAATACG 
AGAAAAGAAMTGTGAAAGTTTTACAAGTTAGAGGGATGGAANGGTATTGTTNMTNNTT 
GAATGCCATGGANNGGCCNAAATNGN I I Mil 1 1 NGGCCNCCAAAAACCCCCCCCTTTAA 
ANNTTNTTTTTGGGGACNTTTTGGNAAANGGTTTTTAAM 

CC TTTTTTTTT!Ti I CCNCNCCNCCAAAAGGGGGGGTNTTNTTTTNNAAAAAAAAAAAAA 
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AAAANANGGGGGGGGCCCCCNCCCTNTTTTTNGGGGGGGGGGGGGGNGGNAGNTTTTTTT 
i 1 I II 1AA 
Sequence 2165 

GGTTAAACGAGCTGTTCTTGGGGAAGTCCTCCAGTGAGTCATGCAC n I I I i 'GCACTTTT 

TGGATTTGCTAGACATTTTTCGGTTTCTGGTCTGGATGCCTTCCTTCTTCATAGTCAGGG 

GTCTGTTAATCTAAACAAAGATCAA I" I I I I I I I I I I [ I M I I I AC I I I I I IG CTGCCTCT 

GAAATGCACAGGCTCTCGCACACCAAGAAGGGTTCCACTGACCGCCCCGGATCAGGGATC 

TGTTTCCATTTCAGCTGGCCCCTTCCTCCCTCCCACCTTCTCCTGTCTCCTTCCCAGGGA 

CTGAAGMCTAAAATACATACAGTTACCAGATTACAGCCCCGTCTAGTACCTTCGGCCGC 

TCTAGAACTAGTGGGATCCCCC 

Sequence 2166 

AGGTACGCGGGAAGGAGATGTTGCCACCTAGGTTACTTGTAGGACCCTATACGGCAACCT 

CCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAATGCAGTGAAGTAGAAGAGACA 

GGGATATCCCAGAAGGTTATGCAAAACATCAAGAGAAGATGAGAGGAGTCTATATGTCAG 

AATACACATTTCCCACCTTGCCCMCAGTAGAAAAACATAAGMGAGAAAMCATTAAAA 

AATGACAAGGGAAGTTAATGGAAGTCAGCAATGTGATGGTGTTTGGGAGGTGGAGCCTTC 

AGNAAGGGTAATTAATGCCCTTGTAAGGAAGAGGCCAGAGAGCCTTGTGGCACCCTTCT 

Sequence 2167 

CCGGGCAGGTAC I I I M I I I I I I I I I II I I I 1 1 I I II I I GGG CAGCTAAAATCTAGGATG 
GNGTTAAGTTTCTTCATTTTGTCAATTATAT^ 

ATTTCTTGGATGAGATAGTTAATMCAAACTATTTCTCAATATTTGTATACTAAAAACTA 
GTGAAGGTGTTATGTGTTTCAGTATCTTATCTCTTATTTGAACATGGGTTTCTGAAAGGA 
G CCTATATAATAATATAAATGGTATGTANTAAATG AGG C ACTGTCTTG GCTG GGACTGCT 
ATAAAAAAATTACCATAGACCATTGGACTA 
Sequence 2168 

GACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCCGGCCGCCCGGGCAGGTACCCTG 

CTGAAAGATTATTTCTMCAGGCTTGTAGAGAAACGTCGGTTCATGTAMTTAGAAATTA 

TGGGGCCACTTTGCCATTCTTCACACCTGCAATGAACAGGTGTTTATCTGCAGTTCTGAC 

TTATCTCTTGAACTCCATTTGCATGTTATAGTGGGATGCAGCTGATGCCCTGTCCAGATC 

TTCTTCAGGCCACTACATCTATATATGCATTCATATTCCAGTGGCTGTGAGTGTTGGCTG 

TTGGTTGACAGAGGAGCTGCATCCTTCTGGAGGAAACTGAACTCAGCTGATGAAAGCCAC 

CTGGTCCTGGAGGTGAAGC 

Sequence 2169 

CGAATACGACTNCTATAGG(9CGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGCAG 

CCATTGTTGCTCCCTTTGCCCTTCAGGGTGTCTTTCCCCAGCATCTGTCCCTGCAGCACC 

CCCACCCCCTCACTCTAATTCTGAGCACTCAAATTGCCCATCAGTCCAAGCCTTCTCCTA 

ATCACAGCCCTCCTACCTCTGAGCCCCTTCATGGAGCCCTGTTCATGGTGCTTGGCATTT 

TATGACAACTAAACCAAA/v^GCAGTGAGGTTTGAAAAAAAAGAGTCAAAAGTCAATCCTG 

GCAGGGGAGGATTTGCTCTTTGTCTTAAAAAACCAAGAGTACCTGCCCGGGGCGGCCGCT 

CTMGMCTAGNTGGGATCCCCCGGGCTGCAGGAAT 

Sequence 2170 

GGTGGAATCCGGTNGTrCTTTCCTCACATTTGGGATGAATTGGTCATAAGGG I MM I AA 
GCATGGTTTCCCTCCCTTTTCTTTCACCCCTCCCCCri MM I CTTTCTAATTAAATCCA 
AGAAGAAAACCTTTCAAMGGTTTAATGGGGGAATGGGTCCGGGGATTCCTCACCAGGGC 
CTTGGAGGAAACCTTCGGTTTCCACCCTCCCAANGCCANTTTTCATGGAAAAAAAANGCC 
TNGCTTTCCmATTTAMTTCMTTACCAAAAACCTTCTTCAACCCCATGGAATNGTGG 
AAAAGAAGGTTTTTCAACCAAAAAATTC NTTTTTCAAAAAAATAAAAAAAAAGGTTAAAT 
TGG AACCTTTAAN N AAAAACCTTGGCCN NGN NTTN N N ATN ANTG AN AATAN AN NN N NCCC 
AAAAAGGTTACCCCT 
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Sequence 2171 

GTGGGCCGGGCCCGCCCCGGGGGCCAGGGGTTACGGCCGGGGGCCGGCCTCTTGGCAACA 
AGCAAGTCAAAAAGCCCAATCNTCTTTCCGGNCGGGTTGGCCGCCAANGGGCCGGGGGCC 
ATTTCGGCCGCCACCTTCTTAATGGGGCMTTTGGCTTGGGTTTCTTACCCAAACCGTTG 
GMCCAGGGGTNGMTCCAAAGGTTTTAAAAGAAAMGCCTTGGGMCCGTTCNCTTCTT 
N C C AAAC C G AAC C C C TTG GG N TTTAATTG AAAC C AN TG TTTAAAAAAG G TTC AATTC CC T 
I I I I I GCCCCACCCGTTGGTTGGTTTCCTCCGTTGGTTCCAAGGAAAAGGAAAAGGAATA 
AAAAACCMCCGGCCCCAATTCCMTTAAGGTTTGGGGAAAACCCCCGGGGAAGGAAAAA 
AAAGGGGGGGGGTTAAAAAATTAATTGGTTGGGGGTCCTTTGG I I I I I I I ! I GGAATTCC 
CCCCCCTTTTGGMTTGGGGGGMTTCCTTTTTCCC 

GCCCCTTTTTGGTTNGTTTNCCCCGGNTTTTGGGGGAAAAACCCCAAATTTTj I I I I I I I 

GGGGGCCMTTTCCTTNAATTAAGGGAAAAAAANG 

GGAATTGGGAAGGNCCCCCTTTTTTCCTTTGGN 

TCCCCCGGCCAAAAAACCCCCAAGGGGGNCCCCCGGGGG 

Sequence 2172 

CCGGGCAGGTACTTGTTGCTTTGGTGTAGTGAAATGAATGCCTTGATGAAATTGCATTGC 

ACCATTTTTGAMGAGAGMTACTCAAACGTGTCACTTCTGTTTCTTGCAAGCAACTGTG 

ATCCTGANCTGTGCCACACTTTCTTGGTTGGGATTATTTCTCGGTTTCTACTTNCCTNGT 

TCTGANAGTATGTTGGCCANNGTGTAGTAGTTGANACATAGTCNTGNATGCTCCACNCAC 

TCTAGGCMTCMCNAGCACCNCAGTAATCTTCAMCGTMTTTTTCCCATTGGGGTACN 

CACCAAGNCATTCTTCCMTTCTTNTTGGNGACTTGGAATNCTCACGNAANATAGAAACC 

CCCNCTTTACCTTGGANTAGNTTCNCNANNNAGGGCCACCCI MINI I NTTTGAAAAAA 

GGGGCCTTCCTTGGCC I INI I GGAACCACTTGNANAGGNTTAATTTTTT 

Sequence 2173 

AGGTACTTTA I ITTITTTTTTTTTTTTTn I GGCAGTGGGAGAGTTTTTTATTCAATTTG 

TGMGGACAGTTTTAAGMCAAAATGTTANTANACACCTCTTAGAAACCACTGGGTTNGT 

TCAT7TGGACCATTTTTATCTGCCACCCCAA 

AAAAAAGGTTAAAAAAATTTNCATTTCCCAAATTMTT^ 

NACCCCACNTTAAGTTNTNGCCCNTTM^ 

TTAACCAAAAAAAATTTCCANAAAAAANACCTTTTAAMNTNGGCCAAGGm mT TTA 
MTmAAMGAAMGGMGGTTCCANAAAAMTTTTTCCTTC 

CCTTG GGG AATTTATTAACC AATTAAGGGTGGGGG NTAATTANTTAATTCCTTTTAAAAA 
AGGCCANGGAAAAAAAACCCCCCCCAANAAAAAANAAANCCAAAAAANNAACCCAGANGG 
GGGAAAAAAMNAAAAGGGGAAAAAAAATTTACCCATTTGGTTTNCAAAACCNAGG^ 
C AAGG NTT ATAAAAAATTAA 1 MUM IGGAA 
Sequence 2174 

AGGTACGCGGGGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGTAGGACCCTATAC 

GGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAATGAGTCTATATGT 

CAGAATACACATTTCCCACCTTGCCCAACAGTAGAAAAACATAAGAAGAGAAAAACATTA 

AAAAATGACAAGGAAGTTAATGGAAGTCAGCAATGTGATGGTGTTTGGAGGTGGAGCCTT 

CAGAAGGTAATTAATGCCCTTGTAAGAAGAGGCCAGAGAGCTTGCGCACCTTCTTCCTGC 

CATGTGAGGAGCCAAGAAGCC 

Sequence 2175 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGTGGCGTAGCATTCCATGT 

TCAGCTTTGACATTTATTTTCTCATATCMCCCTTTTTACAC GTGA AACACAATCCCGCT 

CTTGAATTCTTMCTGCATGATCTGTGAAACCTGTATTTATATTTTCTTCAGTGGTATTC 

TTTGTCGGTGTGTGTGTCCTAMCAAACCAAMGAAAACTTTCCAAATCTAAAGTATTCA 

TTCTCCAATTGGAGCAAGAGGAGTCAGTTAGATACTATCACGGCATTCATTTGTGGCTGG 

CTTGTCATATTTACTTATGATTGATAATAAATCTATTTTGCTTTTTAGAGCCTCCCGAGA 
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AGCCATCTGCCTTCGAGCCTGCCATTGAAATGCAAAAGTCTG 
Sequence 2176 

AGGTACGTGGCGTAGCATTCCATGTTCAGCTTTGACATTTATTTTCTCATATCAACCCTT 

TTTACACGTGAAACACAATCCCGCTCTTGAATTCTTAACTGCATGATCTGTGAAACCTGT 

ATTTATATTTTCTTCAGTGTATTCTTGTCGTGTGTGTGTCCTAAACAAACCAAAAGAAAA 

CTTTCCAAATCTAMGTATTCATTCTCCAATTGGAGCAAGAGGAGTCAGTTAGATACTAT 

C ACGGC ATTC ATTTGTG G CTGGCTTGTC ATATTTACTTATGATTG ATAATAAATCTATTT 

TGCTTTTTAGAGCCTCCCGAGAAGCCATCTGCCTTCGAGCCTGCCATT 

Sequence 2177 

AGGTAC I I I I I I I M I l I I I I I I I I I I I I I M J NACTTGTTAAAGTTC I I I I I ACTGNGG 

CTGTTATAAAATTTAAAATTATATGTGTGACTTGCATTTCA I I I I I ATTGGCCAGGGATG 

CTCCANAAGGTTGATGCCTGAAACTAGANATCCATTACCTTTCCTCCTCTCTAGGGCTTC 

AAATTAG GTTTTAATTTAAN AAC ACCTTTGCTATATATAATCAATAC AAAAAACTTTAC C 

CACTTATTGTACTTTGCCCTGTTGACACTGACCTCCCATTCTCCACCANAANAAATAATC 

AAACCCATCACCAACAAATCACCCATCCATTATTCCCCGCGTACCTGCCC 

Sequence 2178 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCGCATTCCTACTTCATTGCCCCTGATGTAACT 

GGACTCCCAACAATACCCGAGAGTAGAAATCTTACAGAATATTTTGTTGCCGTGGATGTG 

AACAACATGCTGCAGCTGTATGCCAGTATGCTGCATGAAAGGCGCATCGTGATTATCTCG 

AGCAAATTAAGCACTTTAACTGCCTGTATCCATGGATCAGCTGCTCTTCTATACCCAATG 

TATTGGCAACACATATACATCCCAGTGCTTCCTCCACACCTGCTGGACTACTGCTGTGCC 

CCAATGCCATACCTGATTGGAATACACTCCAGCCTCATAGAGAGAGTGAAAAACAAATCA 

TTGGAAGATGTTGTTATGTTAMTGTTGATACAAACACATTAGAATCACCATTTAGTGAC 

TTGAACAACCTACCAAGTGATGTGGTCTCGGCCTTGAAAAATAAACTGAAGAAGCAGTCT 

ACAGCTACGGGTGATGGAGTAGCTAGGGCCTTTCTTAGAGCACAGGCTGCTTTGTTTGGA 

TCCTACAGAGATGCACTGAGATACAAACCTGGTGAGCCCATCACTTTCTGTGAGGAGAGT 

TTTGTAAAGCACCGCTCAAGCCGTGATGAAACAGTTCCTGGAAACTGCCATTAACCTNCA 

CTTTTTAAGCAGTTTATCGATGGTCGACTGGCAAAACTAAATGCAGGA 

Sequence 2179 

CGAATNGGAGCTCCCCGCNGTGGCGGCCGACGTACNGTGATCATTACTCCCCAGCGCAGC 

TCAATCCACTTNCAGACTGNTCTAATTGTNGGAATAATTNTCCTTATGTAATTCAATNTC 

CCATCTATTAATTNATGAGATATTTATTTAAAACTGACCGTATTCCAGGAACTGGGCTAA 

ACGAAGAAGAGTAGTGAATTAATACAGAAATCACCCTNGATTIGGCAGAATATTNTCCCT 

GGTAGTGTAGGTCACCCCTATTGTGGTGATAATCCCCCANAT.CATACAGCANAAGAAACA 

ACACACAGACAAAMCCMGTCATNCTTCTAGGATGTTTTCATCCMGGATTTGAAAAGA 

TTTAGGTGATCTTCCATTNTTCATCCTATGCAATCTGGGCTATGCTACTACTTTACAGAT 

TGAAAATATATTTAAAAATCACTGAACTTTCAATAGTCACCMTGATCACTGTTCTAGTT 

GCAGTTCAATTAGGTGTTCAAGGAAACACTTATAACTTAAATT"rAATGTCTAAATCTTAC 

ACCTTTAATTG GAAAAATAATGTAATAAGTGAGC AATTAGATT ^TCTAGTATTAGAAAAA 

TAACTGAGGATTGGCTGGGCATGGGGGCTTATGTTTGGTATCCCAGCACTTTGGGANGCA 

GAGGCAAAANGGATTGGTTTAAGTCANGAGTTCCNAAAAC/vACCTGGGCAAAACCGGCAA 

GAACTNCATCTNTTNAAAAANNANNNNAANNNNNNANGGTCCTTGCCGGGCGGGCCNTTT 

ANAACTTGTNGGATCCCCCGGCTNGAAGAATTCCANTTT^NCNTATNGATACCGNGNAC 

CTGANGGGGGGCCCN 

Sequence 2180 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCTGTGAACTGAGGAATTATAGATAAACCTTAG 
GTCAAATCATTTCACAATTGCATTGGTGGTATTGAAAAATGATGAGATTTCTCTGACAGA 
GAGCTTTGTCCTAGTTTTTGTCTTCATAGGTCAAAACTGGCAATATTCTCTTGTCTGCAA 
GATAAAGTGTTTGTGCTTCTATCACCATATGCATGAACATGTAAGAATCAGATACAATTT 
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CTGCTTCATCAGTTTCACATGTTCATGTTGTCACTGAAAAAATGCATCTACTGTTTATAG 

CTCCCAAGGAGACCCCAAATCC T I I 1 I 1 I 1 I CTTTTGAGATGGAGTCTTGCTCTTGTTGC 

CCAGGCTGGAGAGCAGTAGCGCGATCTCAGCTCACTGCAACCCCCACCTCCTGGGTTCAA 

GTGATTCTCCTGCCTCAGCCTCCCCAGTAGCTGGGATTTACAGGTGCCCGCTACCATGCC 

TGGCTAATTTTTGTATTTTTAGTAGAGACAGGCTTTCACCATGTTGGCCAGGCTGATCTC 

AAACTCCTGACCTCAGGTGATCCACCCACCTCGGCCTCCGAAAGTGCTGG GATTACAG GA 

GGTGAGCCACCCGCGCCTGGCCCCCGTATTCATTTTTTATACTGGAAAACAl mTTTTN 

GAATTTT 

Sequence 2181 

TATAGGGCGAATNGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTAACTTATCCC 

CAGAGGAAACAGGGTTTATGAGCACTGACAGATGTCTTCCCTGGGCAAA AAAA AAAAAAA 

TAGTATATGTATACACACACACATACACATTTATATTTATATTTCTTAAAGCTT TTMT 

CCTTTCATTCCCTGATATCTCAGAGATTTCAAATCATTGAACACTGAAGTATATTTTTCA 

GGCCAGATGAAAAATTGTATTAAAACCCTATTCCTGGTCAGGCGCAGTGGCTCACGCCTA 

TAATCCCAGCACTTTGGGAGGCCGAAGTAAGCNGATCACCTGAGGTCGGGAGTTCAAGAC 

AAACCTGGCCAACATGGTGAAACCCTGTCTCTACTAAAACTACAAAAAAATTANCCTGAT 

GTGGTGTTGTGTGCCTGTAGTCCCAGCTACTTANGAGGCTGAGGTAGGAGAATTGCTTGA 

ACCTG 
Sequence 2182 

ACTACTATAGGGCGAATTGGAGCTCACCGCGGTGGCGGCCGAGGTACGCGGGGCTTGCCT 

GGAGTCTGATCTGTCCCGGCCCAGTGTCCTCCAGGACCCTGGCCCCTCATGCCTCCGTGC 

TTGCGCGTGTGCCATTTCCTCTCTCCAGAGGACCTTTCCTGCTAGGACTCATGATTGTCC 

CCTCCCTGGCATTTTTTACACCTGGAGCAGCCAGAGGACGCATGCATGGCTCTTCGGAAG 

CCTTCTCCTGCCACGGCATGCACCCACACATGCGAGCCTCCCGGGTACCTGCCCG 

Sequence 2183 

CCGCGGTGGCGGCCCGAGGTACTGATCTAACCAAGATATTTTG7 I I I I CTCATCCACCAG 

TCACTTTCTCAGTCCTTTCTGTATCCCTTGCAATTTGAACAAAGCTTGGTGAATAGTGTG 

CACACAAAAAGCACACTAGGTGAAAGACAGGTACATAAAAAGGGTAAAGTCATGATATTT 

TAACAAACCTATCMGCTCTAAATATAAGCCTCCTTGGTAGTTTTCTCTTTAACCCTCTC 

TCCACTGTTGGATGAAATTTGCTGCATTCMTTCCAGTTCCCACCCCAACTTC CTTCT TA 

ACACMGGTCAGGGTTAAGCCTTCGGTGCTTTMTCCAGAGGAAAATTACTTATTTTAAA 

MGCAGTGAAAAACACCGCATTCCTTTGGCACAGGATGAAGACCAGCTCAGCTCTTCAGT 

TGTTGATCATTGTCTATTGTTCTCCAAACAGTAAACCAGTATTTCACACTGAGATTGTCG 

GCTGCGGGTATATTCCAATTCCCCGTCTCCTCATGAATATGAAGTGAAGGGCTCTGACCC 

TGGAAGTGGTAGGTAATTGCTTGGCG 

Sequence 2184 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACGCGGGGGAGATGCTGCCACCTAGGTTACTTG 

TAGGACCCTATACGGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAA 

TGAGTCTATATGTCAGAATACACATTTCCACCTTGCCCAACAGTAGAAAAACATAAGAAG 

AGAAAAACATTAAAAAATGACAAGGAAGTTAATGGAAGCAATGTGATGGTGTTTGGAGGT 

GGAGCCTTCAGAAGGTAATTAATGCCCTTGTAAGAAGAGGCCAGAGAGCTTGCGCACCTT 

CTTCCTGCCATGTGAGGAGCCAAGAAGCCGGCTGTCTGCAACCTGCAAGAGGACCCTCAC 

TAGAAGCTAGCCATACTGGCATCCTCATCTTGGCT TTCCAA CTTCAGAACCGTGAGAAGT 

ATATGTTTGTGGTTTAAGTCAATGGTCTATGGTAATTTTTTTATAGCAAGTCCCAG^ 

AGACAGTGCCTCATTTACTACATACCATTTATATTATTATATAGGCTNCTTTCAGAAACC 

CATGTTCAAAATAAGAGATAAGATCCTGGAAA 

Sequence 2185 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCTTGGAAAGCCAAGATGAGGATGCCAGTATGG 
CTAGCTTCTAGTGAGGGTCCTCTTGCAGGTTGCAGACAGCCGGCTTCTTGGCTCCTCACA 
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TGGCAGGAAGAAGGTGCGCAAGCTCTCTGGCCTCTTCTTACAAGGGCATTAATTACCTTC 

TGAAGGCTCCACCTCCAAACACCATCACATTGCTGACATTTTCCTGCAGGATGTTTATAA 

ATAGTTCCTGGCAAAGGAGGTTGCCGTATAGGGTCCTACAAGTAACCTAGGTGGCAGCAT 

CCCCTCCCAGCGTCAGGTCTAAGCACACAAGTTCTTCCTTCCATCTACAAACCCCCGCGT 

ACCTCGGCCGCTCTAGAACTAGTTGGATCCCCCGGGCCTG 

Sequence 2186 

CGAGGACGCGGGAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGTAGGAC 

CCTATACGGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCGGGAAAATGCAGT 

GAAGTAGAAGAGACAGGGATATCCANAAGGTTATGCAAAACATCAAGAGAAGATGAGAGG 

AGTCTATATGTCAGAATACACATTTCCCACCTTGCCCAGCAGTAGAAAAACATAAGAAGA 

GAAAAACATTAAAAMTGACMGGMGTTMTGGMGCAATGTGATGGTGTTTGGAGGTG 

GAGCCTTCAGAAGGTAATTAATGCCCTTGTAAGAAGAGGCCAGAGAGCTTGCGCACCTTC 

TTCCTGCCATGTGAGGAGCCAAGAAGCCGGCTGTCTGCAACCTGCAAGAGGACCCTCACT 

AGAAGCTAGCCATACTGGCATCCTCATCTTGGCTTTCCAACTTCCAGAACTGTGAGAAGT 

ATATGTTTGTGGTTTAAGTCAATGGGTCTATGGTAA ! I I I I I I \ ATAGCAGTCCCAGCCA 

MGACAGTGCCTCATTTACTACATACCATTTATATTTATTATATAGGCTCCTTTCAGAAA 

CCCATGTTCAAATAAGAGATAGATACTGAAACACATTACACCCTTCACTAGTTTTTTAGT 

ATACAAATATTGAGAAAAT 

Sequence 2187 

CGAGGTACATTATCAAGTGTGAGATGTTTGGGATATACGCAAGCTGTGCCTGGAAATCAT 

GACAGTGCCTCTATGCTGTTGATGTACCAGCCATGCCGTTTCCTGCCCTATATCCTTTCT 

TMGGGTAAGTAATCTTAGTGCCTGGTGAAGCTAATTTTGTCTCACGAGCCCAACTGTTA 

ACCTGAATCCAATACCTATGCTGCTGGTGAGACAGAGTGGGTATGAAAAAACATATAACC 

TGAAAAATCTACTTATATATGCTAAACTAATAAACCTGAGGGATATGCAGGGGGAAAAAA 

TCAAATTTTAAGGAAATCTMCACAGAAAATAAATTCACAATGAAATAAAMC 

TGTGCAAGCAAAATAAAAATACTACACTTAMCATAAAATATACTTGTTAAAATATGGTA 

ACTCTTTACCCCACMTTAAAAAAGCTTTAATCACCAGTC 

CTGAGATTCTTTTCTATGAGAMTACATCACAGGTTGATTTACTTTAAAAGCAAGCCCAT 

TTAGTGGTTGAAANTAAAAAAAAAAAAAAAAA 

Sequence 2188 

CCGCGGTGGCGGCCGAGGTACGCGGGGTTCATGCTCTCTCTCTCCATGCACAAGAAAGAG 

CACACAGTGAGAAGGAAAGAGGCCCTCACCAGAACCTGACCATGCTGGCACCGTGATTGC 

AGACTTCCAGCCTCCAGGATTGTGAGAAGATAAATATCTGTGGTTTAAGTCACCAAGTCT 

GTGGTGTGTTTTATGGCAGCACAAGGTAACCCAGATGATGCCTTTCACCACTGGGTGGTC 

CAGCCTTTCATGGMGCTGTCAGCTTCTCAGCCTTGCAAAAAATGCTTGCTGGTCACTCT 

CTTTGGTCCTTTCAGATGCCTTTCTTAGTGGTGCCTCTACCAGGCCCTTGACTTCATCCT 

TTTCTGTCAAGAGGAAGGGTGAGTAAAGTACCTGCCCG 

Sequence 2189 

CCGCGGTGGCGGCCCGCCCGGGCAGGNACTTTN M I I I I I I I I I I I I I I I I I 1 1 I I I I I I 

AACNGCTGTTCACTGCTACCCATTTAGTCAACTTGGACCACCCTNTAGAGGGGCTTCAAC 

TGTTATTTTCAATACATTGTCACAGGGACAGAGGAGGGTGTTAGGGAACTNTGTGACTAA 

ANAGTTTTCCATAMTGGGTGGACCTCAGTTTACAAGCATTAAGTTGTTGGGCCAGGAGG 

GGGAACAGATCATGATTAGGAGAGTGGAAAANAAAGATGACAAATCACTTTAAGTCANAC 

AAACCTCTTTCTGTACCT 

Sequence 2190 

CGCCCGGGCAGGTACGCCTACTCAACCCGGCTGTTCACCATTGGTGGCATCAGCATCCCA 
TACACATGGAACCACACCGTTTTCTATGATCAGGCACAGGGAAGAATGCCTTTCTTGGTT 
GAAACACTTCATGCATCCTCTGTGAATCTGACTATAACCAGATAGAAGAGACACTGGGTT 
TTAAAATTCATGCTTCAATATCCAAAGGAGATCGCAGTAATCAGTGCCCCTCCGGGTTTA 
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CCTTAGACTCAGTTGGACCTTTTTGTGCTGATGAGGATGMTGTGCAGCAGGGAATCCCT 

GCTCCCATAGCTGCCACAATGCCATGGGGACTTACTACTGCTCCTGCCCTAAAGGCCTCA 

CCATAGCTGCAGATGGAAGAACTTGTCAAGATATTGATGAGTGTGCTTTGGGTAGGCATA 

CCTGCCACGCTGGTCAGGACTGTGACAATACGATTGGATCTTATCGCTGTGTGGTCCGTT 

GTGGAAAGTG 

Sequence 2191 

CCGCGGTGGCGGCCGCCCGGGCAGGTACACACAAAGACAAACCTGAACTTAATTTCAAGG 

AAMCTTAAACCCATGCACAAATAATTGGTGAGCCTTCATTTCCCTGACTTCMGTTTCC 

ATGTGAGGACTCATGCTCTCTCCACTTTCTTCTTGGGAGGAGGGAAGATTTACCTAATGG 

GTAMTTTGGGCAAAGCACATTGAGTGTGCTTGTTTGGCTCTGAGTCTCTTTGCAAACAT 

GTGTCTGCCCACAGTGACATGAGTTTGCGTTGACTGTCATGTCTGCAGGAAGCTGCCTGC 

TCCTGTGGCCATGTCAAGCMTTCTTTCTTTCAACTGCAACTGTGTGTAAGAGCTTAGTC 

TGAGAAGA 

Sequence 2192 

CCGCGGTGGCGGCCGCCCGGGCAGGTAC I II I II I I I I I I I M ill I I II 1 1 I CAGGATN 

ATTAGGAATCATTTCCMCACCAGGACAMTATGCAMGTAMGNGGGTATTGTTATTTN 

TTTGTCCCTAGTTCACATGATTTANATTGATNAAAATGNNAATTTTTACATATGTGAATA 

AAACGTAAAGGAGCAAACTGGANATTGCTGCCATGTGCTANCAATCATTGNGGGNGGGTA 

AACTGACAACATGTAACTGCATATTAANACAAATGTAGAATTTANACAAAGTNTTC CATC 

TTTTTTNAGATGTATACTTTTAAACCTCAGGA^ 

AACTTGAA 

Sequence 2193 

GGNGGCGGCCGAGGTACGCGGGGGTAAGATGGANGGAAGAACTTGTGTGCTTAGACCTGA 

CGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGTAGGACCCTATACGGCAACCTCCTTT 

GCCAGGAACTATTTATAAACATCCTGCANGAAAATGAGTCMGGMGC7TTTCTTTTGAG 

CTATTTACAGCTTTTAGCAATTGNGTAAAGTNTACTCCTGTGMCAAAATTTGGAACATA 

TTTGGTTCTCTCTAACTGATTNCTCCAGMTTTGGAACTAGTTGTAGCTGAGACCAATGA 

TGCTGACCTNCCTCAAAAGCTGCATTTCTGAATTTCTGAAGGCAAACTGTCTGCCTATAT 

TGTACCTGCCCG 

Sequence 2194 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACCATCCCCAAGGACA 

CGGAAGTATTTCTCATCCTGAGCACTGCTCTCCGTGACCCACACTACTTTGAAAAACCAG 

ACGCCTTCAATCCTGACCACTTrCTGGATGCCAATGGGGCACTGAAAAAGAATGAAGCTT 

TTATCCCCTTCTCCTTAGGTMGCTGGACCCACAGTTTCTTTCCCAGACACCAGAGGGCA 

GGTCCTATCCTCAACTTGAGAAAAAAAATGACAGGTCCTTATTAATTGAGCACTTAATAT 

ATTCCAATTGCTTCACCTGCCTTATCCCCTTCCATCTTCACTACAACCCTGTAAGGAGGC 

TTGAGAAAGAAGATGACATTCCCAAAGGCACATCTGGGCAAGCAGGAACTTGGGCAAGTA 

TmAACATCTCTAMCCTCAGTGM 

Sequence 2195 

TCCCCGCGGTGGCGGCCGCTTTCTCTTACTGATAGTAGGATATTTCTGCTTTAGTTATTG 
TCACCTTAAATATATTTTCAArc 

TCAMTTTTCTTAGGTATATrTCTGTCACGTTGGCATGATAACAAATGCAATAACCCAAA 

AGACCCCAAMGCTAGTGTAATCCCTTTTGCAATCCAAGCATGAGGATTCATCTTCATGT 

TGACAGTGCGTGAATGTTCGGTAGGCTTTGTCAAGCTTGCATACAATAAATTATATATGT 

CCCTTTTCTTTTAGGGTCTCCTGTTGAAGATGGTCTTCTGAAGGCTAACTGCGGAATGAA 

AGTTTCTATTCCAACTAAAGCCTTAGAATTGATGGACATG 

Sequence 2196 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACACAACTCACTCC 
TTTTAAAGAAAAAAATTGAAATGTAAGGCTGTGTAAGATTTG I I I I" I AAACTGTACTCAC 
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ATTGGANCCACTAGGAATCCTGAGAAAGAGGAGTGGACAT ACTC AGAGGAGTATAGGCCA 
TTTGACTCGGCATTGGGAAACCTGGAGCCACACCTGGTCATTTTCTGTGAGATCGATGAT 
GGCACTCCCTGAAGCCTGATCCAGGTAGCCTTTGGTGTATTCATCATANGTGTACCT 
Sequence 2197 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCCGGCCGCCCGGGCAGGTACAATATAGGCAC 

GACAGTTTGCCTTCAGAAATTCAGAMTGCAGCTTTTGAGGGAGGTCAGCATCATTGGTC 

TCAGCTACAACTAGTTCCAAATTCTGGAGAAATCAGTTAGAGAGAAACAAATATGTTCCA 

AATTTTGTTCACAGGAGTATACTTTTACTCAAATTTGCTAAAAGCCTGTAAATAGCTCAA 

MGAAAAGCTTCCTTGACTCATTTTCCTGCAGGATGTTTATAAATAGTTCCTGGCAAAGG 

AGGTTGCCCGTATAGGGTCCTACAAGTAACCTAGGTGGCAGCATCTCCTCCCAGCGTCAG 

GTCTCCCCGCGTACCT 

Sequence 2198 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCCGGCCGCCCGGGCAGGTACAATATAG 

GCAGACAGTTTGCCTTCAGAAATTCAGAAATGCAGCTTTTGAGGGAGGTCAGCATCATTG 

GTCTCAGCTACAACTAGTTCCAAATTCTGGAGAAATCAGTTAGAGAGAAACAAATATGTT 

CCAAATTTTGTTCACAGGAGTATACTTTACTCAATTGCTAAAAGCTGTAAATAGCTCAAA 

AGAAAAGCTTCCTTGACTCATTTTCCTGCAGGATGTTTATAAATAGTTCCTGGCAAAGGA 

GGTTGCCGTATAGGGTCCTACAAGTAACCTAGGTGGCAGCATCTCCTCCCAGCGTCAGGT 

CTCCCCGCGTACCT 

Sequence 2199 

CCGCGGTGGCGGCCGAGGTACAGAGGTGATAGATCCCTTCTTGGTAGTGGTAATTCTTTC 
CTGCATAGTACGCGGGGGCTGTAGTGGCTTCGTCTTCGGTTTTTCTCTTCCTTCGCTAAC 
GCCTCCCGGCTCTCGTCAGCCTCCCGCCGGCCGCTCTAGAACTAAGTGGATC 
Sequence 2200 

ACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACAGAGGTGATAGATCCCT 
TCTTGGTAGTGGTMTTCTTTCCTGCATAGTACGCGGGGGCTGTAGTGGCTTCGTCTTCG 
GTTTTTCTCTTCCTTCGCTAACGCCTCCCGGCTCTCGTCAGCCTCCCGC 
Sequence 2201 

CTACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTAC I I I I I I M 

I I 1 1 I I I I I I I I I I IGNTTTCAAAACCCGCTTATTTCACGCGAGGACTCCTCTGCGAAGA 

CAGGAAGGATGGGGTCCCACACCTGAGCCAGGCGCGGTCTCCACTGGGTGCTCCGCAGCC 

CGGAGTGGNTCCGGACGACAGCCCCGCTTAACGGAACCTTTCNCCCAGGCTNGGGTGGAC 

CTTNTCCGCTTGTTTTCCTNAGGGCGCAAGTCAGGTTTNTTTGACACACACCAACCGAAG 

TTNAAGCGCGTGTCCCCGCGTACCTNGGCCGCTCTAGAACTAGNGGA 

Sequence 2202 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGNCAGGTACNATGTN 

TAGTGATGAGTTTGCTAATACAATGCCAGTCAGGCCACCTACGGTGAAAAGAAAGATGAA 

TCCTAGGGCTCAGAGCACTGNANCANATCATTTCACCCNCGTACAGTTTAGGGGATCCTT 

TNTAATGACAGGAAGGCACTGCTTTCCTCAACACTGTNATCTGACCTGTGACAAGTCTGN 

ACCT 

Sequence 2203 

TCGCACTCATTTACCCGGGGACAGGGAGAGGCTCTTCTGCGTGTAGTGGTTGTGCAGAGC 

CTCATGCATCACGGAGCATGAGAAGACGTTCCCCTGCTGCCACCTGCTCTTGTCCACGGT 

GAGCTTGCTATAGAGGAAGAAGGANCCGTCGGGAAGTCCCAGCCACNGGGAAGGGCCGTT 

NNTTCTTTGTAAGNTTGGTTTCNTCCCCGGCCTGGCCCTACCCCCGCGTAACNCTCCGGG 

CCNGCTCTTAAGAAACCTANGTTGGGGATCCCCCCCCGGGGACNTGCTAAGGGNAAATTT 

CNNAATATCANAGCCTTTATNCGNATTNCNCGTTCTGTACTCTTCGAAGGGGNGGGGGGC 

CCCCCGGNTTAACCCCCAAGCNTTTNTTTGGTTTNCCCCCTTTTTTAAGGTTGG 

GGTTTTAAAANTTTGGGCCGGCCCGNCCTTATNGGGGCCGGTTANNANTTCCAATTGGGG 
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NTTCCANTTAMGGGCNTTGGTCNTTNTNCCCCTTGGNTGGGTTNGNANANAAATTTTGG 

GTTTTAATTCCCCGGCCTTTCAACCAAAA 

Sequence 2204 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGTTTGTAGA 

TGGAAGGAAGAACTTGTGTGCTTAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTT 

ACTTGTAGGACCCTATACGGCMCCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAG 

GAAAATGCAGTGAAAGTAGAAAGAAGACAGGGGATATCCCAGAAGGTTATGCAAAACATC 

AAGAGAAGATGAGAGGAGTCTATATGTCAGAATACACATTTCCCACCTTGCCCAACAGTA 

GAAAAACATAAGAAGAGAAAMCATTAAAAAATGACAAGGAAGTTAATGGAAGTCAGCAA 

TGTGATGGTGTTTGGAGGTGGAGCCTTCAGAAAGGTAATTAATGCCCTTGTAAGAAGAGG 

CCAGAGAGCTTGCGCACCTTCTTCCTGCCATGTGAGGAGC 

Sequence 2205 

CTCCCCGCGGTGGCGGCCGCCCGGGCAGGACGCGGGGTTTGTAGATGGAAGGAAGAACTT 

GTGTGCTTAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGTAGGACCCTA 

TACGGCMCCTCCTTTGCCAGGMCTATTTATAAACATCCTGCAGGAAAATGAAGTCTAT 

ATGTCAGMTACACATTTCCCACCTTGCCCAACAGTTNGAAAAACATAAGAAGAGAAAAA 

CATTAAAAAATGACAAGGAAGTTAATGGAAGTCACCAATGTGATGGTGTTTGGAGGTGGA 

GCCTTCANAAGGTAATTAATGCCCTTGTAAGAAGAGGCCAGAGAGCTTGCGCACCTTCTT 

CCTGCCATGTGAGGAGCCAAGAAGCCGGCTTGTCTGCAACCTGCAAGAGGACCCTC 

Sequence 2206 

ATACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGGGGCCATTGAGACTGCCATG 

GAAGACTTGAAAGGTCACGTAGCTGAGACTTCTGGAGAGACCAPTCAAGGCTTCTGGCTC 

TTGACAAAGATAGACCACTGGAACAATGAGAAGGAGAGAATTCTACTGGTCACAGACAAG 

ACTCTCTTGATCTGGAAAATACGACTTCATCATGCTGAGTTGTGTGCAGCTGCAGCGGAT 

TCCTCTGAGCGCTGTCTATCGCATCTGCCTGGGCAAGTTCACCTTCCCTG'GGATGTCCCT 

GGACAAGAGACAAGGAGAAGGCCTTAGGATCTACTGGGGGGAGTCCGGAGGGAGCAGTCT 

CTTCTGTCCCGCTGGAACCCATGGTCCACTTGAAGTTCCTTATGCTACTTTCACTGAGCA 

TCCTATG AAATACAC CAGTG AG AAATTCCTTG AAATTTGC AA 

Sequence 2207 

CCGCGGTGGCGGCCGAGGTACTTTTATAAATACGCCAGAGATTGAGTCGAGGCTCTGAAG 

AAAGTCCACCATGTGAAATTCmCCGGTGACATTATCCAGTTTGGAG TAGAC GTGACAG 

AGAATACACGGAAAGTTACCCATGGGTGTATTGTTTCCACMTAAAACTTTTTCTACCCA 

GATNGGTATNGGAAGCCCGGCTCCGCTCAGATGTCATCCATGCACCATTACCAAGTCCTG 

TTGACAAAGTTGCTGCTAACACTCCAAGTATGTACCTG 

Sequence 2208 

CCGCGGTGGCCGGCCGCCCGGGCAGGTACACACAAAGACAAACCTGAACTTAATTTCAAG 

GAAMCTTAMCCCATGCACAMTMTTGGTGAGCCTTCAmCCCTGACTTCMGTTTC 

CATGTGAGGACTCATGCTCTCTCCACTTTCTTCTTGGGAGGAGGGAAGAATTACCCTAAT 

GGGGTAAAATTTTGGGCAAAGCACATTGAGTGTGCTTGTTTGGCTCTGAAGTCTCTTTGC 

AAACATGTGTCTGCCCACAGTGACATGAGTTTGCGTTGACTGTCATGTCTGCAGGAAGCT 

GCCTGCTCCTGTGGCCATGTCAAGCMTTCTTTTCTTTCAACTGCAACTGTGTGTAAGAG 

CTTANTCTGAGAAGAAATGTTCAGAAGCTCACTTGTGGGCTGCACATCTGAGCCATGTCT 

TTCCCA 
Sequence 2209 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGGGGCCATTGAGACTGCCATGGAA 

GACTTGAAAGGTCACGTAGCTGAGACTTCTGGAGAGACCATTCAAGGCTTCTGGCTCTTG 

ACAAAGATAGACCACTGGAACAATGAGAAGGAGAGAATTCTACTGGTCACAGACAAGACT 

CTCTTGATCTGCAAATACGACTTCATCATGCTGAAGTTGGTGTGCAAGCTTGCAGCGGAT 

TCCTCTGAGCCGCTGTCTATCGCATCTGCCTGGGGCAAGTTCACCTTCCCTGGGATGTCC 
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CTGGACAAGAGACAAGGAGAAGGCCTTAGGATCTACTGGGGGAGTCCGGAGGAGCAGTCT 
CTTCTGTCCCGCTGGAACCCATGGTCCACTGAAGTTCCTTATGCTACTTTCACTGAGCAT 
CCTATGAAATACACCAGTGAGAAATTCCTTGAAATTTGCAA 
Sequence 2210 

AGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAGAGGTCT 
TTTGAACTCCTTCGGCCACCGTCGCCGCGTTCTCGCTGTGCACTCTTATTCTGCGCCTGC 
GCGCGGCTACAGCACGGTTCG I I I I I CCTTTAGTCAGGAAGGACGTTGGTGTTGAGCATA 
CCGTATCAAGGACAAGTAACTACCATGGCTCCCGAAGTTTTGCCAAAACCTCGGATGCGT 
GGCCTTCTGGCCAGGCTGTCTGCNAAATCATATGGCTGNANCATTCGNNCTATCCCT 
Sequence 2211 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAAACAAGCAAAGTCT 

CCCATAACTGAAACAGAGGATTCTGTTTTAGAAAAAGGCCCAGCTGAGCTTANGAGCAGA 

GMGGAAAAGAAGAAAATAGAGAGCTTTGTGCATCTTCTACGATGCCTGCAATTTCAGAG 

CTTGCATCATTGCTTATGGGAAGGAATCTTCAGATTGAAGAAATTAAACCTTTCTCTCCC 

AAGATCATCAGCCTAGAGTCGAAAGAACCACCTGCCTCTGTAGCTGAAGGAGGCAACCCA 

GCAAGAATTTCAGCCATTTAC I I I I I CTTTAAAAGGATTATCANAGGAGGTTAGCCATCC 

AGCCCGACTTTAAAAAGGGAGGAAATCAAGAAATAGGCCCATTACCACCAACTGGAAATT 

TGAAGGCACAAGTCATGGGGAGATATTTTTANAATAAGCTAAGTG 

Sequence 2212 

CCGCGGTGGCGGCCGAGGTACGCGGGGACAGCGATGTGAGCTGAGGTGCAGGCACCAGAC 

CTAGGAATTCCTAGAAAAATAGTCAGGAAGCATTTAGACACATCAAATGTTAAACGAGTC 

CTG ATTATGATG ATAATG ATGATGATTTTGGTGGTTGC AATAG CAAAG CCTTAAGTATG A 

AGGAGACGTGCCAGCTGGAAATACAGGTAGACAATGAACAACTGAATTTAGAGGACGAAG 

ACATTGAAAGCATTGATGCCACCAAATTGAGCCGTTTCATTGAGATCAACAGCCTCCACA 

TGGTGACAGAGTACCTGCCCG 

Sequence 2213 

CACTNCTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGGACCCGGGGGCTTGN 

CCCCTNATGCCTCCGTGCTTGCGCGTGTGCCATTTCCTCTCTCCAGAGGACCT1TCCTGC 

CTAGGACTCATCATTGTCCCCTCCCTGGCATTTTTTACACCTGGAGCAGTCAGAGGACGC 

ATGCATGGCTCTTCGGAAGCCTTCTCCTGCCACGGCATGCACCCACACATGCGAGCCTCC 

CGGGTACCTGCCCG 

Sequence 2214 

AGGTACAGGCATGAGCCACTGCGCCTGGCCCCATGTTTGGTTATTATTAGTGCTTAGGAA 

GAGGCACTTGCTTACATAGTAGGAGTTGAGMGCTTGGTTTGTTCTTTCCTACCCCTAGA 

TCTATTCTCACCTCCTGACCATGCTCTTTCTGCCACATCTATTATCATTACAAGTTGCCT 

TATCTGAAATTAGTGAATCAGAAAATAMGCAGGGGATACTTTGTGTAGTTTCAACGTTA 

GGGAAAGTTCAGAATACTGTCTGTCTAAACTATCTCTCTAGAAGGCCTGATGGGCCACAA 

CCTGGGCCAGAAGCATTCAGTTCAGATATGAGAATGGTGGGGTGTAAGGGGCAATGGCCA 

ATGGGCCATGGCCGGAAGGAAATTGTTACAGAGTAGTGGGAAGCCTGCAAAGACTGGCTT 

CTGTCCGTTTTTGCCTT 

Sequence 2215 

CCGCGGTGGCGGCCGGGGGCCATTGAGACTGCCATGGAAGACTTGAAAGGTCACGTAGCT 

GAGACTTCTGGAGAGACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGACCACTGGAAC 

AATGAGAAGGAGAGAATTCTACTGGTCACAGACAAGACTCTCTTGATCTGCAAATACGAC 

TTCATCATGCTGAGTTGTGTGCAGCTGCAGCGGATTCCTCTGAGCGCTGTCTATCGCATC 

TGCCTGGGCAAGTTCACCTTCCCTGGGATGTCCCTGGACAAGAGACAAGGAGAGGGCCTT 

AGGATCTACTGGGGGGAGTCCCGGAGGGAGCAGTCTCTTCTGTCCCGCTGGAACCCATGG 

TCCCTGGAAGTTCCTTATGCTACTTTCACTGAGCATCCTATGAAATACACCAGTGAGAAA 

TTCCTTGAAATTTGCAAGTTGTCTGGGTTCATGTC 



Page 3)9 of 635 



WO 01/51628 



PCT/US01/00798 



Table I 



Sequence 2216 

GGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGATTGACATCCCACACTACC 

TGATTTGATACACTAAGAAGGGCTTATCATTTCTGTGGTACTCTTGGCCAACAATGCACG 

TGTMCATTTATTCATTGAAACACATTAGGCAGAATTGCAGGACATTCTTCAAAATAGCT 

GTCCCAATACTCTTCAAAAGTGTCAGGTCCTGGAAGACAAAGAGATACTGAGGAACCATC 

ACAGAGTGGGAGAGGACATAGAGTGATAAAAACTAACTGTGATGTGGAATCCTACATTGG 

ATCATGGACCAGAAAGACAGCACTGATGGGAAGACTGATGAAATCTGAATAAGTCTGTAG 

NTTTGGTTTAAAGMGAAGMTMTMCMTAATGGTTTAGCTGCTGGCTCCTTAATAAA 

ATTCCCCTAGTTACTGTAATGTCTGAAAATGAACCCC 

Sequence 22 1 7 

GACTNCTATAGGGCNNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACl I i I I I I I I I i i 

ITTTTTm I GAGCCTCANTTGAACTATTCATTCTTGTCTTCCANAACTCACTTCAAATG 

GGTTTrCCATCTATCATGAGTCTAGTGTTCTCACCTTTGTCTTACMTTAATTCTTCTGC 

ANATCCCCTCACACTGATCTCCTCCCCCTGAGCTTTGGACAATTATATTCTCTCATCTCT 

TTCACATAAATCGCACTTGGAAGACAAACTCCTCAGGTTAAGCTGCCATCTGGTGTGTGT 

ATTTTTCATCAAGAACTAGCAACACCCACATCAATAGCATCCACCTTTACCATCACCTAT 

ATMGCMCTTACTTTGGGMCCTATTTCCTTATTTCAGCATTCTCATTTGMTCMTCT 

AGCCTAGGTTAGGGACACTGGAACCAAACTTTGGGCTGTCCCTGGACTGGCCTGGG 

Sequence 2218 

TACGACTACTATAGGGTGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACl i l I l l u I 

rTTTTTTTTTTTTTTTTTTTI CGNCAATTCAAAGTAAACCAAATCATCACATCAAGAAAA 

TACATTGNGCAACTNTNATGTCCCCTTNTGACAAAAAAGGCACATAGCCAGGNGTATGTT 

CCCTGGAAGCTTAAAATCTANTTGGGGACAATCAGGCAAGAGCACAAACGGCCTNTTGNT 

TGGGCTGNGGCAAAAAGCTGGATTTTATCCTAATTGCNACAGGATGACATCAGGAGAGCA 

ACATGATCTCATAGANATTCTTTTAAATATATGAATTTTTAACTT 

Sequence 2219 

ACGACTNICTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCG 

GGGGCAGTGGGAAGCTCGCAGCAGCTGGGGAGGAGCCAAAGCCTCGGCGCTCACCTMGC 

CGCAGGGAGATACACCCAACTGGGAGATGAGGAAACAGCAACCCAGAGAGGAGAACTAAC 

CCACACAGGATCATTTCGCGAAGGAGCMGGCTGMGAACCAGACCTGGACTTTCTTAGG 

ACAAACTTACTGCAGCTTGMGGAGCCAACCATGGATTTGAGG CGTG TGAAGGAATATTT 

CTCCTGGCTCTACTATCAATACCAAATCATTAGCTGCTGTGCTGTTTTAGAGCCCTGGGA 

GGCGATCTATGTTTAACACCATCTTACTAACCATTATTGCTATGGGTGGTATACACTGCC 

TATGTCTTTATTCCAATCCAC 
Sequence 2220 

GGGGAGGAGCCAAAGCCTCGGCGCTCACCTAAGCCGCAGGGAGATACACCCAACTGGGAG 

ATGAGGAAACAGCAACCCAGAGAGGAGAACTAACCCACACAGGATCATTTCGCGAAGGAG 

CAAGGCTGAAGMCCAGACCTGGACTTTCTTAGGACAAACTTACTGCAGCTTGAAGGAGC 

CMCCATGGATTTGAGGCGTGTGAAGGAATATTTCTCCTGGCTCTACTATCAATACCAAA 

TCATTAGCTGCTGTGCTGTTTTAGAGCCCTGGGAGCGATCTATGTTTAACACCATCTTAC 

TAACCATTATTGCTATG 

Sequence 2221 

NGGCGAATTGGAGCTCCCCGCGGTGGCGGCNCGCCCGGGCAGGTACTTAl l ! l I AT ATT A 

AGTCAGACTTCAGGATTTATTTrAGCCTTCCTTTTTGAGMGCTTTCTMGCCCTCAGC^ 

TTAAGCAGCATTTTTCCTCATCGCACACGGATCTGAGCAGGTGACCGCGGCTGGGATAGG 

TCTGTTGTGTTGTGTGCAGGTGGGCTGGCCAGCCGTTGGGTGTGTGTTTTTCCTAGAGTA 

AGAAGCACTCCAGGCTGAGATGAGAGCTGTTGAAATGAGTAACATTTCCGTCTCCTGTGA 

TACGCGCTGCTGTTGCTTCTTTCAAATGATCAGATTTA CATTC TTTTAATGGGTCCTT^ 

AAMTGTAATCGAGNGAGAAGGACTTCTAMGATTTTCTTTTGGCCTGGAATCACACNAA 
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GGGGAATTGGAAMTTACTAATTCATGAAMTG 

TTCCTTTGAACATTGACAGAAGTGGGCCGTAGGGAAGGGGMGMTTTGCCTGGATTCTT 
CACTTAACTCTTGTCACCACTGGAGCATNCTGACTCCCTAGACACAAAGTGATGGAAGTT 
ATTTTGTTTCCACTCTTAAACACTGTCTCAAGGGGGACATTGATGGGATGGGNGG 
Sequence 2222 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGCAACAGAATGAGAAGC 

TACAAACAAGAAATGGGAAAACTCGAMCAGATTTTAAAAGGTCACGGATCGCCTACAGT 

GACGAAGTACGGAATGAGCTCCTGGGGGATGATGGGAATTCCTCAGAGAACCAGAGGGCA 

CATCTGCTCGATAACACAGAGAGGCTGGAAAGGTCATCTCGGAGACTAGAGGCTGGATAC 

CAMTAGCAGTGGAAACCGGTMGAATTCTGAGAGTGAGCAAATTGTCTTGCTTATGCAC 

AGCAGTCTTCACAACACATGACATTTCAGGGAAACTTCAAAGGAGTAGCANAGACAGCAN 

CCCGAGATGTGGTTTACATATTGGGGAGACAATTGGGAGCTTATTTGCGCTTATC UN! 

TCAAGGT 

Sequence 2223 

AGGTACACACAMGACAMCCTGAACTTAATTTCAAGGAAAACTTAAACCCATGCACAAA 

TMTTGGTGAGCCTTCATTTCCCTGACTTCAAGTTTCCATGTGAGGACTCATGCTCTCTC 

CACTTTCTTCTTGGGAGGAGGGMGATTTACCTAATGGGTAMTTTGGGCAAAGCACATT 

GAGTGTGCTTGTTTGGCTCTGAGTCTCTTTGCAAACATGTGTCTGCCCACAGTAACATGA 

GTTTGCGTTGACTGTCATGTCTGCAGGAAGCTGCCTGCTCCTGTGGCCATGTCAAGCAAT 

TCTTTCTTTCAACTGCAACTGTGTGGTAAGAGCTTAGTCTGAGAAGAAATGTTCAGAAGC 

TCACTGTGGCTGTACCTGCCCGGGCGGC 

Sequence 2224 

CTACTATAGGGCGNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACACAAAGACAAAC 

CTGMCTTAATTTCAAGGAAAACTTAMCCCATGCACAAATAATTGGTGAGCCTTCATTT 

CCCTGACTTCAAGTTTCCATGTGAGGACTCATGCTCTCTCCACTTTCTTCTTGGGAGGAG 

GGAAGATTTACCTAATGGGTAAATTTGGGCAMGCACATTGAGTGTGCTTGTTTGGCTCT 

GAGTCTCTTTGCAAACATGTGTCTGCCCACAGTAACATGAGTTTGCGTTGACTGTCATGT 

CTGCAGGAAGCTGCCTGCTCCTGTGGCCATGTCAAGCAATTCTTTCTTTCAACTGCAACT 

GTGTGTAAGAGCTTAGTCTGAGAAGAAATGTTCAGAAGCTCACTGTGGCTGTACCTGCCC 

G 

Sequence 2225 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCATGATTAGTTAAATAT 
AAGACTCCGTAATTTTTACAATTTTMCAATAATTTTATTTCTT 

3GATTGTATTAAAACTACAGTGTGTGACTTAGAAAATGATAATGCTGCTTTATGGAAAAT 

GGATTATAGGTGGGTAAGACTTCATTGCAAAAATTGTGTAATACCATCAGTGTTAGGAAC 

CCAGTTGAAGTCTAGAAGACAGATGATAGTATCTTAGACTAGGTTGGTATTTGAATAGAT 

ATTGGTAATATCAGTAGAATTTAATMTACATTAGAAAGAAAGAAATCAGAGAAGATTCT 

TTTATTTTCACTTGATACTTGTGTTGTTACTTTCMTGAGATMGAAAGACAGGCAAAGG 

AGAACGTTCAGGGGCAGGGGATGAGAAGAAAACAAGAATTTTATGTTGGA 

Sequence 2226 

AGGTACCCAGCCCCTCCCCTTCTGAGATCTTTCCTGTCACACACTCTTTCCCCAGCTCTG 

CACATTCTGACACTCACCACACCAGCACAGCAGAATCTCAAAAAAAAGCCACAGCAGAGG 

GCTCCGCTGGTAGAGTTGAAAATTTTGGCAAGAGGAAGCCACTCCTCCAGGCCTGGGTCT 

CACCCTCGGAGACACATCCAGTCTCAGCTCAGCCAGGCGCTGGAACTGGGTCAGCCAAGC 

ACAGACTTCATCCTGTGAAGCCAATGAATGCCACGGCCACCAAGGTTGCTAACTGCAGCT 

TGGGAACTGCCACCATCATCAGTGAGAACTTGAACAATGAGGTCATGATGAAGAAATACA 

GCCCCTCGGACCCTGCATTTGCA 

Sequence 2227 

TCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGCAG 
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AGATACACCCAACTGGGAGATGAGGAAACAGCAACCCAGAGAGGAGAACTAACCCACACA 

GGATCATTTTNNCGNAAGGANCAAGGGCTNNANAACCCAGACCTGGACTTTNTTAGGACA 

AACTTTACTGCAGCTTGAAGGAGCCAACCATGGATTTGAGGCGTGTGAAGGAATATTTCT 

CCTGGCTCTACTATCAATACCAMTCATTAGCTGCTGTGCTTGTTTTAGAAGCCCTGGGA 

GCGATCTATGTTTAACACCATCTTTACTAACCANTTATTGCTATTGGGTGGGTATTACCA 

CTGCCTATGT 

Sequence 2228 

GACCTCCGCNTGGCNGCTTGGGTTCCNNNTTCCGGCTTGCCGGGCCGAAGCCGGGTTATT 

TCAGCCTTCAACCTTCAAAMGGGGCCGGGTTAAATTACCGGGTTTTANTTNCCACCAAA 

GGANNATTCCAAGGGGGGGNAATTAAACCNNNCCAAGGGGAAAANAGGNAANACCCAATT 

GGTTTGGGAAGGCCAAAAAAAAAGGGGGCCCCAANNCCAAAAAAAAGGGGGGCCCCCATG 

GGGTAAAAACCNCCGGTTTNANAAANANANGGGGCCCCCCGGCCGGTTNTTGGGCCTTGG 

GGGCCCGG iTTTTTTTTTTI NCCCCAATTAAGGGGGCCTTTCCCCGGGCCCCCCCCCNCC 

mGGAACCCGGAAGGGCCMNTTTCAAACCAAAAAAAAAATTCCGGGAAACCGGCCNTT 

CAAAAAGGNTTCCANANNAAGGGGGTTGGGGGCCGNAAAAAAANCCCCCCGGAACCAGGG 

GGGAACCCTNATTTAAAAAGGGATTTAACCCCAANGGG 

Sequence 2229 

TCGACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAAGTGATCC 

TCCATCTTAAAGAAGATCAGACAGAGTACGCGGGGTGTTTTTAAAAAAGMTTTAATTAG 

GAAATTTTGATACTCTTCATTACTAGGATTTATCCATTGAM 

TTTAACAGAGTCTCTGTGAGACTGTTTCACAGAAGGATGTGTGTTTACCCAAGGCTGCGC 
ATCAAAMGAAATAGATAAMTAAATGGAAAATTAGAAGGGTCTCCTGTTAAAGATGGTC 
CTCTGMGGCTMCTGCGGMTGAAAGTTTCTATTCCAACTAAAGCCTTAGAATTGATGG 
ACATGCAAACTTTCAAAGCAGNAGCCTNCCGAGAAGCCATCTGCCTTCGAGCCTGCCATT 

GGAAATGCAAAAGTCTGTTCC 
Sequence 2230 

TACTATAGGGCGAATTGGAGCTCACCGCGGTGGCGGCCGAGGTACGCGGGGTCCCGGCCC 

AGTGTCCTCCAGGACCCTGGCCCCTCATGCCTCCGAGCTTGCGCGTGTG TCATTT CCTCT 

CTCCAGAGGACCTTTCCTGCCTAGGACTCATGATTGTCCCCTCCCTGGCATTTTTTACAC 

CTGGAGCAGCCAGAGGACGCATGCATGGCTCTTCGGAAGCCTTCTCCTGCCACGGCATGC 

ACCCACACATGCGAGCCTCCCGGGTACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnGTTCCCTTTAGTGGAGGGGTTAATTGCGCGCTTTGGCGTAAATC 

ATGGT 
Sequence 2231 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGGGGCCATTGAGACTGCCATGG 

AAGACTTGAAAGGTCACGTAGCTGAGACTTCTGGAGAGACCATTCAAGGCTTCTGGCTCT 

TGACAAAGATAGACCACTGGAACAATGAGAAGGAGAGAATTCTACTGGTCACAGACAAGA 

CTCTCTTGATCTGCAAAATCCACTTTCATCATACTGAGTTGTGTGCAGCTGCAGCGGATT 

CCTCTGAGCGCTGTCTATCGCATCTGCCTGGGCAAGTTCACCTTCCCTGGGATGTCCCTG 

GACAAGAGACAAGGAGAAGGCCTTAGGATCTACTGGGGGAGTCCGGAGGAGCAGTCTCTT 

CTGTCCCGCTGGAACCCATGGTCCACTGAAGTTCCTTATGCTACTTTCACTGAGCATCCT 

ATGAAATACACCAGTGAGAAATTCCTTGAAA 

Sequence 2232 

TCGACTCCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTGTGTCAGGA 
GTCCCCAAGCCCACTCCCAAAATAGAAAATTCACTAAAAGGAGTCAAAGGATTCAGCATA 
CAGTAATATTAACAACTAAGATTTATTCCAGCAGGATACAMGCAAAATAAGAAAGGAAA 
AGTCGCATGGGGCAAAATCAGGTGAAAGCTTTCCAAGAGTCCTCTCCCAATGGAGTTACA 
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CAGGATGTGCTTAATACCTCCAGCAACCAGTTGTGACAATACATGCAAAGAGTGCAAAGT 
CTTGTCCACGACGGATGTTC M I I I 1 11 I T I I I 1 I GAGAAAGCCTTGCTCTGTCACCCAG 
GCTGGAGTGTGGCACGATNCTGGGTGAAGAGCAAGACTCCGTNTCAAAAAAACAAAAGAA 
AAGGAAAAGTCCAGTCCTTGTCTTCAAAATTACTTCCAG 
Sequence 2233 

NCCGGCCGCCCGGGCAGGTACNTTATGACCCAACATTTACCTCAAAAGCTNTNAATGACC 

TTTGCGGGGAACTGTCCCCMGACAACCTNAMGAGACAGCACMCATTTGCAGCTGTTT 

CTCTTTCAGCCCAAGGTCGCAAAGGATATAAGAAGCCAATGCAGCTTGGANGGAGAAAGG 

GATTCCTGTAAATCACTCACTATGATTTCCCACAGCAGGCAAGTGGATCCCCTAAAGAAA 

GAAGGCTTATAGAGTTCCCTCAAAAGAGCTCAGAACTTCAACAATGATTCTATTCACATA 

GGCATTTCTGTCTTCCAMTTTTAGCTTTAAGAGTCTGAGGAAGTACCTCGAGCGGCCGC 

CCGGGCAGGTACTGGAAAACCTCCATCTTGGCTCCCAGAGCTCTAGGAACTCTTCATCAC 

AACTAGGATT 

Sequence 2234 

CTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGACCTCTTAGGGA 
TCTMGGAAAAGATTCTGGACCAGAATGGGCCTAGGCTTGAAGTGGATCTCTTCCAAAAG 
MTAGTGMTAGTTTTTATCTmCTAAAAGATGGATTGAATGGMTTCTTCC 
AGTAAGTTCTTGTAAGTTTAATTAATTTTGTAAC^ 

CTC CTTG AG CTGGGTGCTTACTATGTGTTTATTATTTTAACCCTCAC AACAACCTC AAC A 

CCCTCATGAAGTAGGTCCTATTAGGATCCCTGTTTTGAAGATGAACTGAGGCTCATGTTT 

AGCCCAAGTTCACTGAGCTAGTAACTGGCAAAGTTAGAATCTGAAATGAAATCTGCCTGA 

TGTCAGAATGATGTTGTTCATTCATAGGCTGATATTCTT 

Sequence 2235 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACI irTTTTTTTTTTTTI 

fTTTTTl GAGACAGGATCTTACTTTGTTGCCCAGGCTGGAGTGCAGTGTCACGATCACAG 

CTTGCTGCAGTCTCAAGCTTCCANACTCAAGCGATCCTCCCACTTCAGCCTCCAGNGAGT 

TGTCTTCTCA7TTTTCTGNTCCTTNAACCCTTTCCCTCTCCACTAGCTGGTTTCCCTCAC 

AGGACCMCACNCATGTGCAGTATTTGCTTCCACCTCTTANAGATTTNTCAACTACAAGG 

ACTTAGGCAAGGTTTCCCTTTTGACTGGAAGGCTTGTAGAAACGTGACAACCCGCCTTCC 

CTAACACTCATGTCACAGCGGTAGCGGACCGTATTCTGGGACAAATGGGAACTCACGACA 

TTCCCCCCGCGTTACCTGCCCGGGNCGGC 

Sequence 2236 

CTATCACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGTGATATC 

CACATATTTTTGAGAAAAATTCCC;AAGCCAGGCGAATGTGGATTGGAATAAAGACATAGG 

CAGTGTATACCACCATAGCAATAATGGTTAGTAAGATGGTGTTAAACATAGATCGCTCCC 

AGGGGCTCTAAAACAGCACAGC/KGCTAATGATTTGGTATTGATAGTAGAGCCAGGAGAAA 

TATTCCTTCACACGCCTCAMTCCATGGTTGGCTCCTTCAAGCTGCAGTAAGTTTGTCCT 

AAGAAAGTCCAGGTCTGGTTCTTCAGCCTTGCTCCTTCGCGAAATGATCCTGTGTGGGTT 

AGTTCTCCTCTCTGGGTTGCTGTTTCCTCATCTCCCAGTTGGGTGTATCTCCCTGCGGCT 

TAGGTGAGCGCTGAGGCTTTGGCTCCTCC 

Sequence 2237 

AGGTACGCGGGGAGTCTTGACACCCTAGATCCCAAGATCTCCAAGGATTTTGGTGGGCAT 

ACCCACTCCAGCAC/.CAGGAMGCCATGGAGGGTTTCAATGGMCTTCTTCCCTTCTTTC 

CTTGAACAGCTTCTTGGGCAGGGMGCCCCCTGGGTTCTTGTNNCCTTATTGAATCCCAA 

GAAGGGCCCCGCCCTTNTGGCCCCCAAGGGAATCGGGGGGGAAACCCCCTTTGCCCATTG 

AAAGCCATTCAAGCCANGCTTTTAAAAAAGGGAAAAAAATTGNCCANGGTTGGAAAAGM 

CCCCCANGGGGGTTTTANGCCCCAGAACAGGGGCCCACCCAAAAAGCCCAAANGGGAAAG 

GCCAGGAANGAATTCCCCAAGCCCCNTTTNTTGGGGAAAAAAAAAGGGCCCTTAAGGAAC 

CGGGGAAGNCAAAAAAAAAAAAAGGCNTTGTTNGGGGGGGGGGGAACCTTCGGGGAAAAA 
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AAACCTTAGGGGAAAAAAAAGAATTGGCCAAGGTTTCNGGAAAAAGAATTTCTT ANGGA A 
AAAAGGCCGNTGGGGGTTAAAAANGGGAAGGCCCCGTTCCCATTGGAACCGGTTTTTAAA 
AAGMCCGGTTTCCCTTTTGACCTTTCAAGTTANCCCTTGGCCCCNGGG 
Sequence 2238 

ACGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCG 

GGGGCAGTGGGAAGCTCGCAGCAGCTGGGGAGGAGCCAAAGOCTCGGCGCTCACCTAAGC 

CGCAGGGAGATACACCCAACCGGGAGATGAGGAAACAGCAACCCAGAGAGGAGAACTAAC 

CCACACAGGATCATTTCGCGAAGGAGCAAGGCTGAAGAACCAGACCTGGACTTTCTTAGG 

ACAMCTTACTGCAGC7TGAAGGAGCCAACCATGGATTTGAGGCGTGTGAAGGAATATTT 

CTCCTGGCTCTACTATCAATACCAAATCATTAGCTGCTGTGCTGTTTTAGAGCCCTGGGA 

GCGATCTATGTTTAACACCATCTTACTAACCATTATTGCTATGGGTGGTATACACTGCCT 

ATGTCTTTATTCCAATCCACATTCGC CTGGCTTGGGA 

Sequence 2239 

CCGCGGTGGCCGCCTACCTCGGCCTCCCAAAGTGCCGGGACTACAGGCATGAGCCACTGC 

GCGCAGCCCCACTTGCCCMTTCTTGACCATCATTTATTGAGCTAAATC CAAAAA GGAAA 

CTCTACCACTGACTCTTTTCTTTCTCATGGCCTCCAAAATGAATCCTAGGTTTTTCCAGC 

AGGGTAMTTCAAAAGGACTCAGGTTGGAAATTGACTTCTTGGTTAAAAAAAAAAAAGTA 

TAAACAGMGGTCCTTTTTAGAMTGAGGTTTATTTTCMGGCTACTCAGTCCTAAMGC 

CTGCATATTTACTTTTTTTCCTTCCTCCAGCCCGCGTACCT 

Sequence 2240 

NTGGCGGCCGCCCGGGCAGGTACGCGGGGGTTTGTAGATGGAAGGMGAACTTNTGTGCT 

TAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGTAGGACCCTATACGGCA 

ACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAATGAGTCTATATGTCAGA 

ATACACATTTTCCCACCTTGCCCMCAGTAGAAAMCATAAGAAGAGAAAAACATTAAAA 

AATGACAAGGAAGTTAATGGAAGTCAGCAATGTGATGGTGTTTGGAGGTGGAGCCTTCAG 

AAGGTAATTAATGCCCTTGTAAGAAGAGGCCANAGAGCTTGCGCACCTTCTTCCTACCAT 

GTGAGGAGCCAAGAAGCCGGCTGTNTGCAACCTGNAAGAGGACCCTNACTAGAAGCTAGC 

CATACTGGCATCCTCATCTTGGCTTTCC 

Sequence 2241 

GGCCGCCCGGGCAGGTCCTCGGGAGATCAGGACTCTTACAGCCAGTCATTGAAGTCTCCT 
TTAAATGAGATGTCTGATGATGATGACNATGATGATAGCAGTGACTAAGGACACATTTGG 
GAGTATTTAATCAGGTGTGGCTATCCGAGAAATCAACTnGGGGGAAATGTAANATTCTG 
AG CTCTCTGTTTN GTTCTAGCC ATG AATTTGC CTG ACAAACTTGTAACCTATGTGCCTC A 
ATATATTCCATAGAAAGTAGGTCCCCCTGCCTTCTCCCACTCCTC ACACT CTTCTACAGG 

GATAGGCTTTTGCAAATATATCAGATAMTTTTT^ 
TTCTTGGAAGGTTGGGAAMGATGTTTGTTTTAACAGATCATGTACCTC 

Sequence 2242 

CCGGGCAGGTAC I U i 1 1 HI I 1 1 1 1 1 H U fl i \ \\ 1 I CTTAAGTATCAATGAGTTAAT 

TTTAAAGGTAATAAACTGTMCTTGTTTTNTTCAAAACTGCTTTTAM 

CTATTCCTGCACACTAACTGAACAAGGATGCTGGCAATGATAACTGTAAGACAGAGAAAT 

GACATGATTCTGAGCTGCATTTTTANAMGATTACTTTGGCAGCCAGGCNTTATACTGAT 

TAGATGACAGAAAGACCAGAGAANCNCAAAAGATAACGGATGGGTCTTCGGNTTAATACC 

GGGGTGACNAAATMTNTGTACAACAAACCCCCGTGACACAAGTTTACTTATGTAATGAA 

CCTTCACACNCNCCCCGGMCCTAAMTAAAAGTTAAAAAAAAAAAAGAAGAAGATGGAA 

AGACCAGAGGAGGGAGGC 
Sequence 2243 

GGCGAATTGTTTTCCNCCGNGGTNGCGGCCGCCCGGGCAGGTACNNTGATTAGTTAAATN 

TAAGACTCCGTANTTTTTACAATTTTMCAATAATTTTATTTCTTCAAGC 

GGGATTGTATTAAAACTACAGTGTGTGACTTAGAAMTGATAATGCTGCTTTATGGAAAA 
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TGGATTATAGGTGGGTAAGACTTCATTGCAAAAATTGTGTAATACCATCAGTGTTAGGAA 
■CCCAGTTGAAGTCTAGAAGACAGATGATAGTATCTTAGACTAGGTTGGTATTTGAATAGA 
TATTGGTMTATCAGTAGAATTTAATAATACATTAGTAAGAAAGAAATCAGAGAAGATTC 
TTTTATTTTCACTTGATACTTGTGTTGmCTTTCAATGAGATMGAMGACAGG 
GAGAACGTTCAGGGGCAGGGGATGAGAAGAAAACAAGAATTTTATGT 
Sequence 2244 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACI I 11 1 I II 

TTTTTTTTTTGAGGCAGGAGGATCACTTGAACTTGGTAGGCCAAGGCTGTAGGGACCTGA 

GATTATACCACTGCACTCCAGCCTGAGCAACAGAGGGAGACTCTGTCTCAAAAAAAGAAA 

AGAAAACAAAACAAMCAGAAMCCTAGAGTTTAAATCACACTTAATAAAGACTAAATAC 

AmCTCTCAAAATCAGGAAAAATATATGGTTGTTACTTCTTTATTCMCmGTAmG 

TGGTCATGGCCAATGCAATATGACATGAAAGGAAATAATATATATATAAGAAAGTAGTAA 

ACAATATTT ATTT ATAG ACT AC ATG AGC ATGTAC CT 

Sequence 2245 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGATATTTGTGA 

AGTATCTCCTGGACCCCACTTGAATATGATCGTTGGAGCCAATGGAACAGGGAAGTCGAG 

CATTGTGTGTGCCATTTGCCTTGGTTTAGCTGGAAAACCTGCTTTCATGGGACGAGCAGA 

TAAGGTTGGG 1 Mil IGTGAAGAGAGGATGTTCTAGAAGGCATGGTTGAAATTGAATTGT 

TCAGGGCTTCTGGAAATCTTGTAATCACCCGTGAGATTGATGTGGCAAAAAATCAGTCCT 

TTTGGTTCATCAACAAAAAATCTACAACCCAGAAAATAGTGGAAGAGAAAGTTGCAGCCT 

TAMTATTCMGTGGGGAATC7TTGCCAGT7TCTCCCTCAGGACAMGTTGGAGMTTTG 

CTAAACTCAGCAAAATTGAACTCCTCGAAGCCACTGAAAAGTCAATTGGTCCCCCAGAAA 

T 

Sequence 2246 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGAACAAAACCCACACTATCTCTGAGG 

TGTGCTTGTAGG I I I I IGTTTGTAACATTTGCCTTACTGCACTCCTGCCTTGCGACAGAG 

CMGACTCCATCAAAAAAAAAAATGTMTTTAAAAGATTAAAAATTAAGATAGAI UNA 

CAAATGTATATATCTACCATTATAGTATCATACAGMTGGTTTTACTGCCGTGAAGTTTC 

TCTGTGCTCTGTGTATCCATCTCTTCCTTCCCCTTGGTCCATGGCAACCACTGATCTTTC 

TACTGATCCCATAGTTTCACCGTTTCCAGAACGTCATATAATTGGAATCGAACAGTATAT 

AGACTTTTCGGATTGGCTTCTTCTmAGCM 

TTTTGTGGCTTGATAGTTCATCTCCTGTTAGCAGTGAA 

Sequence 2247 

NGCCGAGGTACTTCCGAAGATGGGCTTGTATCTGGTTTCGGACGGACTGTTAATGACAAT 
TTGATCGACGGGAATTGCACACCCCAGAATCCACCACAAMGAAA/\AGGTTACAAATTTA 
ACMTTTATAGTCCTTTTMTAGATTTCNTTTNTTTN7TCATAAATACTAC 
Sequence 2248 

CGACTNCTTAGGGCGAATTGGAGCTCACCGCGGTGGCGGCTAACCCTGGGTCTTCACCTG 

CATTTTCCTTTTGAGCTGCTGATGCTTCATGGCAAGGGTTCCCCGATCCTGGGGCAGAGG 

CGGCCTCTGGATCATAGGCACAGACCAGGGCTCCTGCCAGAGCTGTCAGGAAGAGGAGAG 

TCATGAACCTCATGCTTCTGTGTGCTGGAGTGGGTATGCCACC/\AATCCTTGGAGATCTT 

GGGATCTAGGGTGTCCCGCGTACGCGGGGACCTCATTCATTTCTACCGGTCTCTAGTAGT 

GCAGCTTCGGCTGGTGTCATCGGTGTCCTTCCTCCGCTGCCGCCCCCGCAAGGCTTCGCC 

GTCATCGAGGCCATTTCCAGCGACTTGTCGCACGCTTTTCTATATACTTCGTTCCCCGCC 

AACCGCAACCATTGACGCCATGTCGGGTTATTCGAGTGACCGAGACCGCGGCCGCTCTAG 

AAC 

Sequence 2249 

GTCCCGGCCGCCCGGGCAGGTACNTATTGTGTCCACTGTAAAGGTAMTGATTTNTTTTT 
TATATTGCATCAAACTTGGAACATCAAGGCATCCAAAACACTAAGAATTCTATCATCACA 
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AAAATMTTCGTCTTTCTAGGTTATGAAGAGATAATTATTTGTCTGGTAAGCA1 1 I I TAT 

AAACCCACTCATTTTATATTTAGAAAMTCCTAAATGTGTGGTGACTGCTTTGTAGTGAA 

CTTTCATATACTATAAACTAGTTGTGAGATAACATTCTGGTAGCTCAGTTAATAAAACAA 

TTTCAGAATTAAAGAAATTTTCTATGCAAGGTTTACTTCTCAGATGMCAGTAGGACTTT 

GTAGTmATTTCCACTAAGTGAAAAMGMCTGTGTTTUAMCTGTAG 

TAAATC AGC AAGG GTATTTTAGCTAATAG AATAAA 

Sequence 2250 

CTACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACGCGGGGACCTCAGAG 

CTGAGCTGGGCATGAGTAGATGCTCAGTAAGTGGTGCACAGGGTTGGTCCCTATGGTGGA 

GGCCCCCTAACACCGCCCAACCCCCCTCCATGTTCTCACAGCTCCACGCACTGAGCACGG 

GCATGAAGGCCATGATGTCAGAATTCTGGCACCCAGGGAGCTGAGATGTGCCGCAGGGCC 

TGTGGCGGACATGGCTACTCAAAGCTGAGTGGCCTGCCATCACTGGTCACCAAATTGTCG 

GCCTCCTGTACCTGCCCG 

Sequence 2251 

CCGGGCAGGTAC H I I I I M 1 I I I M I I M I I I I M r GCCTTCTNGGGAAGTGCTGGGAA 

CGCMCTTMGGTGTTTGTAGACACTGCCCCTCTTAGAGCCTGAMTGCCACA ATTTA AG 

CTCTAAGCCTTTTTGTTCTCTGAGTGTCCTCTANAAMGATMCATCTTCMCCCTTT^ 

GTTTTCAGGGATTCTNTGCCAGTAGCCCTGGGGGTGAAAGGAGGTCAGCAAAGGGACTTG 

CCGGATGTCCGAGTAGGAANAANATTGCTNTAAGGAGTCCAATCCAAACCTCACTCATTT 

CACAATTTTACCGGAGGCTCCTNTTGCTNTCCCCAGG 

Sequence 2252 

TNGGGCGAATTGTTNCTCCACCGCGGTGGCCCGCCCGGGCAGGTACACACAAAGACAAAC 

CTGAACTTMTTTCANGGAAAACTTAAACCCATGCACAAATMTTGGTGAGCCTTCATTT 

CCCTGACTTCMGTTTCCATGTGAGGACTCATGCTCTCTCCACTTTCTTCTTGGGAGGAG 

GGAAGATTTACCTAATGGGTAAATTTGGGCAAAGCACATTGAGTGTGCTTGTTTGGCTCT 

GAGTCTCTTTGCAAACATGTGTCTGCCCACAGTGACATGAGTTTGCGTTGACTGTCATGT 

CTGCAGGAAGCTGCCTGCTCCTGTGGCCATCCCGCGTACCTN 

Sequence 2253 

GGAGGCTATGCAGATATATTCTTTTCTCTTTAAAGTTTTGCCCACCAGTmAGM 

TCAGTAAATCTTGTCTACAGTMTTATTACTATTTGCTATTCTMTGGGAATnTGTATA 

TCTCCATCCCTTCTAATATATTTMCAACTGAATTCTATAMCAAGATTTGTTTCTTCTC 

CACATTATTTATTrTATCAATACAGACACATACTTTGGGGGTTATAA TCCA ATACTATTG 

TTATrTATTATGTNGTTCATATTGTTCTAGCTTTGGCTACTGGGATCTTTTATAAGTGTN 

TTCTGTG N CCTTTTG AC ATGCCCC ATTTTTTTCTTTTTAAGCAC ATCTT^ 

ACTACAAGATTGTCCAGGTTCTCATCTTGTGTTTTCCCTTCCCTAGCCTTGGAATGAGCT 

GN'fmCCMTGATCTCTAGTTCCTTTATTAGAAMTGGTGGCCAACATGGTGMACCCC 

ATCTACTAAAAATACCAAAA 

Sequence 2254 

CTATAGGGCGAATNGGAGCTCCCCGCGGTGGCGGCCGAGGTACATTTCTTCTAATTGATC 

ATATCTGCTTATTTTCCTCTGGATTAAGGATCMGGAGATAGTATATTAGATGATTTGAT 

AAATTTCCMTGCTTTGCAAAGTAGTTGMCMCTTTTTCmGTGA 

TACTAGTCMGACTTGGTATTTCATAGCTGGCTTTCTTATTCTGAAGGTTGTAAAAAGAC 

ATATGAAGTMTATITMCATTGAAAACATGATAATCAATTTGATTATCTATGAATTGTC 

CTAAACGTTCAGAGTAAAGTTTCCTTTAMGTTAAMTCTTTCAAGTGAAGGAM 

AAGTCACATGTAMCACCAAMTAAGAGGMTAGAGCMTAGGATATTTTTGGCTTTATA 

ATTCAATTTAAAAATTMGGTGCATTTATTTTTTTGGCAGCTGGC 

TCCAGTAAGCAGTTGCTATCTATGATTCATGAGATCATGGGGTT 

Sequence 2255 

CCGCGGTGGCGGCCGAGGTACGCGGGGGACANGGCCATCTCGCTATAGGAAAGGAAAGTG 
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GAACAGCATTCATCCTCMCATTTTTACGAAGACAAAATGAAGACTGGAGTAGAAGACTG 
ATCAGTGCAGGTGTAGCATAAAAGTGTAATCCTGGAAGATGTGGTGTGAGAAAGCATTAT 
CANCAATNATGCNTNTTNCCAATCCCAAACCATGGGGGGTTTCTCACAGCTTTACACCAA 
AGGGCATCACTATCCCTCAAAGAGAGAAACCTGGACACATGTACCTTGCCCG 
Sequence 2256 

CCGCGGTGGCGGCCGAGGTACACACTTGTGTGCATTTCTGTTGGGTATATAACAAGTATG 
AAATTGCTG G GTCTTAAAATATTTATATATTMCTTC ATTAMTGCTG CCTMG AATTTT 
TCMGCGGTTGTGCTATTTTACCTTCACCAGTTCTAGTTGTTCCATATCCTTGCTGTCAC 
TTGGCATTAATCTGAATTTTTTATTTTGACCAGACTGGTCTGTGTGTCATAGTATCCCAT 
TGTCATTTTAAGTTGCATGCCTTTGATGAAGACATT^ 

AATATATCCTGTMCTGATTTGTAGGGAGNTTTTATATATATTCTGGATATAANTNTTTT 
TATNTGGACATAMTATTTCGCAGATTATCTNCTCCCACTTTTAACTTCCCI ITTTTC 
Sequence 2257 

GATCCTATATGGCGNNTAGGGGCTCCACCCGCGGTGGNGGCCGCCCGGGCAGGTACGCGG 

GGGGAGCGGCCAACATGGCGGAACGCAGGAGACACAAGAAGCGGATCCAGGAAGTTGGTT 

GAACCACCTAAAGAAGAGAAGGCTGTGGCCAAGTATNTTCGATTCAACTGTCCAACAAAG 

TCCACC AATATG ATGGG N C ACCNGGTNTG ATTATTTTATTTGCTTTC AAAAAAGC AG NTG 

NGACTTGTCTTTTTGGMTTCAAAAAGATGGGGNCAAAAGGCTCAAAATATAATGGTAGT 

AGNGAAGCCTTTTATTTTTACCAMCCMGGGGGAGGTCCTGTTGGGTTTGGACTTACCT 

GGCAAACCAGGCCTTTTTAAAAGGAAAGGCAG I I I I I 1 i I I 1 I CACCNGGAGGCNCCTTA 

AAAAAGTTAAAANTGNAAAAAATTGGAANATTAATTGGTTATNTATGGACCATATATNGA 

GNACATATANTNCATNTAATTNAANAAANAAAGGGTACCCCTTTCGGGTCNCCGCNTTNC 

TAANGAAAACCTANNTTNGGGGGATTNCCCCCCN 

Sequence 2258 

CCGGGCAGGTACCCGGGAGGCTCGCATGTGTGGGTGCATGCCGTGGCAGGAGAAGGCTTC 

CGAAGAGCCATGCATGCGTCCTCTGGCTGCTCCAGGTGTAAAAAATGCCAGGGAGGGGAC 

AATGATGAGTCCTAGGCAGGAAAGGTCCTCTGGAGAGAGGAAATGGCACACGCGCAAGCA 

CGGAGGCATGAGGGGCCAGGGTCCTGGAGGACACTGGGCCGGGACAGATCAGACTCCCCC 

GCGTACCT 

Sequence 2259 

CCGCGGTGGCGGCCGCCCGGGCAGGTACATCTTTAGAGAGTAGAGATACATCTGCATATA 

AAAAAAAAATCTGCCGTAAGTGATAGAGTTGAAAATAATGTCCTACATACCAGAATGTTC 

TTCTGAATTAGGCATCATGTAGGTATTCAGCATTCTTTCCATTTCCAAATTGCTCCCACT 

CCCTGI I I 1 I ICATCATCCCCCI I I I 1 1 CCTCATTAGCTCATTTGGAAATAAAAGATTTT 

TATTTTATTTTCAI WUiWW ICTGCAGAAGAAGCCACATATGTATGGAGATTTAGGC 

MTGATTTCTCMGGAAAAAATGTTnAAMGTACAGCATGGCTGGTGGTTTCAAAAAAC 

GTAGTCATTCTTCTCACTGCAACCAATGTAAGATAAGCAGGGTAGATCTGTTATTTCCAA 

ATTAAAGGTGGATTAAGATATATGGAGAGAGAACATGGCATGTGAGGTTTATAGGGCTAG 

AAACTG 

Sequence 2260 

GCGAATTGGAGCTCCCCGCGGTGGCTNAAACATC I I I I I CTTCTTAGTCCCAGGTTGTTC 

TCATTCGTGGCTGGATCACACCAGCAAGMGGATTTTAAAGGACAACTCCTTGGGGCCGT 

CCAACACTGAAACCGCCTGACCGCTGCTGTCTTTGTGCTTGCAGAAAATCACGTGGCCCC 

GGAGAGGGTCAGCGANGGGCCTCCACTGTCTGAGTCCGCTCCTGGCAACGCCTGCTCTCT 

GCACTCAGAGGCCCGGTCCCGGCAAGCTGCCACGTTGCTCAGAAGTTCTGGCAGGGGGAG 

CCTGAAACCCCTGTCTGCTGGCGACACTCCCGGAGCAGCAGGAAATCTCTACATTTTATT 

rrm icttaacagacaaaagtatttgactccagaatgatttccaggaattacagtatga 

GAACTTAGTCTTTGAGA I I ! I TATTTTCTTCTTAAAAAAATAAAAATGAAAT 
Sequence 2261 
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CCGCGGTGGCGGCCGAGGTACGCTTTGACGACACTCTCAGACATGCTGTGCAACTTAACG 

TCACTGCCACCCGGCAGCTCTTGCTTATGGCTAGTCAGATGCCAAAGCTGGAAGCCTTTA 

TACATATCTCTACTGCCTATCCAAATTGTAACCTGAAGCACATCGATGAAGTTATCTATT 

CCNGCCCTGGNGGACCCAAAAAAAATCATNGATTCCCTTGAGTGGTTAGACGATGCTATT 

ATTGACGAGATTACACCCAAGCTGATCAGAGATTGGCCCAATATTTATACCTACACCAAG 

GCCTTGGGAGAAATGGTGGTGCAGCAAGAGAGCAGGAACCTGAACATTGCCATCATAAGG 

CCCTCCATTGTGGGAGCAACTTGGCAGGAGCCTTTCCCAGGTTGGGTTGATAATATAAAT 

GGACCTAATGGAATCATTATTGCGACTGGGAMGGGTTTCTTCGGGCCATAAAAAGCTAC 

TCCAATG 

Sequence 2262 

CGAGGTACGCGGGGGAGATGCTGCCACCTAGGTTACTTGTAGGACCCTATACGGCAACCT 

CCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAATGAGTCTATATGTCAGAATAC 

ACATTTCCCACCTTGCCCAACAGTAGAAAMCATAAGAAGAGAAAAACATTAAAAAATGA 

CAAGGAAGTTAATGGAAGTCAGCAATGTGATGGTGTTTGGAGGTGGAGCCTTCAGAAGGT 

MTTAATGCCCTTGTAAGAAGAGGCCAGAGAGCTTGCGCACCTTCTTCCTGCCATGTGAG 

GAGCCAAGAAGCCGGCTGTCTGCAACCTGCAAGAGGACCCTCACTAGAAGCTAGCCATAC 

TGGCATCCTCATCTTGGCTTTCCAACTTCCAGMCTGTGAGAAGTATATGTTTGTGGTTT 

AGTCAATGGTCTATGGTAA I Hill I ATAGCAGTCCCAGCCAAGACAGTGCCTCATTTAC 

TACATACCATTTATATTATTATATAGGCTCCTTTCAGAAACCCATGTTCAAATAAGAGAT 

MGATACTGAAACACATAACACCTTCACTAGTTTTTAGTATACAAATATTGAGAAATAGT 

TTGGTATTAACTATCTCATNCAAGAAATGCAGATTCATGTTGGTTCTAATTTTTATTATA 

TAMTGACAAMTGNAGAAACTTAACACCATCCTAGATTTTAGCTGCCCCNG 

Sequence 2263 

CCGCGGTGGCGGCCGAGGTACCTTAGATTTCTATGGGACATCGTTTAAAMCTATTGTTT 

ACGCGAGAGCCTTGCTMTTTCCTAAAMTTGTGGATACATTTTTTCTCCCATGTATAAT 

TTTCTCACCTTCTATTTAAAAAGAAAAAAAAAAGTCA 

CTATAAGGAGCTAACATMCTmGAmAGTGTTATTCATAAMTTAGGTTAGCAGm 
ATTAACCTTTTGTATTTGCTCTGGCMTGTTTAATATCTCATAAGCTATACACACCTCGA 
AGCCATCAATGACAACCTTTTCTTGCTGAATAGAACAGTGATTGATGTCATGAAGACAAT 
TTTATCTCCTTTTGCCTTCCATAATTTGTACCTGCCCG 

Sequence 2264 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTNNNTTTTTTT 

I i I I I 1 I 1 I 11 I I I i IGGGAATCTAAACCAAACAAAACATGATAATTTCAGAAGAGATTA 

ACTNG Ai Mill I CCTTCATCGGCAGCAGACTCTGACTTACCCACCTTTCCCACTATTTC 

TTTGAGTAAGGATTTGAATAATTTAAGCCTTCACA7TNGGAATTTT 

TTGNAAATATTATGAGTAAAMTCAGGAMTGCCTCACTGGTNCCCTTGGTTTCTCAGAC 

AACTCACTGATTTATGGTCTTGAGA.CCATAAACTCA 

Sequence 2265 

CCGCGGTGGCGGCCCGAGGTACAGGAGGGAACCAGGCGCTACAGACTGTGTAATGAGTGT 

CTTGCAGAATTTGGCATAGACAGCCTCCCCATTGACTTGGAAGCTGAACAACATCTTATG 

TCCCCATCAGATGGAGATAAGGATTCCAGATGGCACTTGAGTGAAGATGGGAATAGATCC 

TATGTGGAGATTGTAGAAGATGGGTCTGCTGATCTGGTCATCCAACAGGTTGATGATAGT 

GAAGAAAAAAAAAAAAA^AAAAAAAGGTACCTGCCCG 

Sequence 2266 

CGCCCGGGCAGGTACGCGGGATTGACATTGCAAGAAGCAAATGTTACCTCTAATAGCGTG 

GTCCAAAATAATGTCTATTTACATTCTCAGAGCACTTTGTCAAGGTTATTCAGTTGAAGG 

CTGTAACACTCTCAGGAGTTAATCAAGAGCAATGGCATCTATGTTTATTAGTGAAGGGAA 

GACTTACTTGCACCTACTGCATATCTGAGTTTACTGAGCTCAAACTGGAATTTAGGAAGC 

CAAAAAAAAAAAAAAAGAAAAAGAAAGACAGTGTAGACAGAGGCAAGAAAGTAGAGTGAT 
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TGCATTTGGGCATCTGAATGACCCATCTTGGAATCACAGCTCAGTCACTGGCTGTATGCC 

TTGAGAATGTTTCTTCTCTGTTCTGAGCCGCTGGTTCCTTATTTATGAGATAAATGAGGA 

ACTGTGCTGCACGCAGTTGGTTGGAGCCTGAAATGAGATAATGGATGTTGTCCTAGTTAG 

AAGATGAGTMTTTTAGGGCCGGATGTGTTTGGCTCACGCCTGTAATCCCACCAGTTTGG 

GAGGCCAAGGGTGGGCTGATCACAAGGTCAGGAGATCAAGACCCGTCCTGTTTAACACGG 

GGAMCCCATTCTTTACCAAAAMTACAAAAATTAGCCANGNCTTGGTGGCGGGTTGCCT 

GTAGTCCCAACTACTTAAGGAGCTGAGGCAGGAAAATGGGTGTGAACCCTGGGANGCCGG 

ACCTTGC'CAGTGGNGCTGANATTGGCACCAANTGGCCTTCAACCCTGGGCGANAGANACG 

AGGACTNTGTCTTCCAAAAAAAAAAAAAAA 

Sequence 2267 

ACTACTTAGGGGGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGGAGGGAACCAGG 

CGCTACAGACTGTGTAATGAGTGTCTTGCAGAATTTGGCATAGACAGCCTCCCCATTGAC 

TTGGAAGCTGAACAACATCTTATGTCCCCATCAGATGGAGATAAGGATTCCAGATGGCAC 

TTGAGTGAAGATGGGAATAGATCCTATGTGGGAGATTGTAGAAGATGGGTCTGCTGATCT 

GGTCATCCMCAGGTTGATGATAGTGMGAAAAAAAAAAAAAAAAAAAAANGTNCCTGCC 

CG 

Sequence 2268 

CGAGGTACGCGGGGGAGCGGTGAGTGATCCT7TCAGCAAAGTCAGTCCTGGGCAGGAGAC 

GGCTTCAGGAATACTGTCAGCTTTACTGGATTCCACCATCGCTTTCCAGGACTGTTTAGG 

CCCTGGGCCCTTGAAGGGTTTGCGTGCCTCTTGTCTCCATTCATACCTCAAGAACTTTGT 

TCATGTTAA I (Mill ICACTCTATCATATGGAATTGAGCAAAAAAAAAAAAAAAAAAAA 

AAGGTACCTGCCCG 

Sequence 2269 

CCGCGGTGGCGGCCCGAGGTACCCTGCTGAAAGATTATTTCTAACAGGCTTGTAGAGAAA 

CGTCGGTTCATGTAMTTAGAAATTATGGGGCCACTTTGCCATTCTTCACACCTGCAATG 

MCAGGTGTTTATCTGCAGTTCTGACTTATCTCTTGMCTCCATTTGCATGTTATAGTGG 

GATGCAGCTGATGCCCTGTCCAGATCTTCTTCAGGCCACTACATCTATATATGCATTCAT 

ATTCCAGTGGCTGTGAGTGTTGGCTGTTGGTTGACAGAGGAGCTGCATCCTCCTGGAGGA 

AACTGAACTCAGCTGATGAAAGCCACCTGGTCCTGGAGGTGAAGCATCTTCCAAATGACA 

GCCTGCAGTCAATGACTGATGAATATGACTTCATTGCCTCATGACAGGACCTACTCTGGG 

GTATAGATCATGCTTCTAAGCTCCTCCTGGGGTCCTGCTGAGGCTCAATGCCAGCTGAAA 

CCATACCCTTGCTCACATACTTTCCCTTCTCTTTCCTTCTTCCCTTGCTTCCTTAGAGAT 

TCCTCCTGCAATGGCTTCCTCAAAAAAGCATTTTCATAAGCAACCTCATGTCAGCCTCTG 

CTTTCACAGAAGCCAATCTGANGTTMCMGCTTGTTTAACTTAGCATGTCCAACATAAA 

ATTCTTGGTTTCTTCTCATCCCCTGCCC 

Sequence 2270 

ACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGTAGATG 

GAAGGAAGAACTTGTGTGCTTAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTAC 

TTGTAGGACCCTATACGGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGA 

AAATGCAGTGAAGTAGAAGAGACAGGGATATCCCAGAAGGTTATGCAAAACATCAAGAGA 

AGATGAGAGGTCAGAGATGGGMGAAACAAGAACTTTGACATGCTTGGTGTTCTTGCCCA 

AGCTTTGAAGAAGTTTAC AAAGTCTATATGTCAGAATACACATTTC C CACCTTGCC CAAC 

AGTAGAAAAACATATGAAGAGAAAAACATTAAAAAATGACAAGGAAGTTAATGGAAGTCA 

GCAATGTGATGGTGTTTGGAGGTGGAGCCTTCAGAAGGTAATTAATGCCCTTGTAAGAAG 

AGGCCAGAGAGCTTGCGCACCTTCTTCCTGCCATGTGAGGAGCCAAGAAG 

Sequence 2271 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGATGATTTT 
TTATTTTAAAATAATCTG G AAG TAATG G G AAC TTAG I I I I I CCTGAACTCCAACCAGAAT 
CCAAATTGGTTAGATGAGGCCGGGCGCGGTGGCTCACGCCTGTAATCCCGGCACTTTGGG 
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AGGCCGAGGTGGGTGGATCACCTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGGT 

GAAACCCCATCTCTACTAAAAATACAAAAATTAGCCAGGTGTGGTGGCGCCTGGTTGAGG 

CATGAGAATCGCTTGAACCCAGGAGGTGGAGGTTGCAGTGAGCCAAGATCATGCCTACTG 

CACTCCAGCCTGGGCAACAAAGTGGGACTNTGTCTTAAAAAAAAAAAANAAATAATCGGT 

TAGATGANGAAAGCATGTATATTTTCTATATACCAAAAAC 

Sequence 2272 

CTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACGCGGGTGCCCAGGTCTG 

GCATCCTGCACTTGCTGCCCTCTGACACCTGGGAAGATGGCCGGCCCGTGGACCTTCACC 

CTTCTCTGTGGTTTGCTGGCAGCCACCTTGATCCAAGCCACCCTCAGTCCCACTGCAGTT 

CTCATCCTCGGCCCAAAAGTCATCAAAGAAAAGCTGACACAGGAGCTGMGGACCACAAC 

GCCACCAGCATCCTGCAGCAACTGCCGCTGCTCAGTGCCATGCGGGAAAAGCCAGCCGGA 

GGCATCCCTGTGCTGGGCAGCCTGGTGAACACCGTCCTGAAGCACGTCATCTGGCTGAAG 

GTCATCACAGCTAACATCCTCCAGCTGCAGGTTGAAGCCCTCGGCCAATGACCAGGAGCT 

GCTAGTCAANATCCCCCTGGACATGGGTGGCTGGATTCAAAC 

Sequence 2273 

CCGCGGTGGCGGCCGAGGTACGCGGGGGCATTGTTGGAAGTTTACTCTGGTTAAAGGATA 

GAGAAAAAAATGGAAAGAGGAGGGTAATCACAACAGTCTAGGCAAGAGATGATGGTAGTT 

TGGAGACAGAAGGTGTGGCAGTGGACATGAAGATAAGTGGATGGATTTGAGAGAAACTTG 

GAGAGTGAAACTGCATGGATAGTGATGGATCAATGACAGTAGACTGGGATGAATGGAAGT 

ACCTGCCCG 

Sequence 2274 

CTCCTATAGGGCGA ATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC 1 I I 1 I 1 I 1 1 I I I i I 
n I I I I I I I II I I I 1 G AAAN GTTAAAAATTTTAAAATTTAATAACTTG AATATAC ATCTG 
CTTTGAMTAGGACATAMGTCATACAGTATTTTTACATMTTTTCT^ 
TAGTATGCAACTCATAAAGAAATAAAGATCTAAGGAGCNCAAAGATNCATGTATGTGATT 
TAAAAG AAG C ATTTCAAATAAACAG GAAAAAAATTACTCTCTTATTTCATACCGTATCTA 
TTGNCTTCATCATAACAGAAATCTATAATAATAAGAACTAGAGCTNTCACAGCACCTNTC 
ATNTGCTGACACTGNGTTAAACACTNTACATCCATCACCTCCTNTAATC 
Sequence 2275 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGCAGTGG 

GAAGCTCGCAGCAGCTGGGGAGGAGCCAAAGCCTCGGCGCTCACCTAAGCCGCAGGGAGA 

TACACCCAACTGGGAGATGAGGAAACAGCAACCCAGAGAGGAGAACTAACCCACACAGGA 

TCATTTCGCGMGGAGCAAGGCTGAAGMCCAGACCTGGACTTTCTT/\GGACAAACTTAC 

TGCAGCTTGAAGGAGCCAACCATGGATTTGAGGCGTGTGAAGGAATATTTCTCCTGGCTC 

TAC TATCAATACCAAATCATTAGCTGCTGTGCTGTTTTAGAGCCCTGGGAGCG ATCTATG 

TTTAACACCATCTTACTAACCATTATTGCTATGGTGGTATACACTGCCTATGTCTTTATT 

CCAATCCACATTCGCCTGGCTTGGGA A I I I I I CTCAAA 

Sequence 2276 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC1AGACGGGGCTGTAAT 

CTGGTAACTGTATGTATTTTAGTTCTTCAGTCCCTGGGAAGGAGACAGGAGAAGGTGGGA 

GGGAGGAAGGGGCCAGCTGAAATGGAAACAGATCCCTGATCCGGGGCGGTCAGTGGAACC 

CTTCTTGGTGTGCGAGAGCCTGTGCATTTCAGAGGCAGCAAAAAAGTTAAAAAAAAAAAA 

AAAAATTGATCTTTGTTTAGATTAACAGACCCCTGACTATGAANAAGGAAGGCATCCAGA 

CCAGAAACCGAAAAATGTCTAGCAAATCCAAAAAGTGCAAAAAAGTGCATGACTCACTTG 

GAGGACTTCCCCAAGAACAGCTCGTTTAACC 

Sequence 2277 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGTTGC 
AGTGAGCTCAAGTGTTGGGTGTATCAGCTCAAAACACCATGTGATGCCAATCATCTCCAC 
AG G AGC AATTTGTTT AC CTTTTTTTTCTG ATG CTTTACTAACTTC ATCTTTTAG ATTTAA 
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ATCATTAGTAGATCCTAGAGGAGCCAGTTTCAGAAAATATAGATTCTAGTTCAGCACCAC 

CCGTAGTTGTGCATTGAAATMTTATCATTATGATTATGTATCAGAGCTTCTGGTTTTCT 

CATTCTTTATTCATTTATTCAACAACCACGTGACAAACACTGGAATTACAGGATGAAGAT 

G AGATAATCC G CTCCTTG GC AGTGTTATACTATTATATAACCTG AAAAAAC AAACAGGTN 

ATTTTCACACCNAAGTAATAGATATCATGACACATTTAAAATAGG 

Sequence 2278 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCATGAGAGACAAAGCTGAGAAGTGTGGTG 

GAAAAGGAAGAGAGAGCTGAGGCAGACGATGTAAGATCTGCTGTTACTTCTTGCTGTGTG 

ACCTTGGGGTGAAGCAGCAAGAANGCCCTTGCTAGATGAGGCTNCTCAACTGTAGACTNC 

CCAGCCTCCANAACTGTGAAGNGAAGTGGGTGTGCCTGGGATCNNACACAGCTGCTAAAA 

GAAGGNCTNAAGAGCCACAGATGTCTT 

Sequence 2279 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGTAGATGGAAG 

GAAGAACTTGTGTGCTTAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGT 

AGGACCCTATACGGCMCCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAAT 

GCAGTGAAGTANAAGAGACAGGGATATCCCAGAAGGTTATGCAAAACATCAAGAGAAGAT 

GAGAGGAGTCTATATGTCAGMTACACATTTCCCACCTTGCCCAACAGTAGAAAAAAAAA 

AAAAAAAAAAAAAAAAGTACCTGCCCG 

Sequence 2280 

CCGCGGTGGCGGCCGAGGTACGCGGGGGCATTGTTGGAAGTTTACTCTGGTTAAAGGATA 

GAGAAAAAAATGGAAAGGGGAGGGTAATCACAACAGTCTAGGCAAGAGATGATGGTAGTT 

TGGAGACAGAAGGTGTGGCAGTGGACATGAAGATAAGTGGATGGATTTGAGAGAAACTTG 

GAGAGTGAAACTGCATGGATAGTGATGGATCAATGACAGTAGACTGGGATGAATGGAAGT 

ACCTGCCCG 

Sequence 2281 

CTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCATGATT 

AGTTAAATATAAGACTCCGTMTTTTTACAATTTTAACAATAATTT^ 

TGTTAGTTTGGGATTGTATTAAAACTACAGTGTGTGACTTAGAAAATGATAATGCTGCTT 

TATGGAAAATGGATTATAGGTGGGTAAGACTTCATTGCAAAAATTGTGTAATACCATCAG 

TGTTAGGAACCCAGTTGAAGTCTAAGAAGACAGATGTTAGTATCTTAGACTAGGTTGGTA 

TTTGAATAGATATTGGTAATATCAGTAGAATTTAATAATACATTAGAAAGAAAGAAATCA 

GAGAAGATTCTTTTATTTTCACTTGATACTTGTGTTGTTACTTTCMTGAGA TAA 

ACAGGCAAAGGAGAAAGTTCGGGGGCAGGGGATGAGAAGAAAACAAGMTTTTATGTTGG 

ACATGCTAAN GTTAAACACCTGCTTAACTCAG 

Sequence 2282 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGATTGA 

CATCCCACACTACCTGATTTGATACACTAAGAAGGGCTTATCATTTCTGTGGTACTCTTG 

GCCAACAATGCACGTGTAACATTTATTCATTGAAACACATTAGGCAGAATTGCAGGACAT 

TCTTCAAAATAGCTGTCCAATACTCTTCAAAAGTGTCAGGTCCTGGAAGACAAAGAGATA 

CTGAGGAACCATCACAGAGTGGGAGAGGACATAGAGTGATAAAAACTAACTGTGATGTGG 

AATCCTACATTGGATCATGGACCAGAAAGACAGCACTGATGGGAAGACTGATGAAATCTG 

MTAAGTCTGTAGTTTGGTTTAAAGMGAAGMTMTMCAATAATGGTTTAGCTGCTGG 

CTCCTTAATAAAATTCCCCTANTTACTGTAATGTCTGAAAATGAACCCCCTATGGTGGGC 

TACTGTGACTGTTCACTGCATTTGAGAATAAAGAATAATTAAGGAAAAAAATACTCTGTC 

AGCCTAGGCAATATAGTGAGACTCTATCTGTACCT 

Sequence 2283 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCNCGCCCGGGCAGGTACCAGGCAGGATGGAG 

CAGGATGGTGTGAGATTTCATCACACTACTCAGGATAGTGCACAATCTAACACTTATGAA 

TTGTTTATTTCTAAAGTTTTCCATTTAATATATTTGAAATGAAGTAGACCCTGTGTGTAG 



Page 331 of 635 



WO 01/51628 PCT/US01/00798 



Table 1 



CTGAAACTTTGGAAAGGAAAACTATGGATAAGGGGGACTACTAGAATACACTTCAATAAA 

ATAGCATTTTAACAAAATCCTAGTGTCCGTAGAGTAGTTCTCCATCTAGCTGTCCATCTT 

TCAAACGTGGAGAGAGGTGAGATGAGGGAGAAGTATGGTCAGAGAGGGGAGACAGGTGTG 

TGTGGTGTGGGTGTGCAGGTAGATACATAGCTCAGTGTTCACTGTGAGAGGTGAACAGAG 

AACATTCCAAGCAATGGGAGTGGCATGATACAAGGCACAAAGAAGTCAAACTGAAAGGCA 

CCTTCAGGAAAAAGAAAGGAGTCTCTTGCAATTCATACACAGAGATCGGCACTGGCAAGA 

AACAGGACTCTGGGGGAAAAATCAGGGCTTCCAAAAAAA 

Sequence 2284 

TATAGGGCGAATTGGAGCCTCCACCGCGGTGGCGGCCGCCCGGGCAGGTACTTAAACAGT 

TATAGTCACCATCACCTGCTTCAGMTGGTCTTTTAGATTTGTGTTTGTTTTGTTAAAGT 

TGTTGGCACCAGGATGCAGAGAATCAGACTGGCCTGAGGTGAAGGAGCACACAGCCCTGA 

GGGCTTGGAACCCTGGGTCCAGTTCCTCTTCACACCCCCTTCCACTCTGAGTAGCACATC 

TCCCCAGGTGCCCATGGAACACCTGCTTTCATCCCAAATATCCGTCCACCTAGGCGGGGT 

GGTATGTTCTTACGTCTCTCTGACTTTGATGCCACTCATTCTATAGTTTAGCTGGTTTTC 

GT TCAAGATATTC TTGGTAGTAACTGACAAGTATGTTGCACATGTATTGGGGGAGGCGCT 

TCATTTTTATTTTAATACACATGTATTTCCTCCTTGCACAGGATTTTGATGGTGTG 

TATCCTAAGTGGTAGCCTTCCAMGTAGCAGTGAGTTGACATTCAGCTGCTTTTAACTAT 

TCAGGCTACCTT TTATA CTAAACCTTGAAAACTAGAATCTAATGTCTACCCCAAAAAAGT 

AAGTTCTTTGATATTTTATACTTTTTATGTACCTTGGCCCGGTCTAGAACTA 

Sequence 2285 

AGGTACCCAGACTCCAGTATGAAAACTCTCTGGGCTGTGTCCTATGATCTTCCCATGAGT 

MCTCATAGTCTTGATCCAGTGGMTCTGGCCTTCATTAGTCTCAGTGGCAAGTTGGTTA 

TGTGGAAAGTCTCTGTTCACTCACTTGGGTGAATMCAGTAMGACCTTTCTATTGTTTT 

CACTTTACATTAGGCCATGAGTATTTGTGCCTGTAGCTGCAGTTTGTGTTAGTTTCCTAC 

CCCAGGTATCTCCTGCAGCATGCAGCTTCAGTCCTACCAGACCCTCAAAACTTAAAAGCG 

AAC ACTATTTCTAGG GAGGATTTTGC AGG AAAATGG AG AAAGG GTTAC ACACAAAAAAGG 

TTAAACTACTCTATGCATGTTTCTGCAATGTGTTATCTCAAGAATTCATCTCTGTAGCCC 

ATCAGGGCAGGAGCTGGTCTCTCACCTGTTGATAATATTCCATAAGGGAGGTTCTTCCCC 

ACAANTGTTTAGTCTTCCGACGCTGGTATAGCCTGACATGATGACATTCTACTTTCATGT 

TGGTCGTGCTGCAGGGAGAATTCTGTGAGTGTCCCAATANGCTGGGAATCACTTGCTAAG 

GGTGAACCCCA 

Sequence 2286 

CCGCGGTGGCGGCNCGAGGTACTTTTAGTAGAGATGGAGTTTCACCATGTAGTCCAGGCT 

GGTTTCAAACTTCTGACCTCAGGTGATCCACCTGCCTTGGCCTCCCAAAGTGCTAGGATT 

ACAGGTGTGAGCCACTGTGCCCAGCCAGAATATATCATTTCACTGGACTCTGCAGGTGCT 

TTGGATGATCAAGGAATAGGACATGGCTGTAGAAGTACGCGGGTGTTGAGGAGGCATATT 

TTTAAAAGAATCAAATATAATTTAGATAGGAAATGACAAACTCAGCAGACATATTATGAA 

GGAAATGAGACATACAGGAGAAGAGMTTAATGAATTGGTTGGAGGCTAGAACTTAATAC 

CCTCATTGTAGCCAAGAAGAGAGAGAGAGAAAAAAAAGGAAAACATGAAGGGAATAAAGA 

AACATGTTAGTGGGATGAGAGTCTCACATATGTATTCTGTAAAGTAGAAACTGAAGGAAG 

GGACAATATTCAAAGTGAGATGACCAAGTACCTGCCCG 

Sequence 2287 

GACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACCATCCCCAAGG 

ACACGGAAGTATTTCTCATCCTGAGCACTGCTCTCCGTGACCCACACTACTTTGAAAAAC 

C AGAC GCCTTCAATCCTGACCACTTTCTGGATGCCAATGGGGCACTGAAAAAGAATGAAG 

CTTTTATCCCCTTCTCCTTAGGTMGCTGGACCCACAGTTTCTTTCCCAGACACCAGAGG 

GGCAGGTCCTATCCTCAACTTTGAGAAAAAAAATGACAGGTCCTTATTAATTGAGCACTT 

AATATATTCCAATTGCTTCACCTGCCTTATCCCCTTCCATCTTCACTACAACCCTGTAAG 

GAGGCTTGAGAAAGAAGATGACATTCCCAAAGGCACATCTGGGCAAGCAGGAACTTGGGC 
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AAGTATTTTAACATCTTCTAAACCTC 
GT 

Sequence 2288 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC I II I I J I II I M I II I I 

I I 1 1 I I CTGCTTTGATTTCA I II I IAI I I I 1 1 I AAGAAGAAAATAAAAATCTCAAAG ACT 

AAGTTCTCATACTGTMTTCCTGGAAATCATTCTGGAGTCAAATACTTTTGTCTGTTAAG 

AAAAAAAATAAAATGTAGANATTTCCTGCTGCTCCGGGAGTGTCGCCAGCAGACAGGGGT 

TTTCAGGGCTCCCCCNTGCCANAMCTTTNTTGAAGCAAACGTGGGCAAGCTGCCCGGGA 

CCCGGGCCTNTGAGTTGCAGAGAAGCAGGCTGTTGCCAGGAAGCGGACTCANACAGTGGA 

GGCCCCTCGCTTGACCCTTTCCGGGGCCACGTGATTTTNTGCAAAGCACAAAGACAGCAG 

CGGTCAGGCGGTTTTCAGTTGTTTGGACGGCCCCAAGGAGTTTGTC 

0 

Sequence 2289 

GNGGCGGCCCGAGGCACTNTACTN I I I 1 1 I I I I I I! I I I I II I I GGGATTTTTAGCTGAT 

GAMTATGGTCCCATTTATCATCAGTTCTAACTTCGTGTTACTACTGTTGTTCTTAAGGT 

CCTTAAAGTAMTTTTAAGACTCMTCTAAGTGTGTCAGTGTTGATTACCATTCTATTTA 

NAAAGGC AG AATTTGTCTTAAAAG ATATTTG N CTAAAGTGTGTGTTAGTATTTGTC AAAA 

TAAATGGAGGGAGGGAGTGGGTGAGCTGGGATAGAGGCAGGCAGAAAGCAACAGGACAGA 

GAGTTNGANANAGAGAAAGAAAAAAAA 

Sequence 2290 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTGCTAAAGAACTCTAGAACATACAGGGTGTAG 

ACGGCAGTCTTCTTGGGGAAAAAGAGGCTTCAGGATTGCAAGATTTCAGCTACTGTCTCT 

CCTCGGAGCAGGATTCCATCTTTTCCATGGCTGGGGCCCTTGACACATGTCACTGTACAC 

TGCTC7TCTCGCTGGGATTTCCAAAAGACCTTGGCTCAAGGTGTCACTTTTGCAGAAAAG 

CATTTACTGACTGTTGACTTGGCCAAATCCCTTGTGCTGTGTTTCCTAGCACGGNGAGTT 

GCATTTTTCACATGAACTCATGCCTACCTTCTCTTGTTTTATAAGCTTCATGAAGGCAGG 

TACCT 

Sequence 2291 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCCGGGAGGCTCGCATGTGTGGGTGCATGCCGT 

GGCAGGAGAAGGCTTCCGAAGAGCCATGCATGCGTCCTCTGGCTGCTCCAGGTGTAAAAA 

ATGCCAGGGAGGGGACAATGATGAGTCCTAGGCAGGAAAGGTCCTCTGGAGAGAGGAAAT 

GGCACACGCGCAAGCACGGAGGCATGAGGGGCCAGGGTCCTGGAGGACACTGGGGCCGGG 

ACAGATCAGACTCCCCCCGCGTACCT 

Sec;uence 2292 

NCGCCTGGGCAGGTACACACTTGTGTGCATTTCTGTTGGGTATATAACAAGTATGAAATT 

GCTGGGTCTTAAAATATTTATATATTAACTTCATTAAATGCTGCCTAAGAAI I I I ICAAG 

TG3TTGTGCTATTTTACCTTCACCAGTTCTAGTTGTTCCATATCCTTGCTGTCACTTGGC 

A T TATCTGTATTTTTTATTTTGACCAGACTGGTCTGTGTGTC ATAGTATCCC ATTGTCAT 

TlTAAGTTGCATGCCTTTGATGMGACATTTTCTGATGCTTAATTG GCTCTTT GAATATA 

ITCTGTAACTGATTTGTAGGAAGTTTTTTATATATTCTGGATATAAG Mill I ATTGGAC 

ATAAATATTGCAGATATCTTCTCCCACTTTAACTTCCCTTTTCTTTCTCTTAATCATGAC 

TTTCTTAATACAAAGTTCTTMTTTTAATGTAGTCCAACTTAGCAGAC ) 1 1 1 I CTTTATA 

GTTAATGCTTATTTAAGTCTTGCTCAAGAAATTTTTGTCAACTCCAGTGTCATAAAGAAA 

CTCTCTTATGTTATAGAAGCTTGGTTTTTTTTTTTT 

Sequence 2293 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGGAGATGCTGCCACCTAGGTTACTTG 
TAGGACCCTATACGGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAA 
TGAGTCTATATGTCAGAATACACATTTCCCACCTTGCCCAACAGTAGAAAAACATAAGAA 
GAGAAAAACATTAAAAAATGACAAGGAAGTTAATGGAAGTCAGCAATGTGATGGGTGTTT 
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GGAGGTGGAGCCTTCAGAAGGTAATTAATGCCCTTGTAAGAAGAGGCCAGAGAGCTTGCG 

CACCTTCTTCCTGCCATGTGAGGAGCCAAAGAAGCCGGCTGTCTGCAACCTGCAAGAGGA 

CCCTCACTAGAAGCTAGCCATACTGGCATCCTNATCTTGGCTTTCCAACTTCCAGAACTG 

TGAGAAGTATATTGTTTGTGGTTTAGTCAATGGGTCTATG 

Sequence 2294 

CCGCGG1 GGCGGCCGAGGTACACTGATTTCCGATCAAAAGAATCATCATCTTTACCTTGA 

CTTTTCAGGGAATTACTGMCTTTCTTCTCAGAAGATAGGGCACAGCCATTGCCTTGGCC 

TCACTTGAAGGGTCTGCATTTGGGTCCTCTGGTCTCTTGCCAAGTTTCCCAGCCACTCGA 

GGGAGAMTATCGGGAGGTTTGACTTCCCCGCGTACTTTGGAGTCCCCTGGTTTCTCAAG 

AATTGCCGTTGACTCTTTCTTTGGCTTCTGCTGGCACGGTAACCAGACTCCCTACAACTG 

CACTCTTTGTCTTTGTCATGGAAGCCGCGAGCCGTAGAGGTTCCGCGTGCTCTGCCGGAC 

TGTGAGCAGGNTCACTGGGTCCTTTACACTTGTGAATTCGAAGCTTGCCAGA 

Sequence 2295 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACTGATTTCCGATCAAA 

AGMTCATCATCTTTACCTTGACTTTTCAGGGAATTACTGAACTTTCTTCTCAGAAGATA 

GGGCACAGCCATTGCCTTGGCCTCACTTGAAGGGTCTGCATTTGGGTCCTCTGGTCTCTT 

GCCAAGTTTCCCAGCCACTCGAGGGAGAAATATCGGGAGGTTTGACTTCCCCGCGTACTT 

TGGGAGTCCCCTGGTTTCTCMGAATTGCCGTTGACTCTTTCTTTGGCTTCTGCTGGCAC 

GGTAACCAGACTCCCTACAACTGCACTCTTTGTCTTTGTCATGGAAGCCGCGAGCGTAGA 

GGTTCCGCGTGCTCTGCCGGACTGTGAGCAGGTCACTGGGTCCTTTACACTTGTGAATTC 

GAAGCTTGCCAAGATGTATCCCCAATGCATTGCCACTTCTGCCCCGGTTGT 

Sequence 2296 

CCCCGCGGTGGCGGCCGAGGTACNCGGGGGTGGACAGACATGGGAGTGCTGAATCCCCAG 

AGTTAGAGTCCACATCATGTCTCTCTGAAGAACAGNTAAAGTGTCTTCTGGATGAATGCA 

TACTTAMCAAAMTCCATCATTAAACTTTCTTCAGAAAGAAAAMGGAAGA 

ACGTAACACCTGTGTTCCCCCAGCTTTCCAGGTCCATCATCTCTAAATTGCTAAATGAAT 

CAGAAACAAAGGTCCAGAAAAACTGAGGTAGAAGATGCCAGATATTGCTTTGAGAGTGAA 

GAMTGTTGAAGCTTTCTTAAAAGGGCTACTATCTCACTAAAAGCCCTTTGACCTGGGGA 

CCATTAACCATGGTTCAAAMGCTTCTTTGTTCCACTTGAMGCCAGGMGAAAATATTG 

AAAANTGCCCTTTTCAAANTTT 

Sequence 2297 

AGCGGCCGCCCGGGCAGGTACACACACTCACGCATGCACAGGCACACTCACCGGGACACG 

CACACAGTTGTGCTGTGCACACAGGCATACACACACACACCCTCACACGCACACACACTC 

GCTGGATCACTCACACACACACATCCACGAGAGGCGCGGCGCCTCTTTGGTCGTGCAGCC 

AGCCCCTGGCTTGCTTTTGAAGGCTCTGCTAAGTCGATGGTCAGGTTGCCAAAGGGCCGG 

TCATCTCGCATCCTCCCTCCCTTCGGTGCCTCTGGTCTGATGTCCAGTGAGGATCAAAGC 

CAGGAGAGCGAGATGGAAGTTTTCCTGCTGCAACGCTTAAGCTCCGGTTGAACACAAGGC 

TTGGACTCCCCGCGTCCTCGGCCGCTCTTAGAAACTAGTGGAATCCCCCGGGGCTGNAGG 

Sequence 2298 

CCGGGCAGGTACGCGGGGGTAGATGGAAGGAAGAACTTGTGTGCTTAGACCTGACGCTGG 

GAGGAGATGCTGCCACCTAGGTTACTTGTAGGACCCTATACGGCAACCTCCTTTGCCAGG 

AACTATTTATAAACATCCTGCAGGAAAATGAGTCTATATGTCAGAATACACATTTCCCAC 

CTTGCCCAACAGTAGAAAMCATAAGAAGAGAAAAACATTAAAAAATGACAAGGAAGTTA 

ATGGMGTCAGCAATGTGATGGTGTTTGGAGGTGGAGCCTTCAGAAGGTAATTAATGCCC 

TTGTAAGAAGAGGCCAGAGAGCTTGCGCACCTTCTTCCTGCCATGTGAGGAGCCAAGGAA 

GCCGGCNTGTCTGCAACCTGCAAGAGGGACCC 

Sequence 2299 

GACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGTAATAGTATCA 
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GGCATTGGCTTAGACACTGAGGATAAGAGCTCTATATACTTGTGAACTTGGTGAAAGTTG 
AGAAGGATGGAGAAAGAGATAGAATGAATATAAAAATATGAGTAGAGAATTGCTTTAATT 
TG AATAG C CTCATTGTTTT AGTGCTCTGAATAG AAATAGAAAAAAATGGTTAGTATTTAC 
TGGAATTGCTATTTTTAGGTCATTCAGAGTT^ 

AGGTCTTATTTACATAAACAATTTTACAGGGCTTGTACAAATACAAGAAACCAAAAGCCC 
TAACTGCTGTTTCTCATGGGGTTAMTCTGTCTCTTCAATCTATCAAATATTTGGCTTTA 
GAATCCTAACAGTCTTTAGCTATCATTCAAAG 
Sequence 2300 

GAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATGGAGGGCTGATTTATCACATACG 

TGGGTTCTGTAGGGCCCACTGTGGGACTTGAGTATGCATTGGGTTTTTGTATACTCAGGC 

GTTCTGGAACTATCCTCCATGTATACCAMGGATGGTTATTTCTACAAAAGCAGGAGGTA 

AAGCCCAATGCACAGCTTGCAGATTCCCCTCAGGCGAGAGACAGAGGTTAAGGGTAGATG 

GCAGATGACCTAAGTGGCTTGTTTTAGGAGGCCACTCCCAGGGCCACAGCTTTCATGTGT 

TTGCCACCAGGGTAGAAGGTCTTGCTGACAAGGGCAATGACTACTAACAGCCCCAGCTTG 

CATGGAAACAGAAAAGGGCATATTGCTTGTGGCCACCAGGGCTCAGGCTCTATCCCTCAG 

CAAGCTTTGGGATC 

Sequence 2301 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGACGCGGGAG I I I I I CAG 

AATCAGGCCTGCTTTGTCCCTAMTGCTTTCTTATTTCCCCAGAGCCCTTATTGGATTTA 

AGCCCCATATCATATCTCACATACCTGAGACAGACATACCACAGTGAACCCTGCATGTCC 

TTGTA AGTT CTTTTTCCGTGCCTAGAAGTTGCAGAAGACCAAGGAACCTTGGAAGTGACT 

TACTCTTTTGCTTCATAAAGTTGCTTTAAMCAGCCT^ 

TGTCACACATTTTTAAATTTTAAGGTTAATTTAGTCATTAATTCTGT 

TTTGGMGCTAGTTGTCTTCMTGTCTTTTACTTGGAGTTACTTGGMGmGCTTAAAC 

TTTATTCAC TTG GAG TAG AACTTCCTAG 

Sequence 2302 

TACGACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGAAC 

TTGTGTGCTTAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGTAGGACCC 

TATACGGCAACCTCCTTTGCCAGGMCTATTTATAAACATCCTGCAGGAAAATGCAGTGA 

AGTAGAAGAGACAGGGAATATCCCAGAAGGTTATGCAAAACATCAAGAGAAGATGAGAGG 

AGTCTATATGTCAGAATACACATTTCCCACCTTGCCCAACAGTAGAAAAACATAAGAAGA 

GAAAAACATTAAAAMTGACMGGMGTTMTGGMGTCAGCAATGTGATTGGTGTTTGG 

GAGGTGGAGCCTTCAGNAAGGTAATTAATGCCCTTGTMGAAGAGGCCAGAGAGCTTGCG 

CACCTTCTTCCTGCCATGTGAGGAGCCAAGA 

Sequence 2303 

TAGGGCGAATTGGANCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCATGATTAGTTAA 

ATATAAGACTCCGTAATTTTTACAATTTTMCAATAATTTrATTTCTTCM 

TTTGGGATTGTATTAAAACTACAGTGTGTGACTTAGA/jVATGATMTGCTGCTTTATGGA 

AAATGGANTAATAGGTGGGTAAGACTTCATTGCAAAAATTGTGTAATACCATCAGTGTTA 

GGAACCCAGTTGAAGTCTAGMGACAGATGTTAGT/JCTTAGACTAGGTTGGTATTTGAA 

TAGATATTGGTAATATCAGTAGMTTTAATAATACATTAGAAAGAAAGAAATCAGAGAAG 

ATTCTTTTATTTTCACTTGATACTTGTGTTGTTACTT1CMTGAGATAAAGAAGGACAG 

CAAAGGANAAAGTTCAGGGGC 

Sequence 2304 

CTCCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGATTATCTTA 
GTGTAAATGATTTTTTGTAAAATAATTTAGTTGTTAGAT^ 

G AAAAC AAATTGC ATGG C ACTGTATCCTCTC AG G AC ATTTTGAGATAAGATGTGAAAATG 
ATACTCATTGGAACCAAAAAAAGATTGAAAATAA! I I I I GTAG AAAATATAATTAATTTA 
TTCTACTATGGTGTGTATGGTGTGTGTGTGTGTATATATATATATATATATATACTGAAA 
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TAAMTTATGAGT MTTM CTTTATGACAGGTAACACTAATAAAGTGCAGTGGCATGTTC 

CTGAAATCCCAGCATTTTGGGGGATTGAGGTGGGGMGGGATTACTTTGGAGGCCAGGGA 
GTTTG GGACTAG 

Sequence 2305 

ACTACT TAGGGCG AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACTGAGATGATTAT 

tccctci rmn i aaaatttgmgtttcctttgattccccttttcttttacaaaatggag 

TCATGACACGATCCCCGTGTCCCGTGTTGAGMCTGAAAGGTTTCTGTAGCTTTCCTTTG 

ACATGAAATTTCAAAMTATACTACTCTGTTTCACTCTGGCCTGTAATTGTTGCACTCTG 

GCCTTTAATTGTTGCAACTGATTAGACATTTTTCCTTCCAGCATTATGGTAGACTCGATG 

ccc 

Sequence 2306 

AGGTACTTTTTCAC i I i II I 1 I I I I I I I I I i I I GCATTTGAGCTAAGATTTTAACAATCC 

TGTGAAAATAGGTAATATAGTCACGTATTTGTTGAATACATTATTTCAAAGAACCACAAA 

CACACAGAAAAMTA7TTAAAAAGTCCATGGTGGGGGGAAATGATCCAGAAACAACAGCT 

CTTCTACTTCCAGCCCTAACCGTCTGGGATGCTAGATGGTTGTGGGCAAACCGTGGAAAA 

AGATMG CTTCT GTTTTTCMGATTTTATTTTGTTAGAGAGTG 

GTCTCTGTTTTTGCCTTCTAGCMTTMTATATGAC^ 

CTTTCTGCAAGACCTTTGGCTCACAGAACTGCAGGGTATGGTG 

Sequence 2307 

AG GTACTCTTG GTTTTTTMGACAAAGAGCAAATCCTCCCCTGCCAGGATTGACTTTTG^ 

CTCl MINI iCAAACCTCACTGCTTTTTGGTTTAGTTGTCATAAAATGCCAAGCACCAT 

GAACAGGGCTCCATGAAGGGGCTCAGAGGTAGGAGGGCTGTGATTAGGAGAAGGCTTGGA 

CTGATGGGCAATTTGAGTGCTCAGAATTAGAGTGAGGGGGTGGGGGTGCTGCAGGGACAG 

ATGCTGGGGAAAGACACCCTGAAGGGCAAAGGGGAGCAACAAATGGGCTGCAAGTACCTT 

GCCCCGGGCCGGCCGCTTCTTAGAAACTAGTGGGATCCCCCGGGCTTGCAAGGAAATTCG 

ATATCAAGCTTATCGAATACCCGTCCGAACCTTCNGAGGGGGGGGGCNCCGGGTACCCAA 

GC 11(11 I GTTTCCCTTT 

Sequence 2308 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACNCGGGGATGCTGCCA 

CCTANGTTACTTGTAGGACCCTATACGGNAACCTCCTTTGCCAGGAACTATNTATAAACA 

TCCTGCAGGAAAATGAGTCAAGGAAGCTTTTNTTTTGAGCT 

TGAGTAAAGTATACTCCTGTGMCAAMTTTGGMCATATTTGNTTCTNTCTAACTGATT 
TCTNCAGAATTTGGAACTAGTTCAGTGAAGTAGAAGAGACAGGGATATCCCAGAAQGTTA 
TGCAAAACATCAAGAGAAGATGAGAGGAGTCTATATGTCAGAATACACATTTCC 
Sequence 2309 

GACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACNCGGGGGCAGTGG 

GAAGCTCGCNGCAGCTGGGGAGGAGCCAAAGCCTCGGCAGCTCACCTAANCCGCANGGAG 

ATACACCCAACTGGGAGATGAGGAAACAGCNACCCAGAGAGGAGAACTAACCCACACAGG 

ATCATTTNCCGAAGGAGCACGGCTGAAGAACCANACCTGGACTTTCTTAGGACAAACTTA 

CTGCAGCTTGAANGANCCAACCATGGATTTGAGGCGTGTGAAGGAATATTTCTCCTGGCT 

CTACTATCAATACCAAATCATTAN CTGCTGTGCTG NTTTAG AGCCCTG GGAGCG ATCTAT 

GTTTAACACCATCTTACTAACCATTATTGCTATGGTGGTATACACTGCCTATGTCTTTAT 

TCCAATCCACATTCGCCTGGCTTGGGAATTTTTCTCAAAAA 

Sequence 2310 

AGGTACGCGGGAGAATTTGTTGCATTGAGTTATTAAAGTAGATTATACACATTTAGCATG 

GTTAGAAACTAGATGTTACCACTGAGTGTTTTCTCCTTMTAAGTTAGTTTAAGAGGCTG 

GGGA T AATTTC ACGGTAAAATGGCAGCTTAACCCCAGTTTCAAACATTGAAACTTGAAAA 

TTTTCCTTTTTGAAAAATTACATATACACCTGGCACTCTCTATCTGTGGGTTTCA 

CAGGAGTCAACCAACCTCAGAAACAACCAATCGTGGATTGAAAATACTTTGGGGGGGAAA 
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AAATN N AANAN AAAAANAAAAANN N N ANGGTACCTGCCCGGGCGG 
Sequence 2311 

CCGGGCAGGTACGCGGGAGAACTTGTGTGCTTAGACCTGACGCTGGGAGGAGATGCTGCC 

ACCTAGGTTACTTGTAGGACCCTATACGGCAACCTCCTTTGCCAGGAACTATTTATAAAC 

ATCCTGCAGGAAMTGAGTCMGGAAGCTTTTCTTTTGAGCTATTTACAGCTT^ 

TTGAGTAAAGTATACTCCTGTGAACAAMTTTGGAACATATTTGTTTCTCTCTAACTGAT 

TTCTCCAGAATTTGGAACTAGTTAGTCTATATGTCAGAATACACATTTCCCACCTTGCCC 

AACAGTAGAAAAACATAAGMGAGAAAMCATTAAAAAATGACAAGGAAGTTAA 

Sequence 2312 

AGGTACGCGGGGCAGTGGGAAGCTCGCAGCAGCTGGGGAGGAGCCAAAGCCTCGGCGCTC 

ACCTAAGCCGCAGGGAGATACACCCAACTGGGAGATGAGGAAACAGCAACCCAGAGAGGA 

GAACTAACCCACACAGGATCATTTCGCGAAGGAGCAAGGCTGAAGAACCAGACCTGGACT 

TTCTTAGTGCTCATTCCTGGGCCCTCGATCACTGCATGTTTGTTACATGGTCCAGTTTGC 

TGCTTCCTGGAAGAGCATCTCCTGACTGTGATGCCTTTGAGAATAAAGTCTTTCCGAAGC 

C ATTAATATTCAGAAGGAG CCACATGG ACA 

Sequence 2313 

CCGGGCAGGTAC I M I H I M 1 1 I I 1 1 l II U I I I I I I M I I I NAANGACTGACTGTGCT 

CCANANAGTGGANAAAAATGMTTGGGAGCTGAGGAGGAATGGTANATGGAGGCTAATTA 

CAGGATATTCCAAGATGCTTGAAAATCAACATCCACATGATGAGATGACCTGGGTATCTT 

GGGTTTCTTGTGTGACTGGCANAAATTCATATNTAGAATATAAGGGGGTTAATATTGGCA 

TTCTCTAGGGTGGTGGAAGTGTCTGTTGTATCGTGTGGGGAANAAATCACCTTGACAGGA 

AAACATGGAGCTCAACACATCACTTATTCAGTCACAAA 

Sequence 2314 

CCGCCCGGGCAGGTACTATTCTCTCTCCCAGATCAAATTTTCTCCGGTGGCTAAAAAGTT 

GTTTGTGGTMCTGCAGTGAGTGCTATACCTGTAATTTTTCTGGCTCATCACTTTAAAAG 

AAMCGTGGAAAGAAGAAAGGAAAAATATTACCATGGGAACCAGAGCACCTCATACTTGA 

ATACACTAAAAGAGCAGCATCAGACAAAGGTTCAAGTTGTTCCAGTAGCAGACAGAATTT 

GACAT TATCT TTAAGTTCTACCAAAGACAAAGGATCTCAAGTTTGTAACTATGCTAATGG 

AGGACTTTTCAGTAAATATTCAGGTTCTGCACAGAGTTTGGCCTCTGTCCAG 

Sequence 2315 

ACTATAGGGCGAATTGGNNCTCCCCGCGGTGGCCGGCCGCCCGGGCAGCTACGCGGGATA 

AGAGGCCAGAGGAAGAGCGCTTCGAGCTTCTTCAAGGAACTTAGAGAAGAGCGGCATTGT 

GCTCCTTCTGGGACCCCCACAGGACCAGAGATCCTTGCTGCTGCAGTTCCTCCCTCTTCC: 

CTAAAGAACAATAGGGAACMGTAGMGTGGTAGAATTTCACAGCAATAAAAAAAGAAAA 

TTGACGCCAGATCATAACAAGAACACAAAGGCTAATCCTAGTGTTTTGGAGAGAGATGTG 

GATACACAAGAATTTAACCTAGAAAMGCTCGTlTAGAAGTGCACCGGTTTGGTATCACG 

GGTTATGGAAAAGGAAAGGAGAGAATCCTGGAACAGGGAACGTGCCATTATGCTGGGCGC 

TAAGCCTCCTAAAAAGAGTTATGTGAATTAC 

Sequence 2316 

NCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTGTTCTTTTTTTNT 

mrm nttnttggagcagctaaaatctaggatggngttaagtttcttcattttgtcaay 

TATATATAAAAAATTAGAAACAACATGAATCTGCATTTCTTGGATGAGATAGTTAATAAC 

AAAC TATTTCTCAATATTTGTATACTAAAAACTAGTG AAGGTGTTATG n gttcc agtatc 

TTATCTCTTATTTGAACATGG GTTTNTGAAAGG AG C CTATATAATAATATAAATGGTATG 

TAGTAAATGAGGCNCTGTCTTGGCTGGGACTGCTATAAAAAAATTACCATAGACCATTGA 

CTAAACCNCCAACATNTNCTTTTCACAGTTCTGGAAGTTGGAAAGCCAAGATGAGGATGC 

CAGTATGGCTAGCTTCTAGNGAGGGTCCTNTTGC 

Sequence 2317 

CCAGACGGAAGCTGACATTCGCTATTGATGTTCCAGCATTGATTGAGCTCCTGCCTCGTC 
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TGCATCAAGCATG GTCCT CGCAGCTTCACGCAGTTAGAGTCCACATCACGTCTCTCTGAA 

GMCAGTTAAAGTGTTTTCTGGATGMTGCATACTTAAACAAAAATCCATCATTAAACTT 

TCTTCACAAAGAAAAAAGGMGACATTGAGGACGTAACACCTGTGTTCCCCCAGCTTTCC 

AGGNCCATCATCTCTAAATTGCTNAATGAATCCCAAAACAAAATGGTCCANAAAAACTGG 

AGGTANGAAGATGCANATAGTGCTTGAGAAGTGAAGAATGTGAAGCTTTCTAAAGGCTAC 

TTTCTCACTAAAGCCTTGACTGG 

Sequence 2318 

GCGGCCCGAGGTACAATTGTGTCATATTCATGCACMCTTCATTTTAGTCTCTATTTGAC 

TTGATTTCTTTAGTGGATTTGGGCTTTTTGGCCAATAAGATAMTACCTGCMGGGACAT 

CACATAAAATTCGATCATAGAAGAGGATCTCTTTCCTGCCGTCCACATCTATCTGGAGCC 

TGGGTATGCTGGAGGCATCATGGTTGACCACCCCCGCGTACAATGGAACAAGGAGATAAG 

CAGTGAAAGGCCAAGGGAATGTCTGGAGTTAGGACTTCAGGTGATTCACAACTTGGCTGC 

CACTCACCCGAGACTGCCCAAGCCCAGATTTCTTNCTTCTATAAGAATATTGATTCTTGC 

AAATAAGATGAACCTAAATGTGGTCCAGGAGCCAGCATCTTCTACATGGTACCTGCCCG 

Sequence 2319 

ATACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGGCCNGGGTCCGCGGGNAGGGAGAA 

TGCTG CCACCTAG G CTACTTGTAGGACCCTATACGGC AACCTCCTTTGCCAG G AACTATT 

TATAAACATCCTGCAGGAAAATGAGTCTATATGTCAGAATACACAT1TCCCACCTTGCCC 

AACAGTAGAAAAACATAAGMGAGAAAAACATTAAAAAATGACAAGGAAGTTAATGGAAG 

TCAGCAATGTGATGGTGTTTGGAGGTGGAGCCTTCAGAAGGTAATTAATGCCCTTGTAAG 

AAGAGGCCAGAGAGCTTGCGCACCTTCTTCCTGCCATGTGAGGAGCCMGAAGC 

Sequence 2320 

CTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGGTACGCGGGGGGACAA 

GGGGCTTGAGCGTTCTGTGGAGAGAGTGCGAGGTCAGGCCATGAACTTGGGAGATGGTTT 

AAAGCTTGAAACTGAATTACTGGATGGAAAAACCAAGCTAATA7TGTCTCCATATGAACA 

TAAATCAAAMTTTCTGTGMGATGGGAAATAAGGCCAAGATTGCAAAATGTCCTTTAAG 

AACAAAAACTGGGCACATTCTAAAATCAACACAAGATACTTGTATTGGGAGTGAAAAACT 

TTTGCAAAAGAAGCCAGTTGGTTCAGAMCATCACAGGCAAMGAMGAGATTTCACCTG 

CAACTCCTAATATGCAGAAGACTAGAAACACCGTAAATACATCTCTAGTAGGTAAACAGA 

AGCCTCACAAAAAACACATNACAGCTGAAAACATGAAGAGCAGTTTG 

Sequence 2321 

ATCATACTTAGGGCGAATTGGAGCTCCCCGCGGNGGCGGCCGCCCGGNCGGGTNCCCGGG 

GGANATGCTGCCNCCTAGGTTACTTGTAGGACCCTATACGGCAACCTCCTTTGCCAGGAA 

CTATTTATAAACATCCTGCAGGAAAATGTCAGAGATGGGAAGAAACAAGAACTTTGACAT 

GCTTGGTGTTCTTGCCCAAGCTTTGAAGAAGTTTACAAAGTCTATATGTCAGAATACACA 

TTTCCCACCTTGCCCAACAGTAGAAAAACATAAGAAGAGAAAAACATTAAAAAATGACAA 

GGAAGTTAATGGAAGTCAGCAATGTGGTGGTGTTTGGAGGTGGAGCCTTCAGAAGGTAAT 

TAATGCCCTTGTAAGAAGAGGCCAGAGAGCTTGCGCACCTTCTTCCTGCCATGTGAGGAG 

CCAAGAAGCCGGCTGTCTGCAACCTGCAAGAG 

Sequence 2322 

GGGNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACCAGAGTCCAAAGACGTAAACGT 

GCCTTCAGCAGGG AAGTT CGGAGGTCTTGGATTCTGATGGTMTATTCTTTTCATAAAAA 

GGAMTMGTMTATTTTTGTCGGCAAGATCACAGTCCCTAGCTATGCCTTAAATATCAT 

CTCAGCTGAATTCTGTGTTCCACTCCACAGCCTTCTTTGCAACACTGGAAGGTTAACGGA 

TTTGMGTAMTGMTTGTMCTGTMTTTCAAC^ 

GTTTGGACCGTGAACAGGCTAGCCAAGAACAAGTATTGGGAGGTCCATGTAATATNGAAG 
AGAGTTGATATTTGGCCCAACTTCCTGATACTTTAAAAACAGAGAAGTNAAACAAAATAT 
GAGGATNTCTTTTGGCCAGCAAGTCC 
Sequence 2323 
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CNCTTAGGGCGAATTGGAGCTCCCCCGNGGGGNCGNCCNGGGACNACCCTTGGNTTTCCA 
GNGGGCNTCAACANA I rTTTTTTTTTl I AANAAAATGAATTCACTGAGGTTTAGAAGATG 
TTAAAATACTTGCCCMGTTCCTGCTTGCCCAGATGTGCCTTTGGGAATGTCATCTTCTT 
TCTCAAGCCTCCTTACAGGGTTGTAGTGAAGATGGAAGGGGATAAGGCAGGTGAAGCAAT 
TGGAATATATTAAGTGCTCAATTAATAAGGACCTGTCAI I 1 I I I I I CTCAAGTTGAGGAT 
AGGACCTGCCCTCTGGTGTCTGGGAAAGAAACTGTGGGTCCAGCTTACCTAAGGAGAAGG 
GGATAAAAGCTTCATTC I III ICAGTGCCCCATTGGCATCCAGAAAGTGGTCAGGATTGA 
AGGCGTCTGG I I I I I CAAAGTAGTGTGGGTCACGGA 
Sequence 2324 

GGNGGCGGCCGCCCGGGCAGGTACACACAAAGACAAACCTGAACTTAATTTCAAGGAAAA 

CTTAAACCCATGCACAMTAATTGGTGAGCCTTCATTTCCCTGACTTCAAGTTTCCATGT 

GAGGACTCATGCTCTCTCCACTTTCTTCTTGGGAGGAGGGAAGATTTACCTAATGGGTAA 

ATTTGGGCAAAGCACATTGAGTGTGCTTGTTTGGCTCTGAGTCTCTTTGCAAACATGTGT 

CTGCCCACAGTGACATGAGTTTGCGTTGACTGTCATGTCTGCAGGAAGCTGCCTGCTCCT 

GTGGCCATGTCAAGCAATTCTTTCTTTCAACTGCAACTGTGT 

Sequence 2325 

GGCGGCCGCCCGGNCAGGTACNCGGGGTAATTGACATTGTGGACTTCCTGGGGTAGTCCT 

GAGTAAAAAGCACTTMGCACCTGTGGTACATCAAAGAGTCTCTTTGAGAGTTCTGCAAA 

GATTCAAGTGTGTATACCTGAGTCTATATATCAAAAAGTAATGGAGATNAAATNGAAGAA 

NGTANGAANGAGCCTNCTAAGAAGCCATCTGCCTTCAAGCCTGCCATTGAAATGCAAAAC 

TCTTGTTCCAAATAAAGCCTTTGAATTGAAGAATGAACAAACATTGAGAGCANATCCGAT 

GTTCCCNCCAGNAATCCAAACAAMGGACTATTGMGAAAATTCTTGGGATTCTGAGAGT 

CTCTGTCGANGACTGTTTCACAGGMGGGATGGTGTGTTTACCCAAGGCTACACATCAAA 

AAGAAAAAAAAAAA 

Sequence 2326 

CCGCGGTGGCGGCCGAGGTAC 1 1 1 i 1 1 I 1 i I I I 1 II 1 ! TGTTTTTCACTGTGGTAGGGTG 

GAGCTTAATTGGGCANACCTGTCCTCAGGGACCCTGGTGGTGCATGTCAAGCAAGGCTGG 

CTGTGGTAGTCAATGGGCCAGGTGATAACCAGGTCTCTGGTGGTATGTCAAANGGGGGGC 

CANANGNGCTTGGACTTGNGAACCATGTTACTGGAGGAGGTGAAANTACCTTTTCAGTGG 

AAAGCCTCCATAGGCAGGAAGCTAGGGACACAAACTTTGCTAATGCTTTAGCCCCAAAGG 

CTACAACCCAGCACAGCAGCAGTTATGAGCAGNGAAATTTACCCTTGAGGTACCTTGCCC 

GGGCGGCCCGGNTNTAAAACTAGTGGGATCCCCCGGGCTTGC 

Sequence 2327 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGAGAATTGGAAACT 

ATAAAAAAGTCACTGCAGATTTGTTAAAAAAAAAAAGCCAAATATAAATTTGAAAACTTG 

AAAAACCAAACAGTGAAAATTAAGAACTCCAAATGTGTTAGTTATCTATTGCTGTGTAAC 

ATACTGCTCAAACTTGACTTAACATNACCTTTNATTATCTCAAG III I I I GGTTGGGTCA 

GGAAATCCAAAGCCTTGGGCCCTMGNTGGGGTTCTCTGGCTTTTAGGCTTGCAAATCAA 

AAGGCATTNGGTCGGGGGGTANGGGTCTCCTNTGGGAGGCTGAAGCTGGG 

Sequence 2328 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCACACACACACAGGC 
ACACGCAGGCACACACGCAGACACACACACACATAAACACACAGAGTTCACTAGTCCGAG 
TTACTGATTTTCTTAGGATTCTC AAAGTGACAACACC GGAAACAAG GTAATTC ATGTTAA 
AACACAAGGGGTTATATTCAGTTAAGAGAATGGGGATCCCCGAAAGTAAACCCGNGGGAA 
TTTTGGAATCCMGTTTTCGAAAGMGCTTAAAAAA 

CATTTCAACCCmCTTGGMTTCCCTTAMGAAAAATACCAGGANGGmCAAAAATT 
NGGAAAAACCATTTTNCCGGGTTTTTTNNGGGGATTNCCCAAAMGNTTTGG^^ 
im t GNGGNAATNAANGGCCCTTTANN M 11 1 I I I I AAAA 
Sequence 2329 



Page 339 of 635 



WO 01/51628 



PCT/US01/00798 



Table 1 



TCACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCGAAAATAACGTCAATTTTGT 

CAGGCTACTGAATTAGGTGACAGAATTTAAATTTAMTACAAAGTCACGTGGCTTTAGCA 

CCCAAATTCTTAACTAAAGTTCTGTGCAGGATAAGACTTATGACACTGGGATTATTTGTG 

CTCCCTTTATGCATCTGTGTTAACCCACACATATTACCAGCCCTGTAGTTAATAGGTTAA 

TAAAGAATTAACTGCTGCATAAAATATTCAAGAGAGGAGGGGCTTACTCCACAACTGGAT 

CCTTGAGTCTCACCTTCTCTACATCTCCCAGACCCTCGGTGTCCAGCAGAACTAGGATGT 

GGCCTGGCTTCTTGGGGTGGGGCACACACCACATCCAGATTCCTTTAAGTGTGAAGACTG 

CACCGTGGAGCCCAGAGAGAAGCCTGCAAG 

Sequence 2330 

AGGTACGCGGGGAAACACTTCTTACCTCAGAGCTGAGCTGGGCATGAGTAGATGCTCAGT 

AAGTGGTGCACAGGGTTGGTCCCTATGGTGGAGGCCCCCTAACACCGCCCAACCCCCCTC 

CATGTTCTCACAGCTCCACGCACTGAGCACGGGCATGAAGGCCATGATGTCANAAATTCT 

GCACCCAGGGAGCTGAGATGTGCCGCAGGGCCTGTGGCGGACATGGCTACTCAAAGCTGA 

GTGGCCTGCCATCACTGGTCACCAAATTGTCGGCCTCCTGTACCTGCCCGGGCGGCCGCT 

CTAGAACTAGNTGGATCCCCCGGGCTGCAGGGAATTCGATATCAAAGCTTATCGATACCC 

GTCGACCTCGAGGGGGGGGCCCGGTACC 

Sequence 2331 

NCTATAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTTTTTTT 

I l | | | | U I I I I I I II I I I M GACATG AACTAATTTAATCCTNTTAACAAATTTACAAGG 

MCATATTCCCCTCACTTCATTATGTAMTAATAAMCAAAAGCCTAGAAGGGTTAGTAA 

CTTGCANATCATATAGTCACTTTGCAGCAGGTCTGANATTTGGGTCCCANANAGGCAGNT 

CCAGCCACCATGCAACCGAATCCTGCGCAGCACAGTTCGGNGTCATT CAACAA GACGGNA 

TCCATGTTG ATTAAGG AATCTTAGCT NTTAAAAG G NTCTAC N AATTACA l I I I I iCCACC 

CAAGC 

Sequence 2332 

CTNCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAAAAAAAAAAGAAG 

TGTTTATTTGAAATTCAAATATAATTTGACATCATCCCTTATTATCATTTGCTTM 

GTAATCCTAACTAGGAAGAMGCAAAAAAAAAGGCTTCTGTATATTCATATTAGATGCTA 

AAATGTATTCAAATTAGATGCTAAAATTCATGCACTTACCTTGCAGAAGG CTCAC TCTCT 

CAGGACCTACAAACTGGCAACTCTCACGCCAMTTTACCTCACTGATATATTTTATTTGC 

TMGCAAAATATTTTAATTTCMTTAGTTACCATCATTAAAATGAATAGAGTAGACATAA 

AAAAATGATTTCATATTCTGGCTTAAAAAAATTTGCC AGCCTG GGTAACATAGCAAG GAT 

CCTGTCTCTACAAAGAAAAAAAT 

Sequence 2333 

CCGCGGTGGCGGCCGAGGTACAGTAGGAGTGCCCAGACTCGGGGAGAGGCAAGCTGGCGC 

GTCTCCAAGGTGCTTGTCACTCACCACTAGAACGTGGTCCTTTACCACTAGACAAGATCT 

CCTGGTTTGACAATGCAGGTGACACAGCTGAAACTTTATCTCAACACGGCCTGAAGAATA 

CCACTGTCTAAATATGAGGTGCTAAAATACTATGACCTACTCTAATATTCTCTCCCAACT 

CTGTCCATCCTCGTGACTGGCACCCATGCTGGCCCAAATGATGACAACCTCCTGTTCTAA 

GTGCACAAGCCGCACATTTMTAMCCTTTCACCCAGTTCCTTCACTGGTGTTCATCTCT 

CGGCTTCACTTACAACCACTCCCTGTGGCTTTTGCAGTGAAGCCTCTTAGCCCAAAACTC 

TTCCACTCCCCA 
Sequence 2334 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATGAAGGAGG 

GCACTAGGGGGAGTTAAGM7TTGTGAGTTAGGTAGCCAAGATTGTTGAGGTTGGAAGAA 

TGAGGAAAGAAAGGGGATCTGTGGCAAAAATATGCTATTGGATCCACAGTTCTAGTTTAA 

GGACATGTCTAGTCCCTTTTCCAGCCCCCAGTTAATTACAATCCCCTGCTGCCCAAATGG 

AGAGGGAGGTTGTTCACCAGTAGAGGGAGCCAGGGGCTGGGTGTCTCAATAGAAACTGAT 

CTGGTCCCAGAAGGGGCCTTTAGATTrAAGTTTTTCGGATCGAAGGCAGAGCAAAGTGTC 
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CCTTACAACCAATGGCTAAAATGTATGCCATTGTAGTAGATCACATTGTTAGTTTTCTTT 

0 

Sequence 2335 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGACAGGCCA 

TCTCGCTATAGGAAAGGAAAGTGGAACAGCATTCATCCTCMCATTTTTACGAAGACAAA 

ATGAAGACTGGAGTAGAAGACTGATCAGTGCAGGTGTAGCATAAAAGTGTAATCCTGGAA 

GATGTGGTGTGAGAAGCATCATCAGCATCATGCTATTACAATCCCAAACCATGGGGGTTT 

CTCACAGCTTTACACCAAAGGGCATCACTATCCCTCAAAGAGAGAAACCTGGACACATGT 

ACCTGCCCG 

Sequence 2336 

AGCTCCCCCGCGGTGGCGGCCCGCCCGGGCAGGTAC I I 1 I 1 I II 1 I i 1 I 11 I I H I GGAC 

AAAGAMCGTGTTTTAAAAMTMCACAMGTCCATAATTTCAATGAMGTMGTGTCM 

GCTCTTACCAGGCTGGTCCTCTGGTCCAGTTATAGCTAGTATATCTGCTTCAATACTATC 

ATCTTCTGGTAAAGGTCTAGMGACACAGGCATACTTTTTAATCCAGAAATTAAGAAAGA 

ATTTTTm f GAAGTTCAATCTATTAAACATACAGCTTATAGAAATMTTATTTGGAAAA 

TGGAATAAAGCAGTGTTGTTG I 1 1 I IGAGACAGGGTCTTGCTGTCTGTCACCTAGGCTGG 

AATACAATGGCACAATCATGGCTCACTGCAGCCTTAACTCCTGGGCTCAAGAGATCCTCC 

CGAGTATCTAAGACTACCAGGTGCCGTGCCCGACGCTTGGCTAATTTTAGTACCTCGGCC 

GCTCTAAAACTA 

Sequence 2337 

CCGGGCAGGTAC I I I I I J II I 1 1 I II I I I I I I GGACAAAGAAACGTGTTTTAAAAAATAA 

CACAAAGTCCATAATTTCAATGAAAGTAAGTGTCAAGCTCTTACCAGGCTGGTCCTCTGG 

TCCAGTTATAGCTAGTATATCTGCTTCAATACTATCATCTTCTGGTAAAGGTCTAGAAGA 

CACAGGCATACTTTTTAATCCAGAAATTAAGAAAGAA I 1 I II I II I GAAGTTCAATCTAT 

TAAACATACAGCTTATAGAAATMTTATTTGGAAMTGGAATAAAGCAGTGTTGTTGTTT 

TTGAGACAGGGTCTTGCTGTCTGTCACCTAGGCTGGAATACAATGGCACAATCATGGCTC 

ACTGCAGCCTTAACTCCTGGGCTCAAGAGATCCTCCCGAGTATCTAAGACTACAGGTGCG 

TGCCACGACGCTTGGCTAATTTTTAGTACCT 

Sequence 2338 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGCCACCTAG 

GTTACTTGTAGGACCCTATACGGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTG 

CAGGAAAATGAGTCTATATGTCAGMTACACATTTCCCACCTTGCCCAACAGTAGAAAAA 

CATAAGAAGAGAAAAACATTAAAAAATGACAAGGAAGTTAATGGAAGTCAGCAATGTGAT 

GGTGTTTGGAGGTGGAGCCTTCAGAAGGTAATTAATGCCCTTGTAAGAAGAGGCCAGAGA 

GCTTGCGCACCTTCTTCCTGCCATGTGAGGAGCCAAGAAGCCGGCTGTTCTGCAACCTGC 

AAGAGGACCCTCACTAGAAGCTAGCCATACTGGCATCCTCATCTTGG 

Sequence 2339 

CCGCGGTGGCGGCCGAGGTACCCTGCTGAAAGATTATTTCTAACAGGCTTGTAGAGAAAC 

GTCGGTTCATGTAAATTAGAMTTATGGGGCCACTTTGCCATTCTTCACACCTGCAATGA 

ACAGGTGTTTATCTGCAGTTCTGACTTATCTCTTGAACTCCATTTGCATGTTATAGTGGG 

ATGCAGCTGATGCCCTGTCCAGATCTTCTTCAGGCCACTACATCTATATATGCATTCATA 

TTCCAGTGGCTGTGAGTGTTGGCTGTTGGTTGACAGAGGAGCTGCATCCTTCTGGAGGAA 

ACTGAACTCAGCTGATGAAAGCCACCCTGGTCCTGGGAGGTGAAGCATCTTCCAAATGAC 

AGCCTGCAGTCAATGACTGATGAATATGACTTCATTGCCTCATGACAGGGA 

Sequence 2340 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGGCGGCNCGCCCGGGCAGGTACTTACAGTTT 
G ACTTTG AGTC AGCTCAG C ATTCTAAATC AAACGC AAAC AGC AG AATCATATGG C ATAG A 
CACTTAGCAAATCCAATGCCTTCCAGGCATCTGTTTCCTGTTGATGAAATCCTCCCTATG 
GAGAGCAAACTGGTTCATATCTTCAGATAGTGTCATTCAACCCCTTGGTAGCTTTCTGGG 
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CTACTAAATATCACACCGTCTGGTGGGACATTTCACCCAAGTCGTGAAGGTCCGCAGCTT 

CACTCCTGAGCCAGCGAGACCACGAACCCACCAGAAGGAAGAAACTCCGAACACATCCGA 

ACATCAGAAGGAAGAAACTCCGGACACACCGCCGCCTTTAAGAACTGTAACACTCACCGC 

GAGGGTCCGCGG 

Sequence 2341 

TAGGGCGAATTGGAGCCTCCACCGCGGTGGCGGCCCGAGGTACCATGATTAGTTAAATAT 

AAGACTCCGTAA I I I I f ACAATTTTAACAATAATTTTATTTCTTCAAGCTTGTTAGTTTG 

G GATTGTATTAAAACTAC AGTGTGTG ACTTAGAAAATG ATAATGCTGCTTTATG GAAAAT 

GGATTATAGGTGGGTAAGACTTCATTGCAAAAATTGTGTAATACCATCAGTGTTAGGAAC 

CCAGTTGAAGTCTAGMGACAGATGTTAGTATCTTAGACTAGGTTGGTATTTGAATAGAT 

ATTGGTAATATCAGTAGAATTTAATAATACATTAGAAAGAAAGAAATCAGAGAAGATTCT 

TTTATTTTCACTTGATACTTGTGTTGTTCTTNCAATGAGATAAGAAGGCAGGCAAAGGAG 

AAAAGTTCAGGGGCNGGGGATGAGAAGAAAACAANAATTNTATGTTGG 

Sequence 2342 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACCCTCTCTGTTCCCCTCCCTCCACCCTTCCCC 

TCCTTTTCTG I I I I IMGATAMGTGTTGATTTGGAATTCAGCAACTGCAAGTCCATTTG 

AMGAGGTCAGTTTTGTTAGAGGCCTGGATAGAGCAATATGTTGTAACCAAATCACACGT 

GMCATTCTGTGGCTGAACATMGAAATTGTGCATTATTAACGTTGTGCTTCAAGGTGGA 

ATGCTGAAAGCACTTGGAAAGTAGTTGCCAGAGAAGAGCTCACTGAAAACATTATTAGCT 

TTATATATCCACTTGTAGAATCTCTGTACAGTGATACATCCCTCCCTTTCATCTTGATCA 

TAAATATAAGACTTAAAGGAGGCAGAGTGAACGTGCTTAGGAAGCCCACTGAGAGCTCTC 

AGAAAGGCGCTTAATACCTTTCTTAGTAAGTTGAAAGTAGGTTGATCTTCAAGT 

Sequence 2343 

CGGCCCGCCCGGGCAGGTACTATAGCTGTAAGGAGAAGCTGAGAAATGATACCCAGGAGC 
AGTAGGCTTTACGTCTTCAGCCTAAAACCTAAAAAAAAAAM 

AGCTATTAAACTGAMGCATCTGTAAAAAAAAAAAAATAAAAAAAAAAANGGTCCTGAGC 
ACCCAAGTCAGCTGTGCACTGAGGATTATGAGGACTGCAGGCTGTAGTCATTCTTCCCCC 
TGAGTCGTCAGCCTTGTCCCACACAGCAAGGTACCT 
Sequence 2344 

AGGTACGCGGGGGACGCTGGGAGGAGATGCTGCCACCTAGGCTACTTGTAGGACCCTATA 

CGGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAATGAGTCAAGGAA 

GCTTTTCTTTTGAGCTATTTACAGCTTTTAGCAATTGAGTAMGTATACTCCTGTGAA 

AAATTTGGAACATATTTGTTTCTCTCTMCTGATTTCTCCAGAATTTGGAACTAGT^ 

GCTGAGACCMTGATGCTGACCTCCCTCAAMGCTGCATTTCTGAATTTCTGAAGGCAAA 

CTGTCTGCCTATATTGNACCVGCCCG 

Sequence 2345 

AGGTACGCGGGGGACAGGCCATCTCGCTATAGGAAAGGAAAGTGGAACAGCATTCATCCT 

CAACATnTTACGAAGACA/w'XATGAAGACTGGAGTAGAAGACTGATCAGTGCAGGTGTAG 

CATAAAAGTGTAATCCTGGMGATGTGGTGTGAGAAGCATCATCAGCATCATGCTATTAC 

AATCCCAAACCATGGGGG^TTCTCACAGCTTTACACCAAAGGGCATCACTATCCCTCAAA 

GAGAGAMCCTGGACACATGTACCTGCCCG 

Sequence 2346 

AGGTACGCGGGGAGTGTCCTCCAGGACCCTGGCCCCTCATGCCTCCGTGCTTGCGCGTGT 
GCCATTTCCTCTCTCCAGAGGACCTTTCCTGCCTAGGACTCATGATTGTCCCCTCCCTGT 
GTTGCCTAGTTTCCTGGTATTAAGGAGAATCAACTCTCTGGATAAACGTGCCTTCTCCTG 
CCACGGCATGCACCCACACATACGAGCCTCCCGGGTACCTGCCCG 

Sequence 2347 

CGCCCGGGCAGGTACCATGATTAGTTAAATATAAGACTCCGTAAl I I I lACAATTTTAAC 
AATAATTTTATTTCTTCAAGCTTGTTAGTTTGGGATTGTATTAAAACTACAGTGTGTGAC 
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TTAGAAAATGATAATGCTGCTTTATGGAAAATGGATTATAGGTGGGTAAGACTTCATTGC 

AAAAATTGTGTAATACCATCAGTGTTAGGAACCCAGTTGAAGTCTAGAAGACAGATGTTA 

GTATCTTAGACTAGGTTGGTATTTGMTAGATATTGGTAATATCAGTAGAATTTAATAAT 

ACATTAGAAAGAAAGAAATCAGAGAAGATTCTTTTATTTTCACTTGATACTTGTGTTGTT 

ACTTTCAATGAGATAAGAAGGACAGGCAAAGGAGAAAGTTCAGGGGCAGGGGATGAGAAG 

AAMCAAGAATTTTATG7TGGACATGCTAAAGTTAMCA 

TCTGTGGAAGCAGAGGCTGCATGAGGTTGCTTATGAAAATGCTTTTTTGANGAAGCCATT 

GCAGGAGGAATCTCTTANGAANCCAGGGAAGAAGGAAG 

Sequence 2348 

CGAGGTACGCGGGGGAGAAGAAACGGCGGAGACCTGAGACCGGGAGGCTGAGGCTGTAGG 

CTGTTATCCTTGACATCTGCAGCAGCCCTTCCAAGCTGTGGAGACCAGGTCATCTGGAAT 

GCCCATTTATGTCAATGGAAGAAAGAAAAAGGGGTCTCCTCCCATCCTCACCACTGCATT 

CTCCCACCAACCCTGCTCCTGTCCCACTTCCCACAACTCAGTTGTTGACAATTTAACAGT 

GGGTTTCTGGAAACAGTGCTGCCTCCTTGATGGTCTAACTAACTACCAGAGTTATTCTAA 

CAGCGGGAATTTCAAGCCAGCAAACCAGACATACATAGGAGGCACAGCAAAAGACCCAAG 

GTGAGGGGCTTCCCTCACCTTCACCCTTGTTCTCACGTGGAGGTGGCGCATGTCTATGAG 

ACACAGTACCTGCCCGGGCGGC 

Sequence 2349 

CGAGGTACCATGATTAGTTAAATATAAGACTCCGTAAI I I I I ACAATTTTAACAATAATT 

TTATTTCTTCAAGCTTGTTAGTTTGGGATTGTATTAAAACTACAGTGTGTGACTTAGAAA 

ATGATAATGCTGCTTTATGGAAAATGGATTATAGGTGGGTAAGACTTCATTGTAAAAATT 

GTGTAATACCATCAGTGTTAGGAACCCAGTTGAAGTCTAGAAGACAGATGATAGTATCTT 

AGACTAGGTTGGTATTTGMTAGATATTGGTAATATCAGTAGAATTTAATAATACATTAG 

AAAGAAAGAAATCAGAGAAGATTCTTTTATTTTCACTTGATACTTGTGTTGT^ 

ATGAGATAAGAAAGACAGGCAAAGGAGAACGTTCAGGGGCAGGGGATGAGAAGAAAACAA 

GAATTTTATGTTGGACATGCTAAGTTAAACAGCTGCTTAACTCAGATTGGCTTCTGTGGA 

AGCAGAGGCTGACATGAGGTTGCTTATGAAMTGCTTTTTTGAGGAAGCCATTGCAGGAG 

GAATCTCTAAAGGAAGCCAGGGAAGAANGAAAGAGAAGGGAA 

Sequence 2350 

AGGTACGTGGGGAGGCCATCTGGCTGCTGCTGCTGCTGCCAGCTTCCAGACGGAAGCTGA 

GATTTGCTATTGATGTTCCAGCATTGATTGAGCTCCTGCCTCGTCTGCATCAGGCATGGT 

CCTCGCAGCTTCACGCAGTTAGAGTCCACATCACGTCTCTCTGAAGAACAGTTAAAGTGT 

TTTCTGGATGAATGCATACTTAAACAAAAATCCATCATTAAACTTTCTTCAGAAAGAAAA 

AAGGAAGACATTGAGGACGTAACACCTGTGTTCCCCCAGCTTTCCAGGTCCATCATCTCT 

AAATTGCTAAATGAATCAGAAACAAAGGTCCAGAAAACTGAGGTAGAAGATGCAGATATG 

CTTGAGAGTGAAGAATGTGAAGCTTCTAAAGGCTACTATCTCACTAAAGCCTTGACTGGA 

CATAACATGTCAGAAGCTCTTGTCACTGAAGCAGANAATATGAAATGCCTTCAA 

Sequence 2351 

AGGTACI I I I 1 1 I II I I I M I I I I I I I I I I GGTTTCTTCCTTCCTTCTGTCTTTTCTTCC 

TTCCTTC lilll CTTTTCTTCCTTCCTTC Mill CTTTTCTTCCTTCGTTC Mill CTTT 

I I I I I ICTGGAGTAACCAGTGTGACATMGATCTGTTTGTGTTCCCCCATTTTACAAAAG 

AAAATTTAAGTCTGCAACTTCCCCTAGGCATCTGCAGCAGAGAGTGGATTTGTCAACATG 

CAAAGCTGCTGACCTGCCCGGGCGGCCGCTCGAAGCGTGGTCGCGGCCGAGGTACGCGGG 

AGTCTCGGTCTCCAAAAAGGGATACTACTAGGGAAAGCAGAAGATCTGAATCACTGTCCC 

CAANAAGAGAAACTTCTAGAGAGAACAAAAGATCTCAGCCAAGAGTGAAAGATTCTTCCC 

CAGGGAGAAAAATCCCAGGTCCCAGAGCAGAGAACGAGAAAGTGATAGAGATGGGCANAG 

GAGAGAGAGAGAAAG 

Sequence 2352 

CCGGGCAGGTACCGCGGGGGCAGTGGGAAGCTCGCAGCAGCTGGGGAGGAGCCAAAGCCT 
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CGGCGCTCACCTAAGCCGCAGGGAGATACACCCAACTGGGAGATGAGGAAACAGCAACCC 

AGAGAGGAGAACTAACCCACACAGGATCATTTCGCGAAGGAGCAAGGCTGAAGAACCAGA 

CCTGGACTTTCTTAGGACAMCTTACTGCAGCTTGAAGGAGCCAACCATGGATTTGAGGC 

GTGTGAAGGAATATTTCTCCTGGCTCTACTATCAATACCAAATCATTAGCTGCTGTGCTG 

TTTTAGAGCCCTG G GAGCGATC TATG TTTAACACCATCTTACTAACCATTATT GC TATGG 

TGGTATACACTGCCTATGTCTTTATTCCAATCCACATTCGCCTGGCTTGGGAAl I I i M I 

CAAAAATATNTTGGATATNACAGTACCTTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnAGGAATTCNATATCAAGCTTATTCGATACCCGTCNACCTTCGANGGGGGGGCCCN 

GTACCCAGCTTTTTGTTCCCTTTAG 

Sequence 2353 

CCGGGCAGGTACGCGGGGAACTTGTGTGCTTAGACCTGACGCTGGGAGGAGATGCTGCCA 

CCTAGGTTACTTGTAGGACCCTATACGGCMCCTCCTTTGCCAGGAACTATTTATAAACA 

TCCTGCAGGAAAATGCAGTGAAGTAGAAGAGACAGGGATATCCCAGAAGGTTATGCAAAA 

CATCAAGAGAAGATGAGAGGAGTCTATATGTCAGAATACACATTTCCCACCTTGCCCAAC 

AGTAGAAAMCATAAGAAGAGAAAMCATTAAAAAATGACAAGGAAGTTAATGGAAGTCA 

GCAATGTGATGGTGTTTGGAGGTGGAGCCTTCAGAAGGTAATTAATGCCCTTGTAAGAAG 

AGGCCAGAGAGCTTGCGCACCTTCTTCCTGCCATGTGAGGAGCCAAGAAGCCGGCTGTCT 

GCAACCTGCAAGAGGACCCTCACTAGAAGCTAGCCATACTGGCATCCTCATCTTGGCCTT 

CCACTTCCAGMCTGTGANAAGTATTTTGTTTGTGGTTTAGTCAATGGTCTATGGTAATT 

TTTTTA 

Sequence 2354 

AGGTACGCGGGCAAAGCCCATAAAMTAAAAAATTATAACAAACCCTGAGAACCAAAATG 
AACGAAAATCTGTTCGCTTCATTCATTGCCCCCACAATCCTAGGCCTACCCGCCGCAGTA 

CCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTAGNGAATTCGAATAT 

CAAGCCTTATCGATACCCGTTCGACCTTCGAGGGGGGGGGGCNCCGGTACNCCAGCT7TT 

TGTTCCCTTTTAGTTGAGNGGTTAAATTGCGCGCCTTGGGNCGNTAATTCATTGGNTCAA 

TANGCTTGGTTTACCTGGTGGTGNAAMTTTGGTTATTCCCGCTTCACANATTTNCCAAC 

AACCAACCAATAACTGAGGCCCGGGGGMGNCATTAAAAAGTTGGTTAA AAAG ANCCTNG 

GGGNGTTGGCCCTTAMTTGNAAGGTTGGMGCCCTAAACCTTCACCMTTTTAAATTTG 

GCGGTTTTGCCGGCCTTCAACCTTGGCGCCCGGNCTTTTTTCCCAAGTTTCCGGGGGGAA 

AAACCCCTTGTTCCGNATGGCCCCAAGGCCTTGGCCAATTTNAMTTGGAAAATTCCCGG 

GCCCC 
Sequence 2355 

AGGTACGCGGGGCCCGCCGATATCTCTGCCGGGTGACTAGCTGCTTCCTTTCTCTCTCGC 

GCGCGGNGTGGTGGCAGCAGGCGCANCCNANCCTNTAAATGCANAACGACGCCGGCGAGT 

TCGTGGACCTGTACGTGCCGCGGAAATTGCTCCGNTTAGCAAATTTCNCAATCATCGGGT 

GCCAAAGGAACNCACCGCTATTCTCATTNNCANNATTGNAANCGTTGGNCACGAAGTGTT 

TGTACAAAGGGTTCAACCAAGGGCCAGGGTTTNTAMTNGGGNCACAAGGTTTNTAAAAA 

AACC NTTTAATTGGCCTAATNCCTAACN G NGGGGGCTCAAT fAT TCNG GTC AAG NG N AAT 

TGNGGGGGTTGGAAANTTCNAATNGMTTGGAATTTTCCCCCMTTTTNCTTNCNCCGGN 

MTTTGGGGNNCCCANAGNGGACCCCTNMTTGGGCCCAWTTCNNGTACCTTCAAAAAAG 

T AAAN CCTTTTTTTTTTGT ACCTT NGGGGAAGT AAG N AAAATTCC AACCC AAGGT AATTG 

GGTTGGGGAA 

Sequence 2356 

CCGCGGTGGCGGCCCGAGGTACTGGAGATTCATATATGCAAATATTCTCATGCAAGAAGT 
TCCACAGTAACAACAGCAAAAAGAAAAAATTAGTTGTCCAGCCAGTGCTGGAGGAAAATG 
TTTCTGGGGMGATGACTCAGTCATTTTGTGGCGAGACACCCTTTGGT AACTCC CACTGA 
CCAGTCTTGGGAGCCTTCCTGGAATGATCGTGGGCTGAGCGGAGATGI I I II iGCAAAAT 
GAAACTGAAGCTGAAAGAAAGGAGAATTCGAGTGAACCAAGAAGAAATCCAAAGACCTGG 
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GGAAGGAGGACTTAAGATGAAAGTGAAGCAAGAGAAAAATAATTAAANAANANANNTNNN 

NNGTACCTGCCCGGGC 

Sequence 2357 

GACTACTATAGGGCTTTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACAACCACCCTCA 
TGCCTCCTACCCCCGAGGTTCCTAGAGCTAGGCTCTCCTGAGGCAATGCTTTCCTTCTCA 
ATTCATATTCTTCCAGGAGGGGCACCAACG I Nil I AAAATGATGTTGGCGACGAGGACG 
GTAAATTTTCTAGATGACTGMGGCTGACTTTCCCCTTTCTGTGACTCTCTAGGCAACAC 
CGTGACAGGAGAGACGACTCCCTTTTCTCCTCTTGTGACCACTTCTGAATCTGTGACCCG 
AAATCACAGCCAGTAGCTTTGTGGTCTCCTGGGTCTCAGCTTCCGACACCGTGTCGGGAT 
TCCGGGTGGAATATGAGCTGAGTGAGGAGGGAGATGAACCACAGTACCTGCCCG 
Sequence 2358 

CCGGGCAGGTACCATATAGGTCCAMCCCAAAACTGATTTTTGTTTTATGAAGTAGAAAT 

GGTTGCATGGGCTCTTGGCTAGMCCATCTTTGGCCCATTGAAAATCAAGTTTCTGAGAC 

TGAAAAGGAAACACACAGTGACACCATTTTGATGAGGAAGAGAAGAATTCTGAAGGAGCC 

AGAATATATTTGCTGMTTTTGACACTGAAGMGAAATAAACATTGAAAAAACAAAGCAA 

MCAACAGCMCAACAAAAMCTAMCCCATTTAGGTTTCCACTACAGATGCAAGAAAAA 

AGTGCCCTTGCACG 1 1 I I I ICTGTCTGAGTGTGTCAGCCAAGGTTGCATGGGGGAATTCA 

GGATGTATAAAAACATAGACAGAGAGAAGMGATNATTTTGCACCTCAGAAATAACCTTG 

TGAACAGCTGAGTAGCCTGT 

Sequence 2359 

TCATATAGGGCGAATTGGACTCCACCGCGGTGGCGGCCGATGTAC I I I M II I I I I I I II 
Mill IGTACGGGATGGGTTTTTGCCATTGTTGTGCCAGCNTGGGTCTTGAACTCCGGGG 
GCNTGNAGGCCNGCCTCCCACCAACANAATTAACTTNNTNAGAGCAGCAAGTNCCCATAT 
GGNMCACAGTCACCACTCCCAGGAATTAAGAATGTGGGACAGTNGGACATCTTTGGGGG 
GCCATTATTCAGCCTGACCCTGACTCAAGGTCTGGACCAACAAAGAAAAMTGGGCAGCC 
CAGCACI I I I INCCTCCTTTCCTCTCCTTTCATCCCAl I I I I I I ACATTCAGTN AGAAGA 
TTGTCAGAACACCAATTAAAAAAAGAAAGCNAAGATTGGGGGCT 
Sequence 2360 

AGGGCGAATTGGAGCTCCCCGCGGNGGCGGCCGAGGTACAGATTGTGTCCACTGGAAAGG 

TAAATGATTGCTTTTTTATATTGCATCAAACTTGGAACATCAAGGCATCCAAAACACTAA 

GAATTCTATCATCACAAAAATMTTCGTCTTTCTAGGTTATGMGAGATMTTATTTGTC 

TGGTMGCATTTTTATMACCCACTCATTTTATATTTAGAAAAATCCTAAATGTGTGGTG 

ACTGCTTTGTAAGTGAACTTTCATATACTATAAACTAGTTGTGAGATAACATTCTGGTAG 

CTCAGTTMTAAMCMT7TCAGMTTAMGAAATTTTCTATGCMGGTTTACTTCTCAA 

GATGAACAGTANGACTTTGTAGTTTTATTTCCACTAAGTGAAAAAAGAACTGTGTTTTTA 

AACTGTAGGAGAATTTA/xTAMTCAGCMGGGTATTTTAGCTAATAGAATAAAAGTGCAA 

CAGMGAATTTGATTAGTCTATGAMGGNTCTCTTAAAATTCTATCGGAAATAATCTTCA 

TGCNAGAGATTTCANGGTTTGGATTAGCCAGTGGGAATAAAGAAATGGGCATTGGTTCCC 

TATAATTGGGCTGGTTTrATAACTTTTGTAATATTACC I I 1 1 I CTGGCTGNGGTTTTATA 

CTTATCCCATATGCNT GG ATGGGGAAAAAATTT 

Sequence 2361 

ACTNCTTAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACI I I I I I I 
TTTTTTTTTTT TTTTT (TTGGAGTAACATCCTC AG AC AAGTAG ATAG ACAAGCCTCCAGC 
AAACAATGGGAGGAAGAATGCCATGATTCCTCCTTAAATTGTANAAGCTCTCTGGCCTCT 
GCTGTGCATAATCTGCCTACCCATCAAATCACAAAATTTGTTTGTTTATATAATATCATC 
CTTTCTCACAGCTTTTATCAGCAGAGTTCTTAAGTTCTCAAGGGAACTGTCTATTTGAGG 
AACTATAAGGATTGGTCAAATATGCTGGAATACTGCAAAGGCTAAAAMTGAGATTCTTT 
ATAAGACTATGAATCCAGTGAACTAAATATTAGGCAAGACAACATTGTTTTATCCANATG 
G CTAATTTGCATG ACTCC GTTTAAC ACTTG ATTTTTTT 
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Sequence 2362 

CCGCGGTGGCGGCCGAGGTACGCGGGGGTAGATGGAAGGAAGMCTTGTGTGCTTAGACC 

TGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGTAGGACCCTATACGGCAACCTCC 

TTTGCCAGGAACTA7TTATAAACATCCTGCAGGAAAATGAGTCTATATGTCAGAATACAC 

ATTTCCCACCTTGCCCAACAGTNGAAAMCATANGAAGAGAAAMCATTAAAAAATGACA 

AGGAAGTTAATGGAAGTCAGCAATGTGATGGTGTTTGGAGGTGGAGCCTTCAGAAGGTAA 

TTAATGCCCTTGTAAGAAGAGGCCANAGAGCTTGCGCACCTTCTTCCTGCCATGTGAGGA 

GCCAAG 
Sequence 2363 

AGGTACCACACACACACAGGCACACGCAGGCACACACGCAGACACACACACATATAAACA 

CACAGAGTTCACTAGTCCGAGTTACTGATTTTCTTAGGATTCTCAMGTGACAACACCGG 

AAACAAGGTAATTCATGTTAAAACACAAGGGTTATATCAGTAAAGAAGATGGGATCCCCG 

AAGTAAAC C GTGGAATTTGAATC AAG CTTCGAAG AGC TAAAAAAAGAAATTGG AG TTTC A 

ACATTCACCTTCTTGAATCCTTAAGAAATACAGAAGTTCAAAATAGAAAACATTACAGTT 

TCAGGATACAAAAGTAGAMCATCTGAGATTAAGCCTACATTTTTAAAAAAAGATATTGA 

AAATTACTGTGCTTGTMTACCAGCTTTTCAAACATCAATACCAGTATTGGCATTACCTA 

GATC 

Sequence 2364 

GGGGCGAATTGGAGCTCCACCCGCGGTGGCGGCCCGAGGTACCGCGGGGGCCGCCGTGTG 
CCATTTCCTCTCTCCAGAGGACCTTTCCTGCCTAGGACTCATGATTGTCCCCTCCCTGCT 
GTTGCCTAGTTTCCTGGTATTAAGGAGAATCAACTCTCTGGATAAACGTGCCTTCTCCTG 
CCACGGCATGCACCCACACATGCGAGCCTCCCGGGTACCTGCCCGGGCGGCCGCTCGA 

Sequence 2365 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGTCCCT 

GMTTCCATCTGACTCTMTTTTATGAGAATTGCAGAACTCTGATGGCAATAAATATATG 

TATTATGAAAAAATAAAGTTGTAAmCTGATGACTCTMGTCCCTTTCTTTGGTTAATA 

ATAAAATGCCTTTGTATATATTGGATGTTGGAAGAGTTCAMTTATTTGATGTCCGCCAA 

CAAAATTCTCAGAGGGCAAAMTCTGGAAGACTTTTGGAMCACACTCTGATCAACTCTT 

CTCTGCCGACAGTCATTTTGCTTGAATTTCAGCCAAAMTATTATGGCATT TTGG ATGCT 

TTATTCMGGGCTATTACCTCAMCTTTTTTCTTCTCAGAMTCCCAGGGATTTTCACM 

GGGATACCTTGTTATATATGGGGAAAACAAGCAAGGTTTAT 

Sequence 2366 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCCGGCCGCCCGGGCAGGTACTTTACTCA 

CCCTTCCTCTGACAGAAAAGGATGAAGTCAAGGGCCTGGTAGAGGCACCACTAAGAAAGG 

CATCTGAMGGACCAMGAGAGTGACCAGCMGCATTTTTTGCAAGGCTGAGGAGCTGAC 

AGCTTCCATGAAAGGCTGGACCACCCAGTGGTTGAAAAGCATCATCTGGGTTACCTTGTG 

CTGCCATAAMCACACCACAGACTTGGTGACTTAAACCACAGATATTTATCTTCTCACAA 

TCCTGGAGGCTGGAAGTCTGCAATCACGGTGCCAGCATGGTCAGGTTCTGGTGAGGGCC1 

CTTTCCTTCTCACTGTGTGCTCTTTCTTGTGCATGGGAGAGAGAAGAGAGCCATTGAACC 

AAGNCCCTCTTACTGTCCCTCTTAAGAAGGGGCACTAAAT 

Sequence 2367 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGGGGGGAGGAGTTA 

GGGCTTAGGAAGAAGGAAATGCACAACTTGGGACCTGAGTTAGGCCTTCATCGTCGCGGA 

CTCAGTAGACAGAGGGCATAGGTGCAGCTGTGATGGTAGAGGTGGAAGACTGATTTGCAT 

TTTCACCAGGTGAGACCTGGTTACTCAATTGGTTGGATGAGGGAAAGAGATGATAGCCAG 

GGGCGGCATAAAACGGCAATGAGCTCAGTAAACAACCCCCTTCCCGTGATCTTCGTGGCT 

TCCCGCCCAGACNTGTGACCAATATAGCAAGGAGCAGTCTTTMGGCGACACCGGCAGAG 

AGGCAGAAAAAGATGGGCCCTTACNCGGTTAGTTAAGAAAAAAAAGAGGACCTGTTGCCT 

GTCCCTCTAGCTTTGAACCTACAAAGTGGGAGGGGG 
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Sequence 2368 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCCGGCCGCCCGGGCAGGTACACGTCCCTAGAA 

GCACATAGCTCCAGCAACAAAGCTAATCTGAGAGTAATACCCAAGATGCCTCCTACCTTG 

CAAATTTCAAGGMTTTCTCACTGGTGTATTTCATAGGATGCTCAGTGAAAGTAGCATAA 

GGAACTTCAGTGGACCATGGGGTTCCAGCGGGGACAGAAGAGACTGCTCCTCCGGACTCC 

CCCAGTAGATCCTAAGGCCTTCTCCTTGTCTCTTGTCCAGGGACATCCCAGGGAAGGTGA 

ACTTGCCCAGGCAGATGCGATAGACAGCGCTCAGAGGAATCCGCTGCAGCTGCACACAAC 

TCAGCATGATGGAAGTCGTATTTGCAGGATCAAGGAGAGTCTTGTCTGTGGACCAGTAAG 

NAAATTCTCTCCTTCTCATTG 

Sequence 2369 

GACTNCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAGACGGGGCTGT 

AATCTGGTAACTGTATGTATTTTAGTTCTTCAGTCCCTGGGAAGGAGACAGGAGAAGGTG 

GGAGGGAGGAAGGGGCCAGCTGAAATGGAAACAGATCCCTGATCCGGGGCGGTCAGTGGA 

ACCCTTCTTGGTGTGCGAGAGCCTGTGCATTTCAAGAGGCAGCAAAAAAGTAAAAAAAAA 

AAAAAAAATTGATCTTTGTTTAGATTAACAGGACCCCTGACTATGAANAAGGAAGGCATC 

CAGACCAGAAACCGAAAAAATGTCTAGCCAMTCCAAAAAGTGCAAAAAAAGTTGCATTG 

ACTCACTGGAGGGACTTTCCCCAAGAAACAGCTNGTTTNAACCCGGCCCGCTCTTANAAC 

TAGTTGGGATCCCCCGGGCTGG 

Sequence 2370 

CCGCGGTGGCGGCCGAGGTACCATTTAATCCTCACCCCTGACTGTCAGTAGACTGCCATG 

GAAACCCCAAGGATGCCTGTCCTGTTGGACATGTGGTCAGAGACTGGCTCCAAAGATATC 

TCCCAAAGAGCAAAAGCCACCATCTTCCTGTTGGTGCCCAGACACAGGTGCATGAGCTCC 

AAGGACTCCTTCCTAGGCCACCTCTNTGCTGCCTGATCTAAACTCACCTCATTTCTGGGA 

GCTGGTCTGAGAGTCTGCCTTGGAGATGTGTTGGATGGGAACTAGGGGTTTGTAGGCTCC 

ATTGTGAGCAATACTGATGGGAAAATGACCCCAACTGGTATGGATATTTTTTGCANTAAA 

AGCCCCCTGNAAATNCTGGACATGCTCAGGGGTCAAGAGCCATTGAA 

Sequence 2371 

TTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAATCAACCAACAAGT 

CATTATTACCCTCACTGTCMCCCAACACAGGCATGCTCATAAGGAAAGGTTAAAAAAAA 

GTAAAAGGAACTCGGCAAATCTTACCCCGCCTGTTTACCAAAAACATCACCTCTAGCATC 

ACCAGTATTAGAGGCACCGCCTGCCCAGGTGGACAACATGTTTAACGGCCCGCTCTAGAn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGCTTGG 

CGTAATCATGGGTCATAAGCTGTTTCCTGTGTGAAATTGGTrATCCCGCTCACCAATTCC 

CACAACAACATTACGGAGCNCGGGNAGGCATAAAAGTGTAAAAGCCTGGGGGT 

Sequence 2372 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTCAAATTTGGAGC 

ATCATTTGAAATCATTTTCTGATTCTMGCTATAACTGAAA.TACAAATATAACATTAAGA 

CACCCCAAAGAAGACATTAAGGATACACTTGTCGTTCTTAAGTGTGTTTCACTAAAAATA 

ATAACAAAAAACTTTT AATTTTTAAAAATGGTTCTAG CTTT FAAAAAC CTGC AGTAAATA 

CTTTACAAGTGGCTAGACACCTTCCCTGGTGGTTTAGT^TATACAAGCAGGAMCTTTCT 

CTCCTACTCTTTCTCTCTCTCTTTCTTGTTTFCTTTAAACCAACTGACGTM 

TTTGCTTTAAAAGMCAGACAGGAAGTAATTGGTAG 

GGATTGGGTTTTAGTGGCCAAAATGCCATTTAAAAAAAAAAAAAGTT 

Sequence 2373 

AGGTACCATGATTAGrTAMTATAAGACTCCGTM 

ATTTCTTCAAGCTTGTTAGTTTGGGATTGTATTAAAACTACAGTGTGTGACTTAGAAAAT 

GATAATGCTGCTTTATGGAAMTGGATTATAGGTGGGTMGACTTCATTGCCAAAAAATT 

GTGTAATACCATCAGTGTTAGGAACCCAGTTGAAGTCTAGAAGACAGATGATAGTATCTT 
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AGACTAGGTTGGTATTTGAATAGATATTGGTAATATCAGTAGAATCTTAATAATACATTA 
GAAAGAAAGAAATCAGAGMGATTCTTTTATTTTC 

AATGAGATAAGAAAGACAGGCCAAAGGAGGAACGTTCAGGGGCAGGGGATGAGGAAGAAA 

ACAAGGA 
Sequence 2374 

AGGTACGCGGGCATCCTTAGGAGACCTGAGTCCTCAAGAAAACCCTCTTCTGGAAGTAGT 

TGCTCCTTCAGMCGTTTTACAGAAAACACTAATGTAAAAGACACAACTAATGTNAAAGA 

CACAAAAGAGATGTGTTCAMGACACATTTTCTGAAMCACAAAANCTNACAAATTCAAT 

CCTCCCTGGAGNGGCAAGTTTTTCNGGCCTGGGGNMCCTTGGCMTTTCNAAAACTTNT 

NANGAAACCCCATMCCTTGGTTTNAAAAACCAAAAMCCTTGGMGG MCCA AATAATA 

TTTGNGNGGAANATTMCCAAANCCAANAATTGGTTGGGGGGG 

TTTGNGTTCNCCCCCCTTGGGAAGNCCCCCCAAAAAANAGGGCCr 1 I I I CAANANTTTTA 

ANCCCCCCCAATTTTGGGCCTTCNTTCCNGGTTTCCCCCCCCAAGGGGGTTTNGNAATTC 

CAAGGNCCTTTTGGTAAMNMTTmCNMGGNCCTTAAAACCCCGGAAGGNCAANGCC 

TTAACCNGGGGTTCTCCCCCTTNNATTTCCCCCCCCNAAACCGGAAGGGGGATTGGTTGG 

AAGGAAAAAAGGGTTTNCAATTGGGTCCTTNCAATTGGGTTTAATTCCNTGGGGGAACC 

CTTTGGGAA 
Sequence 2375 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCATTTA 

ATTAGGGTTGCAGAGGAGATGGAAAMCAAGCTGTTTGATTATGAAGGCAACACTTCTGT 

CAGTAGAATTGCTTTGGATTGTAMTATGTTTACTTCAGAGGAAATCAGTCAGAGTGGAT 

GGCTCAGCAAAACCCATCACAACTGCGGAAAAAAGGAAGTGACTGGCTAGAGGTAAGGAA 

ATAGATTTTCTCTAAGTTAACTGGCCTACGTGTGAATCGAACCCTGCACCCTGGCCTCAT 

TAGCACCAACACTGTGAAGAACAGCTGTGGCAGAGAAAGTGCCACGGCCCCACCCCATCA 

CCAGATTCAGTAGAAAGGCCCTTGGMTTTCCTACAAGGGTTCAGTTCTGAACAGCTCCA 

CTTCAAATCAGTATAGAATCCTCACAGTGACTCTGTGAGGGGAGG 

Sequence 2376 ^ 
GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATGTGTCCAGGTTTC 

TCTCTTTGAGGGATACGTGATGCCCTTTGGTGTAAAGCTGTGAGAAACCCCCATGGTTTG 

GGATTGTAATAGCATGATGCTGATGATGCTGGAAGAGGCATCAAAACTCAGTTACTTCCC 

MGGTTTGACTTACAGACATAAACTTCCCAGGGACCATACTTTGTAGGAACTTAAGAAGG 

TATTTGCCATTATTCCAAGGCTGGTTGTCCTTCANAAACAAAACATGGGGATCAGTGGAG 

CATTGGAAAGTGGGCATGCTGGGCACAGTTCATCCATTGTGGTCAAGAAGCATGTCACTG 

GGAGAGTACCT 
Sequence 2376 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATGTGTCCAGGTTTC 

TCTCTTTGAGGGATACGTGATGCCCTTTGGTGTAAAGCTGTGAGAAACCCCCATGGTTTG 

GGATTGTAATAGCATGATGCTGATGATGCTGGAAGAGGCATCAAAACTCAGTTACTTCCC 

MGGTTTGACTTACAGACATAMCTTCCCAGGGACCATACTTTGTAGGAACTTAAGAAGG 

TATTTGCCATTATTCCAAGGCTGGTTGTCCTTCANAAACAAAACATGGGGATCAGTGGAG 

CATTGGAAAGTGGGCATGCTGGGCACAGTTCATCCATTGTGGTCAAGAAGCATGTCACTG 

GGAGAGTACCT 

Sequence 2378 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTTCATCTCTCTGACCTCATTCGCATGGCATTC 

ATGGCTGCAACTGATCATAGCAACCAGCTGCGAATGGCTGGGCTCCAGGCGCTTGAAGAC 

ATTATCAAGAAGTTTGCGTCTGTGCCTGAGCCAGAATTTCCAGGTCATGCGATACTGGAG 

CAGTATCAGGCTAATGTGGGAGCTGCTCTAAGACCAGCCTTTTCACAAGATACACCATCA 

GATATAATAGCGAAAGCTTGCCAGGTATGTAGTACCT 

Sequence 2378 
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CCGCGGTGGCGGCCGCCCGGGCAGGTACTTCATCTCTCTGACCTCATTCGCATGGCATTC 

ATGGCTGCAACTGATCATAGCAACCAGCTGCGAATGGCTGGGCTCCAGGCGCTTGAAGAC 

ATTATCAAGAAGTTTGCGTCTGTGCCTGAGCCAGAATTTCCAGGTCATGCGATACTGGAG 

CAGTATCAGGCTAATGTGGGAGCTGCTCTAAGACCAGCCTTTTCACAAGATACACCATCA 

GATATAATAGCGAAAGCTTGCCAGGTATGTAGTACCT 

Sequence 2379 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACGGCCTGGCTACCTGTGGGTTG 

TTTGGATCTATCGGAACCTCATTGGAAGTGTGCACTTCI I I I I CATCCTCACCCTC ATTG 

TGCTAATCATCACCTATCTTTACTGGCAGATCACAGAGGGAAGGAAGATTATGATAAGGC 

TGCTCCATGAGCAGATCATTAATGAGGGCAAAGATAAAATGTTCCTGATAGAAAAATTGA 

TCAAGCTGCAGGATATGGAGAAGAAAGCAAACCCCAGCTCACTTGTTCTGGAAAGGAGAG 

AGGTGGAGCMCAAGGCTTTTTGCATTTGGGGGAACATGATGGCAGTCTTGACTTGCGAT 

CTAGMNATCAGTTCAAGMGGTAATCCMGGGCCTGATGACTCTTTTGGTAACCAGACA 

CCAATCAAATAAGGGGAGGAGATGAAAATGGAATGATTTCTTCCATGCCACCTGTGCCTT 

TAAGAACTGCCCANAAANAAA 

Sequence 2380 

AGGTACGCGGGGGTGTGCTTAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTACT 

TGTAGGACCCTATACGGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAA 

AATGAGTCTATATGTCAGMTACACATTTCCCACCTTGCCCAACAGTAGAAAAACATAAG 

AAGAGAAAAACATTAAAAMTGACMGGAAGTTAATGGAAGTCAGCAATGTGATGGTGTT 

TGGAGGTGGAGCCTTCAGAAGGTAATTAATGCCCTTGTAAGAAGAGGCCAGAGAGCTTGC 

GCACCTTCTTCCTGCCATGTGAGGAGCCAAGAAGCCGGCTGTCTGCAACCTGCAAGAGGA 

CCCTCACTAGAAGCTAGCCATACTGGCATCCTCATCTTGGCTTTCCAACTTCCAGAACTG 

TGAGAAGTATATGT7TGTGGTTTAGTCAATGGTCTATGGTAAI II If ( I ATAGCAGTCCC 

AGCCAAGACAGTGCCTCATTTACTACATACCATTTATATTATTATATAGGCTCCTTTCAG 

AAACCCATGTTCAMTMGAGATMGATACTGAAACACATAACACCTTTCGCTAGTTTTT 

AGTATACAAATATTG 

Sequence 2381 

CCGCGGTGGCGGCCGAGGTACAACATGGGGCTGTTCATCCTTCGTCTTGCTGAAGATGGT 

CCTGCCACCAAAGATGGCAGAATTCATGTTGGTGACCAGATTGTTGAAATCAATGGGGAA 

CCTACACAAGGAATCACACATCTCGAGCAATTGAGCTCATTCAGGCTGGTGGAAATAAAG 

TTCTTCTTCTmGAGGCCAGGMCTGGCTTGATACCTGACCATGGTGATTGGGATATTA 

ATAATCCTTCGTCTTCAAATGTGATTTATGATGAACAGTCACCATTACCCCCATCTTCAC 

ATTTTGCTTCCATATTTGMGAGTCTCACGTGCCAGTAATTGAAGAATCTTTGAGAGTTC 

AGATATGTGAAAAGGCAGAAGMTTAMGGACATTGTGCCTGAAAAGAAAAGCACTTTAA 

ATGAAAATCAGCCTGAGATAAAGCATCAGTCTCTTCTCCAGAAAAATGTGAGTAAAGAGG 

GATCCACCCAGCAGTCATGGGCACAGTAACAAGAAAAATCTATTAAAANTAGAAAAATGG 

TGTTACACNAAGAGGTAGATCGGNTAGTCCCAAAAAGCCAGCCAGTCACATTCAGAGGGA 

ACATTTGGATAAGATTNCTAGTCCTNTNAAAAATAACCCCAAA 

Sequence 2382 

gggcgaattggagctccccgcggnngcggccgatggcctcttctaggcagatccgtct;x; 
aactggtaacagcaaccatatcgagtttctctttatcacaataccacgaggcttatcatc 

ATTGCCATGCCTTTAGTAACATCAC I I II I I AAATGGGCATTTTCACAATGTAATCTGCA 

ACCTACATATAACATGCACTCTTTATTGAAGGCCGTACCTGCCCGGGCGGCCGCTCGAGA 

G AC ACATTACCTG AATG AGCAGGTGAAAGCCATC AAAGAATTG GGTGAC CACGTGACCAA 

CTTGCGCAAGATGGGAGCGCCCGAATCTGGCTTGGCGGAATATCTCTTTGACAAGCACAC 

CCTGGGAGACAGTGATAATGAAAGCTAAGCCTCGGGCTAATTTCCCCATAGCCCGTGGGG 

TGACTTCCCTGGTCACCAAGGCAGTGCATGCATGTTGGGGTTTNCTTTACCTTTTTCTAT 

TAAGTTGGACCCTCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnNAA 
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TATTCAAGCTTATTCGATTCCGGTCNACCTTCGANGGG 
Sequence 2383 

CCGGGCAGGTACGCGGGGGGGGGGGGATTCTTGATCCATGCACAGCGATGTGAGCTGAGG 

TGCAGGCACCAGACCTAGGAATTCCTAGAAAAATAGTCAGGAAGCATTTAGACACATCAA 

ATGTTAAACGAGTCCTGATTATGATGATMTGATGATGATTTTGGTGGTTGCAATAGCAA 

AGCC7TAAGTATGAAGGAGACGTGCCAGCTGGAAATACAGGTAGACAATGAACAACTGAA 

TTTAGAGG ACGAAG ACATTG AAAG CATTG ATG CC AC C AAATTG AG C CGTTTC ATTG AG AT 

CAACAGCCTCCACATGGTGACAGAGTACCT 

Sequence 2384 

CCGCGGTGGCGGCCGCCCGGGCAGGTACACACAAAGACAAACCTGAACTTAATTTCAAGG 

AAAACTTAMCCCATGCACAMTAATTGGTGAGCCTTCATTTCCCTGACTTCMGTTTCC 

ATGTGAGGACTCATGCTCTCTCCACTTTCTTCTTGGGAGGAGGGAAGATTTACCTAATGG 

GTAMTTTGGGCAAAGCACATTGAGTGTGCTTGTTTGGCTCTGAGTCTCTTTGCAAACAT 

GTGTCTGCCCACAGTGACATGAGTTTGCGTTGACTGTCATGCCTGCAGGAAGCTGCCTGC 

TCCTGTGACCATGTCAAGCMTTCTTTCTTTCAACTGCAACTGTGTGTMGAGCTTAGTC 

TGAGAAGAAATGTTCAGAAGCTCACTGTGGCTGCACATCTGAGCCATGTCTTCCCATTAG 

TTGTC ATG AGTCAGCAATAAAGCG G GTATGTTGATGTCTATC AATCTAAT 

Sequence 2385 

CCGCGGTGGCGGCCGCCCGGGCAGGTACACACAAAGACAAACCTGAACTTAATTTCAAGG 

AAAACTTAMCCCATGCACAAATAATTGGTGAGCCTTCATTTCCCTGACTTCMGTTTCC 

ATGTGAGGACTCATGCTCTCTCCACTTTCTTCTTGGGAGGAGGGAAGATTTACCTAATGG 

GTAAATTTGGGCAMGCACATTGAGTGTGCTTGTTTGGCTCTGAGTCTCTTTGCAAACAT 

GTGTCTGCCCACAGTGACATGAGTTTGCGTTGACTGTCATGCCTGCAGGAAGCTGCCTGC 

TCCTGTGACCATGTCMGCMTTCTTTCTTTCAACTGCAACTGTGTGTAAGAGCTTAGTC 

TGAGAAGAAATGTTCAGAAGCTCACTGTGGCTGCACATCTGANCCATGTCTTCCCATTAG 

TTGTCATGAGTCANCAATAAAGCGGGTATGTTGATGTCTATCAATCTAATTCCTATGTTC 

TGAACTCAGG 

Sequence 2386 

CGCCCGGGCAGGTACCCGGGAGGCTCGCATGTGTGGGTGCATGCCGTGGCAGGAGAAGGC 

TTCCGAAGAGCCATGCATGCGTCCTCTGGCTGCTCCAGGTGTAAAAAATGCCAGGGAGGG 

GACAATGATGAGTCCTAGGCAGGAAAGGTCCTCTGGAGAGAGGAAATGGCACACGCGCAA 

GCACGGAGGCATGAGGGGCCAGGGTCCTGGAGGACACTGGGCCGGGACAGATCAGACTCC 

CCCCGCGTACCT 

Sequence 2387 

AGGTACGCGGGTTTCCTCAACATGGCTGCGCCCTTGTCAGTGGAGGTGGAGTTCGGGTGA 
GTCACAGAGCTGGGGCGCCGTGGGGATGGATTGAAGTCGTCGGGCCCAGAATTCCTTTCT 
TTGCCGTGGGGCCTGACA 
Sequence 2388 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCNCGCCCGGGCAGGTACGCGGGGGGGGGG 

GGATTCTTGATCCATGCACAGCGATGTGAGCTGAGGTGCAGGCACCAGACCTAGGAATTC 

CTAGAAAMTAGTCAGGAAGCATTTAGACACATCAAATGTTAAACGAGTCCTGATTATGA 

TGATAATGATGATGATTTTGGTGGTTGCAATAGCAAANCCTTAAGTATGAAGGAGACGTG 

CCAGCTGGAAATACAGGTAGACAATGAACAACTGAATTTAGAGGACGAAGACATTGAAAG 

CATTGATGCCACCAAATTGAGCCGTTTCATTGAGATCAACAGCCTCCACATGGTGACAGA 

GTACCT 

Sequence 2389 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC I I I M I I 1 I ! M 1 1 I I I T 
TTTTGGGGAGMGATCATGCATTTCCTAGGACAGGACTAATACATTTAGCTGTGGCTATT 
ATCAAAGAGCACCAANACCTTAGGTGGGGCAACCAGACTGGCTATGGGGCTCGATGTAAG 
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CCAGACCCATGACCCATAAACTATTTCAGGTAGGAAGTAAGCAATACTATAACTTGGACT 
CGAGGGGGAGCAAAAGGGCACCCCTCAATGGTGATGGCTGCCCACTTGGTCTCAACACTG 
TAAAACTTGGGACTTGCT 
Sequence 2390 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTC 1 iTTTTTTTTTTn 

rnrm iggggggattacacaatnnaaaagccccaaggatattcatcaattctattctt 

TTCTCTTTATGACTATTTCCTTCTTTATCCTAAATTGTCAAAAATCATCMGGGGCATTC 

TGATACANMGCACTCTGATACANAAACACNCANTTAACATGAAAATTAACGGTTAACAA 

AAMGGAAAGTTACAGAGGATAAAATACCTTTTAGCCTTGACCCATTTTACTTCTTTCT^ 

G GGC ATAC AG ATAC ATG CTAATATCC ATACCTTGCTAAATATAAAAATACCTGAGAGTAG 

CTGGATGGAGCCCAATTTTGTATANACAGCCTGTTAGTTTCTCCATCTCTATATGCATTG 

GCATATAAAAAGTAAGAGAAAACTAAACAAGTTCTTTTTAAATGTMCTTCC I i I I IGAA 

AATATGCATATAAGACTATGCATTTTMTTTTCATGAATACTGACCTTCTGTAAATGCAA 

ATACCCATTTTCTACATTACAATATTACCCAAGGGGGAA 

Sequence 2391 

CGAGGTAC I I I I I I I I I I I I I I I I I I I I M I N G ATG CTATTACTG AATAACTCTTTCTG A 

GATGGTTGTTGTATAGTAAACCATGAGTCTGGCATGTCTAAAATTGTCTTCATTTCTTTC 

CATAMTTTGAATGTTTTGCTGTGTAAGTCATTCAAAGTTGAAGACAATGTCCCATCTTT 

TGACAGAMGGGAAGGAAGGATATGTGCTCAGCCTGCCAGTTTGACCCACTTTTGAAAAG 

GGCTTTGTGGTCTCTACCAATGAGAAGTCTGGTGTCAATGTGATTTTCTTAACTTTATAG 

GMTTGTTTGTTTCCCTCTTTTCTGAAMCTTGNGCTTTCATCAGGCTCTACCAGCCTTG 

ATACTCTGTCACTCACTGGGCATGGTACCTGCCCG 

Sequence 2392 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTACTTTACTCAC 

CCTTCCTCTGACAGAAAAGGATGAAGTCAAGGGCCTGGTAGAGGCACCACTAAGAAAGGC 

ATCTGAAAGGACCAAAGAGAGTGACCAGCAAGCATTTTTTGCAAGGCTGAGGAGCTGACA 

GCTTCCATGAAAGGCTGGACCACCCAGTGGTGAAAAGCATCATCTGGGTTACCTTGTGCT 

GCCATAAMCACACCACAGACTTGGTGACTTAAACCACAGATATTTATCTTCTCACAATC 

CTGGAGGCTGGAAGTCTGCAATCACGGTGCCAGCATGGTCAGGTTCTGGTGAGGGCCTCT 

TTCCTTCTC ACTGTGTGCTCTTTCTTGTGC ATGGAG AGAG AG AG CATGAAC AAGC CCTCT 

ACTGTCCCTCTTAGAAGGGCACTAATCCCATAATAAGGCATTNCACCCTTCTGACCTCGG 

GCTAACCCTAGTTAAGTACCTCGGCCCGCTCTAANAACTAAGTGGATCCCCCGGGCTTGC 

AANGAATTTCGATATCAAAGCTTATCGATACCCGTCNA 

Sequence 2393 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAGAACAGGCGATATCTTCACCATGCG 

CACAATGAAATCAATACTCAGAGAAGGCAGATAATTCTCCACGAAGCCAGAAAACTAATA 

AATGAACAACTTGGGTGAAATGTCCCACCAGACGGTGTGATATTTAGTAGCCCAGAAAGC 

TGCCAAGGGGTTGAATGACACTATCTGAAGATATGAACCAGTTTGCTCTCCATAGGGAGG 

ATTTCATCAACAGGAAACAGATGCCTGGAAGGCATTGGATTTGCTAAGTGTCTATGCCAT 

ATGATTCTGCTGTTTGCGTTTGATTTAGAATGGTGAGCTGACTCAMGTCAMCTATTTC 

ATATGTTGAAATCCTGACTCCCAG CCTGAGGGCATTACG AGGGGCTCTGGGAG GTGATTA 

GGTCATGAGGGTGGAGCCCTCATGAATGGGATTAGTGCCCTTATAAAAAGAGGCATCAGA 

GAGATCGTTTGCTCCTTCTGCCATGTGAAGACACANCAATGGATGACTATCANGGTATTA 

AAGAAAGAACA 

Sequence 2394 

CGCGGTGGCGGCCCGCCCGGGCAGGTACGCGGGGGGGATGAGTCAGGGAGAGCTAGTGTG 
GTAGCAGTTTCTAGAGCTGTTTTCAAGGMTGGAMGAGGAGTGGGGAAAGGATTTAGGA 
TCTATGGGGTTGGCTAGGTTTCCCTTTCTGAGTTTATATAATGGTTTCGAGAATAAATGT 
TGMGGAGCANGAGGGTGTCTTGTTGAGAAGATTTAAAGGAGGGGCTACAAAGTAGAAGG 



Page 351 of 635 



WO 01/51628 PCT/US01/00798 

Table 1 

TCATCAATATATTGAATAACGTGAGAAGCACAGGGGTGGAAAGAAAGTAAATCATGAGAA 

AGAGCTTGGCTGAAGGAATGAGGGCTGTCCCTGAAGCCTCGTGGCAGTACCTCGGCCCGC 

TCTAANAACTAnnnnnnnnnnnnnnnnnnnnnnnnnnTC GATAT CAAACTTATCCATACC 

CGTCNACCTCGANGGGGGGCCCCGGTACCCACTTTTTGT 

Sequence 2395 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACTGTGATATCCACATATTTTTGAGAAAMTTC 

CCAAGCCAGGCGAATGTGGATTGGAATAAAGACATAGGCAGTGTATACCACCATAGCAAT 

AATGGTTAATAAGATGGCGTTAAACATAGATCGCTCCCAGGGCTCTAAAACAGCACAGCA 

GCTAATGATTTGGTATTGATAGTAGAGCCAGGAGAAATATTCCTTCACACGCCTCAAATC 

CATGGTTGGCTCCTTCAAGCTGCAGTAAGTTTGTCCTAAGAAAGTCCAGGTCTGGTTCTT 

CAGCCTTGCTCCTTCGCGAAATGATCCTGTGTGGGTTAAGTTCTCCTCTCTGGGTTGCTG 

TTTCCTCATCTCCCAGTTGGGTGTATCTCCCTGCGGCTTAGGTGAGCGCCGAGGCTTTGG 

CTCCTCCCCAGCTGCTGCGAGCTTCCCACTGCCCCGCGTACCT 

Sequence 2396 

tggcggccgcccgggcaggtac 1 1 i 1 1 1 1 1 i gnttttttnantcntttttanaacaaaca 
caaaaaagttttamaaaaamgagtttgacamaamgmgaaataatattaaagt 
gacaccctgtttccttggaaacacaatgcanaaccacaaactctgagactrattatagcg 
aaagtattgtcattcagttcaatctaatggtatgaggttagctttgaaggccaaaagaaa 

CTAGTTTCTAATTCTTATTCTTCTCCCCACTAGACTTGTGGCCTAGGTAATTTTGTAATC 

tttctgagtctgttttctcatcaggagcaggataattcctaagtaatagatcatatgtag 

AAGTGAATATGATCTCATGGCAGAGGACATNAACCATAGTTAATTATTAAGAATATATTT 
ACATTGAGCTATCCTTTTATNTACTTTAAAAA 

CC^GGGNTTGGGGCCGGGGTCNCGGGGGGAAGGCCAACCCCCCAAAAGGGGGGTTGGCCC 

GAAGGGGGGCCACCCAAAGNTNNCTTTGGNTGGGGGGCCGGCCNTGGTTGGTTGGGGNCA 

ACCCCAANTTTNCAACCNAGGNCAAAAGGGTTCCCAAATTCCTTNTGGGGGGGTNCATTG 

GTTAAGGNTGGGCCAAAGNCCAAAGGGGAAAGNCCGGNGGCCCGGCCCCCTTTTNCCTTN 

TGGAAAAMTTCCNTTCCCAACCCCTTAAAGTTNGNAACCCAAAACCCCAAACNGGNAAA 

ACCCTTTTGNAAAGGTTGGGTTGGGTTGGGGGGTTCCCCCTTTTGGGCCTCCCAAAGNGG 

GAAATTCCCTTCCCCCTTTCCGGTTTTGGGCCAAATTTAAAAAAAAAAATTGGGGTTNCN 

CCMGGCCCTTNCCGGGTTNTTAAAAAAAAGGGGGGAATTTGGGCCCCCCCGGGCCGGGA 

TTAMCCNCCNTTTCCGGGGGGGGNCNCCCNGNNCGTTTCCTTTTAAGGGAAAAACCCTT 

TAAAGGTTTTGGGGGGGAAATTTCCCCCCCCCCCCCGGGGGGGGGNCCNTTGGGCCNAAA 

GGGGGGGAAAAAANTTTTTNCCNGGAMTTTAANTTTCCAAAAMGGGNCCCTnTAAAT 

rrCCCGGNAATTTAACCCCCCGGTTTTCTGGAAACCCCCCTTTCTNGN AAAGNGGGG GGG 

GGGGGGGGGGCCCCCCCCGGGGGGNTTTTAACCCCCCCAA AGNNC Ci rTTTTTI iGGGTT 

TTTTCCCCCCCTTTTTTTTAAMGNTTGGGGAMGGGGGGGGTTTTTAAAAAMTTN7T 

GGCCCGGNCCCGGNCCTTTTTGNGGGGCCCGGGT 
Sequence 2398 

ATTACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCTGGCAGGG 

CCAGTGGCAGGAAGGGAGGGACAAGTGGACAGTGGTGTGTCTGAATTGTAGAGTGTTAGA 

TTCCAGGTCATTCCCATCTGCTCAATTCCATCAGCCGCAGAGTTCAAGCTGTCATCTAAG 

CCCAGGGAGGTGTAAGGAACAGGCCACATCTGGTTCTTGGAAGTATATGCCCAGGCATTT 

AAAGCGGATCTTTCCTCTTTGGGCCATGGGTGGATTTCCTGCTCAAGCAACCTTCATCTC 

TTAAGCTTGTGTTGCTTGGAAATTCTGGGACT 

Sequence 2399 

AGCGGCCGCCCGGGCAGGTACGCGGGGGTTGGAGACCATTGCTCTATAGCAAGACCAGAC 

TTTGCCCTTCCTCCTCTCAGCCTACTCAACGTGAATATAATGAGGATGAAGACCTTTGTA 

ATGACCTTTTCCCACATAATGAATAGCCATTGGGAGACACACTTCTGAACACCACCACTG 
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GAAAATCACACATGCTGAAATGGGAGAGTTCCCTGACCCCCTTGCAGGATATGTGACAGG 
AGTGTGGCTCATCTGTTCAGCTGGAGTGCATACTCAAACCCCTTATGAGACAAGGAGTAT 
GCAGACAGGAAGGTGCAGGGAACTGGGGAAGCAAAATATAAACTAGTTAA 
Sequence 2400 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC I N I I 1 1 I J I I I J 1 I I II 
I I I I I 11 1 IGTAGAGATGGAGTCTTGCTGTTTTGCCCAGGTTGTTTTCGAACTCCTGACC 
JCAAG CAATCCTTCTGTCTCTGCCTCCCAGAG TGC TGGG ATAACAAAC ATAAGCC ACCAT 
G CCC AGCC AATCCTC AGTTATTTTCTAATACTAG ATAATCTAATTG CTC ACTTATTAC AT 
TATTTTTCCMTTAMGGNGNMGATTTAGACATTAMTTTAAGTTATAAGTC 
GAACACCTAATTGNAACTGNMCTAGMCAGGGGATCATTTGGTGACTATTGGAAAGTTT 
CAGGTGGATTTTTAAATNATATTTTCAATC 
Sequence 2401 

CGTGTCCTCAGATCTCAGGCTGCTCAGCTCCATGTAGGCTGTGTCTGTAGATGTGTCCTC 

GGTCATGGTGACTCTGCCCTGGMCTTCTGTGCGTTGATTGTTTCACCATCTTCAGGATC 

AAMCCTCCCTTCCACTCAAGCCCTTTTCCAGGGAGCCTGTCCGCACCCAGCTGNCATGG 

GATAAATTTCANGTGAGGGGTGTTATTCCCGGAAAACCTTTGGCAGNGNAAGACCNTTCA 

ACTTGGAGGGCCCCCAAGGGCTTTNTTTCACCCTTCAAGGCCCCCCAGGAACTTGNTACC 

CTTCGGGGCCCGGCTTCCTTAAGTAAACCTTAAGGTTNGGGGATTCCCCCCCCCCGNGGG 

G NCCCTTGCCAAAGGGGGAAAATTTTTC N GTAMTTAATTTCC AAAAN GCCCTTTTTAAT 

TCCCGGAANTAACCCCCGGTTTCNGGAAACNCCTTNTNTAAGGGGGGGGGGGGGGGGGCC 

CCCCGGGGTTTACCCCCCAAANCCI I I I I I I IG 

Sequence 2402 

TCCACCGCGGTGGCGGCCGAGGTACGCGGGGAGGATTAACAGAAGCAGATTAACCTCAGA 

AATCCTGTCTGGCTGGCAGATTTCMGTAAAAAAAAAAGGTGGGTTGGGGGGACCCTTTT 

CTTTCTAGTTGTCTTTAAGGGAAAATTAATTTTAC I I i I I I I I I GTTCTGGCCCGAAATT 

TTNTATGGAGNATATCCTCNTCACCTTGTCTNTCCACTTTTGAAACNCGGTTAAAAGGCT 

NCATCAGCCTGGTCCAGCTTCTGGAANTGNAGGGAGNGGGGGAAGNAAGCCCCCNTGGGG 

GTCTTTTC I I I I IGGAAAAGGGNAATTCCCGCNTTGGCTTTGNAGGGGGCTTGCCCCTCC 

CTCATTGGGTGGTGGCCGTTGGTTCCGNTTTTCCTTTCTTTTCNCTTGGACCCGCCATT^ 

CNTTTTGGTTGGNATTAATTCCAAGGNAGGGGGTTTAAAACCrrTATT^ 

GGCCCAATTTTCNCAAAGGGGCCCGGG 

Sequence 2403 

AGGTACI I I n I I II I I I I I I I I 1 1 I I I I I GTTTMTTAC ATTTATTTC ATTGAATAATG 

GATTGTCAACAGTAGTTATAAAATAGTGCATGGCTTAATGTGTTAATTATCATAAGTAGA 

ATCCACAATTAMGTTTGATATTTGCTGMTACTTTCTCTGCAAGGATATAAAAATTAAT 

C ATCCAC AACCTAATTMCTTAAATTTCAAATTAAATTC ACATATG N GGGTGTTTTACTA 

TCTTTTGCTTTTCGGTAATGTTGGTGACATTTTAGTAATGCATTATGAACAGTTATTA 

CMTGTGAATTAATTGGGTAATAACTTTAGTGTCACCATGTTTAAGGAACTNTTT^^ 

A 

Sequence 2404 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGATGCTGCC 

ACCTAGGTTACTTGTAGGACCCTATACGGCMCCTCCTTTGCCAGGAACTATTTATAAAC 

ATCCTGCAGGAAAATGAGTCTATATGTCAGAATACACATTTCCCACCTTGCCCAACAGTA 

GAAAAACATAAGAAGAGAAAAACATTAAAAAATGACAAGGAAGTTAATGGAAGTCAGCAA 

TGTGATGGTGTTTGGAGGTGGAGCCTTCAGAAGGTAATTAATGCCCTTGTAAGAAGAGGC 

CAGAGAGCTTGCGCACCTTCTTCCTGCCATGTGAGGAGCCAAGAAGCCGGCCTGTCTGGC 

AACCTGGNAAAGAGGACCCTTTACTTAGNAAGCTAGC 

Sequence 2405 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTATTATTCTTTCCTTT 
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TGTGTATGTTTGAMTTTTCCATTATAAAAAGGACAAAAAAAAAAGAGAGAGAAAACTAA 

AAAAGAAAAMTAGTGTTGCGCTTGTTAGTAAAACCAATCTTMGGCCAGTTACTAAAAA 

GTAMCTCCTACMTTTTGAATAGCTATTTGCTAATAC AGTG ATTTGC TGCC TTCTTG G G 

GTTAGTTTAGCCTGTCCATTCCTTATTTAACTTGCTCCCTTCCTTCCCTTTTMTAMGT 

CCTGCCTAAGCTCCAGGCTCTATTTTCCACCTCTCCCCTAAGATTTAGGATTTGTTCCGT 

TTTAAGGTCAGTGGCCCATATGTCTTCCTGTTCTCC 

Sequence 2406 

AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCTGGACTCATACCTGAAAGCAGTGTT 

CAACCTTAGCAAAATCTCCAACCAGCGCATGAACAATTTTCTACATCACAACGACCTGGT 

TTTCAAATTCAGCTCTCMGGCCAAATCTTTTCTAAATTTAACCMGAACTTCATC^ 

CACAGAGAAAGTAATCCAGGACCGGAAGGAGTCTCTTAAGGATAAGCTAAAACAAGATAC 

TACTCAGAAAAGGCGCTGGGATT7TCTGGACATACTTTTGAGTGCCAAAAGCGAAAACAC 

CAMGATTTCTCTGMGCAGATCTCCAGGCTTGAAGTGAAMCGTTCATGTTTGCAGGGA 

CATGACACCCACATTCCAGTGCTATCTCCTG 

Sequence 2407 

CGCGGTGGCAGGCCGCCCGGGCAGGTACCCGTTMCTTCCMTTAACTAGTTTTGACAAC 

ATTCAAAAMGAGTNATAMCTNNGGCTTAATTTTAATAATCAANACCCTNCTAGCCTTA 

CTACTAATAATTATTACATTTTTGACTANCCACMCTTCAACCGGCCTACATAGGAAAAA 

ATTCCACCCACTTAACGAGGTGCCGGCTTTTCGTACCCCTTATTATCNCCCNNGCCCNCG 

GNCGGTNCCNCTTTTNCTTCCCAATAAAAMNTNTTCNTTTCTTTMNGGTMGACCTTA 

ATTTAANCCCTTTTCCTTTTAATTAT 

GCNCCNCTTCCCC 1 MINI 1 MCCCCNCCCTTTAACNNCNATTTGNAAGGNCNC NCTTT 
NNCNAAAAAACCAAMTCTTTANAAC^ 
NATTNGTGTCAMTTCCCCCTTTCCTTTTAATTTTTAAAAA 
Sequence 2408 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGCTCTCACT 

GCTCCCAAAAATGGCGGACGCATTCGGAGATGAGCTGTTCAGCGTGTTCGAGGGCGACTC 

GACCACTGCGGCGGGAACCAAAAAGACAAGGAAAAGGACAAGGGGAAATGGAAGGGGCCT 

CCAGGGTCTGCAGACMGGCAGGGTAAGGAAGAAGTTCCAGTTGTTGCTTTTATTTTTTT 

TCTCGGTGGGGAGGAAGAAAACACTACTCTCTTAGGAAGAGCACCGAGAAAGTGAAGTGT 

ACCTGCCCG 
Sequence 2409 

TANCACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCCGGCCGCCCGGGCAGGTACT 

1 11 H U I I I I i H 1 U I I 1 1 1 I I CCTTTNAATGCTGTTCACTGCTACCCATTTAGTCAA 

CTTGGACCACCCTNTANAGGGGCTTCAACTGTTATTTTCAATACATTGTCACAGGGACAN 

AGGAGGGTGTTAGGGAACTNTGTGACTAAAGAGTTTTCCATAAATGGGTGGACCTCANTT 

TACAAGCATTAANTTATTGGGCCAGGAGGGGGAACANATCATGATTAGGAGAGTGGAAAA 

NAMGATGACAAATCACTTTAAGTCANACAAACCTTTTTNTNTACCTCGGC 

Sequence 2410 

GTTTACCCGNCCGGGGGGGGNGGAAAACCATATNGTTGGTTGGTCCTNTTAGGGAACCCC 
TTGGTACCCGNCTTGGGGGGtiAAGGGGGAAAGGAATTGGCCATGGGCCCCAACGCCTTAA 
GGGTGGTTTAACCATTGTNGGTGATGGGGGMCCCCCCTTTAATTANCCGGGGGCAAAAA 
CACNTANACTTTTTTTCTCTCN^^ 

ATTNCTCCTTGCCATGGGNANAAAAANTGGGAAGCTACCTTAAATNATTTGGTTCCANGG 
CAAAANTAACCCAACCMTTNTTTTCCACCCCAACCCC^ 

GGTTAATGNAAAAAAAAAACCAATTTANATNGGAAAAAGGNAAAGGAAAAANAACATANC 
CACTTTTATAANAAANAAANATTTGGNAACCCAAAAN^ 

TTGGGGGGAAAAAGGCCCAAAAATTGNGTTGGAAATTGGGGGTTGGl mil \ AGGGGGA 
AGGNGGGTTGGNGGCAAGNCCNCCTTTTTCAAAGGAAAAAGGGGTTNATAAATT^ 
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AATTGGCCCCNCCCTTTTNGGTTATAAAGGAAAAAGGGAAAGGGGGCCCCCAAGAAAAGN 
AAGGCCCTTTTGGGCCGGCCMCCCCCTTTT^ 

GGNAAGGGGGNAAGGCCCCCAAAAAGGNAAAAAGNCCNCCGGGGGCCTTTGNNTTCCTTT 

GGCCCAAANCCCCTTTGGNCCNTAANGGAAGGGGGGACCCCCC 

Sequence 2411 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTTTACTCACCCTTCCTCTGACAGAAAAGGATG 

AAGTCAAGGGCCTGGTAGAGGCACCACTAAGAAAGGCATCTGAAAGGACCAAAGAGAGTG 

ACCAGCAAGCATTTTTTGCAAGGCTGAGGAGCTGACAGCTTCCATGAAAGGCTGGACCAC 

CCAGTGGTGAAAAGCATCATCTGGGTTACCTTGTGCTGCCATAAAACACACCACAGACTT 

GGTGACTTAAACCACAGATATTTATCTTCTCACAATCCTGGAGGCTGGGAAGTCTGCAAT 

CACCGGGTGCCCAGCCATGGGTCAGGTTCTGGGTGAGGGCCCTCTTTCCTTCTCACCTGT 

GTGCTCTT 

Sequence 2412 

CCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACI M I I M i I M I I I I I 

mrm i attctttgggcagctaamtctaggatggtgttmgtttcttcattttgtca 

ATTATATATAAAAAATTAGAMCMCATGMTCTGCATTTCTTGGATGAGATAGTTAATA 
ACAAACTATTTCTCMTATTTGTATACTAAAAACTAGTGAAGGNGTTATGTGT^ 
TCTTATCTCTTAT7TGAACATGGGTTTCTGAAAGGAGCCTATNTAATAATATAAATGGTA 
TGTAGTAAATG AGG GCACTGTCTTGG CTGGG ACTGGCTATTAANAAAAAN ATTACCATTA 
G 

Sequence 2413 

rrTTTTTTTTTTTTTTTI I GCTTTTCTTCTTCTTTCCCTTTCAGGCATCTCCTCGTTTTA 

TTTTCTGTAACTCACAGAGTGCTGCAAMTAATGTTAGGGCmGCAGmATTCCTTGC 

GATTTAACTGCATAAATATAAAAAGTGCCTGAGCAATAATGCACGTTGGGTTATTCAATA 

AC AC AAG G AATGTAAAGCCG ATTTC ATATCTCCTTGAATC AGG AAC NTTAATT AATC ATA 

TGATGAGAATGCNACGTTTTNTTTACCANGGNCAAGGAGGCTGCCAAGAANCTTTCTGAG 

GTTTAATAATTGCACCCNCTNTGGAGGGGAGGGAAAAAT 

Sequence 2414 

CTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTTTTT 

rTTnTTTTTTTTTTTI 1 GGGCAATACCAGCTTTCACCCTGTCCTCTGCACTGCTTAGCC 

TTGGCTCTGCTGAGTTGNGCAACACATTCTGAGAGATGCCATACAATGCTCCAGGCAAAC 

TTATAGAAAAACAGGAATGAGTGATTTTATACGGGATGTGTTTCAGCTGTCCAATTCAAA 

ATAMTCACGTCCAGCTGCATTCTTTCCTMTTTGNGACCCAMCTGNTTGCACTTTATA 

CATTAATGAATTATTTTTTTAAAAAAGGAMTTAGCCTCTATCATGGAAANACAGTGGCT 

NTCTAACCCCTTTGNGGGTCCTTTCTCCCTTTAAAAAGGC 

Sequence 2415 

CGACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATAGTTCACAT 

TTTTTGGAAGAATTCCACAAMGAAGAAGAAAAGGAAATCTAATCTAAAMTACTATTT^ 

GGAGGGGATAGCAGATTGAAMTGAGMTTTTGAAAGTTTCAATACCAGTGGGGAAAGCA 

TTTGCTAGAGTTTAGAAGGACAGCATATGGTACACCAGATCACGAGACATCGTTTCATAC 

TTCCCAAATAGTTTTATATTTTAGCTTTGMGGTCAGTTACCAGAGCCAMCTTGTTCTT 

MCMGCAGAATTTTATGTCCATTCAAAGAGTCTCTTACACCTTTCTGGGCCTATTCACT 

TGCAGAGAGGAGTCGAAACTGTAACCAG 

Sequence 24 1 6 

ATANGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTNTTTTNl I I I I I I I I I I I 
TTrmm rACTACTTTCTTCmCTTATTTTCTCATTGGCCTCAGGTCTTGGAATATCT 
TGAATTCAAACTAATAATTCACCTCCTTCTCAGCACTC TCAATAT AGTCTTTTGCCAACA 
CCTGATGAGACATATAACTTATGACCATTGCACTTCTNTTTTTTTCCATGGTTTGCATTT 
ANGAGGTAATTTTCTTCAGCTATTANAGCATATCCTTCCTCATNATTTTANANAAATACT 
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TGGTCAAATGACCTTTTAAATAAAAG 
Sequence 2417 

CTATAGG GCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC I rTTTTTTTTTTTTTTi 

((Mill NCAAATGTGACAGGCATTTATTTTMCAGAGAGGAAGGGGAAGACAAAATATC 

ATAAAATGTGTGGGTTTTCCCCANATGACTCTGTTGAAAAAACCAnGAAATGCAGAGCGG 

GGGGACAAATGGAGGAGAGATNTTAACTAGAACATTGCTGACTCAAGGCCGTCCAGCTTC 

AGNGGGAGTTCTGATTAGGCCCCATGTNTGCCTGATGCCACCTGCAAC 

Sequence 2418 

CCGCGGTGGCGGCCGAGGTACGCGGGGACGCTGGGAGGAGATGCTGCCACCTAGGTTACT 

TGTAGGACCCTATACGGCMCCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAA 

MTGAGTCTATATGTCAGAATACACATTTCCCACCTTGCCCAACAGTAGAAAAACATAAG 

AAGAGAAAAACATTAAAAAATGACMGGAAGTTAATGGAAGTCAGCAATGTGATGGTGTT 

TGGAGGTGGAGCCTTCAGAAGGTAATTAATGCCCTTGTAAGAAGAGGCCAGAGAGCTTGC 

GCACCTTCTTCCTGCCATGTGAGGAGCCAAGAAGCCGGCTGTCTGCAACCTGCAAGGAGG 

ACCCTCACTAGAAGCTAAGCCCA 

Sequence 2419 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGCAGTGGGAAGCTCGCAGCAGCTGGG 

GAGGAGCCAAAGCCTCGGCGCTCACCTAAGCCGCAGGGAGATACACCCAACTGGGAGATG 

AGGAAACAGCAACCCAGAGAGGAGAACTAACCCACACAGGATCATTTCGCGAAGGAGCAA 

GGCTGAAGAACCAGACCTGGACTTTCTTAGGACAAACTTACTGCAGCTTGAAGGAGCCAA 

CCATGGATTTGAGGCGTGTGAAGGAATATTTCTCCTGGCTCTACTATCAATACCAAATCA 

TTAGCTGCTGTGGCTTGT7TTAGAAGCCCTGGGAGCCGATCTATGTTTAACACCCATCTT 

TACTAAACCATTATT 

Sequence 2420 

TGGGGTAAGNCTTTCTCANCTCCGTCCGNGNTTGGGTCAGGGTCTCCGTCTCNCCGGGGG 

TCCANGGGTTTACCCGTCCGGGGGGGGGNNACCNCTTGGTACCCGCCTTGGGGGGTAGGG 

GTAAGTAATGGCCTTGGTCNCCACCCCTTTAAGGGGTNTAACCTTNTGGGTTANGGGGAA. 

CCNCCCCTTAA1TACCCGGGGCNAAGGCTCCTTCNCCTTTTTGGGGCNCCAAGGGGNAAA 

ACCTTAMTTNmATTNANAMCCMTn*CCCCTGGGCCMGGGGGAAAAAAMTTTGG 

CCAAGGGTGGGANAAGGTTANGGTAAAAGGGAAGGTAACCAANGGGGGGGTAATNAATTC 

CTCCCAAAGNAANAAGGGGGTTNTTANTTTGGCCCAAAATANTACCCAMTTTCTAATAN 

GNGAAGNAAAAAGGNMTTTGGAMGGGMGGGGGNAAGGTTCCrrNMTTTMTTGGGT 

TTCCAAGGGAAAAATTAANCCAACCCAAAT I (TTTI I CNNCCCCAAACCCCCCTN7TTGG 

GCCCCCCCCAAAAACCCMGGGTTGMNGNTAAAAANAANAAAMCCCCMTTTTTNTNN 

N N N AC N AC AAN N ACN ATATAATATCATN AN AAGAATAAN ATN ATTAT C ACATN AN N ACAA 

TNANGAGAANAATAATNANAANGANNGGGGN7TTNTCCCCCTTTNGGGGGNCCCCCCGGN 

CI I N ICCNTTTAAANAGAAAAAACCCTTTAAGGGGNGGTGGGGGGGGAATTNNCCCCCC 

CCCCCCCCNGGGGGGGGCCCTTTGGTCCCAMGGGGGGGNAAW^TTTTTTCCCGNNNA 

AmNNTTTCNAAAAMGGGNCCTTTTTTATTTTCCGGGGMTNTM 

NNGNGAAACCCCCCTTTCTTTAAAGGGGGGGGGGGGGGGG 

Sequence 2421 

TGGGGNCCGGGNGTCCGNCTTGCTTTTGCNCCGGCNTGTGCCCTTGCGGGGCTTGCAACC 

NTGGNNACATTCNGGCTTGGNCGTCTTTCNGGGGTNTCTGTTNTACNGAGACCTTGGGCN 

GTGGTCCGTAAGTCCGNGNTTATTNCAANGACATNCAAACCTNCAANATAGGGGGGCCGG 

GGGCTAANMTTMCCGGGGGGTTTTMTTTCTCCAACCAAAGGGAAAATTCCAAAGGGG 

GGGGGNAATTTAAAAACCCGGCCAAAGGGGNAAAAAAAGGAAMNCCMTTGGGTTNGNA 

AAGGCCAANAANAAAAGGNGGGCCCCCAAGGCCNANNATAANAANGNGGGCNCCAATGNN 

GTAAANANCCCCCGGGTTTANNAAAAAAANAAAGNGGNCCCCCGCCNGGGTTTNTGGNCN 
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TTTGGGGTCCCGGTTTTN I (Mil! I CCCCAAATTAAAAGANGGGCCNTTTCTCCCGGTC 

TCNCCGCTCCTCCTTNGGNAACCGGNAAAGGNCCCANTTTNCNAATCCAAAAAAAAAAAN 

AAATTCGGGGAAACCCGGGCCTTTCNAANAAAGGNTTCNAANNNGTGAAAGGGGGNTTGG 

GGGGGCCCGGGNAANAAANAACCCCCCCCGGTAACCCCAAGGGGGGGAAACCTTTAAATT 

TNATAANMAGGGGAAATTAAACCCCAAANGGGGGGCCCGGGI rTTTTTI ICNCCCCCCC 

NCTTTGGGGGGGNAAAAAAGNCCCTTTCNCCCCCGTTTCCGGGTTGGGGCCCGGCCCTTT 

CTTTCNCCCTTTGGTTNTTCCCCGGGAACCCCCCNTTTGNGCNCCGGNC I 1111 IACCNC 

CCGGGGGTAATTNACCCCCTTGGTTTCCCCGGGGNCCl I I I I I NCNTTCCCCCCTTTTTC 

GGGGGGG 

Sequence 2422 

gggcgaattggagctccccgcggtggcggccgaggtactcagagtgcacaatgtaagaag 

tcactcactcttgggcatgcgctagttcagggttggcagcgacagtgagtgccttcttac 

attttgcaccctggataccttccttgcctcaccctagactggcccagatgtctagtctac 

agntcacncttatcacttgcmmctamcaatgactacacttttnacttccaaggtt 

tctatattttmtccctatctat™ 

ATTTGATGATTCAAGAATNC 
Sequence 2423 

CCGCGGTGGCGGCCCGAGGTAC1 1 1 I 1 11 I I I I I I I I 1 I I II I 1 1 1 I AACTGCAACAACC 

TAACAGGAAAACTAATCAGAGATCATGAGGAGAGTGGTATATCAGTCAGGGACTAATCAG 

G AACATAATCCACTTGAG AATTTAAAATAG AG AGG GCTTAGGCTG GGGAGTTG GTG ACAG 

CATAATGGGAAATCTACAAAGCAAACAGGAATAGGGAGACAACCCAGAGCCCATNTAGGC 

TGGCAGAGTCATCATCTNTAGGCTGGAAGGTCAAAANAAGGTGGTGTTAGTGGAGCCAGG 

AGATGCACAGTCTCCCAGGGGAAGCTGGAGCTAGGGTGTACCTGCCCG 

Sequence 2424 

ATTATTTTGATTTTATATCTGAGTATGTTTNTAGCCAGAAGAGGAGAAGAAAGAGGTTAA 

AAAGCAAAAACAGAATTTATAGATGCACGATATTCTTCTGAACAAGAGAAAGAAACTGGT 

AGGAAAATTTATTTTAAAMGTGTTGAAGGGAAAGAATCAAGACCACAGATCCAGATCCG 

GAGATTATTTTGCTAAAGAATAGCAATTGTGAGGCATGAGGTGGGAGGGGGGAAGAAGCT 

ATGAACTTAATTTTGAGGTTTCTGAATAAGGAAACTTGAGTGAATTCACTTCAGATGCAT 

TTGGAATGTTNGCACTCCAGAAGATNANATTGTGTGTGCTCTGGAGAGTATTGGAAGA 

Sequence 2425 

Al I I I I M II I 1 1 1 1 I I I I I I 1 I I 1 NCAAANTTCTTTC I I I I I I AATTT AAATTG AGTTC 

ATTTATTTGAAACANACTGGGCCAATGTCCACAAANAATTCCCGGTCAGCNCCACCGATN 

TCCAAAGGNGCANTNTCAAG3MGGGCAGGCGTGATGGCTTATTTGT1TTGTATTCAATG 

ATTGNCTTTCCCCCCCGNGTACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCCnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnCAATTCACANAACATNCCAACnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnNGGNCC 

CACCTGCCCCTTTTCCAATTCGGGAAACCTNTNGTGCCCACTGCATTTANTGNAATTGGN 

CCAACCCCCCCGGGAAA/\GGCGGNTTTCCTTTTGGGCCCTTCTTCCGNTTTTNTTCCCCC 

CCACCNCCCTTGCCTTNGNNCTTTTNGGTCGGGGGGNNAGCGGNTTTTNACCNCCCCCAC 

AGGGGGGNTATTTCGGGTTTTTCCCCCNNTTNNNGGGGGATACCCCCGGAAAAAANTTNT 

CCCCNCNCCCCCAAATNGTTTGTACCNTAAAAGGCCCTTTNCCGGGGTTTTTTAAAAGGC 

CCCCCCCCCCCC 

Sequence 2426 

CCGCGGTGGCGGCCGAGGTACGCGGGGGATGTGGGAGGATTGCATTCAGTCTAGTTCCTG 

GTTGCCGGCTGAAATAACCTGAATTCAAGCCAGGAAGAAGCAGCAATCTGTCTTCTGGAT 

TAAAACTGAAGATCAACCTACTTTCMCTTACTAAGAAAGGTATTAAGCGCCTTTCTGAG 
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AGCTCTCAGTGGGCTTCCTAAGCACGTTCACTCTGCCTCCTTTAAGTCTTATATTTATGA 

TC AAG ATG AAAG GG AGGG ATGTATC ACTGCAC AG AGATTCTACAAGTGG ATATATAAAG C 

TAATAATGTTTTCAGTGAGCTCTTCTCTGGCAACTACTTTCCAAGTGCTTTCAGCATTCC 

ACCTTGAAGCACAACGTTAATAATGCACMTTTC7TATGTTCAGCCACAGAATGTTCACG 

TGTGATTTGGTTACMCATATTGCTCTATCCAGGCCTCTMCAAAACTGACCTCTTTCAA 

ATGGACTTGCAGTTGCTGAATTCCAAATCMCACTTTATCTTAAAAACAGAAAANGAAGG 

GAANGGTGGANGGAGGGGAACAAGAAANGGTCCTGNCCG 

Sequence 2427 

CCGCGGTGGCGGCCCGCCCGGGCAGGTAC I I I I I I I II I I I I I M I I I I I I CCTTAATAA 

GTGAGGCCAAGTGCTGTGTGTAACTGTGGGGTGGGCAAAATTAGGATTCCTGAATCACTT 

CACTTTTTGTTGCCTTTCTCTTGGGAGAATGGTTACCATTGCAGAGAAGCATAGTTAAAA 

TTGCTTTTGTGTTTCATAMCGMTTCATACTTAGATGTCATTGACCAACAACTAAATAG 

TATATCAGAGATGAAAGGCATCCTAGGGATGACCAAATCAAGCTCTTCATTACTCAAAGG 

AGAAAAAAATCCAGAAAGTGGATGACATATGTTGAACTAGTGATGGAAGTAGAACAGGGT 

AGTGTATTTTCTGCCTCTTAATTCTTCA I 1111 I ACACTAAACCAN ATATTAGGTATATG 

ACTAGGAAGATCGTGAGGTAAGGAGCAAAGGGTTTGGC 

Sequence 2428 

CGCGGTGGCGGCCGAGGTACGCGGGGGAGTCTGATCTGTCCCGGCCCAGTGTCCTCCAGG 

ACCCTGGCCCCTCATGCCTNCGTGCTTGCGCGTGTGCCATTTCCTCTCTCCAGAGGACCT 

TTCCTGCCTAGGACTCATCATTGTCCCCTCCCTGGCA! Mil I ACACCTGGAGCAGCCAG 

AGGACGCATGCATGGCTCTTCNGAAGCCTTCTCCTGCCACGGCATGCACCCACACATGCG 

AGCCTCCCGGGTACCTGCCCG 

Sequence 2429 

CGCGGTGGCGGCCGCCGGGCAGGTACGCGGGGGAACTTGTGTGCTTAGACCTGACGCTGG 

GAGGAGATGCTGCCACCTAGGTTACTTGTAGGACCCTATACGGCAACCTCCTTTGCCAGG 

AACTATTTATAAACATCCTGCAGGAAAATGAGTCTATATGTCAGAATACACATTTCCCAC 

CTTG CC C AACAGTAG AAAAAC ATMG AAG AG AAAAACATTAAAAAATG ACAAGGAAGTTA 

ATGGAAGCMTGTGATGGTGTTTGGAGGTGGAGCCTTCAGAAGGTAATTAATGCCCTTGT 

AAGAAGAGGCCAGAGAGCTTGCGCACCTTCTTCCTGCCATGTGAGGAGCCAAGAAGCCGG 

CTGTCTGCAACCTGCAAGAGGACCCTCACTAGAAGCTAGCCATACTGGCATCCTCATCTT 

GGCTTT 

Sequence 2430 

GACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCACAC 

ACACACAGGCACACGCAGGCACACACGCAGACACACACACACATAAACACACAGAGTTCA 

CTAGTCCGAGTTACTGATTTTCTTAGGATTCTGAAAGTGACAACACCGGAAACAAGGTAA 

TTCATGTTAAAACACAAGGGTTATATCAGTAAGAGATGGGATCCCCGAAGTAAACCGTGG 

AATTTGMTCAAGTTTCGMGAGCTAAAAAMGAAATTGGAGTTTCAACA 

TGAATCCTTAAGAAATACAGAAGTTCAAAATAGAAAACATTACAGTTTCAGGATACAAAA 

GTAGAAACATCTGAGATTAAGCCTAC A 1 111 1 AAAAAAAGATATTGAAAATTACTGTGCT 

TGTAATACCAGCTTTTCAAACATCAATACCAGTATTGGCATTACCTAGATCAAATCTTTT 

CATCAAAGTTGAAAAAAATAAAGCATTGGGGATAGAAGGACCATCAAAATGTCCAATATT 

GAAATATTGGTCTTTGCATAGTTCCATTAAAAAATTTACCTGCTCTCAATGGTTGGTCAT 

TCTTCAATTTC AAN GGCTTTATTTGGMC AG ACTTTTGC ATTTTCAATGGC AG GCTTAAT 

GGGGGTGGG 

Sequence 2431 

GACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC I I I I I I I I I I I I 
I I I I 1 I 1 I I IGATGTGTAGCCTTGGGTAAACACACATCCTTCTGTGAAACAGTCTCACAG 
AGACTCTCAGAATCCCAAGAATTTTCTTCAACGTTCTTTTGTTTTGATTCT 
ATCTGATCTATAAAAGGTTAATCACAGATACATTCATGAGAACATTTCTCCACTATACAT 
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TTTAAAAACATATACAMTCTACTTTCATTCACGAATCTGTGGTTCCTCCTCTGTAGCCC 

GTATATTCTTTTTGTTGATTACAATGACTACTTCCTATAGTCAATCCGTTAGMTTGAAA 

TCTGAAAAGTCTGAAAATTAACATTATTTCATTAGAATGCACAAAAATATAAATTTGACT 

AACTTCTATGAATTATTTCTTCACAAAGCAGTCACATACTCTTCTACTCTGAAACAAGGT 

ATATCCTATATTTCCTGTNCATTCANAAMCTTTAATTACCTACCATCTCTCATTTCTTT 

GG AAA I MM 

Sequence 2432 

AGGTAC I 1 1 1 M 1 11 M 1 1 I I It ) I 1 I 1 U 1 NNCGNAAANACCACAAACCACCTNTGAAG 

CCMGAAAAAGACGNGATTAGAGGGAAACACATCAGNACCTTTANTCCTCCACTGACTTT 

GGAAAAACAACAAAATGCCAGCTTCCACCANATCATTTNTTCTTAATTAAMGTTAGCCA 

GATTCTTTTGGAGCACTCMTTTrCTGGTCAGGTCAACAAAAMGTMTAAACATCAC 

ACACCTCCCTCCCCTGTC7TCCTAGCCCCATTCCTGGAGTTATGCAACCCTTGGGGGCGG 

CATACACTTCAGCTATTNGCTCTGNGTTCATAATTAGATTGNCTAGACAGGANATGTTGC 

TGGAGCAATAGG 

Sequence 2433 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATTTCTTCTAATTGATC 

ATATCTGCTTATTTTCCTCTGGATTAAGGATCAAGGAGATAGTATATTAGATGATTTGAT 

AMTTTCCAATGCTTTGCAAAGTAGTTGMCAACTTTTTCTTTGTGATGCTAGGTGGCAC 

TACTAGTCAAGACTTGGTATTTCATAGCTGGCTTTCTTATTCTGAAGGTTGTAAAAAGAC 

ATATGAAGTAATATTTMCATTGAAMCATGATMTCMTTTGATTATCTATGAATTGTC 

CTAMCGTTCAGAGTAMGTTTCCTTTA^ 

MGTCACATGTAMCACCAAMTMGAGGMTAGAGCMTAGGATATTTTTGGCTTTATA 
ATTCAATTTAAAAATTAAGGTGCATTTA I Mill 1 GGCNGCTGGCCATAAAGCTTCAATG 
TCCAGTAAGCAGTTGCTATCTATGATTCATGAAGATCATGGGGGGTTCCC 
Sequence 2434 

CGCCCGGGCAGGTAC I I II M I I II I I I I I I I I I I GATGCTATTACTGAATAACTCTTTC 

TGAGATGGTTGTTGTATAGTAAACCATGAGTCTGGCATGTCTAAAATTGTCTTCATTTCT 

TTCCATAAATTTGAATGTTTTGCTGTGTAAGTCATTCAAAGTTGAAGACAATGTC CCAT C 

TTTTGACAGAAAGGGAAGGAAGGATATGTGCTCAGCCTGCCAGTTTGACCCACTTTTGAA 

AAGGGCTTTGTGGTCTCTACCMTGAGAAGTCTGGTGTCAATGTGATTTTCTTAACTTTA 

TAGGAATTGTTTGTTTCCCTCTTTTCTGAAAACTTGTGCTTTCATCAGGCTCTACCAGCC 

TTGATACTCTGTCACTCACTGGGCATGGNATTTGCACCATCCTTTCAT 

Sequence 2435 

CGCGGTGGCGGCCGAGGTACTAGACGGGGCTGTAATCTGGTA^CTGTATGTATTTTAGTT 

CTTCAGTCCCTGGGAAGGAGACAGGAGAAGGTGGGAGGGAGGAAGGGGCCAGCTGAAATG 

GAAACAGATCCCTGATCCGGGGCGGCAGTGGAACCCTTCTTGGTGTGCGAGAGCCTGTGC 

ATTTCAGAGGCAGCAAAAMGTAAAAAAAAAAAAMTTGATCTTTGTTTAGATTAACAGA 

CCCCTGACTATGAAGAAGGAAGGCATCCAGACCAGAAACCG/JAAAATGTCTAGCAAATCC 

AAAAAGTGCAAAAAAGTGCATGACTCACTGGAGGACTTCCCC/\AGAACAGCTCGTTTAAC 

CCGGCCG 

Sequence 2436 

GACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACCTGAGGATCTG 

AAACTTTGGCGACCAAAAGTCCTGATTTGCGTGAGGCACACCAGTTCACACTAGTTGTCT 

AGGTTTMTAATTAATAATGCTTCTTTTTCATTCTCAAAAGAGCCCTGATTT 

MTTATATGGTCACCCTACCTATAGCCAGATATTTCTGCACCAMTGCCTCGATTTTGGT 

GGGTGGCTTTAATTAAGGMGMTGCTTTMGTTTTATGACATAACTCTGTTGTTCTTGT 

CAGGAAGTGATATGTATAAAACTATTAATTCATCAGATTATATATAAGGAGATAGGGTGC 

ACTTAAAATTGCTAGGMGTTGAGATCTCTAGTTGTTTAAG TTGG AAGTMTTGAAGTAA 

CAGTTGGACAAGCTGTCATGATGGACATAGTAGAAGTGCCTTTTAATTAA 
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Sequence 2437 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACACCATCCCCAAGGACA 

CGGAAGTATTTCTCATCCTGAGCACTGCTCTCCGTGACCCACACTACTTTGAAAAACCAG 

ACGCCTTCAATCCTGACCACTTTCTGGATGCCAATGGGGCACTGAAAAAGAATGAAGCTT 

TTATCCCCTTCTCCTTAGGTAAGCTGGACCCACAGTTTCTTTCCCAGACACCAGAGGGCA 

GGTCCTATCCTCAACTTGAGAAAAAAAATGACAGGTCCTTATTAATTGAGCACTTAATAT 

ATTCCAATTGCTTCACCTGCCTTATCCCCTTCCATCTTCACTACAACCCTGTAAGGAGGC 

TTGAGAAAGAAGATTACATTCCCAAAGGCACATCTGGGCAAGCAGGAACTTGGGCAAGTA 

TTTTAACATCTCTAMCCTCAGTGMTTCATTTTCT^ 

ACCGNTGTAAGCCCAGTGCTGTACCTCGGCCGCTCTAGAACTAAGTGGATCCCC 
Sequence 2438 

CCGCGGTGGCGGCCGA GGTAC IT rTTTTTTTTTTTTTTTTI 1 GCTGATATTGAGCCATAG 

AGCCAAGAAGATTGCTAATTTTGMGTGTGCTTGGTCGGGCTTTCTACTGATGATGTTGC 

TATTG ATTCAAAC AGTCACMGTCTTTAGG ATCATTTTTAGAGCAATG GCAAGAGTAGG G 

CTGAAATGAAAATCAGTCTGAGATAATGAACACTCCTGCTCAGAGATCATTATGGGCTGA 

GCTGTGCTGGGAAGGCACTGAGCCTTGGGGGGCTAACACTTGCCTCAGTGGAGTTAGCAC 

TCAGTAACCAGGCAATCACCTCGATCCAAGGCTTTCAACTCTCGTGGAATATCTGTCAGT 

TTGTCAGTTCAGCAMTMCAAGTGTTTTGTTT^ 

TGGACTTTTAAAACAGCTTAATTAAAAACTATC 

GGCAAAGACCACTGGGTAGTTGGTGAGAAGCTTATNAAAACAGCTTGCTTTCTCCTGAGG 

GGGGAAGGCTAACNTCCATGGGGGTATGGGAC 

Sequence 2439 

CCCGGGCAGGTACGCGGNGGAGGAGATGCTGCCACCTAGGTTACTTGTAGGACCCTATAC 

GGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAATGAGTCTATATGT 

CAGAATACACATTTCCACCTTGCCCAACAGTAGAAAAACATAAGAAGAGAAAAACATTAA 

AAMTGACAAGGAAGTTAATGGAAGTCAGCAATGTGATGGTGTTTGGAGGTGGAGCCTTC 

AGAAGGTAATTAATGCCCTTGTAAGAAGAGGCCAGAGAGCTTGCGCACCTTCTTCCTGCC 

ATGTGAGGAGCCAAGAAGCCGGCTGTCTGCAACCTGCAAGAGGACCCTCACTAGAAGCTA 

GCCATACTGGCATCCTCATCTTGGCTTTCCAACTTCCAGAACTGTGAGAAGTATATGTTT 

GTGGGTTAAGTCMTGGTCTATGGTAATTTTTTTATAGCAGTCCCAGCCAAAGACAGTGC 

CTNATTTACTACATACCATTTATATTATTATATAGGCTCCTTTCAGAAACCCATGTTCAA 

ATAAAGAGATMGGATACTGAAACACATAACACCTTTCACTAGTTTTTAGTATACAAATA 

TT GGAGAAA TAGGTTGGTATTAACTATCTCATNCAAGAAATGCCANATTCATGGTGGTTC 

TAA'i MM l ATATATTAATTGAC AAAATG AAAAAAACTTAACCC ATN CTAG ATTTTAGCT 

GCCCAAGGAATGAAMGMTNAAAAAAAAATTTTT 

Sequence 2440 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACAGGAAGAAGTTCCTTGT 
GCTATTTCTCACMTTTTTCTTT^ 

CATGAATTGTGAGTTAAAATGCAAGCAAGATCTTCATGCCTGTAATCCCAGCACTTTGGG 

AGGCTGAGGCAGGAGGACTGTTTGAGCCTAGGAGTTTGAGAACAGCCTGGGCAACATAGT 

GAGACCCTCATCTCTACAAAAAATTTAAAAAATTAGCCAGGCATGGTGGCTCGTGACTAT 

AGTCCCAGCTACTCAGGAAATGGAGGTAGGATGATCCCTTGAGCCCAAGAGGTGAAGGCT 

GCAGCAAGCTGTGATCACATCACTATATTCCAGCCTGGATGATAGAGTGAAACCCTGTCT 

CAAAMCAAATAMGCCTMGGGMTATGCAGATTAATGGMGAGAAGGCTATTTTGTGC 

AGACAGGAAACAGATGAMGACMTTTGACTTTTAAAACTATGAGTAAGGCTCATGCCTA 

TAATCCCAGCACTTTGGGAGGGCAAGGTGTGAGGACTGCTTGAGCCCAAGAGTTCGAGAC 

CAGCCCTG G C AAMCAGTGAG ACCCTATTTCTATN AAACATTTAAAATATTAACCAGGTG 

TGGTAATGTGCACCTGTGATCCCCAGCGACTCGGAACCCTGANGGGGGGAAGAATGCTTG 

AGCCCTGGGANGGCAAAGGCTTGCAATGGGNCCGAATNATGCCACTTGTANTCTTGGNCT 
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NGGGGACAAAACCAAGACCCCTTGTNTTAAAAGAAAAAAAGAAAAAAAGCC 
Sequence 2441 

ATAGGGCGAATNGGAGCTCCACCCGCGGTGGCGGCCGCCCGGGCAGGTACACATGTCCAA 

GGTCAGGTCCTGGGTGGTAAAGGTAAATACAAATTGGAAGGGCACTGTGTGAGCCAAAAT 

GAGTCAGATTAGTCATGATTCATTTCCAGTTTGGGTTTTGGGTGGTCTTGGAGAATGTTG 

TAAGCACTGCTTCATTGATAGGTTGATTGAGCCAGACTTTACTCAGCAGCCTGGAAAAGG 

AGAGATGGGCTCTGGGTTCTACCTTTGCTCACTGGTAAGTTGCTMGATTTCAGCCTTGC 

CCTCAAACCCTGAAGTAGTCCTTCATTCACACAGTGGGATCACTCGAAAATGTCAGATGG 

GGAAGTCCATAGGTTGTTACmAMGAAMTAGAAAAMTGCTGGAAAAGGmCTTCA 

ATTTTMTACCCATGAAGGCCCATGTTTTAGCTTTCCTCCGATGGGCAAACCATACACTA 

ACTTGGGCCTTGTAATCAACAAGCAAGGCTAAAAGCTCTCTAAGTGCTTGCTGTTTAAAC 

TATTTTGTGTTGGAAGAAGAGTTGGAAAAGAGGCAGTGTGGAGGTGAGGGGAGAAAGTNC 

CTNCCTCNAT 

Sequence 2442 

CCGCGGTGGCGGCCCGAGGTACGCGGGAAGGAGATGCTGCCACCTAGGTTACTTGTAGGA 

CCCTATACGGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAATGCAG 

TGAAGTAGAAGAGACAGGGATATCCCAGAAGGTTATGCAAAACATCAAGAGAAGATGAGA 

GGAGTCTATATGTCAGAATACACATTTCCCACCTTGCCCAACAGTAGAAAAACATAAGAA 

GAGAAAAACATTAAAAMTGACAAGGAAGTTAATGGAAGTCAGCAATGTGATGGTGTTTG 

GAGGTGGAGCCTTCAGAAGGTAATTAATGCCCTTGTAAGAAGAGGCCAGAGAGCTTGTGC 

ACCTTCTTCCTGCCATGTGAGGAGCCAAGAAGCCGGCTGTCTGCAACCTGCAAGAGGACC 

CTCACTAGAAGCTAGCCATACTGGCATCCTCATCTTGGCTTTCCAACTTCCAGAACTGTG 

AGMGTATATGTTTGTGGTTTAGTCAATGGGTCTATGGTAATTTTTTTATAGCAGTCCCA 

GCCAAAGACAGTGCCTCATTTACTACATACCATTTATATTATTATAT 

Sequence 2443 

CCGCGGTGGCGGCCCGAGGTACTTGCAAGAAACCTCAGGACTTGAGTAACAGCAACATGG 

TTCTTCTGAGCTATGAAGGGGCCCAGATTTMGGGCTATTTTTGACACCCTAAATGTGCT 

GAGACAAGTCATTAAGGTGGTCCTGCCAGGACACAGCCATCTAAAGCAGCAATCTGCTTC 

TTGCCAGAAAATCTCGTGCCTCTGCAGAGCCTTTTCCAGAATGAACCACACCATGCTGAG 

GAAAGGAGAAAGAGACCACCTACTGCATTTCTGTCACTCGCTGAAAAGGACACTCTGTCA 

GAAMTCTTCTAGCAMCTTCAAAGGGCAAMTCACCCCTTGTTACTGATAAAGCCCAGA 

GAGCTTCAGCAGCTAACATTCCCTGGACAGGGCACAGCAAGGATTTGAACCTAGGTCAAG 

TCTGGCCAGAACACCCACAAGCTTTCCTTAACTCAGTGTGCTATCTCCCCACGACTAGGT 

CACTACTGCTTTATAATCACCTTTGTAGCCACCAGTGGATTT 

Sequence 2444 

GGTACATATTCTTACTTCTTGTGTAGGAGGCAAAAGGTATCTGCAGGGCTCTGGGCTTGC 

TAACCAAACCGTTAATCGCTTAGAAACAGC AN G AN NTN N ACTAGTG AG ATGTTTATC ACA 

TACCTGGACACGAGTTGACTTTCTGTGAGACATCATTCTGGAATGAAAGCCAAAATCTCT 

CTGCTATTCAGTTGAGGCTGAAAAAACAATCACATCAGAATTAGTAAGGCCCATNACAAN 

GAGAGAGAAATGAAAACAAGCCAGTATTCAATGGGGCTAGGGGGAAAATTACCTGGGCTT 

CAGGAATTCAGGAGAGTTGGGGTTTTGAGAAGGTAAAAAAGTGGGCCAAGTGCGGTGGCT 

CAAGCTTGTAATCCCAGCACTTTGGAACGCCAAGGCAGGTGGATCACGAGCTCAAGAGAT 

TGAGACCAGCCTGGCCAACATGGTGAAACCCTGTCTCTACTAAAAACACAAAAATTAGCC 

CGGG 

Sequence 2445 

CTNCCCGCGGTGGCGGCCCGAGGTACGCGGGTTTCCTCAACATGGCTGCGCCCTTGTCAG 
TGGAGGTGGAGTTCGGGTGAGTCACAGAGCTGGGGCGCCGTGGGGATGGAT 
Sequence 2446 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAAACAAAACTAAAAATAAA 
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MGTCAAAAATTGGAGATGAMTGTATGTTTTGAAACACATTATTTTGGATCTTAAAAGA 

TTTCCATGATA! MM I GAAGGTGTAATATTAAAATGATGTCTC I I I I I ICTGTCTCATT 

CCAAAGAACCATAAAAMGCAMTTAAMCAGTGATGAMCAAMCGCTTACAGGTTTGT 

ATCTCAGTGCATCTCTGTAGGATCCAAACAAAGCAGCCTGTGCTCTAAGAAAGGCCCTAG 

CTACTCCATCACCCGTAGCTGTAGACTGCTTCTTCAAGTTTAI I I I I CAAGGCCCCCGCG 

TACCTGCCCG 

Sequence 2447 

AGGGCGGCCGCCCGGGCAGGTACGCGGGAGAAACCGTCTGAACTATCCTACCCGCCATCA 

TCCTAGTCCTCATCGCCCTCCCATCCCTACCGCATCCTTTACATAACAGACTNAGGTCAA 

CTATCCCTCCCTTACCCATTCAAAATCAAATTGGCACACCCAAATGGGTACCTGGAATCT 

AATTTATTTCTAAAAATATTTTCCCCACCTGCCMGCCT1TGGTMTGAAGTG 

CNTGGATTGCCAAMTTANTGGGTTGGAAATMGGTTGGGCCGGMCTCTTCATTNTTAA 

AACCTTGGG GG GCCTTAAAAAATTAATTCCTC I J 1 1 1 GGAACCAAGNCC I 1 1 i H M MA 

ANTACNTGGGNCNCCTNCCTTTCTCNAMGNGTMTTNTNTNTTNTTAAGNNTTCCTTAC 

NTTTTTAANGNCCNTTTNCTTTGNTTGGGGNCNCCAMTTTGTTTCTT^ 

NAAANCCTTTGTAANGGGNANTTGGGGGAA 1 M M I 

Sequence 2448 

GGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGCGGCTTCTTAGCTTTAC 

GATGGCAACAAGTATGGCGGCTGCTAGTGGTAGATTTGAAAGTGCGAAGAGTATCNAAGA 

GCGGAAAGAACAGACCCGGAATGCCAGGGCCGAGGTGTTGCGCCAGGCTAAAGCCAATTT 

TGAAAAAANAAGAAAGGCGTAAAGAACTTAANCGACTTCGGGGTGAGGATACATGGATGC 

TACCTGATGTGAATGANAGAATTGAACAGTTCTCACAGGAACACNTCTGTGAANAAANAA 

GAAAGAAAAAAATGACCNAGCTTTTCNAAAAAAAGCNATGGAAANGGAAAAA 

Sequence 2449 

CGAGGTACTTACCTACACMGTTTTACTATCTTGGMTGTTCTAGAATTCCAGTTAATAA 

GGTGAAAGTGATAAACATGGCTCCATTTAGTTGTTATCTCTAAATGATTTAGAGAAGTGT 

TTGCCAAATTATAATGTGCATATGAATCACCCGGAGATGTTGTTAAAATTCAGGTTCTGA 

TTCAAAAGGCCTAGGGTGGGGTCTGAGATTCTGTGTTACTAACAAGTTGCTAACTGAGGC 

TGATGTTGCTGGTTCCCAGATCACACTTTGTTTAGTAAGGATAAGGGCAGGTTTCCAATC 

TTGGCTGCAATTAGAATCACTTGGGAAAATTAAAAACATCTATATCTGGGTCCCATCCTA 

CCCCCCAGAGATTTGTCTTTMTTAAAGTGGAATGTGGCCGGGTATCAGGATTTTAAAGA 

AGCTCTCCAGCTGATTTTGGCCGTTCAGCTCACCGTTTGAACATGACTGAGTAGAGAATG 

GACCCTGGAAGATGAAANNNNNNNNNNNNNNNNNNNNGTACCTGCCGGGCGGCCGCTCTA 

NAACTA 

Sequence 2450 

AGGTACTCTCCCAGTGACATGCTTCTTGACCACAATGGATGAACTGTGCCCAGCATGCCC 

ACTTTCCAATGCTCCACTGATCCCCATGTTTTGTTTCTGAAGGACAACCAGCCTTGGAAT 

MTGGCAAATACCTTCTTAAGTTCCTACAAAGTATGGTCCCTGGGAAGTTTATGTCTGTA 

AGTCAAACCTTGGGAAGTAACTGAGTTTTGATGCCTCTTCCAGCATCATCAGCATCATGC 

TATTACAATCCCAMCCATGGGGGTTTCTCACAGCTTTACACCAAAGGGCATCACTATCC 

CTCAAAGAGAGAAACCT 

Sequence 2451 

CCGCGGTGGCGGCCCGAGGTACTTTN 1 1 1 I 1 1 1 1 1 1 1 1 I 1 1 1 I I H I NACTGCAACAACC 

TAACAGGAAAACTAATCAGAGATCATGAGGAGAGTGGTATATCAGTCAGGGACTAATCAG 

GAACATAATCCACTTGAGMTTTAAAATAGAGAGGGCTTAGGCTGGGGAGTTGGTGACAG 

CATAATGGGAAATCTACAAAGCAAACAGGAATAGGGAGACAACCCANAGCCCATCTAGGC 

TGGCAGAGTCATCATNTNTAGGCTGGAAGGTCAAAAAAAGGTGG 

Sequence 2452 

TCAGTTTGGTTTGTTTGGTTTTGAAAAAGTAGG I M M M i I I AGTATGTGATGAGCTTA 
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TTCTTAAGTGAATTAATAAATAC ( I I I I AGTATCTGTTTTAATTCTAACAGTATG AGTGT 

TGGTAGTTATATTCCAAATAAACAAAAACTCTTAGAGTCCTCAGTAATTTTTAAGACTGT 

AAAGGGGTCCTGAAAATGATTAAATGTTTGAGAATAACTGCCTCAGAACATTGTTCTTCA 

ACC CTAATCAAACCCAGC ACGGCATTTATATAAAAAATG ATTTTATAATGCCCATTTG CT 

AATCCTGAAATAAAMTACATAGGTMTAAGMTGACTTANCCAAmCAAAAATATCAA 

TGGAGGGAATACTAGTGTTTGGAGACCTAACTATACAGGAGAAATAAAAGGAAAGTCATT 

TTTTATAAC ATTATGC ATG CATGTATAAAGCACTCAGATCTGATGACATTG G G AGAC AC G 

GTGGCATAGCCAGATCCTTACACCTGTTTGTGGACTCACCATGAATACAGCAAGTGGCCA 

MGCAMCCCCCNCGCTATMCTGATAGGMTTTCATTGGCTTNGGTGATGGTCATTTCT 

ANAATGGTAAGAAAATCTTGGTAAGGTTAAATGAACCAA 

Sequence 2453 

CCGCGGTGGCGGCCCGAGGTACGCCACAAGGCATTTAATGCCCACAGTAACAGGGCTGTT 
TGACAGTGGCAGAAGAGGACGGGACTAAAGTTACTTTGTGCTGAGAGGGGGAAAGAAGCA 
CAAAGTTTGGTCTGTTGCGTAATTGAATTTTTAACACT^ 

TCGTGTCCTGCTGTGTAAMGACATCAGATATATTACAGATTTTCAAACAGGCAGGTATC 

AACATTTACATTGTMTTCMTAGACGCTACTACTACAAAG GAGCTTT ATTCTTCCAACT 

TAATATGGTTGCTGCGGGAACACTGCAGGATGAAACTGACTTTTTTTGTGGTMTGTTTC 

TCCTGGTTGGGAACAACGCAAATTAGCAAAGGCGTAATTrATAACCGCTGCTACCAAAGA 

GTGAAGTATTTGTGGAAGTTCTGACTGGAATCTAACTCGTACCTGCCCG 

Sequence 2454 

CCGCGGTGGCCGGCCCGCCCGGGCAGGTACTTTN I I I I I II I M I I II I I NTGCTNGGGG 

NCTTTTTCCCCTAGGATTTCTCCAATGAACACAAAATGCTTTTGTAAAAMGAAAACAT^ 

ATAAAACACATTGAAAAAAAAGTAAAAGTAGGTGAAAATGTCMTGAAATTCTGGTTTGT 

AGAACCATCAAAACTTGGAAMGTTTAATAAGACCACCANNAAGAAGAAGAAGGGCAAGT 

TTCCAAACTGATTTCACAGGGCCAGGAGGGCAGGCCACTGANATAGGTAAACCATGGAAG 

CACAATCAGTCCCATCTCATGGATATTCTACCTGGGGCTTTTACTTTGCGTGGCTCAAAA 

GCATCTGAGTNTTANACCAACAGAAAGATAACTCCNGAAGAAAGATGTAATAGTGTCATA 

TTTGTGTAATTGATTGATCTAAMCTTGGTTGATGAAAAATCCCTGGGACAACAGCAAGA 

ANAGTTGCGTTCCAACCCCCCGAAGTNCTGCTGCTTTCTGAACTGCAAGTATCATATCTC 

TTGCTTTTGATTTTA 

Sequence 2455 

AGGTACTCTTTCTAGTTATTAAAAAGAAAAGTTAACTGTAAAACAGCCTCAGGCAGGTCC 

TTCCGGAGCTATCCAGAAGAAGCCATTGTTATCATAGGAGATGGCAGCTCCATGGGTGTT 

ATTGCCCCTGAAGACCTTCCAGTGGGACAAGATGTGGAGGTGGAAGA CAGTG ACACTGAT 

GATCCTCACCCTGTGTAAGCTTAGGCTACTGTGTGTATTTGTGTCTTAGTTTTTAACAAA 

AAGGTTTAAAMTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGTACCT 

Sequence 2456 

TANGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGACACCCTAGATCC 
CAAGATCTCCAAGGATTTGGTGGCATACCCACTCCAGCACACAGAAGCATGAGGTTCATG 
ACTCTCCTNTTCCTGACAGCTCTGGCAGGAGCCCTTGTCTGTGCCTATGATCCAAAAGCC 
CGCTTTTGCCCMGATCGGGGAACCCCTTTGCCATTGAAGCATCATCCAGCTTAAAAAGG 

AAA 

Sequence 2457 

AGGTACGCGGGAAGAAGGGTCAGCTCAGGGTGTTGCCATGGGAATGAAATTCATATTGCC 
TAACC GATTTGATATG AATGTGTGTTCTC G ATTTGTG AAGTCCTTAAATG AAG AAG ATAG 
TAAAAATATTCAAGATCAGGTTAACTCTGACCTGGAGGTGGCATCTGTCCTATTTAAAGC 
TG AATG CAATATCCATACATCTCCTTCTCCG GGAATTCAAGTAAGGCATGTCTACACCCC 
CTCTACAACAAAGCATTTCTCACCCATAAAACAGTCAACCACTTTAACCAACAAACACAG 
AGGAAATGAGGTCTCTACCACACCTCTGTTAGCAAATTCTTTG 
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Sequence 2458 

AGGTACGCGGGGAGGCCATCTCGCTATAGGAAAGGAAAGTGGAACAGCATTCATCCTCAA 
CATTTTACGAAGACAAAATGAAGACTGGAGTAGAAGACTGATCAGTGCAGCATCATCAGC 
ATC ATGCTATTACAATC CCAAACC ATG GGGGTTTCTC AC AGCTTTAC ACC AAAG G GC ATC 
ACTATCCCTCAAAGAGAGAAACCTGGACACATGTACCTGCCCG 
Sequence 2459 

C P! 3G ? CAGGTAC I 1 1 M 1 1 1 1 1 * 1 1 1 1 1 1 1 ' 1 " G i m i h i n n 1 1 1 1 1 1 m 1 1 n 
nrrrrrrnrTTTTTi iaaaaaaaaaaaaaaatnaaatttntgaaaangnggnaantgga 

ANTGGTMNNTNAAACCAAAANTNTGGAANTAAAAAMNTTNAAAAAAAATTAAAANTTG 

NGAATTGAAAAAAAAAA 

Sequence 2460 

TCGACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGAGAG 
CTCTTTGTGGCTTGGGTGGTCAGCMGMnTTTTMGCAGAGATACTCTTGATTGGGTT 
GGATAAATAAAAGAGCAAGGAGGACATTGTAGGAGGGGAATTGCTCCCAGCAGAGCCTAG 
AAGC AGAGAGATTTTCC AGGG TTCCC AG G AG AAG AGATTGAATAAGTG CAGTGGGCCATG 

TGGTGACACATCTTGAGTGTCCAGCTGGAGAACTGGAAGTGCATATCATAACAGGGGTGG 

AAAACTCAAGAGGTTGATAGGGAATGGTTGAGGTTGCAATGAGGGGACCCGGTGGGGGAC 
TTT 

Sequence 2461 

TTMCGCATTTACTAAACGCAGACGAAMTGGAMGATTAATTGGGAGTGGTAGGATGAA 

ACAATTTGGAGMGATAGAAGTTTGAAGTGGAAAACTGGAAGACAGAAGTACCTnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnNAGGAAATTTCGATATTCAAGNCTTATCTGA 

TACNCGTCGACNCTCTGMGGGGGNGGGGCNCCGGTACCCAGACTTTTGTTCCCTTTAGT 

GGAGGGGTTTATATNAGCCGCCGCTTTGGACGTTATATCATGGGTACANTAGCTTGTATT 

ACANTGGTGGTGANMTTGTNTNATCACGNTTCACCAATTTCTCANCACAAACNATTACN 

NAAGTCCCGNTNAGTCNATTAAAAANTTNGTTNAAAGACCTGGGNGGTTGNNCTTAAATG 

GMGATNGAGGCCTTAAACCTTTCAACTATTTTAA 

Sequence 2462 

TCCGGCGAATTTTTCTCNCNANNGCGGCGGCCGTGTGGCGCANGGAGGACCCCTTACAGT 

AACCACAGCGTNTTTNAATCCGAAAGGGAAAAGACCCCGCGTACCTTnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnGCCCGGTTCCCTANNTATTTGTTTCCCTTTAGTGAGGGGTTAAATTGCGCGC 

TTGGCGTAATANATGGTNCATAAGCATGNATTCCTGTCGTGAAATTGTNATCCGCTTAAC 

AATT TNCACACAAACAN ACNGAGCCTGGG AGC ATAAAAAG NTG ATAAAGCCTN GGGG NGC 

ACTMTGAGTTGAGCTAAACTTCACAATTAAAT 

Sequence 2463 

TCCACCGCGGATGGCTGGCCGCCCGGGCAGGTACTCTTTCCTGGATTTACCATTTATACA 
AAGAAAATTCTTACCCAGCTGGAAGAGGAGGAACAAAACTTTTTACTATCAGGGCACAGG 
CAAAAMTNCAmGTG CAAGAG AGACATATAAGCAAACATGCATTTACAGTGTAGAGGA 
ANNGGGGACCNCAGTAl I I I I I I I CTCCTGGGAAN N CTTACATTTTTAATTAAATN NCNC 
CTTG NAAAATTTTGTAG N ANTANTTAAN AN GCCTTCCNTTG ATTTTCAN ATGC AATTCAA 
(4ACCTCNNGCCAATGNAATATNCAAAACAACNTTTCANGGTATGTTGGGGAATNGTCNCN 
ACGATTATGTANAAGTAAAAAAGTGGCTCCANGTAAAAAA 
Sequence 2464 

CGAATTGGACTCCACCGCGGTGGCGGCCGCCCGGGCAGGTACCTTAAAAAAAAAACATCA 

GTTNTGGGACATAACAAANTAAATACTATGGAGAAATGGTATCTGGACAGGAACAGAAAT 

GTCCACAACTGCNAGGGATTTTCTTTTACACTTGGCCACAGAGTCGTTAATTGANNCCTA 

CCACTCCTTNAAATATTGGGGATTCTCTTAATTAAGGTT 

Sequence 2465 
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CCGCCCGGGCAGGTACCTTTCTTTGGAATGGGTTTGGTTCGACCGTCATACTTNTTCAAG 

GTGATATANACGCTGCCCGACGTTNCGGCACTTTCTGGAAAAGTNTGGTCAGCTCCGTCA 

GGAACATGCTCNNTTCTNCAACAACACCATNCNCGGNNNGACCCNNNGNNTACTCNTGGG 

CCGCTTTNTAGAAAACATACGNTGTGNATTNCCACCGGGGCCTGNCAAGGAAAATTACGT 

ATNATTNTAAAGGCNTNTATTCNGNATTACCCCGATACGTAACNCTNCTGAAGNGNGNGG 

GGGGCCACACGNGGTATCTCCCMNACATTANTTNTGGATTTCNNCCCTNTTTAAGTTCG 

GMGGGGGNTTTTAAMNTTTNGNCNGNCNGACNTTTTGNGACCGTTNAAMTNTCMTT 

GGGGGTTCNCAATNACGNCCTTTGNTATNTACNNCATCGGTGGGNTGGAAAAAAANTATT 

GNTTTTAATNACACNGGNTtTCCNAACNAAMTTATTNCCCMCTAACCAAMCNTATTT 

AACCCNTAAAGGCCTCNGGGGNGGGAAGGACNAATTAAAAAAAGTTTG 

Sequence 2466 

CTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGATTGACCCA 

AAACCTGGATGTCCTATCTGTGTGTTCCAGTAGTGCCCTATTTTAAATGTGTCATGATAT 

CTATTACTTTGTGTGAAAATTACCTGTTTGTI X 1 I I CAGGTTATATAATAGTATAACACT 

GCCMGGAGCGGATTATCTCATCTTCATCCTGTMTTCCAGTGTTTGTCACGTGGTTGTT 

GMTAAATGAATAAAGAATGAGAAAACCAGAAGCTCTGATACATAATCATAATGATAATT 

ATTTCAATGCACAACTACGGGTGGTGCTGMCTAGMTCTATATTTTCTGAAACTGGCTC 

CTCTAGGATCTACTMTGATTTAAATCTAAAAGATGAAGTTAGTAAAGCATCAGNAAAAA 

AAAGGTAAAACAAAATTGCTCC 

Sequence 2467 

CGCGGTGGCGGCCGAGGTACGCGGGGGCAAACGCCGGGAGTAGCCGAAGGGGACTGCCGG 

GAACAGGAATTTCTTCACATGGCTCCTGGAGAAGTGACCATCA 

Sequence 2468 

CCGGGCAGGTACCCGGGCGTGTGAGGTGTNAGTCTGCCCCTACTTGGGGGTGCCTCCCAG 

TTAGGCTACTNGAGGGTCTGGGACCCACTTGAGGAGGCAGTTCTGTCCGTTCTCAGATCT 

CCAGCTTGCGTGCTGGGTGCNTGGGGAGAAACCACTACTNTCCCCGCCGTTACCTNAGGC 

CCGCTTCTTAAGAMCCTANTTGGNATTCCNCCTCGGGTCTTGTCATGGTAATTTCCNAT 

TATTCAAAAGGCTATATTCNGTATAACNCCGTGCTGACNCTTNGNAGGGGNGGGGGGGNC 

ACTCCGGGNTACCCCCAAGNC I I ("I rNTGGTTTCCCACTTTTAANTTGNAGNGGGGTTTA 

AAATTAGTCGACCGCCCTTTGGNCNGTTANAANTCAATTGGNGTTCAATAAAGGCCTGGA 

TmCCNTTGTTGGTNGGAAAAAANNTNGTTNTATTNTCCGNCATTCAACCANANTTm 

NTCAACCACCAAAACCNATTANCCGTAAGGCCTCCNGGNGNNAAGTCCAATTAAAAAAAT 

NNTTGGNTAAAANAAGGCCNCNTGGGGGG 

Sequence 2469 

GGCCGCCCGGGCAGGTACGCGGGGANTTNTCTACTGGCGAAACCTGTATCCGGGCCCAAC 

CTGAAAACATCCCAGCCAAGAACTGGTATAGGAGCTCCAAGGACAAGAAACACGTCTGGC 

TAGGAGAAACTATCAATGCTGGCAGCCAGGTTANGAATATAAATGTANNAAGGGGAGTAG 

ACTTTCNCAAGGGGAAAAATGGCTTACNCCAAANCTTGCCTTNCATAGNCGCNCTGCTGG 

GCCAACTTATGCCCTCNTCAGAAAACAATCAACNCTTACTCACATGCAAANGAANCGGGN 

CATTNGGCAATTAACAATGNGGATTGGATGGTAGTACTTGGGCAAAACNCATGAAAAAAA 

AAGGGCCTNGNACAAATATNCTTAACCAAGGGGGNCCTTCNCTTAAATAGGAATTGNTAT 

NGGAAAACCATTNGGNTTGGCNTTGGAAGGGGGGCCAAAACCCAAGGCNAAGTGGGNNTT 

CCACCTTTTTANCCAACCCTNGGTGNTCCNTNTTGGGTAANGNNAAATGGGGGCNNTNGG 

NCTTCCCTNNAAAAAA 

Sequence 2470 

AGGTACl I M 1 I I I I I I I M M I M I I I I I 1 I CTAAAAGCAAGGNGATCTGTGATATTTT 
CGTCCTTTTGT ATG ATTCTCT ACTTCTTATTC AAG ACCTGTG ATTAAGTTG N GTGTATAT 
TGTCTGTAAAATGAAACAAAAGTAGGTCCTAAACATTGTAAGAATATGGTATACTAAAAA 
GCTTTGAAACCAAAAGAAAATTTAAAAAAACAACAACAACAACAATGTAGAGTACCTCAT 
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A CCTAA CAGACACTATAGAAMGCTGATA^ 

ATTTTTCACAAAMTAGATTCCTTAAATAANACATGTTGTCTAACAGGTAmACA 
Sequence 2471 

CCGGGCAGGTACAAACAGGTGCTATCATAATTACCTCTGGGCAACAGGCAAAAATCAAGA 
TTGTCCTTAGTAAAC C AGGATTC ATACC AGGTCATCTG ATGTTTAGTATAG CACTCACCC 
GACTCTGCCTTACATTGCTTATTTAAMTGTCTGCCTCCCCTTCTAGGTTTTATAAAAGT 
TCTTGGCTCACAGTA ACTCTT AGTATMGTTTCTGAMTAAACACTTTGTTATCACTTTT 
GMTTGATGGCTTACATTTTTGTTTTGGCATT^ 

TGCATTTCAACTCATCTGCAAGC7TATATCCCTCAAAGTTTGCTGCTGGGC 
Sequence 2472 

AGGTAC TTTI I I U I M I I II I I I ) I I II I 1 1 ACATGTATCMTTTATTTACTTTTGGTT 

TTGCTC ATTTTT ACTGAAMTATGATTATCTATC 

AACTTCAT7TTTACATAGTGTAGTCCAGCT 

ATGATATTTTCACTGGATACAGAAACCTGTCTTCTGCCTCGCTCACGCTGGGAGCTGTAN 
ACTGGAGCTGTTCCTATTCGGCCATCTTGGCTCCTCCCCACTAATGCTGTTCTTCTAGAC 
ATGTTTGAAATAGGGGCATTGCTATAATGAATGGCTAGCAAGTAGGACAA 
Sequence 2473 

ATTGGAGCTCCCCGCGGTGGCGGCCGNCNGGGCAGGTACTCTACATACCTGTATCTNATT 

TCATCCTAACAATAGCCACACAAGCTAGACATAATTATCCCATTTTCCAGGAGAAACTGA 

AGTTCANAGCATTTAGTAACTTCCTCAAATTTATACACCTNAATAAGTGGTAGAGCCAGA 
ATTTG 

Sequence 2474 

TACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACGCGGGAGGTGTGCTG 

TGTGTGAGGCGCCTGCCATGGTGATGGCCGTGCACAGCCAGACCATTCAGATCCCACCGT 

GCCCCAGCGGGTGGTCCTCGCTGTGGATCGGCTACTCTTTTGTGATGCACACCAGCGCTG 

GTGCAGAAGGCTCTGGCCAAGCCCTGGCGTCCCCCGGCTCCTGCCCGGAGGAGTTTAGAA 

GTGCGCCATTCATCGAGTGTCACGGCCCGCTCTAGAACTAGTGnnnnnnnnnnnnnnnnn 

nnnnnnnCGATATCAAGCTTATTCGATACCCGTCCGACCCTCGAGGGGGGGGGCCCCGGT 

ACCCCAGCNTTTTGGNTCCCTTTAANTGAAGGG 

Sequence 2475 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAGAGGAGATGCTGCCACCTAGGTTAC 

TTGTAGGACCCTATACGGCMCCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGA 

AAATGGTAGCTGAGACCAATGATGCTGACCTCCCTCAAMGCTGCATTTCTGAATTTCTG 

AAGGCAAACTGTCTGCCTATATTGTACAGCAAMTMTTTCCTGTATTAAMCGTTTTTA 

TTGCTCTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGTACCT 

Sequence 2476 

ACTACTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGGGGGTAGGATGGGAAGCCGTG 
GGGCAAGGGAGGTTGCAGGAAGCCCATCCTTCCCCTCCTGCGGCAGGTACCT 
Sequence 2477 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCCTGCAGGT 

AGGCATTTAAATTGGTGCAACCGC1TTGGATTGCTGCTTTGTAGTATCTGGTGCAACTGA 

AGATGAACATGCCCTGTGACACAGCAACCGCACTTCTAGGTCAATACCCTAATTATATTC 

TTACTGT0GTTCACAAGAA3GTATGTAAGAGGTCATTGCCTGAGCACTGTTTAGAATAGG 

GGCAAACTGGAAATCCTCTAAATGTCTGTCAATGAAGGAATAGATAAATTGTAATATGTT 

CATATAAAATACTGCATAAATAAGTGAAATTTATAAATGTACCTCGGCCGCTCTAGAACT 
AGN 

Sequence 2478 

ATACGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACT 
GCTGGTTCGGGCTGCCATAAAAAGCACCACAGACGACATGGCTTACACCACAGAAACTCA 
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TGTTCTCCCAATCCTGGMGCTGAAAGTCCAAGATCAAAGTGTTGTCTGGGTTGGTTTCT 

TCCMGACCTCTACTGTTGGCTTACAGATGATTGTATTCAGTTCACATGATGTTCTTCTA 

TGAGTGCCTGTGTCGATTTTCTGTTCTTATAAAGATATCAGTCAGATTAAATGATGGTTC 

ACCCAAATGACTTCATTTCACCTTAATTACTTCTCTAAAGGCTCTATCTCCAAATACGGT 

CACATTGTGAGTTTCTCAAGGGTTAGGACTTCAACATGTATTATTTGAGGGAGTGGAACA 

CAATTCAACCCCATT 

Sequence 2479 

CCGCGGTGGCGGCCGCCCGGGCAGGTAC I II 1 I I t 1 I I I I I I I 1 Ml II M GI Ml HI I 

1 1 ii 1 1 ii 1 1 n 1 1 1 1 1 n 1 1 1 1 m m 1 1 1 1 m n n i m i ii 1 1 1 1 1 1 1 1 1 1 1 1 n n 

rTTTTTNNTTTTTTTl I NNNTNNTN M I M 1 1 I 1 1 I I 1 CCCCANNANMNCCCNTNNTNNN 

NNAAAAAAAGAACGTGAGGACAGGGCTCCTGCTGCGCAGGCATAAAGCATCCAAGAGTCT 

GCACATACATGCCACACACTAAAAAAAAAAAAAAANTNNAGCNCNANNNNAANTCTGGGG 

GGGGGGGGAAAACTTAAAAAAAAAAAAAANNTTTTNCNAAAANANAAA 

AAAAAAA 

Sequence 2480 

CCGGGCAGGTAC 1 U I I I 1 1 H I 1 11 I 1 1 II 11 11 M GI 1 i 11 1 M 1 1 NTTTNNTTNANT 

NANTNNTTTNNTTAMTNATNAAGGMNAANNAAAAAAANAANTNAAAAAAAAAAA 

AAAANNAAAAAAMTTNNAAAAAAAANTAAAAAAAAAAAAT^ 

AAAAAAAAAAAATAAATAAAAAAAAN NTAAAAAAAAAAAAAAAAAAAN AANTG AAATN N A 
TNAAAAAAANNTAATANAAAAAAAAAAACAAATTNAA 

AATATTTTTN AAAATTAAAAAAANCGAAAAN NCTTTTCCAAAAAANAAG N ATAAAAAAAT 

CCAAANTNTTNAAANNANNAAANNGGTTNTTTAA 

Sequence 2481 

CCGCGGTGGCGGCCGAGGTACATCCTGGATGGATGAGAATGGGCACCATATTTTTTATGA 

ATATATATTTTTC I NUN I GTTTTCTACAGCACCAAAGAAATTCAAATAGGAAAAGGA 

GAGTTGAGAATTGGGAATCAAGAATCAGCCCTGTTTCCATCTTAGCCACACCAACTTATA 

TCTTTATGATTTTCAMGCTTTTGCCATGTGATTCTGCCCCCACAMGGCATCGGTATTT 

CCTAAATGGTACCTGCCCGGGGCGGCNCGCTCGAAATACCGAGCCCGGGGAGCATTAAAA 

GTGTAAAAGCCCTGGGGGGTGCCCTTAATGGAGGTNGAGCCTAACCTCACATTTAMTTT 

GGCGTTGCGGCTCACNTGCCCGCTTTTCCAAGTTCGGGGAAACCT 

Sequence 2482 

AGGTACGCGGGACTGTTATTCTCTCCAAAGCTTACCCAGCAATAGGAACTCCCATACCAT 

iTGATAAMTTTTGTATAACAGGCMCAGCATTATGACCCAAGGACTGGAATCTTTACTT 

CiTCAGATACCAGGAATATACTATTTTTCATACCACGTGCATGTGAAAGGGACTCATGTTT 

GGGTAGGCCTGTATAAGAATGGCACCCCTGTAATGTACACCTATGATGAATACACCAAAG 

^CTACCTGGGATCAGGCCTTCAGGGGAGTGCCCATCATTCGATCTCACAGGAAAAATGAC 

CCAGGGTGTGGGCTCCAGCTTTCCCAATGCCCGAGTCAAAATGGGCCTATTACTTCCTCT 

TGGAGNTATGGTCCCACTTCCTC 

Sequence 2483 

CCCGNGGTGGCGGCCGCCCGGGCAGGTACCGTTCCTCCAGTGCGCAGAGATGCTCTCCGC 
ACCAAGCCACAGATGTGGAGGAGGCAGGTAGGGGGTCAAAGAGGGGTGGTNTCGGTTATT 
CAGG ACTTTTTTTTTTTCTTAAATATCCTG NGCTTNTTTC AATCATTTG AAG G TAAAACC 
AGGTCCTGNGANTGGTAAACTGATTTTTGGTTCT 
Sequence 2484 

AGGTACGCGGGTGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTGGACTCC 

GACGGCTCCTTCTTCCTCTATAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAGCAGGGG 

AACCGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACACGCAGAAGAG 

CCTCTCCCTGTCCCCGGGTAAATGAGTGCCGACGGCnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnCTGCAGGAAATTCGATATCAAGCCTTATTCGATACCCGTCGACCTCGAGGGGGGG 
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GGCCCCCGGGTACCCCAAGCTTTTGTTCCCTTNTAGTGGAGGGGTTAAATTGGCGCCGCT 

TGGGCCGTA 

Sequence 2485 

GAGAGGCGGTTTGCCGTATTGGGGCCGCTCTTCCCGCTTCCTCGGCTCACTTGACTCGCT 

GCGCCTCGGTTCGNTTCGGCTGCCGGGCGAAGGCGGTAATTCAGCTCACTCAAAAGGGCG 

GGT 

Sequence 2486 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGATAATTAACAATTTG 
TGGAMTTGAAAMGCATAAACTGTGTTATTTGATTAGTAATATGTTCCCTTAAM 
TTTTGAGGTGTATGTTATACACAC AGTAAA 1 Ml I GTTCAGGAATGACTTGCTCATTCTG 
TGTTTTTAAAMTAGGAAATGAGGCATAGTGAGTCATCATO 

TATTTCATCCCCTCCGTCACTGAAATTATCTACTTCAGCCACCTTTCTTATTCTCCGTGT 
TAGGGAGGGCCACGTTTATGGGACTTNTTTAATTTTCCATGTGCCCATTATTTGTCCCAC 
TTACCCGGGCAGTTAGCCCAAAAGGCTAGGCCTGTTTCAGTTCCCACAGGA 
Sequence 2487 

TATACGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGG 

TGCGATTCGCATATTCCGCGAGATCATCAAGCCAGCAGAGAAATCCCTCCATGAAAAGTT 

AAAACAAGATAAGCGCTTTAGCACCTTCCTCAGCCTACTTGAAGCTGCAGACTTGAAAGA 

GCTCCTGACACAACCTGGAGACTGGACATTATTTGTGCCMCCMTGATGCTTTTAAGGG 

AATGACTAGTGAAGAAAAAGAAATTCTGATACGGGACAAAAAATGCTCTTCAAAAACATC 

ATTCTTTATCACCCTGACACCAGGGAGTTTTCATTGGGAAMGGGATTTTGGAACCCTGG 

GTGGTTTACCTAACA I I I I I AAAAGGACCCACACMGGGAAGCCAAAMTTCTTTTCTTG 

AAAGGAAGGTAAAA 

Sequence 2488 

CCGCGGTGGCGGCCGAGGTACGAAAGAGAGACAAAAGGGTTCTCTTGGAAACAAGAAGAG 

TGACTCCAGATGTGGCCTGAATAATTGCCATGTTAAGTTAATGCAAAAGATCAGAACAGG 

GCTACATTTGCACAGGCAGTTTCTCTCCGGGCCGTAGTTTTCACTGATGATCACCTTTCA 

CAGCATTTTCCCCAACCAGCATTTCACTTAGTCTTCTCTATACCCAGCACCTCCCCCGGC 

ACCCCCGGCAAGCCCACTTATCACTTCCCGACTTTCCAACGTGGCATTCCCGTGGAGGAT 

CCTGTTCCACATTAGGGCGAAAGCAGGGAGAAACACCTGGNGNAGCCAGCCAGGGATGGG 

GTTTTGGGAAAGGAGCCATGCCCTCTGGG 

Sequence 2489 

CCGGGCAGGTACGCGGGAGCAGAAATGATTGCACTATTGATAAATTCCGAAGGAAAAATT 

GTCCATCTTGTCGTCTTCGGAAATGTTATGAAGCAGGGATGACTCTGGGAGCCCGGAAGC 

TGAAGAAACTTGGTAATCTGAAACTACAGGAGGAAGGAGAGGCTTCCAGCACCACCAGCC 

CCACTGAGGAGACAACCCAGAAGCTGACAGTGTCACACATTGAAGGCT/vTGAATGTCAGC 

CCATCTTTCTGAAATGTCCTGGGAAAGCCCATTTGAAGCCAAGGTGTA/XGATGTGNTGCC 

TGGACACCGACAANCAACCCAGGCCCCGACTTCCTTTGGCAGGCCTTTGGCTTCTCTAGC 

CCTCACTGGAACTGGGGGAGNAGAAGA 

Sequence 2490 

CCGGGCAGGTAC M M M M M M M M 1 M i M M I I GGGTTTCTGGTAACTTCAAAGC 

TCCCTAAGTCACTAACCTTTCTTCGCCCCTCCTCCACTAAGGGGCTGTCAGGAAGCATCA 

ATTTCANAAAATCTTCTTTAGACAGAACATATTGAGTCTNTGCAACAGCATCTCCCACCA 

CTGTCTGATGCTGANTTGCTGTGGACATGACGCCCAACTCACTATACATCCCGGGACATT 

CAGTCACTTNAGGCTCCTCGGGNCGGNGGGGCCTGCCCTCNCGATTTCTTCCAGC 

Sequence 2491 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTAATAGCTCAAACTCAGAGTCA 

TCGTGCTCCCAATTCCAAAGAGATTCCTAAAAGAGGCAACTT 

Sequence 2492 
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ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAGTGGATACCCCA 

GAATGACCTTC TAGGTCATCCAAAAAC CAGAGCTTTTATAACTCATGGT GGAGCC AATGG 

CATCTATGAGGCAATCTACCATGGGATCCCTATGGTGGGCATTCCATTGTTTTTTGATCA 

ACCTGATAACATTGCTCACATGAAGGCCAAGGGAGCAGCTGTTAGATTGGACTTCAACAC 

AATGTCGAGTACCTGCCCGGGCGGCGCGCTCTAGAACTAGTGGGGATCCCCCGGGCCTGC 

AGGAAATTCCGATATCAAGGC7TATCGATACCCGTCCGACCCTNCGAGGGGGGGGCCCCG 

GGTACCCCAGC 1 1 1 1 I GTTTCCCTTTTAAGTGGAGGGGTTTAAATTGCCGCCGCCTTGGC 

CGTTAAANGATGGGTCATT 

Sequence 2493 

CCGCGGTGGCGGCCGAGGTACGCGGGGAGTCTTCACTGCTCTGCGTCCTGTGCTGATAAA 

GGCTCGCCGCTGTGACCCTGTTACCTGCAAGAACTTGGAGGTTCACAGCTAAGACGCCAG 

GACCCCCTGGAAGCCTAGAAATGGGACCACTGACATTTAGGGATGTGAAAATAGAATTCT 

CTCTAGAGGAATGGCAATGCTTGGACACTGCGCAGCGGAATTTATATAGAGATGTGATGT 

TAGAGAACTACAGAAACCTGGTCTTCCTTGGTATTGCTGTCTCTAAGCCCAGGACCTGGA 

TCACCCTGGCCTGGGAGGCAAGGGAAAGGGAGCCCCTGGGAATCCTGGAAGNAGACATGG 

AGATGGGTTNGACAAAACCCCCAGT 

Sequence 2494 

CCGCGGTGGCGGCCGAGGTACCCCCAGTGATGGGACATGGTCGCGCTGGCGGTCGGGGCA 

GGGCAGGAATCGGCACCGATCTGGCGGCGGCGGCTTGGGCGCCTGCACTGTGTGTCCCCG 

CGTACGCGGGGAATTCAGAAATCACTACAAGCAGCATTAAGACTGAAGTTGGAATATTCT 

GTTGACCATAAAACCTTGATATCATTCTGTGTATATAGAATGTAAAAGGAATATTACAGT 

GTTAACTGCCATATATGTAATATACACAAACTCAATTAGCATTGTAAATGGCCCAAAATG 

GCATTCCCCCATGCTTTTTCTGGTTTTTC 

TCTTTATCCCCAAACAGGCTGCCCTAATTTTTAAGGGAGTTCTG 

Sequence 2495 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGACCAGTCTT 

GAGAGGTGGCCAGCCAGACTGCCTGTCCACATGCGTGTCAGCACATACAGCCGCTTCCTG 

GMGCCGCCTGGAATGTCTTCACGGCAGCGTTTTGCTCACACAGCANGCTTTTGCACGCC 

CCAGGCAGCCCCGACTGCTGAAATCCAACTTGAGCTGGCTGGTGGTCCCTGGATCCTAGA 

GCCCTTCACTTCGGGTTACTCCCTCTTTCTTGCCTCTATTTCTTAGTTGGAAGAAAATAA 

ACTCACAAATTATTGGTGCAGTAATTTTTTCCCGGGGGAAAAGGTAAAAGGCCTCAGGGA 

ATGCCCCACCGCCCTTTTCTTTCCAMGGCCTTTGTCTCTGGAGACCCTCTTAAGGTTC 

Sequence 2496 

CTACTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

rrrnrm iah i i n 1 1 u 1 1 n in 1 1 n 1 1 1 1 i ctttgggcaacactttattggga 

AANATTTACNCNCGGNGACCTGTCNTAGGCCAAGCGATNAAAAAAGGGCCCCAGGANCNC 

TGGGGTCCCGAGGNGGCTCAAATGGAANCCATGGGA 

Sequence 2497 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGTTTGAGAAGCCAGCGCTCACCC 
ACCCGGGGTCTCTGTGCATTGACCTTTGGGTGCTGACTTGGAGAAAAGCACAAACACGAC 
CAGTCCCATCCTGGCTCCCGTGGGGCTTCTTCTATCTACGCATTGTATCGACTGCATTAG 
TTGG ACTAAGATGATGACTCAGTTAAAG GAG GAGACAAATGCTGACTGTCTAAGCAAGAA 
TGGCCCAAGCTGGCAAGAA'sAAGCACACTTGCATACATCCCGCGTACCTCGGCCCGCTCT 
AGMCTAGTGGGATCCCCCGGGCCTGCAGGNAATTTCGATATC AAGNCT TATTCGGATTA 
CCCGTCCGACCTCGNAGGGGGGGGGGCCCCGGTACCCCAAGCTTTTTGTTCCC 
Sequence 2498 

CCGGGCAGGTACTGAGTCAAGGACGTCTTTAACGTCATGGACGGCTCCT TTACCCA CGCT 
TTCTAGATCTTCGACTGCATCTTTTCCTAGTTTTCCGAGTCCCCCCACAGCl n i I nGC 
TCCGTCTAGGCCTTTTTCCAGAAGGCTGGATCTCTGCTTCCTTGGCTTTGGTGCCTGTCT 
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GGCTAACCCTGGGTCTTCACCTGCATTTTCCTTTTGAGCTGCTGATGCTTCATGGCAAGG 

GTTCCCCCGATCCTGGGGCAAGCCCCGCGTAACCCTCGGCNCGCTCTAGNAACCTAGGTG 

GGATTCCCCCGGGCCTGCAAGGGAATTTCGGATATCAAAGCTTATCGGATACCCGTCCGA 

CCTTCGGAGGGGGGGGGCCCCCGGT 

Sequence 2499 

TCACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGG 

ACTCAGAAGCTTGGACCGCATCCTAGCCGCCGACTCACACAAGGCAGGTGGGTGAGGAAA 

TCCAGAGTTGCCATGGAGAAAATTCCAGTGTCAGCATTCTTGCTCCTTGTGGCCCTCTCC 

TACACTCTGGCCAGAGATACCACAGTCAAACCTGGAGCCAAAAAGGACACAAAGGACTCT 

CGACCCAAACTGCCCCAGACCCTCTCCAGAGGTTGGGGTGACCAACTCATCTGGACTCAG 

GACATATTGAAGGAAGCTCTATATAAAATGCCAAGACAAGGCAACANAACCCTTGGATGA 

ATTATTTCATCACTTTGGAATGAAGTGCCCCACCACAGTCMGCCTTTTAAAGGAAAAGG 

TGGTTTGCT 

Sequence 2500 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC 1 1 I I II I I I I I I I 

n I I 111 I II I 1 I 1 ATNGCAAGGACTGAGGCTGGAGCTCCACTTAAGTGGTTTGATCCAA 

ACTGGAAAAGCAACTTCC AG CAAGGMCATCCAN ATAAACCCTTTG CGGAACAAAANAG N 

GGGCCCCCTCCTGACCACAAAGATGGAACCCGCTCTCCTTGATACAATCCACGAATTTAC 

ATCTTTTAATCGGCTGTCCACTCAGGCCTAGGGCGTAGTCAGAAAATGACACCCTCCACA 

CATTGTGGTAAGGCAGNAAATTAAGCCTCAAACCCACCGGGTGACCCTATGTAGGTTCCT 

GGGGTTTTGACCTGGGGGAGCCACCTTNTCAAACTTCCTTATACCTTCCCGACCGAAAAC 

TCACTGGGAGCT 

Sequence 2501 

ACTGTNNCCCGACCCTGCCCGCTTACCGGGATACCTGGTCNCGCCCTTTCTCCCTTCGGG 
GAAGGCGTGGGCGCCTTTCTTCATTAANCTCACCGCTGTAGGGTATTCTCAGTTCG 
Sequence 2502 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCACCATGTGGGACTCCAG 

GCACCATCTGTTCTCCCCAGGGACCTGCTGACTTGAATGCCAGCCCTTACTCCTCTGTGT 

TGCTTTGGGCCACCTGGGGCTGCACCCCCTGCCCTTTCTCTGCCCCATCCCTACCCTAGC 

CTTGCTCTCAGCCACCTTGATAGTCACTGGGCTCCCTGTGACTTCTGACCCTGACACCCC 

TCCCTTGGATTCTGCCTGGGCTGGAGTCTAGGGCCTGGGGCTACATTTGGCTNTCTGTAC 

CTGCCCGGGNCGGCCGCTCGACGGGTTATCCACAGNAATCAGGGGGGATAACCGCCAGGG 

AAAGGAACATGGTGAAGCCAAAAAGGGCCAGGCAAAAAGGGCCAAGGGAACCCGTAAANA 

AG 

Sequence 2503 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTATGATCCAAACACCAAAAGCTGTGCAAGATT 

CTGGTATGGAGGTTGTGGTGGAAACGAAAACAAATTTGGATCACAGAAAGAATGTGAAAA 

GGTTTGCGCTCCTGTGCTCGCCAAACCCGGAGTCATCAGTGTGATGGGAACCTAAGCGTG 

GGTGGCCAACATCATATACCTCTTGAAGAAGAAGGAGTCAGCCATCGCCAACTTGTCTCT 

GTAGNAAGCTCCGGGTGTAGATTCCCTTGCACTGTATCATTTTCATGCTTTGGATNTTAC 

ACCTCGGAACTCGGGGAGGGGAACAATCCTGCCTGCATGGACCCTATTCAAGTTATGGGT 

GGCTAATGTGNTCTGGTGGGACC 

Sequence 2504 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTAC1 II I 1 1 i II I 

rTTTTTTTTTTTTTn I CTGCCCAACGAAATTATTTATTGCTGTCAGGAGGCCCAGGACC 

CTCCANACTGAANAGTCCATGGGCCCCTGTGCATGTGANAGCTACACAGGGAATCGCTGT 

CACCTTCCAAGGATAAGCAGACAGGGCCACACATGCCATGGCCACTCGGNGCACAGGGGC 

CAGGCTGNGCCCAGAGACCCAGGCATCCAGCTTGCATTCCAAGTTCCTTACTGAGAGCCC 

CACTGCTTGATGGCAAGNCTCTAGGCATTGCTTGGGGATTTGGGTGACTTTTGCCTCATT 
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GAAANAAANAATGTTCTGGGGNGGTCTGTAAGGGGACAATTGGATCATTCAAGGGAAGGG 
GCC 

Sequence 2505 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCAGAAGTGTCCTG 
GAATGGGGCCCATGAGATGGTTGTCTGAGAGAGAGCTTCTTGTCCTACATTCGGCGGGTA 
TGGTCTTGGCCTATGCCTTATGGG 
Sequence 2506 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCTGACACTGCAGTGAAGGGCATG 
CTAAGTCTAGGCACAGGTCCTGGCAGCAGGAAGGAGACAGAGCCTCTCCCAGGCACACAT 

Sequence 2507 

ACTACTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACACCACCAA 

AGACCAATAATAAAAAAGAAGGTTCCAGGGAGGGCATGACCTCTGAAATTCCCCATGATC 

TTTGGGTGTTTAAATCCCATTTTCCCTTCTGAGTCATTGGTCATCCTTGGTCTCAGTAGA 

TCAGGCACGTTAGAACTGTGGCAGAAGAGATAAGAATGAAGCAGCCAAAATTGAGCCTTT 

GCTTCTTAGGTAATGCTGAAMGTTTCACACTTTTACCTCTGCCTGAGTCCCCCGCCGAA 

TCATTGCGTTACCTCGGCCCGCTCTAAGNAACTAGTGGGATCCCCCCGGGCCTGCAGGGA 

ATTTCCGATATTCAMGCTTTATTCGGATACCCGTTCGGACCTTCGTAGGGGGGGGGCCC 

CCGGGTACCC 

Sequence 2508 

TNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGGGAAACAT 

GCATMGTGCCAATCCTTTGAATGTTCCACGGAAACACTGGTGGACAGATTCTAGTGCTG 

AGMGAAACACGTTTGGTTTGGAGAGTCCATGGATGGTGGTTTTCAGTTTAGCTACGGCA 

ATCCTGAACTTCCTGAAGATGTCCTTGATGTGCANCTGGCATTCCTTCGACTTCTCTCCA 

GCCGAGCTTCCCAGAACATCACATATCACTGCAAAAATAGCCATTGCATACATGGGATCA 

GGCCAGTGGAAATGNTAAAGGAAGGCCCCTGNAAGCTGATGGGGGTCAAAATGGAAGGTG 

AAATTCAAGGGCTGGAAGGGAAAATAG NCAAAATTC ACCCTAC 

Sequence 2509 

CCGCGGTGGCGGCCGAGGTAC 1 1 1 I I I I I I M I 1 1 1 M I I II I I I I i I CTAAACTTCTCT 

TCTCACTTCATTTCATTCATTTGATCTTCAATCACTGATACCCTTTCTTCC AGTT GATCG 

AATCGGCTACTGAGGCTTGTGCATTCGTCACATAGTTCTCGTGCCTTGGTTTTC AGCTC C 

ATCAGGTCCTTTMGGACTTCTCTGCATTGGTTATTCTAGTTAGCCATTCATCTAATTTT 

TTTTCAAGGTTTTTMCTTCTTTOCCATGGGTTCNMCTTCCTCCn 

GTTTG ATN CGTNCTGAAGCCCTTNTTTTTCTCAACTTCACCC AAAAG GTC ATTTCTCCAT 

CAGCNTTTGTTTCTGNATGCCTGGNNGMGGGAAGCCTTGCCGTTrCCTT 

Sequence 2510 

CACTACTATAGGGCGAATTGGAGCTCACCGCGGTGGCGGCCGTGTCCTCAGATCTCAGGC 

TGCTCAGCTCCATGTAGGCTGTGTCTGTAGATGTGTCCTCGGTCATGGTGACTCTGCCCT 

GGAACTTCTGTGCGTTGATTGTTTCACCATCTTCAGGATCAAAACCTCCCTTCCACTCAA 

GCCCTTTTCCAGGAGCCTGTCGCACCCAGTGCATGGATAATTCAGTGAGGGTGTATCCGG 

AAACCTTGCAGGAGACCTTCACTGAGGCCCCAGGCTTCTTCACOTCAGCCCCAGACTGTA 

CCTCGGCCGCTCTAGAACTAGTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCTT 

ATCGATACCCGTCCGACCTTCGAGGGGGGGGCCCGGGTACCCMCTTTTGTTTCCCTTTA 

Sequence 2511 

CCGGGCAGGTACCAAGGCTTAGAGGTCAGACTGAGGAAACAAATGCTTCACAGAGTCCTC 
AACCCATGACCATGAGTTCTTTAATAAGTAAGTGTGTGTGTGTGTGTAT GTAAAA TATGT 
GAGGCCCTCTAACGCCTCTTTTCTACGGTAGAACAAAGAGAAACTTCAGTTTTTGCCATC 
AATTTATTAAAATAAACATGTATAGCAGGTTTCMCAATTGTCTTGTAGTTTGTAGTAAA 
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AAGACATMGAMGAGAAGGTGTGGTTTGCAGCAATCCGTAGTTGGTTTCTCACCATACC 
CTGCAGTTCTGTGAGCCAAAGGTCTTGCAGAAAGTTAAAATAAATCACAAAGACTGCTTG 
TCATTATATTAATTGCTAGGAAAG 
Sequence 2512 

GTGGGGCGGGCCCGCCCGGGGGCCAGGGTTANCCAATTGGGGTGGGGCCCAAATTGGAAT 

TTTGGAATGGGGTTAAMGGGGGMGGGGGMTCCNGTTTTGMCCCTCCGGTTCTGGTT 

TAATTGGTAAAAAAAGGGAATTGCCCGNTAAGGGGGAATTGGGGGAAGGGGGCCGGAATT 

GGAAGGGMCCTTAATGGMTTGAATTGGGCCGGGGGTTANGGGGAATTAGGTTTCCAAG 

AAACCNGG I Mil ICTTAAI 111! I CCCTTGGAAGGCCCGGTTCCTTGGAANGNAATTGT 

TTAAGGTTNAT7TTAAGGTTTTNAAG I Mil I NGGTTTGGTGGAAAGGTTGGTTTTAAAN 

GGGAAAAAAAAGGGGGGCCMTTMCCAAAGGGGGAACTTAANGGGGNAAAAGGCCCAAG 

NAATTAAAAGGGGGGAAAAAAAAATTGGAAATTTTAATTTGGAAAGGGGG 

GGGMTTCCAANTTGGAAAAAMGGNGGTTTGGNAATTAAAAAAGGGCCCTTCCl I I t I C 

CTTMTTTGGAAANTAAGGGGGGGGGGGGAAAAMGGNTTAAMGCCCGGTTTCCTTT^ 

NGGTTTAAAGGAAMCCCCCCTTTAACCCTTTTTGGGCCGGGCCCTTTGNCCCM 

TTTNGGCCCCCCAATTTTTTAAAAAAGGGAAATTTMTT^ 

TTTTTTAAAGGGCCCCCTTTAANTTAAAAAN l I I I I I I I AAAAAACCCTTTTTTTN GG AA 
CCCAAAAAAAAAGNTTTTTTMTTTTGGNAAAAAAMTTGGGGGGGG U 1 1 I I I I I i IC 
CCCAAAAATTTAACCCCCTTTTT 
Sequence 2513 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCCGGCCGCCCGGGCAGGTACCTGTCCTAGC 

AGTGGCAATCTTTTGGGGTCCCCTAAAATAAAGAAAGGCTTATCTCCACACTGTAATGGT 

AGTGATTGTGGATATTCATCTAGCATGGAAGGGAGTGAAACAGGTTCTCGGGAGGGTTCG 

GATGTTGCCTGCACTGAAGGCATTTGTAATCATGATGAACACGGTGATGACTCTTGTGTT 

CATCACTGTGAAGACAAAGAGGATGATGGTGATAGTTGTGTTGAATGTTGGGCAAATTCT 

GAAGAGAACGACACAAMGGAAAAAATAAAAAGAAGAAGAAGAAAAACAAGATACTGAAA 

TGTGATGAACATATCCAGAAGCTTGGAAGCTGTATTACAGATCCAGGTAATCGAGAGACC 

TCAGGAAATACCCATG 

Sequence 2514 

CCGTAATACGACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAG 

GTACTGCGGCGGGTAGGCCTAGGATTGTGGGGGCAATGAATGMGCGAACAGATTTTCGT 

TCATTTTGGTTCTCAGGGTTTGTTATAA I till I ATTTTTATGGGCTTTGGTGAGGGAGG 

TAGGTGGTAAGTTTGTGTTTAATA I 1 1 1 1 AGTTGGGTGATGAGGAATAAGTGTAAGGAGT 

ATGGGGGGTMTTATGGTGGGCCATACCGGTMGTATTTAGTTGGGGCATTTCACTGTAA 

MGAGGNTGTTGGTTCTCTTMTNTTTAACTTTAAAAA 

Sequence 2515 

CTAACACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGATC 

CTTAMGAAGGTGTGGGGTCTTTCCCMCCTGAGGATTTCTGAAAGGTTCACAGGTTCAA 

TATTTMTGCTTCAGAMGCATTGTGAGGTTCCCAACACTGTCAGCAAAAACCTTAGGAG 

AAAAACTTAAAAATATATGAATACATGCGCAATACACAGCTACAGACACACATTCTGTTG 

ACAAGGGAAAACCTTCAMGCATGTTTCTTTCCCTCACCACAACAGAACATGCAGTACCT 

GCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnCGTCGACCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnTGTTCCCTTTAG 

TGAGGGGTTAATTGCGCGCTTGGCCGTAA 

Sequence 2516 

TCTGGGTTGGGGATTATTTCTGGGTTTCTACTTCCTGTTNGAAGAATGTTGGCATGGAAN 
AGTGGTAAGTTTGAAGAAGATGAGTGCCGGGGGCCTTCATCTAATCCCTNGGAAAATTGG 
TCTTTTTCCCCACCMTTCCCCTTGGACACCAAGAAATTANTGGAAGCCCANTACAAGGG 
AANTTCTTGNAAAGNAAAATNGGGGGTNCTTCTTTGGCCCACCCTTCCCCAAGTTAAAAA 
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AGGAATTNAI 11 I I I I J I I II I AAAAAAAAAAAAAAAGGGGTAAAAAATNCTTAAAAACC ' 
TTTGGGNGGCCTTTTTNAATCCTTNGNATAATTGGAAAAAAAANTAACCAGGNAACCAGG 
CCCAMTTAAGGAAAAAAAAAAAT7TTTAAAAAAAAAATTTNAAACCTT 
Sequence 2517 

CCGCGGTGGCCGGCCGCCCGGGCAGGTACTGGATGTCAGGTCTGCGAAACTTCTTAGATT 

TTGACCTCAGTCCATAAACCACACTATCACCTCGGCCATCATATGTGTCTACTGTGGGGA 

CAACTGGAGTGAAAATTTCGGTTGCTGGCAGGTCCCGTGGGAAAATCAGTGACCAGTTCA 

TCAGATTCATCAGAATGGTGAGACTCATCAAGACTGGTGAGAAATCATCAGTGGTCATCT 

ACCATCATCAGAAGTCCGTTCCGAAGTCAATTGGGAAGTCCTGGGCTGTCCACATGGGTC 

CGTCATCATCTTCATCATCCCATATCATCCATTGTGGGTCATTGGCTTTTTCGTTGGGAC 

CTTACTTGGMGGGGTCTCTTTGTTTAMGTCAmGGGTTTCTrrCMGAGGACACCAG 

CATTTCTGTGGGGG 

Sequence 2518 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTTTTT 
rTTTTTTTTTTTn I GGGCTTTTATCCTGTTTGTTGTTCTCTGAGCTTCCTGAATCTGTG 
GTTTCGTGTCACATATTAATTTAGAAAMTCTC^ 

TATTCCTTTCTCTCTTTCCTCTCCTTCTGGGGTTCCACTATGCATACTTACATCTTTTAT 
AGTTGTCCCACTGTTCTTGGATATTCTGTTCCTTTTTTATTCAGTCTTT^ 
TTCATTTTTGGAAGTTTCTCTTGAAATATACTGMGCTCCAGATTCTTTCCTCACCCACA 
TCCAGTCTGCTAATGAGGCTTCMCACCATGCTTCATTTTTGGTMGGTTTCTTGACTTC 
TAGAAI I I I IATTT 

Sequence 2519 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACl limn 

I I I U I n n I I I 1 1 I I I I 'CTCGNATCTTATAGGCCAGGCGNGGNGGCTCACGCCTGTAA 

TCCCAGCACTTTGGGAGGTCGAGACAGGCGGATCACGAGGTCAGGAAATCGAGACCATCC 

TGACCAACATGGTGAAACCCCGTCTNTACTAAAAATACAAAAAAATTAGCCAG GCGTG GC 

GGCANACACCTGTAGTCCCAGCTACGCAACTGNGCCACCTCAGAGAAGTANATTTTTAAA 

CAAACCCA^TTTTNTCTTGCTGTTAACAACACTGAANACTAGTANCAGAGGAGTGAAAGT 

GTGGGTTAANAGCCTTAAAAACTTGNGGGAAAAG 

Sequence 2520 

ATTGGAGCTCCCCGCGGTGGCCGGCCGCCCGGGCAGGTACAGAAGGGCCATGCTGTTATT 

ACTCTTACACAAGGAGGCAGCCCTCGAGCCACAGGGTCCAGCTGTTGGCTATAATAGCCT 

ACCGGTCTCTGATGATCACCATGTTTCTGGAATTCAAGCCAGGAAGAAGCAGCAATCTGT 

CTTCTGGATTAAAACTGAAGATCAACCTACTTTCAACTTACTAAGAAAGGGGATCATGGA 

CATTGAAGCATATCTTGAAAGAATTGGCTATAAGAAGTCTAGGAACAAATTGGACTTGGA 

AACATTAACTGACATTCTTCAACACCAGATCCGAGCTGT TCCCT TTGAGAACCTTAACAT 

CCATTGTGGGGATG CC ATG GACTTAGG CTTAGAGGC C ATTTTTG ATC AAGTTGTG AGAAG 

AAA 

Sequence 2521 

CCGGGCAGGTAC I I I \\ I I I I G 1 1 I I I I I I M GTTTGTTTTTTTGTTTTTTGAGATGGAA 

7TTCGCTCTTGTTGCCCAGGCTGGAGTGCANTGGCGTGATCTCAGCTCACCGTANCCTCC 

TGCTTNAGCCCTCCCAAGTAAGCTGGGATTACACGGTGCCATTGCACACTCATGCCTGGG 

ACTMTNGTATANGTACNNTTGTATTAMTNTTTTTGNTATNTTN'rr CCTAG AAGGACGN 

GTAGTTTTCCTCCATTGTATGAATCAGGGGCCTGGGTTCTCCGAAATTTTCCCAACCCTT 

NAAGGTTGGATTCACNGCCCAACCCTCAAGNCCTTCCCNCAAGTAGNTTGGCNTGGGGGT 

ATNTAACTAGGGNCNTTGNAACCCAACNCGGTANNCCCNCAAACTTNATTGCCNTGGNTG 

NTNTTTTTTTCTTT 

Sequence 2522 

CTTAGGGCGAA7TGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTGTGAATTGGACATCTG 
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TTGTTTGCAGATGTAATTAAGATGGGTCATACTTAAGTAGTGTGGGCCTCTAATCCAATG 

TGACTGGAGTCCTACTAAGGGAGCAGGGTCAGAGGCAGACATGGGAGAATGCCATGTGAT 

GACGGAGGCTGAGATTGMGTGTTGCAACTGCAATCCAAGGAATGCCAACATTTGGTGGC 

CACCATGACAAGGTAGAAGGAGGCAAGGAAGGCTCCAACCAGTGTCTCAGAGGAAGCATG 

GCCCTGGTGACATGCTGAGTTTACGCTTCCAGCCTCCAGAGCTATAAGAAGGTNAAATTG 

TCTGTTTTCTNTNGGTTCCCTGTTACAGCAGCCTTAAAAAAAT 

Sequence 2523 

AGGTACCGCATTCCTACTTCATTGCCCCTGATGTAACTGGACTCCCAACAATACCCGAGA 

GTAGAMTCTTACAGMTATTTTGTTGCCGTGGATGTGAACAACATGCTGCAGCTGTATG 

CCAGTATGCTGCATGAAAGGCGCATCGTGATTATCTCGAGCAAATTAAGCACTTTAACTG 

CCTGTATCCATGGATCAGCTGCTCTTCTATACCCAATGTATTGGCAACACATATACATCC 

CAGTGCTTCCTCCACACCTGCTGGACTACTGCTGTGCCCCAATGCCATACCTGATTGGAA 

TACACTCCAGCCTCATAGAGAGAGTGAAAAACAAATCATTGGAAGATGT 

Sequence 2524 

TAGGGCGAATCGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAATGGGG 
AAAGACAATCATTGMTACAAAACAAATAAGCCATCACGCCTGCCCTTCCTTGATATTGC 
ACCTTTGGACATCGGTGGTGCTGACCAGGAATTCTTTGTGGACATTGGCCCAGTCTGTTT 
CAAATAAATGMCTCAATCTAAATTAAAAAAGAAAGAM 

CCATTTCTTCTTCTTC I I I I I I AACTGAAAGCTGAATCCTTCCATTTCTTCTGCACATCT 

ACTTGCTTAMTTGTGGGCAAAAGAGAAAAAGAAGGATTGATCAGAGCATTGTGCAATAC 

AGTTTCATTAACTCCTTCCCTCGCTCCCCCAAAAATTTGAAI 1 1 1 1 I I ICAACACTCTTA 

CACCTGTTATGGAAMTGTCAACCTTTGTMGAAMCCAAAATAAAAATTGAAAAATAAA 

AACCATAAACATTTGCACCACTTGTGGCTTTTGMTATCTTCCACAGAGGGMGTTTAAA 

ACCCAAACTTCCAMNGGTTAAACTACCTCAAAACACTTTCCCATGAGTGTGATCCACAT 

TGGTAGGTGCTGACCTAACAGAGATGAACTGAGGCCCTGGNTTGNTTGGTCATAATACAA 

MGGNGCTMNTAATAGGATTTCAANACTTGAANAATGGTGATNGGGCTANAANAATTGG 

AGMNAAATCCTCTGNATTGNGTGGATCGGGNGGGGGATTTTTAA 

Sequence 2525 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACCAACTAACCTCCCGCTCTTTATCTCCCCAAC 

CTTCCCACTCCCCAGCCTCTAGTAACCATTATTCTCCTCTACCTGTGAGTTGATTTTGAC 

CTCCCACATAAGAGAATGTGGCAGTATTTGTTTTTCTGTGCCTGGGTTATTTCATTTAAC 

ATACTGTCCTCTGGTTCCATCCTTGTTGTTGCAMCGACAGGATTTTGTTCI I I I I IATG 

ACTGAATAATATTCCATAGCCATATACATACCATGTTTTCTGTATCCATTCATTCGTTGA 

TTGGCAGTAAGGTTGATTCCATATCTTGGCTGTTGTGAATATTACTGCAGTAAACATAGG 

AATACAGATATCTTTTCCATATACTGATTTCCTTTGTTTTGGATATATACCCAGCAGTGG 

GCTTGCTGGATCATATGGCAGCTCTGATTTTAGGTTTTTAAGGATGGGI 1 1 1 1 Ml I IGC 

AATGAGAATAAAAATTTTCCATGTATMGAACAGTGTCTTTGMCTTTGTTTATATCCTT 

TGCCCA 

Sequence 2526 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACAATACTTTCTGACTGTCCACTAAAATGTCAA 

ACTATTTCAGACTCCTACTCCTCCACTGTCTTAGTTGATTTTGTGCTGCTGTTATAGAAT 

ACCTGAAACTGGGTMTTTATAGAAAACAGAGATTTATTTCTTATAGTTCTGGAGACTGG 

TCAAGGGGATGGCATCTGGTGAGAACCTTCTTGCTACATCATCTCATGGAAGAAGGTGGA 

AG GGC AAGAGGG CAC ATATGCCCAAG AAAATGTGTGCAC ACACAAGAGAGGTGGAGAGAG 

AAAGAGAGAGATAMTAGAGAGACGAAAAGGGGGCCAAGCTCATCCTTTTATTAGGAATC 

CATTCCCATGATCTCCCATGATAACAGCATTAATCCATTATAAGTGCAGAGCCCTCATAA 

CCTMTCACTTTTTAAAAGTCCCACCTCTCAACACCATTGCATTGAGGATTAAGTTTCCA 

ACACATAAACTTTGGGAAATGCATACAAACCATAGCACTACCTCCACTCACATCTTGGCT 

CTTCAATGTCAAGGTATATTGGTTGGTAAGGGAATATAAACACATGAAGACATGTCTCTC 
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CTTTCAATCCAAAGACTTACTGGAATTGNGTCAAGCTTCTTGTGGTCAGAAATGAGGNCT 

TCTCTTTATTTTTGGC 

Sequence 2527 

GCGTTANACGACTCACTNTNAGGGCGAATTGGAANCTCCACCGCGGTGGCGGCCGCCCGG 

GCAGGTACGCGGGCACTGTAATGCTNACTTANCATTAACCTTTTAAGTTAAAGATTAAGA 

GAACCAACACCTNTTTACAGTAGAAATGCCCCAACTAAATACTACACGTATGGCCCACCA 

TTAATTACCCCCATACTNCTTACACTANTCCTGATCACCCAA 

Sequence 2528 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACCCTTTTCAGACTAATTGGAACTTGAGTCATG 

GCTGTCACACTGAAATAATTTGTTCATTCAACTGCAGTGAAAATCAATATAGATCAACAT 

GCATAMGTTCAGTAACTATTAAATGGGAGAAGCAAGCAA AAGCA ACAGG AAAATT AAAT 

CAGGATGCTGAGGGGGGAACAGGGATTGCTGGCC ATGAATTTTAATTGAAi l l l iGA CGG 

GGCAAGCTACGTTACATTATGGCAGCCCCCGTAC II I II I I t I I M I M I I I I I M I f l"C 

GAGTTTTATGATTTATTTAACTNGTGGAACAAAAATAAACCAGATTAACCACAACCATG 

CTTACTTTATCAAATGTATMGAAGTAAATATGAATCTTATATGACAAAATGTTTCATTC 

ATTATAACAAATTTCCMTAATCCTGTCMTTATATTrCTAAATTTTCC 

Sequence 2529 

CGAGGTACGCGGGGGGGTGGTGTGGTCCAAAGGACAGGCTGGATGGCGGGTGCATCGGCG 
TGGGCGTGGTCAGCATGTGTCTGCAATGCCTGTGGGCTCTACTACAAGCTTCACAATATT 
AACAGACCCCTGACTATGAAGAAGGAAGGCATCCATACCATAAACCGAAAAATGTCTAGC 
AAATCCAAAAAGTGCAAAAAAAGTGCATGACTCACTGGAGGACTTCCCCAAGAACAGCTC 

GTTTAACC 
Sequence 2530 

CGAGGTACAAAAAACACAAGGAATACAACCCAATAGAAAATAGTCCTGGGAATGTGGTCA 

GAAGCAAAGGCCTGAGTGTCTTTCTCAACCGTGCAAAAGCCGTGTTCTTCCCGGGAAACC 

AGGAAAAGGATCCGCTCCCAAAAACCAAGAATTTAMGGAGTTTCTTAAATTTCGACCTT 

GTTTCTGAAGCTCACTTTTCAGTGCCATTGATGTGAGATGTGCTGGAGTGG CTATTA ACC 

TTTTTTTCCTAAAGATTATTGTTAAATAGATAT^ 

TAGGTTAAATGTCATTTTAGAGATGGGGAGAGGGA TTATA CTGC AGGCAG CTTCAGCCAT 
GTTGTGAAACTGATAAAAGCAACTTAGCMGGCTTCTTTTCATTA I MM lATGT 
Sequence 2531 

AGGTAC l rrrTTTTTTTTTTrrm I GTTTATG AAG ACTTTTATTAAATTACAGTGTATT 

ACAGATTATATCATAATAATMGCCTTTCATCTTTAGGCTAATATGATACAAAAACCTAC 

TTGGCCACATTACTTCTTGAGTTTCTTTTTGGGCAGCTTTCTTNTTTGAC^ 

CCGCTTCATAGCATTGANCCCGTGATTCTTTGTGAMGTTTGGGGCCCTTTAAGGGATGC 

TGAGGGAGAGCTGCTGGATTCTGAAANNAATTTTGCTNGGTAAGAACCTGCCCGGGCCGG 

CCGCTCTTAGAACTTAGTTGGGATCCCCCNGGNGCTGCAGGGAATTCNATATCNAAGCTT 

ATCGAAACCCGCCGACCNTCNAAGGGGGGG 
Sequence 2532 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACGCGGGGGTCACCTGCGGTGGGCGCACTAGAG 

GTCTTTTGAACTCCTTCGGCTACCGTCGCCGCGTTCTCGCTGTGCACTCTTATTCTGCGC 

CTGCGCGCGGCTACAGCACGGTTCGTTTTTCCTTTAGTCAGGAAGGACGTTGGTGTTGAG 

GTTGGCATACGTATCMGGACAGTMCTACCATGGCTCCCGMGTTTTGCCAAAACCTCG 

GATGCGTGGCCTTCTGGCCAGGCGTCTGCGAAATCATATGGCTGTAGCATTCGTGCTATC 

CCTGGGGGTTGCAGCTTTGTATAAGTTTCGTGTGGCTGATCAAAGAAAGAAGGCATACGC 

AGATTTCTACAGAAACTATGATGTCATGAAAGATTTTGAGGAGATGAGGAAGGCTGGTAT 

CTTTCANAGTGTAAAGTAATCTTGGAATATAAAGAATTTCTTCAGGTTGAATTACCTAA 

Sequence 2533 

AGGTACAGCTGCTATCTTATTGGACTACAGTAAATATTTTTTAAAAGGACACCAATGAGG 
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GGCACCATCTGGTGTTAACCTTAACCAGAAAGCTGGTTTCCTCCTCCTCCCCCCCCGCGT 
ACTTTGGCCTCTCTGGGATAGAAGTTATTCAGCAGGCACACAACAGAGGCAGTTCCAGAT 
TTCAACTGCTCATCAGATGGCGGGAAGATGAAAGACAGATGGTGCAGCCACAGTTCGTTT 
TGATTTCCACCTTGGTCCCTTGGCCGAAACGTCCTAGGCCAAATCGTTGGACCTGCCCGG 
GCGGNCGCTTTTAGAAACTAGTTGGGAATCCCCCGGGCTTGCAAGGGAAATTTCGAATAT 
TCNAAGCCTTAATCCGATAACCCCGTTCNGAACCCTTNGAANGGGGGGGGGGCCCCGGGT 

ANCCCAANC1 11 MTGGTT 

Sequence 2534 , 



AGGNCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGCAGGTACTTTN rTTTTTTTTTI 
M I NIM NTTNNNGGGGNGGGTTGAGTCGGAGTCTCACTCTGTCGCCCAGGCTGGAGTG 
CAGTGGCACGATCTTGGCTCACTGCAACCTCCGCCTCCTGGGTTCAAGTGATTTCTGCCT 
CAGCCTCCCAAGTAGCTGGGACTACAGGCAAGCGCCACCACGCCCAGCTAATTTCTGTGT 
TTTTAGTANAGACAGGGTTTCACTATGTTGGCCAGGATGGTCTCGATCTCTTGGCCTCGT 
GATCCACCCGCCTCAGCCTCCCAAAGTGCTGGGATTACAGGCATGA GCTACC ACGCCCGG 
CCTAATTTTTAACTCTTTATTCCGTGTC 

NTTATTCTGNGTCTCTAMGGATTCATTCATTCACCTMTTTTTCAAATACCGTTCTCAA 
GTCCTGGTTATAGACMGGCAGTTATTGGAAAGCACACTATGGCAGGATTTATGGAGTGT 

ACCTT 

Sequence 2535 

CCGCGGTGGCGGCCGAGGTACGCGGGGAGTGAAACTTTTACCAAAGACTGCTGTCTACTC 

TGAGGGCTGCTACCTGTGAGGCTTCAACTGCATAAAAAGACAGCCTTTGTCACCAAGTCT 

TTTTTCCCCATCTCTCACCCATMCCTGTCACCATGCTCMGCCTCMTTTCTTTCTATA 

AAATATAAAMCTTTGTCAmGGCTCTTCmCAGTCTCATATTCATAGACCTCTCGTG 

TCCATGTGCATATTGATAMTCTGTATCCCTTTTCTCCTGTTAATCTGTCTATTGTCAAT 

TCATTTCAGCAGACTCAGACTTCAACATTCGGGGGAAMTTTGAACTTCTCTACAGTTTT 

GGTATCTTTGGCAGGATTGTAAGCAAATCACTCTGCCCACACCCGAGGCTGCAGCCAGAA 

ACC ACGG AAACTG AC AG AC ACC AGC N N N N N AN AAAN NNNNNNNNN N AAAGTACCTGCCC G 

GGCGGCCGCTCGACTTGAGCAGGCTACTGAACCATGAGCCAAATAAACCTTTT 

Sequence 2536 

CCGCGGTGGCGGCCCGAGGTACCACTATAATCTCTAATACTTACTCAGAATTACTGTGTA 
TTTACTTAATTTCTTATTATGTG CCTTATTATGTGCTT AAG ATAC AAT AG GTTA G AGTTT 
MTCTAAATATCTTGAMGCTATATTGTGGGCTTG GTAA GCATTT TGl m i iCTTTCTC 
TGTTTTGGTMGGATTTAAMTTTTTTTCATTGCAATTTTM 

TAGTTT1TATCAMTTTTTGGTGCTTGGTGCAGAGACGGTGTGGGGAAGGGTGAATGGTT 
TTGGG^ATMTTCAGTGCACACCTGTAGGCCTCTTTACATTGTGACTGATAGGGGTTATT 
GCATAICMTTTGGGGCTGTAGAGTGCMTCTCAAGTTTCATCTTTTTCACCCATCANAA 
TTTGTCTCAGGATTACTTGGTTTTTTCTCAGTCCTCMGCCGAGAACTTGCTT 
Sequence 2537 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGAGGTCCGGAGAGACGC/ 

AGCT/KAGATGTCAGGACATCCTGGAAGCTGGGAAAGGTAAATCTTGCTACTGCTCACTCT 

TTGGGTCCACGCTGCTTTTAAGAGCTGTAACACTCACCGTGAAGATCTGCAGCTTCGCTC 

CTG;\AGCCAGCGACACCACGAGCCCACCAGAAGGAAGAAACTCCGAACACATCTGAACAT 

CAGAAGGGACAGACTCCAGACGCGCCACCTTAAGAGCTGTAACACTCACTGCGAGGGTCC 

GCGGCTTCATTCTTGAAGTCAGTGAGACCAAGAACCCACCAATTCCAGACACA 

Sequence 2538 

C CG G GCAGGTACTC AGGGATAATAAAAATGC AAACAAAGTTTTATG C AC ATAATACC C AT 
C AC AG G GTTACTTATAACAGC AAG AATTAGG AG ACC AAAAAAAAGGTAG ACAATATAAG A 
GG AATG GTTGGGAAGTC ATAATTGG CC ATAC AATGG ATTTTATG AAGCC ATTAATAAGG A 
AGATTTTAATGTAATAGAAAATACATGAGATAAATTAAGTAAAAAAAGCGTGACACAAAA 
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7TGCACGTAAAACCTCACCCATCTTAAMGGATGCTTAATGATACTTTTCACTTTATGCT 
TTCATAATATCTAAATGATCTACAATTAACCTACATf TATTCTCTAATGAGTC AAAAAGG 
CTTTTTTTTTTTTTrrTTTI IGAGACAGAGTCTTGCTCTGTCATCCAGGCTGGAGTGCAG 
TGGTGCGATCTCGGCTCACTGCAAGCTCAGCCTCCCGGGGTCACGCCATTCTCCTACTTC 
AACCTCCCAAGTAGCTGCTTTGAACCCGGGAGGCANAANGTTGCAGTGAGCTGAGANCAT 
GCCACTGCACTGCACTGCANCCTGGGAGACAGATTGGGACCCCCTCTTAAAAAAAAAAGA 
AANANGA 
Sequence 2539 

CCGCGGTGGCGGCCGAGGTACGCGGGGGTAGATGGAAGGAAGAACTTGTGTGCTTAGACC 

TGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGTAGGACCCTATACGGCAACCTCC 

TTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAATGTCAGCAATGTGATGGTGTTTG 

GAGGTGGAGCCTTCAGAAGGTAATTAATGCCCTTGTAAGAAGAGGCCAGAGAGCTTGCGC 

ACCTTCTTCCTGCCATGTGAGGAGCCAAGAAGCCGGCTGTCTGCAACCGGCAAGAGGACC 

CTCACTAGAAGCTAGCCATACTGGCATCCTCATCTTGGCT TTCCAAC TTCCAGAACTGTG 

AGAAGTATATGCTTGTGGTTTMGTCAATGGTCTATGGTAATTTTTTTATAGCAGTCCCA 

GCCAAGACAGTGCCTCATTTAC 

Sequence 2540 

CCCGCGGTGGCCGGCCGCCCGGGCAGGTACI I I I I I I I I I I I I I I I I I I I 1 1 M I I M I N 
GATGTTATTTTATTGGGTCGTAAATAAGGAGTAGGTAAAACATTTTGNGCTTTAATAAGT 
GAGGCNCAGCTTAAAAGTTTTAAACAGCAATA 
Sequence 2541 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCTGCTGAAAGATTATTT 
CTAACAGGCTTGTAGAGAMCGTCGGTTCATGTNAATTANAAATTATGGGGCCACTTTGC 
CATTCTTCACACCTGC AATGMCAGGTGTTTATCTGN NGN NCTGACTTATCTCTTGAACT 
CCATTTGCATGGTATNGTGGGATGCAAGCTGATGCCCTGTCCAGATCT 
Sequence 2542 

TCGACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGTCACACTCATTTACCCG 
GGGACAGGGAGAGGCTCTTCTGCGTGTAGTGGTTGTGCAGAGCCTCATGCATCACGGAGC 
ATGAGAAGCCCGCGTACCT 
Sequence 2543 

ATACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAATTAATTTAACAC 
ATCTTTTGATAATCTCATCCTTGGTGTTGGAAAAGACGGGAAAATCCAAAAGTGTCTATT 
TTGTGCCCAAATGCTCAAGTTAATACTCGAGGGCCTTGATCTCTGGCCTCTTGCCATCCC 
TCTCCCCA7TATTTCTGAAGAAAGTANCCCATTTTTTGCAGATTCCGGAGGGATGGGGAG 

CCACGGC 
Sequence 2544 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACNATATACAAAGACTNTGAG 

CTGNNTGCCTCCGATGGTTTCCAGTATTGGCCCGTTGTAMGCTCATTAAGGCCAACTTT 

NACTTTNANTATGTGATTCTGCAGAATTAANTTAAGGAGGCGCTGATCCATGCTGAGAGT 

ATCATNAGAAAANGGCATTMTCCACMGGTGCCAANCAAAAGTTGTMTTTNNTTNCAT 

CNTGGCTCTCANGAAGCAANATGCCAANGCNTTAATNTGGGGNACACCAAAGAATCCGTT 

GAAAGGGNAGGTTTGCTTG 

Sequence 2545 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGCACCTTGTGGG 

GCATGTTTGTATTTATTGCTTCATGGCCGACTGGAATCCTGAGTCCTGGGAAGCTGGCAC 

TGCGGGGATCTTGCCCGGTGTCCTGGTCCTCTTGCTTCCGTC 

Sequence 2546 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC rTTTTTTTTTTTTTTTTTTl 
MMIir I GGGTTTTGATTATTTGTGGTGCTAANAAGAAGGGAACATGGTTGGAGGCCCA 
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TTCTTGGCAGCAGNATCCAATTTGGAAGTTGAAGGTCTTGTCCTGGAGCCAGATGCTAAC 

CNAAACACAGCAAATGCTTTTCCTMGGCACAATAGTCTTTTCANTGAGCTCAGGAACCC 

TNTNTATGCCANATCCTCGTNTGAACCTTCTTGCTCCTTCTGTGCNTGAAGATGCCCACT 

CCACAGANCCCGCNGTACCTGCCCNGGCnnnnnnnnnnnnnnnnnnnnnnnnnnnnCNGG 

CTGCAAGGAANTCTATATTAANCTTAATTCGATACCGTTCGACCTNNGANGNGGGGGGCC 

CC 

Sequence 2547 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGCGGGAGGCTGACGAGAGCCGGGA 
GGCGTTAGCGAAGGAAGAGAAAAACCGAAGACGAAGCCACTACAGCCCCCGCGTACCT 
Sequence 2548 

CCGCGGTGGCGGCCG AGGT AC i I I I I i I I I I I I I I I \ I I I I I I II i I I i CGGGATGAAGG 
TAAACCAACGTTT A I II I I I 1 I CCAAAACAGAGTCAGTTTGTGCTGGCCATCCAGGAACT 
CACTMCACTACCCCTCTTAGTTTCATCATTAATCTGTTACTGAACTTCCACCTCCAAAA 
NACTTGNATCCTTTTCTTA I I I I l"GGAGACCGTNTTGCTNTGTCACCCAGGCTGGAGTGC 
AGTGGTGCNATCTTGGCTTACTACAGCCTNAACCTNTCGGGCTCAAGNGATTCTCCTACC 
TNAGTCTCCANAGTAGTTGGGACTACAGGGGTGCACCACCACGCCCAGCTAATTTCTGAA 
TTCTCTGTANAGATGGGATCTTGCTATTGTTTCCCAGG 
Sequence 2549 

CCGCGGTGGCGGCCGAGGTACGCGGGGACTACTTCCACTAAGACTATAATTACTACTGAT 

GGTTATGATGATGTCCCAAACACGCCATCCTAACACCGGCTTCTTACCAGTTTGAAAACT 

TTTMGGCATTGTTTGACTTCTCTTTMTCCTAGATACATTCTGCTTAAMGTCCTTCAT 

TATTTTCTTTAAAAGCGTMCATTATTATTTATCCCACAGTTACTACCCTTGCTCTTTAC 

CAGTTAGCTTGTGACTGMTTACTGAAGCATmCTTAGCGGmCCTACmATACTTT 

CTCl I I I ! GTAATATACTCAGTATTCTTCTGCCAAATGAATCCCTCCCATCTCTTCCCTA 

TGATATCTAACTTAAGCTCTGACAATTTGAGTGTGGGAGTAGTTAGCAGTGATTCTGGTG 

AGGTAAGCAGGGACCAAGATCATAAAGGACCTTAAGT 

Sequence 2550 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCAGCGGCCGCCCGGGCAGGTACGCGG 

GGGCCACCTAGGTTACTTGTAGGACCCTATACGGCAACCTCCTTTGCCAGGAACTATTTA 

TAAACATCCTGCAGGAAAATGAGTCTATATGTCAGAATACACATTTCCCACCTTGCCCAA 

CAGTAGAAAMCATAAGMGAGAAAAACATTTAAAAAATGACAAGGGAAGTTAATGGAAG 

TCAGCAATGTGATGGTGTTTGGAGGTGGAGCCTTCAGAAGGTAATTAATGCCCTTGTAAG 

AAGAGGCCAGAGAGCTTGCGCACCTTCTrCCTGCCATGTGAGGAGCCAAGAAGCCGGCTG 

TCTGCAACCTGCAAGAGGACCCTCACTAGAAGCTAGCCATACTGGCATCCTCATCTTGGC 

TTTCCAACTTCCAGAACTGTGAGAAGCATATGTTTGTGGTTTAGTCAATGGTCTATGGTA 

ATTTTTTTTATAGCA 

Sequence 2551 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGCTTCTCATGCTC 
CGTGATGCATGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCTCCCTGTCCCCGGG 
TAAATGAGT 
Sequence 2552 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGATGTCTAGTGATGAGTTTGCTAATACAATGC 

CAGTCAGGCCACCTACGGTGAAAAGAAAGATGAATCCTAGGGCTCANAGCACTGCAGCAG 

ATCATTTCACCCGCGTACAGTTTAGGGGATCCTTTCTAATGACAGGAAGGCACTGCTTTC 

CTCAACACTGTGATCTGACCTGTGACAAGTCTGTACCT 

Sequence 2553 

CCGCGGTGGCGGCCGCCCGGGCAGGTAC I I I I I I I I I I I I I I I I M I I I I I I GGGCAGCT 
AAAATCTAGGATGGTGTTAAGTTTCTTCATTTTGTCAATTATATATAAAAMTTAGAAAC 
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AACATGAATCTGCATTTCTTGGATGAGATAGTTAATAACAMCTATTTCTCMTATTTGT 

NTACTAAAAAACTAGTGAAGGTGTTATGTGTTTCAGTATCTTATCTCTTAT7TGAACATG 

GGTTTCTGAAAGGAGCCTATNTAATAATATAAATGGTATGTAGTAAATGAGGCACTGTCT 

TGGCTGG GACTGCTATAAAAAAATTAC C ATAGACC ATTGACTAAAC C ACAAACATATGCT 

TNTCACAGTTCTGGGAAGTTGGAAAGCCAAGATNAGGATGCCAGTATGGCTAGCTTCTAG 

TGAGGGTCCTC 

Sequence 2554 

CTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATGTAGA 

AAATCCAGCAACGAAGAGCCATGGCTCACTGCTAATCACATAACACACACAGGAGTCTAT 

TTATGCATGGAATGTGGCAGATTTTTTMCMGAAGTCACAACTTGTTATA CACCA GAGA 

ACTCATACAGGAGAGAAACCCCTATCAATGGCAGTGAGTGTGGAAAAGCCTTTTCACAGA 

AGTCACTGCTCACGGTTCATCAAAGMCTCACTCAGGAGAAAAACCGCATGGGTGCAGCG 

AATGTCAGAAAGCTTTTAGTAGGAAGTCACTCCTCATTTTACATCAGAGAATTCATACTG 

GAGAGAAGCCGTATGGATGCAGTGGAATGTGGAAAAGCCTTCAGTAGGGAAGTCGCAGCT 

TAAAAGACATCAGATAACGCACACAATAGGAGAAACCCT 

Sequence 2555 

NTTTTTTTTTTTTTNTTN N N N NN N G NG N NC ATATTAATATANG GCG AATG N AGCTCC ACC 

GCGGTGGCAGNGGCCGCCCGGGCAGGTNCAGAGGACACACATTGTANACAGGCCTGTGTC 

ATGTTTCCTTACAGTCGTTTTTTACAGAGAAAAGGGGCATTGTTTTT^ 

AACANTTCCTTGTGAATAAATGAAACATTTCGGAACTCCCTNGNTGNGCAAANAGCCCTT 

CNACTTTTGNTTNNTTTGCCGGGNTAGCCCNGGGGAACCCATTGTTGGTTTGGGTGGAAA 

TTCGNTGNTTTNNCCTTGGTCGNGGGGGGGCCCCAACNTTTNNACNTTTCAAAAAAATNN 

GNACNAANCACCNTGGGGAANGGGGGCACTTAGNTNTTTCGCTTATCCCCCTTTTAANNC 

CAACCTTTCNNCTCTTTTCNACACCNCAAGTCTCCCTCGGNACCTTNTCCNTTATNGNNN 

CCCNTTCCACCTNNNGCGCCCCNAAAACTCCAAGGNNANCCAACCGCCTTGCAAAAA 

Sequence 2556 

TCACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGAACAATGAGG 

TTTCCCACAGCGGAGTCTCCCTGGGCTCTGTTTGGCTCTCGGTAAGGCAGGCCTACACCT 

TTTCCTCTCCTCTATGGAGAGGGGAATATGCATTAAGGTGAAAAGTCACCT TCCA AAAGT 

GAGAAAGGGATTCGATTGCTGCTTTCAGGACTGTNGGAATTATTTGGMTGTTTTACAAA 

TGCCCGCGTACCTGCCCG 

Sequence 2557 

ATACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACGCGGGGTAGATGGAA 

GGAAGAACTTGTGTGCTTAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTG 

TAGGACCCTATACGGCAACCTCCCTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAA 

TGCAGTGAAGTAAAAAGAAGACAGGGACATCCCAGAAGGTTATGCAAAACATCAAGAGAA 

GAT GAG AGG AGTCTATATGTC AG AATAC AC ATTTCCC ACCTTGCCCAAC AGTNGAAAAAA 

A/JWWVAAAAAAAAAAAGTACCTGCCCG 

Sequence 2558 

TACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACGCGGGGTAGATGGAA 

GGAAGAACTTGTGTGCTTAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTG 

TAGGACCCTATACGGCAACCTCCCTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAA 

TGCAGTGAAATAGAAANANAACAGGGGACATCCCAGAAGGTTATGCAAAACATCAAGAGA 

AGATGAGAGGAGTCTATATGTCAGAATACACATTTCCCACCTTGCCCAACAGTAGAAAAA 

AAAAAAAAAAAAAAAAAAAGTTCCTGCCCG 

Sequence 2559 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGGCCCTGGGCCAATGCGTC 

CTCGCTGGAGCCTTTGCCTCCTTCTACTGGGCCTTCCACAAGCCCCAGGACATCCCTACC 

TTCCCCTTAATCTCTGCCTTCATCCGCACACTCCGTTACCACACTGGGTCATTGGCATTT 
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GGAGCCCTTATCCTGACCCTTGTGCAGATAGCCCGGGTCATCTTGGAGTATATTGACCAC 
AAGCTCAGAGGAGTGCAGAACCCTGTAGCCCGCTGCATCATGTGCTGTTTCAAGTGCTGC 
CTCTGGTGTCTGGAAAAATTTATCAAGTTCCTAAACCGCAATGCATACATCATGATCGCC 
ATCTA 

Sequence 2560 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC I I I I I I I I I II 1 1 I I I I U I I I N 

GG 1 I I I 1 11 1 I I 1 1 11 I I I 1 1 I I I 1 1 I 1 I I 111 1 1 1 I) 1 1 I I 1 11 1 I 1 INGCNCGGNGTT 

TTTTTAANGGAAANANAGCCNCCNACCTNAAAANCNCCCCCCCCTTGCCCNGNACCAAAA 

AAGGAAAAGGGGNAANCNAAACCCCCNCTNGGCCCAGGGGGGGGNAGGGGNTNAAAGGAA 

AACCNGGGCCCGGNACNCNGGCGGGGGNGGGGGGCATCNGCNNGAAAANCNGGGCCTCCA 

NCCCCNAACCTNGTGGGGGANAAAACCNGNCNGGGGGGCCCNCCNANAACCANGGGACCC 

CCNGGGAAAAAAANNAATCCCNAGGGCNAAGNCTNAAAAAAAAAACNGGGGAACCCCCCC 

NAGGGGGGGGGGGCCCCCNGGACCCCCCAANNCTTTTGNGGNTCCCCCCTTAAANGGGGG 

AGGGGGG 

Sequence 2561 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACI J I I H HlGlUlWim 

TGAGATGGAGTTTTGCCCTTGTTGTCCAGGCTGGATTGCAATGGCACGATCTTAGCTCAC 

TGCAACCTCTGCCTCCCGGCTTCAAGCGATTCTCCTGCCTCAGCCTCCTGAGTAGCTGGG 

ATTACAGGTGCCCCGCCACCACCGAACCAGCTMGTTTTTGTATTTTTTAGTAAGAAGAC 

GGGATCTCACTGTGTTAGCCAGGATGGTCTCGATCTCCTGACCTCGTGATCCGCCCGNCT 

CGGCCTCCCAAAGTGCTGGGACTACAGGCGTGATCCACTGCGTCCGGACTTAGTTTTAAT 

ACTATCATTGAATAATTACATACAAAACCTAGATTCTCTGTATATTCCAGATAAAGACAT 

CTAAGACCCGGGAAGTTTAAAGTTCAAAGNTGGTGAACCAGAAAAATT 

Sequence 2562 

CTCCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGATATATAACAATGA 

GGTGCTCCATCAACCACTTTCTGAAGCTCAAAGGAAATCCAAAAGCCTAAAAATTAATCT 

CAATTATGCAGGAGATGCTCTMGAGAAMTACATTGGTTTCAGAACATGCACAAAGAGA 

CCAACGTGAAACACCAGTGGCCAAATNGAAGGGAAGCTGGAACACATGGTNTCAAAACCG 

AACAAGATAATGTGAACAAACACACTGAACAGCAGGAGTCTCTAGATCAGAAATTATTTC 

AACTACAAAGCAAAAATATGTGGCTTCAACAGCAATTAGTTCATGCACATAAGAAAGCTG 

GCCACNAAMGNCAGGATTACCATTGGTATTTCATTTTTCTTGGGAGAGGGAAAATGCCA 

ACATCATCTTCCTAAAMGGAGAAAAAATGAGGAGGATATTTTTAATTACCAATAACCAT 

TTTAAAA 

Sequence 2563 

ATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGCTTTCCCCCAGTGCAAAAGACTG 
TTACTTTATTATTGTATTCAAMTTCATTGTGTATATTACTACAAAGACAACCC-CAAACC 

aa i rnm i cctgcgaagtttaatgatccacaagtgtatatatgaaattctcctccttnc 

TTGNCCCCCCTTTCTTTCTTCCCTCTTTCCCCTCCAGACATTCTAGTTTGTGGAGGGTTA 

TTTAAAAAAMCAAAAMGGMGATGGTCAAGTTTGTAAAATATTTGTTTGT gctttttc 

CCCCTCCTTACCTGACCCCCTACGAGTTTACAGGTCTGTGGCAATACTC'TAACCATAAG 

AATTGAAATGGTGMGAMCMGTATACACTAGAGGCTCTTAAMGTATTGAAAGACAAT 

ACTGCTGTTATATAGCAAGACATAAACA 

Sequence 2564 

TACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACI rTTTTTTTTTTTTI 

iiiiiiinn F r c iiiiiimiuiiiiiiiiiiniiiinimim miiiii 

n I I I I II 1 1 I I H I I I l I I I I NGGCCCCCCNCCCANNTTNATTNAAAANGGGNAAAAGG 
NNCNTNTNNAAAAAAAGGGGGGGGGGCNAAGNCNAAAAAANCCCCCAAAAACCCAGGGGC 
NTNTNGGNAANANNAGCNGGCNNAGNGNGGCCNNNANNGGCNAAACCCNAAAAAAAACCA 
GGCCCNNNGGGCCTTNGNAMNCNANCNGGTNANATTNNGGNAAAGGGCCCTTTTCCCNC 
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CCAAACCCNAAAA 
Sequence 2565 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC rTTTTTTTTTTTTl 
I I II I I I GCTGGGTTATGCCCGCCTCTTCACGGGCAGGTCAATTTCACTGGTTAAAAGTA 
AGAGACAGCTGAACCCTCGTGGAGCCATTCATACAGGTCCCTAAGGAACAAGTGATTATG 
CTACCTTTGCACGGTTAGGGTACCCGCGGCCCGCTCTAGAACTAGTG 
Sequence 2566 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATATTCATTGTANG 

ACTTCCTGTGGATGAGCATGAGCTATCATGACCTCTTCTTCTTCCAAGACTGCAGGCTGC 

TGAGAATCAAAGTGGGAGGGCTCTTGTGAGTCTGCTCGTAAATAGCCTTCAGTTCGATCA 

TCTGTATCTACTTTCTCCTCNTTTTGTTGGAAGNTTGAGGATTCATACTTTGGACAACTG 

CTTCTTMTCTCCACATCATCATCATTTTCAGGTTTCTTCTGATTCTTTTTCTTCTTATC 

TTTAAATTGTTTGATCAATGTGMGGACATGTTTCAACAAGGAAACCAT NAAAT ACAGGC 

CTCCTAGAGCTGGTTAGACCCTTCCACGTGGGAANTCAAAATAGGCACTTTTCTTCTATG 

TTTTNGAGAAGACAGAATGACTGNAAAMGGTGGGTCCTCTTTTTTCAT 

Sequence 2567 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCACTATGCCTGGCTAATA 

TTTTTA1 Mill I ATTTTTGTAGAGACAGGGTCTTGTTATGTTGCCCAGGCTGTTCTCAA 

ACTCCTATCCTCAAGCCATCCTCCCATCTCAGCCACTCAACTTGCTGGAATTACCAGCGG 

AAGCCACTGTGCCCAGCCCCAGACTCCCTTTTACCTAGTATATCATGTCTAATTTCTAAC 

AAAAAATTAAAAAAACATTGTAAGAGGAAAAAAAGAAACCCTAAAACAAACAAGCAAACA 

AAAAACCCAAAACAACAAAMGTTTCAAGAAACAGAGCTAGCTGACAGAACTATATTCAG 

ATATGGCAGGTATATTAGAATTATCAGATTGGTMTTTTAAACAACTATGACTAATATGG 

TAAGAGTGATAATGGGAAAAAATAGACAATTTG 

Sequence 2568 

CCGGGCAAGGTACAGTATATGTATTTTTTTAAGTGACCTCCTCTCCTTCCACAGACCCCA 

CATGCCCAAAGGCCTCGGGACTTCCCACCACCTTGCTCCACAGATCCAGCTAGGCCTGAC 

CTGTGCCTCATCCCGTGCCGCTCGGTCTCTGGCTGATCCCGAAGGGCTTTTGTCTTTCCT 

CTCGTCAAGTTCTTTTGGGTTGTG I M I I I IG I I I I I 1 1 I AATAACTCAAAAAAAAATAA 

AAGACTTGGAGGAAGGGTGCAAGCTCCCAAGTGCATCTGGGGCACATGTTTCTTGGAAGG 

GACTGTCTCGCCGACACTCGGGATTCCCTCTTGCCTGCATGTTGAGGCTATGGGTGACCG 

TGGTTAGTGGGACAGGCAGTGAGCAGTNGAAAGTGCCTGAGTGCCCGAATTGGTGGGGGA 

AGGCTT 

Sequence 2569 

GCGGCCGCCCGGGCAGGTACGCGGGACAGCGGCTTCCTTGATCCTTGCCACCCGCGACTG 

AACACCGACAGCAGCAGCCTCACCATGAAGTTGCTGATGGTCCTCATGCTGGCGGCCCTC 

TCCCAGCACTGCTACGCAGGCTCTGGCTGCCCCTTANTGGAAGAAATGGTGATTTCCCAA 

NGACGANTTCATATTCCCACAATNTTGGNTCTTAANGACCTTGGAAATTAACCAAAAAGG 

AAAC 

Sequence 2570 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGNCAGGTACACAACAAGCN 

NNGTCTNNCATAACTGAAACAGANGATTCTGNTTTAGAANAANGCCCATCTGAGCTTANG 

AGCNTAGAAGGAAAAGAAGAAAATATNAGGAGCTTTGTGCATCrrCTACAATGCCTGCAA 

TTTCANAGCTTTCATCATNGCTTTANGGGAGGANTCTCATANTGAATNACNTTAAACCTT 

TCTCGTCCCAAGATCATCAGNCTANAGTCGGAAAAGAACCACCTGCCTCTGTAGCTGGAA 

N 

Sequence 2571 

CATCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCGAGCATGAACATCT 
GCAGCGTCTTGCAGAATCACCCCAGAAGGGGACTGAATCATGGTCCTCTTGATAGGTATG 
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TTCAGCAGAGTTTCCAGTCCTGAGGTGTATGAGGCCAGCTGGAGCTCATAATCCTTAATT 

CCCGCGTACCTGCCCG 
Sequence 2572 

TCTTAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGTGGCCCACCGTGCAAAGCTTCCA 
CTTGGTGCCTGCGTGGGAACGCACCACCTGCCGTGGAACTTCCTCCTGCGGGGCAGAGGA 
GGGGTCCCCCGCGTACCT 
Sequence 2573 

ACTCACTATAGGGCTTTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGGTTGGTCATTT 
CCTTAACCAAGGATCAANGACAAAAGGCCAGACAGCTCACTGCACTGCCAGGCTGCACGT 
CGGAAGCCGGGCTCCGCGCGACTGCCCCCGCGTACCTGCCCG 
Sequence 2574 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAGCAGGGGTTGCTTTGGTCATTGCTGTGTTGA 

ACAGTGCACCTCCTTTCTTGACTGTATGCACCCATTGTTGATATGACCTCGGGTTCCCTC 

CACAAAAGCCACCTGAAGTGATCTCTTCTTCAAATACATCAAGAGAAACCCCTTCCTGCA 

TTTAGTTTTTGCCAAGTCGACCATCGATAMCTTGCTTAAAAAGCTGGAGGTTAA 

GTTTCCAGGAACTGTTTCATCACCGCTTGAGNCGGTGCTTTACAAAACTCTCCTCACAGA 

AAGTGATGGGCTCACCAGGTTTGTATCTCAGTGCATCTCTGTAGGATCCAAACAAAGCAG 

CCTGTGCTCTAAGAAAGGCCCTAGCTACTCCATCACCCGTAGCTGTAGACTGCTTCTTCA 

GTTTATTTTTCAAG GCCG AG ACC AC ATC ACTTG GTAGGTT 

Sequence 2575 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGAGGGATTTCGGC 
CTGAGAGCGGGCCGAGGAGATTGGCGACGGTGTCGCCCGTGTTTTCGTTGGCGGGTGCCT 
GGGCTGGTGGGAACAGCCGCCCGAAGGAAGCACCATGATTT 
Sequence 2576 

CCGGGCAGGGTCCCCGGNANTTCCANMTTTTNCNTGGCNTCNAAGGTCCTTTGAGGNAG 

AAGGCTCTGGGGCTTCTGCTTGTCCCTTTGGAGGGTGTCTTCTGGGTAGAGGGATGGGAA 

GGMGGGACCCTTACCCCCCGGCCCTTCTTCCTGACCTTGCCAATAAAAATTTATGGTCC 

CAAGGGGAAAAAAAAAAAAAAAAAAATMTTNGTTCCTNGGCCGTTTNAAMCTAGTTGG 

ATCCCCCNNGCTTGCAGGAAATTAAANNTNMGCTTATTNATACCNTTCAAACCTTG 

Sequence 2577 

CCGCGGTGGCGGCCGCCCGGGCAGGTAC I I M I I I I H I Ml I I 1 1 I I I 1 1 I I IGTAGGG 

ATGGGGTCTCGATTTGTCTGCCCAGGCTGGTATCAAACTCCTGGGCT.CCAGCGATCCTCC 

CGACTCANCCTCCTGAGTAGCTTGGGATTACAGGCACGTGCCCACACACGTGGCATCATA 

CACCCATTTTACAGGTGAGGAAATTGTGGTTCACCGACAGTAAGTGACTTGCCTAGGATC 

ACACAGGTAGTAAATGGTGGAGCCAGTAGTTAAAATCANATGAGTCCCTAAGTCTTCTGT 

TCTCCCTTACTTGTGTTCTGTnTTTGMTCCCTMNAGCTTTGTGCAATGCCCCTGTCT 

GCTGGATGMCCAACAMCACTGCTTTAAAACTCtACCAAAAAGTGAATCCCGCGTACCT 

0 

Sequence 2578 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCCGGGAGGCTCGCATGTGTGGGTGCATGCCGT 

GGCAGGAGAAGGCACGTTTATCCAGAGAGTTGATTCTCCTTAATACCAGGAAACTAGGCA 

ACAGCAGGGAGGGGACAATCATGAGTCCTAGGCAGGAAAGGTCCTCTGGAGAGAGGAAAT 

GGCACACGCGCCCCCGCGTACCT 

Sequence 2579 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGC CGAGGTAC't ITTTTTTTTTTTTTT1 
TTTTTTNCCTTTTTAAAAAAGTTATTTATTTATTC 1 M I I I I I! I I I I I I I GGNAAGGTT 
GAATGCNCTTTTGGTTTTNGGTCATGTTCGGTTGGNC 

GATGAATGCNAAGGAAAAAAATATTTTTCCAAAGTCCCATNGTGAAAATTGTCNCCCA^ 
TmGGGCTTTTGAGGGGGTNCAGTTGGGGTTGCTTGTCTGmCCCGGGTTTGGG 
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Sequence 2580 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC M i I 1 1 I 11 i I I I 1 I 

TTTTTGCCCCGCAGCTTTTATTAGCCATCGTTACAGTCTCTCGTCCACAAACACTGCCCA 

GCTCCCTTCCCAGGGCACTTTACTGGAGGCAAGTGCAAAACACTGTGGTGCTTTNTATTC 

AAAGAAACAGAAGATGTNGCTGTGGGTGGTANACGGGACAGCCCAGNTTACCCACCAGTT 

TTACAGNAAGGATCCCGCAGTGAGCTTTGATACAGATATGAGGATAGATGACGGTAAAGG 

GGG/W^GAGmAAAATTATCMGmCCTCmGAAGTGTCTmCCCAAGGCACTGT 

GAAGAANACTGCTGCTAGCAATTTTCTAGGGACTAAACACACCGATGGGAATATAAATAT 

TTCAAACACAAACCCCTGTTACATTCTTACAGGTTAGTTTC 

Sequence 2581 

CCGGTGGGGCNGGGCCCCACGGTACCCCGGTNTCCCTTCCANATGGGCCCAAAAAAAATA 

TCTTNNTCCCCAANACAAAANCCCNAAAAATTTTGGGAAAGGAAAGGGGCAAAAGGGNGC 

CCNNNAAAAAACCCNNGAAAGGGCAATTTTTNGGNAAAGGGCNNI M I I I f I AAAAATTN 

CCTNGGGAACCCCNTTGGGMGNAAAAANTGGTTTTTACAGGGGNAAATTGNATTAAACC 

TTNAAAAAATTTTCCTTAAGTNGGGTNAAAGMTCTTGGAAGTTACCCTTG 

NCCGGGCCCGGCTTCCTTAAGAAAACCTTAGGTNGGGATTCCCCCCCGGGGGCCTTGCAA 

GGGGNAMTTTCCGAATTAATTCAAAAGGCCTTTMTTCCGMTTA CCCGGT TCCGAACC 

CTTCAGAAGGGGGGGGGGGGCCCCCCCGGGTTACCCCCAAGCCTTTTTTGGTTTCCCCTT 

TTTTAAGNTGGGAAGGGGGTTTTAAMTTTTGGCCGCCCGC^ 

TTCCAATTGGGGTNCAATTAMGGCCTTGGTTTrTCCCCTTGGTNGGTTGGAAAAMTTT 

NGGTTTMTTCCCCGGCNTTCAACCAAMTTTTTCCCCACCACCCAAAACCAATTACCCG 

AMGNCCCCGGGGGGAAAGNCCANTTAAAAAAAGGGTTGGTTTAAAAAAGGCCCCTTGGG 

GGGGGGTTGG 

Sequence 2582 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGGTCCAGGTTTCAGAAGGTTTGATGC 

TATACATTGTGTGAATTTTGAGGTTATGATGGTGGCCCAGAGAAGGTTGGGAGGCATTAG 

TGTAGTGTTGGAGGGATTTGTGTCTCACAGAGCATAGATCGTATTGTGGMTTTCATTCT 

TCACAGAAGCTGTCTCCCTCTCTGCCCAGGGTGACATTCCATGGGACGACAAGGATTTCA 

GGATGTTCTTCCTCTGGACTGCTCTGTTCTGGGGTGGAGTCATGTTTTACTTACTGCTCA 

AGAGATCCGGGAGAGAAATCACTTGGAAGGACTTTGTCAATAACTATCTTTCAAAAGGAG 

TAGTAGACAGATTGGGAAGTCGTCAACAAGCGTTTTTGTTCGAGTGACCTTTACACCAGG 

AAAAACTCCTGTTGATGGGGTAAATCATTTTATTAA 

Sequence 2583 

GCCGAATTGGAAGCTCCACCCGCGGTGGCGGCCGANGGACTCTGGCGTTGGTAACAATGG 

TTTCCNGGANCTTNGGNTNGTAACNACTGCTTCCNGGGAACTTCTGCGTTGTAACCACTG 

GCTTCCCGGGACTCTGCGTTGTTACCACTGCTTTCCGGGACTCTGCGVrGTTACCACTGC 

TTCCCGGGACTCTGCGTTGTTAACCACTGCTTCCCGGGACTCTGCGTfGTTACCACTGCT 

TCCCGGGACTCTTGCGTTGTTACCACTGCTTTCCCGGGGACTCTGCGTTGTTACCACTGC 

TTCCCGGGACTCTGCGTTTGTTACCACTGCTTTCCCGCGTACTCTGC<3TTGGTTACCACT 

GCTTTCCCGCGGTAACCTCTGGCGNTTGTTAACCATTTGCTTCCCGCGTTACCTGCCCCG 

GGCGGCCGGCTCTAAGAAACTAGTGGGAATCCCCCGGGGCTTGCA^GGGAATTTCGATAT 

Sequence 2584 

GACTCCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATCCAA 

ATTGGGCTACGAAAAGTGGCAAGTTTGATTGTAGCATCATGTCTTTGAGTGTCCTATTGG 

ACTACAGATTAGAGGATAATAAAGMCATTCATTrGAGGTTTCATTGTTTGCGGAACTTT 

TCAACGAAATGCTTCAAAAGAGATTTTNGGTGTCCGTATATACAAATCATTACTGTCTCT 

TCCTGAGAAAGAGGACAAAAAAGAAAAGGATAAAAAAAGCAAAAAAGA TGAG AGAAAAGA 

JAAAAAAAAAAAAAAAAAAAAAAAAAAGTCCACCGTTAANAAGTTGAG 

AGGCAGCANAAGGTTTTGGAGCCAAGGAATGAAATGATGAGGCGTCCTTCAGGTAATGAA 
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CTTCAGCTTGCAGTGTGAAAGGGGCAGGGAAGACTGGCCAGCTGTCAAAAACTGGAACAG 
TO 

Sequence 2585 

CTATAGGGCGMTTGGAGCTCC CCGC GGTGGCGGCCGAGGTACATTTTACAAAATTAAAG 
TCAC ^~T TQ ^ QQ^ATATAAAMGTTTTAAMGTTGGTTTATTAATG AG AGTAGTG N ATTA 
CATTTCTTTTCMGCAGTAATTTAAATATTMCCNGTTTACTTTT^ 

ANCTTTNCAGCCTGGGACCAACATNGGNGAAACCNCGTATTCTNCCAAAAATTGAAAAAT 

TAGCNGGGCNTTGGGGGGNTCACNCCCGGNAGCGTGTTTCTTTTGGGNGGGGGAAACCAA 

TGNGTCTAMCNGNNCNGNNGGGCACCNGGGNTATACCNACCTNTGNGGGCCTTTTTCCN 

TAAAAAATCCCNNNAMTTTGGGACCCTNNGCCNNGGGCNGGGCGNTTNTTNAAAACNNN 

NGTGGGGGTCCCCCCCCNGGGCCNGNACGGGAAAATTNTCAATNTTTCCNANCCCNTTTT 

NNGNNTNCCCCCGGGCNNACCCCNTTNANGGGGGGGGG 

Sequence 2586 

T^CTTAGGGC^ 

rTTTTTTTTTI INAI I I N I I I 1 1 I I M J II I II I M i I j I I I I I I CTTTGGGCAACACT 
TTATTGGGAAANATTTACNCNCGGGGACCTGTCNTAGGCCAAGCNANNAAAAAAGGGCCC 

CAGGANCCCTGGGGTCCCNNGGGGGGCTNAAAATGGAACCCATGGGACGGCCNNTNTAAA 
ACTAGNGGATCCCCCG 

Sequence 2587 

GGCGAATCGGACTCCACCTTNGGTGGCGGCCGAGGTACATTTCCTTGTAGACTCTGTTAA 

TCTCCTGCAGCTCCTGNTTTGGTTTCTGGAGCANATGAATCTCAATGAGGAGAGTCCTCG 

TCGGTTCCCAGCCCCTTTATGGAAGCTTTANNCTCAGAANCGTCATACTGAAACAGGCNT 

TTTTCAANAAGGNCCCNAAAAMNCACCCGGTTTTTCCAAGGGTAGGGCCNANAAAAAAA 

GGGCCCNAACTTTNCMNATGCCTTGAMTGCCNAANNGTNTTCCCNTTTTTNAGGGGTC 

CCCCTTCTNCCTNGGGGNTAANNGGCCNAAAAAGGCCAAATTAANTCCCCTGGNACTTCT 

TGGNNGGCCAATTNNGGCCCCCCCNGGCNGGTAANCCCNGGCCCCCGNGGGCCNAAGGNC 

CAGCNTNCNTAANAAAACCNAAAGTNGGGNAATCCCCCCCCGGGGGCCCNTGGCAAGNGN 

AAAATTTATCCNAATTATTCAAAAGGNCNTTTAAANCCNGAATACCCCCGNCCCNAAACC 

CCTTNNGAAGGGGGGGG 

Sequence 2588 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACACTCCT 

CATCATCACTGTTGCTTGAGCACCGAAGAAGAGTGGCCCAGCCTTTTGGGTGCAGCTGTA 

AGGACGTGATGCAGTTTCCGTGGTCACTCTCACCAAGAACTTCAGGGGTTACGACTTCAA 

GACTATTTCGTGGTGAMCTCCTGMCTTTGMGTAGTTCTTCTTGCTCTTAAAATGGGA 

TAGCTGCCTACTTCTAAAGACTTAGTTAAGTCAAGGTCATGCAAAATTAAGAGATATCCA 

GAGGACGTTGAMTCMCATTTTGGAACAATCTGGTGTTAACCTCATTCGCATGAGAAAA 

CGTGTGTGAAAGMTTTCTTATGTGGACACCCATCTTCTGTATACCTGTTAGTGTCCCAA 

ATAATGACATTTCCATCAAATCCTGATGTTACTAAGAGTCTTGTATTAAGTATCA 

Sequence 2589 

AATTGTTGCTCCNNCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGAACATAAATTGATT 

TCTCACTGTTCTGTGGGCTAGATTGGCAAAGATCAAGGCATTGGCATGTTTTATGTCTTA 

TGAGGCCTGCTTTTGACCACTAAGATGGCACTTTGAAAGCTGTGTCCTCACATGGCAGGA 

GGATGGAAGGGGAAGAATGGGTGAATGCAATGTGAAACCTCTTTAATATCATTATTCCCA 

TTCACCAGAGTGG AGCC CTTCTTACTTACCTCCTAAAAATCCTACCTGTTAATACTATAA 

TATTGGTATATMGTTTTAACATATGAATTTTGGAGAACATATTTAGACCATAATAAATA 

TTAATAGTTNCACCAGATGTMTAAATATCAAGCAATATMTTAAATAAATTTCTACCTC 

ATAAGTAAAACTACAGG 

Sequence 2590 

CNGGCGAATTTTTCTCCTNGCNGCGGCGGCCGAGGTACGAACTTTTCTCCANAGGATANT 
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TAGGTTGCACCN7TGTATTTGTAAAACAGGAGCAMTTTGGACCTTGCCGGGCCAAAGTC 

GTGTCACGTGGAACCTCTTAATCTCAGCATCCGGAGCTCCAGGAAGGGAAAATTTCAAGT 

CAGATAGAATTCTATATATAC C ATTTCTTTGG AACCTTC AGCCC TC AAGATTCC AACATC 

ATGACCTCAGTTTCAACACAGTTGTCCTTAGTCCTCAT GTCA CTGCTTTTGGTGCTGCCT 

GTTGTGGMGCAGTAGMGCCGGTGATGCMTCGCCCTTTTGTTAGGTGTGGTTCTCAGC 

ATTACAGGCATTTGTGCCTG CTTG G G GGTATATGC AC GAAAA 

Sequence 2591 

GCGCGGTGGCGGCCGCCCGGGCAGGTACTGTGAGCCAGCACTCCCTGACCTCAGGGTGTG 

TGAGGAGTTGGCACTGTAGAGAGAAACCAAGACTCTTCCTCAATGCCTGCCTTATTCTGA 

GCACCCACCCTTAGCTCTGATAAGGATCCCGCGTACCGTGGCCTACAATGACACACAAAT 

CCCATTAAATAAATTATAAACAAGGGTCAATTCAAATTTGAAGTAATGTTTTAGTM 

GAGATTAGAAGACAACAGGCATAGCAAATGACATAAGCTACCGATTAACTAATCGGAACA 

TGTAAAACAGTTACAAAAATAAACGAACTCTCCTCTTGTCCTACAATGAAAGCCCTCATG 

TGCAGTAGAGATGCAGTTTCATCAAAGAACAAACATCCTTGCAAATGGGGTGTGACCNCG 

0 

Sequence 2592 

GGGCGAANTGGAGCTCCTTTGCGGTGGCGGCCGAGGTACTTGCACATCCTTTTAGTTATC 

TTTTTTTATATAAGTAAAGNTCACACACATAMTTCACTTGAAm 

TTGGCAATGAATACAGTGCCTGTTGAGGCCTGAAAGAGAGAGGAAATCAAGAACAGTGTG 

GGAGMGAAAGGCAATCCTTCGTTGAAAAGCTTTTCTCTTAACAATCCTTAGTGCCTGAA 

AMTMGCCATTTTGATAACAAGAGTATAAAAAGCCTTATGATCTGCAAAGTAAATCGTA 

TCTGGTTTAAATTTCATCCCAAGTCTCAGGTTATGGCTGAG GAGCTAT ACTAAGGATCTT 

TGNAGCNCATTTCTTTGAATTTTTATAACTAAGACAAGAAAA I Mill I ATCAGATCT AG 

GCAGAAATACTGATC 

Sequence 2593 

CCGCGGTGGCGGCCGAGGTAC I I M I i II M l I H I I M I I I H I I I TTTTNCl h um 

1 1 1 i i i i 1 1 m i i i i i i u 1 1 i i m i n i m i i u n m i i i m i m n i i i i i i i i i 

M M II I I NANNGGGNAAAANCCNNCNNNAATANNNTCCCNNNCAAAAAAAAANATTAAN 
AAAAN GGG NAN AN AAAAAAANT AC AAAN N N AG AAAAACT AN AAAAAAAAAAN GGG N AAAA 
ANATTNNTTTTTTTCCCCCAAAAMNNNTNTTTAGGGGCCCTAAATTAAAA 
ACANANNGAAAAATCNAAAAAAAGNANNGNAAAAAAAAAATTAAAAAA 
Sequence 2594 

GTGGCNGGCCCGAAAGTAAC I II M M r NTTNNNAAAANAAAAANAAAAAACANANAGAA 
NAN AAAAN AAAAAAAANAAANAAAACAAANAAAAANAAAACANAAAAAANCAN N ANAAAA 
AN N AAANATAN AAAAAACCAAAAAAAAAN AAAAAAAN AAAN AAN N AN AAAAAN N AAAN AA 
AAANATAAAAAAAAAAAAAAAAAANAGAAAAAAANCAAAAANAAAAAAA^ 
AAAAAAAAAAAAAAAAAAAAAANATAAAAAAAAAAAAA 

Sequence 2595 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCGGCAGGTAC rTTTTTTTTTTTTTTTTI 
TTTTTNNNGACTGMTCCAGCTTTATTAMGATACTTTCCATAAACAATCATGGTATTNC 
AGGCNGGACANGGTGCAGACAANTGTAAACAGTATACAACAACTTTC 
Sequence 2596 

CGAGGTACATGAAAAAATTCATACTGGAGAGAAACCCTATGAGAACCCTAACCCTAACGC 

TTCAGTTGTCCCAGTTCTTTCATGAGCATGAAAGGAGTCACATAGAGAAACCCCATGAAA 

GTMGAAATTTGGGAAAGCCTTCAGTCCTTTCTGTTTCTTTCAACTACGTGAMGGATTC 

ACGGTGGAGAAAGACCCTGTAAGATAATTGGCTTTAAATTACGAGAGACTTGTGATAGGA 

CAGTAAAACCTAGAGTTGGAGTTGGATCTCTGGATTGTGTTATGTCAGTGTTGGTAGGTT 

AGGAACTAGATTTCCCAGAATCCATTCCATTTGTGATTCCATGATACAATTCACCAGTAA 

CCTATCTTTACATGAGATTCNGGMGTAAGTTMGAAAGGCATTAGTCATGGTTTGGAAG 
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CACCATACAGGGAGACAGCTGTGTGAATACAGGCTGTATGGACACTTGCTTCCATCCCAT 

TTTCCTGCTTCTTTGGGTrGGCAATCAAGAGTATCCTCAAAACGACTTGACmMTm 
CT 

Sequence 2597 

CGCGCAGTCCGCCTGACTATACTACCTTTAGTAAATAAACCNCCCTTTCNGGATGCCACT 
ATCTCCGATGGCACTATACCCTGTCTACCTTCTATACCACTAAATTAAACCAACTCCCCT 
CCTCCTCCCAAGCNATAAAAJAAAAATATACAAACCCTGGGCCAAAATGANNAAAATCTG 
TCNTTATTATTGCCCCACAATCCTAGCCTCNCGCCGAGACCTCGGCGNTCNAAACTAAGT 

GATNCCCGGCTGAGNATCGTATAAGCTNATCGATCCGCGCCTCGANGGGGNCCGNACCC 
Sequence 2598 

CCGCGGTGGCGG CCGAGG TAC 1 M I I 1 I 11 H M I M 1 1 11 11 1 11 I U 1 GCTTCCTTGT 
CTTTGGTTTCATTGTTTTTTCTTTATC^ 

ATCCTMTACTGCGTTTGTAATACTTGCTCTTCACATATTTGTATGCAGCCAGTCACAAG 

TGGAATGTGACTAGAATCCTTGTTACTCAAGAGGCTGAGGTTGAAGGATCGCTTGAGCTC 

AANAATTTGAGTCCAACCTGGGCAACATGGNGAAACCCCATNTCTAAAAGAAAATNCNCN 

CAAAACTAAGCTTGATTACTTTTATATTTTCTCCAGAAACAAAAAGAAATCNCNCTNTGC 

TGNCATACTCACCCTNCCCACACTGNTCCTTCAGCAACATATACCATTGTTTTTAG 

Sequence 2599 

NGGCGGCCGCCCGGGCAGGTACTGGCTTCCTTGTTMTCCTTTATCCCCTTCAGGTGTTA 

TTCAMTGTCATCTTCTCAAGMGGCCTTGTGTGATTAGTTTATTGAMTTATTCCACTC 

AACTCCTATGTCTMCACTATTCAAGTCTCCTTTCCTGCTTCMTTTTCCTNCTGGGCTT 

CNTACCACCATCTNAACTTGGGAAACCATGTMTATTGTTGACATTATTTTGGGCTTAGN 

ATTMTTTTCACCCCATTTNAGCCATNGTAAMCCTCCACGAAAGGCCTTTTATTTTACT 

GCA AGTAT TTTNCTGCCACCCTAGMCAGTATTTGGGTTATATMGCMGGTCACTTTM 

AATATTTTGTNTGMTGGAAGTGATGTAAGMGGAAGGGTTCCAAAAAAAGCCCACTTGG 

GAAGAATGGTCATGGGCTTTGACCCG 

Sequence 2600 

CAGGTACAGTGGATTTTTCATTTGCAAAGACGTTAAGCCCTCCAAATGTGCAAATCATGA 

AGTCAGTNGTTGTTCCAGCAAGGTTTGCCCAGCGGTAAAAAACAAGATAAAACTAATGCA 

CTAGCTGAAACCAGGTGGGGAGACCATGTGTGGTAGTGCTTGGGGGTGGAGGGAAACTAT 

TTCTGAAATGAGGACTTAAAGTATAATACCAGCTTCACTGCCTGTTCACATGAGAAACCA 

MGCTTCAATTTAACTGCANGCAATAGGAGTTTCACACTGTCAGCACCAACTGTCTAANA 

TNCAAAACTAGTATCTAAATGTGTAGGATCAAAACCAAAAATCTGGAGGGATCTAGTTAA 

ACTTCAATATGCATGACCCCAGATTCCCN 

Sequence 2601 

AGGGCNAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTTCTTTCTGGCAG 

TGAATGGTCl'GTATTCCTCTAGTGATGATGTGATCGAATTAACTCCATCAAATTTCAACC 

GAGAAGTTATTCAGAGTGATAGTTTGTGGCTTGTAGAATTCTATGCTCCATGGTGTGGTC 

ACTGTCAA/ k GANTNACCCCCCNNANTTGGAAAGAAAAGCCAGGCAACCTGCATTTAAAAG 

ATGTTGTCAAAGTTGGTNGCAGTTGATGCAGATAAGCATCATTCCCTAGGAGGTCAGTAT 

GGTGTTCAGGGATTTCCTACCATTMGATTTTTGGATCCAACAAGAAACAGGAGCANGAA 

GNATTACCAAGGTTGGGCAAGAACTGGTGAANCCATTGTNAGATGCCTGCGCCTGAGTGC 

CTCTGCGCCAG 

Sequence 2602 

CCGCGGTGGCGGCCGCCCGGGCAGGGTACAATACAATCTAGATGACGGTGCAGACTAAGT 
CAAGAACTAAAGTTGTGCAGTAACCCGAGTTAAGGCATGAATGCAGACACACACATGCAC 
AC AC AC AG CACCCATGCTATCAAG AC AC AGG ATTTTTTC AGTTG CCTC ATG AG AGGGC AA 
CCTGGGCTTGGCAGTTAATCAGAACTGCTGAGCATTCCAGAAAATGCCCCCCACGACTTT 
ATGCTAACAGCTGTGTGTATGTTTTAATCAAAAAATTAAAGAAGAAAAAAAAAACCTAAA 
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AAACAAAGAAAAAMCAM^ 

AATGGCTCTTGAGCATGGAACTCATGTAGCAGCATCAATGGCTGGCTCTTTAACAATTTG 

GAAATAAAAGGGTTGTTTTACTATGTATTTCTTT 

Sequence 2603 

TGGGAAACCGGAGCTCCCTTTNGGTGGCNGCCNCCCGTACGCGGGATGTGGCAGTGAGTG 

GAGGACTGTGGATGAGTGACCCCNTTAAAAGTAAGATATTGTCACA TAAGCT CTTGAGAC 

AGTTMCTCATTATCTGCAGTTTCTTCCATTCCACAATAGGCTACTTATTTTTAATGCTG 

TAmACATTGAGNAAANACCTCATGmNMNTTTTNATAAAGAAAAAAANTATTNANC 

TNNTTNTANTTCTACCAGAAAGGGTNGGACNATNTTCNAAAAAGGGCACCGGGTGAAATT 

ACCTNATGGTNCANATGNACCNGCNACTGCT7TNGCCCATATGCCCAAGGGANANTACCA 

NTTGGGNTCANAAAANANCCATGAAANTGGGNGGAAAAAAACTNATTCACTTTCANNAAG 

GATACCTTGGGCCCATTCAANTCAAAGGGTACCTTGCCCGG 

Sequence 2604 

GMTTGGAGCTCCCCGCNGNGGCTTTGTNTMTGATAGGAATTATTCCTTTMTTTAAAA 
AATACTTTCAATTTTTTAACAGTTTTTTCCTTACTTGNCCAATGGG 
GAATGGATCCTGCCATTGCTTTGGTGCTGGCCATTATCA 
Sequence 2605 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGAACAATCAACAGTT 

TCTGATGTTCCGAGGGACCTGGAAGTTGTTGCTGCGACCCCCACCAGCCTACTGATCAGC 

TGGGATGCTCCTGCTGTCACAGTGAGATATTACAGGATCACTTACGGAGAAACAGGAGGA 

MTTNGCCCTGGNCAAGGNANTTTANNTTGGGNCCCNGGGAGCAAAGTTCTTACAGCTTN 

CCCANTNAAGCTGGGNCCTTTANNACCTTGGGAGNTTTGAATTTATTACC 

Sequence 2606 

TGGTNAATCCNNTTCACAATTTCCACACCAACCATACCGAAGCCGGGGAAGCATAAAANG 
TGTAAnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn nnn 
nnnnnnnnnnAAGGGCGGTTTTGNCGTATTGGGGCCGCTCTTCCCGCTTTCCTCG 

Sequence 2607 

CNAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACGCGGGGGCATGCAGGCGTGCACGTT 

TCCCGGCTCTCCGCnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnAGGGTTNAATTTGNCCCNCTTGGCCGTTAATTCATGGGTCATMGCTTGTTT 

TCCTGTGTTGAAATTGGTTATCCCGCNTCACAMTTTCCAACACANACATTACCGAAGCC 

CGGNGAAGCATTAAAAGTGTAAAAAGCCCTGGGGGTGCCTTAAATGGAAGTGGAAGCTAA 

CTTCACATTAATTTGCGTTTGCGCTCACTTGCCCGC7TTCCAGTTCGGGGAAAACCCTGT 

NCGTGCCANGCTTGCATTTAA 

Sequence 2608 

CGGCCGCCCGGGCAGGTACACTTATNGTTGAGAGCCANGTCTCCCTTATCATTGGTGAAT 

GAGAATGAGCTACTGAAAACAAAAAGAGGGTCTTCTACTT CAGC CTGTACC CCTA ANATT 

TTATTATCAGNAAAGCAGNAGATTNACCNGTCCTTT^ 

NGGGTAAAAANAAAAANGGTAAATGTTTTCCCCTTATNTAAAANM NATTTC CCTTTCCC 
CNGGTTTTCCTAACCCCCTTGGNNAAAAMCNTTTTTTAATGGGCCTTT^ 
GTTTTNACCCANTAAAANTCCAATGGGGCCAATGGGGGGGGGCCTNA ATGNA AACNCCTT 
AAAAAACCNTCAATTTTTNTTTGCAAAAAT^ 

NGGGGCCTTANTAAMTNTTTTCANTNCNNTGNGCCNTT MTTTTT GGGAACCANNNCAA 

NANTTTTGGANTAAAATAATTGGGNTTGGGNNTTCNCANATTTTTTCTCCATGNAAGGAT 

CANNNACNNNTTGGNANTAAACNCATAACANTATCTTCTNTGGGGTANTCCCCCACNANT 

NTTATTTCCAAAAATTGG 

Sequence 2609 
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CTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGGCTGC 

ATGCTGTGGAGGAGGGTTTCAGAGGAGAGAGGTCGGAGAGCAGAGGCCTGAGAAGCCAGA 

GGCAGGTGGAGAGAGGGTGGAAAGTGAGCAGCGGGCTGGGCTGGANCCGCACACCGCTNT 

CCTCCCATGNTAAANNNGCCCCTTTAGAAAAATTTCACAAAGTCCCCCTTCCNAANTAAA 

N N NANATATATATTTTATAAGTTACNCTNN GGCCGCTCTAGAAACTATGGNGGGN ANTCC 

CCCGGGGCNTGCAGGGNAAT 

Sequence 2610 

GCAAATTGGAGCTCCCCGCGGTGGCGGCCGGGCAAAGCGATCGGAACCCGGGAATGAAAT 

CGTCCCAGGCGCGGAGTGGAGCGATATTAACGTCCATGTTCTCAGCTTTGCGTTCTTCGC 

CTCCAGAATCTGGCGGCGGCGGCGGCGGCAACCGATCAGCTGAGCTGAGTTGGAGAGTTG 

ACAACCCCCGNGNTNCCNTTTGGCCNGTTCTTAGAAACTAANTNGGATCCCCCCGGGCTG 

CANGGAATTTCNATANTCAAGCTTATCCGATACCCGTnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnAGTAAATCATTGGT 

CATMGANTGATTTTCCTGTGTGAAAAANTTGNTTAATTCCGCCTTCACCAAATTCCCAC 

CAC 

Sequence 2611 

TCCCCGCGGTGGCGGCCGAGGTACGCGGGTGTTCCTTTTTGTCCAAAGTCTATTTTTATT 
CCTTGATATTTTTC I ITTTTTTTTTTI I GNGGATGGGGACTTGTGAATTTTTCTAAAGGT 
GCTATTTAA 
Sequence 2612 

GAGCTCCCCGCGGTGGCGGCCGTTTGAGAAGCCAGCGCTCACCCACCCGGGGTCTCTGTG 

CATTGACCTTTGGGTGCTGACTTGGAGAAAAGCACAAACACGACCAGTCCCATCCTGGCT 

CCCGTGGGGCTTCTTCTATCTACGCATTGTATCGACTGCATTAGTTGGACTAAGAAGGAN 

GCNTCANNTTAAAAGGGNGGAGACAAAATGCCTGAACTGGTCTAAAGCCAANGAAATGGG 

CCCCAAAGCTGGGCAAGNAAAAAGCACCACTGCATACATAGGGATACAGAAGGGCAGAGC 

TTCTGCCTGCGGATCTGCAACATTACATNTTGTTTTTGCCTGCAAACTATCAAG 

Sequence 2613 

CCGCGGTGGCGGCCGTGGGACCTGCMTNACTACGCAMCNCTTACAGCTTTTGGCTCGC 

CACCATAGAGAGGAGCGAGATGTTCAAGAAGCCTACGCCGTCCACCTTGAAGGCAGGGGA 

GCTGCNCACGCACGTCACNCCGCTGCCAAGTNTGTATGAGAAGAACATNTNGAAANCCCT 

GGANTN NCTANATGTNAACAACA7TGGTGCNTACTTTCTTCTTTC I I I I I 1 1 GTNTAACC 

AGCAAAC NAAACCCTTAAN AAATTATAN NCGNTGTGTTACCCTCGGCTCGACTTCTAG N A 

ACCTANGTGNGA 

Sequence 2614 

GAGCTCACCGCGGTGGCATGCGGCCGCCCGGGCATGGTACTCCTGCCAAGGCANGCTCCC 

ACCGCTATGGGCACAAGGAAGTTGTCGTCATCTGCACTGCANNCTTGAGCTGTAGAATNC 

T 

Sequence 2615 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCATTGGTGGCCAATT 

GATTTGATGGTAAGGGAGGGATCGNTNACCTGATNTGTTATATNAANGANTCAACGGGCT 

GAACAAGGAACATCAGCATCCTTTATGAAATGGATCCCCAACAATNNCAAGACANCCGTC 

TGTTAC ATN NCACCTN N NTNGGCCTN AA>a AATGGCAGTCACCTTCATTGGCAATAG CACA 

GCCATCCATGAGCTNTTNAAGCGCATCTOGGATCANTTCACTGCCATGTTCCNCCGGAAG 

GCCTTCCTCCACTGGTACCT 

Sequence 2616 

CGACTACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCCCGCCCGGGCAGGTACGCGGGGT 
CCCAACCAAGCCCTCCAGCAAGGATTCAGAGTGCCCCTCCGGCCTCGCCATGAGGCTCTT 
CCCGTCGCTCCCGGTCCTGGTGGTGGTTCTGTCGATCGTCTTGGAAGGGAGTGGTTTTCA 
NAGACATTTCAGAAAGTGAAGGAGAAACTCAAGATTGACTCATGAGGACCTGAAGGGTGA 
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CATCCCAGGAGGGGCCTCTGAAATTTCCCACACCCCAGCGCCTGTGCTGAGGACTCCCTC 

CATGTGGCCCCAGGTGCCACCAATAAAAATCCTACCANAAAAAAAAAAAAAAAAAAAAAA 

AAANGTACCTCGGCCGCTCTANAACTAGGTG 

Sequence 261 7 

TNCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC1 I M J 1 1 I M M I J I f 
iTTTTTTTTTl 1T NGC 1 I I M I I I i U I 1 I M I 1 1 I I I I M I I M 1 H I II I ANCAAAAA 
NAGGGNNGMGTTTNTTTNTTNTANCCCCNTO 
AAAAANAAMCAAAAAGCCTTTGAATTGTTTTTC 

C NTTTTTTTTTN NC AAATTNTACTG N TTCCAMCN TTTTG G AAAN AAAN AACTG GAA7TT 
TGNNGGNCNCTTGCNCTGGTTGNCNANATTNNAACAAGAGGMCCCNTTTTGGGGTTAAA 
TTTTTT 
Sequence 2618 

CCGCGGTGGCGGCCGAGGTACCTGTGTCAGGAGTCCCCAAGCCCACTCCCAAAATAGAAA 

ATTCACTAAAAGGAGTCAMGGATTCAGCATACAGTMTATTAACAACTMGATTTATTC 

CAGCAGGATACAAAGCAAAATAAGAAAGGAAAAGTCGCATGGGGCAAAATCAGGTGAAAG 

CTTCCAAGAGTCCTCTCCCAATGGAGTTACACAGGATGTGCTTAATACCTC CAGCAACCA 

GTTGTGACAATACATGCAAAGAGTGCAAAGTCTTGTCCACGACGGATGTTCl rTTTTTTI 

TTTTTTGAGAAAGCCTTGCTCTGTCACCCAGGCTGGGAGTGTGGCACCGATCTGGGTGAA 

GAGCAAGACTCCCGTCTCAAAAAAACAAAAGAAAAGGAAAAGTTCCAGTCCTTGTC 

Sequence 26 1 9 

CGACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCG GGCA GGTACTTTTT 
f I I I M I I 1 I I I I n I I I I I H I I H I A CAGATGCTTTCAGTTTAATAGTTGTTTTAAAT 
rTTTTTTTTTTTTTTTn l AGGNTTTAGGCTGAAAACNTAAAGCCT GCTGCTC CTGGGTN 
TNATTTCTCAGCTTNTCCTTACAGCTNTAGTACATANAATACTCCTTTTTTTAAGTCTAA 
NTAGCTTCANACCAAATCGGTCTGACCCAAAATACCTGCTNNTNTGGANAGNGGGATGGG 
GAGAAGGTAACTAGAAAATGCAAGATCNTNTTTACCCCAACAGCATAAAACCNCCCTAGC 
TGGGGCTNTGGCTGGGAGGGAMTGCTAGTCATTTTTTCC 
Sequence 2620 

ATTGGAGCTCCACCGCGGTGGCCGGCCGCCCGGGCAGGTACGCGGGAAGAAGGGACAAAA 

CTMGGGTAAGCGAACTTTGGTTCTAGGAATGGCAAGACCAGCAAGAAGATCACCATTGC 

CAACTGTAGCCTTTACACMTGTCATAGTAGCCCAAATTCAGTCAGCTATTGAATTAAGT 

TTATTGGCTTCCTTTNNCNAGNTAAAAGMTGTATGAATGCT TGTC TTTAGAAGATAACA 

TCATATGGAGGTTCTACAACTTCTTAGACACAATTGTCAAGCTTTTAGTAAAATTATACT 

AGACATTCAAGAGATGGGATTGAAAACCAAGAGAAAGGGCAGACAATAGAAATATACTCA 

CAGATGATTCAGGCAGTGGAGTTAAGACCAGGACTTGAAATAACAATGATAATTGTTCAA 

G AAAATA 3 AAG AC AGGGAAAAAAAAAAAA 

Sequence 262 1 

NCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCA GGTAC fTTT TTTn 

I 1 1 II II I I H II > CAGGAACTGTATAATTTTGGATATTGAACTGTTC TAC M M u u i 

TAMl'TTTATTCCCCCTATCTTTGCCTTGTTCTTCTAAGTCTGAAAATT^^ 

TTCTT TTC AAN GTTGN N GTCTAAACTTNTAATN N NAGGTTTAAAGACCTGCTAATATGTA 

TTTAATTTCCAGGTTTTTTNAGTTTTCTATTTCTTTTCACATAACG 

AGTC AT ATAAT ATCCTG N GTTTCTG AATATTAATCTTC ATTNGTTTrG AAC A \ liilin 

NTTACCANATGMGAGGCTGATTGATTNGGAGTTTTCCTGGACAGGGATATNCATGAGNC 

CTTTTT 
Sequence 2622 

AGGTACGCGGGAGAGTTCTGCCTCGCTTCCCGGCGCGGTCGCAGCCCTCAGCCCACTTAG 

GATAATGGCGACAGCTGAGGTACATTGTGATACAACTTCTTCACAGATGAAGGCCAGCAG 

AGACAGCAACAGCTGCAAGCCCTTGAGAACGCGCCGGAGCAATGGGAGCCGGCCCATGAA 
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^G T CTCTACCTGCCCG CGCTCCGGG 
Sequence 2623 

^7 A ^ GCGGGAGAG ^ CTGCCTCGC7TCCCGGC GCGGTCGCAGCCCTCAGCCCACTTAG 

GATAATGGCGACAGCTGAGGTACATTGTGATACAACTTCTTCACAGATGAAGGCCAGCAG 

AGACAGC A ^ CAG ^ 

GTOGgS^ 
Sequence 2624 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGCAGATTCATGTTGTTTCTAATTnTT 

ATATATAATTGACAAMTGMGAMCTTMCACCATCCTAGATTTTAGCTGCCCAAAGAA 

TGAAAAGAATGAAAAAAAMTCTTTGAAAACCCACAAGTGATATGGATCTAATTTATGGT 

TA^TAGATATAGATMCAAACAGAATACGCCTGTTTAAAACTGTTAAAATGACATTGGT 

TCTMnATACTmATTTAAATTGAAAGACAAGGCATTTATATGGTATCTCTAACCATC 

ACAACTTTTGTGTGACAAAAAGAAATTATCACCAAAATACACCTCCTTAAGTAAGTGTCT 

GATTrCACACTTCCAGAAAAAGTGCTCTTTCTGGTCAAGCCAGCAAGAATTGAGAA 
Sequence 2625 

GCGCGG J GGCGGCCGAGGTACGC ^ 

CACTCACGGCCCAMTCTTCTTGATCATATGTTTAAATATATTTTTTAATATTTGGAGCA 

TGAGTTGTCACmTTGmGCCrrTTTTATAAGGAAATGTTGGAGAGTTACATCATTGC 

TAATGTAGAAAT GTTAAGT GGGAAAAATATACAGTTTGGTAAAATAAACTAGATTCTACA 

™TTTGTGGGI I I 1 1 I rCCCCTCCTTTCTTTCCACAGCACTTTTGATATCAAGCAAGT 

GGCTTCCTTTTTGAGATATTAAAAAAAAAMGAAAAGGAAAAMGTAAATGAAGCCCAAC 

TACCTAACCCTTTCTTATTTGTATTTGTTTTAGTATTGTGAAG'n-GTGTTAAATAGTACC 
TGCCCG 

Sequence 2626 
GCG 5^1 TGGCGG ^^ 

TCTTCCGTCAAACGGAAGACACAAGGCACCTGAGGCTGATCCTTCCCAGAAATTAACTCC 

AGAGGCTGCCACATCTGGTATGAGCGTCCAMCCCAGCATCGACMTGTAGTTCCTGCCA 

TCAATGGTCACCTGCAGGAGAAGGTGAATCATGCCAGTGCTGTATTTTTTGGCTGGAGTG 

CTGTAAACATACCCTCCCAACATCGTGGTCTCAAAACCAATAGTGGTCAGAGCCCAGTAC 

AGAAGATGATTGACCTGGAGACACCATCCACCCCGATTTC7TCTCACAACTTGATCAAAA 

ATGGCCTCTAAGCCTAAGTCCATGGCATCCCCACAATGGGATGTTAAGGTTCTCAAAGGG 

AACAGCTTCGGATCTGGTGTTGAAGAATGTCAGTTAATGTT 

Sequence 2627 

ACTAC ^ AGGG C GM ^ GGA ^ 

i • i >• i i i i i i lAAl I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | , , , | | , , j NCACTCAGAAGGGGNCTT 

I A ,!3? ATAC ^ G ^ GACAmCCCATGGGGMTGG ^ NMGGG AGGGGCAGGGA3GGCTN 

ANCGGNAACCCACGGNTTGGTTTTCCCTCCATCCCNNGGGGAAAAAAAACTCTGGTAGGG 

NCATNCTAANNAAANGTGANCAGGGGANCCATCANTATGTGGGGCTGAAATNCACTGGNC 

AGTGACTTNAGGCMCCNGAGTCCTTTTGTCAGGAAATGCAATTTNTAAGNGGAAAAAAA 
ANCAAGNCCCAMGAAA 'vwww 

Sequence 2628 

CCGCGGTGGCGGCCGAGGTACTCCGAAACAAGTAGAAMGTGCTGmGAGGGATTTTAT 
TAAATCi him I AATGGAATGTGGTACAGTTAGCTGTCACTCAGCTGACACCATGATGT 
GGCAGCAGAGAGGGAMCCTACAAGTGGTTTGCCTCATTGCCTTTGCCACATCTGAAGTT 

CTCAGCAGCACTACCTTAGACTTCATGAGCTMTAGGAAACTTTTTATGGTGTAAATGCT 

GTAAGACTTTGTACCTGCCCG 

Sequence 2629 
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GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACi I l I u u m m i m 
I 1 1 1 I 1 1 11 1 1 AGTAGAAGCAGGGTTTTGCCATGTTGCCCCTTGCCAGAATCAATTCTAA 
TTATAGGAAAAAAGTTTCTGAGTAATCAAAATTTTGC^ 

GTTTAGCTTAGGAAACTTGAACTCTTACTGAGTTGGGTCTGCCCATGTANGATGCCTGGT 

GTTGGTGTGTGCGCTCTCTTGGTTATAGAATAATCCACCCGAATTCTTCTACCATCCAGC 

TCCATTCCATTTGCCCTTTCCATAGCCTATAAAGAACAGGCACAAAAAGTCACCGTATAA 

TTCACCAGTCTTGMGTMTCATTACTTAAAATTAAGCNCAAAGTNTATGTAATCCAAAT 

AMCTCCTGATTTTGAAAATGACAGCAATGTAGCTTGTACAGGGCATTTGTAT 

Sequence 2630 

CCGCGGTGGCGGCCGAGGTACTTCCTCACAGTTCTCACATATGGAAAGGATACACACTTT 

GTAGAAACMGMCTTTATGTTATCCAAGTTCTAGGATAGCCATGAGCTCCA^TTATCTC 

AGAGCTCTGAGTCCTCTACTCAATACCCATTGAGATTTATGTGTTCTGAGGCTTTTGTCT 

TCTAGCTACTTACTTCATTCTCCATGGGTAACGGTCATTCATCCACA7TAACTAATTTCC 

TCACTCCAAGCTCTTTTCTAGAGATAATCTCCAGTCCCTGTGCAGAAACTGTCATTGCAC 

TTTCTGCTGAAATGGCAGTTTCTTCTCAGCAAGGTGAGATTATGGAATCCAGAATCTTTT 

TTCAGGGGTCACATGCCCATTTCCCCACTTGCATGAATGTCGACACTGCAGCCACAGTTT 

TGGCCGTAAATGTGAATTTGGCAAGTAACCA 

Sequence 2631 

CCGCGGTGGCGGCCGAGGTACl I I I I I 1 M I I I I I I I M I GTTGCTTAAAACAACAGCCA 

TTTTATTTAATATCTCATGATTTTGTGGTCAGGAATGTGGACAGAGCTCAGCTGGGAGAT 

TGTTCCACTCCATGTAGCATCAGAAGACAGGTCTNTTGGTGATATTCAGCTGGAGAATAA 

GTTMTATGCAGAGTTCAAGATGGTTTTCACTCACATTGACACCTTTGGCAGGGATGGCT 

GGAGGGCTGGGCTCAGCCAGATCAGTTAACTGGCGTGCCTGCATATGCACTCTCCACCAT 

GGTGGGTCTCTAGAATTGACTCAAATTCTAAGAATTTGGCGATTGTATCCCGTCAGTCCA 

AGCAAGCTGGTTTACCGGGAACCACCCNAGNCTGCAAGGGCCAGGGCANCCGTGCCTGCA 

AGGCCGTANCATATTGTTTGCTG 
Sequence 2632 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACTTCAGTCTCAATTCTG 
GAACTCTAAGAAAAAACGTTCCAGACTTCTACGCAGCTGCCACCTCGGAGGACGGGAGAG 
CGGGGGACGAGGAAAGATCAGGATAAGACCCTAACTCCCACAAATGACTCCAGGAAAGGG 
AGACCACCACTCCCCGAGCTTTGGAGCGCCGCGCGGGACCCGCGTACCTGCCCG 

Sequence 2633 

ACTNCTTAGGGCG AATTGGAGCTCCCCGC GGTGGCGGCCG AGGTAC I rTTTTTTTTTTTl 

| || | | I m 1 1 1 i GGGAGGCTGGAGTGCAGTGGCGCAATCTCTGCTAACTGTAACCTCTG 

CTTCCCAGGCTCAAAGGATCTTCCCACCTNTCCTTTGTTTAGCT GGGA CTACAGTAGGGC 

ACCACCATGTCCAGCTMTTTTCGTTTTT17ANTAAAGATGGGGGTTTTGCCATGTTCCT 

CAGACTGGTATCAAACTCCTGGCCTCAAATGATCTTCCTACTTAGGCTGGCCAAAGTGCT 

GGGATTACAGGCATQAGCCACCATGCCTGAACTTTAAAAAGCATTTCTACTATATGTTTT 

TCTTTTTGATTACCAAAAAACTTNTATTA/^AMTCTTTTAAAGTTCATTCAGCAGAANM 

GGAATTAGTCAACTTTAANACATGTTAT^TAAAATTAGCCACTTGGGGAAACAA 

Sequence 2634 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAAAACGGAGTCCCTGAAATAACAGAT 

GCAGCCACAGATCAGGGCCCVGCAGAAAGCCCACCCACTTCCCCTTCATCAGCCTCTCGG 

GGTATGCTGTCTGCCATCACCAATGTGGTTCAAAACACAGGTAAAAGTGTCTTAACTGGA 

GGCCTTGATGCGTTGGAATTCATCGGCAAGAAAACCATGAATGTCCTTGCAGAAAGTGAC 

CCGGGCTTTAAGCGGACCAAGACGCTCATGGAGAGMCTGTTTCCTTGTCTCAGATGTTA 

AGGGAAGCTAAGGAGAAGGAGAAGCAGAGACTGGCACAGCAGCTCACGATGGAGAGAACC 

GCGCACTACGGGATGCTGTTTGATGAATATCAAGGCTTGTCACACCTGGAAGCCCTGGAA 

ATTCTGTCCAATGAAAGCCGAAAGCAAGGT 



Page 391 of 635 



WO 01/51628 PCT/US01/00798 

Table 1 

Sequence 2635 

m?m^ G J A £' " " " 1 " " 1 1 1 " " " I H 1 1 1 1 1 1 H GN I I I II 1 1 1 1 1 1 I GGGA 
^^™^ NTMGGMNATCCCCTCGGGGGGA G C TCCATGGTGTGCATmGGCTTT 
m^?I» TAMGGGGGAAACMTAGG GAAAGACrTGGNGCAAAAGGGAAGGAACANAAAC 
NGACAGATGANAAAAGGAAGATGACANCATTGCAAGTGTTTNTTTAAATCCTNGCCANGC 
CAGGCTTTGTTTAGCTACCCTGGTCTNTTNCTGCATACCCCAACCCCCGTTTCCTGGGGA 

ATGGGGAGCAAAAAACAGGATCANAGCATGATCTGAAAAAAACACTGAACCTTCTCCCTG 

CTGGAACCCCAAACCCACCCCGGGT 

Sequence 2636 

CCGCGGTGGCGGCCGAGGTACGCGGGATATACAGAAATGAAATGACACTTCTGGGAGGCA 

GTAGAAGCAGuAAGTCAATGAATTGAGTAGAGGGTCCCATTCCCTCAGGCTGTCATTGAT 

CAGTGACAATTTATAAAAACAAACTGCAAAGTCTGTGGCAAGTGGCTGCCTGCTTCCTAN 

AAAGGAGCCCATGAAAGGTTAAACTCTGTGGTCGGTATTTGCAAGCGCCGGGCGTGGTGG 

CTCACGCCTGTAATCCTAGCACCTTGGGAGGCCAAGGCAGGCGGATCACCCGAGGTCAGG 

AGTTTGAGGATTTGCAAGCAAMGGTCCTCTCCTGAGTCTTTCCCA 

Sequence 2637 

TNCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAACAAATGCAGTAGC 

CAACAAGATTACCATGCAATCATTAAGGAGAACCAAAGTAAGAGAGCCACTCAAACCAGA 

TTTTGAACGCTACTAAAATTAAAGTAGTTCTTTGATGAATATGAATGAGTAGGGAAAGGA 

TTCTTTGTAATAGTGATACCTCTGGGGNTAANNAGAAGGGNGGTATGTGAGTTTTAGTCT 

ACAGATTATGGCAAATTCAGTGACAACAATCAAATGGTCTAAGATTGACAGTGGCACAGT 

TTTACTCTGTGAAGGTAATGTTCAGGACAMTTTCAAGAAAACTAGAAAACCATTCTTTA 

CAGCTGAAATCTTTCCCTAACCATTGTTATTTCCACTTTTAAGTCCTCAAGAGATGAGAA 

MGGGAGGTAAGGCTTCCTTATACATTTCCTGCACAATGAAACATTTTTCCTCCTCCAGG 
CAAAGATTCAAG 

Sequence 2638 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAGGTATG 

AGCGCCGGGACCTGTGACAGGGCTGGTAGCAGCGCAGAGGAAAGGCGGCTTTTAGCCAGG 

TATTTCAGTGTCTGTAGACAAGATGGMTCATCTCCATTTAATAGACGGCAATGGACCTC 

ACTATCATTGAGGGTAACAAGCCAAAAAAACTTTNTCTTGTCAACANGAACAAGTCATCG 

GCTATTGTGGAAATATTCTCCAAGTACCT 

Sequence 2639 

GGCGAA^GANCTCCCCGCGGTGGCGGCCGAGGTACANAGTGCTmCTGTTTAGTTTT 

TAC. i i i 1 1 iGTTTTGTTTTTTTAAAGATGAAATAAAGACCCAGGGGGAGAATGGGTGTT 

GTATGGGGAGGCAAGTGTGGGGGGTCCTTCTCCACACCCACTTTGTCCATTTGCAAATAT 

ATTTTGGAAAACNGCTCNTTTTAATTCTGATrGATCAGCCAAAACNGTCCCTGCCCGGGC 
GGCCGCTCTANAACTA 

Sequence 2640 

ATCGACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGTG 

MGGTGAGAGAAAAGCTGCCMGCTGCATTTTAGTCAG7TGGGCAACAGTGAAAATT7TC 

TTTTAGAGGTGGAAAAAAGCCTCTCTCTAGTGACCTGGGTAGTTTGCACGGTTTGGCTGG 

AAACCACAGTCCCCCCATCTCTGCCAGAACCCCCCATGTGGCCACTGTCCTCAGACAGCT 

CCTGGAGCTTGTGGATAAGCACTGGAATGGCTCCGGCTCCCTCCTCCTCAACAAGAAGTT 

TCTCGGAMGTTTGMGCAAAAACTGGTCAGAGTGCTGGAGGAAAACCTCATTTTGTCAG 

AAAAAATTCAACAGTTGGAGGAAGGTGCTGCCATCTCAATTGTGAGTGGGCAACAGTCAC 

ATACTTATGATGATCTTCTGCACAAAAACCAACAGCTGACCATGCANGTGGCTTGCC 
Sequence 2641 

CGAATTGGTTrTCCCCGNGGTGGCGGCCGCCCGGGCAGGTACCAAAGGTGCAATTTATGA 
CTCCAGAAGGANTTCAAAACACATCGGAATCATGACAGCAAGATAACCCCAAGACAGTCT 
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TTTATCAAGTTTTGACGGTTTATGCTGGGCACTGCCAGACTGAACCAGAGCTGGAACACA 

ATMTGTTCGAGCGGNCCGCCCGGCCAGGTACTNNNNNT7TTTNTTTNTTNTTTGGATAA 

TACTGTTTATTGTCCACGGACCGAGCGATTGCTCAAGAATGATNCAAAGCATCAGAGACA 

TGCNCAGTCTGCTTGTCAACTTTCAACAACTCTTGTGTGTTCCCAGCACCGGAAGGCCTC 

AGACTTCATATTCCAGCATAAACACAGTGCTCCCCTCCCCCAAAANTCCAAACGGAAACA 

AAATCAACAGAGCTT 

Sequence 2642 

CTTACTTCACTATAGGGGGCCGAAATTGGGTAGCTCCACCGCGGGTGGGGCGGGCCGCCC 

GGGGCAGGGGTTACGCGGGGGGGGGGAATTTGGGGTNGGGCCATTACCCCACTTCCAAGC 

AACAACCAAGGAAAGCCATTGAAGGGTTTTCAATGGMCTTCTTCCTTCmCCCTTGGA 

ACANGGCTTCTINGGGCANGGGAAGCCCCTNGGGTTCTTGGTNCCTTANTGGAATCCNAG 

AAANGGGCCCGNCTTTTTTGNCCCCCAAAGGGAANTCGGGGGGGGAAMCCCCTTTNGGC 

CNTTGAAAACCNNTTAAACCMNCCTTTAAAAAAAGNGGGGAAAAAATTTNCCNNG^ 

NAAAAAAANACCCCAMGGGGTTTTTAACCCCCAAAACCACNNGGGNACCCCAAAAAGAC 

CCCCMGGGGAAAAGCCACNAMGAMTTCTCCAAGCCC^ 

AAGGGGCCCNTTATTAAACNGNGGAGGGCCAAAAAAAAAAAAAAGGCCNTTGTTGGNGGG 

GGGGGGGGAACCTTTTNGGGGAAAAAAAACCTTTTTGGGGAAAA 

Sequence 2643 

NACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTGCAGAAGCCAG 

GGCAGTCTCCACAGCTCCTGATCTACTTGGCTTCTAATCGGGCCCCCGGGGTCCCTGACG 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnCTGCATGCAAACCTCTTACAAACTTGCGGGATTCGGCGGAG 

GGGACCAAGGTGGAGATCAAACGAACTGTGGCTGCACCATCTGTCTTCATCTTCCCGCCA 

TCTGATGAGCAGTTGAAATCTGGAACTGCCTCTGTTGTGTGCCTGCTGAATAACTTCTAT 

CCCAGAGAGGCCAAAGTACCTGCCCG 

Sequence 2644 

AACAC1 1 I I I AGGGCGAATTGGAGCTTACCGCGGTGGCGGCCGAGGTACGCGGGGACACC 
CTAGATCCCAAGATCTCCAAGGATTTGGTGGCATACCCACTCCAGCACACAGAAGCATGA 
GGTTCATGACTCTCCTCTT 
Sequence 2645 

AGGTAC 1 1 1 1 1 M 1 11 1 11 1 I I 1 1 1 1 1 I 1 1 1 1 GTTTTATAACCAGTGCTTTTGGTAACTA 

ACAGAATGAGAACTCACTTATTACAGTGAAGATGGCACTGAAACAATACATGAGGGATCT 

GCCTCCATGACCCAATCACTTCCCATTAGGCCTCACCTCCAAAATTGGGACTCAAATTAC 

AACATGAGGTTTGAGGGGACAGACATTCAAATTATAGTAATTACGAAGTAAA/XTGAGCAA 

CTGTTTTCTTTTGCTATCCAATCATGATGGATTGGATATGTGTGATGAAGGAtvlAGGGAGG 

TATGAAGAATGACTTTNANATGTGTMCTTATAAAACTGGATACTATTCACTGAGAGAGG 

AAACACTG AAATN G ACCAGAATTG AAGATG ATG AGTTCC 

Sequence 2646 

CCGCGGTGGCGGCTCGCCCGGGCAGGTACATAGACCCAGCCCAAGGTTGGCCTTTGGGGT 

CTGATAGCATTCACAGAGCAGGAGATTCACCCTTGGAAGCAATTGAATGTGTAAACTCTG 

AGAGCAGACACTGACAATGCAATGCCTGCATACCTGACTCCTGGGCTGCCTTCCTTCTCA 

GCAGAGCCCACTGAAAGTCTCACGTGTGCCTTTAACCAATGAGGATGGCAGGCCTTATCT 

GTGTTTGAAGTGGACAGGTCATTGCTACCCCTGTTGGAGAAGCTATGTAAGAAGGAGTAG 

GCTGGCTTCTGGTCTAGTTCGACTCATTCTTTATAGGGACATTTCANAATGATGAGTTGG 

CTAAAATTTACTTTAMGGGGAMTGTAAMTTACCTCTTCTTTTCCT^ 

CTTTCCCATCCTGAGCCCTCTGGCAAGGCCTGTTGCAGCTCAAGAAG 1 1 I I IGGTCAGAT 

TACAAAGTTACCAMTGGTCAACAGGGAAGGACCCACAAGCAGATTCCAAAMCTCTTTC 

ATTCACTTGAAGTTAAACTGAAAATGGC 

Sequence 2647 
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CACCGCGGTGGCCGGCCGCCCGGGCAGGTACGCGGGGGCCTCACAGATGATGACTTTCTT 
CATCTTCTTGCTCTTTTTCCCATCCTTCACCGGGGTCTTGTGCACCCTGGCCATCACCAT 
CTGGAGATTGAAGCCTTCANCTGACTGTGGCCCTTTTCGAGGTCTGCCTCTCTTCATTCA 
CTCCATCTACAGCTGGATCGACACCCTAAGTACCTNGGCCCGCTCTAGAACTA 
Sequence 2648 

CGCCCGGGCAGGTACACATGGGGTTTCACCATGTTGGTCAGGATGGTCTCGAACTTCAGA 

CCTCAGGTGATCCGCCCACCTTGGCCTCCCAAAGTGCTGGGATTACAGGCATGAGCCACC 

NNACCCAGCCAGTTMTTTTTCTATTAACTACAGACCTANTTAANATTGAGGCAAAANAA 

ATG GGTC CTTGGGATTTG AAAATTACTATNCANTTTG GAAGTTN AATTTGCAAC ATANAT 

TGTCTGTTATTAAATTACTAGATATAATATCNCATAGGTGGAAAGAMGGTTGCTTAATT 

AAAGATCTAAGTTACTAGTCATGGTGTCAGATATNGANAATGATTGAANGTTATNAAGAN 

TCNCACACCAG ATGAGT AAATTGNTGTTTTCNNGAAGAAGTTACATNAANGTNACCGGAG 

TATATTTCAGCATTTTTTGNTANATTAAAA 

AATACTCTTAACNMNTTTACCGGGTAAMTGACCAMTTCCAGGTTTAATTTTCACATG 
TTAACACCCTCCTTGAGCCCCTTTA I I I I I I I AAAGCTCTTNAATNTATTTTGGNCCTGA 
AATTAACCNTTTCTTrTAAGGAAGMTTTNAAAAATTTTTT^ 
CNTGGAANAGGCCCCTGGTTATNGTTTTTCAAATT 
Sequence 2649 

GGCGAATNGGAGCTCCACCCGCGGTGGCGGCCCGCCCGGGCAGGTAC 1 1 I I i I I I I NTNT 
TTTGTTNTNTTTTTTGGAGGACTGGGAMTAGAGGACTGGNMTTTGAGCAAATTNTGAG 
GMTCTTAAATTGCTATTAGGACACCAATATGACGCATCMTNMTCATTATTT 
Sequence 2650 

GNAGCTCCACCGCGGTGGCGGCCCGAGGTACGCGGGGAGTCTTGGTTTTCTGCTAGTGCT 

GCTGCTGGGAGGACGACGGACGGCAGCGGCCAAGCAAGAAGAAAGACGTGGCAGCAAGCG 

GGAGTCGGGGATAGTGTCATCGGTTCGGTAAAGTTTCTGTTCTCTGATTTGGGGATATTC 

CCTGCCCAAAAAAAAAAAAAAAAAAAAAAANGTCCCTGCCCG 

Sequence 2651 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGT 

ATGATTGTATATGTGTTACTCCGATATGTAATCCATTTCACTGGCTGAGTTTGGCCCCTA 

GCCATGTGTTAATATAAAGTAGGCATGGCTTCCCAATGGAAATCTCTGAGAATGACAGTG 

GAGTTGTGCAAGCATTTTACATTGCCACATAAATCGAATTTGGCCATTTTATGGTTAAA^ 

ACGGCACATTTAGGCAGTTGAATATGACCGTTACCTTGCAGACTAAAAGGTTGAAGGCCC 

GAMCTMC7TTTAGCTAACMTMGGGCTGTGCCCCMTGGAMCTGAGTTTCATTTTT 

CTGAGAMGGTTTGGATTGACNTGAAATATTTTCCTCTACAGGTCMGGGACTTTGGCAT 

GTGGTGGCTGAAAACTGGAGC 1 1 i ll! I GTGT 

Sequence 2652 

CGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGATG7TT 
GCCACTTTGCAAAGGAGCTCACTGTGGTGTCTGTGTTCCAACCACTGAATCTGGACCCCA 
TCTGCGAATAAGCCATTCTGACTCATATCCCCTATTTAACAGGGTCTCTAGTGCTGTGAA 
AAAAAAAATGCTGAACATTGCATATAACTTATATTTGTAAGAAAATACTGTACCTGCCCG 

Sequence 2653 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTCCGAAGATGGGCTTGT 
ATCTGGTTTCGGACGGACTGTTAATGACAATTTGATCGACGGGAATTGCACACCCCANAA 
TCCACCACAAMGAAAAAGGTTACAAATTTAACMTTTATAGTCCTTTTAATAGl I I I 1 1 
TTTTTTTCATMTACTACTGAGGGGMTTGGTNAGAATGTATNNATGTAAGGCNTTCTTA 
ATTTAAGTTATTAAAGTTTACATTTTTAATAl I I I I AAACCTTTTGTAAATGCTTGGCTT 
MTTAGAAAATGTTTACAGAAAAGTAAAAAAATTCTAGTAATATGGGAAATCCTTGTAAG 
CAG CATGGTTTCAG AAAAATCTC AAGATGATTTATTTCACCAAATTGAGTN I I M I I lAA 
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AACTAGGNAACTTCCCCAACCAAAAACACANGACNTTGAATAATATTTTTGTGTTATT 
Sequence 2654 

ATTGTTATTCC G NTTTACAAATATN ACN AC ATN N ATATN N AGCCC GGG AGCC ATAAAGGN 

GTAAAAGCCTTGGGGTGCNCTMTGAGTGGAGCTAACTCACATTAATTTGCGGTTGCGCT 

CACTTGCCCGCTTTCCAGTCGGGGAAACCTGATCTGTGCCAGCTTGCATTAAATnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnNGTnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnNTCGNTTGCGCTCGGTCGTTTCGGCTGCGGNGAGCGGTATCAG 

Sequence 2655 

CCGGGCAGGTACTCTN 1 1 11 III 1 1 NTTTTNTTTAGGCAGNTNAAATCTAGGATGGTGTT 

MCCTTGTCTTCATTTNGCCAATNATATNTNAAAAATNANAAACNACATTGAATCTGCAT 

TTCTTGGAN N AGAN NGTN N ATAACAAACTATTTCTCAATATTNNG ATACTAN AAACTAG N 

GAAGGNGNCATGGGNTCCAGCNTCTTATCTNTNATNTGAACATGGGTTTCTGAANGGAGC 

CTATATNATAATATAAATG GTNTG N N N AAAATGAGGCACTAGCNTTGN CTG N N ACTG CTA 

TAAAAAAATNACCA 

Sequence 2656 

CCGGGCAGGTAC I 1 I ( t I 1 I I I I I I I I I I I I I I I AGGCAGCTAAAATCTAGGATGGTGTT 

MGTTTCTTCATTTTGTCAATTATATATAAAAMTTAGAAACAACATGAATCTGCATTTC 

TTGGATGAGATAGTTAATAACAMCTATTTCTCAATATTTGTATACTAAAAACTAGTGAA 

GGTGTTATAGAGTTTCAGTATCTTATCTCTTATTTGAACATGGGTTTCTGAAAGGAGCCT 

ATATAATAATATAAATGGTATGTAGTAAATGAGGCACTGTCTTGGCTGGGACTGCTATAA 

AAAAATTACCATAGACCATTGACTAAACCACAAACATATACTTCTCACAGTTCTGGAAGT 

0 

Sequence 2657 

CGCMGGAAAGAACATGTGAGCAAAAAGGCCAAGCANAAGGCCANGAACCCGTNAAAAAG 
GCCCCGTTGCTTGGCG I I I I iCCATAAGGCTTCCGGCCCCCCTGGACCGAGCATTCACAA 
AAAAATCGACCGCTCAAAGTCAAGAAGGNNGGCGAAAACCCGGACAGGGACTATAAAAGA 
TACCAGGCCG 1 1 i iCCCCCTGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGACCCTGGC 
CGCTTACCCGGATACCTGTCCCGCCTTTCTCCCTTCNGGAAAGCCGTNGGCCGCTTTCTT 
ATAAGCTCAACGCTTGTAGGTATCTTCAAATTCGGTGGTAAGGTCGNTTCGCTCCCAAGC 
TGGGCCTGTGTGCACCGAACCCCCC 
Sequence 2658 

CCGCGGTGGCGTCTACCTCATTGTCAGCCTCCTGCTCCCCATTTTCCTCATTAGCATTCC 

CGTTAGCAGGGGCGTCTCTTCCATTTTCTGCCTCTTCCACAACTTCCTTCTTCTCCTTTA 

AGTCCTTGGTGGTGATTTCGGAGCTGGTGTCTACGGCTGCGTCTGACCCCGCGTACCTCG 

GCCGCTCTAAGAACTAAGTGGATCCCCCCGGGCTGCAGGAAATTCGATATCAAAGCTTAT 

CGATACCGTCGACCCTCGAG 

Sequence 2659 

CCCCGCGGTGGCGGCCCGCCCGGGCAGGTACGCGGGAGAGTTAGTTGATTTCAGTAGTTC 

AACAATGTCAGTATAAACCCATATTTTGCTTAGCCATATATTGCTCAATTCTGCCATCCT 

TGGGACTGGCTTCATCCTCAGG CTAGTAGC AAAATGGTTG CGGTG ATTCCAGGC ATC ATA 

TCATATCCATAAATAACAACTTCCAAAAGAAGAGGAAAAGTATCTCTTCCTGTGCCACTG 

TTTTTGGATAGAGGAAACmCTACAAAMCCTCTAGCAAACTTCTTTCC 

GCCAGAATTGGGTCATAGGCTAGTTCCTGAACCAGTCACCANTAAATAACCCTTAGGTTA 

ATCACGTTTCTCAATCTGGAATGGTGTTGGCTTCTACTGAGGTATATGGTTGTATTGGGG 

AGAGATTCTGTTAAAGAGGMGAGGGAGAGAATTGATGCTAGGAAAGTAATTTGGAAAAC 

CATAACTCTCTTCAGTGGTATTGGNTTTNTCAATAAACAGTNTTAGCAAGTTAI ITTTTA 

CTTTTTCAAAA 

Sequence 2660 

TNGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGA T rTTTTTTTTT C TTTTTTT C TTTTTI 
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TTTTTTGAGAGGGACTCTCGCCCTGTCACCCAGGCTGGAGTGCAATGGCACGATATTGGC 

TCACTGCAACCTCTGCCTCCTGGGTTCAAGTGATTTTCCTGCCTCAGCCTCCTGAGTAAC 

AAGGANTAACCAGGCATGNGCCACCACNCCCAGCTAAI I I I I I TGTATTTTTAGTAGAGG 

CAGGATTTCACCATGTTGTTCAGGTTGGTCTCAAGCTCTTGACCTCATGATCCATCTGCC 

TCAGCCTCCCAAAGTGCTCGGATTACAGGCATGAGGCACTGCGCCCAGCTGACTGATTTC 

TTTTGGCTAAAC ACTTTTTAAGTAGATG ATCTA 1 Mil I CCCCCGTTTATGTATGCAATT 

ATCACCAGGAAAATTGTAACTATACTCAATAATATGGGCTTTACAATTGGACAGACACTT 

TATTTACG 

Sequence 2661 

AATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTAC1 ITTTTTTTTTTTTTTTTT1 

"II I M I I I I ACTCTCCTGTTGCTTTTCTTCTTCTTTCCCTTTCAGGCATCTCCTCGTTTT 

ATTTTCTGTAACTCACANANTGCTGCAAAATAATGTTAGGGCTTTGCAGTTTATTCCTTG 

CNATNTAACTGCNTNAATNTAAAAAAGGGNCCTNACCATTANNNCNCGTTGGGGTTATTC 

AAATMCCCAAGGGAATGTAAAGCGATTTCATATCTCCTTGAATCAGGAACATTAATTAA 

TCATATTGATGAGAAATGCAACCG'I I I I I ATTTACCAGGGCAAGAGGCTGCCAANATCTT 

TCTGAGTTTAANAAATTGCANACTCTGNGGGAGGGGAGGGAAAAATTNATTTAGTTNTTA 

AATTGGAGGGGG 

Sequence 2662 

GCCGAGGCACCI I I I I II I I I II 1 1 I I 1 I I I I I NGGGGGGATGGGTATNTNCCATGTNAT 

ACCAACAGNTCATNAACTCCTGGGCATAANCCAGCCTCCCACCAACATCATTAACTTAAT 

CANAGCACCAAGTGTCACATATGGTAACACANTCACAACTTCCCANGAATTNAMTTTTT 

GGANNAGGTNGTACA I Mill I NTGGGNCCTTTTNTTTAATTCNTTNAACCCTTTACTNN 

NAGGATCTTGGACCCAACAAAANAAAAAAATTGGGCAGCCCMCACTTNTTTTTNCTTCC 

CTTTCCTTTTCCTTTTAA 

Sequence 2663 

CCGCGGTGGCGGCCCGAGGGTACAACAACAAAGTCTGCCCACCTTCTTGTGTGCACCTCT 

GTGATCAGAAGTAGCAATCAGTGATCGGATGCAGATCTCTAATATTTGGAAGACAACATC 

CTTTTTGCCCACCCTGGAGCCCATAGGCTGCATGCAAGTTTCTCCAAGAACTGGGGGATG 

GGTAGCTGCTACCGTTTTAAGAGCTAAAATTAAAATTMTTGTGCTTTTACCTTATACCT 

TCCCCTGGAATTTGCAAGCTTACAGTAGACTCCAGAGTTGCAAAAATAGTTACATTAGGC 

AGATTTTGCCAGTATAATTGTTGTCTAGGAGCACGGCCAGACTCCTGGTGCTTTCTACCC 

TTCCATCTTCCCCAACCGCTAGTTATTGAMCTTACTTTAATACAGGAATGTAAGCAGTT 

TACTTTGTG I I I I II CATTTTCATCACTrATGAATAACTCAGCTAGTGAAAAACTTCTTA 

GAGTAMTCATTTTTCTTTCAAMTMtGTTGACATTGCTCTGTTGAMTCTAGTGTT^ 

TTTCAGAGAAAGTCTGCGGCCAGTTTGATTTTTATTTCTTTTCAGATTACCTGGC 

AGATCTGGGATGCTTTAAATC I I I I I CCTTTTAATCAAATAACCCATGAAAATATCTTGG 

GTGTGCCCGCGTACCTGNCCNGGGCGGGCCGNTCT 

Sequence 2664 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGNCCGGNCNGGTACGCGGGGGGAGA 
TGCTGNCACCTANGTNACTTGNAG3ACCCTATACNGCAACCTCCTTTGCCAGGAACTATT 
TAT AAAC ATCCTGC ANG AAAATGANTC AAGG AAGCTTTTCTTTTG AG CTATTTACAGCTT 
N TAGCAATTGAGTAAAGTATACTCCTGTGAACAAAATN NTGGAAC ATATN NGTN N NTCTC 
TAACTGATTNCTCCANAATTTGGAACTAGTTCAGNGAANTNNAAGAGACAGGGATATCCC 
AC N AGGTTATGCAAAAC ATCAAGAGAAG ATGAN AGGTC AGANATG G GAAG AAACAAG AAC 
TTTG AC ATGCTTGGTGTNCTTG CC C AAG CTTTG AAGAAGTTT ACAAAGTCTATATGTCAG 
AATACACATTTCCC 
Sequence 2665 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAATGTGT 
GGCAGTCCAAAATTACGAGGAAAATGAGTTCCCTTCATGGGTCACATCAGCAAI M 11 M" 
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TTCCCCTTTTGAGACAGAGTCTTGCTCTGCTGCCCAGGTTGGAGTGCAGTGAGACCCCGT 

CC CAAA AAAAAAAAAAAAATGCAAGTCTGGAAAAAATATAGCTNGGGAGTCAGTAATGCC 

AGTTTTAGCTGAGGGAGTTAAGATCATTTMTCAGTGTTAAGGAAGAAACTCAATAGTAG 

TGACCACCAATACTTAAGGAAAAAGGATAAAGTACCTN 

Sequence 2666 

C CGCGC iT GGCG GCCGCCCGGGCAGGTACTTGAAGTGTGACTAATGTGACCAAGGGAACTG 
AATTTTAAATTTTATTTACTTTAMTTMTTTAAACTC 

GGCTACCACATTGGATACAGCCTTAGATTCCTCCCAGACCAGACAACCTTCTGTGGCCTT 

CCAANGTTGNAACAGCAAACTCACACTGTGGGCAGCCAGGCCTGGAGTGGGGCCCAGCAA 

GCCGCATCCTGGGGTCAGGTCTGGCCAACTAATTATCTACTGTCTATGGTCACATTAGTG 

CTACAATGGCAGCATCGAGCAGTTGCAACAGAGATGGTGTGGCCTGCAAAACTGAAAATA 

TT CATA TTTACTATCTGGCCCTTTACAGAAAAACATGCTGACCCCTGCACTACAGCAATT 

AGTTTTGGAGAGAACATTCCTGATAAAAGCTTACATCTTCTCTCA 

Sequence 2667 

CGACTCACTATAGGGGCGAATTGGGAGCTCCCCGCGGTGGCCCGGCCCGCCCGGGCAGGT 

ACMTCTCTGGCCCTACATTTTCTAAATGTTATGCCACCCCGACCAAGGGGCAACTCCTA 

CAAAGCCAGGCAAAATAATAAAATCATATTTGTCTCTAGTGGAATGGATAACTATGCCTA 

AMCTGTGCCCTTTGAAMGCMCTAGAGAGATAATTTCTGAAGTGTTTGTCCCTACCTG 

AATGTGTGGCAAAATTCTAAACTCCCTGAAGTGTGAAAGTGGTTTCCAAGCCACATGCAC 

ATCCAGTAGTGGTAAAGGGTGAAAATCTAACTGGCTAAGAGGGCTTCATAGCAACATTAA 

CCAAAAAGTGGTTTATGTAGTCTTTGCCTGCTTCATAATTCCCTANGCATTCTATGCTAT 

TCTGCACCT 

Sequence 2668 

CGCCCGGGCAGGTACATCCAAATTGGGCTACGAAAAGTGGCAAGTTTGATTGTAGCATCA 

TG TCTTTG AGTGTCCTATTG G ACTAC AGATTAG AGG ATAATAAAGAAC ATTC ATTTGAG G 

TTTCATTGTTTGCGGAACTTTTCAACGAAATGCTTCAAAGAGATTTTGGTGTC 

ACAMTCATTACTGTCTCTTCCTGAGAAAGAGGACAAAAAAGAAMGGATAAAAAAAGCA 

AAAAAGATGAGAGAAANGATNAAAAAAAAAAAAAAAAAAAAAAAAAAA 

GTTAAGAAGTTGAGCTTTTATCTTAGAGGCAGCAGAAGGTTTGGAGCCAAGGAATGAAAT 

GATGAGGCGTCCTTCAGGTAATGAACTTCAGCTGCAGTGTGAAAGGGGCAGGAAGACTGG 

CAGCTGTCAAAACTGGAACAGTCCAGTGAGTATGTNCAGGCCCGGGCTTGGGCAGTGACC 

ANGGCAGGGAGCACCATCAATTTCTGCGGTAGAACCTTCTAGAGGAAGACCANACAGAGG 

GTCAACAAGAGTTGAAAGGAGAAAMTATTGAGGATGANGTCCMGTTTTTAGTTTCAAA 

ANACTAGGCATATGGGTAATNCCGCGTCCTTGGCCGTTNTANAACTAGGGATCCNCGGGC 

CTGAGGAATTCGATATCAAGCTTATTCGATCCCGCCANCCTCAGGGGGG 

Sequence 2669 

TCACTATAGGGCGAATTGGGAGCTCCCCGCGGTGGCGGCCCGAGGTACCTGAAAGTGACG 

G GGAGAATGGAACCAAGTTGGAAAAC ACTTTGCAG GATATTATCCAGGAG AACTTCCCCA 

ATCTAGCAAGGCAGGCCAACATTCAAATTCAGGAAATACAGAGAACGCTGCAAAGATACT 

CCTTGAGAAGAGCAACTCCAAGACACATAATTGTCAGATTCACCAAAGTTGAAATGAGGA 

AAAAATGTCAAGGGCAGCCAGAGAGAAAGGTCGGGTTACCCACAAAGGGAAGCCCATCAG 

ACTAACAGCTGATGTCTCGGCAGAAACGCTACAAGCCAGAAGAGAGTGGGGGCCAATATT 

CAACATTCTTAAAGAAAAGMTTTTCMCCCAGAATTTCATATCCAGCCAAACTMGCTT 

CATAAGTGAAGGAGAMTAAAATACTTTACAGACAAGCAAATGCTGAGAGATTTTGTCAA 

CAACATTTGAATATTATAACCTGCAGGGGTCTGTCCTGCAGATGGTTGAGACCATAACCA 

GATACTGATATTCAGTGAAAGAGCAGCAAGGATTTCCGAGCTGATTACCAGACCCCAANG 

GAAGATGCTTGTAAAGAGCCCNCCAGCCACAGNCCTGGCTTGCTNGCCCTGGGGGCATTT 

ATTTAGTAA 

Sequence 2670 
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AGGGGCGAAT TGGGAGCCTCCACNCGCGGTGGGCGGCCCGAGGTACI 1 1 M H 1 I M I 1 1 
inn I NTTTTGGG CCTAATTGCTTTGGGCTGAAGACAGAGTGAGGCGATGTACTGTAAG 
GCGAACACCTTNTGCACAGCAAACCCGTNATTCGCTCACCTGTNTGNGGTNAGGCTCTTG 
TAGAGAGTGANCACTCACTANTAGTTTTTAATAATAAACCANGTAACCAAAGTGAGATGA 
AGGAACTA 1 I I TTGGA 
Sequence 2671 

CCGCGGTGGCGGCCGAGGTACAAGGCATTTAGAGAMGCTTTTTGATTCCATTCAATCCT 

GGAMCCMCATGTTGMGGGCCAGATTTTCCACCATTTTATAGGGATGCAMCTTTAGC 

TCAGAGAGGTTAAATGACTTGCCAACAGTCACACAACTAGCACTTAGAGGAACGGGAAGG 

CTAACTCGGAGCTGATGCTGCAGCTTACACTCACTCCCCACTCCCACGCAACACAAACTC 

TTTCTAGGCAGCTTCGAAGGTACCTGCCCGGGCGGCCGCTCGA 

Sequence 2672 

AGGTACAACGCCTGGAACGAGAAGCGCAGGGTCTACGAAGAATAGGGTGAAAAACCTCAG 

AAGGGAAAACTCCAMCCAGTTGGGAGACTTGTGCAAAGGACTTTGCAGATTAAAAAAAA 

AAAAAAAAAAGCCTTTCTTTCTCACAGGCATAAGACACAMT^ 

AC I I M l ACCAACGGTCAGTTTTTACATTTTATAGCTGCGTGCGAAAGGCTTCCAGATGG 

GAGACCCATCTCTCTTGTGCTCCAGACTTCATCACAGGCTGCTTTTTATCAAAAAGGGGG 

AAMCTCATGCCTTTCCTTmAAAAM^ 

CATCTGAGCTTTATAAGCGCCCGGGAGGAACMTGAGCTTGGTGGACACATTTCATTGCA 

GTGTTGCTCCATTCCTAGCTTGGGAAGCTTCCGCTTAGAGGTCCTGGCGCCTCGGCACAG 

CTGCACGGGCTCTCCTGGGCTTATGGCCGGTCACAGCCTCAGTGGGACTCCCACAGNGGC 

CCCTGTANCCCGGGGCAAGCAGGMCAGGGTCTTTCTGCATCTGTTTTCTGGAGGAACTC 

AAGGTTTGGGTTGCCCAGAAAAAAATGTGCTTNIATTCCCCCCTNGGTT 

Sequence 2673 

CTACTTAGGGCGAATTG GAGCTCCCCGCGGTGGCCGGCCGCCCGGGCAGGTACTTTTTTT 

u n n m i I I 1 1 I I I ! I IAGGNCGAATGCAGGGTTTTATTGAGTGGTGGAGGTGGTTCT 

CTGAAGGATGGATGCAGAGCTGGAAGCGGGGATGGCACGGGAAGATAATCTTCCTGCTGG 

GTGGCCAAACCTNTCCGACCACCCTCGGCTGAACTCCTCTCGGCGTTCAAATGTTCCTCT 

TCTCTCTTTCTCTGCTGTATCGTTTCACCATTCATCTGCTTGTCTGGGATGGTGAANACA 

ACTGTTTGCTCATTTGCTTGGCGATAGGTGATTGTCTCGCCGCTCAGCTATAGGCGATGG 

TCTCACCACTCGGTGATAGGCGATTGTCCCATCTAGGTCGCCAAAATGTGTCTGGAATTG 

GTGGGTTCTTCGTCTCACTGACTTCAANA 

Sequence 2674 

CCGCGGTGGCGGCCGAGGTACAGGAGACTTTCTGATTTCCAATCTTGGCTCAGGTCAGAA 

GAAAAAGGGGAAAGGTTACATTCCTGGAMGAAMTACAGCCTATTTGA.GGGCATGCCTG 

ACTTTCAGTCCATGCACTAGTCCTCTTGGCTCGTAGAGTTTTCAGTGCCCTGAAACCATG 

ATCCCTCTGCCTCCATGGTCCTCTGAGTGTAGTTATTATGTCTGCAAACCAGTAATGTCA 

CAAGTCAAAGGGTCTGGGGCTTCCCAGGGAACCACTATGACTGAGTATATTCTGATTTGA 

GAAACCTGTGACAAGTCTTCTCAGCACTCGCCTTCTMTTTTTGGAGCCGATGCCTGGGT 

TCCCAGCATCCAGGTCACAGTAGCTGGAAATCATGCAGGAAGGGCAAAAAGCCAGCAGCC 

CAGAATGGACTGTCTCTGCAGCTCTGGCACATTCTCTACACATCGCCACACCACTCAAAT 

GTCCTATAAAATATCAAAACCAATAACCTGGACATGACCCGATTAA^CTGTACCTGCCCG 

GGCCGGCCCGCTCTAGAACTAGTGGGATCCCCCGGGGCTGCAGGAAATTCGATAT 

Sequence 2675 

CCGCGGTGGCGGCCGAGGTACTTCAGTTTCTCTAGATTACCATGTAAGACAGCTCTGTGG 

ATCCTCTTCAGATGATACGGTTTAATGGGGTATTGGGGAAATGCGAAGCCATCCGAGCAC 

AAGCGCTCCATGAGGGTGGGCCACCTCTCCCGCTTGCCGTCTTCCATAATCGTCGGCTGC 

AAATTGTAGCCTGCAGCCGTATTTCAGCTCGCCTTCGGGGATCGCCGCCTCCGAAGAGCA 

ACAACGAGCAAAGCAGTCTGTCCACGGACCTCCGCACAGACTCTCAGCGCCTCCCGCCTC 
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TCAGCAGAAACGCCCAACAGAAGGGTTAGAACCAGCGAGCACGCGCACCTTAGCCGGCCC 

TCCCCGCGTACCTGCCCGGGCGG 

Sequence 2676 

CCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTNACTCGGTCCNNNTACGGCAACC 

TCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAATGAGTCTTATATGTCAAAAT 

ACACATTTCCCACCTTTGCCCANCNGTNNNAAAACCNTAGGAAGGANGAAAAANCATTTA 

AAAAATGACACNGGAATGTTAATGGAAGCAATGTGATGGTCGTTTTGGAGGTGGAACCCT 

TTCAANAAAGGTAA7TAAATGCCC7TGGTTAAGAAGAAGGCCAAAAGAAGCTTGCGCACC 

TTTTTTCCTGCCATGTGAGGAAGCCAAANAAGCCGGCTGTCTGCAACCTGCAAGAGGACC 

CTCACTAGAAAGCTAGCCATACTGGCATCCTCATCTTGGCTTTCCAA 

Sequence 2677 * 

CCGCGGTGGCGGCCGAAGGTAATCGGGGGCCAGTCTTTATACTGCTGACAGTAATAAATT 

CCAAAATCTTCAGACTGCAGTCCACTGACGACGAGAGTGAAGTCTGTCCCAGACCCAGAG 

CCAATGAATCTGGCTGGGACACCAGTGGCCCTGGTGGATGCAGCAAAGACGAGGAGCCTG 

GGAGCCTGGCCAGACTTCTGGTGGTACCTGCCCG 

Sequence 2678 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTACTTTTTTT 

( I 1 1 1 1 1 I I I { I I II 1 1 ! I IAI 1 1 1 1 1 T 1 1 I I 1 1 1 1 1 I 1 1 1 I 1 1 1 1 CTAAGNCATTACTT 

TTTATTTTGMGGATTTGNGAAACTNTTCACATNATGGNGANAGTTTGTNTGATTAANAA 

AAANCAGNTTTTTCATNAAATGCTTGGAGGNGAACNAGTTNTCACCCTGNGANANCCNAC 

CNTCCCATTAACTTNGAAGTTTNTNTTGATTAATANAAAAAAAAAGGGGAGGGNGAAAAA 

AAGGNGGAACATGCTAAAAACCTTNTGACAATCNTNCAAATGTCCCGCGTACCT 

Sequence 2679 

AGGTACTTAACTAGGGTTAGCCGAGGTCAGGAGGGTGGATGCCTTATTATGGGATTAGTG 

CCCTTCTAAGAGGGACAGTAGAGGGCTTGTTCATGCTCTCTCTCTCCATGCACAAGAAAG 

AGCACACAGTGAGAAGGAAAGAGGCCCTCACCAGAACCTGACCATGCTGGCACCGTGATT 

GCAGACTTCCAGCCTCCAGGATTGTGAGAAGATAAATATCTGTGGTTTAAGTCACCAAGT 

CTGTGGTGTGTTTTATGGCAGCACAAGGTAACCCAGATGATGCTTTTCACCACTGGGTGG 

TCCAGCCTTTCATGGAAGCTGTCAGCTCCTCAGCCTTGCAAAAAATGCTTGCTGGTCACT 

CTCTTTGGTCCTTTCAGATGCCTTTCTTAGTGGTGCCTCTA 

Sequence 2680 

CCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCCATACAGTTTTTCTAAT 
TTTCTAATTAAATGTMGTATTGTGAATATT7TCT 

AGTTT'GATAGTGGTATAAACTGTTTATATTTTGTATTCTATCAGGAAAGTATTACAGTAA 

TGGAATAATTATAATTTCATTGAGTGCGTCTCCTCTTTTTAATATGMTTTTGAGCMGT 

TTTTAGATTCATTTATATGAAGTGTCCA N 1 1 I GTAGGTAAAAGTAGTTGAAAAAGGTAA 

TGTATGGTTCAACCTAATAACAGAAAAtATCTAGCCTTAATTAGAAAAATAAGTCATAAA 

GACMCm GTTT CAAATAAGGGGAAATACGATAGCAGTCATGAAAAACTGAAGGATACA 

AA^AGAATATTTTAATTGGTAGCCATACAAAGTGGAGATATTTTAGAATCTATTTCATTC 

TC1MTATATTTGAGTCATTTTATTTTGGGTAAAATAMTTATAGGTGGAGCATGGTGGC 

TCACGCTGGTTATCCCAAAGCGCTGGGATTATAGGCCGTGAGCCCGCCACTCCTGGCCGA 

A ATCTGTTTTAAGTATAGAACATCATGAAATTGGAT 

Sequence 2681 

CCCGGCCGGGCAGGTACCATGGTTGGMTGATAAANGATATTGTCATTTTTGTTAGCANT 
GAAGTGAGTGGCATCTATGTA I Mil I AAGGTATATAATGAAATTGTGCCTAGGGGAGTN 
ATAATN C ACTCTATGTA 
Sequence 2682 

CGCGGGATGGAAAGAAATGAAGACANTTTTAGACNTGCTAGACTNATGGTTGACTATACA 
NCAACCATCTCAGAAAGAGTTATTCAGATATAGCTTCANACTGATANNTAAATCATATAA 
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ANTAATGTGGTANTCAAAATANGAGTTANGTAACTACTGACANATATAAGGAAAGTCGTA 

CCANTTCNGAACTAAAAACAATGGTCTATGTTGCTGGANGAACAATGTGGGGAGGGT 
Sequence 2683 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGAAATG 

AAATTGGAGCTGGACCATTTAGTCAGTTCATTAAAGCAAAAACTCGGCCATTACCACCCT 

TGCCTCCTAGGCTAGAATGTGCTGCTGCTGGTCCTCAGAGCCTGAAGCTAAAATGGGGAG 

ACAGTAACTCCAAGACACATGCTGCTGAGGACATTGTGTACCT 

Sequence 2684 

CCGCGGTGGCGGCCGAGGTAC 1 I I I I I I I 1 M 11 1 II I U 1 1 I I I I NCTCCACATCTTCA 

CATTCTTTCTGAATAACAAGCCTAAGTCCCCATGACCTGATCACCTGTCAGGTGATCACC 

TTGGCCATCACCTTGGCCTTTATTCACCTAGTGATGCTTCTCATAGTAATGGATTTGTTG 

TCTCCANATGTATTTGAGTCACTGCATTTTTGGMTGGACTTTAAGATGTAAAGGNGTAA 

TTCTACCTGAGCAGAAGTGACAAGGGGCCTTTCCATNTGCACCCCCAGCC 

Sequence 2685 

CCGCGGTGGCGGCCGCCCGGGCAGGTAC I I I 1 M I I I N I 1 I 1 I M 1 I 1 i 1 GAGACAGGG 

TCTTGCTTTGCTGCCCAGGCTAGAGTGCAGTGTCGCAATCTCAGCTCACCGCAACCTNTG 

CCTCCCAGACTCAAACCATCCTACCCTTAATCTTCCAAGTAGCTGGGACTACGGGTGTGT 

GCCATCATGTCCAGTTMTTTTTAAATTTTTTGTANAGMCAAGGTCTNACT^ 

CAGGCTGGGTCTTAAACTNCTGGGCTCAAGCAATNTGCCTGCCTCCTTCTCCCAAAATGC 

TTGGGA TTAGAGGCATGAGCC ACAATACCTGGCCACTATGCAATTTGTATGTATTTTTAA 

ACGATAAI' I rTTTTTTTTTTTI I AAACAG AGTTTTGNTTTNTCNCCGAGGCCANAAGNGC 

AGNGGGCACAATCTCAGNTANACTGCAACTTCAAGCCTCCCAGGT 

Sequence 2686 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGACGCGGGGAGCTACAACAGGC 

AGGCAGGGGCAGCAAGATGGTGTTGCAGACCCAGGTCTTCATTTCTCTGTTGCTCTGGAT 

CTCTGGTGCCTACGGGGACATTGTGATGACCCAGTCTCCAGACTCCCTGGCTGTGTCTCT 

GGGCGAGAGGGCCACCATCMCTGCMGTCCAGCCAGAGTGTTTTTTCCAAGTTCCCAAC 

TATAAGAACTTCTTTAGCTTGGTACCTGCCCGGGCGGC 

Sequence 2687 

CCGCGGTGGCGGCCGAGGTACTCGCGGGCCTTGGAGGGGGCCACCGACAGAGCTGACATC 
ACCGTCATGATGCCTGCATTCCAGGAGTCACGGATGGGAAAGTAAATCTTTGGAGGGGCT 
GGGAGCTGGGGTATCTGCAGGCCTGTCTGCTGACACACGTACTAGAATGTTGGCTATAAT 
AGTTCTGTTCTTACAACACATGAMTrTTTTCGTTT^ 

ATGTTCATTTCTACTCACAAACATGTTCTTGGTGTATTTCTTATGCAAACAATCTTCAGG 
CAGCAAAGATGTCCGTTACATCTAAACTTGAATMTAAAGTTTTACCACCAGTTACACAT 
NNAAAAAAAAAAAAAAAANGTACCTGCCCG 
Sequence 2688 

AGGTACTTTGGCCTCTCTGGGATAGAAGTTATTCAGCAGGCACACAACAGAGGCAGTTCC 

AGATTTCAACTGCTCATCAGATGGCGGGAAGATGAAGACAGATGGTGCANCCACAGTTCG 

TGTGATCTCCAGCCTGGTCCCCTGGCCAAAAGTCCGAGGGATACTGCTACTCTGTTGACA 

GTAGTAAGTTTNCAAAATCTTTCAGGTTGCAGAACTGCTTGATGGNGAAGAAGTGAAATC 

TGTCCCAGATNCACTGCCACTTAAACCTTTGATGGGGACCCNACTTTTGCAAACTGGGAT 

TGCANCNTAAGAATGAGGGAGTTTAGGGGGGCCTTTCCCTGGTTTCTTGCTGATNCCAAT 

TTTAAATAGATATTAATGGACTTGACTTGCCCC 

Sequence 2689 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGTTGTATCNGG 
ACTTATGGTG GCC AC C AAATATG AAGTG AGTGTCTATGCTCTTAAGG ACACTTNG ACAAG 
CACACCAGCTNATGGAGNTGNCNCCANTCTGGNNAANGCNNGNCCACCAAGAAGGGCTNG 
TGTGACAGATGCTACTGNNACCACCATCACCATTAGNTGGAGAACCAACACTGAGACCAT 
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CACTGGCTTCCAAGANNGATCGCCGTTCCANCCAATGGCCATGACTGNCAATCCACAGAA 
CCATCAAGCTCATATGTNCACCAAGCTACACCCATCATCAGGTTTACAACCAGGCACTGA 
CTACAAGATCTACCTGTACCTGCCCG 
Sequence 2690 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCACGGTACTTTTTT 
I I I I I I I 1 I I I I { 1GAGACAGAGTCTCACTCTGTAGCCCAAGCTGGAGTGTAGTGGCGTG 
ATCTTGGCTCACTGCGACCTCCATCTACTGATTTCAAGTGATTCTCCTGTCTCAGCCTNT 
CGAGTAGCTGGGATTACAGGCGTGTGCCACCATGCCTGGCTAAi III)! I GTATGTGTAG 
TAAMGATGGGGTTAGCTGTTGTTTGATTTTATATAGACCGTGACATGTCTTATGATATG 
AGMTGGGCTGGAACTATAGCCTCGGGGTCCTTGCCCTTTCCTMGGACTTACTGTTTAT 
TGATCAAGTAAAAGTGAMCAGGAGCATGTTTAAAATMTGTAGCCTCATTTTAAAATT^ 
TCTGC ACTCTTAC AAGTCAATAGATC ATG G 
Sequence 2691 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGCGAAGA 
CAAATGATCCTTAAAGAAGGTGTGGGGTCTTTCCCAACCTGAGGATTTCTGAAAGGTTCA 
CAGGTTC AATATTTAATGCTTCAGAAGCATGTGAG GTTCCCAAC ACTGTC AGC AAAAAC C 
TTAGGAGAAAACTTAAAAATATATGAATACATGCGCAATACACAGCTACAGACACACATT 
CTGTTGACAAAGGGAAAACCTTCAAAAGCAATG 1 Mill] INCCTCCACNACCAACAGAA 
ACATGCCANTTAACCTGCCCGGGGGCGGGCCGNTTCTAGAAACCTAAGTTGGATCCCCCG 
GGCCTGCAAGGAATTCGATATCAAGCCTTATCGATACCCGTCNNACCTCGA 
Sequence 2692 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGCTGGGTGCTGC 

AGCACTGGAGTGCGAACTTGTGGCTGTGGTTATGAAGGAGCCTCGTCAGCTATGCCTTTA 

ACCCCTGGTGAAGCAGTGCCAGTCACACTTTGCCAGTCAAACGGAGAGGCAGCAGTAGTA 

AGACCAGACCAAATCAAACCAAACCAAGCAAACCACGGGGAATGGAGACACAGAGTTTCA 

TTCTGTCACCCAGGCTGGAGTGCAGATGCAGTGGCATGATGATAGTGCACTACAACTTGG 

AACTCCTGGGCTCAAGCCATCTTCCCTCCACTGCCTCCAAGTAGCTGGGGACTACAGGTC 

ATGTGAGCACACAGCATAAAGGCAGNTGCCCACAAGCCAGGAAGAGAAGCATCATCAGAA 

ACCCG ACCATACTGGCTTCCTG ATCTCAGACTTTCAG C CTCCAG AATGGATTATTGCCT 

Sequence 2693 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGAGAGAGACACACA 

GATCTAAGTCGGGGGCCAGAGGGAGCTCCCTCAACTAGGCAGCACAGCCCTACTGATCAG 

CGCATGTGTGTAAGGAACCACCTCAGGCCAGAGAAAGAGCCATGTGAGAAGATTAGAAGG 

AACAGTGGTGCTGACACAGGACCTACiGGTAGGGCCTGTTCCCACCACCCTAGCTGGAAAA 

CCTCATGATTTATGGGGCACTGGGTA3AAAACACAAGTGAGAAACCCTAGATGGGCACTG 

CCACCTGACACAGCTATACAGCAGGACCTGAAAGGATCACACTTGCATCGTAGGACAAAG 

CTCAAAAGTACCTTGCCCGGGCGGCCCGCTCTAANAACTAGTGGGATCCCCCGGGGCTGC 

ANGAAATTCGATATCAAGC 

Sequence 2694 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGAGGTTTCATGACTCACCACA 
GGGGCACCCAGCAAATTGGCATCCGAGCCAAMTGAGAGTGCMGCTTTTGAGTTTCCAA 
TGTAAAGCTCATCAAACTTTG ACATTACTTC 1 1111 1 ATATAAGGGGC I 1 1 I 1 1 I 1 I I I I 
TTTTTGCATAAAAAATTTTAAAAAANCAAAT^ 

NCTNNTGNCCATANAAACNTTGCNTGTNCTGNCCCNAGGNAANNTGGAANNTTCGNANGG 
TNTNGCNCTCNACCCTGGGTAACNANCAMNNTTTNTTTCAAAAAAAAAAAAAAAAAAA 
Sequence 2695 

CCGCGGTGGCGGCCGCCCGGGCAGGTACATACCCCCACACCAGGCTAATTTTTCCTATTT 
GTAGAAACACGGAGTCTTCCTATGTNGCCCAGGCTGGTCTCGAACTCCTGGGCTCAAGCA 
GNNNGCCCACCTCACCCTCCCAAAGTGCTGGGATTATAGGCATAAGCCACCACGCCTGGC 
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CAATTTCTGTTAAATTTTACCTTCATTATTCTCTAATTCTAGCCTGCTTCATTTCTGTCA 

GCGGNACTAG TCAC TGTTTTCTACATTGTCCATCGGGAATGTATGCCCCATTAGGATGGG 

GACTTTGCCTATTTTGTTTACCATTATAGNATCATTANTTGAAATAGTGCCTGGNCGTGT 

TTCCAAAAAAGTACATTATACTATTCTCTTNAAATAAAAAACCACAGTAAANAAATTTGT 

CTTTCAA 

Sequence 2696 

ccgcggtggcggccgangtacaccmgcacctatttttataacttagctncccatggnga 

gataatggcttgcgtgcatnttatgtatccataacatacatacaaggctcggnctttnca 

atgggatmcanttcacaactcttcgatttgaantgtaatgaatctggcgacaaggattt 

ttctctaatggatnccaaagttagccagaacttttaatgtnaagatgaaaaagggtgtaa 

ggtgttatattntcttcaatncctttaccacaggaggctaactccacaatttccctcatg 

tttctcattcagaaaaaaaaaaaatmttaaatttttgtgtntccgm 

gaatngctttcttatgntgcctgatngttttnnannttncttgnamntntaaanctttg 

ggnttctttaanaaantttt^^ 

TTTGGTTATTAAAAACAATTTN I I 1 I i ii NTTGGG 
Sequence 2697 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACACCCTAGCTCC 

AGCTTCCCCTGGGAGACTGTGCATCTCCTGGCTCCACTAACACCACCTTCTTCTGACCTT 

CCAGCCTAGAGATGATGACTCTGCCAGCCTAGATGGGCTCTGGGTTGTCTCCCTATTCCT 

GTTTGCTTTGTAGATTTCCCATTATGCTGTCACCAACTCCCCAGCCTAAGCCCTCTCTAT 

TTTAAATTCTCAAGTGGATTATGTTCCTGATTAGTCCCTGACTGATATACCACTCTCCTC 

ATGATCTCTGATTAGTTTTCCTGTTAGGTTGTTG 

TCGGCCGCTCTAGAACTAGTGGGATCCCCCGGGCTGCAG 

Sequence 2698 

CCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGAGAACTTGTGT 

GCTTAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGTAGGACCCTATACG 

GCAACCTCC TTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAATGAGTCAAGGAAGC 

TTTTCTTTTGAGCTATTTACAGCTTTTAGCAATTGAGTAAAGTATACTCCTGTGAACA 

ATTTGGAACATACTTGTTTCTCTCTAACTGATTTCTCCAGAATTTGGAACTAGCTGTAGC 

TGAGACCMTGATGCTGACCTCCCTCAAMGCTGCATTTCTGAATTTCTGAAGGCAAACT 

GTCTGCCTATATTGTACAGCAAAATAATTTCCTGTATTAAAACCGTTTTTATTGCTCAAA 

AAAAAAAAAAAAAAAGTACCTGCC 

Sequence 2699 

CCGCGGTGGCGGCCGGGGGCCATTGAGACTGCCATGGAAGACTTGAAAGGTCACGTAGCT 

GAGACTTCTGGAGAGACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGACCACTGGAAC 

AATGAGAAGGAGAGAATTCTACTGGTCACAGACAAGACTCTCTTGATCTGCAAATACGAC 

TTCATCATGCTGAGTTGTGTGCAGCTGCAGCGGATTCCTCTGAGCGCTGTCTATCGCATC 

TGCCTGGGCAAGTTCACCTTCCCTGGGATGTCCCTGGACAAGAGACAAGGAGAGGGCCTT 

AGGATCTACTGGGGGAGTCCGGAGGAGCAGTCTCTTCTGTCCCGCTGGAACCCATGGTCC 

ACTGAAGTTCCTTATGCTACTTTCACTGAGCATCCTATGAAATACACCAGTGAGAAATTC 

CTTGAAATTTGCAAGTTGTCTGGGT 

Sequence 2700 

CGGNGGCGGCCGAGGTACGCGGGGAGTCTTGGTTTTCTGCTAGTGCTGCTGCTGCTGGGA 

GGACGACGGACGGCAGCGGCCAAGCAAGAAGAAAGACGTGGCAGCAAGCGGGAGTCGGGG 

ATAGTGTCATCGGTTCGGTAAAGTTTCTGTTCTCTGATTTTGGGGGATTNTTNCCTGCCC 

AAAAAANANNNAAAAAAAAAANTTGTTCCTGCCCG 

Sequence 2701 

CCGGGCAGGTACACATGCTAGGGTCCAGGACAGCAGGACCAAGCCAGCAGAAACAGCCTG 
AGCCCACCGCAGACTGGCCTGGCTATACTGGACAATGCCACTCCTCCTGTACCT 
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Sequence 2702 

CTATCACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGAAG 

ATGTGCGGGAGGCATATCGCCGTAGAGGCTGGGCCTTCAAGAAC 

Sequence 2703 

CTATCACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGCCGGGATGGCAACA 
GATGGTGCGTGCCGGGGACCGCGTTAGCAGCAGCACTCTTGAGAAACCAGGAAAAGAGCC 
CCGCGTACCT 
Sequence 2704 

CCGCGGTGGCGGCCGAGGTAC I I I I I I 1 I 1 I I I I I 1 I I I I I I I I I I I 1 I I I'CATTCTTTT 

CTTATAAGTAAGTTGTCCATAAGTGAACACTTNAAAAGCAACAATTCCNTCCCCTGGATT 

CAAGGAANAATCCTATGCATAATCATTCACATGCATNATGACCCANATTNTTACTGGGTN 

GTTTAAAAAGGGTGAANCAGNGGTCCTTTGAGTGTTCAAACCCANTTCCCAAGGAGCACA 

NATNCNCCTGMTCCCATNTTGGGTTGCAAGCCCTGTTGAGTAAAAAANAAACCTTNTGG 

TTN C AAC ATTCATCCTTTTN ATG ATAATTG CC AAAAMTTTTTAAC AAAC ATTAATATAA 

TGCNNAACATCCCAGTACCTGCCCGGGCGGCCGNTNTTAAAACTAGGTGGGATCCCCCGG 

GGCT 

Sequence 2705 

AGGTACTTTN 1 1 1 11 H 1 i 1 1 1 1 I 1 1 H 1 CAATNTTTATTTTGGTTTTCtNACAAAGGTT 

GACAGTTTTCCATMCAGGTTGTAAGAGTNTrGAAAAAAAANTTCAM 

CGGG 

Sequence 2706 

CCGCGGTGGCGGCCGAGGTAC i I I I I I I I I I I I I I 1 I I I I I I I I I GNTTNACATTTGGGG 

GTAAAGTGCAAGTTTGTTAAAAAGATATTGNGGAATGTTAAGGCTGGAGTACAAACAAAT 

CCATCATCCAGGTAGTGAGCATAGCACACAATTGGTAGTTTTTCAACCCTTTTTTCTCTC 

CCTCCCTTCTCATGTTCCCCAGTGTCTATTGTTCCCATCTATATGTACATGCATACTCAA 

TGTTTAGTTCCCACTTATAAGTGAGMCATGTGGTATTTAGTTTTAT^ 

CAGTCCANATTTGNGGCTCACAGGCTGGGTANATAAANACTTTTGTGTCCAAATTTCCAA 

CCAGTAGCTGCTATACACACATATCTGGGCCTGCCGATTTTCCCAT 

Sequence 2707 

GGAGCTCCCCGCGGTGGCGGCCGGGGGCCATTGAGACTGCCATGGAAGACTTGAAAGGTC 

ACGTAGCTGAGACTTCTGGAGAGACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGACC 

ACTGGAACAATGAGAAGGAGAGAATTCTACTGGTCACAGACAAGACTCTCTTGATCTGCA 

AATACGACTTCATCATGCTGAGTTGTGTGCAGCTGCAGCGGATTCCTCTGAGCGCTGTCT 

ATCGCATCTGCCTGGGCMGTTCACCTTCCCTGGGATGTCCCTGGACAANAGACAAGGAG 

AGGGCCTTAGGATCTACTGGGGGAGTCCGGAGGAGCAGTCTCTTCTG1CCCGCTGGAACC 

CATGGTCCACTGAAGTTCCTTATGCTACTTTCACTGAGCATCCTATGAA^TNCACCAGTG 

AGAAATTCCTT 

Sequence 2708 

TGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGATTACTCAAAAAGAATCA 

GTCCTGAAAGGGTGGGCAAATGTGCAGCTCTCCCTCAGGGTCGTGGGATCTGGGTCCAAC 

AGTCTGGCCTTTCAGCCTTCAGGCTGTTTTAGGCTTGMGTTTCGTGTGGCTGATCAAAG 

AAAGAAGGCATACGCAGATTTCTACAGAMCTACGATGTCATGAAAGATTTTGAGGAGAT 

GAGGMGGCTGGTATCTTTCAGAGTGTAMGTAATCTTGGAATATAMGMTTTCTTCAG 

GTTGAATTACCTAGMGTTTGTCACTGACTTGTGTTCCTGAACTATGACACATGAATATG 

TGGGCTAAGAMTAGTTCCTCTTGATAAATAMCAATTAACCAAATACTTTG 

Sequence 2709 

CCACGGCTCATAGCCTGTGGGTTGGTGCTGATCGGCATGTTCGTGATGAAACTCTAACCG 

TCGGACAAAACTAAAAMGGMTGGGTTATGACAGTCGCTTTCTGGTGTGTGTTGATCGC 

TATTTTTCTGCCGTATCTGTGCACGGGCGTGGCCAAGTTCAGCGGTGGCAAGTTCGGGCC 
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CCGGCAGAACCATGACCCCCGAGCTTTCCTGGAAACCCTCGAGGGGTTCGCCAAGCGTGC 

CCACAGTGCGCAACTCAACAGCTTTGAAGTGACCCCGGCCTTCGCGGCGGCGGTGATCAT 

TGCCCATTTGGCCGGCGCTGCGTCGCTGGTGACGATCAACGTATTGGCGGTGCTGTTTAT 

TACCAGCCGGCTGCTTTACATCATCTGCTACCTGGCGGACTGGGCGATTCTGCGGTCGGT 

GGTATGGGCGGTGGGG 

Sequence 2710 

AGATACCACTATGCCTGGCTAATATTTTTA I I I I I I I ATTTTTGTAGAGACAGGGTCTTG 
TTATGTTGCCCAGGCTGTTCTCAAACTCCTATCCTCAAGCCATCCTCCCATCTCAGCCAC 
TCAACTTGCTGGMTTACCAGCGGAAGCCACTGTGCCCAGCCCAGACTCCCTTTTACCTA 
GTATATCATGTCTMTTTCTAACAAAAMTTA^ 

ACCCTAAAACAAACAAGCAAACAAAAAACCCAAAACAACAAAAAGTTTCMGAAACAGAG 
CTAGCTGACAGMCTATATTCAGATATGGCAGGTATATTAGAATTATCAGATTGGTAATT 
TTAMCAACTATGACTAATATGGTAAGAGTGATMTGGAAAAAATAGACMTTTGCAAGA 
AC 

Sequence 2711 

GCGAATNGGAGCTNCACCGNGGGGGNGCCGGGCAGGTACCCATNTAAACCCCAAACCTCG 

GGCTCCAGAAGCAGATGAGCAGATGAGGAGACGCCTCTATTCCGTGTCGTNTAGAATGAG 

CAAACAGTTGTCTTCACCATCCCAGACAAGCAGATGAATGGTGAAACGATNCATCATGAT 

AAAAGAGAGATGAGGNNCATTTGAACGCCNAGAGGAGTTCAGTCGAGGGTGGTCGGAGAG 

GTTTGGCCACCCAGCAGGAAGATTATCTTCCCGTGCCATCCCCGCTTNCAGCTCTGCATC 

CATCCTTCAGAGAACCACCTCCACCACTCAATAAAACCCTGCATTCAACCTGCNAAAAAA 

AAAAANAAAAAAAAAAAGTACCTN 

Sequence 2712 

CCGGGCAGGTAC rTTTTTTTTTTTTTTTTI GGTCTTATAATATTTGTTTGACACCTCAAA 

CCTCTTATTTTCATTTCAGAGACTGATTAAATACAGTCTCAAGGTGATAAMTAACACAG 

TAGGATAAAAATAAAGTGACAAGTGGCAAGGGATGGTTAGTCATCTTTGATACCTGTGGA 

TACTCAAACCCACCATTTAAMTGTCTTAGGTGTTAGATGTGGTAGGTGTTTCTATCAGG 

AACACGTGGGAATAAAAACAMTTTAGAACTCACATCTGGGCTTCTCACTTCTAAAACAA 

GGGTCAGGAGGCCAGAAGGCGGAGCAGCGCATCAGTTAGGATTTGACAGGACC 

Sequence 2713 

AGGTAC I II I I n I II 1 1 I J I II I I i I I J GGGCAGCTAAAATCTAGGATGGNGTTAAGTT 

TCTTCATTTTGTCAATTATATATAAAAAATTAGAAACAACACCGAATCTGCATTTCTTGG 

ATGAGATAGTTAATAACAAACTATTTCTCAATATTTGTATACTAAAAACTAGTGAAGGTG 

TTATGTGTTTCANTATCTTATCTCTTATTTGAACATGGGTTTCTGAAAGGAGCCTATATA 

ATAATATAAATGGTATGTAGTAAATGAGGCACTGTCTTGGCTGGGACTGCTATAAAAAAA 

TTACCATAGACCATTGACTAAACCACAAACATATACTTCTCACA 

Sequence 2714 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTNCTGCCTGGGTTA 

TGGGANTNAAATNTTGNCCAGGGACATCCNAGGGAAGGGGAACTTNCCNAGGCAGATGCG 

ATAGACAGCGCTCATAGGAATCCGCTGCAGNTGNACACAACTCAGCATGATGAATTNNTA 

TNTGC AGG ATC AAG AN AGTCTTGTCTGTG ACCAC N NG AATTCTCTCCTTCTN ATTGCTCC 

AGTGGTCTATCTTTGTCAAGAGCCAGAA 

Sequence 2715 

GGCGAATTGGAGCTCTCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAGTCCTGAAA 
GGGTGGGCAAATGTGCANCTCTCCCTCAGGGTCGTGGGATCTGGGTCCAACAGTCTGGCC 
TTTCAGCCTTCAGGCTGTTTTAGGCTTGAAGGTGGG I I I 1 1 GCTAGGACCC7TGGCTGTC 
TCTCTGTCAGATGGATGACTAAGTGAGTTTCGTGTGGCTGATCAAAGGAAGAAGGCATAC 
GC AG ATTTCTAC AGAAACTACG ATGTCATG AAAGATTTTGAG G AG ATGAGGAAGGCTG GT 
ATCTTTCAGAGTGTAAAGTAATCTTGGAATATAAAGAATTTCTTCAGGTTGAATTACCTA 
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GAAGTTTGTCACTGACTTGTGTTCCTGAACTATGACACATGAATATGTGGGCTAAGAAAT 

AGTTCCTCTTGATAAATAAACAATTAACAAAAAAAAA 

Sequence 2716 

CCGCGGTGGCGGCCGCCCGGGCAGGTAC I rTTTTTTTTTTTTTTTTTTI I AATGNGCANA 

GGATTGACCTGTGCATGCTTTTGATCTCTCATTCAAAGGATCAATATTAAATAAAATTGT 

CATGAGOTGTGTTGMNACAGGGCGCTTTCAMTAGAGGTAATTTGCTCTTGTGTTGTAA 

GAGGAACATGTCAACAAAGATAGGAAATGAGGGTGATCGTGCANATGGCTTGTATCTTAT 

ATATGCAAAGGAGTTCCCGCGTACTTCTGTCTTCCAGTTTTCCACTTCAAACTTCTATCT 

TCTCCAAATTGTTTCATCCTACCACTCCCAATCAATCTTTCCATTTTCGTCTGCGTTTAN 

TAMTGCGTTAACTAGGCTTTAAATGACGCAATTCTCCCTGCGTCATGGATTTCAAGGGC 

TTTTAA 

Sequence 2717 

GCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTAC I M I I i I M M M I I il l M U I I 

GAGTTCCAGGATACAGGTGCANAGTGTGGAGGTTTGTTACATAAGTAGATGTGTGCCATG 

GTCGTTTGTTGCACCTATCMCCCATTATCTAGGTTTTAAGCCCCATATATGTTAGCTAT 

TTGTCCTAATGCCTTCTTTCCCCTCACCCACCAACCGCCCTCAAGTCAGTTTTNTAANAG 

TATTAATCAAG AAACCATNTN ATAATC ACACCAAAGCATATTTNTAC N CAAG ATNTAAAA 

TNCTAGGATATTTGCTAAANATA 

Sequence 2718 

CCGCGGTGGCGGCTCCTATGGATGCTTTAAGGCAGTGTTTCTCAACCAGAGACAAGCTTG 

CCACCCACTGAATATTTGTCAGTGTCCAAAAATGTTTTTGGTTGTCATGACTGGAGGTGG 

GAGGAGTGTGCTATTGACATCTAGTGGCTAGAGGGCAGGTATGCTGCTAAATATCCTAAC 

ACACAGGACGGCCCCTCTGCAACAAAGAACTACCTGACCCAAAATGTCAGTTGTGCCAAG 

GTTGAGACACCCTGCTGTAGGGGATAGAAAGAAGTGTAAGAAATGGTTTCAAACTCAAGA 

ACAATGATTAGAGAGATGGATGGATGGATAGATAGGTACCTGCCCG 

Sequence 2719 

AGGTACTGTATTTCCAAAAGTAAGTTTCTATTCCTAGAACTAAAACAGAAAAGMGTGTT 

GTGAGATACATAAATGAAGCCMTAAATTTAACCAAGGGCTTACCATTTTCCTGCAGGAT 

GTT 

Sequence 2720 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCACCATCAG 

CACTGCTGATTTTCAGTTTTGCTATCTATTCCAGGTTTGTCCTTACTCCCAAAAAAACTA 

ACTAGAGAGCACAACTGAAAAAAATATATAATTCTTTGGAAACTGTCCCTG Al I 1 1 l A TG 

CAAATGATATGCTTCAACAGCATTTCAGATCCATCCATGTTGAAACCfGTCCGTTCTTCA 

TGCGTTGCTCAAGTTGGTGATGCTCTGAGGGTGATGCTGTTGCCATAGCTGTTTCTGTTG 

GACCGCAAGCTGTTGAGACTGGTCTGAAGTTGATAGGAGATTTACAAGAGGGGACACACA 

ATTTGCCGGAATGATCAAACCTACAATCCGTTGCCCATCTTCCGTTCTTAGCCGCACGAT 

CTGCATCTTCACGTTTGTGCCACTGACAGATGCTAGAACACCCTCAAC 

Sequence 2721 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATAAAAGTTTAAATCCTAA 
TGTATTAAAAAAGTTTMTAGCACTATMTTTCAGMCTTTATATCATGCTTATCATACC 
ATCCCATAGTATGGGAATCTTTCCATTTCAGTAAATGTAGATCCACAGTAATAI I I M AA 
GGGCTATG C ATTCCATTGTCTAG ATGAACCACAGTTCATTG AAAC AACTGTCCG ATG ATG 
AACGCTGAAGTTGATCGAGCGGCCGCCCGGGCAGGTACTGCGGCGGGTAGGCCTAGGATT 
GTGGGGGCAATGMTGMGCGMCAGATTTTCGTTCATTTTGGTTCTCAGGGTTTGTTAT 
AATTTTTT ATTTTT ATGG G CTTTG GTG AG GG 
Sequence 2722 

CTATANGGCGAATATGNAGCTCCACCGCCGTGGCGGCCGCCCGGGCACGGTACTCTGTTT 
TTTCAAACNTTTNNTATTACNTTTTTATACTNNNTTTTATTTNTNATT^ 
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TTTTTNATGTTTTNCATATCCTTGACTTT 

AGCCAAAGCATTGANGTTCACTTGGTTCACCAAAGGCCCGACCCCAAACACATATACATN 
CAGNATAATCCTCCCTTGGGTTTTTGCGATCCTTGCCAATGTATANCAAGTCCCGGATCT 
CATNAATGCCCNCGTACCTGCCCG 
Sequence 2723 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATCATATTTGCTTGTTGGC 

TTTATGATAACTTAGAMTMTATTCTGGATTAACTGTGTGACTCATGAGAGAGTTTGTG 

CAACTATGTTTCCAAATTTACTTGATTTTCMGACTTACACTAGAACTGTGCGAACAAGG 

TAATAAGTAAAGCATATGTATTAATACAACCTTTGGTCAACTCTTGGGTAGCCCCAGTGT 

TGGCCTCCCGCTTTGGCCTCCCAAAGTGCTGGGATTACAGGCATGAGCCACCGCAGCCAG 

CCAACATAGAGATAATCTTAAAGACTCCTMNAAAAAAAAAAAAAAAAAAANGTACCTGC 
CCG 

Sequence 2724 

NCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTACTTTTTACTGCATC 

CTGCMCAAATATCACTGAMTGATTCATTTCTGGAAGCATTCTACTGGAGCATCCCGTG 

GCCTGGAACAGAGCTCTGGGGACTGGCATCCTTCCACTGACTTTGATGCTGATGACAGCC 

CTGATCGTGTGATTCAGCTGGCCTTAACCCGACCCACGTGCACGTATTCCTCAGCACATC 

TGGAGCTGAAGTCGAGAGCCTCTGTGGGAACGCTTGGCAGCCCATGCTGTTCTAAGGCTG 

GAGCAGGATAGACTGCTTTCATCTCTGATGTTAGAAAGCAGACCTGTTTCAGGGTTTGGG 

GMGGTTGTTCGATATGAACTGGGTCCTCTCTMTTATTTTTACTGTATGTGTGACTTCT 

TCCTAGAAAGMTGGAAGAATGTTTATGTTAGAACATTTTATCTATTC 

Sequence 2725 

TATAGGGCGAATTGGAGCTCCCC GCGGTGGCGGCCGCCCGGGCAGGTAC' i rTTTTTTTTT 

m m m u i mrc ) 1 1 n i ii i m m m m ) n n 1 1 1 1 < m 1 1 1 n m n 1 1 n 1 1 n 

»■ m i u i I l I I I U II I II I I 1 111 I AAAAAAAAAAAAANGGGTNTTTTNNNTNAAANC 

^^^^^^^^f^^'^' ~* nnisjnni^<3y^yx t >^^0o^^i s ^i f^i r*^ c^csAX^v^^s^^x^^j^^^^rc^ i^si 1 1 i"rvi rvi tvi xv-^.rsi r^i ivi g<s^i — frsi cs 

NTTTTTTI I ANNGGGGGGGAAAAAAANAAAAAAMGGGGGNATTTTAGNNATTNNGTCCC 
CNAANTTANNCGNNGGGGNNGGNAANCCCCCGGNCNTTAANNAAANNGGGGAAAACNTTT 
NTNTTTNNNGGNGNGGNNCACCCCCCCCNTNNTNGTNTTTCTNNAAAANNNNGGNGGAGA 
ATNANCNGNTCTTCANANCANGAAAAAAACNCNTCNCCNNCCCCNCTNGAACTTGGGTAA 

GNNTAAAAMNAAGGGNCCCCCNGGGGGNGAGGAATTCANTTNTAGGNTTTTTNNACCCC 

CCAANCGGGGGGGG 

Sequence 2726 

TATAGG GCG AATTGGAGCTCCCCGCG GTGGCGGCCGAGGTAC I i I I III II I II 1 I I H I 
mm uGGGGll I 1 1 1 1 ) U I I M 1 I I 1 1 1 1 ) I 1 1 1 I 1 1 1 1 U 1 1 CNGGGGGGANCATGG 

GACCTTTNTTCGTTAAAACATCAGGCTCCAAATNTGGACTTTNANCAAAANCCNTTGCCG 
GGAACA 

Sequence 2727 

m m M i M I I I I I i II I n I I rCCGAAAGGAANCAAAACCTGGNGGGTTAATTAATCCT 

TNTTGATGCTGGGCAGCANTAATAGGATCAAGAANAAAGCCTTTCTCCTCCTGTGCTCCC 

AAGGATGCCCANCGGGANACTGTGAGGANATGGAACACAGCCTTCACTNTGTTAATTGTT 

CAANAGCAAATCACCTGCCGTGAGGACCATGACANTCCTGGGGGCCATTTGTANAATATT 

CATCCCCANACTNTGMCTGAGTCTTGAAGTCTGAGTAANAATTTGCTGGGTGAAAAAAA 

GGGTMCGTTNTTTCAAAANGGAGAAACCAGCTTAAACAAGGGTCCCGGGGGGCAGAACA 

TCATGATAGGAGTAGGAATGCAGGTGAGGTTTTGGTTTTCTGCAAAATATTGGGTACCTG 

CCCGGGCGGCCGTTNTANAACTAGGG 

Sequence 2728 

CCGCGGTGGCGGCCGAGGACGCGGGTATGCATGCATTTTCTCTGAGTTATATTTTAAAAT 
CTATACATTTCTCCTAAATATGGAGGAAATCACTGGCATCAAATGCCAGTCTCAGACGGA 
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AGACCTAAAGCCCATTTCTGGCCTGGAGCTACTTGGCTTTGTGACCTATGGTGAGGCATA 
AGTGCTCTGAGTTTGTGTTGCCTCTTTTGTAAAATGAGGGTTTGACTTAATCAGTGATTT 
TCATAGCTTAAAATTTTTTTGAAGAACAGAAC I Mill I AAAAACAGTTAGATGCAACCA 
TATTATATAAAACAGAAC AGATACAAGTAGAGCTAACTTGCTTAAAAAAAGGATG GAGG C 
TCTGAAGCTGTGACTTCATTATCCCTTAATACTGCTATGTCCTCTGTAGTACCTTAGATT 
TCTATGGGACATCGTTTAAAAACTATTGTTTATGCGAGAGCCTTGCTAATTTCCTAAAAA 
TTGNGGATACA I MM I CTCCCATGTATAATTTTCTCACCTT 
Sequence 2729 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACCCAAAAGTGGGTCTTG 

AAATATAATTGTTTCTGAGAGGAACTTATTCTCAGGATA I I I I I GTAAGAGGGACAAAAC 

TACTTMCAGTGTGTATCCTTGCAATAAGTATATTAATGAATACGATACMCCATTTTAT 

ACAGGCAAAAGATGGTTTAATGCCAAAGCACAAACTCATAAACCCAGTTGATGAGGAGCC 

AGMCAATGGTCATTACTGTATAGAGATGTTTTAAAAAGTCACCATATAAGCTTTGAAAT 

AATTCTAATGAAAATTATATGACCTTGAGGCTGGGCGCGGTGGCTCATGCCTGTAATCCC 

AGCACTTTGGGGAAGGCCNAGGTGGGTGGATCACAAGGTCAGGAGATCGAGACCATCCTG 

GCCAACATGGTGAAACCCTGTCACTACTAAAATACAAAAAATTAGCCCGGGCATGATGGT 

GCATGCCTGTAGTCCCAGCTACTTGGGAGGCTGAGGTGGGCGGGTCGCTTGAACCTCAGA 

GGCAGAGGTTGCAATGGGCCAAGACGGTGCCACTGCACTCCAACCTGGGGGACAGAGCAA 

GACTCTTGTAAAAAAAAAAAAAA 

Sequence 2730 

AGGTACTGTCATGCMGGATGGGAACACATTCTGGGAMTGTGTCGTTAGGCMTTTTGT 

TGTGAGAACATTATGGAGTGCATTTACATGGACCCANATAGTANAGCCTACTACATACCT 

AGGCTGTATGGCATANCCTGTTGCTCCCAGGATACAAACCTGCNTAGCATGTTACTATAC 

TGAGTACTCTTTGGAAGCAGCTGGACTTGGCTTGATCGTANGANAATNCGGNGTCCAGNT 

CNCTGGGCANACTTCTCCATGTGTTNCTACATACTGGAACGCCTTNACCAAGCGGAGGGN 

NTNTTNCACGCTTCAGGCCCACTGGGAGATCGTTGACGCTTNAAATGCTTGATGACTCCA 

TTGGGGTCAA 

Sequence 2731 

NCACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACI M M M I M M 
M M M I M M I M M M M M M M I I GGGGAACCATNTTTNAATTTATTGACATTGNC 
AATTN ATNAACAAAACAGGA I M M I M M ICC 
Sequence 2732 

ACTNCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGNCNNGGGACCTTNGCNNTCNAAA 

ANACCANCTNTGACACAGACTAAAAGCATCACCTGCTCAACTCTCAATATTGCTTTATCA 

GCATCTTTTCTCTGTAGCTCCAGGGGAATCTTTTCTCTAAMCATTTATGCCCTTGCTTT 

GGCTAGAMCACATTMCTGGTTTACTCATTGAGAATCCAGCATATTTAAGAGGTGACCC 

TGTGI MM IGTGATATTGAGGCATTCATACAGAGCTGCAGTTAGACGGGGTTACGGGGG 

CTAAAAGCAGAAAAAAAATTCCATTTCATCGGGATGGAACTGAAGGATTTTATTCTATAA 

AGCGGCCCTGGTTGAATCTGGCAATTC 1 i I I I GCCAAGATCCCTAGCAGAAGATTTAGCC 

CATGTCCTTCCC 

Sequence 2733 

TACTTAGGGCGMTTGGAGCTCCCCGCGGGGGCGGCCGAGGTNCMGNANTTNAAATTTC 
TTTAG N CC NCTCAMTTCTTGCTTCTAACTTATGGCTCTTCATTTTCTCCTTTTTACGGG 
GCTGACGCACCCANTCCTTAGGGGTCTTTTCTGTTTCTAAGACATGATGI I M I IGTTTG 
TTGNCCACCTTAGGAGCTTACTTTTTGGTTCACAGTCTGAATTATNTACAATGTCCACAA 
TTCCATGGTCCCCTTTNAATGGATATGCTATTANACACAAGTTCCTGNCTTCATCTTNTT 
CAAAGCGCACAGATACCACACCTTTGNACCGGCCCGGGCGGNCGCTCGGN 
Sequence 2734 

CGCGGTGGGCGGCCAAGGTCCCNGGNNAAAATNAGGAGGANCCCGGAAGGATGGGNTGCA 
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NTTGNGGAGGAGGGTTTCAAAGGAAAAAGGTCGGAAAGCAAAGGCCTGAAAAGCCAGAGG 

CAGGTGGAGAGAGGGTGGAAAGTGAGCAGCGGGCCTGGGCTGGAGCCGCACACGCTCTCC 

TCCCATGTTTAMTAGCACCTTTAGAAAAAATTCACAAGTCCCCATCCACAAAAAAAAAA 

AAAAAAAAAGAAAGTACCTGCCCGGGNCGGCCCGCTNTAGAACTAGGTGGATCCCCCGGG 

NNTGCAGGGAATTTCGATTATCAAGNNTTATTCGATACCCGTCGACCTCGAGGGGGGGGC 

CCCGGGTACCCCAGCTTTTGTTCCCTTTTAGTGAAGGGTTTAATTTGNCNCNCTTGGGCG 

TAAAT 

Sequence 2735 

TGGCGGCCGAGGTACGCGGGGGCACGTGGGAAGCTCGCANCAGCTGGGGAGGAGCCAAAN 

CCTCGGCGCTCACCTAAGCCGCAGGGAGATACACCCAACTGGGAGATGAGGAMCAGCAA 

CCCAGANAGGAGAACTAACCCACACAGGATCATTTNGCGMNGANCAAANGNTTAAATAA 

CCMTACCTGNACTTT7TTTTAGGACAAACTTANTGCANNTTTGMGGANGCCAACCATG 

GATTTGAGGCGTGTGAAGGAATATTTCTCCTGGCTCTACTATCAATACCAAATCATTANC 

TTGCTGTGCTGNTTTAGAGCCCTGGGGGCCGATCTATGCTTTAACACCATCTTACTAACC 

ATTATTGCTATGGTGGTATACACTGCCTATGTTCTTTATTCCAAATCCACATTCCCCTGG 

CTTGGGAAATTT 
Sequence 2736 

CCGGGCAGGTACTGAGTCMGGACGTCTTTAACGTCATGGACGGCTCCTTTACCCACGCT 

TTCTAGATCTTCGACTGCATCTTTTCCTAGTTTTCCGAGTCCCCCCACAGCl 1 1 1 1 nGC 

TCCGTCTAGGCC7TTTTCCAGAAGGCTGGATCTCTGCTTCCTTGGCTTTTGGTGCCTGTC 

TTGGCTAACCCTGGGGTCTTCACCTGGCATTTTCCTTTTGAGCTTGCTGATGCTTCATGG 

CAAGGGTTCCCCGATCCTGGGGCAGAGGCGGCCCTCTGGATCATAANGCACAAGACCANG 

GCTCCTGCCAGAGCTGTCAGGAAGAGGAGAGTCATTGAACCTNATGCTTCTGTGTTGCTT 

GGAGTGGGTATTGCCCACCCAAATCCTTGGAAGATCTTTGGGAATCTAGGGTCCCCGCCG 

TACC 

Sequence 2737 

AAGGTGCATTCTGCTTCCTGCAGGGGCTTGAAACACCAAGGCACTCCAGGGATCCTGGAG 

TCAAAGCAGCAGCCCCGGTTGTTGCACTCCTTGGGGGTGACATGGGGGTAGCCGCAGTCC 

ACCCTGTCCTTGGCTGGCACGGNACAACTGGTTTTGCAANACANGGCCCAACGTACCTTC 

GGCCCGCTTCTAGAACNAGTGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTGTnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnCGTAATCATGGTCATANGCTGTTTCCTGTGTTGAAATTGTTA 

TCCCGNTCACAATTCCACACMCATTAC 
Sequence 2738 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGCAGAG 
TGCCGCTTAACACAAAGCACCCAACTTACACTTAGGAGATTTCAACTTAACTTGACCGCT 
CTGAGCTAAACCTAGCCCCAAACCCACTCCACCTTACTACCAGACAACCTTAGCCAAACC 
AmACCCAAATAAAAGTATAGACCAATNGAAAATTGGAMCCTGGGCGCAATAGNATAT 

AGTACCTCGGCCCGNTTCTAGAACTAGT 

TGGGAGCTCCACCGNGGNGGCGGCCGAGGTNCTGGGGATNTmAAAATATTCACAGTTN 
ACAATGATCATGCTTGTTCCTANAGAMTTGNGGGCCANACACTTAAGTGAAAGCAGAAGT 

GTTTTGGGTGACTTTCCTACTTAAAATTrTTG 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGNCAGGTACi 1 1 1 1 i u 1 1 1 1 1 1 
M I I 1 1 I I I I II I NGAAAAAGTCATGGAGGCCATGGGGTTGGNTTNAAACCAACTTTGGG 
GGGTTCGATTCNTTC 

CCGGGCAGGTAC I 1 I 1 1 I II I I II 1 1 1 1 1 I I 1 1 I I 1 1 1 1 NAANATGCTTTCAGTrTAATA 
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GCTGTTTNAAA l rTTTTTTTTTTTTTTTTTI I NAGGNTTTAGGCTGAAAACNNAAAGCCT 

GCTGCTCCTGGGTATCATTTCTNAACTTCTCCTTACAGCTATNAGTACTGANNTCAAGGG 

ACCGTTTTTAACCGTCATGGGACGGNTNCTTTACCCACNCTTTNTAANATNTTNGANTGC 

ATCTTTTCCTANTTTTCCGAGTCCCCCCANAGC i I 1 I 1 I rGCNCCGNCTAGGCCTTTTTC 

CAAAAGGNTGGAANCTNTGCTTCCTT 

Sequence 2742 

CGTGGTCGCGGCCGAGGTACATTTCCTTGTAGACTCTGTTAATTTCCTGCAGCTCCTGGT 

TGGTTCTGGAGCAGATGATCTCAATGAGAGAGTCCTCGTCGGTTCCCAGCCCCTTCATGG 

MGCTTTTAGCTCAGAAGCGTCATACTGAGCAGGTGTCTTCAATAGGCCCAAAANTCAAC 

CNTCTTCCAAGTGGGCCAGNATAAAGGCTTGACTTTNAAGTGGCTGGATGCAAAGGTNCC 

I I I I I 1GGGTCCCTTCTCTGGGTANGGCCGAAANGGCAATATCCTGTCTCTTGTGCATTG 

CCCCGCGTACCTGCCCGGGCGGCCGCTCTAAGAACTAGTNGGATCCCCCGGGGCTTGCAG 

GAAATTCGATATCAAGCTTAATCGATACCCGTCCNACCTCGAAG 

Sequence 2743 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAGAG 

GGAACCGGGCGGAGCGGCCAACATGGCGGAACGCAGGAGACACAAGAAGCGGATCCAGGA 

AGTTGGTGAACCATCTAAAG AAGAGAAG G CTGTGGCCAAGTATCTTCG ATTC AACTGTCC 

MCAAAGTCCACCMTATGANGGGNTCACCCGGGTTGATTATTTTATTGCTTTCAAAAGC 

ANGTGGACTGTCTTTTGGATTCAMGTGGGCAAAGGCCMNAAMGGAGAGGAAGCTTTA 

TTTACMCCAGGGAGTCTGTGGTTGACTACTGCAACAGGCTTTTAAGGAAGCAG 11(111 

TCACCGAGCCCTAAAAGNTAATGAAANAAATATAACNAAANNCCAGAAANAAGGGGTCCC 

TTGGGCNCGTTNTTAAAACTAGNTGGGATCCCCCGGGCT 

Sequence 2744 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGNCAGGTACACGGGGGCT 
GTTGGTGGCGGCGGAGACAGCTGTGAAGTGTGAGGTTCTTTGTCTGCTGGCAGCTAGGGG 
CGACGAGGCGGNACGTCATGGAAGTGAAGGATGCCAATTCTGCGC 
Sequence 2745 

CCCCGCGGNGGCGGCCGAGGTAC I I I 1 I I I II I I I I I I I I II I I 11 I I i II I I I I i II 1 N 
AC I 1 I I 1 I I I I I I I I I I I I 1 1 1 I I I I 1 I GGGNTTTCTTACAAAGGNNGACATTTTCCANA 
ACAGGNGTAANAGNGTTGAAAAAAAAATTCAAATTTTTGGGGGANCGGGGGAAGGANTTA 
ANGAMCNTGTTTTCCNNAAATGCTNTGGAAACAAAN^ 

CCCMTTTAACCCAAAGTTNMTNNGNCNAAAAAAAMTGGGAANGGGNTTNCNNCNTTT 

CCGTTTTAAAAAAAGAAAGGAAG 

Sequence 2746 

ACTNCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACI II I I II I I 11 h I 
TTTTGTTCTGAAACCATAATTGCTTGGCCCTTCAGTTGAATAGGTTGAATGTTCATCTAT 
ACCATGCCCTTTTTCCACTGTTTCTCCATTAATCTCTCACTTGTCTTTAATTGGCACAGG 
AGTTTTCCCCTGAAAATCCTTTGAACTCTTATTCTAAGGATGAGAI I I i 1 1 1 I I I 1 IGCT 
AATCCATAGAAAMTGGTCACTATTTCTATTTGATTTATGAAAACTGTTTTTTT/^ 
TGTTCTTMTTTGTTGTTTCTATTATCCCTTTTCTCCTTCGTAGGTTCC I I I I 1 I IGTAA 
TTC ATC TTG TTAAT AAAGTTCTTC C TC ACCTAGTT AATTTC AG AAAATTAATG TC AAGTA 
GGATAAGGCATTAATTCAGGCTAGCAAGGATTCTTTCTGGTT 
Sequence 2747 

TGGAANCTCNACCNGCGGTGGCGGGjCCGAGGTACCCCAGAAACTTAAAGTATAATAATAA 

AAAAAAGAAAAAAGAAGTTCATTNATTTTGACTCCTACAAAGGCMGGMTAAGGGTTT^ 

TGNTGTTGTTTGTTG I I I I IGAGACTGTCTCGCTCTGTTGCCCAGGGCTGGGAGTGCNAN 

CTGGCCGTAATTTTTGGATCMCTtNCAMCCCTCNCGCCNTCNCCMGGGTTTCAAAGN 

CAANTTNTTCTTACCTTCCAANCACTTCCTAAAGATAAGCCTGGGNGNACTAACCAGGGC 

ATNCACAAACCANCCCAACNATNCAAGGGGCTNAAAT 
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CC^GG^AG^TACAGGAGGCCGACAATTTGGTGACCAGTGATGGCAGGCCACTCAGCTTre 
AGTAGCCATGTCCGCCACAGGCCCTGCGGCACATCTCAGCTCCCTGGGTGCAGMTTCTG 
ACATCATGGCCTTCATGCCCGTGCTCAGTGCGTGGAGCTGTGAGMCATGGAGGGGGGTr 
GGGCGGTGTTAGGGGGCCTCCACCATAGGGACCAACCCTGTGCACCACTTACTGAGCATC 
TACTCATGCCCAGCTCANCTCTGAGGTCCCCGCGTACCTCGGCCGCTCTAGAACTAGTA 

Sggushcs 2749 

AGGTACTGGTTGMTTCAGGCATCTTTCTGTGMTCTCGCTTATCAGGCTGGAGTAAAAT 

TCTTCTGCATTGGGTGGCTTATAATTCAGAGTTCCAAATGACCACAGAATCTTGGCAACA 

TCTTTACTTCGACAGTGTGCCACTCTAGGAGGCAAAGACGCAGCCACTGCATTCATTACT 

CCTTCATTCAGGAAGCGTMGGCCGAGCAGTAAAGAGTCAGGTGCATGACACCTTCAACT 

CCCAGGGAAGGAATTCGCTGAGGAGCTATCTCTCCAATCTGCTTCATGAAATTGATGTGA 

TCCACGTGAGTGAAACGGAACATTTTA 
CACAOAC^ 

GTTCTCTCANNTGTAAAATGGAGAGGCTGTATCTCCCAGAGAAGTCATGCAGNAGAGATG 
AGGGNGGAGTCNGGACCAAATNTCAGGCCCAGACTCCTGGCTACCANCATGTGCCAMGT 

ACCTGCCCGGGCGGCCGCTCTAGAACTAN 

TA^MCTANAAAATCCAGCANCAGAAGAGCCATGGCTCACTGCTAATCACATAACACACA 
CAGGAGTCTATTTATGCATGGMTGTGGCANATTnTTMCM 

TACACCAGAGMCTCATACAGGAGAGMGCCCTATCAATGCAGTGAGTGTGGAAAAGCCT 
TTTCACAGAAGTCACTGCTCACGGTTCATCAMGAACGCACTCAGGAGAAAAACCGCATG 
GGTGCAGCGAATGTCANAAAGCTTTTAGTAGGAAGTCACTCCGNATTTTACATCAGAGAA 

TTCATACTGGAG 
Ssousnc© 2752 

CCGCGGTGGCGGCCCGAGGTACTA 1 I I I 1 1 1 1 I I I 1 1 I I i GGATmAG ^^21 G rr 
GTTTCACCACGTTAGCCAGGATGGTCTGGATCTCCTGACCTCGTGATCCGCCAGCCTCGG 

CCTCCCAAAGTGCTGGGATTGCAGGCGTGAGCCACACCACGCCCTTCCTATTCTTGAGTT 

TTAAGTGTTTTTTATTTTTTATTTTTTGCATG 

GTGTTTTGCMTTATTTTCTCTCATTCTGTGCCTTTTCTTT^ 

TTTGCAGTTCAGAAATGTTACTTTTMCATAGTTCGACTCAGCAGCT^ 

ACCATATTTTGGTTTTTGGTGTTGTATCAMGGAGACAGAGTGAMGATACToAAATACA 

CACCTGAGCTGTTTCCTCGAGGGCTTTGACTTTGTAGCATGGCTGAGATTTCAT^GCTC 

TGTAGCTCTGCCTTCGGGGTTGCAGAGGATGMGAGACTTrGGGGAGGCTGGAAAAGTTG 

GAGAAAAACTGGCAACTAAGCATCTCGCAAGCTTCANGGGTCTTCTTCTrTTCTTCCTAA 

AAGGGATTCCTGGAAAANGAAAGTACCTGGCCGGGCCGnnnnnnnnnnnnnnniinnnnnn 

nnmnmwrmnnnnnnnATTCGATATTAAAGCCTAATCGAAACCNTCNACCTCNGANGGG 

GGGGCCC 

ATTTTVATCCTCACCCCTGACT 

TGTTCGACATGTGGTCAGAGACTGGCTCCAMGATATCTCCCAMGAGCAAAAGCCACCA 
TCTTCCTGTTGGTGCCCAGACACAGGTGCATGAGCTCCAAGGA.CTCCTTCCTAGGC^NC 

TCTCTGCTGCCTGACCTAMCTNACCTNATTTTTGGGAGCT 

GAGATGTGTTGGATGGGAACTAGGGGTTTGTANGCTCCATTGTGAGCAATACTGATGGGA 
AATGACCCCAACTGGTATGGATATTTTGCAT 

CCGCGGT^ 

GGTGGCCTGCCTGGAGTCACCAACACAAGTGCCAGCATATGTCACCCTAGGGCAAAAGGA 
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TAGATAAGCTTAGTTCACCGGCGCTACCACTGAAACCCAGAAATGGGACCATACAGCATT 

TCAGTGTAGCCAGCACAACTTCACTGTAGTCTCTAGCAATAACCACACTGTAACACACTG 

AGGAAACCACAGATGCCACTAATGATATTCACAGCCAAAGAAATCACACAGACTTCACGA 

TTGAATGCAAAGCCAACAAACCCCACCCAACGAACATTATAGTCACATCTTCAGGAAAAA 

AATCCTCACCCTTAAAGAAAGTAAATTCAAAAATAAAAAGTGACTTTCATATCAGATGCA 

CAGAAATCAATGTACCTGCCCG 

Sequence 2755 

GGCGGCCGCCCGGGCCGGTACNCGGNATCNGCNCATATGGGCATCGGCATTGNCGNGGNC 

ATG ATATTTG GC ATG ATCTTC AATATG ATCTTGTGCTG N G CTATCCGC AGGAACCGCG AG 

ATGGTCTANAGTCAGCTTACATCCCTGAGCAGGAAAGNTTACCCATGAAGATTGGNGGGA 

TNTTTTGTTTGTTATGCANATGNTTNGNTNGATGTTNGTTGTNTGACTGTTTGCCNNCTA 

ATGTAAGTATTCNTNTCANCATTGACTACGATAAACTCATGANGGGTACATCACATTGGA 

ATGTC 

Sequence 2756 

GTACGCGGGGGTAGACNGAAAGAAGAA.CNTGCNTNCTAAGNNNNNNNNCTNNGNNGAGAT 
GCTGNCTCCTANNTNNNTNNTANNATNNNATANGNGAACCTCTTTGCCAGGAACTATTTA 
TAAACATNCTGCAGGAAAATGGTAGCTGAGACCAATGATGCTGACCTCCCT 
Sequence 2757 

TNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCCGTCATCATCTCC 

CTAGACAGTGACCAAACTACATGCAMTAMTTTGAAAACCGGAAAMTTTAGAACACAG 

TAAGCAGAATAAGCATGTAGATACCAGGGACAAGACAGTAGTTACTTACACTGTGAACTC 

CTAAAGCTTTCCAGACTGCGAAGCTAATCACACCAACCCCACACCATGCATACAGGGAGC 

CCCAGCCCAGAGCTCGCAAGGCAAGGGAAGACCCGCTTTCCGGTAATGCAGCCGTGGCCA 

TACTTCCCTTATTGAACCATTCAGGGCTTTTC I 1 1 1 1 AGCCAATGATAATGTTGTAATAA 

ATCCAGCTAGCATTCCCGCTGeAGCAACGGTACCTGCCCGGGCCGGCCGNTCTAGAAACT 

AGTTGGGATCCCCCGGGCTGC 

Sequence 2758 

CGAGGTAC I I I 1 I ( I M I I I I I I I I I I I I 11 I I I GGGTGTTTGACTTCTGATAACTTCTA 

AGTCTTACTCCTCTGTATGTCTTTTGTTCCACATCTGAGCAAGCTAATAAGAAAGCCTGG 

GTGCTTTCTGCTTTGGTGCTTACAGGAGATCCAAACCATACAACCCTCTGCTTGCACACA 

GGAACTCTCACCTCAATCTCACTTTCTAACCACAAGCAAAACCCAAAGGCAGTCTCCTTT 

CCTTTCTCCCATAAACCATTGCGGACCTACTTGGGAGGCCTGCACCCCATTGTCCACAAA 

AACTTGAATTCTGTAAGTAATAA 1 UN 1 A AGCTCTTTCGGAGTGGGTTTCACATCTCAG 

TCTCAACATCTGAATTAAATATTGACTGANAGTGAATCCTGCACCTTCAGAATGGCTACA 

AAATGC1CTCTTANAGGTTTGTGTGTGTCTNTGCTTACATAACTATCTGNTATTNAAATG 

TTGACTA CTTTTTTTC ATGANTTNTAAC ATN TTAATTATTTTAAAAAAAATCCTNTN N NA 

NTTCCAGMTTGGCTNTTTNCTTNMNANATNGNTTTNNTNCTTACTGGGGCTTAAAAAC 

TTTTTT 

Sequence 2759 

CGAG3TAC 1 1 t Tl I I I I n I I I I I I I I M I I I I I I GCGCTCGGGGGGGGGCGGTGAGGGG 
AGGMGAGTTATAGTAGAAGATAAAGAAAAAGAI I I I I I I I I AATGCAAGCATCAAGAGC 
CCTGAAAATGGCTAACTAGGTAAAGTA 
Sequence 2760 

GACTACTATAGGGGCGAATTGGAGCTCCACCCGCGGTGGCGGCCCGAGGTACTCI I i I (' I 
rTTTTTTTTTTTTTI 1A ATCAGACAGGGTCTTGCTCTGTTGCCCAGGCTGGAGTGCAGTG 
GTGCGATCTCGGCTCACTGCAACTTCAGCCTCCTGGATTCAGGCAACACTCCTGCCTCAG 
CCTCCCACGTGGCTGGGATTACAGGTGCCTGCCCCCATGGCTAATTl 1 1 IGTAI I I I I IG 
TAGAGATGGGGTTTCACCATGTTGGCTGGGCTGGTCTCACTCTCCTGACCTCAAGCAATC 
TGCCTGTCTCAGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCGCCCCAGCCTGA 
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GC CTTTTTTTTTTCTAATGCATCC AAGGTTAAG6 GG AAGACGCAAATAACAGGACTACTC 

TAAAAGGAAACCTGTTTGAACTCTGTGAGATCAGTCATC AGTCTC AGT ATTCC ACAGGCA 

CACCTTAATTTCATTGTAAAAAGANATATATATT^ 

CT 

Sequence 2761 

GCGGCCGCCCGGGCAGGTACCTTGAAATCAGATAATCTCGACATCCAGAAATAGCCTTTG 

ATTTGTTCTTTCTCTGATGTCTTGTGTCCTTGGGATCAMGCCTCTTTC ACTGG CACCAA 

ATMTGCCCAGTGAGGAGTTCTGGCCATATMTCCTTTGCGCTGMCGTTTTTGATCCAC 

CACTATCTTCATATTCCTGGATGAGCTCCTGGAAGCGGTTATAATCGATCCATGTCCACC 

ATTCACCACCMTTTTAMCTTTCTGTGTGCTATCAGGAGGCMTTAGAGTGTTCGTGTT 

CACATGCAATTTCATATTCGGGGATCAGATCCACCAACTCGTGGACAAACTGTACCT 

Sequence 2762 

CCGGGCAGGTACT H II ill i II I I M I H I I N H M I TTTTCNAAAGGCTTNATTTTA 

AGTTMTATGTATACANAGAAAGNGTNTATATTTTTGGGNANAAAATNATGCNTTTCNTA 

GGACAGGANTAATACATTTAGNTGGGGNTNTTATAAAANAGCNCCAANACNTTAGGNGGG 

GNMCCAAACTGGTTATGGGGNTTGATNTAANCCAAACCCATNACCCATAAANTNTTTNA 

GGTNGNAAGTAAGCANTATTATAACTTGGANTCNAGGGGGAGNAAAAGGGCCCCCCTAAN 

NGGGGAGGGNTNCCCANTTGGTCTAAANANTGNAAAATTTGGNANTTGTTGCCAAGNTCC 

ACCAGGCAGGGGGCAMGNACANTNATTTTCCTCCAACTGNNTCAGGAGGGGTTAAGNAG 

NTAACNCTGGCCTMGCTTGNAAAAAAGGGCCCCTACNTTCTTTGGCCGTTTTAAAMCT 

AGGGGGATCCCCCNGGNCTTCAAAAAATTCNAAATTNAAAGTTTNATCNATACCTGTCGA 

NCCTTAAGGGGG 
Sequence 2763 

TGGGTATCTGCGCTCTGCTGAMGCCAGTTACCTTTNGGAAAAAAGAGTTTGGTAGCTCT 
TGATCCCGGCAAACAAACCACCGGTTGGTAGCGGGGGGTTT 

Sequence 2764 

CCGCGGTGGCGGCCCGAGGTAC I nTTTTTTTTTTTTTTTTTTTn I GGCi 1 \ \ nu n 
TTTTTTTTTTTTTTTTTTTTC C C NTTTT AAAAAAG GTATT AATT AATT CTTTTTTTTTTT 
TTTTTTNGGGMGGGTGMTGCCCTTTTGGTTTTTG GCCAT 

AAMCTAANTTTGANANATNAATNCNMGGAAAAAMTTTTTTCCAAAGNCCATNNNAAA 

NTGGCNCCCATTTTTTGGCTTTTGGGGGG 

Sequence 2765 

AGGTACTTTGTCTCTGTGTCTCTTTCTTTTCCAAGTCTCTCCTTCCACCTAATGAGAAAC 

GCCCACAGGTGTGGAGGGGCAACCCATCCCTTCACTGAACCATTTTTATTCTTCCAGAAA 

TGTGATTGATAACAGTAAAGCCACACTCCTCAAGTGCCTGAAATACCCCTCATTGTCTTC 

TTCAGGTGGCAAGGGCTCTGGAACAGCCACATAAAGCCCTTGAAGATAAGGATGGTCAAA 

CAAAATAATATCATACCTGGAGAAACTCAGATCTTGGTAAGATTTACTGGTTGGGAATCC 

AAAGTTAATGCCAAGAAGCAGCCGCCAGTTGGGATCAAATGTGAGCCTATGGATCAAGGT 

GCGTACCTGCCCGGGCGGC 
Sequence 2766 

CGAGGTACTCTGACATCATCTGACATGGCAGCTCCCCAGCCATGGTGGTGGTGATAGAAA 

TATAAATGGTGACAGCAGCAGCAGTGTGATAGTCACGGTGGAGGGAGATTGCTTTTCTGC 

AGCCATGCTTCTGGCATCTCCCAACCTGGGTGGGACCCAGCTGAAGATCTGCAGAGAAAC 

C AG CC ATTTCTG AG AGCAAGGCAATC AGTGGG C ATTTACTGGAGG GTGTCTAGGTTCTT A 

ACATCTCACTGTTATTTGGGCTCATGGGATCATAAGAATGACAAAATAAAGAGAAGTCCT 

CATTTCTGACCACAAGAAGCTTGACACAATTCCAGTAAGTCTTGGATTGAAAGGAGAGAC 

ATGTTTTCATGTGTTTATATTCCCTTMCAACCAATATACCTTGACATTGAAGAGCCAAG 

ATGTGAGTGGGAGGTAGTGCTATGGTTTGTATGCATTrCCCAAAGTTTATGTGTTGGAAA 

CTTAATCCTCAATGCAATGGTGTTGAGAGGTGGGACTTTTAAAAAGTGATTAGGTTATGA 
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G GGCTCTGC ACTTATAATGG ATTAATGCTGTTATC ATG G G AG AAN ATGG G AATGGA 
Sequence 2767 

AGGTAC1 1 I I I I I I I I n I I I I I I I I i I I I 1 I GGGGATGGAGTCTCGC I I 1 1 ITGCCCAG 

GCCGGACTGCANTGGCGCTATCTCAGCTCACTGCAAGCTCCACCTCCTGGGTTCACGCCA 

TTCTCCTGCCTCAGCCTCCTGAGTAGCTGGGATTACAAGCACCCGCCACCGCGCCCAGCT 

AMTTT^GTATTTTTAGTAGAG ATGG GGTTTCACCATGTTAGCC AAN ATGG NCTCGAT^ 

TNCTGACCTNGTGATCTGCCCGCCTTGGCCTCNCAAAGTGCTGGGATTACAGGCATGAGC 

CACCGCGCCCGGCTTTACTTTCCGNAACTTATTGAGAGATGTCATGATCCTTATGGACCC 

CTTCAGGGTTTNTACACAGAAAAGAAAGAGACAAGAAACAGAGGGCCCTCGAACTCTCAA 

AAATAG GCC AAGATGG CATAATTAAGTCTATGAATCAAGATAGGTTTTCTAAGTGGAGAG 

CMTACTAMTTTCTTGTMGGTGAAGACTAAAGGAM^ 

AATTCTCTAAGAATTAGAAAAAATTAACTTCA I I I I IAAAC 

Sequence 2768 

CCGGCAGGTACATTTCTTGTTTGTC I rTTTTTTCTTTI I CTTGTCAATTCTTGGGTGGGA 
GAAAAG CCTTCTAATATTTCCTCATCCAAAAACTCATAGTCTG AGA I ! I 1 ! GCAGTTAAT 
T1TATCATGATTAGTCTAGAATACGACTGCCTTTCATTTTGCATTCACCTTCTTCGTCTA 
ATAATGATTAATGCCTTCTGTCAG I Mil I CTCTTTGCCATTATGAATGGAAAATGAAAA 
ATTGTGAATTCCCATCGGTGTTCAGTGAAAGCTAAAAGTAGACAACAGGTATGATGTCTT 
GAMGATGATGATGAAATACTTTGGACAATACAATGAGAAAACTAATATCTTAAAAACTA 
TTTAGTTA7TTTTAGCATAGTTGAGAATAATGAA 
AACTATTAAAAACTATTAAAAMCAACAGTTTTTACATTTGT^ 

MTACGCTCACTAGGATTCCATCATATTTCTTAGATTTATAAAGAGTTCAGTGGACTCAT 
TTGATAATTG CAAG ATAG CATGAGTTrTAGTCTGTTCTTTTAAAAAATAAGGTTTTCTCT 
TTG GTCTTTG G G AAG ACTTTT AAG AAAG AAT 
Sequence 2769 

CGCCCGGGCAGGTACACTTCACTTTCTCGTGCTCTTCCTAAGAGAGTAGTGTTTTCTTCC 

TCCCCACCGAGAAAAAAAATAAAAGCAACMCTGGMCTTCTTCCTTACCCTGCCTTGTC 

TGCAGACCCTGGAGGCCCCTTCCATTTCCCCTTGTCCTTTTCCTTGTC I I 1 1 M GGTTCC 

CGCCGCAGTGGTCGAGTCGCCCTCGAACACGCTGAACAGCTCATCTCCGAATGCGTCCGC 

CATTTTTGGGAGCAGTGAGAGCCCCGCGTACCT 

Sequence 2770 

AGGTACTGAGTCMGGACGTCTTTAACG1CATGGGCGGCTCCTTTACCCACGCTTTCTAG 

ATCTTCGACTGCATCTTTTCCTAGTTTTCCGAGTCCCCCCACAGCI I I I 1 i IGCTCCGTC 

TAGGCCTTTTTCCAGAAGGCTGGATCTCTGCTTCCTTGGCTTTGGTGCCTGTCTGGCTAA 

CCCTGGGTCTTCACCTGCATTTTCCTTTTGAGCTGCTGATGCTTCATGGCAAGGGTTCCC 

CGATCCTGTCGAGCGGCCGCCCGGGCAG3TACGCGGGGCACAGCGGCTTCCTTGATCCTT 

GCCACCCGCGACTGAACACCGACAACAGCAGCCTCACCATGAAGTTGCTGATGGTCCTCA 

TGCTGGCGGCCCTCTCCCAGCACTGCTACGCAGGCTCTGGCTGCCCCTTATTGGAGAATG 

TGATTTCCAAGACAATCAATCCACAAGTGTCTAAGACTGAATACAAAGAACTTCTTCAAG 

AGTTCATAGACGACAATGCCACTACAAATGCCATAGATGAATTGAAGGMTGTTTTCTTA 

ACCAMCGGATGAMCTCTGAGCAATGTTGAGGTGTTTATGCAATTAATATATGACAGCA 

GTCTTTGTGATTTATTTTAACTTC 

GTGAGAMCCMCTACGGATTGCTGCAMCCACACCTTCTCmCTTATGGCTTTTTACT 
AC AAACTACAAG AC AATTG NTGN AACCTG CTATAC AAGGTTATTTTAAAN AAATG G ANGG 
NCAAAAAAAAAAAAAAAAAAA 
Sequence 2771 

CCGGGCAGGTACTTTGCCTACGGCAGCAACCTGCTGACAGAGAGGATCCACCTCCGAAAC 

CCCTCGGCGGCGTTCTTCTGTGTGGCCCGCCTGCAGGATTTTAAGCTTGACTTTGGCAAT 

TCCCMGGCAAMCAAATCAMCTTGGCATGGAGGGATAGCCACCATTTTTCAGAGTCCT 
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GGCGATGAAGTGTGGGGAGTAGTATGGAAAATGMCAAMGCMTTTAA^TTCTCTGGAT 
GAGCAAGAAGGGGTTAAAAGTGGAATGTATGTTGTAATAGAAGTTAAAGTTGCAACTCAA 
GAAGGAAAAAAAAAAAAAAAAAAAAAAGTACCT 
Sequence 2772 

CCGCGGTGGCGGCCGAGGTACCAAAAAGACTCTCAAAAACCAATACTCCCACGGGCAAGG 

GAATAGCCAAGTTTGTTGCGGTTTCCAATGAATGACATCAGCCCTGTGTAGGTCTCAATC 

AAAATGGGTTCAGTTMCACCATCAGTTTCTTTCCTCTTCCAGATCCAGTTGAATTCTTG 

TGGGCATTCTGGATAGCTGGAACAAGCTTAGACATGAACCCAGACAACTTGCAAATTTCA 

AGGAATTTCTCACTGGTGTATTTCATAGGATGCTCAGTGAAAGTAGCATAAGGAACTTCA 

GTGGACCATGGGTTCCAGCGGGACAGAAGAGACTGCTCCTCCGGACTCCCCCAGTAGATC 

CTAAGGCCTTCTCCTTGTCTCTTGTCCAGGGACATCCCAGGGAAGGTGAACTTGCCCAGG 

CAGATGCGATAGACAGCGCTCAGAGGAATCCGCTGCAGCTGCACACAACTCAGCATGATG 

AAGTCGTATTTGCAGATCAAGAGAGTCTTGTCTGTGACCAGTAGAATTCTCTCCTTCTCA 

TTGGTCCAAGTGGTCTATCTTTGTCAAGAGCCAGAAGCCTTGAATGGTCTCTTCAGAAAG 

TCTCAGCTACGTGACCTTTCAAGTTTTCCATGGCAGTCTCAATGGCCCCCNGGCCGNTCT 

AG AACTAGGTGGGATTN N 

Sequence 2773 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAGTTTTAAACCTGTT 

CTATTCTAACCTTGGMTGGAGTCACATGTTTCAGCAGCMTATGC TGTTT CCCTTCCTC 

GGTTTCTCACATCACCCTCTGGAGAATGGTGTGGAMCTGCCCTGTTTTTACATCAAGAT 

AAAAGCTTCAGGAAGTTACTGGAGAGTGGAATGCAGGGGTCATTCTGGATAACACAAAAT 

MCTATTTGATTAGACCAACCATTTGGTCTCTAAAATTGGGTGCTAAAGACAAAGATAGG 

AAATTCAAAAMTCACCACCTGTAGAAGAAAAATCAGTTMTMTTTGACAGATTTAAGG 

GCCTCACCTTTCTCTCACAGMCTTTACCTCATMCACCCTGTTTGGGAAAATTGGAAAA 

TATCTTCTCCTTACCTCCTAGGGGATAAACMGAGTTTAAATGTTGCCACCAACTCAAAC 

AAAACACACCTCATTCTTTATGTTCTGGAGGGAGAGGGAGTCCTTATTTTGCTTCAAAGT 

TTCCTTTCTGGAGCTTC 1 HUM I AAGCAAAAAAAAAAAAAAAAAAAAAAGTACCTCGG 

CCCGCTCTAGAACTAGTG 

Sequence 2774 

CCGGGCAGGTACAATGAGGCAGATGCAGATGCAGTGTGTGTCACTCCAGTCTTGAAGGAG 

TTACACTGGCTCTCCAGCAAGGGTATAATCCATATCAAMCCAAMCACTTTATCTTTTC 

CATTGCTAAACTTCGGTTCACAAACACCCGATGATAGGCTTCTCGACGATACTTTTTCAG 

GGCATGCTTATCCATGTTAGCAGACATATCTGCTGCATTCTGTAACCGATCACTCCAGGA 

GGTTGACATTTTATTTTAACTGTATTTTGT ATTACH 

CGAGGCGAATGGATCACTTGACGTCAGGAGATCACATCACTGTACCT 

Sequence 2775 

AGGTAC 11 1 1 I 1 i H HI H 1 1 H 111 1 AA^4CNNNAAANCCNTTNN^INNAANTTTCCAAA 

NTGGTTNGAANTTTNCCNTTTTAAGGTTTTNAACCNNNTTTTTNNAAMCC 

TCCGGNAANANTAAGGTTCNNGGNGGCCGTTTNAAAANNANGGAANCCCCCGGNNTGAAG 

AAATTCAANTTAAACNTNNTCAANCCCGNCAACCTCAAGGGGGG 

Sequence 2776 

TCACTNNNAGGGGCCGAATTGGNAGCTTCCACCGCGGTNGGGCCGNGCCCGGNAGAGGTT 
TMCCTTTCCGGGGGQNCCCACCGGTTTNMGNGGGGGCCCGGGGGGG TTTTAAAGGG GG 
GTTTGGGGGGTTTTGGGGAAAATTTGGGGGGGGGGGGNNNNNGNNNNN ITTTTTTTTTTN 
GGGGGGGGGGGGGGGGGNNNNNNNNCCCCCNNGGGGGGGGGGGGNNNNNNNNNNNNNGAA 
AGAATTGGGCCCANCGGGGGGAAACAACTTAACCTTTGTTTGGCCAAACCGGCCCTTTGG 
GCCGGGGGGCC I MINI i NITTTTCNANNAAAAAAAATTNGAAAAACCGGGGANCAAAGA 
AAAACCCCGGGGCCCCCCCN I I I I I I I I I I AAAANCCCCCAAAAACCCAAA 
Sequence 2777 
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CCGGGCAGGTACGCGGGAGCTAATCTTTGATAAAGACTAACTATGGGCAAGGCTTTACTC 
TTTG TTTGTAGTAACTTACTTAATC CTCATTATMCCCCTTTATATAG ATGCTGTATTTT 
CTCCATTTTACAGAGAGGAAGTAGAGCCTCAGAGATGTTATGTCACTTGTCAAAAGTCAC 
CCAGCTAGTGAATGGCTGGATCAAACTGGATCATGTCCAAGTAGCCTGGCTTCAAAGTTC 
ACACGCTTTACTATATTTATTC I 1 1 I I I I I TTCTTTTGAGATGGAGTTTCGCTCTTGTTG 
CCCAGGCTGGAGCACAATGGCATGATCTCAGCTTACCACAACCTCCACCTCCTGGGTTCA 
AGCGACTCTCCTGCCTCAGCCTCCCGAGTAGCTGGGATGCGCCACCACGCCTGGCTAATT 
TTGNA 

Sequence 2776 

CCGGGCAGGTACAATAGCCGTGATCTCGGCTCACTGCAACCTCCATCTCCCAAGTTCAAG 

CAATTCTCCTGACTTAGCCTCCCAAGTAGCTGGGACTACAGGCACATGCCACCACGCCTG 

GCTMTTTTTGTATTTTTTGTAGAGACGGGGTTTCACCGTGTTCGCCAAGGTGGTCTTGA 

ACTCCAGGCCTCAAGTGATCTGCCCACCTCGGCCTCTCAAAGTGCTGGGATTACAGGCAT 

GAGCCACCACACCTGGCCTGGATTAGTMTTTCGGATTTCCAAATTCAGGTCAAGAAGTG 

GMTTTATTATGGTGGGTTCTTGTGAAAAj^AATATAAATGTGGCC 

GTTTTTTT1 IAAACAAAGGCTACTCAAACCATTACCTTCCTGGCCCCCTTGAAAAGAAAA 

AAAAMTNTATCCATCCCATTTTGCTTTCTCTMTTTTGAAAAGTCAGAAAGATCTCAAA 

GGTTTNTTACTACCGTACCTN 

Sequence 2779 

AGGTAC I II 1 1 I i I I I I I I I I I I I I I II I I n I I CAGCATTGCCACATACCTGGAGTTCC 

TCTTTAGTTGGTGGCAGTCAGCAGGGTTCTNTGAAMCAATTTATAAACAAATTGNGGCT 

GTTCTGAAACTAGGGCAACCGATATCTTGAGTTTCATCTGTTTAGTGTTTNAAACCTGGN 

CCTGATGTTAGTGTNCCCATATTAAANATTTGACTGNGGGCCTTCCNAATGCCAATGTAT 

CCCGGATGGATGGCCTAAGGTCCTTAACCACAAATGTTAATGCTAAAAATCATGATGATN 

ATGTGTGTGTGCGTGTGTGTGTGTGTGTGTGTGTGTGT 

Sequence 2780 

ATTGTTATCCCGCTCACAATTTCCACCACCAACATACGAAGCTCGGGGGAGCANTAAAAG 
TGTTAAAAGCTCTGGGGAGGTGGCCCTAAATAGGAGATGTAGCCTATACCTCANCAATTT 
AANTTNGCCGTTTGNCAGCTTCACCTT 
Sequence 2781 

ACGACTCACTATAGGNGCCGAATTGGTAGCTCCACCCGCGGTGGCGGCCGAAGTTGCCTC 
TTTTAGGAATCTCTTTGG AATTGGG AGCAC N G ATG ACTCTGAGTTTG AGCTATTAAAGTA 
CTTCTTACACACCAAAAAAAAAAAAAAAAAAAAAA 
Sequence 2782 

AGGTAC I I I I I I I II I I I I I I I I I I I II GGNTTCATTTACTCCATTATGTTACGCTTATG 

CTTGGGANTTCATTTTCGACATATTGCAAAATTTATACTGTGGTCAGA/^TTATrCAGGTG 

TATCTCCTGANCTGTGGAACNTNCTTTACTACNTTAACCCTTATAAATTTTTA^ATTAAA 

CCTTTTAACCCTTTTCAGGGGAATMGGCATTCCCCCAAACCACCAACCTTCTTAACCAA 

NTGGAANACNTTANCTCCTTNGAAAAAAAAAAAAAATTTTAAGCCCCCA^ 

TTMTTTANAAAACCTTTTrTGGCNTTT^ 

AACATCTTTTGGTACN I I I I I I 1 1 I I CCTNAACCCTTGGANTAANAA I il II ITTNTAAA 
CCNTCTTATTMNTNGCCTTNTTTNCCTTNCCMTTNTTGGNANTTGAGG3NCAACTTAA 
TAGMNANATAACCNTTCCTAATTCAAAMNANNANTNTTTTAAANNGGMCNCTTTNTC 
AAATTGGGCCCCC I MINI 
Sequence 2783 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGACGCGGGGAGGCAGACATGGGAGTGC 
CAAGGTGGCCAGATGGTTCCAGGACTACMTGTCTTTATTTTTMCTGTTTGCCACTGCT 
GCCCTCACCCCTGCCCGGCTCTGGAGTACCTGCCCG 
Sequence 2784 
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GGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGTTTCTCTTTCATCTTCAC inn 
GTTCAGGAGCTTCGACAGCCTTCTCTTGTCTTCTAACGGGACAGTTGGGGCTTCAGGAAG 
GGGACTGAGAATAGGGGTGTG 
Sequence 2785 

ATAGGGCGAATTGGATNTCCCCGCGGTGGCGGNCGAGGTACTTCTGCCTGGGTTATGGGA 

GTAAAATCTTGTCCANNGACATCCCAGGGAAGGTGAACTTGCCCAGGCAGATGCGATAGA 

CAGCGCTCAGAGGAATCCTCTGCAGCTGCACACAACTAANTCATGATGAANTCNTATTTG 

CAAATCAAGANAGTCTTGTCTGNNACCAGTAGAANTNTTTCCTTNTGATTNTTCCANTGG 

TNTATCTTTGTCAANANCCAGAAGCCTTGA 

Sequence 2786 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTAATTATGCA 

ACAACAAGATTTGCAGGGAGCATAACCACATAGTTCCCCTGAGTATGCTGCCAGTATGCT 

CCACAMTGATTAAATATAAAATAAACAAGAGCTAACTGTGAATATGAAGTTGTAATTCC 

ACCAACGGGGTTCCAGTGATGTCATAAACCACTAGAGCAAAATTCATGTCACCAAAGGAT 

CCCAGATGGGCTTAGAGAACTGTACTCAGTAGTGGTCTCTACTACAGACGTGGGTCCAGG 

AAACCCTTTTCACCTCTGTTTTCTGCCCCTTCACATCTCTACMGTGCCCTTCNTTCTTG 

GTTTCTTCCTTCMGTTTTCTAAMTCATACTTATCCTGCTTTTCTAMTGTM 

TTTAATTCATTC ATTGGGTCTATCATTC AN G GGC AG ATG AACC AAATACATN CTTTTGG A 

CAACGATTCATAATTATATATACTTTCATTAGAAAAAAAAMGCCCGATATTTMTAGAA 

AATCCCATTTAGGACAAGAGCCTTAAAACGTACCTCGGGCCGCTCTAAGAACTAGTGGGA 

TCCCCCNGGCTGNCAGGAATTCGATAT 

Sequence 2787 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGACCCTAGATCCCAAGATCTCCAAGGA 

TTTGGTGGGCATACCCACTCCAGCACACAGAAGCATGAGGTTCATGACTCTCCTCTTCCT 

GACAGCTCTGGCAGGAGCCCTGGCTGTGCCTATGATCCAGAGGCCGCCTCAGCCCCAGGA 

TCGGGGAACCCTTGCCATGAAGCATCAGCAGCTCAAAAGGAAAATGCAGGTGAAGACCCA 

GGGTTAGCCAGACAGGCACCAAAGCCAAGGGAAGCAGAAGATCCACCCTTCTGGAAAAAG 

GCCTAGACNGGAGCAAAAAAAGCTTGTGGGGGGGACTCGGAAAACTAGGAAAAAGATGCA 

GTCCGAAAGATCTAGAAAGCGTGGGGTAAAAGGAGCCGTCCATGACGTTNAAAGACGTCC 

CTTGACTCAGTACCTCGGCCCGCTCTTAGAACTAGTGGGATCCCCCGGGGCTGGCAGGAA 

ATTCGATATCAAAGCCTTATCGATACCCGTCGACCCTCGAGGGGGGGGCCCGGGTACCCA 

GCTTTTTG 

Sequence 2788 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACI rTTTTTTTTTTTTTTTI 
fTTTTTTTi I NCTGGAAAAAGTCATGGAGGCCATGGGGTTGGCTTGAAACCAGCTTTGGG 
G GGTN CG ATTCCTTCCTTTTTTG 
Sequence 2789 

CCGGGCAGGTACAGCGGGGGAGACTGGCAACCTGAAAAAGGCTGTCATTCTACAGGGCTC 

TAATGANGTTGAACTTGTTGCTGANGGCAACAGCAGGTTCACrTACACTGNTCTTGTANA 

TGGCTGCTCTAAAAAGACAAATTGAATGGGGAAAGGACMTTCNTTTGGAAATTACCAAA 

AAACANAATTAMGGGCCCANTTTACNGNCCCTGNCCCCTTNCNCTTTGGATAAT NTTTC 

CACCNCTTTNTGNGGACCAATNCAGGTTGGGGTGNCCNTGAACTCC AAAAAA AAATTTTT 

CNTTTTTGTTNGGGGMCCAATTNTGNGNCNCCCC 

TAAAAC AATTGGAAAACCTTC NMN ATTTCCTTAAAAMTTTTAAAAAAAAAAAAGN AAA 

AAAAGNAMCMTTTTTTGGACATANAAAAAACNTT^ 

TCTNTGCTTTTCCATGTCCTTTNCCTTTTTTNGTTTAAMCCTTNGCM 

GGAAAATTC NCCTTTTCNCC AATTTTTTCCTTTTN NTTGGGC AACCAATNCCTTAACCCT 

TNNGCCTTTTAA 

Sequence 2790 
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AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTACACTGAAAACTTCAAGCA 

AGTAGCACCAGGGCTCCCCCTAAGCATAACAGGGGCCTGATTAGCACCCCAGAAGAACCC 

CTCAAAAAGTATATAAATTTATTGCGTCCTAGCAAAGATCTGCAGCTTCCAGTATTCACA 

CMT GTGAAG ATATTAACAATACAAMTATATTTTGAAAGTCAAACACCTGTAGTTCATA 

GTGCi Mil I AGATTTGTTAAAAAAMTACAATGCTTAGTTTCCTATATAGGATATACAA 

AAGCAAAAAATACATATGTATATAGTAGAAATAAATCAGCCATAGTMTTTACAGCTTTT 

GTCTCCAACCCTCCCCTAATCATAATCTTTATCCCACACTTCAGTATATACTTTCTGGGA 

TCAAAGGATCAATTTATTCTGAACCAGG 

Sequence 2791 

NCTTAGGGCGA ATTGG AGCTCCCCGCGGTGGCGGCCGAGGTACAAGGACTGCATCGAGTC 
CACTGGAGACTATTTTCTTCTCTGTGACGCCGAGGGGCCATGGGGCATCATTCTGGAGTC 
CCTGGCCATACTTGGCATCGTGGTCACAATTCTGCTACTCTTAGCATTTCTCTTCCTCAT 
GCGAAAGATCCAAGACTGCAGCCAAGTGGAATGTCCTCCCCACCCAGCTCCTCTTCCTCC 

TGAGTGTCCTGGGGCTCTTCGGACTCGCTTTTGCCTTCATCATCGAGCTCAATCAACAAA 

CTGCCCCCGTACCTGCCCG 

Sequence 2792 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTACCGCGGCCCGN 

TAAACATGTGTCACTGGGCAGGCGGGGCCTCTAATACTGGTGATGCTAGAGGTGATGTTT 

TTGGTAAACAGGCGGGGTAAGMTTTGCCNAGTTCTNTACTTTTTTNACCCTTTCCTTAT 

GAANCATGCCTGGGTGGGGTGACAGTGNAGGGGTAAATAAATGACTTTGTTGGGNTGAAT 

TGTMGAATATTGGGCCTGGTTMTTGTCAAGTCAAGTGTTTTAAATCTGACCNCAGGCT 

TTATGCCGGANGAAGAATGTTTTCATGGTTACCTTATACTAACATTAGTCTTCTATAGGG 

TGATAGAATTGGTCCAATTTGGGGTGTGAAGGAAGTTCMGTTATATGTTTGGGGATTTT • 

TTTAAGGTANTGGGTCCCCGCGTACCTCGGCCCGCTCTAAGAACTAGTGGGATCCCCCCG 
GGCCT 

Sequence 2793 

CCGGGCAGGTAC i MK rNNTTNTNNTTTTNTTTTTNNGGGCAGGGGCANATGGCTGTTT 

NTTGATCCTTNAGTCTTGNTTNMTTANATCATACTTTTGCCAGCGAGANATGGAAACAG 

AGGAAGGGATCTAGNAGAACGNCCTGANACAGGGGCACCAAGGGTCAGGCTGCTGGAAGG 

CATTGGGGAAGGCTAATCTC CACTGGA CCCCCCAGGCTTGCTTTAACACAGGATTTTGGG 

GCTTGCTNCAGGCTTGTCGCTTTTTTTGGTATCTCCTAATCGCAAAAAGAAGCGTCCAGC 

MCMCCAGGATTCCMGCGCCGNGGCCACACCCGACTTCMCCTTGGCCTTTTCCACCT 

GmACAGCGGGACTTTGCCATATCTCAACTGGGTGACAAAAGTGCTCACAGTTCTGCTT 

ACAAAACTGGACCTNGGCCCGTCTANAACTANTGGATCCCCCCGGGCTGCANGAATTCNA 

ATATTAAGCTTTATCGATACCGNCNANCCTTGAGGGGGGGGC 

Sequence 2794 

CTATGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACTCTGCTCAAAATGGCTG 

GTGCTACTGGTGCACTAAGTTGTGAGGAATTACCTCTGCTCGGGACAGAATAAAAAATCC 

TAGGCCTCAAGGAAAAACTGGTGGCATCCTTCTGGGTCTTTTCAAGTCCTTGCATCTAAA 

MTCAATCTTGACAAAAATAAAACAAAAAGCCACAACAAAAGAAAGCCAAACCAATCCCC 

CACCCTAAAGGAAGAAAAAAATTGATGTGAAAGCCCATCCACCCTTCTACCC 
Sequence 2795 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCGGGCAGGTACTCACTGCTGCAGGCA 

GCCCTGCTCTGCGTCAACGCCATCGCAGTGCTGCACGAGGAGCGATTCCTCAAGAACATT 

GGCTGGGGAACAGACCAGGGAATTGGTGGATTTGGAGAAAGCCGGGAATTAAATCACAAG 

CTAATGAACCTTATTCGATCTGTAAGAACCCGTGATGAGAAGTGCCATTGATAATAAGTA 

MCTCAATTGCAATTGTGTTACTTTTATTATTTGGATGAATATCAAGTGGAGAAAATGGA 

GACTCAGAAGANGACATGCCAGTAGAAAGTTATTACTTTGGTCATTATTGGGAATATTTA 

TATCTTAGCTGGCTGACCTTGCACTTGTCAAAAAATGTAAAGCTGAAAATAAAAACCAGG 
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GGTTCTATTTATCTGG' l ITTTTi I A ATGGTGCACTTGNAGGTTCATTCCAAAGAATCAGA 

TCTGGAAGGCAGNACCTTTCCAGACTGGAATTCTGATGCT CANGGTN ATATA GTTC ATGC 

CTNGGNGAGMTGNTNAAAGGGGCCAGACTGGGGCTTNCTTTTTTTAAAAMTTTTCTAT 

CNCCCCCTTCCCAGGGATGAAAAA > 1 [TTTI 1 CAAAGGNTTGGAAGG TNCTTG GACNTAA 

CCCCTGANGGGGNGGGGGGTTTNCCNTNMTAAAMTTAAAMGGGTTTTTTAAAANGAA 

TTTCCCCCCANAAAANGNAAMNAGGGGNANTTNNAAMTTTTTTT 

Sequence 2796 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTACAGCCACGGTCA 

CACACTACCCACAGCGCGGTGGCAGGCCCGCTGCTCAATGAGGAGACGATCACACCATTT 

CGGAGATAAGCAGTGCCATTGTGTCTGGAGGAGAAGAGAGAATTAAAAATAAAATAGAAT 

TCMCAAAAAATATATATAGAAAAAAMGAAAAAAAAMCCAMCAGAAAGTACCTC 

CG 

Sequence 2797 

ACACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCGTA 
ATAAATATGCATCCGGAAACAAGATAAAAGGCTACACCTCGTCAGGCATCCTACAAAAAT 
GTCTCAAGTTTTATATACTCTGCAGCATTTCTGTGCGGGGGCAGAAGGGGCTGTTGTGTA 
TTTTCTGAAGTGCTGTGACAAMGGTCCTTTTCACA 

AATTGCTTAACTATMTTAAACMCTTMGAAMGTMCACCMGCTTTAMGCCATm 
TGCTTTGCTGTCATTGGTCCTTATCCAATACAGATCAACATATCATCCAGCACAGCCCCC 
GCGTACCTCGGCCCGCTCTAGAAACTAGTGGATCCCCCGGGGCTGCAAGGGAANTTCCGA 
TATCAAAGGCTTTATCGATTACCGTTCGAACCTTCGAGGGGGGGGGCCCCGG 
Sequence 2798 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCTCTTCCCACTCTGCG 

GTTCTGCCAAGCACCTTGCCTGGGCACTTGTGTGTGTGCAAGACTCTACAGTGTGTTGTC 

TGCTGC I I I I I n I I 1 1 1 1CTTCTTCTGTTTCAATTGGCAGCTCTGTTTCCTAAAGTGGA 

ATGAACTGAAGATACAAATACTAATAGAAGATAAAATACCCATAAGCTGAAAAAAAGAGA 

CAGAGAGCAATCATGTTCAATGTCACCATCCTGCCAACCCCCACTCTGTAAAATATACAT 

TATCACCTACTCTCAGCATCTCACTTTTGACACTAAMCTTAAMTMTTTTATTTAATA 

GGGAGAATTTATAATTATTAGTTTTTATTTCAAAAAAMNAAAMTM 

GAAAACCCCTACAACTGGGTTGCAAACTCAGGGCTTT 

Sequence 2799 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGGGAACTCATCTTCTAACT 

TCTTCTTGAGCTGAAGATACTTGGACTTGTAGCCTCAAGCGCCACAATAAACGACTCGGA 

CGGCGAGAGCCATGGCTCGGGGCTGCCACATCCGCTGAGGAGCGGACGCCCGGGCCACAC 

TCCCCGCGTACCTCGGCCGCTCTAGAACTAGTGGGATCCCCCGGGCTnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnCGCTCACAATTTCCACACAAACATACGAAGCCCGGGAGCANTAAAGTGT 

AAAAGGCCTGGGGGGTGCCTAATGAGTGGAGCCTAACTTCA 

Sequence 2800 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGT 

GGGGGGGGTCCTGGTCTTTGGCTTCTCGACTCGGTCCTGTTTCGACAGCGAACATGTCGC 

GGCCTGTCAGAAATAGGAAGGTTGTTGATTACTCACAGTTTCAGGAATCTGATGATGCAG 

ATGAAGATTATGGAAGAGATTCGGGCCCTTCCACTAAGAAAANTTCGGATTCATCTCCCC 

GAAGAAAGCTAAAAAATAATTAGGNCGATCTGGAAAGAATTCACAGGNAAGATAGTGAGG 

ACTCANAAGCACAMGATGTGAAGGACCAATGAAGGATGATTCTCACTCAGCAGAGGATA 

GTGAAGGATNAAAAAGAAAGATCATAAAAAATGTTGCCGCCAACAAACGGCAAGGGNGGC 

ATTCTAAAGCAACCTTCTAAACAAGAA 

Sequence 2801 
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TCACTACTTAGG G CGAAT TGGAGCTCCCCGCGGTGGCGGCCGAGGTAC f rTTTTTTTTn 
I I i 1 I M i I I M 1 GGTTTTTATTTTTGTGG 1 1 1 11 1' G 1 1 I 1 I GTTTTAACCTTTGCAAAC 
ACGATGGTGATG AG G AATGCCTTG CCAGGCTCTCCCAG ACACATCTGTGGTTCTGGGCTG 
TG AATGTTAAACAC AC ACTGGG ATAGACGACCAGG GATG AN GTGTTCCCTATNTTCTCCC 
CGCCCACCANTTGTCAATGTGGCCTAAAAAAAGGCTTTAAAGGGAAAACNAAACTTAAAA 
AAACATTGAGTTTCCCTGCATTTAGCTGAAACAGGGATCTCGTCTGAAGGGCTGGAGGAA 

GCAGCNCGGATCAGCCACTTGCCTCCCGGCCCACGGCTCGCTCTAGAAACTAGTGGGATC 

CCCCGGGGCTG 

Sequence 2802 

NGTGCCAAGCTGCCATTAAATGAATCGGCCAACGCCGCGGGGGAGAGGCCGGTTTGCGTA 
TTGGGGCGCTCTTCCCGCTTTCCTTCGGCTCACTGACTTCGCCTGCGCTCGGGTNCGTTC 
GGCTGC 
Sequence 2803 

AGGTACGCGGGGACTGCTAGGGGCTTAAGCGGAGGGAGTCGAGCCAGCGTCGCCGCGATG 

GTGTTGTTGGAGAGCGAGCAGTTCCTGACGGAGCTGACCAGACTTTTCCAGAAGTGCCGG 

ACGTCGGGCAGCGTCTATATCACCTTGAAGAAGTATGACGGTCGAAACCAAAACCCATTC 

CAAAGAAAGGTACCTTGCCCGGGCGGCnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnAGGGTTAATTGCGCGCTTGGCCGTAATCATGGTCATAAGCCT 

GTTTCCTGTGTGAAAATTGTTTAATCCCGCTCACAATTTTCCACACAACMTTACCGA 

Sequence 2804 

CGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGCA 

CGGGCGTCAATTCCATGCCAGAGGGAGGATGAATTAGGGATGAAAAGGAGGAGATAAGGA 

AGAGTTACAAGGTGGGCAAACCCAGGGTCTAGTATCGTTTCTCAGGGGAAATTCGCTGTC 

GGGGAAAATGGAGCACTCCCACATCTATTAAAAAAATGAAAGAAAATCCTGGGGGTGCAT 

CTTGCTCTCTGTAGGACTTGGCCTCATGTGAATTCCCAGGAAGCTGTGTTTCCAGAGGCA 

TGCTCTTTCCAAACCCATCCTGCTGCTCTCAGTGTTCTCCTGCTTCGCCTAATGTGGACA 

GATCTCACCGGGATGCCCACCGTTGGGAAGTCGGGAAGTGAATAAGTGGGGCCTTGCCCG 

GGGGGTTGCCCGGGGGGAGGTGCCTGGGNTATTAGGAGGAAGACTTAAGTG 

Sequence 2805 

CCGGGCAGGTACGCGGGGGGGGTTGTGGAGTGGATGCTTTGGCAAGATGGCGGGGAGCGG 

CGTCCGCCMGTTACTTCTACCGCCAGCACCTTCGTGAAGCCCATTTTCAGTCGGGACAT 

GAACGAGGCCAAGCGGAGGGTGCGCGAGCTCTACCGCGCCTGGTATTCGGGGAGGGTGCC 

GAACACTTGTGCACCAATTTCCAAGCTTGGACATCACNTGTGAAAATGGGACGGGATAAA 

GTCCGAGAAAATGTTTATGAAGAAATGCCCATGTCACAGACCCCAGGGTGGTTTGATCTT 

CTGGTCATTAAGGGAAAGATCCGAACTGGAAGAAAACAATTAAAGTAATGGAAAGCNAGC 

GGGACACATTGTTTATTGCCGGTTCTTCCATGAAAACAAGAAGCCGCCAAGGGCCAAAGG 

GATTTTCCTATCC 

Sequence 2806 

ACCTGTCGTGCCAAGCTGGATTTMTGAATCGGCCAACCGCGCGGGGGAGAGGCGGTTTT 
GCCGTATTGGGGCGCTTCTTTCCCGCTTTCCTTCGCTCACTTGACTTCGCTTGCGCTCGG 
GTCCGTTT 
Sequence 2807 

GACT NCTATAG GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTAATTTGTTC 
TT ATCT TTTTTCCCTTCCTTTTAATCGCCTAGGCTATm 

AC TTTTAAAATCC AG AGTGGGTTAAMTGCTTATTTTACTCAAAGACTGTTTAC ACTG C A 
CTGTTGAMCATCAGATTATCAAGTTCMGTCTTTMGGGGMCCATCTCTTTTGACATC 
TTAAAATTTAACTCGAAAAGAGTTTCTTTMGCTGTTAGGTGTAAA/^ 
CTTATTTCAAATTTTAAAMCTCATACATATCAGTTCATGTATTCATTTCATTCAGTAAG 
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TATGCAATGTGNCATATACCTGTATACCCAGGGCATTGTGCCTAAAGTGCCTGAGGGCTT 
CTAAGGGTGAAACACCAA I HUM 1GG 
Sequence 2808 

CGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGTGGTAGGTAAGCATCTG 

CAGCTCATCTGGAATACTACAGTCTTGAGGACGTGCTTTCCATACGCCCTGTGTCTGGTC 

TACATGCAGGCGTGTCTTGTGATAAATCCTGTTACGGGCATGTGTGTGTGAGGGCTTCTA 

TGTGGCCCCCCGACTCCATTTTATTAGAGTTTGGCATMGTTGACTCCATTTTGGTTCCA 

GCAACTTTCACATAAGGAAACTGAGGCTCAGAGAAGTCAAGGGACTTGCCCCCAGGTCCC 

ACTAACACTGTGCAGTGGAGTGATGCAAACCCCAGTTTTCCCAGTGGATGGAAGCTGCTG 

TGGGTTGAAGTGCAAGGGTTGCCGGTGTTGAAACCAGAGGGGGTGGTGCCAGAAGACCAA 

GTGTGGGGAATGTGAAGAAGCCCCGCGTTACCCTTCGGGCCCGC 

Sequence 2809 

CGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC I I 1 I M 1 I I 11 
I 1 I I I 1 M I 1 1 I ICMT<3TTGTTGTCTANACTtCTATGTAGTTTTAANACTGCTAATATG 
TATTNAATTTCCAGG ( mm I AGTTTTCTATTTCTTTTCACATAACGTTTNTTCTTGCT 
TCAGTCATATAATATCCTGNGTATCTGAATATTAATCTTCATTTTGGTTTNGAAACATTT 

nrm inttaccamtgmgaggttgattgatttggagttttcctggacagtgatataca 

TG AN CTTTimTAAAAAAAGAAAC ATG ATGTTATTAGGG N GTAAN CTTTTTTNTTTTGG A 
AGTAATAAAAATACCTNAAAAATGTATTCATCTTAGGTGGTNCCTTCTAAAAAGG 
Sequence 2810 

GGAGCTCCCCGCGGTGGCGGCCGAGGTACAGCTTTACATTCTATAAACTTGAATTTCAAA 

CTTCAACACCCCTCACTAATCAGCCATCCGAAGAAAGGATATTGAACATAAGCAAATCCT 

CTTGGACGGCGAGTGTAGAAATCAAGTGGAACATACACATCAACTATAGGACCATAACGA 

CCAAATTCACGCCCGCAAGGTCTTCAGACCTGGTGTCGTCGGCCACGTTCCTGACGAACA 

GAGACGTGTTGGGGGGACGCAGGTAGCGGGACATGGCGGCGGCGTGTCTnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGA 

CCTCGAGGGGGGGGGGCCCGGTNACCCAGCCTTTTTGTTCCCTTTTAGTGGAGGGGTTAA 

ATTTGCCGCCGCTTTGGCCGTA 

Sequence 2811 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGACACCCTAGATCCCAAGATCTCCAA 

GGATTTGGTGGCATACCCACTCCAGCACACAGAAGCATGAGGTTCATGACTCTCCTCTTC 

CTGACAGCTCTGGCAGGAGCCCTGGTCTGTGCCTATGATCCAGAGGCCGCCTCTGCCCCA 

GGATCGGGGGAACCCTTGCCATGAAGCATCA3CAAGCTCAAAAGGAAAATGCAGGTGAAG 

ACCCAGGGTTAGCCAGACAGGCACCAAAGCCAAGGAAGCAGAGATCCAGCCTTCTGGAAA 

AAGGCCTAGACGGAGCAAAAAAAAGCTGTGGGGGGACTCGGAAAACTAGGAAAAAGGATG 

CAGTCGAAGGATCTTAGAAAAGGCGTTGGGGTAAAAGGGAGCCCGTTCCCATGGACGTTA 

AAAAGACGTCCTTGA 

Sequence 2812 

ACTNCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAGCACC 

TCTGAGAAAGAAAAAGCCAGACAAACATG AG GATAACAGAAGAAGTTGGGCAAGC CCGGT 

TTATACAGAAGCAGATGGAACATTTTCAAGACTCTTTCAAAAGATTTATGGTGAGAATGG 

GAGTAAGTCCATGAGTGATGTCAGCGCCAGAAAGATGTTCAAAACTTGCGTCAGCTGCGT 

TACGAGGAGATGCAGAAAATAAAATCACAATTAAAAGAACAAGATCAGAAATGGCAGGAT 

GACCTTGCAAMTGGAAAGATCGTCGAAAAAGTTACACTTCAGATCTTGCAAANTNAAAA 

ATTAAAANGGTACCTTNGGCCCGCTCTAGAAACTAGTGGGATCCCCCGGGGCTGGCAGGN 

MTTTCGATATCAAGGCTTTTATCCGGATACCGTCNGACCTCGAGGGGGGGGGGCCC 

Sequence 2813 

ACGCAAGAAAAGAACATGGTGAGCCAAAAGGCCAGCAAAAAGGCCAGGAACnnnnnnnnn 
nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
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nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

CTGGGAAGCTCCCTCGTNGCGCTCTCCTGTTTCCGACCCTNCCGCTTACCCGGATACCTG 

TCCGCNTTTTCTCCCTTC 

Sequence 2814 

CCGCGGTGGCGGCCGAGGTACTTGCGTCCGCTACAGTATGTCAATCGCTTGCCCCAGCAC 

AGCTGTCATTCTTCTCTACAGAAGAGCTTCTCCTCATCAACTGGGGATGGTTACAGTTCT 

TCCTAAAAAAGGATGGCTGCTCTTTTTCTAAAGAGGTTAACACTACAAACTGTAAAGTCT 

GAAAATAGGTTGCATTTAGATGTTTTGGTAAACACATCCTTGCAAAAGACAGCACCAAGC 

ACAGTTGTCCCCTATTGCTTCTGCCCCMGACTCTCCTTCCTAATTCATGCAAAAGCCTT 

TAGTACCTGCCCGGGCGGCCCGCTCTAGAACTAGTGGGATCCCCCGGGCTGCAGGGAATT 

CGATATCAAAGGCTTATCGATACCCGTCGACCTCGAGGGGGGGGGCCCCCGGTACCCAGC 

TTTTTGTTCCCTTTAAGTGGAGGG 

Sequence 2815 

GGTGGCGGCCGCCCGGGCAGGTACGCGGGACCAACAATGTTAGACAAATGTANCCAAACC 
AAAAAATACACATGTTCTATGAATCCATATATGCAAGGTTCACAATAGGTGATAGTAGCT 
ACTCTTTTGGGAAG 
Sequence 2816 

CACTACTTAGGGCGMTTGGAGCTCNCCGCGG TGGCGGCCGCCCGGGCAGGTACTTTTTT 
i I i M I i I m M M i M I I I I i i I I I I I M I M 1 M l | i [ 1 1 GNAAAAAAAATTNTGTTT 

ATTCTCCTTCCCCTTCCCTANGAAATGGGCNNANAGGGGTTNAGGGGCCAACTTTGGCTG 

ACNCTGCAACAAAAACTANTTAMNNCTTGGAAAAAACTNGGGAGGGCACMGGGTTTTN 

GGACTNANGGGGTCACAGGTTGCCCCCTTTGAGGACAGGGCTCCCCCCGGAAGGAATGCA 

AAAAATAATGAAAGGATGAAATGCCCAANTNCCTNG 

Sequence 2817 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGGACAAGCCTGCA 

G AAGC AAGTC AG GCAACTGGCTC AATTTTGTATTCAG CTCAG AGTC ATTTTTTTG ACCTG 

AATAAAGAAAATCCTGTATCCCAAAAAGAMTTCAAAGCCTTGTGAAACTCCACTGACAC 

TAATTAGTGGCGGACGAGGGGACAAGAAGTTGGTCACTAGTGGAACACAAGCATATATCA 

TGTCATTCT TCAG AAAAGCAACAACTGGCCAGAGTTCTTCACCTCCTATCAGGAGTCCAT 

AMTGGATGTTTTATTGATGCGTGAACAGCTATCACGACTCTCAACGAAGTCTTTATCAT 

CATCCAATAAATCTAAGTTCAAAGAGCAGTGCTTTAAGAAAGGGACCATCTTCAGGGAAC 
AG GC AC ATAACTTGCT 
Sequence 2818 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATTTACAA 

ACTTTGGAAACAAGTTTGATGGTGTCTTTAAAAGTTAAGCATACACTTACCATAGGATCC 

AGCCATACTTCCCCGGAGAGAAGAAAGTGTGCCTCTCAGCAAGGCTTACACACGAATATT 

CACAGCAGCTTTATTTATAAAAGCTTGAAAATGGAAAACCGGCCCAACCGCCATCCACAT 

GCAAATGGATAACCCACGGTGATGTGTTCCATCCATGCCCGAGGAATACCATTCAGCCAT 

AAAAAGGAACGGATCANAAACGCATGCAACAGTGTGGACCAATCTTAAMTAGTTGTGTG 

GACTAGAATGAGCCAGTCCAAAACAGAATCCTGA 

Sequence 2819 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTC l I I I 1 1 I I I I 1 I 1 
I I I I 1 I 1 I ! I I IGCAGACTTTGGCAACATATTGAGAGCCTGAATTAACACTGAATCCTAT 
G AAAACAGTTTTGTAATTATAATGCTG AATCAG N NTTATGTGTAATTAC ACC AATATGCT 
ACT CTGN GAMCATTTCTTNAGGGAAAATGTTCCCGGGTATATTTTGNATTTNTGCCNCT 
GCCTTTTATAAAAGGTCTCATTATTTCATGAACTTTAATTTACCAGGCCAAGTCC 
Sequence 2820 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGAGCAAAGCTCACCA 
CAGGTAGCAGCAGAACCTTGAGCCCCCACATCCTGCTGATGGAAATCTGTTGTCTGGCTT 
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CGTTTCGGCAGCAAAGTGTGGGCACACCATCAGGGGTGCTAGATTTATTCCATCGGGATT 

GCGCCCCGCGTACCTGCCCGGGCGGC 

Sequence 2821 

TGTGCCTATCACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGG 

TACGCGGGGGACTCGGAAAACTAGGAAAAGATGCAGTCGAAGATCTAGAAAGCGTGGGTA 

AAGGAGCCGTCCATGACGTTAAAGACGTCCTTGACTCAGTACTATAGCTGTAAGGAGMG 

CTGAGAAATGATACCCAGGAGCAGCAGGCTTTACCGTCTTCAAGCCTAAAACCTNGTAAA 

AAAAAAAAAAAAAAATTTAAAACAG C TATTAAACTGAAAGCATNTGTTAAAAAAAAAAAA 

AAAAAAAAAG GT ACCT 

Sequence 2822 

GCGTATCGACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGC 

ATCCTTAGGAGACCTGAGTCCTCAAGAAAACCCTCTTCTGGAAGTAGTTGCTCCTTCAGA 

ACGTTTTACAGAAAACACTAATGTAAAAGACACAACTMTGTAAAAGACACAAAAGAGAT 

GTGTTCAAAGACACATCTCTGAAAACACAAACTACAATCATCCTCCTGAGGCAGTTTCCG 

CTGGGACTGCATTCAACTTAGAACCAACTGTTAAACAAACTGAGACAAAATGGGAATACA 

ACMTGTGGGCATTGACTTGTCCCCTGAGCCCAAAAGCTTCAATTACCCATTGCTCTCGT 

CCCCAGGTGATCAGCTTGAAATTCAGCTAACCGAGCAGCTACGGTCCCTCATCCCCAACG 

AGGGGATGTGAGGAAAGTTCATGTCTCATGTTATCTGGACCTTGAAAATGGGAA 

Sequence 2823 

GCGAATTGGATTTCCCCGNGGTGGCGGCCGCCCGGGCAGGTACTGCTGTGGACCTCTTGA 

AACAMTTGTAAAAGNTTGACTTGAAATGATCAMCTGTTTGACAGCAACTTGACTGCCT 

TCCTGAACAAAACTCAAAGTTATAGGGACACAAAAATATGCAGCAACCAACAAAGTAAAA 

TTCACAGTGTNTGGTTTTCTATCAMCTTTTGAGCCACACTAAAACAGGGAAATGACCAT 

AATGAAGGGATAAATCAATCATGTTAAACTGACTCAGAGCTGACACAGGTGTAAGAATTA 

CNAGAAAAGAACATTAAATCAG 1 1 I I I ATAACTGTATACTGTACATCCCCCAAATTAAGT 

NGACAAAAAAAAAGATATAAAAATATTCAAGCTMCATTMGAAAGAAAAACTGGGGCTT 

GGTGCTGTGGCTCAT 

Sequence 2824 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGAGATATTTACTTTATGA 
ATTAAAGCTGGAGTCAGTGTCAGCCCGAAGGCTGAAGGAAAAAGAGCAACAGATCCAGGG 
AGCATTCACCTGCCCTGTCTCCAAACAGAGCCACCAAGGAGGAGCAGGGGAGCGA 
Sequence 2825 

CCGCGGTGGCGGCCGATGCACAGAGTAGAGCCTTGTGCAGGTAAATTTCTCATCCCAGCA 
CTCTGCAGTGGTA I Mill ICACTGAGGGAGAGCAAGAAAGCCAGGCTAGGCGTCCCCGC 
GTACCTGCCCG 
Sequence 2826 

CTATAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTTTTTT^ 

I U i I I k I l I I I II I 1AH 1 1 1 1 U I U I 1 1 M 1 1 I I I II U I M I I I M 1 1 I M I I I I F 

I I I I 11 I I 1 M II I I 1 I I i II I IA AAAATTNAAAAANNTTTNNNGNNTNAGGGGCCWAAA 

AANGGNANNTTTNGGNNNNNAAAAAAGGNCCCCTTTTTNAAAAAAAANNNNNWNNANGGN 

NNGTGGGGGANCCCCNNCNNNNGGTTTNNAAANGGNAANGGNNANNTTNGGAANAAANTA 

NTTMNNNCCNTNNNTNTTTTTTNAAAAAAAGCCNNTNCCNNCNGGNAAAAAAAAAAAGG 

GNAAANCCNANNATTNGGGNNTATGCCNGGNACCCTTTNAAAGGGTTCCCNAAAAAAAAG 

NGGNCCCNCAAAAAAANNNGGGNANNNCNCGGGGGANGGGAAA 

Sequence 2828 

CGGCCGCCCGGGCAGGTACGCGGGATGGAAACTGGGGAAACATGCATAAGTGCCAATCCT 
TTGAATGTTCCACGGAAACACTGGTGGACAGATTCTAGTGCTGAGAAGAAACACGTTTGG 
TTTGGAGAGTCCATGGATGGNGGTTTTCAGT7TAGCTACGGCAATCCTGAACTTCCTGAA 
GATGTCCTTGATGTGCAGCTGGCATTCCTTCGACTTCTCTCCAGCCGAGCTTCCCAGAAC 
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ATCACATATCACTGCAAAAATAGCATTGCATACATGGATCAGGCCAGTGGAAATGTAAAG 
AAGGCCCTGAAGCTGATGGGGTCAAATGAAGGTGAATTCAAGGCTGAAGGAAATAGCAAA 
TTC ACCTACACAGTTCTGGAGGATGGTTG CA 
Sequence 2829 

AGGTACI I I n I I I I I I I I I I I I I I i n I I I I I 1 IGGI I I I I I ACATCACCATGCAGGTT 

ACATTCATCTTCCACTGGAATGACTAGAGCCCTCCAGGTAGTAGCCTGACTACAGAAGAA 

CACAGGACTGGCTCCTGGGGGCAAACAGGCTCTCTTGCTTCTCCTCATTGGCCATGCCTT 

AGCATGGTTCCTCCCTATCTCCTACTCAAGCAGGTCCT 

Sequence 2830 

GTGGCGGCCGAGGTAC I MINI I NTTATTN I 11 I 11 I NCTATTGGCNCTAGAGGGGGTA 

GAGGGGGTNCTATAGGGTAAATTCGGGGCCCTATTTTCANAGATTTTTAGGGGAATTAAT 

TNTAGGACNATGGGCATGAAACTGTGGTTTGCTCCACAGATNTCACANTCATTGACCCGT 

NNTTANTACCCCCGGTCTGAGTANTNGGTGGAAAGTGGTTTGGTTTANANGTCCGGGAAN 

TTGCATCTTTTNTTTNAGCCTAATGTGGGGACAGCTCATGAAGTGCAAGACTGTCTTTGN 

GATGTAAATNATTTATANCGAANGGGGGGCTTC 

Sequence 2831 

AGGTACATCGGGCACATTACCACACCAGGCGAGGGTCATTCGCTTCCTTGTCTGTCACCT 

GGGGTTTATGACAATCTCGGTGGTAGAGATTATGGCACTCCTGACATTCTACTAATTGAT 

TGCCAGATGCCACCATCATTTGCCTACAAACAACGCAGGCCAATCCCATCTCCATGGCAA 

AATCATCAGCACTGGTCTCCTCAAAACTGGAAAGGTCAGCCATAGGTAAATCCTTGCTAC 

TTTGGACAGTAATGGGAGATGACTGTGTTTCTGGTTTCTCCAATCTAGGTTTCTTTC5GAA 

TATCAACTCCTTCAGTGATGTCTGATTTCATTTTATCAGCAGGTCTCTTTTCAGCTTCCT 

TCTTTACCT7TTCAGTTGTGAGGACCTTGCCATTATTATTACCAGAAGGAAGACTGGATG 

ATATTTTGGGCTCTTGCTTAATGGAAATG 1 1 I 1 I I GTGCTTGAAATTTTGGGTGGCTCCA 

CATCCTTTTGAGATGGACGGTAACTGGAATCAATGCCCCGAGCCAAAGA7TCATCAAGCC 

CCGCGTACCTGCCCG 

Sequence 2832 

i 1 1 I n 1 I I I I N 11 II I I I NTTTTNCTTGTGNNCGCGCGTAATNCGACTCATATAGGGCG 
nnnnnnnnnnnnnnnnnnnnnnnnnnnnCCGGGCAGGTACi rTTTTTTTTTTTTTTTTTl 
rTTTTTI I GGGATTGTGAAAACTAATCAGTCCTATANACTTTCCTCTTGTGGATGTGCAG 
ACACAGCCTAATAGAAAAGGGCTATTTGGTTGGGAGAATTCCCAGGANAAATATCTGTTT 
CCATCCAATGAATATAAAACCCCAGCTTNTTCTGGTGT7TTTCCCTGANAAGGGAAAAAT 
AAGAGCAC1GGTTAAGTTTGGGACCGNATCCTTCTTCTCTTGGGACTTCTCTTCCTCTTA 
ATTATGNCNC;CTNGNGTAATNhrTNCAGGCCTGTTTATTTCCCGTCTGCAAANNTAATGTA 
GGGAAGGAGCTTATCTTATTCAGC1 Mill 

Sequence 2833 

ATAGGGCGA.TTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAGTTTTTTCACGCCTGCAA 
TCTTGGAG T CTAGGTTGCCTTGTCTCTCCTATTTTTAAATMGTGA MTTTGG GAGATTG 
TAAAATCTOjTAAAGTTTGTTTTGTGAAAATAAAATG 
AAAAAAAA AAAAAAAAAAAAAAAATAAAAAAAAAAAAAAAAA^ 

M I 1 1 1 1 I 1 1 1 1 1 1 1 1 11 1 1 M I M M 1 1 I CCGGNNTTNAAAAANNNAAAANCNCCNTTT 
TTNGNA/vMNGNGGNANAAAANCTTTNCCNTNTTTGTNCAGGG NANTT NCCCNTNNAANT 
TNCCCK'rTNAANCNGGGGGNGNAAAAANCAGCCNNTTTNNCCNTTTTNTTTGTTA/^^ 
TTTTTNAAAAAAAAAAAAAAAGGGGGGGNTNTTTTTAAAAA 

Sequence 2834 

CCCGCGGTGGCGGCCGAGGTACTTTCTTTTTTNTTTTNTTTTTTCGCAA 
ATCCTTTTGAAACTTAC N NTCTTCAG NTATC AATATTACTGCATT 
Sequence 2835 

TAATACGACTCACTATAGGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACGCG 
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GGGATCTCACAGAAAATGACCAGGTGTGGCTCCAGCTTCCCAATGCCGAGTCAAATGGCC 
TATACTCCTCTGAGTATGTCCACTCCTCTTTCTCAGGATTCCTAGTGGCTCCAATGTGAG 
TACCTGCCCG 
Sequence 2836 

AGGTACGCGGGGAGTCCTGAAAGGGTGGGCAAATGTGCAGCTCTCCCTCAGGGTCGTGGG 

ATCTGGGTCCAACAGTCTGGCCTTTCAGCCTTCAGGCTGTTTTAGGCTTGMGTTTCGTG 

TGGCTGATCAAAGAAAGAAGGCATACGCAGATTTCTACAGAAACTACGATGTCATGAAAG 

ATTTTGAGGAGATGAGGAAGGCTGGTATCTTTCAGAGTGTAAAGTAATCTTGGAATATAA 

AGMTTTCTTCAGGTTGAATTACCTAGAAGTTTGTCACTGACTTGTGTTCCTGAACTATG 

ACACATGAATATGTGGGCTAAGAAATAGTTCCTCTTGATAAATAAACAATTAACAAANAA 

NAAAAAAANNAAAANANAGTACCTGCCCG 

Sequence 2837 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACCCGTTCCTCCAGTGCCCAGAGATGCTCTCCG 
CACCAAGCCACAGATGTGGAGGAGGCAGGTAGGGGGTCAAAGAGGGGTGGTATCGGTTAT 
TCAGGAC 1 II I I I ill I CTTAAATATCCTGTGCTTCTTTCAATCATTTGAAGGTAAAACC 
AGGTCCTGTGAGTGGTAAACTGA I 1 I I I GGTTCTTATGAAGGTATTATTTTCTGTGTAGA 
TATTTGTTAACTTGGTGTC CTTGCAGGGTAGAC G ATCAGCGAAG CTTTCTGTTCCACCAT 
CTTGCTCCTCTCATTTTAGACCTAATATATTGACTATAAGTGTTNCCCTCTGGAAGTATT 
GACTATATCTGCATCAATACTAAGAAAATAAAATNTAATCATAGAGACTGAAACATATTA 
A 

Sequence 2838 

CTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACCAGAACTTAAAG 
TATAATAATAAAAAATAAAGTATTAC AG AAC ATG GTC AATTATATTAAAAAGTAG ATT AA 
TTAATGTCCATATATAC A I MM I GGTGGAAATCAAACATAATGAAAACTATTTCACATA 
ATTAATATTTTGTTTA 1 MM 1 ACTTATTAATTTATTGC AG AG ATG GG AGTCTTGCTATG 
TTGCCCAGGCTAGTCTCAAACTCCTGGCCTCAAGTAATCCTCCCACCTCAATCCCCCAAA 
ATGCTGGGATTTCAGGAACATGCCACTGCACTTTATTAGTGTTACCTGTCATAAAGTTAA 
TTACTCATAATTTTATTTCAGTATATATATATATATATATATATACACACACACACCATA 
CACACCATAGTAGAATAAATTAATTATATTTTCTACAAAAATTATTT^ 
GTACAATGAGTATCATTTTCACATCTTATCTTAAAAATGCCTGGAGAAGGATTCA GGNCC 
CTGCCMTTTGGTTTCTTTGCCAAAAATCNGGTCCTTGTNTTCTTACCACTTAAAl mm 
TTCCAAAAATTATTTN N N CTTAAG ATTATCCC ATTTGTTTAAATTTTTG AAAGGTAAAAN 
TTTCAAAAAAAGG NCCCCGGCAAGGGTTNAATGGANNTTTTGGGG 
Sequence 2839 

CGATGGCCTCTTCTAGGCAGATCCGTCTACAACTGGTAACAGCAACCATATCG AGTTTC T 

CTTTATCACAATACCACGAGGCTTATCATCATTGCCATGCCTTTAGTAACATCACTTTTT 

TAMTGGGCATTTTCACAATGTAATCTGCAACCTACATATAACATGCACTCTTTATTGAA 

GGCCGTACCTGCCCGGGCGGCCGCTCGAGAGACACATTACCTGAATGAGCAGGTGAAAGC 

CATCAAAGAATTGGGTGACCACGTGACCAACTTGCGCAAGATGGGAGCGCCCGAATCTGG 

CTTGGCGGAATATCTCTTTGACAAGCACACCCTGGGAGACAGTGATAATGAAAGCTAAGC 

CTCGGGCTAATTTCCCCATAGCCGTGGGGTGACTTCCCTGGTCACCAAGGCAGTGCATGC 

ATGTTGGGGTTTCCTTTACCTTTTCTATAAGTTGTACCTCGGCCGCTCTAGAACTAAGTG 

GATCCCC 
Sequence 2840 

CGAGGTAC I Ml U HI H 1 1 1 1 1 1 I 1 i I I Ul 1 I GGNTTANAGACAGGGTCTCACAATG 
TTGCCCAGGCTGGTCTTGAACAATCCTCCCACCTNTGCCTGCCAAGTAACTGGGATTATA 
GGAATGAGCCATCATGCCTGGCTCAAATATACTCCATTATAATCATATGGGACCCTGTCC 
TATATGGGGTC TGTC ACTAACTGAATC ATCGTTATG C ANC AC ATG ACTGCTATGTGTATA 
TATGTATTTATTTATAAGAAATAAATATTACATACCAGAATTTTACAGTTTTGTCCTCTA 
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ACTCTTTGGAATGCTAAAGGACAGACACTTGTN 1 I I 1 i AAATACCCCCATTGTACCTGCC 
CG 

Sequence 2841 

CCGCGGTGGCGGCCGAGGTAC I M I I ) I 1 1 I I R I I M II I H I I I I I I H I NACAACTC 

CTTAAAAAGGTAAAAGCCATCGCTTGAGCCAGGGAGGTTGATGCTGCAGTGAGTCATGAT 

TGCACCACTGCACTCCAGCCTAGGTGACANAGCCAGGCCCTGTCTTAAAGAGGAAAAACC 

ATTCCTAGCTCACGGGGCCACATGCCATANTTTGCTGACCCCTGMTTCMCCTTTNTGC 

TTTTCTGCAAGCTGCCTCTCTCTCGAAATGTTTTGACATCTAGCTTGCTGACACCTTTGA 

TTCAAGCCTGGCGCAGCGGGACTGGCTGANCTCACCTGCCCGGGCGGCCGCTCTAAAACT 

AGTG 

Sequence 2842 

GGCGAATTGGAGCTCCCCGCGGNGGCGGCCCGAGGTACCACGATAAGANGCANGCCAAAG 
TATTGTCATGAAAGGTAGGGAANGAGCATGGNGAGGCCTCAGTGGAAAGGTACAACCGC 
Sequence 2843 

AGGTACTGGAGGTTGGCTGTGGTGTGGGAAACACAGTCTTTCCAATTTTACAAACGAACA 

ATGACCCAGGACTCTTTGTTTATTGCTGTGA I I I I l CTTCCACAGCTATAGAACTGGTCC 

AGACAAATTCAhl^TATGATCCTTCTCGGTGTTTTGCCTTTGTTCACGACCTGTGTGATG 

AAGAGMGAGTTACCCAGTGCCCAAGGGCAGTCTTGATATTATCATTCTCATATTTGTTC 

TTTCAGCAATTGTTCCAGACAAGATGCAGAAGGCTATCAACAGGCTGAGCAGGCTTCTGA 

AACCTGGCGGGATGATGCTTCTGCGAGATTA 

Sequence 2844 

CCGGGCAGGTACTAGAGGTAAATTTTTAATMCATACTAATGTGAAATGGTATCTGGTTT 

TTATTCAGACTTTTTGTGAGCATATTCTTATAGACACTAGAAATAAATACATAGAAGGAG 

ACTTTGAGCACAAGGGAGAAGAGAGTTATTAGCACTCCTGAGGCCTCTGGAATGCTGCTT 

TTCTATCCAGGTATCTTGCTTGCTACCAGCAGAATTGACTGGTATTGGTTCTCACTGGCC 

AGGGAGCCTCTGCTCTGTTCCTGGMTATMGGCTTTTATTCAGGAGCAAATCATTGATA 

GTAACTTTGCTTTATCTAGCCATTGACTTCAGCAAA 

Sequence 2845 

GTGGCGGCCGAGGTACGCGGGGATTGTGTGCAAAATCAGAGAGGGGTGCAAGATCCTGAT 

TTTTCAGGAGTTCAAGCGACAATGGCAGCCCAATACGGCAGTNTGAGCTTCAACCCCAGC 

ACACCAGGGGCCAGTTTGGTGAGAGGGCATTCAGTGCTCCCCAGACCAGGCTGCAGGGAG 

GGCCTGGAAGGCAAGAGCCCANAAATTCCCAATTGAGAATTGTGTTAGTGGGTAAAACCG 

GAGCAGGAAAAAGTGCAACAGGAAACAGCATCCTTGGCCGGAAAGTGTTTCATTCTGGCA 

CTGCAGCAAAATCCATTACCAAGAAGTGTGACiAAACGCAGCAGCTCATGGAAGGAAACAG 

AACTTGTCGTAGTTGACACACCAGGCATTTTCGACACAGAGGTGCCCAATGCTGAAACGT 

CCAAGGAGATTATTCGCTGCATTCTTCTGACCTCCCCAGGGCCTCATGCTCTACTTCTGG 

TGGTTCCACTGGGCCGTTACACTTGAGGAAC-AGCACAAAGCCACAGAGAAAGATCCTGAA 

AATGTTTGGAGAGAGGGCTAGAAGTTCATGATTCTCATATTCACCCG 

Sequence 2846 

CCCGGGCAGGTACTTTGGGAGGCCAAGGCAGGTGGATCAACTGAGGTCAGGAGTTTGAGA 

CCAGCCT AGCTAATGTAGTGAAATCQTGTCTCTACTAAAAATNCN AAAATN N GCTG NGTG 

NNGTGGTGCATGCCTGTAATCTCAGCTACTCAGGAGGCTGANGCAGGAGAATTACTGGAA 

CTCAAGAGGCAGAGACTGCAGTGAGCCAAGATCGTGCCATGGCACTCCTGCCTGGACAAC 

AGAGCAAGACTCCATCCTTAAGCTCCCTGACTTGCTGCTCGTNCTTTCTCTGGCAAAATC 

TCATCTTGGATGAGCCCCGACTTCGTGTTTGCTCAATGACACCACTCACCTGAACCANAA 

AAAAAAANCAAAAAGTGGTCAGAAGATATCATCATAAATTCAAATTATTTACATAAAAAA 

TT 

Sequence 2847 

CCGCGGTGGCTCTGTAATGCGGCGTAAGAAGAAAACTGGCTCCCAACAGAAGCAGAAAAG 
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AGAAGCGAAAAGCAATAATACAGACTGTCACCTCTAGGGTTTATGCTGTCACTAGGATAC 

ATCTTTAGCATTCTACTCTGCTTAGAAATAAAGCACAAGAAGATAAAGTATTATGGAAAG 

CMTTGCCCTATTGATTATTTCCTAGGTGTCTAMTCACMTCATTTAAGCTAATTATAC 

ACCAGCATATGATATCTAACCAGAGAAGAAAGGTCTTACCCACGTGTGCTCAGAATCATG 

CTGCTATGAAATGCCCGAGTTTGGGTTCCTTTTTATAAGTAAGCTCTGTAATGCCAAACT 

TCTTTTCTTTATACATAGTGAGGGTAATCACAGGGAGTCAGAGCTTAGGAAATAATCCAC 

MCTTTAGTATCCTCTTCAMTGTCTTGCTTTATGGTCTTGTGTA TAAGC CAGGATACTC 

CAAMTTTCTTTTMTCMGACTTGTTTCCTTAMTATCACCTMGTm 

ACCATTCCTCTTTAAAGAGGATCTTTCCTAATAATTCTT 

Sequence 2848 

TATANGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTNGCANNTNCTNTTTTNT 

TTNTTNNCTCNCNTNTGGGAATATCTNGAATTTAAACTAGTAATTCAGCTCCTCTTCAGC 

ACTCACAATATANTCTGTTGCCAATGCCTGATGANACACATAGCTNATGACCATNGCACT 

TCTCTTTTTTTCCATGGTTCGCNTTNAGGAGGTMTTTNCTTCANCNACTANAGGATNTC 

CTTCCTNANCAATTNAMGAAATATTCGGTCMNTGATTTTT 

Sequence 2849 - 

CGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTGGA1 U I I 1 1 I INT Nl u M 

TTCCTATTGATGTNATNTTATTGCCGNCGTAAATAAGGAGNAGGTAAAACATTTTGTGCT 

TTAACAAGTGAGGCACAGNTTAAAAGTTTTAAACAGCAAT 

Sequence 2850 

CTATAGGGCGAATGNAGCTCCCCGCGGTGGCGGCCGAGGNACTTTAI 1 1 1 I I I ! I 1 I i I I 
rTTTTTTTTn I GGAGAAGGAGTTTCGCTNTTGTTGCCCAGGCTGGAGTGCAATGGCGCG 
ATCTNTGCTCACTGTMCCTCCGCCTCCTGGGCTCMGCMTTT TTNT GCCTCAG CCTCC 
TGAGTAGCTGGGATTACAGGCGTGTGCCACCACTCCCGGATAATTTTNTTTTGTATTTTT 
AGTANAAATGGGGTTTTACCATGTTGGTCAGGCTGGTCTCNAACTCCTGACCTCAACTGA 
TCCACCTGCCTCGGCCTCCCAAAGTGCTGGGATTACAGGTGTGAGCTACTGTGCCTGGCC 
C G AAG ATTTC ATATTTATAAG AAG AG AG AAT AATTN TCTTTAAAC C AAAG TTT AAAATAT 
CAGGAGAAAACATAATAAAAGCATCATTATGAMGTATCTAAATCTTTCATTTTAAATT 

Sequence 2851 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAAAGAGGGCATAATAATCAGTCTCACAGTAG 

GGTTCACCATCCTCCAAGTGAAAAACATTGTTCCGAATGGGCTTTCCACAGGCTACACAC 

ACAAAACAGGAAACATGCCMGTTTGTTTCAACGCATTGATGACTTCTCCAAGGATCTTC 

CTTTGGCATCGACCACATTCAGGGGCAAAGAATTTCTCATAGCACAGCTCACAATACAGG 

GCTCCTTTCTCCTCTACAAATCCAATGTAGGCCATTGTATTTTTGCAGTGAGCGCAGTTG 

AATTCTTCTGGGTGCCAAGATTTCCCCAGTGCCACTAAGAATGGTCCTCTGACAGAAGAA 

AAAGAGGCCAACTAAGTTTGAGAATAAAATTTCTAGAGTTAGCCTGGAtATGGACCTCTT 

TTATGCMTGTCCAACCAATTAAATACTTGCTCMTTTTCCTTCTTAAGGACTGMCTCA 

TGGGTGTTGGGTGTTGTAAATTCACAGAGGGACTCTGCATTGATGTCCCCGCGTACCTNG 

GNCGCTCTAGAACTAAGTGGAT 

Sequence 2852 

AATGCGGCCGTAAGAAGAAAACTGGCTCCCAACAGAAGCAGAAAAGAGAAGCGAAAAGCA 

ATAATACAGACTGTCACCTCTAGGGTTTATGCTGTCACTAGGATACATCTTTAGCATTCT 

ACTCTGCTTAGAMTAMGCACAAGMGATAAAGTATTATGGAAAGCAATTGCCCTATTG 

ATTATTTCCTAGGTGTCTAMTCACMTCATTTAAGCTAATTATACACCAGCATATGATA 

TCTAACC AG AG AAG AAAGGTCTTACCC ACGTGTG CTCAGAATC ATG CTGCTATGAAATGC 

CCGAGTTTGGGTTCCTTTTTATAAGTMGCTCTGTMTGCCCAAACTTCTTTTCTTTATA 

CATAGTGAGGGTAATCACAGGGAGTCAGAGCTTAGGAAATAATNCACAACTTTA GTATC C 

TCTTCAAATGTCTTGCTTTATGGTCTTGTGTATMGCCAGGATACTCCAAAATTTCTTTT 

AATCAGACTTGTTTCCTTAAATATCACCTAAGTTNTATTGNTTCTTACCATT 
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Sequence 2853 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACCAAATATTAGAAAAAACAAACAAAAACGGAA 

A CAAT CCCTAGTAGCATTCCTTGGCGTGCAAAGTCAGCAGGAAATGATTATCCATTGACA 

GTTTTCATGATAAGTCTTGGMGCATTCAAAAAGCAGAGGGMTCATCGGGGATTGTTCA 

GATTAGGTTCTGTGGGATGCCTTGTTCTTMTTTAACTCTGAGTTCCAGGATTTCATTCC 

CTTTATAACATCTCTAACAGGGTCATGTTCMTTTAGTGCAAGTGCAAAATGAAAATGAA 

ATGAAACCAACAAATACATTAGGAGCGAAGGCTGATCGTTAAGATCTGTAGTCCCCGCGT 

ACCTCGGCCGCGACCA 

Sequence 2854 

CCGGGCAGGTACCTGTTACTGGATTTGGCTCCGGGGTCCTGCAGGACCAGTTCATGCCTT 

GTCTCTCAGCCCCCTGCAAACAGCACAGAGCATCCAGTTCTGCCCTGGAGCCTGGCACTC 

CCACTGCACTCCTCTCAAGCCCCAAGTTTTCACTGGCAAGGGGACAGCATTTATAGGACT 

CTTAAGTGCATCCCCTAATCCTGTCAATGTCTCACCTGATTCCCTCGGGGTCCAGGACTT 

GGTGTTCCAGTTGAGGTAGGGCTGGGCATCTGGTTTCTGAAAANTGAGGTCTCTGGAAGT 

CT 

Sequence 2855 

CCGCGGTGGCGGCCGAGGTACAAGTTCCTAGGCAGGTCCCAGCACACTCATAATAAGTGT 

GATAC AAC AGGGTTTTAGTC ATGATGTTCCCCGACC ACGTAGTGTGG GTGCAG TGG AGG C 

ATCCCTCCCAGGGTTCTCCTTTTAACATGGATAAGGGGAGTAGCAAGAACAAAGTGATGA 

GTAGCATGTTCATACCCAGCATGGACAGAAMGGCTTTTTCCTGGGGGGAGAAGGCTAAG 

CAMGTGGCAGCTTAATAGCAAAGTAATTAATATGACAAGGAAAATTACTGGACTTCAGA 

TTTCTAACCACATTTACTTCCCTGATGATGACTCAAGCTT 

Sequence 2856 • 

GAACCCACCGCGGCGGCGGCCGCCCGGGCAGGGACGTNGNGAGGGGGNGAGNAAACCCGC 

GCAGGGGCTNCTGCTGAGGGGGCAGGCGGAGCTTGAGGAAACCGCAGAGAAGTTGTNCCC 

TCTAAGAAAGANAGAGATNAANACAACTACTNAAAAAANATAGACAATAGGCNACTAAGA 

NANTGCTTATCNGGNAAGTTTNNAACGGAANGGGAATAGCCCAAGANAANAAGAGAAAAN 

ATGAAGACCCAGAAGAGTAGCAGGAGGAAGGAAAAGANAAAAGGGGCACTAAAACATGAC 

GGAGGGGCGAGANGAAGCNNCCACANGGAGAAAAAAAAGCATGNCAAAAGAAAATGGAGA 

GAAAGGACNACAGAGCCCCGAAAGAANACCAANAGAAGGGCAACGCNCANAGANNAAAAG 

GAAGGNGACCGAAACAGCNNAAAGNCNNAGGGNAAAANGNGGGAGGCGACCAAAAAAATG 

GGAAGGCGAACCNNAGAAAAAGAGAACAAACCGAANGGGGAAGAANAGACCNNGAAANCC 

ATGACGCAAGGGAGAAACCGCGANANNNAAAGCCCAGGGAANGNAANNACCAA\NCGCAN 

ACCAAAAGGGAAAGAAA 

Sequence 2857 

AGCGGCCGCCCGGGCAGGTACGCGGGGACCTCAAGCCATGGGCCTCTGCGA.^GCGCCGGC 

GTGCCCACGGCTAAGAAACTTCCTCTTTCTGCTCCCGGGAACGAAGGCTGTAGCAGAGAA 

GGCCTTCAAGTTTCGAGACCAGTTCCAGCCGCAGCCCGAGCACTGGTGACCTTGAATTAA 

GTCTAATGTTGGGATTGGAAGACGCTTCAGACATCCGCTGCCGCCCTGGGCAAGCGACCA 

GCTCAAGGTTACCCCACAGGGACGTGCTCTGGTCCCACGGTCCAGTGCTCT7TTGTCGGT 

TAGTTTTGTCATTTGTAAAATAGGAACAGTAGATAGTGGTAGGAAAGTGGTTGTGAAAAC 

TTAATAGAAAGCTAAGCGCCTTGGTCGGGCGCAGTGGCATGTCACCCCTGCAATTCTATC 

ACTTTTGGAGGCTTGAGGCGGGCCGGATCGTTTGAACCCAGGAArrCGAAACCAGTTCTG 

GGCAATATGAAAACCCCATCTTTTGCAMTAAAGTACCTTCGGCCGCTTTTANNAACTAA 

GTGGGATNCCCCCGGGCTTGCAGGGAATTCNATATTAAACCTTATTCGATACCGGNCGAC 

CCTTCGANGGGGGGG 

Sequence 2858 

CCGGGCAGGTACCAGGCAGGATGGAGCAGGATGGTGTGAGATTTCATCACACTACTCAGG 
ATAGTGCACAATCTAACACTTATGAATTGTTTATTTCTAAAGTTTTCCATTTAATATAT^ 
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TGAAATGAAGTAGACCCTGTGTGTAGCTGAAACTTTGGAAAGGAAAACTATGGATAAGGG 

GGACTACTAGMTACACTTCMTAAAATAGCATTTTAACAAAATCCTAGTGTCCGTAGAG 

TAGTTCTCCATCTAGCTGTCCATCTTTCAAATGTGGAGAGAGGTGAGATGAGGGAAGMG 

TTATGGTCAGAGAGGGGAGACAGGTGTGTGTGGTGTGGGGTGTGC 

Sequence 2859 

TAGGGCGAATTGGAGCTANNNGGAGCTNNACCGNGGNGGCGGCCGAGGTACGCGGCTAAC 
MGMTTCTTNMCAGCAGAMTTAACGAAACMCMCCTCATCTACTGATTTTCTGGCT 
AGAGCTTATGGTGTTGAAATGGCCAAAGAATTTGTTACATNAGCACCA 
Sequence 2860 

AATNTGGGGGAAGGCCCTACCCACCCCCGCCGGGNTGGGGCCGGGGGCCCCCGGGAAGGG 
GTTAACCTTTAN N N NTNTCCNTNCNTTTN N N N NTATATNTNTN N ATTTNTTTTGGGGGAA 
AAATTGGTTNAAAAAANTCC I I I 1 I I 1 I AATTTTAAAAAANACCAAGGTTGTGGTTNCTT 
TTTTCCCCCACCGGGGTAAGGGTAAAAAMGCCTTTTTNGGGNCCACCCANTTACCNAGG 
GNTTTTTAAAAAAAAAAAATTAAMTTCCANCCCCCCNCCCCATTTAAAA I I I I lAATTT 
ACCCCAANAAAATTTTTCCCCTTCNTTTTTMTTCAAAM 

AAAGGGNTGGGTTTTTTNTTNCAAAAAATTAAACCCANAAA I i 1 t I I I I I 1 I 1 1 ICCCCC 
GNCTCAMTTAANAAAAAAAAMTTAAC^^ 

TG GTATTTATAAAAATTAAC AACCC ANGGGGGGGTTTATN AAG GGG AA AG N N AAAAAAAA 
AGGG AAATT A 1 I 1 1 1 1 1 1 1 1 CCNTNAAAGGGGGGCCCANAAMTTTTTAMCCCTTTAAA 
NGGGGGGG AAATTCCCCAN 1 I 1 11 I i 1 GGGGGGGGAAAAAAAAAAAAAAGGGNTTTNGGA 
AAGNGTTCTACCCCTTGGGNTGGGGGGGGNAATTTAA 1 1 1 I 1 1 1 I I AAAAAAAAAAAAAA 
TTNNAATTCCCAANCCCAAGGGTTTNANAACCNAA AAAGGG GNAAATTCCCAATTGGGNC 
CTTTTTGGGGTTTTCCCCCTTMCCNACGGGTTTMTTTTTGGGCCGGGGGGGGCCCCCC 
MNANAACCCNCCTTTTTTAAAMGGTTTGGGAAAAAMGGCCCTNNNAAAAANNNGGGT 
NGGI'I I I I I NGGGGGGGGGNTGGGNAACC 1 1 I I I I 
Sequence 2861 

CAAACCCCGGNCGGGGGTGGGGCCGGGGGCCNCGGAATTAACCCCTTCCAAACCCCGGGT 

NGGGGGCCTTGGNCNTTGGATNCATTNTNCNCGGGTTGGGGGTNCNGGGGAATTGGNTTC 

CCAAATTGGGAAAAAGGGGAAAGNGGTTCCNGNAATGGGAAAGNCNAANGGAAATGGNCC 

TTNTNAAAACCGGTTGGGCCAAGGAAAACCAAAANGGAAAAACCAAAGNCCAAGGGCCCT 

TAACC I Hill I GGTTGGGGGAAAAATTTGGGGGGAAATTCCCCCCCCCAAAAACCAAAA 

ATTGGGTTNCCAAAAAGGGAAACCAAGGNCCCCCGGTTCNTTGGGTTGGGAACCAATTCA 

CCCCAACCCCTTCCGGATTGGGGGCCCCCTTCCAAAAAGTAAATTGGGGGCCNAAGGGNT 

TCCMCCCCNTTTTCCCAATTTTTGGGGGCCCAAMTTTAANGGCCCAAACCAAAGGNCC 

CCAANTTCTCCAAAGGGGGAAMGGCCCTTTNCCTTTTTNCAAAAGGNCCGGGCCAAATT 

TCCTTTCCGGGGGGAAAGGNCCCAAAGGNTTn"CCAAAMCCTTTG GGCCC CCCAATTNG 

GGTTTTTCCCCCGGGCCCCCGGGGGGAAAAANGGGGGGNCCCCC I 1 1 1 1 NCCCCTTNCCC 

AAACCCTTGGGGGGTTAAACCCCCCTTTNCGGGGGGCCCCCGGGCNTTTCCTTTAAGGGA 

AAAAANCCTTMGGTTTGGGGGGMTTTCCCCCCCCCCCG GGGGG GCCTTTGGGCNCAAA 

GGGGGNAAAATTTTTCCGGAMTTTAANTTC^ 

TTMCCCCCCGGGTTTCNGGGAANCCNTTTTTTAAAGGGGGGGGGGGGGGGGGGGCCCCC 

0 

Sequence 2862 

GGNCCGNAATTGGTGTAGACTNCCACCGNCGGTGGCGNGCACCGNAGTGNTACNCCGNGG 

AAANNGTNGCTNCGTGNNNMNTGTGNAGNGTCNGCTCNTTGTCTCAATTGCGTTAGGAAT 

GGGGACACTGCTTGGCGAGCCANGTCNCAAGGCACCCAATTTCAATGTAGTCCTCACAAN 

CACCGATCGCCTCTCTCNAAGAACCGTGGGGATGGGGNTNCCCTTTCANGCNCTAGGTCG 

GNGNAATNCGCCANATAANCCTTCNTNTATCNTGTTNGAACTTGACTAACNACNACNAAA 

AACCG GCCATAGTGATAGTCG ATNTAAAN GGGGTCTTTCATAG AGACACNAAATGTCTCC 
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GA Nil I NAAAGTAAAAAGGTTGTCAGGCACAAACTTTTTCCAATTCAGTACGANTTGCT 

TACCAACTGGCGGAAGCNNGCCTTTCAATTMGTNAMGCCTTAATGTTTCGGTGTAGTT 

GCACACTCNTCTGTTGAGTCATTTGNTCTAAGGAGCAAAAAAATTNATTGTAATANTTNT 

CCCATAAGTTCCTTCTNAATNACAGCMTTNAAACTCTTGTNGGCCNAAANCNCTTNNTT 

TTAAGGGGGGGGTNGTNGTGGGGGTCCCCCCCCCNTGGTTTTAAAANCNCCCCAAANTCC 

C 

Sequence 2863 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCGCGGGCATGGTACGCGGGGGCGGCGGC 

GGCGGAG'GTTGCAGTGAGCCAAGATCGCGCCATTGCACTCCAGCCTGGGGGACAAGAGTG 

AGACTTAGTCTCAAAAAAAAAAAAGAAAAAAAAAATCAGGGATATAGTTCATATCCCACT 

TCTTTGTTTACACCGATGTCCCTGAATATCAGCCTGTAGCTAATGGACTTGGGATTTCTG 

GTCTAAGTGGGCCTCCTGGGGATGGGGTGGTACCTnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGCC 

CCGGTACCCAGC 1 I I I I GTTCCCTTTTAGTGGAGGGGTTAA 

Sequence 2864 

CGNCGNGTNGGGCGGGNCCCGAAGGGTAACCTTTANCAGGGGGGGAACNCGCTCAGGGGG 
GGCCCCTCGTAGNAAATTCCGGTNAATTCGCTGNAGGANCCCTCCTTGNAAAAGAAGCCA 
NGTTCNNAAGGGGTTTTG 
Sequence 2865 

TATTGTTTTAGTCCAAAGATACCCCTGGCACAAGCGACANTAATATTACAGGGACATTAA 
ATTAAGGAAAGAACCAACTGATNTGTGTCTATGAGAGAAGANCTTCNGGGTGAGGAACTG 
GGMCAAAGGAGGAAGACNCTNTTAGCAAGCCCTGGGGCCCNCCTTTATTGAC 
Sequence 2866 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGAGCAGAGG 

AAACATCCTGTCCAATTCTAGGTTTTCCTCCCTTGGCCTCCTTTCCCCAGCATTGTCTAC 

CCTGGCCCACTTCCTGCATTCTCCCCATGCCCTGCTATTTCTGATTCTTTGCTTCTCCTA 

GCGAGATACTTTCCTTATATGATAGCTGCTGAGAAGTTTCCCAGAACTGCTAGAGGAAAA 

GAAGTGGGGAATTTAGGAAATATCCCTCACTGACCTAACTCCATTATCTTCACTCTTTCA 

TTCTTCCTGCCACCTCATGCCCATTCTCTTTACTGTCTAGCATGCTGAAAGAAGGAAGTG 

ATCTAAATGCCAG C GTGTTC AGTGGTAAATATTAGTTGGTGCAAAAGAAAAAC C ATG ATT 

Sequence 2867 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGTGAGT 
TGAMTGTTCTGTCAAATGTGTCTCACATCTACACGTGGCTTGGAGGCTTTTATGGGGCC 
CTGTCCAGGTAGAAAAGAAATGGTATGTAGAGCTTAGATGTCCCTATTGTGACAGAGCCA 
TGGTGTGTTTGTAATAATAAMCCAAAGAMCATAAAAAAAA 

ANTCCCTNGGCCNCTCTANAACTAGAGGNTCCCCCGGNCTGGAGGAATTCGANTTCAATC 

TTANCCNTACCNCNCACCTCGAGGG 

Sequence 2868 

CGACTCACTATAGGGCCGAAATTGGAGCTCCACCCGCGGTGGGCGGCCCGAGGTACAGAG 

GGGTCTGTTTCTAAGTCTGGAACCTCAACACGAGGCTGGGATGCTTTCACAACTGTGCTC 

TCTCCCACTGTCCAGCCANMNMTGGTTGGTTCTTTCCCACAGTGCTTTCCCCCCCATN 

CCTTTTCAACACCTCCTTGTTGGNAGNGGGAACTCATTNGGGGAACCGGNNCCCAANAGA 

AGGGAAAAANCCNTGGGGGANTTCMCCTTTCTTGGAACAAGGGGAAAAATCGGGGANCC 

AANAGTCCNACAAAGNCTTTGNCCNCCAAGGGAAANNCCCCANGGNTTTGGCCTTNCTTG 

GGG 

Sequence 2869 

CCGCGGTGGCGGCCGAGGTACAATTTCTTTrGGTTCATTTTCAGATTCAGGCATTTTCCC 
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CCTTGGCTCTCAATGCTGTTTGGGTTTCCAACAATTCAGCATTAGTGGGAAAAAGTGGGC 

CCTCATACACAAGCGTGTCAGGCTGTCAGTGTTTGGTGCACGCTGGGGAAG AATTTACT T 

TGGAAAGTAGAAAAGCCCAGCTTTTCCTGGGACATCTTCTGTTATTGTTGATGl Mill I 

TACCTTGTCATTTTGGTCTMGGTTGCCATTGCTGCTAA AGGTT ACCCGATTTCAAAGTC 

CAGNATACCAAGGCATGTGGGATATTGTTTAGCCTACCGTTTTACTCACAGGCCAGGCGA 

ACTGGACAT 

Sequence 2870 

ATTGGAGCTCCACCCGCGGTGGCGGCCGAGGTACTTTATATAGATTTTCAAACAAAAGAT 

TGATTGTGC I 1 1 I 1 GMTAAAAAAGATAGGGTATNTTCCAACCTCACTGACTTAMCCTT 

GTGGAATACAGCACTTCTAMCTMCMCNCACGTCAAAAMGAACA ANAT TTGAATAAA 

TCCCCAGGTGGGGATGCCTCAAAAMTTNNTTCTTmMGAMhJGGTmAAGGGATCT 

GGCTTAANTTTCCTGGCTTAATNTAATCATTMTTTGGCCTTNGCGGGCNAAAAAAAAAN 

AAGNAACCACAAGGAAGGCTTGGCCTTNGTTCNTTAAAAAATTTTTNNTTCCNCAAAAM 

CNTTCTCCNCCCTTCNCGNNAGNTTTTTCNAAGGCCAANTNGGCCAAACCANCCTTCCAA 

GAAMTTTTTNGGAAAAMNAAAAAMC^TTTNNTT^ 

TNACCNATTAANTTGGGGGTTTAAANCCCCCGAAGGTTNCGNAACCAAl MM iGGAATT 

TTAAGNCCAAGNGGCCATTTTT 

Sequence 2871 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGAGAAGAAGGGGGCTGGAACAAATAAGTTACG 

GCTGATTTTACTCTGCAAAATATATMGGGGTTTTCAAATTAATAACCATTCAGGGTCCC 

CAAACAGCCTAATCTTTGTCMTTGGTGCCAAATTATGCAAGGTGCTGCCAATTCTGGTT 

GTCTAACAAATCAAGACCAGTGTATGTTACTCCTCGGCTGAAGCCTGATTGATTCTGGTC 

ATTATATGGGCT 

Sequence 2872 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGTGGGACCTGCAATTACTAC 

GCAAACGCTTACAGCTTTTGGCTCGCCACCATAGAGAGGAGCGAGATGTTCAAGAAGCCT 

ACGCCGTCCACCTTGAAGGCAGGGGAGCTGCGCACGCACGTCAGCCGCTGCCAAGTCTGT 

ATGAGAAGAACATAATGAAGCCTGACTCAGCTAATGTCACAACATGGTGCTACTTCTTCT 

TCTTTTTGTTAACAGCAACGAACCCTAGAAATATATCCTGTGTACCTGCCCGGGCGGCCC 

GCTCTAGAACTAGTGGGATCCCCCGGGCTGCAAGGGAATTCGATATCAAAGGCTTATCGA 

TTACCCGTCGACCTCGAGGGGGGGGGCCCCGGTACCCCAGCTTT 

Sequence 2873 

ATTTGGCTCATGGTTCTACAGGCTATATATGAAGCATAATGCTGACATCTGCTTCTGATG 

AGGACATCAGGMGCTTAAMTCATGTTGGAGGTTTAAGGGGAGCCAGCATA TATCA CGG 

NGAGAAAGGAGCANGAGACANAGGGGCNTACCACACTCTTTAAACAACCAGATTTTGCTT 

GAATTCAGAGCAAGAACCTNACTCACTACCACCAAAACAGCATGGAATCCATCTCATGAA 

GGTATTCTTCTTGCATNGACTCAAACACCTCCCATTCAGCTTCCACCTT 

Sequence 2874 

CTAATACGAC1CCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCATTT 

TATTTAGTGTTGTAGGAMTGTTGGGTTACTTCTTAAAAACGAAACCAAAGAAATTCAAA 

AGTCCCAAAGAAAGAAAGCAGGAAATAATAATTCTATAATCCAAAAACGTTGGGCGATCC 

TTCAGTTGGAGGAAGAGGGCGTCAGTTAAGTAGCTCACACAGTAGATATGGAGACACCAT 

ATGGAGATACCGGAGTTAAGTTTGGTGGATACTAGGAATTAAGTTCTCCACCTAAGGCAA 

TTAATTTTTCAGCCTTGAGAGATAMTTAGTAGGTTCTAGGAAAAAGGAAAATNACAGTT 

GACTGGGGAGAAAGGGTGGGNAGGGGAGGGATGGGTGCCGTCATT 

Sequence 2875 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACNCCACAANNCC 

AGNAGATTNIGTCTCCTTNCCTGNGGCCTCATCTANCTTACTTATTCAGACTTTCTGGACA 

CAAGGGGTTTGTTATCTTCATGAAANTTTMTNTGTGTNACGTTAGAAATGAANTTAGAA 
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AAAGATGTTCTTGCTGTAAACACTGGGCNGNAGTANNNTCANAAGAAGACTGTGGTTCTT 

GGATG 

Sequence 2876 

CCGGGCAGGTACTTTTCCCAAAATGCTATATTGCAATNGGACTGGAGGCTATAAGTGGTC 

TACNGCTCTGGCTCTAAGCATCACCCTCNGAGGNTTTGGAGCAGACCGTTTCTACCTGGG 

CCAGTGGCGGAGAAGGCCTTCGGGTAAGCCTCTTNCANCCTTCCGGTTGGCCTTGGGTAA 

ATANTGGNACGCCTGATAGGACNGTTCCCTGGATTCATATNAGAAGTTTGGNNTTATNGT 

TTGGNACNCAAACCAGGATGGGCTCCTTTTGGGTAACCCTTCGGTCCGCCTTCNTAGAAA 

ACTAAGTTGGGATACCCCCCCGTGGNCGTNGCAANGTAAATATTCGNAATNAATTCCAAA 

ANCCTTTTATTCCGAAATAACCCCGGGTGCTGAACCNCCTTCNNNAGNGGGGNGGGGGNG 

GCCTCCCCGGAGGTTAACCCCCCCAANGCCN I I I I I I I I I GGGTNTTCCCCCC I I I I I I A 

ANGGNTNGGAAGGGGGGGTTTTAA 

Sequence 2877 

TAGGGCGAATTGGA GCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTAC1 MINIMUM 

n i n n n 1 1 n 1 1 icataggtanaaggaacttgccttgagtctcaaatnaaacttggg 

ACTTAGGACTTTTCAGNGGAGGCTGGAATGAATTAANACTTTNGGGGACTATTGGAAAGA 
NATGATTNTNTTTTGCAATGTGANAANANCNTNCANTGTTGGGGGCTGGAGGGGAATGAT 
GTANTTNAAATATTTGACCCTTCCAAATCTCATGTTGAAATGNGACCCCCAT 
Sequence 2878 

AGGTACGATTTTTAAAATCTAGCACATTTAAMCACTGTTAGTTCAAC 
TGGAGAATTTTAGG AATACTCAATTTAC ATATCTGTATAAATCAN G AC AGAGCTCCTTTT 
AAN N AG ACTTTATAAATCTAATATTATC AATACCCCCCC AAATTTC ANGG AAG N AAC ATT 
ATACAC 
Sequence 2879 

AATTGNAGCTCCACCGCGGTGGCGAGCGGCCGCCCGGGCAGGTACTGAGTCAAGGACGTC 

TTTAACGTCATGGACGGCTCCTTTACCCACGCTTTCTAGATCTTCGACTGCATCTTTTNN 

TAGTTTTCCGAGTTCCCCCCACAAGC I I M I I GCTCCNGTCTAGGCCTTTTTCCANAAAG 

GCNTTGGAATCTNCTTGCTTTCCTNTGGCTTATGGTTGCCTGTTANTGGGCCTTAACCCT 

GTGGTCTTTCAACCCTTGCNAI JIM I CCTTTNTTCGAAGCCTTGACTTGGAATTGCCTT 

TTCATTGGGGNANAGGGNGANTTTCCCCCCCCGAAT 

Sequence 2880 

TAAGCNGTGTGCGCCTTTCTCATAAGCCTCACCGCCTGGTAGGGTTATCTCAANTTTCTN 

GCTGGTAAGGATACCGTATCCGCTTCNCAAAGNCCTGNGGACNTGGTGNTGTCACTCAAN 

CCCCCCCCCGCTTTCANGCCCNCGAACNCGCTTGTCGGCCCTTTATTNNCGGGTAAACTT 

AATACGTNCNTTTGAAGTTTCCCAACNCCCGNGATATAGGNACACCGAA 

Sequence 2881 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGATTGACATCC 

CACACTACCTGATTTGATACACTAAGAAGGGCTTATCATTTCTGTGGTACTCTTGGCCAA 

CAATGCACGTGTAACATTTATTCATTGAAACACATTAGGCAGMTTGCAGGACATTCTTC 

AAAATAGCTGTCCAATACTCTTCAAAAGTGTCAGGTCCTGGAAGACAAAGAGATACTGAG 

GAACCATCACAGAGTGGGAGAGGACATAGAGTGATAAAAACTAACTGTGATGTGGAATCC 

TACATTGGATCATGGACCAGAAAGACAGCACTGATGGGAAGACTGATGAAATCTGAATAA 

GTCTGTANTTTGGTTTAAAG 

Sequence 2882 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACNCGGGGAGGATNAACAC 

GAANCNGATNAACCTCANAAATCCTGTCTGGCTGNCAGATTNCAAGTAAAAAAAAAAAAA 

GGTGGGTTGGGGGGACCCnTTCTTTNTAGTTGTCTTNAAGGAAAATTMTTATACT^ 

TTTTNGTTCNGGTCGAAATTTn"ATGANATATCTCTCACTTGNCTTCCACTTTGAACCGG 

TCAAAGCTCATAGCTGCCAGCTCTGAATGAGGAGGGGAGAAGCCCCTGGGTCTTTCTTTG 



Page 43 J of 635 



WO 01/51628 



PCT/US01/00798 



Table 1 



AA 

Sequence 2883 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAAGACTAT 

TGTAGAGACTATCCGGGTTAGTTTGCAAGGGGAGCCAATGATGAGTAATTTGAAAGAAAT 

TCACCTGGTGAGCMTGAGGACCCTACTGTTGCTGCCTTTAAAGCTGCTTCAGAATTCAT 

CCTAGGGAAGAGTGAGCTGGGACAAGAAACCACCCCTTCTTTCAATGCAATGGTCGTGAA 

CAACCTGACCCTCCAGATTGTCCAGGGCCACATTGAATGGCAGACGGCAGATGTAATTGT 

TAATTCTGTAAACCCACATGATATTACAGTTGGACCTGTGGCAAAGTCAATTCTACAACA 

AGCAGGAGTTGAAATGAAATCGGAATTTCTTGCCACAAAGGCTAAACAGTT 

Sequence 2884 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGAC1 1 1 1 I I I I I M I III I 1 1 
III I I I 111 I GAGAGGAAAACCCGGTAATGATGTCGGGGTTGAGGGATAGGAGGAGAATG 
GGGGATAGGTGTATGAAC 
Sequence 2885 

CCGCGGTGGCAGCTCTAACGCTGTGGMGCCTTAATTAAAAAGGGTGCAGACCTAAACCT 

TGTAG ATTCTCTTG G ATACAATG CCTTAC ATTATTCC AAACTCTCAGAAAATGCAGGAAT 

TCAAAGCCTTCTATTATCAAAAATCTCTCAGGATGCTGATTTAAAGACCCCAACAAAACC 

AAAGCAGTTGAGTGATGTCTCTTCCCCAAGATCAATAACTTCGACTCCACTATCGGGAAA 

GGAATCGGCA i Mil IGCTGAACCACCCTTCAAGGCTGAGATCAGTTCTATACGAGAAAA 

CAAAGACAGACTAAGTGGACAGTACCT 

Sequence 2886 

AGACCMGACCGAGACTTGGAGACCTGATTGAGATTTCTCGCTTTGGCTATGCACACTGG 

GCCGTCTACGTGGGAGATGGCTATGTGGTCCATCTGGCTCCGGCAAGTGAAATTGCTGGA 

GCTGGTGCGGCCAGTGTCCTGTCTGCCCTGACCAACAAAGCCATAGTGAAGAAGGAACTG 

CTGTCTGTGGTGGCTGGGGGAGACAACTACAGGGTCAATAACAAGCACGATGACAGATAC 

ACACCACTGCCTTCCAACAAAATCGTCAAGCGGGCAGAGGAGTTGGTGGGGCAGGAGTTG 

CCTTATTCGCTGACCAGTGACAACTGCGAGCACTTCGTGAACC 

Sequence 2887 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTAC 1 1 I I I I 1 1 M 
1 I I M M i III I i MI NGCTCCAATMGTTAANAATTTTTAGTTCACTGGAAAAAACAGA 
TCTTTGATATNTGG ACTATTCAGCATTTTCTATGTCTG N NGGGGAACAGGCCCCCAAATC 
TGGCCATAAACAAAATCTNTGCAGCACTGNGACATGTTCGNGATGGCCATGATGTCCACN 
CTG AAAGTTGN GGGTTT ATTAN AATG AGGGCA 
Sequence 2888 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTACGCTGCATCCTTTTC 

TATGCTCTCCCTGCTGGTGATAATGGGAGATACAGACAGTAAAACGGACAGCTCATTCAT 

AATGGACTCGGACCCTCGACGCTGCATGAGGCACCACTATGTGGATTCTATCAGTCACCC 

ATTGTACCTGCCCG 

Sequence 2889 

AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGCCTTGGNATTGNNGAATGAN 
CAANCATTGANAGCACATGAGATACTCCCATCAGAATCCATACACANGGACTACTGAAGA 
AAGTCCTTGGGAT ' 
Sequence 2890 

CCGCGGTGGCGGCCGAGGTACTTTGCACCAACTGTATCTTCAGATACACAGCAAGTTTCC 

TCCAAGCCTTTATGCTACCAGGATTTCTAAAGCACACCAAGAGGAAATAGCAGGTGCTTT 

CCTAGTGGACACTGGGATCCACTTATCAAGTCAGCTGGCTTCACATTTNAAGCCTTTCAT 

GCANGGTGGTTTTGGACAGTA 

Sequence 2891 

AATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGATTCCACCAAAGTGA 
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TTCTAAATCCCTACCACG I 1 1 I 1 AAAGTTTTCTTTTAAGCTTGTTAAGAAATAGTAAGAG 

AAGGGGCTTCCTTTTCGATCCGCCATCTGCGGTGGAGCCGCCACCAAAATGCAGATTTCC 

ATAAAAACCCTTACGGG GAAGACCATC ACC CTC AAG GTTG AACCCTTG G ATATG ATAG AA 

AATGTAAAGGCCAAGATCCAGGATAAGGAAGGAATTCCTCCTGATCAGCAMGACTGATC 

TTTGCTGGCAAGCAACTGGAAGATGGGATGTACCTCGGCCGCTCTAGAACTAGGTGGATC 

CCCCGG 

Sequence 2892 

NCCGCGGTGGCGGCCGCCCGGGCATGGTACGCGGGGCACAATGAAATCAATNCTCAGAGA 

AGGCAGATAATTCTCCACGAAGCCAGAAAACTAATAAATGAACAACTTGGGTGAAATGTN 

CCACCAGACGGTGTGATATTTANTNGCCCANAAAGCTGCCAAGGGGTTGAATGACNCTAT 

CTGMGATNTGANCCAGTTTGCTCTCCATAGGGAGGATTTCATCAACAGGAAACAGATGC 

C 

Sequence 2893 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAGAAGACAATGCCGAGGT 

GCTGGTGACCCTGGGGGTCTGAGTGGATGATTGGGGCTGCTGCGCTCAGAGGCCTGCCTC 

CCTGCCTTCCAATGCATATAACCCCACACCCCANCCAATGAAGACGAGAGGCAGCGTGAA 

CAAAGTCATTTAGAAAGCCCCCGAGGAAGTGTAAACAAAAGAGAAAGCATGAATGGAGTG 

CCTGAGAGACAAGTGTGTCCTGTACTTTTTTC I II I I I I M I I I AGTAG ATTAATGAATG 

ACTCACTTTACTTACTTGAAATATTAGGAACTGAGTTAAAGGCAACTCCT 

Sequence 2894 

GGCGGCCGAGGTACGCGGGGGAGTCCTGAGCTGCGTCCCGGAGCCCACGGTGGTCATGGT 

TGCCAGAGCGCTCTGCATGCTGGGGCTGGTCCTGGCCTTGCTGTCCTCCAGCTCTGCTGA 

GGAGTACTGTGGCGCTCCGTGAAATTAGACGTTATCAGAAGTCCACTGAACTTCTGATTC 

GCAAACTTCCCTTCCAGCGTCTGGTGCGAGAAATTGCTCAGGACTTTAAAACAAGATCTG 

CGCTTNCAGAGCGCAGCTATCGGTGCTTTGCAGGAGGCAAGTGAGGCCTATCTGGTTGGN 

CTTTTTGAAGACACCAACCTGTGTGCTATCCATGCCAAAACGTGTAACAANTATGCCAAA 

AAGACATCCAAGCTAGCACGCCGCATACGTGGAGAAACGTGCTTAAANAATCCACTNTTG 

AATGGGGAAACAT 

Sequence 2895 

CCGCGGTGGCGGCCGAGGTACGCGGGMGAACTGAAATTTrrAATTAAMTTTATTTCCC 

CTATTAGGCTGGGCGCAGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCTGAGGCG 

GGCGGATCACCTGAGGTCAGGAGTTCGAGACCAGCCTGGTCAACATGGTGAAACTCTGTC 

TCTGCTAAAAATACAAAAATTAGCCAGGCCTGGTGGCACACGCCTGTAATCCCGGCTACT 

CGGGAGGCTGAGGCAGGAGAATCACTTGAACCCAGGAGGCAGAGGTTACAGTGAGCCGAG 

ATTGCGCCCTGCACTCCAGCCTGGGTGACAGACAAGACTGTCTCAAAAAAAAAAAAAAAT 

TATTTCCCCTATTTACTATTTTAAATTCCCAATCTGTATGATTTAGAATTTCCATGCA^ 

AAAAGAAACAGCAATGTTTTTAAGAACAAGGGAAT 

Sequence 2896 

CGAGGTACCTGTTACGAACTGAATGTTTATGTCCCCCCAGTCAGCAATGTGATGGTGTTT 

GGAGGTGGAGCCTTCAGAAGGTAATTAATGCCCTTGTAAGAAGAGGCCAGAGAGCTTGCG 

CACCTTCTTCTGCATGTGAGGAGCCAAGAAGCCGGCTGTCTGCAACCTGCAAGAGGACCC 

TCACTAGAAGCTAGCCATACTGGCATCCTCATCTTGGCTTTCCAACTTCCAGAACTGTGA 

GMGTATATGTTTGTGGTTTAGTCAATGGTCTATGGTAATTTTTTTATAGCAAGTCCCAG 

CCAAGACAGTGCCTCATTTACTACATACCATTTATATTATTATATAGGCTCCT TTCAGAA 

ACCCATGTTCAAATAAAGAGATAAGATACTGAAACACATAACACCTTCACTAG 1 IN I I A 

AGTATACAAATATTGAGAAATAGTTTGTTATTAACTATCTCATCCAAGAAATGCAGATTC 

ATGTTGGTTCTAATTTTTTATATTAAATTGACAAAATGAAGAMCTTAACACCATCCT^ 

ATTTTAGCTGCCCAAAAAAAAAAAAAAAAAAAAAGTCCCTGCCCGGGCGGGCCGN 

AACTANTGGATC 
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Sequence 2897 

CGAGGTAC1 iTTTTTTTTTTTTTTTTTTTTTI ( GGAAAAAAATAAGAATAGTTGGAGAAT 

GTTAAAACATTATTGAAAAACATATCGACTAAAAAAGATAAAAACAATCCCAAATCACTT 

TAATACTTTAAATAAATTCACAATAGAATTCTACTCAGTGTTAAAATACTGGAGAAGGCT 

GGGTGCAGTGGCTCATGACTGTAATCTCAATACTTTGGGAGCCTGAGACCGGAGGATTGC 

TTGAGCCCAGGAATTCAAGACCAGCCTCAGCAACATGGCAATACCCCATCTCTACAAAAA 

AMTTTTAAAAATTAGCTGGGTACAGCTGGCACAGTGGCTCATGTCTGTAATCCCAGCAC 

TTTGGGTGGCTGAGGTGGGTGGACCACTTGAGGTCAGTTCAAAGACCAGCCTGGCCAACA 

TGGTGAAACCTCATCTTCACTAAAACAAAACACAAAAATTAGATGGGTNTGGTGATACAC 

ACCTGTAATCTCAACTACTCGGGAAGGCTGANGCCAGGANAATTGCTTTAACCCTGAAAG 

GGCAMNCTTGCANTGNAGTCAAAMTTGCCACCACTTCCCTTTNACCTTAGGGGCCNNA 

GNTANGNGAGACTCCNTCTTAAAAAAAAAAAAAAAAAAAAA 

Sequence 2898 

CCGCGGTGGCGGCCCGAGGTACATGCTACACCACTTCTCCAGACTTGACCTCTATCAAAG 

GGCATTTGGGTATGGCCCGAGATGGGAGTAACCTCTCTTTTCAGCTGTGTCAAATTCTTA 

CACCATGTTCTTTTG I Nil ICCTCATAATACAAGGCCAAGCACATCTGCAAAAAAGACA 

TTGCTGGAGGAGGTGCAAAGAGCTGGAAACCAAGTCTCCAGTCCTGGGAAAAGCAGTGGT 

ATGGAAMGCMTGGAAAGAGCATTTTGAAMTGCCATTCCACTGTTTTCTGGCCTTTAT 

GATTTCTGCTGAGAAATCCACTGTTAGTCTGATGGGGTCTCCTTCATAGCACCAATGACC 

TGMGAGCCTTGTTGAAGGMGACTCCATCTGATGACTCAGAGCMGTATTTTTTAGTGT 

GTTATTGTTATTAGCAGAAAGAGGGCCATAAAATACATGGGGCAAGATGAATATATCTTA 

GGCAAAAGAAGAAAATATTCAAATTCTTATGTTATTTTATCTMTTAT7TT 

Sequence 2899 

CCGGGCAGGTACTMTTTACATTCCCACCMCAGCATATAAACATTAATTTTTATCACAT 

TCTTAACTGTGTCTATTA 1 1 1 I I IGTCI Mil 1 AAAGT AT AAT AA 1 II I 1 AAAAAGTAAA 

GAAG AAAAGAAAG AAAAAAAATTG ATAGG ATCTTG CTTTGTCCCC C AGGCAGGAATGCAA 

TGGCACTATCATAGCTTACTGCAGCCTTGAACTTCTGGACCCAAGTGATCCTCCTGCCTC 

AGTCCTCTTTGTAGCTGGAACCAGAAGTGTGTACCACTATGCCTGGCTMTATTTTTATT 

TTTTTAI I I I IGTAGAGACAGGGTCTTGTTATGTTGCCCAGGCTGTTCTCAAACTCCTAT 

CCTCAAGCCATCCTCCCATCTCAGCCACTCAACTTGCTGGAATTACCAGCGGAAGCCACT 

GTGCCCAGCCCAGACTCCCTTTTACCTAGTATATCATGTCTAATTTCTAACAAAAAATTA 

AAAAAACATTGTMGAGGAAAAAAAGAAACCCTAAAACAAACAAGCAAACAAAAAACCCA 

AAACAACAAAAAGTTTCAAGAAACAGAGCTAGCTGACAGGAACTATATTCAGAAATGGGC 

AGGTATATTTAGAAATTATCGGATTGGTAATTTTAAACCAACTATGACTAATATGGGAAN 

GAGGGGATAATGGGAAAAAAT 

Sequence 2900 

CCGGGCAGGTACGTCCATCTTCCAGCTGCTTGCCAGCAAAGATCAGTCTCTGCTGATCAG 

GAGGAATTCCTTCCTTATCCTGGATCTTGGCCTTTACATTTTCTATCGTATCCGAGGGTT 

CMCCTCGAGGGTGATGGTCTTCCCCGTAAGGGTTTTCACGAAMTCTGCATTTTGGTGG 

CGGCTCCACCGCAGATGGCGGATCGAAAAGCCCCGCGTACTCGCCAGTCCAGCCGCTCCT 

GCACACACATTCACCACAGTCACAGTCGCCGTTACCTCCGCAGAGCAGCCCTTTGTGTCT 

CACGCAGGAGAAATTGTCACACTGGCAATCCCGCGTACCT 

Sequence 2901 

CCGCGGTGGCGGCCGAGGTACGCGGGGAGTCCTGAAAGGGTGGGCAAATGTGCAGCTCTC 

CCTCAGGGTCGTGGGATCTGGGTCCMCAGTCTGGCCTTTCAGCCTTCAGGCTGTTTTAG 

GCTTGAAGTTTCGTGTGGCTGATCAAAGAAAGAAGGCATACGCAGATTTCTACAGAAACT 

ACGATGTCATGAAAGATTTTGAGGAGATGAGGAAGGCTGGTATCTTTCAGAGTGTAAAGT 

AATCTTGGMTATAAAGMTTTCTTCAGGTTGAATTACCTAGMGTTTGTNACTGACTTG 

TGTTCCTGAACTATGACACATGANTATGTGGGCTAANAAATAGNTCCTCTTGATAAATAA 
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ACAATTG NCNANTANTAN ANTIC AAN N AN AAAN N GTACCTGCCCG 
Sequence 2902 

CCGCCCGGGCAGGTACGCGGGGAGATTTTTGATATCTCAATCTGCAGCCATTCTTCAGGT 

CGTAGCATT TGGAG CAAAAAAAAAAAAAAAAAAAAAAGGAGTTTC 

TG AAAG ATGTTTTTGTTTCTTTCTTTGTAAAGG C CTTG GATATTGAAAAATACC AAGGC A 

GAACAGTTGGACAATCTATTTCTTGAGCCAMTTTAATTATTCTTAi ) 1 I I GTAATCAGT 

CATTGGCTTCTTATCTGGATGMGGCTTTTGGAGGAGAACCAAAACGACAAGTTCCAAGA 

AGAAGATGAAGCTCCGCCTCCGCCGCTTAGTCCCAACCCTGCCCAGGAAGAAGGGCCCGT 

GGGGCTTTGCCTGTGCCCGTCCACCAAAGGCTGTCATGTGTCTCGAAATCAGCAGCCCTC 

CCCATCCCAATCCCAGGCAGCTTGTGTGTACCTCGGCCGTCTAGAACTAGTGGATCCC 

Sequence 2903 

CCGGGCAGGTACTGTAACTCTCAGTCTGAGACCGAAAGCCTCAAGACCTGGGGGGCCTCT 

GGTGTGTAAGTCCTGGAATCCAAAGGCTAGTGAGCCTGGAGTTCTGATGTCCATGGTGGC 

AGAAGTCTGTCCCAGCTCTCAGAGAAAGACCAGCTTGCCTTCTGTATCTGTTCTCTCTGG 

GCCCCTGGCTGATTGGATGGTGCCCATCAACAATGAGGGCAGATCTTCCCCACCTCATCC 

ACTCACGCTGACACATTAATCTTCTCTAGAAACACTCTGAGACGTGCCCCAAATAATGCT 

CTATTAGGTTTCTAGGTATTCCTTAATCCAGTCATGTTGACACCCTAAAATTAAGTCTAC 

AGGTCTACCCCTTGCAACTTGGCACCCATATGCATGTTTTTAAANCCGTATTTTA 

Sequence 2904 

CGGGGCAGGTACGCGGGGGTGCCTGCCTACCAGACACCCATTCTTGCACGATCACCATTA 
AAATTCTCTCTTCC ATTGTTAAAAAAAAAAN N N CAAAATAAAAAAAAGTACCTTGGCCGC 
TCTANAACTAGT 
Sequence 2905 

TCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTATTTGGGAGAGCC 

AAGTGGAACTCTTGAAGCAAAATAAGTGTTTCCCAGCCTTCACATCCTAACACTGTCAAT 

GAACTCTCTTAAGTTGAGTGAAGTGGAAGCGTGCTGAGTTACTGAAGTGCCAGGTCTTTA 

ATCCAATCAGCTCACMCCTCTTTCCTGACTTTTGCAATATAGATTGTCAATAAAGGTTA 

GCTGTCCCTTGAATGCTCATGGCAAAGAAGGTAC I I I I I i I I It II I I II I I I I I I GGAA 

ATGCTTATGTCTCCATTTATTTTATTTTA ( I I I I I I ATAAAAAAGCAGGCATAAAATACA 

ATTACATTACTACGAAGATGCAACAAAATTTTAAAAAAGAAAAAGGGGTGCAA I I 1 1 I I I 

CAGAGAGGACAGCTGATCAMTATTTATAATTTTCTAAACCATGCAGTTCATTACTTATT 

ACAATTNCAAACAAAACTCATTATTATGGGGATGGGAGTCAGGGAGAGG 

Sequence 2906 

GGAGCTCCCCGCGGTGGCCGGCCGCCCGGGCAGGTACI rTTTTTTTI I GNTTCACTCAAT 

TGGGCAATNAACATGGAGAGNCACNTNCTCTGATGGTATCTCNGANAATACTGGTTGTNG 

GACTGGCCAATAGTGCCTTCGGGACTGGGNGCACCCCCAGGNCTGCGGCAGTTGTCACAG 

CGCCAGCCCCGCTGGCCTCCAAAGCATGTGC 

Sequence 2907 

CCGCGGTGGCGGCCGAGGTACCCAAAGCCATATTTGTTAGAGTAATCAACCCATTTGGTG 
ACCCACTGA/ATGATGTGCTCAGCTGCTCTTTGGGAATGCAATCAGCTTCCGGCATGTTT 
TCCAGACATCCCCGAAGAACCCTTGCCTGTGTCTGCAACACTTCCCATGGTACAGATGGA 
TTCAGTTCAAAACCTGGCTTTACTACCCCGCGTACCTGCCCGGGCGGCCG 
Sequer;ce 2908 

CCGGGCAGGTACCAATTAATGACAAGAGGTTAGATAGAAGTATGCTAGATGGCAAAGAGA 

AATATGTTTTGTGTCTTCMTTTTGCTAAAAATAACCCAGMCATGGATMTTCATTTAT 

TMTTGATTTTGGTAAGCCMGTCCTATTTGGAGAAAATTAAT^ 

ATTTTCTCAATATCACCTGGCTTGATAACA I I I I I CTCCTTCGAGTTCCTTTTTCTGGAG 

TTTAACAAACTTGTTCTTTACAAATAGATTATATTGACTACCTCTCACTGATGTTATGAT 

ATTAGTTTCTATTGCTTACTTTGTATTTCTAATTTTAGGATTCACAATT^ 
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CTATTTTn'AACCTGTTGCACCTAAACATGATTGAGCTAGAAGACAGTTTTACCATATGC 
ATGCATTTTCTCTGAGTTATATTTTAAAATCTATACATTTCTNCTAAATATGGAGGAAAT 
CACTGGCATCAAATGCCAGTCTCAGACGGGAGACCTAAAGCCCATTTCTNGGCTGGAGCT 
ACTTGGCTTTGCGACCCTATGGGGGAGGCATAAGTGCCCTGGAGTTTGGG GTTGGNC TTT 
TTTGTAAAATGGAGGGGTTGGCCTTAATCAGNGATTTTCATTAGC CTTAAAAAi 1 1 1 1 iG 
GAAGAACNAGAACC 1 I 1 1 I 1 1 1 1 NAAAACCGGTTGGATGCCACCC 1 1 I 1 1 1 1 1 1 

Sequence 2909 . 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTTNN1 1 1 1 1 1 I I 1 1 1 1 1 1 1 1 I TNTTTTTACAG 

TCTNATAGGTAGAAGGAACTTGCCTTGAGTCTCANATGAGACTTGGGACTTAGGACTTTT 

CAGNGGAGGCTGGAATGMTTMGACTTTTGGGGACTATTGGAAAGANATGATTATATTT 

TGNAATGNGANAAGAGCATGCAATGTTGGGGGCTGGAGGGGAATGATGTAATNTANATAT 

TTGACCCTTCCAAATNTCATGTTGAAATGNGACCCCCATTGTTNAAGGTGGAAGCTGATG 

GGAGGTGTTTGAGTCATGCAAGAAGATCTTCATGAATGGATTCATGCTGTTTTGGTGGNA 

GNGAGTGAGTTCTTGCTCTNMTTCAAGCAAMTCTGGTTGTTTAAAGAGTGTGGAATGC 

CCCTCTGTCTCTAGCTCCTTTCTCACCGNGATATATGCTGGCTCCCCTTAMC 

Sequence 2910 

GACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGG 

AATCATrTTGCTTCCAACCCCATTTAGCCTGCCATTGAAATGCAAAAGTCTGTTCCAAAT 

AAAGCCTTGGAATTGAAGAATGAACAAACATTGAGAGCAGATGAGATACTCCCATCAGAA 

TCCAAACAAAAGGACTATGAAGAAAGTTCTTGGGATTCTGAGAGCCTCTGTGAGACTGTT 

TCACAGAAGGATGTGTGTTTACCCAAGGCTACACATCAAAAAGAAATAGATAAAATAAAT 

GGAAAATTAGAAGAGTCTCCTGATAATGATGGTTTTCTGAAGGCTCCCTGCAGAATGAAA 

GTTTCTATTCCAACTAAAGCCTTAGAATTGATGGACATGCAAACTTTCAAAGCAGAGCCT 

TCCGAGAAGCCATCTGCCTTCGAGCCTGCCATTGAAATGCAAAAGTCTGTTCCAAATAAA 

GCCTTGGAATTGAAGAATGAACAAACATTGAGAGCAGATCAGATGTTCCCTTCAGAATCA 

AAACAAMGMCGTTGMGAAAATTCTTGGGATTCTGAGAGTCTTCCGTGAGACTGTTTC 

ACAGAAGGATGTGTGGTGNACCTTGGCCCGCTCTAGAACTAGTGGGATNCCCCNGGCTGN 

ANGGAATTCGATATCAAGCCTTATCGGATACCGTCCGAACCTCGANGGGGGGG 

Sequence 29 1 1 

CGAGGTACCGCGGGGAGAGGCCACTTCCTTCCCCAATCCTTCCGTTTCAGCTGGGAGAAT 

GGGACAAGATTCAGGTGCCCCCAATGGGGAGAGGGCGTGGATGCGGCTGGCTTCCTGCTA 

TAAAAACACAGAGCAGCCTACCCTCTACCTGGTTGATCTGGTAGTGTCTTCTGCTATGGC 

GTGCAGGCAGAGAGGAGGCTCTTGGAGTCCCTCAGGCTGGTTCAATGCAGGCTGGAGCAC 

CTACAGGTCGATTTCTCTCTTCTTCGCCCTTGTGACTTCAGGGAACTCCATAGATGTTTC 

TCAGTTGGTAAATCCTGCCTTTCCAGGCACTGTCACTTGCGATGAAAGGGAAATAACAAG 

TGGAGTTCCCAAGCAGTCCTGGCACCAAGAAATGGCATGCATCTGTGGTGGATCCTCTTG 

GTCTCGACATGCCCGAACTGCACTTACATCCTGGGACCCAAAAAAAGCTTACCCTGAGGG 

CTACCTATGATAACTGTACCTGCCCGGGCGGCCGNTCTAGAACTA 

Sequence 2912 

CCGCGGTGGCGGCCGAGGTACCTGTATCCCCAATAATACAGCCCCTGATGTAACTATCAC 

TAAGGGTCTATAAGGCCTCACCACTCTAGGAAATGAACTAGCTGAAAATTCGGGAATGAA 

TGATCCCTTTACAGGTGTAATGGAAAAAATGGTTTGGTAGATGGAAAGGACTAATGGCCT 

CTA1TCTCACCTCCCTTGTGATCATAATAGGGGTGCTCATTCTCTTAGGGTGCTGTATTA 

TACCCTGTGTCTGCGGACTAGTTCAAAAGCTCATAGAAACAGCCCTTACCAAGACCTCCC 

ACAACTCTCCCCCACCCTATTCAGATAAACGTTTATTTCTAAGCAACCAAGAAGAGCAGT 

GAAGCCAGATCCTGTTAGAAATATATGAAGAGGAAGGACTACAAAATCAAAAGGGGAAAA 

TTATTAGACAAAATGAGTTGCTCTACAAAGGTCCAACTTCGTTGNCCTTTGTTCTGGAAC 

CmGTTCTGTAMCTGCCCTTNCCGCAAAAACTGTCTTTCCCGCCGACCAAAACTGCTC 

TTCACTATCCTGNAATCCATACCCCTACTCTATTTGAAACTGCCAATCGGGAATCAAGCT 
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TACATTGGGGGTGGGTCAACCCCCNCCCCATGGAGGNAAAAAACCCNNAAAGCCAAAGCT 

TTTGATGCCATTAGGGGTAAAAAA NCCC TTTCTTTNCCTGNCCAANGNGCTCTTGAAAAT 

GGNTACAGGCCCAGCNAGCCCCCTTTTGCANAAAATAAACCTTGNCTTTGTTGGAACAAA 
AAAAAAAA 

Sequence 2913 

ACTCACTATAGGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTACAGA 

AGGTAGATTTAGAGGTGAGGGGCAATAAGTAAATGACTCGCACAGATGGTGTTCATGCTC 

MTGTTTCTCTTCTGAACMTTTTCAGATTTGGTGTTAAAGCACCTTCTGGCCAGGCGTG 

GTGGTTTATGCCTGTAATCCCATCACTTCGAGAGGCGGAGTCGGGTGGGGATTGCTTGAG 

GCCAGGAGTTCAAGACTACCCTAGGCAACATAGTGAGATGCTCGTCTCTATTAAAAAAAA 

AMGAAAAAAAMGGGGACCCAAGGCCATATATCTGTTTCATTCCTAGAAACTCTCCACA 

GTGTCAGATGAAGACTGCAAATCCAGTGGTCCTACATTCACAGATAATTCCTAGCCTTTG 

GTTAGTGAGGAGAATGGGATGGGAGGACACAAAATGAGGACTTTTTTGT 

Sequence 2914 

GGGGNGATACATAAAG7TTTGCATTTTCCAATAAATGATGACTGTGATTCANNGAAATAT 

GGCA TATGT GTAATTAAAMCCACTTTTTATAAAAACTATCTTTGCTGCATGCTGCTCTA 

TGACATTTTCTGTGTGTTATTTTATTNCTTTTCCTTTTMTGCTGGTCACTGCT^ 

TTAG NC AACTTGGACCACCCTCTAG AGG GG CTTN AACTGTTATTTTCAATAC ATTGTCAC 

AGGGACAGAGGAGGGTGTTAGGGAACTCTGTGACTAAAGAGTTTTCCATAMTGGGTGGA 

CCTCAGTTTACAAGCCTTAAGTTATTGGGCCAGGAGGGGGAACAGATCATGANTAGGAGA 

GTGGAAAAGAA 

Sequence 2915 

AGGTACACTTCTCTTTCCCTGMTATGACTTGCCTCTGGTGAACAGMTTTTGCAGATGT 

AGTTAAGCTCTAAAATCAGTTAGTGTTTGGCTCATCAGTAGGGAAATTATCCTGAGTGAG 

CCTGATATACACTGATATGACAGGCTATTCTTGCTGACTTGATGAAGCAAGCAGCCATGT 

TGAGGAAATCCACATGGCTAAGAACTGTGACCAGCCTCTAGGAACTGTAGGCAACATTTA 

GGGCCTGAGATT ACCCA GCTAAAAGTCAGGAGATAGGCTTGGGATTTNAGTCATACTGTC 

ATAAAAACATACATTTTTGCCAGCAAACTTNATGACCTTGGAAGCAGACTCTTTCCCAGT 

CAAGATTCCAAATGAAAATTCANCCCAGCTTGACA 

Sequence 2916 

CCGGGCAGG TAC H iTTTTTTTTTTl iNTTANAAC I 1 1 I I I 1 1 I NNTATTTTTTATNCCA 
TNATTCCATC I \ I I I CTTCTTCCACTTGACCTGCGATTCACC AACATTG GTCC AG AC ACC 
ATGCATGTNAACNGGGCTNNACCCCCATCCATTGA" r TTAA 
Sequence 2917 

cggccgcccgggcaggtacttggttaatctagca1ttgttagccaatgatgtcctttggt 

attcatcaaatttaccacagcatgggggggcctttatattcaggttttgcccaagagtta 

gcttatctgcttgttgtgctagcagggctgttgct3ccaaggccctcaaacatgagggcc 

mcacttagamccctgtctmttgtttagagaga";aggccaccagctttggccagtgtc 

ccacagtctgggttaaaactattaactatcai l i ! '* i i i i ctgacacatattgtgtaaaa 

aaaaaaaaaattataaaaaaagtacct 

Sequence 2918 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGAGATCA 
TGTCTCTCTGAAGAAC AGTTAAAGTGTC TTCTGGATGAATG CATACTTAAACAAAAATCC 
ATCATTAAACTTTCTTCAGAAAGAAAAAAGGAAGACATTGAGGACGTAACACCTGTGTTC 
CC C C AGCTTTC C AGGTCC ATC ATCTCTAAATTGCTAAATG AATCAGAAACAAAGGTCCAG 
AAAACTGAGGTAGAAGATGCAGATATGCTTGAGAGTGAAGAATGTGAAGCTTCTAAAGGC 
TACTATCTCACTAAAGCCTTGACTGGACATAACATGTCAGAAGCTCTTGTCACTGAAGCA 
GAGMTATGAAATGCCTTCAATTTTCCAAGGACGTTATTATTAGTGACACAAAAGACTAT 
TTTATGTCGAAGACTCTTGGCATTGGGAGA 



Page 437 of 635 



WO 01/51628 PCT/US01/00798 

Table 1 



Sequence 2919 

TCCACCGCGGTGGCCCGGCCCGCCCGGGCAGGTACGCGGGAAGAAATGGAAATTCTCTTG 

MGCTGGATTTCCCACTTGCTGCGGGTGTGTGCTTTGCTTAACATGTTAAAACCTGCAGG 

CGTGGTGGCTCATGCCTGTAATCCCAACACTTCGGGAGGCTGAGGCAGGAGGATTGCTTG 

AGCCCAGGAGTTCAAGACCACTCTGCACMCAAAAAAACCCATCTCTACCAAAAAAAAAA 

AAAAGAAAAAAAAANNNGTCCGCGGGGCTGACTGTTAA GGAAAGAGCAC CCACATCTGCT 

CCTACTTAGCTTTTTTTCTGTGGTTCTTACACAGTATTCCTTTTTT^ 

AGACTCCTCCTTTCTTTTCTmCTTGAMGAGTmAAACAGATMGAT 

ACTGATCTCTACTCCCCCAGTTTGMTGGTAAMTTTGAATGGTAAATTCCCATGAACAT 

ATATGGAAATGTCTTTATCCTACTT 
Sequence 2920 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGTGGGCGATCTGGGGGCCAGGTT 

CGGGATGAGTATCGGCAGGACTACGATGCTGGGAGAGGAGGCTATGGAAAACTGGCACAG 

MCCAGTGAGTGGTGAGAGCTCTGTCAGTGACAAACACTCCTTTGGCCTGTTGAATTTGC 

TGMGAACATCACCTAAAGTCTGCACACGAGCCCATTTTTACCAAGATTTGATCAGTGTC 

TTTACTGAGCTGGAAGCCTCTGAMGTTATTAAAGGACAGAATC CAAAA GAATGCCTTTA 

ATTCTTGTCTGAGMTCTTGGCCATGTGTCAGATTATCAGAACMTTTTGTTACCAGGTC 

AGAMTTGTGTTCTTTGACAACAGATTGGATCTGTMTGTT GATTA GTCTTTAGCCATAA 

CCACTACACTTTTAGAAAGACAGAAAMTGTAAAGAA TTTGTTTT TACCATAATGAGTCT 

TAAGTAG GTTCATG ATCTACATTG N GGCCTGGGATTATTTTTTTAATTTA 

Sequence 2921 

CCGCGGTGGGCGGCCGCCCGGGCAGGTACCCTT TCTGAT TGTAGAGAACCTTGTTTCTGC 

AGGAAGCCTAGCTCCAAGCACGCAGTCTGTAGACATTTTTGCCTTTGCCCTTGAAATGCT 

TGCAAAATACTTTGTTAACAAAAGCTGCAMGAGAGMCATGCCGTGTGCCTTTAGTTAG 

CACAGCGGGCAGCCTCAGTGAAACTCTTAGGTTAAGCAGTTAAGTCCTGGAACCCAGAGC 

TGCTGTGTATTTCGAGCGGGCAAGTTTATCTTTTGCTATACTTATTTTCMTTCMTTAC 

ACCACGATTCAAATAATTCCCCTCCTAAAACCAAAAAGGAGGGAAACCGTCAACTCCATT 

GCAATTACTTATCTTCCTCTTCTATCTCTGGTATACGCCGGGGCATAGAATGCTCGTATA 

CATCTCTTTAACAACCACAAACCTTAAGCCATGTAGATGAAAGTTAGTGCATCAACGGGA 

TACAGTTCCATATTGCCTTAAACCTCCTTGGTTTAGACACACTAACATTTATACCA AATT 

GCAGATTATTCTGCAGAGANGGMTTGCATGTTGGGTTGGATATTTAGTATGMCTTTTT 

TCAGMTATMTATTTCTTAGTTATCAAMGTANGTTGGAAMCATTTGCAAGACTATGA 

MCATAGMTTGCTGGTTTTATATTTTMCTGGCAGMTGGGGMTTTCACTGGCCTTAN 

ATTAATTTAAT 
Sequence 2922 

TNTAAACTCCCGGCGAATNGGTTTTNCAACGCGGTGGCGGCCGCCCGGGCAGGTACAGTN 

GCACAATCATAGCTTAGTGNAGCCTCATTCTTTTGGGCACAAGCGCTCCTCAGCCTCCCT 

CCTCANCCTCCCAGTTGCTGGAACTTACAGGTGCACCACCATMCCAGCTAGTTTATTTT 

TTGTAGAGACAGAAGTCTCACTTTTTTTCCTAGACTGGTCTGAAATAGACCATTCTATTT 

GCTAGAATGGTCTCAAACCATCCTTCTGCCTCAGCCTCCCAAATTGCTGGAATNTCAGGC 

ATMTTCATCATGCCTAGCCATTTCTTTTTTCTGCTGCATTGCAGAGTTAAGAACCAATT 

TCATTTGTCCTGCACTGAGAATGTGCATTCCCAGCAGATCATGGTTGAGCTCAAGCACAC 

CTGCAACTCAGCCAGCCTCTTTGAAGGGTGCACAAGTTACTAACTGTCAACGTTGCACAA 

GCCACCTTTAGCAGGTATGTTCMGAACTTACATTCCCACCCTTTCTAAAACAGCTCACA 

GTMCAAGMGMCGGGATTTMCTTTTGGACATGCACACTCATGAAAACCAATGGATTT 

TGGAAAAGCCAAAATTGGTTGAAAAGAATAAGGGGGG 

Sequence 2923 

TAAACTCCCGGCGAATGGTTNTCAANGCGGTNCCGGCCGCCCGGGCAGGTACGCNNTCAA 
AGTTAATGGGGTGGTTTNATCTCACANNTTNAGAACTCGTTCACCTCCAAGCATTNCATG 
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ANAAAGCTGCTTCTTATTAATCATACAAACTCTCACCATGATGTGAAGAGTTTCACAAAT 

CTTTCAAAATAAAMGTAATGACTTAGAAAAAAAAAAAAANAAAAM 

Sequence 2924 

NCGGGGGAGAAGGCGGTTTGCGTATTTGGGCGCTCTTCCNGCTTTCCTCGCTCACTGNAC 

TCNCNTGCGCCTCGGTGCGANTCNGGTNTTGCNNGCTGAAGCCGGTTATCAAGCCTCACT 

TCAAAAAGGGCGGNANATTACCGGGCTTATCCCACANGNANATTCAAGGGGGGGAATTAA 

ACGNCCAGGGAAATAANAAACCATTGNTGNAAANCCATAAAAANGGGGCCCANGNCCAAA 

AAAAGGGCCCCNATNGGAANAACNCNGNTAAAAAAAAANGGGGCACCGNNCGGTNTGNCN 

TTTGGGCCCGN II I I ! I I I 

Sequence 2925 

CCGCGGTGGCGGCCCGAGGTAC M I I I I I i I I 11 I 1 1 I I I I I I I I M 1 GGGGNGCTTAAA 

ACAACNNCCATTTTATTTAATATCTCATGATTTTGTGGTCAGGAATGTGGACAGAGCTCA 

GCTGGGAGATTGTTCCACTCCATGTAGCATCAGAAGACAGGTCTCTTGGTGATATTCAGC 

TGGAGAATAAGTTAATATGCAGAGTTCAAGATGGTTTCACTCACATTGACACCTTGGCAG 

GGATGGCTGGAGGGCTGGGCTCAGCCAGATCAGTTAACTGGCGTGCCTGCATATGCACTC 

TCCACCATGGTGGTCTCTAGAATTGCTCAAATCTAGAATTGGCGATGCATCCCG 

Sequence 2926 

CCGGGCAGGTACCCGGGCGTGTGAGGTGTCAGTCTGCCCCTACTTGGGGGTGCCTCCCAG 

TTAGGCTACTCGAGGGTCTGGGACCCACTTGAGGAGGCAGTCTGTCCGTTCTCAGATCTC 

CAGCTGCGTGCTGGGTGCTGGGAGAACCACTACTCTCTTCCCCGCGTACCTnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGCGCCG 

CTTTGGC GTMT CATGGTCATTAGCTTGTTTTCCTGTGGTGAAAAATTTGTTATCCCGCT 

TCAACAAATTTTNCAACACAMCCATTACCGAAGC 

Sequence 2927 

CCGCG GTGGCGGCCCGAGGTACGCAACATGACATTGGCTGGTGTAAAGATCTTACAATTA 

nit l AAAGTTTC ATTGTATTCATTTG ATTATATAAGTTTCTTG CC ATC ACAAC AAAC AC 

TCCTATTCATGTCTAATTCGATACCTGGTGAGTCCACATAAGCATTTGGGTCCCCACTGA 

TGAGAGCACCATGGCCAGGTCACCCACTTGATCAGGTTCTCTTGCTTTTTGNTGNGCATC 

TCATCTTATTGTAGGCATTCAAACTGTGGGCTCAGGCGAGCCTTTCCAGTGTGATCATAT 

TAATTTTATATTTTATGTGTTMCATGCGCCTCCAATAACTTGCTTTCCTTGCTCACCAT 

GGGTGATGCATTTTTCCTACTTTATGTCCTCGCTCATTGATTATGATGAAGCTGCTGACT 

ATTTGAACCTGGTACCTGCCCGGGCGGNCCGCTCTAGAACTA 

Sequence 2928 

CCGGGCAGGTACCCGGGCGTGTGAGGTGTCAGTCTGCCCCTACTTGGGGGTGCCTCCCAG 

TTAGGCTACTCGAGGGTCTGGGACCCACTTGAGGAGGCAGTCTGTCCGNTCTCAGATvTC 

CAGCTGCGTGCTGGGTGTTGGGAGAACCACTACTCCCCCGCGTACCTnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTATTATCAAGCTTATCGATACCGNNGACCT 

TTNAGGGGGGGGCNCGGTACCCAACTTTTGTTCNCTTTTAATGAAG 

Sequence 2929 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGTATATCTGAAAAC 
CTAATTTCTTTCTTTTTTTGATAAGGA 

CTGTGMGAGMTTGTTTCTATAGTAACTGGTCTGTGATCTTTTGTGGCCAAGAG^ATAG 

CAGGCAAGAATTAGGGCCTTGACAGMTTTCCACGAAGCTCTGAGMCATGTTTGTTTCG 

AATGTCTGATTCCTCTTTGTCATCAATGTGTATGCTCTGTCCCCATCCTTCACTCCTCCT 

CAAGCTCACACCAATTGGTTTGGCACAGGCACAGAGCTGGTCCCTAGTTANAGTGGGCAT 

TTATGTTAAAAAAAAAAATAGTTCAGAAATC 

Sequence 2930 

CCGGGCAGGTACGCGGGGATTGGGACAGCCGTGGGAAGGACAGTTATGAAACGAGTCAGC 
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TGGATGACCAGAGTGCTGAAACCCACANCCACAAGCAGTTCCANATTATATAAAGCGGAA 

AGNCAAATGATGAGNAGCAATGAAGCATTCCGATGTGA TTTGA TATGNCATGGAACCTTT 

NCAMGTCAGTCCGTTGMNTTCCACCAGCCCATTGTAATTTTACAACAGNNCATTGAAA 

GTATATTGCTNGGGTTTGTTAAGAACNCCCCAANAANGTTAACGG GAANG GAAGNATTNT 

MCCACCCTTGG AAAMTTTTCCN GTTATTTTTTrCTCC AATGG AAAT^ N AATTAG 

GATTNGCCAATTCNTATNTTTGGTAGGGGTTTCAAAAN TTTANA NNAAAN GGGGAG GNNA 

AAAAAAAANAATTTANCCANMNTTTTTNCTTTCCAACC I till I NGCCNAAl M I i iAA 

GGTTTCCNAAAAAAAAGNAAAAAAAAAAAAATTGGGC 

Sequence 2931 

ACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCG GCCGA GGTACTCGCCCGACGGGT 

CAGTGCTGACAGTTGCTTGTGAACAAACTGAAGTTCTGCTTTTTGACCCTATATCTTCAA 

AGCACATAAAAACACTTTCTGAAGCTCATGAAGACTGTGTAAATAATATCAGATTCTTGA 

TAACCGGCTGTTTGCTACCTGCTCTGATGACACTACAATAGCACTATGGGATCTGAGAAA 

ATTGAACACCAAAGTATGCACTTTACATGGTCACACTAGCTGGGTGAAGAACATCGAATA 

TGATACTAATACMGACTCTTAGTMCATCAGGATTTGATGGAMTGTCATTATT TGGG A 

CACTAACAGGTATACAGAAGATGGGTGTCCACATAAGAAATTC7TTCACACACGTTTTCT 

CATGCGAATGAGGTTAACACCAGATTGTTCCAAAATGTTGATTTCAACCGTCCTCTGGAT 

ATCTCTTMTTTTGCATGACCTTGACTTMCTMGTCTTTAGAAGTAGGCAGCTATCCCA 

TTTT 

Sequence 2932 

GGCGAATTGGAGCTCCCCGCGGTGGCCGAGCGGCCGCCCGGGCAGGTACCAACTAGAGGA 
TTTCTGTGACCTTCTCAMCTCTAAAATTTTTGGCACTGGACTGTTCCACTTTGAAGGGT 
CAAGAAAGAAATCTGGAGGTTGCAGAAATCAGGGTCAGGCGCACATACACGCACGCACAC 
ACACNCACACACACACACACAGATCCAGCAGCATCCTTGGAACATCCTGAAGTCAAGAAA 
GGCACAGCACTGTCTGCCCCGATAGAAGGAATTCCAGAAAACAGCCACCGAGGTTGGGCC 
ACAGAATTTCCAAAGAGAGCAATCTACAAACTGGGGGTTGATTTCCCAGTACCTGGATCT 
AANCANTCAAACACACCGAGAAGGG A I I I I I I I I I I FGAAAGCTGCCCACCTTCCAATT 
Sequence 2933 

CTCCCCGCGGTGGCTAAACATTTGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGTACCTn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnCNnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnCATAnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGCTTTNCAAACNGGAAACCTG 

TNATAGCCANNTCCATNACTGAAATCGGNCAACNCCCCGGGGAGAAGGCGGTTTGCNATN 

TNNGGCGCTCTTCCCCATTACTNGTTTAANTGGACNANTTTNNAANTNTGGACGTTCNNC 

TTCGGNAAGCGGANNTCAACNTCACTCAAAGGGCGGGTAAATACCGTTTATTCACAAGAA 

TCAGGGGGATAACCNCAGGGAAAANAACATTGTGAAGCAAAAAGGCCAACCAAAAGGCCN 

GGAAACCNTAAAAAGGCCCCGTTGGCTGCCGNNTTTNCATAGGCTCCCCCCCCCTGNNCN 

ANGCATCACAAAAANTCNNCNCTNAN 

Sequence 2934 

GCGAATGGACTCCACCGCGGTGGCGGCCGAGGTACl rTTTTTTTTTTTTTTTi INTTTNN 

TNNAGGATACAGGGTCTTGCTCTCTTGCCCAGCCTGGACTGCAGTGACAGGTCACTGTAG 

CCTNAAACTCCCGGGTTCAAGTGATCCTCTTACTTCAGTTTCCCAAGAGCT GATAT TACA 

GGTATGCACCACCATGCCCAGTTAATTTTT AMTnTTCT ATTGCTGTCTGTATTTTCAT 

ATCAAGATAAAAGCACAAGTTTAGATTTACC ! I I II I I I I AAAAAAATAATCTTTTACTG 

GGAGGTGGCAAGAGACCCATTTCTTCAGAATTCCTGTATGGCTCATATTCTGNGTCAGGG 

ATTTGGGGGGAAMGACAATTCCNGGGATAGATGAACCCCCGCACTCATNTCTTTCAAAC 

TTAAACTCTNCAATGCCACAATTTT 
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Sequence 2935 

CGCGGTGGTCGAGCGGCCCGCCCGGGCAGGTACGCGGGCATGATCAAAGAGCAACATCTG 

AGTGTGTCCTGAGTGAAGTTCTGTAAAGATAAGAAGCAGGACTTGCCACCGCATTGCAAG 

AGAAAGTTTACCAGCATCAATTAATTAAAATAGAAGGCAGCCAATCGGCATTGGGCACAA 

GGGAATGATGCTTGTATTGGAGCATGATGAAGAGATGTCAGCTTTGTCTCAGCACCAACA 

TCACTATTAACACTTGCTATCTGTCACCMTCAGCAATCTGTTCCTGATGTAATdGAGAG 

TTCCAAAGCACAGCTCAATTCTTCCACCTCTCTGAGAAGAAGTGGGTGTGTAGGCCTTCT 

GACAGTGGGAGAAAGTTGCACTCTTTGGGACTCAGTTTCTTCTCAAAACTCATTCANGAA 

ATCATCACGTGTCACTGTTAGCTAGGTCCAGGATGTTTGAATAAGAATTGTGGTTATATT 

ATAGTAAATTG N GTAAAAAATCAGTTAAGTTATGGGTAATAGGTATGGC AAGTTCC ATGT 

GCC N AAG AATC TTATA AAGTTTTTTACATTTAGTAATTC ATTTMGT^ C CC 

TACAANGTATNATTT7CCTTATTTTAAAGAGGCAAAACTGAGGTCCCTCGCCCGCTCTNA 

ACTAGTGGATCCCCCGGCTGCANGNATTCGATTTCAAGCTTATNNGTCCCNCCNACCTNC 

AGGGGGGCCCCGG 

Sequence 2936 

GAATTGGAGCTCCACCCGCGGTGGCGGCCGGGCGCAGTGGCTCACGCCTGTAATCCCAGC 

ACTTTAGGAGGCCAAGGCGGGTGGATCAATTGAGGTTGGGAGTTCGCGACCAGCCTGACG 

GATATGGAGAAACCTTGTCTCTACTAAAAATACAAAATTAGCCAGGCCTGGTGGTGCACA 

CCTGTAATCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATCGCTGGAACCCAGGAGGCAG 

AAGTTGCGGTGAGCCAGATTGCACCATTGCACTCCAGCCTGTGCAACAAGAGTGAAATTC 

TTGCCTCAAAAAAAAAAAAAAAAAAGTACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGTTAATTTGCGCGCTTGGCGTAAT 

C AATG N G N C ATAAGCTTGTTTCCTGGTGTG AAATNG NTATN CGTTC AC AAATTCC AC AC A 

ACATANCGAGCCCGGNAGCATAAAGTGTAAAA 

Sequence 2937 

NGGCGGCCGCCGGGCAGGTAC 1 1 Ml 1 11 NNTGN T1 I I U i I U 11 11 NTTTN NCAG AG A 

CAGCTTTNTATTCTGTCACCCAGGCTGGAGTGCAGGGGCACAATNTCAGCTCACTGCAGC 

CTCCACCTGTTGGGCTAAATCCATNTCCACTTCAGCCTCCTGAGTAGCTGGGACCACAGG 

CACGCATNACCACTCCCGGCTAATTTTGNTTTTCATAGCACATTACATTACCGATTTCCC 

AGTTTATAAGTAACCMGATGCTCAMCAAGCAAATATTTTCCTTGAGTATGACTTACTN 

CAAAAAAGAGAMGMCAGAGAAATTGGTTCTGAMGNGCATTTTrCTAMGCCCATTTC 

AGTATTTGNTAGCTCANATCATATACAAGTTTTAATCAAAATGCTAAAGAATATTNTATT 

AATANTCTTCTATTCCATC 

Sequence 2938 

AGGAGATGCTGCCACCTAGGTTACTTGTAGGACCCTATACGGCAACCTCCTTTGCCAGGA 

ACTATTTATAAACATCCTGCAGGAAAATGAGTCTATATGTCAGAATACACATTTNCCACC 

TTGCCCAACAGTNGAAAAACATATGAAGAGAAAAACATTAAAAAATGACAAGGAAGTTAA 

TGGANGTCAGCAATGTGATGGTGTTTGGAGGTGGAGCCTTCAGAAGGTAATTAATGCCCT 

TGTAAGAAGAGGCCANAGAGCTTGCGCACCTTCTTCCTGCCATGTGAGGAGCCAAAGAAG 

CCCGGCTGTCTGCAACCTGCAAGAGGACCCTCACTAGAAGCTAGCCATACTGGCATNCTN 

ATCTTGGCTTTCCMCTTCCAGAACTGTGAGAAGTATATGTTTTGTGGGTTAGTCAATGG 

CCTATGGTAAT 

Sequence 2939 

CCGCGGTGGCGCGGCNCGCCCGGGCAGGTACGCGGGGACAATTGGTTCATTTTTATAGGT 

GTTGACCGTTATGCCTATAAATAAGCCTCCTATAGACATACAGAAATCATATCCTGTGGA 

ATTAGAATATAAGACTTGGTAAAAGAGATTTCAAAGTATTTTACTTAACTTGTATACTT 

AAATCATTTAATCCAGACTGAAGTTGTAAAAGCCAGCCAGTGTTTTCAATATAGACTTCC 

ATGTTTGACCATCTGAAAATGAAAAACACTAAAAACATCACATGCTGTTTAGGAGCTGGA 
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AATTTTAATATTTGACTTCAAGTAGATGG I 1 1 1 i AACTCCTGAAATCGAACTACGTTTAA 

GTTTGTATGTTTATTACCTGTTTGAGCACTTAAGTGCAATTGTGGGAGCGGGGATGTCAA 

GTTCATTTATGTGACTCTTTGGCTCAACTTACATMTCTTTGTTTTGATATCACAGTTGT 

CTMTTATTTTACTTTGTAGCTTAAGGCAGGCTGMTTGTTGATAAAATGGAAAAAGTAG 

TATATTGTTATATAAGCTTCTGAGGTGTGTTTTGTTGTATAAGCCCTGGAGGGTAAAAAG 

TCATNCCTTATGTATAGTAGTTAAAGGCATAAAACTGTGACTTTTAGATATTCACCAGAA 

CCAGACTTATTTGGATGGGGATAATAACCCMTGGATTAANCATTTGGTTGCTTTTGGTT 

TAATTTATCCGGGT 

Sequence 2940 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGTAAACATG 

ATTGAGCTAGMGACAGTTTTACCATATGCATGCATTTTCTCTGAGTTATATTTTAAAAT 

CTATACATTTCTCCTAAATATGGAGGAAATCACTGGCATCAAATGCCAGCTCAGACGGAA 

GACCTAAAGCCCATTTCTGGCCTGGAGCTACTTGGCTTTGTGACCTATGGTGAGGCATAA 

GTGCTCTGAGTTTGTGTTGCCTCTTTTGTAAAATGAGGGTTTGACTTAATCMGTGATTT 

TCATAGCTTAAAA I 11111 I GAAGAACAGAAC I 1 1 I 1 I 1 AAAAACAGTTAGATGCAACCA 

TATTATATAAAACAGAACAGATACAAGTAGAGCTAACTTGCTAAAGAAAGGATGGAGGCT 

CTGAAGCTGTGACTTCATTATCCCTTAATACTGCTATGTCCTCTGTAGTACCTCN 

Sequence 2941 

AGGTACCGCGGGGAATACAAAACAAATAAGCCATCACGCCTGCCCTTCCTTGATATTGCA 

CCTTTGGACATCGGTGGTGCTGACCAGGMTTCTTTGTGGACATTGGCCCAGTCTGTTTC 

AAATAAATGAACTCAATCTAAATTAAAAMGAAAGAAATTTGAAAAAACTTTCTC 

CATTTCTTCTTCTTC I kill I AACTGAAAGCTGAATCCTTCCATTTCTTCTGCACATCTA 

CTTGCTTAAATTGTGGGCAAAAGAGAAAAAGAAGGATTGATCAGAGCATTGTGCAATACA 

GTTTCATTAACTCCTTCCCTCGCTCCCCCAAAAATTTGAA I 1 1 I I I I I CAACACTCTTAC 

ACCTGTTATGGAAAATGTCAACCmGTMGAAAACCAAAATAAAMTTGAAAAATAAM 

ACCATAAACATTTGCGCCACTTGTGGCTTTTGMTATCTTCCACMGAGGGAAGTTTAAA 

ACCCAAACTTCCAAAGGTTTAAACTACCTCAAAACACTTTCCCATGAAGTGTGATCCACA 

TTGGTAGGTGCTGACCTAGACAGAAGATGAACTGAGGTCCTTGGTTTGGTTTGGTCATAA 

TAG AAAGG N GCTAATTAATAG N ATTTCAGATACTTGN AG AATGGTG ATNGGGCTAG AANA 

ATTTGGANAANAAATACTCCTGTATTGAGTTGTATCGGGTGGGGGATTTTTTAAAA 

Sequence 2942 

CCGGGCAGGTACTTGGMTGCTGTGTGGAGTCCT1TTTCTACTTTTGTTCCTGAGAAAGA 

GAGATTCACAAACTTCCCACATTCTTCCAGTGGCATCTAAGGATCCCTTCCTAAATGGCC 

TTTCTCCCAGCCAATAGACTGAGCAAACCATGTCCCTGATGCTTCCTGCCCACCACCATC 

CATTCTTCCMCCAA.CAGCTTCTCCGGCCTTTCACGTGCCATGCACTTTGCTAAGAGCTT 

GAGGGAMTACAGCAATGAAGAAGAGATGATTCTGTTTGGACCAGTGCTCAGGGATTCCT 

CCTATTAGAAATCAAAGAAGGCTTCCAAMGCCCTACAATTTTTTCCCTGTACTAAAGAA 

AATAGCTTCTCAATGGTACTAAATTTTATCTCATACGCAAATTGGAATGTATACCCCTTC 

ACTCTGAATTGCTT3GAAATATTNCAGGAGAAATAAATCAACTCTGTGTNANGGTCTTCT 

AAAAAAGCAGMGTCAAAGGTCACCAGAAAGACCTTTAAAATCAACTATGTTTCTGACAA 

TGTTGCTTTCGMACTGGTACCCCACTTCCGCAAAATCACCATTGNAAAATCCTTTGTGG 

TGTAATTAAGMTAATCATGAATGCCCTGAATGGTCAAAAATGAGTGGTTTTGGTTTTGG 

T 

Sequence 2943 

ATGGGCGAATTGGAGCTTNCCGCGGTGGCGGCCGCCGGGCAGGTACGCGGGTTATTTCAT 
TTACTAGCATTTTAGGATCCAGAAGMTTCCACCAGATTGCATGAGTTAGATTGGGAAAT 
GGGAGTGGGAGATAATATTGGGAGGTATCTATTTTAAGTCAGGGGCTTACTACCGATTTA 
GTTCTCACAATAACCATGTGGAGAAGCTGTGACATTTrTMTTTACAACCTTTCTGGGGC 
TC AG AC ATAAAGTTACCTATCCAANGTTG C AGTTGGGTAGTGGTGGG ACCAG C ATGG AC A 
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ACTCATTGGCCCTGCCTCAAAAGCCATACCTNTTCTCCTGCTATGCAGAATCTGTTTCTC 

CTGAATCTCTGNGATGCTGGTGGGAATTGTTTGCATAGAGGAANGACANTAACCCTGCCN 

TCGTGAGTTAATGTCCGGGCTGGTCACAAGNGGTTCATGCCTGTAATCCCAACACTTTGG 

Sequence 2944 

GGCGGCCGAGGTACATCMCMTGACTGCTTCTTATTTCTCTTTGAATCTCCAATACCTG 

TCACCAATGTTTGGCACACAATAGGCATTCGATAAATATCTGTTGAGTGAACGCAAGAAA 

GAATAAACTCCCACGAACATCTAAAACAACAGTCCTGCAAACAGAGAGTTAGCATACAAC 

AAACCATGTGTCAACATTCCCTCTTGTTAGTCCCTGCAGATCTTAGAGACTGGAAAGCGG 

G GG G ATATAGAAGGACTGGAGGGATATATTCC ATATATCACTG G MGGTAATAAAGGTGG 

CCTGGTTGAAATACTCCATTGACATAACTAACAGTCACCAAACAAGCCAAGTGGAACTCT 

ACAGCTACCTTAAAACCCTCTTCACAATAAC 

Sequence 2945 

CCGCGGTGGCGGCCGAGGTACATGGTTTCCATTGCAGCGAATAGACTAATATTCCTTCTG 

CTGGCAACCTAGACAAGTTCCCTAAGCTTATGGGTAAAAATGGAGTCAAAGAGAAGTGCC 

AGTATCCAATCTTCAMTATTTCGCTGAGGATCACATGAATTTAAAAAGAAACGCTCCGA 

ACAGGAAAAACAATTGGCCAAGCCTTGGGAAGGAAACAAGAGGGANGGCATCCAGAGACT 

CTGGCTGCCTGATGTAGGGCCTTCGAGGTTGAACAAGGGTGAAGGGGGACCCTCACATGG 

GAACAGCGTGATGGAGGCAGTCTGCTTATGAGCTGGGAAGTGAATTGTGTAAACATTATT 

GMGTCAGTTTGGMGTTGCAAGCCACAAAAAGCCCTATCTGAAAACTTGTTGAAATTAC 
CCTTTG 

Sequence 2946 

CCCCGCGGTGGCGGCCGAGGTACACCGAGMCATCACCTGACTTCCCTAAACTTCAGTTT 

CCTTGTCCTTAAGAGAAAATAAAAATATCTTCCTCAATGATTMTTTCTTAAAAATCAAT 

AAMTGTGAAATAATATCTGGGATGGATTGTGACAAATATTATGAATGAAATTAATACAG 

TTTCCTTTAATTATTTATCANAGTTCAGAGAAACACAAATGAMTCATGAATACTMTTA 

ACTGTG TGCTTTGTTTAGAAAATGCATTACATCTCTTCTCAGTATTCAAGCATGCCTTCT 

M ) ll lATTTGAAATTATCAATAGCATTCTAATCTGNCTTCATACTGGTGCTAGAAATTC 

ATCACAGTTTTAAAATC 

Sequence 2947 

NCACTCCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGTAAAAAA 

ATAAATCTACCACTAAATATAGTAAGGGAGTTTTAAACTGAGGTTTTTGAAGTCTTTA^ 

AATTTATTTAGAGACTTCTTTTGTTGGAAGTATGGTGGCTAGATGCCATGAAAAACCCCT 

GAAAACAAGTGTMGTGCTAGGTMTATGTTGAAACATATCTTTTGAAATGTATTACTGA 

GCTGGCAAGAAAGTATGCCATTGTAGAGAGTGAAAATAAGTGAAGGCAGAATCCTGGGAG 

GTMGCCAGCCTGAAGATATTGATGACACTGGATGGCCTTMGTTTCCATTTTGACTGGC 

ATGTAATCCACCAAAGATTCCCACAGAAATC CG AGGTTCCAAATGGTAAATCCG TGGTG A 

CATTGGGGTGACCCGAGAAATGAACTCTTGTGCAGAAAGGAATGGTAACAACACTTGTCT 

CGTCAACTTTGGCTCTAGGTAGGGGAAANGAAAACAGCTTTCTAGAGAAATGGTNACCCC 

AAGCTAGCCCTCATGAGANATTCAGCTAGGGATNGATGCTATCTGTGTAGTGCCAAAAAA 

AAAAAAAAAA 

Sequence 2948 

MTTGGA GTTTN CCGCGGTGGCGGCCGAGGTACGTGGCGTAGCATTCCATGTTCAGCTTT 

GACATTTATTTTCTCATATCMCCCTTTTTACACGTGAAACACAATCTCGCTCTTGAAGT 

CTTAACTG CATGATCTGTGAMCCTGTATTTATATTTTCTTC AGGTATTCTTG TCGTGTG 

TGTGTCCTAAACAMCCAAAAGAAMCTTTCCAAATCTAAAGTATTCATTCTCCAATTGG 

AGCAAGAGGAGTCAGTTAGATACTATCACGGCATTCATTTGTGGCTGGCTTGTCATATTT 

ACTTATGATTGATMTAAATCTATTTTGCTTTTTAGAGCCTCCCGAGAAGCCATCTGCCT 

TCGAGGTATTTAAGTTTCATGATTTCATTTTGAATGACT^ 
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AAGTATACATTCTTTATTAATCATTTTC 

C AAAAAGTCTGTTC CAAATAAAGCCTTGG AATTGAAG AATG AACAAAC ATTG AGAG C AGG 

TAAATTTTTTAATGGAACTATGCCMGACCAATATT^ 

CTTCTATTCCCAATGCTTTAT 

Sequence 2949 

CCCGCCCGGGCAGGTACGCGGGTATNGGAGACGATTGAAAAAGGATTTGAAGAAAGGANT 

ATTGCCAAGTCTCCACTTTCTCAGGGNCTGAANTGMCTTTTGCTCACAGCAGGTGAACA 

CAGGCTCCCTCCCACACTGAGACTTCCTCCCCTGCCGGTGTGCAAGTCTGT TAAGTA AAC 

AGAGACGTTATACTGAGAMGGMCATTTCTCTATTTTGGCTATGAGTNGGTATTTTTAA 

AGATCATCCTCCAAACATTTNCCCCTATCCTTGGTTATTTTTTCACTAGAAAGAANATGA 

ACTGCTTAAAAAAAAAATTCAACTCCATGACAAGAAAAACCCA 

Sequence 2950 ' 

ATAGGGCGAATTTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACATTCTGAGCGATCTGCA 

TTTGGTGATGATGGAAGCCATGGACATGCACTAGATTGTCTTGTGGGAGAATATGGAGTA 

GTAAGAGAAGGAGAAGACCTAGGATTGAGCCCTGAGCACCTCTGGCTTAATGTTGGATGG 

AGGAAATTGAATCTGTAAACAATACCGGGAGGCTGCAGCCTGAGAGGCANAAGGAACTGG 

GGTGTTTGGGATTATGGAAGCCAAGGGAAAAAGCCTGTCTCACAGCGGGMGGGAGGTAT 

CAACATTGTAAGCTGCTTCAGATAGGTTATGTAGGATGCCGACTGAAAAATATCTGTAAA 

ATTTGGCMCCCCGATTCATTGGTMTCCTAGGGAAGTTGCTTTTTTGGG 

Sequence 2951 

CGAGGTACCACCTGAAGTTCAGGGAGTGGGCATGGGGCCTAAGGCAGGCCAGTGAGAACT 

CAGAGGCACTAAGATGTATGTTCCCACTTGCAGAAGAGGTGCATGCTCACCCAGTAGTTG 

CTAAAAAGGCTGAAGGTAGGTGTAGAGATGCTTTTGGGGCCATGGTTACCACCCTTGAGG 

TAAGGAGGGAGGGTGAGTCTGCCTAAAAATGAACCAAATCGAGAGTGCAACCACMCTGC 

ACTAAATGGAAAGAAACAGANATAAGAGAGATGGGCTCCTCTCACAACACATTTGGGGAC 

CTGAACTCAGCCCCGCCTCACCCTANGTACCTGCCCG 

Sequence 2952 

CCGCGGTGGCGGCCCGAGGTACAGTAGGGTATTACACAGCAAAAAAGCCTTCCTGCTGGA 

ATATTTCAGCATTTCAGTTATTTATTACATTAGCTTTATGGCATATCACMCCTATTTGA 

GACAGGTAGAGTGAACCCTTTTCTGCCACACAGAGGCACATTAAGTGGCCATCATGACTG 

GTGAGACTTMTTGTTtTACACCGGGCAGTTCTCTGTCGTGATTTAAATCCCTCGTAAAG 

CAGCTGACCCCAGTTCTCAGGATGATTTGGATCCCCCGCTAACTTGGAAAATCAGAGCCA 

GCTGAGAGTTGGCTTGCACATTTTAGCATCTGGTAAAAGCGTGCAAGCTAGTGAGATGTG 

CAGGGMGGAGTCATTGGCMTTGTATGTGAAMCAClTMTTAATrGTTTATCTCTTTT 

MTTTGGTTAMTGATCATAAAATTGCTCGCATTATTTT.GAAGATGTCTTCATGTTTATA 

AAACTGTCTTCATGTCTTC AC ATATTTATAAAAC AAAGT FNC AATCATTT 

Sequence 2953 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCGGGCAGGTACGCGGGTGTTCA 

GGGAATTACAGCATAGATATGAMGATGTCTGAAAA/iTTGGAAATATCAAATAAAAATTA 

AGAAGCCTAAAACTACATCAGATTAATATGTAGTATAGTCTTTCACATCTGGCCATGATG 

GATGGAGCAATGGGATTATATATACATTCTCACAGTAAACAGTTTTCAGATATTGAACAA 

CAAAAAGTAGGCAGTGCTAGCCAGAGATTCCTTGGAGCCAAAAAACAAAACAAAACAAAC 

MTAAAMTTGGTAATCTCTATATTGCTCC/,GTTTGCTGCTTGGAGAAAGTTCTCAGGCT 

GCAGCTCAGAGAGGAAAAATCCAGAGCTCAGTGGTCTGAATGAAGTTAAAGAGGATTTGG 

G G C AGC AAATG CCAATTTAG AATTCAC AGGCC ANAAGTGAGAGG AAGTGTAN AG ATGG AG 

AGGGGTGTGCCTGAGGGAAGANAGAGAAGAGCCANNANAGGTNGAACTNCCGAGAGAACA 

AGTTGC AAAGTAAAAAAN N AANTAANGACTGGGG NGCTGCAGAC AGGTTTC AGG NTTTAA 

GTNTGGATCCNGNGCCCCCCATGTGAGGAACCTTCAGGGCAAATANCTTCTGGAAANGAA 

GCNGATTGNACAATGCTNAATNGGACAGAAAANGGGTTAAAAATAATTNNCTNGGCCTCC 
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TMTNNGCCAAGNTNGGCNCCTTAATTTTCCCCANAGGTTGGGGAAATTACCCCNAAAGG 
GTTTGGCNTAATACCAGGNNNCTNGCCCGTTTTANAACTANGGGNNCCCCCGGCCTNNAG 
GAATTNGATTAAACTTTTCNTTCCCGNAACTNNGGGGGGCCCGG 
Sequence 2954 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTGTCGAATGT 

AGAAGTGA/.TAAATAGCAAGACAGAGTCACCCGGCTGGCTTGGACCTGGACTGGCCAAGC 

TGGTGGTCTACACCATGATTTGCTGGATAATCCATGCAGCTTTCCCTGCATAGACACAGA 

TGCAGCCCCTCTGTCCAAACATGATGCTCCCTCTGTCATTGATGGGAACCAGATGGAGTC 

G CGCCTGTAAC ATG ACTG AGC AC TG ATTTCAGTAAAGTAAATGAATTACCAAATTGTAGT 

CATTATAATCTATCTrCATCTTAGTGACACACTTGACTGGCTGCTATAAACCAGGATTTG 

AAGAGTGAATGTGGATTTACAGACCTCTCATCTCTGCCCACAGGATCCAAATCTGGCAGA 

ATTGTCATCTCTCCCGCGTACCTCGGCCGCTCTAGAACTA 

Sequence 2955 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAAAACAAACCATATCTAAACCATAGC 
AGTGGCATCCTTTGAACTCACGGATCCAGTCATAACTGAAGTTCCATCTACACCTGGGGT . 
CTTCAGACCCAGGCTCTGGAAAAGATCCTGTTACTTACAAGCAAAAGAAATTCTGCCTCT 
TACCCTCAGCATGTGAGTACCT 
Sequence 2956 

CCTATAGGGCGAATNGGAGCTCCCCGCGGTGGCGGCCCGAGGTACATGGTTTCCATTGCA 

GCGAATAGACTAATATTCCTTCTGCTGGCAACCTAGACAAGTTCCCTAAGCTTATGGGTA 

AAAATGGAGTCAAAGAGAAGTGCCAGTATCCAATCTTCAAATATTTCGCTGAGGATCACA 

TGGAATTTAAAAAGAAACGCTCCGAGCAGGAAAAACAATTGGCCAAGCCTTGGGAAGGAA 

GACAAGAGGGAGGCATCCAGAGACTCTGGGCTGCACTGATGTAGGGCCTTTCGAGGTTGA 

ACAAGGTGAAGGGGGACCCTCACATGGGAACAGCGTGATGGAGGCAGTCTGGCTTATGAG 

CTGGGGAGTGAATNGTGTAAACATTTATTGAAGTCMGTTTGGAAAGTTGGCAAGCC 

Sequence 2957 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTCTGATCAAGCTAGGTCATGT 

GCTGGACCCTGGGAAAGCGGACACAAATATCCTGCCCTTACTGGCTGAATACGTAATGAG 

GGGGACCCAACATGGGAACATATAAATGTGCAATTTCAATTTTTTTTAAAAAAACACCCT 

GATAAGAAATCAAMTGTGCAGTTTCAAAGTGTGGCAAAGCCATGCAAGAAATGACCGGG 

CACGGTGGCTCACACCTGTAATCCCAGCACTTTGGGAGGCCGGGGCGGGGGGATCACTTG 

AGGTCAGGCATTCGAGACCAGCCTGCCAATGTGGTAAAATCCCATCTCTACTAAAAATAC 

AAAAATTAGCTGGGTGTGATGGTGCATGCCTGTAGTCCNAGATTATTCANGAGGCTGAGG 

CANGAGAATGGCTTGAACCCANGAAGCAAAAGGTTGCAATGAGCCAAGATGGCACCACTT 

GC ACTTC ANCCTGGG CG AC AG ANGG AG ACTCTTGTC AAAAAAAN N N AAN NNNNNGGNNNN 

NNNGTTCCTGCCNGGCCGGCCGTTTTAAACTAGTGGATCCCCNGGCTGGAGGAATTCCAT 

NTTAAGCTTATNGATACCGTCNANCTTCAAGGGGGGGCC 

Sequence 2958 

AGGTACGTTGTTCCTCTCTATGTGTCCATTAAAATCCAGTCAAGTACAAACCA TGAGA TT 

CCATGATTCACAATGCCCCAGAACACCCTCCAAAAAAAGAAMTAAAATACCATTTTTAT 

MTAAATCACTATMGATAMTGCATTGMCACTAAAAGCATCATAAATATTTTATTGGC 

CATGAAATCTTGTTAAATTAGCAACTACCTTCCCGTGTAAAATGAGGGCTGATGTGGAGT 

GACACCACCGTTACCTGAGCCAGGATGATAAGGGGGTCCTCTGTGCTTTACCTCTTTAAG 

GAAAGTMCTCTGAMCAACCAATCACTTTTTGTTCTTTTTCTGTTT^ 

TCTGCCAATAAAGCTAACCCCCCTCTGCTCAGCTTATTGAAGAGC 

Sequence 2959 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATTCGCTAGAAACAGTTAAGTAA 

CAATACAAAGCGCCCCTGAGTCTAGTTGCTGGGGAGGCCAAATGTGAACTGGCTCTAGAA 

AGATCTGGGTGGGCTGAGAGACCATGTGGTAATAGCATAAGCGAAGACAGAGTGGACATG 
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AGTCTGGGCTGCTTTGC AGGTC AG ACG NG ACAG AAG AAGGTACCCAAGGGCAGG GTANAG 

GGAMGAMGGACACTTTGGCTTNTGTAGAAAGTCGGTGAGTTTGTCGGGGGAGATTTGG 

GATCAGGGAAAATGAGAATGGATTAMTTTGTAATTTTTATTAGGGCCTGCTTGGGGTAG 

AAAAGGTTNTGGGGACCTTTTTCCTTTTCATGCCCCTGGAGGATAGCCATTGTTNGCCCA 

NGCCCGGGGGATGGGGACATTCCTTTCACTATTCCCTACCCCCCCCAT 

Sequence 2960 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGGCGGCCCGGGGGCCATTGAGACTGCCATG 
GAAGACTTGAAAGGTCACGTAGCTGAGACTTCTGGAGAGACCATTCAAGGCTTCTGGCTC 
TTGACAAAGATAGACCATTGGAACAATGAGAAGGAGAGAATTCTACTGG 
Sequence 2961 

TCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGTGGCACAATCATAGCTTAGTGCAGCCT 

CATTCTCTTGGGCACAAGCGCTCCTCAGCCTCCCTCCTCAGCCTCCCAGTTGCTGGAACT 

TACAGGTGCACCACCATMCCAGCTAGTTTTATTTTTTGTAGAGACAGMGTCTCACTTT 

TTTTCCTAGACTGGTCTGAAATAGACCATTCTATTTGCTAGAATGGTCTCAAACCATCCT 

TCTGCCTCAGCCTCCCAAATTGCTGGMTTTCAGGCA TAATT CATCATGCCTAGCCATTT 

CTTTTTTCTGCTGCATTGCAGAGTTAGMCCMTTTCATTTTGTCCTGCACTGAGAATGT 

GCATTCCCAGCANGATCATGGTTGAGCTCAGCACACCTGGCAACTCAAGCCAGC 

Sequence 2962 

GATACCAGGCCGTTTCCCCCTGGGAAGCTTCCCTCGTTGCCGCTTCTCCTGGTTCCGACC 

CTTGCCCGCNTTACCCGGGATACCCTGTTCCGGCCTTTTCTCCCCTTTCNG GGGN AAAGC 

GGTGGGCGCCTTTTCTTCATMGCTTCACGGCTGGTTANGGTATTCTTCMGTTTTCGGG 

TGNTAAGGGGTACGTTTTCCGCCTCCAMGGCCTTGGGGGCTTGGTTGGTTGNCACCGGA 

AACCCCCCCCCCGGmCMNCCCCCGGACCCGCTTGCCGNCCCTTTTAATCCCCGGGTT 

AAANCTTATTCCGGNTCTTTGGAAGGTCCCAAAACCCCCCGGGG 

Sequence 2963 

CCGCGGTGGCGGCCGAGGTACAGCTGCATGCCATACTGCAGAATAAATAAATAAAACACA 

TTCCCACCACTTTTACTACTCTGATAATCTATGAAMTCAACACTTTCATCAATTATACA 

ATATAAATATAAAAACAATTAAAGCAMCAAGTAMGCAACCAAMTTTTGAAAGATGAA 

CTmAGTGCCCTGCMCMTACATTTAAAAAGACMTTGMGTCTmACTTGGACCTA 

CAACTTAAGGTCCAGCTCACCAGTACTGGCCCCG 

Sequence 2964 

TCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCACAGAACCTCAAA 

AGTGGGATATGGAGGGAGAGAAGGAGAATGAGAGAAGGCACAGAGGCAGGAAGAACAAGT 

CCGGTTTGGCTGGCAGGAGGCCATCTGTGAGAAGCCATCTGTGAAGCCAAATTCTTGATT 

TCTAAAAAGAAAAGATTCCTTACTCTGGTGGGTAAAGCTGGAGGGCAGAGGGCAGAACCA 

CCAMGGCmGTGCANAGAACAAAACTGAAGCCGTGCTAGAGAAAAACCATACAGTGTT 

GGGCAGGACAGATGACAGGACAAACGGTCTGAAGCTTAGGAAGCTTGACTTCACCAAGCC 

CNGTGCNCACGTTGCAAGCATTTCCTATAGAAACACNGTTCTTTGACCCAACATGCCAA/\ 

AACCATAGNTTTCTGTCTTACAGAAATNAAGGGCTTTTC 

Sequence 2965 

CCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAGGTATTT 

TTGCTMCTGTMCTTCCCACTTAATCAACAAAAACAAAMCACTCATTTCTGAACATTC 

AGTGCATTCATGATTAATCTTMTTACACCACAAAGGTATTTTTCMTGGTGATnTGCG 

GGAGTGGGGTMCAGTTTCGAAAGCMCATTGTCAGAMCATAGTTGATTTTAAAGGTTC 

TTTCTGGTGACTTTGACTTCTGCTTTTTTAGAAGACCTT^ 

TCCTGGMTATTTCAAGCMTTCAGAGTGAAAGGGTATACATTCCAATTTGCGTATGAGA 
TAAMmAGTTACATTGAGMGCTATmcmAGmACAGGGGAAAAMTTGGTAGG 

GGGCTTTTGGAAGCC 
Sequence 2966 
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TATAGGGCGAATTGTAGCTCCCCGCGGTGGCGGCCGAGGTACATGTGGCCTTTCAGCTAG 

TGCAGAGGATGGAAACCAGAGTGGGCTGATGATTGGATGCCAGGCCTGAGCCAGGTGAGT 

CCTGTTCTCATTATACTGTAAAGTGAGCTGCTCCAAGCCTCCCTGATTCTTAGCACTCCC 

AGTCCCTGCAGGGGTCCCTGAGMGGTTAATCTTGCA7TAGGCCCTCTGCTGTTTTCCCT 

CTGTAAGCCTTACTCCACTTTGCAGTGAGTTCTAGTATAACTTCTATTCCAGTAAGATCA 

AATGGTTAAGGACTCCAAAAMGAMTTTGGTGAACACAAAGGGTTTGTCATACTTTAAC 

TTTTGTCCCTTTCCC 

Sequence 2967 

CCGCCGGGCAGGTACTAGCTGTTTTTGCGACTTTTAATAATAACACCTACTAGTTACTGA 

GTGTCATCATCACCAATGTCATGATTGCCACGATTTATTGAGTGCCTACTACATGCAGGC 

CCCAGGTCAGNGCTTTACACTCTTACTGTCATGTTGACAACAGCTCTGAATGACAGGAGC 

TCTGTTCAGTGTTTTACAAGTGAGAAMCTAAAGACACTAATAATTTTTAAAM 

CACAGTTCTATGATTAGTAAGTAAGGGAGCTAGTCCT 

Sequence 2968 

CGAGGTACTCTCACCCCTAGCTCATTTTCATTCCCTGGCTGACCTACCAGCAGAACCTCC 

GCCC CCCTGCGGCCCGATACCATCATTTTTAATGAAGTTAA I 1 i I 1AI 1 I i 1 ATTTTTAT 

TTTTACTATTTTGATAACCACTTTCTGTGAAATAATAG I I I 1 i I I I i I CAAAACTTGGCT 

CATATCTCCTTTTATGTTAGCAGTCATGGCATAGAAAAGCTGCCCACAGAATGACATACA 

TCAAAAACAGTTTTCTGTTTAGACGGATAGCTTTAAGTGTGTGCAAACAGGTGCCTCTAT 

GTGTGGMGGAAGACATGCTAGATTATTTGGGGTCCACCTGCAGAGGAATGAGCTAATTT 

ACAGAAGAGTGAAGCAGGCTCTGCACTGAAGGCCAGGGTTAAGGCATATAGCTAATGATA 

AAATAAATGAATGTGACTTTCTAAAAATGTCATGGGGGAGAAGAAAATGAGGAGGAAGGG 

AAMCTGGCTTCTATAATCTTGTAATGGTACAAATAGAATTGCCATGTGACTCAAATCTT 

CTCAGAAGGTAMNGGCATANGGCACTTTTTTTAGTGGNGCATCTTTGTCATTAAACCAA 

AAATAATTTGGGANGGAAATGATTTCNCCCCAAAAGTTTAAAATTCCTCGGTCACTGGCC 

TCMTTTTNTTTCCTTAAAATCCCTANGGAATCTTCCTGGGGAATGAAAATNGGGAGCCC 

TGTGCCATTTTTCCTAAAAGTTNGGGCC I MM ( GGGGGGTTT 

Sequence 2969 

CCGCGGTGGCGGCCGAGGTACATGTGGCCTTTCAGCTAGTGCAGAGGATGGAAACCAGAG 

TGGGCTGATGATTGGATGCCAGGCCTGAGCCAGGTGAGTCCTGTTCTCATTATACTGTAA 

AGTGAGCTGCTCCAAGCCTCCCTGA7TCTTAGCACTCCAGTCCCTGCAGGGGTCCCTGAG 

AAGGTTAATCTTGCATTAGGCCCTCTGCTGTTTTCCCTCTGTAAGCCTTACTCCACTTTG 

CAGTGAGTTCTAGTATAACTTCTATTCCAGTAAGATCAAATGGTTAAGGACTCCAGAAAA 

GAAATTTGGTGAACACAAAGGGTTTGTCATP£TTTAACTTTTGTCCCTTTCCCAGGTGGC 

GCATGCTTCTAGGCCAGCCTCCCAATTCCTAGCTTGCTTCATGCTCCTGGCCTTTGCCTA 

CTC AAAACAATCTGTCCTCCCACCCTGTCA T CTCTGTTC AAAAGTAAATACCTCCCTCCC 

TCTGCTTTTGCCCTAAAGTTC 1 I 1 1 I I CAACTCACCTAACATCTAACT 

Sequence 2970 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACGCGGGAGAGGGGTCTTGCCCTCATCCCCTGC 

CCTGTCTCTGAGACGTGTCCAGTAAACC1GAGGTCAGCACCTCAGGAATGAAATAATGTC 

TTTTGCAGCAACTTGAATGMGCTGGAGGCCATTATTCAGTGAAATAACTCAGGAATTAA 

AAATCAAATG CC ATATGTTCTC ACTTATAAGTTGG AGCTAAGCCATGCGTATGCAAAGG C 

ATGTGGGGTGACATAATGGACTTTGGAGACTCANACATGAGGGTGGCAGGTGGATGAAGG 

ATGAAAAAACTACCTGTTGAGTACCTCGGC 

Sequence 2971 

CCCGCGGNGGTTTTCGAAAACTGCCCGNGANAGGCGGI M M M I I GGAGGTTGAATGCA 
GGGTTTTNTTGAGNGGTNTNNGTGGNTCTCTGAAGGATGGATGCAGAGCTNGAAGCGGGG 
ATGGCACGGGAAGATAATCTTCCTACTGGGTGGCAAACCTCTCCGACCACCCTCGGCTGA 
ACTCCTCTCGGCGTTCAAATGTTCCTCTTCTCTCTTTCTCTGCTGTATCGTTTCACCATT 
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CATCTGCTTGTCTGGGATGGTGAAGACAACTGTTTGCCCATTTGCTTGGCGATAGGTGAT 

TGTCTCGCCGCTCAGCTATAGGCGATGGTCTCACCACTCGGTGATAGGCGATTGTCCCAT 

CTAGGTCGCCAAAATGTGTCTGGAA7TGGTGGGTTCTTCGTCTCACTGACTTCAAGAATG 

AAGCCCGCGGACACTCGCAGTGAGTGTTGCAGTTCTTAAAGGCGGCATGTCTAGAGTTTG 

GTCCTTCTGATGGTCGGAAGTGTTCACAGAATCTTCCTTCTGGNGGGGTCCGGGGGCTCC 

CTGGCTCANGAGGNGAAnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnATCAMCCGNCACCTCNAGGGGGGGCCCGGGNCCCACTTTTGGTCCCTTTAN 

TGANGGGTAAATGCNCCCTGGCCGAAACATGGGCATANCTGGTTNCNGNGGGGAAAAATG 

GTATCCGNTCACAAATNCCCAAAAAAATACC AN CCCGGG AG NC ANN AAAAGGGGAAAACC 

CCGG 

Sequence 2972 

CCGCGGTGGCGGCuCGCCCGGGCAGGTACGCGGGTGGCCCGaAGTTAGATAGATGAGGAG 

GGTATTTATCTGAGACAAACACTGGGAGGAMTTTCAACCAGANGGAATATTAGCATGTG 

AGAAGGCATANAATCATAAAAGGCATGTTGTGTTAAGGAGCTACAAGTAGCACAATTTGG 

CTAATGTGTACAGTGAGAAGAGAGTTGTGGGCAGATAGGTTGGAAAAGAAGCCGTGAGTT 

AGATCATGMGAGCCTCGGTAAGGAGTTTGTTTTAAAAACMTTACTGAAAGATTCGGAG 

CAAACACAATATTTATAGGTTGGAAATATTGCCAGAATGGAAAAAACACCCTTACCAAAC 

CGTTCCTCGGC 

Sequence 2973 

CCGGGCAGGTACCTCATCTCTGATGATTAGTGATGTTGAGCGTTTCCCATGCCTGTTGGA 

ATGGCAATCCGTTGGAAATGTCTTCTCAGTTACTACCCATATGTATGTTGCTCAGGCCAC 

TGNNTANTNCNAGCNCTGTCGTTTGNATAGNTACTTGGGGATTGTTTGTCGTTGTTAGCC 

AAGACAAATGTTACACTGCTGTTCCTACAGTGAGTAGTGGGAGTGGGGTATGGGGACATA 

TGCTTATGCCCACAGTTCCATCANAAATCCCACTGCCTTAAGACAGTCTCACGGGAACCC 

ATTCCCACCTTGCAGCACTGGGTAGCCCTGTGGCATTTTGACTTTGGATGTATCTGCATT 

GTTGMGTCACTCAGCATGTTGCCATGCATTTATCCACTTAGGTCAATGTTAAGGGAAAG 

TTCATCGCCTCTGCTTCTTCANTAACTGATGTCTGGTGCATCAACTGAATGCATTTCACG 

CTTTTAAGCAGTTCCCACGGATCTATGGGGTNAAAAAAGGCTTT 

Sequence 2974 

GCGGCCGCCCGGGCAGGTACCCCCCTATTAGGGAAAAGGGCTCCATCCTTTTCCCTAAAA 

CTTACTCCTTTCTGCCTCTGGAAAAAGAAGGTTCAGCCCATTCTTCACTCCAGTCCACTG 

CTTGTGTG ATC ACTACAG ACC ACTGCTTG NG N GATTTTCCCATTTATAAATGATACTTTC 

ATNTCCTTTCTMCTNTTACTTTTGCTCACTTTTTAAAGCCAAGATGTAGCCATCTAGCC 

CCTCCATGATGTCTTCCCCAGTGCTCCCATCTCTGCCTGTCTCTGCTCTTANATATCTTT 

CCATACAGTGCTGATTGAATGTGTATGCTCTAAAACCAMCTTTCTGGGTTTAAATGCTG 

GGGCCTTCACTAACTAACTTATTTACCC 

Sequence 2975 

GGGCGAATTGAGCTCCCCGCGGTGGCGGCCGCCGGCAGGTACTTGGGGCGGCAGGTGGAG 

TTATAATTATAGCTTAGAAAAAAGCAGCATCAGCAGCTGTTGCTTCTGTTTGTTCCCAGC 

TATMCTGTCATCAGTGGGTAAACTGGCATAAACAGGCCTCTACAAGTGGAAAAACATTT 

AACCCCTGTTCTTGTCACACTCTGGGTCCTCTCAGTGGCCAGAGAACAGAAAGAAAATCT 

GACTGTGCACGAGCGAAGGGTGACACCGGGCGTGCAGCACCGACCTGAAGGCACCTCTTC 

GGGGGAGCCAGAAATTTGAAGATTCTCTTTCCTGTCGGAGACTACAAACCAAATAAATAG 

GGTTTTATGTTACTCTGGTAGTGAACCGATGGTGGMCCAGTTTGTGAATTTGCCCTGCC , 

AGA 

Sequence 2976 

TTGGAGCTCCCCGCGGTGGCGGCCGGGGGCCATTGAGACTGCCATGGAAGACTTGAAAGG 

TCACGTAGCTGAGACTTCTGGAGAGACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGA 

CCACTGGAACAATGANAAGGAGAGAATTCTACTGGTCACAGACAAGACTCTCTTGATCTG 
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CAAATACGACTTCATCATGCTGAGTTGTGTGCAGCTGCAGCGGATTCCTCTGAGCGCTGT 

CTATCGCATCTGCCTGGGCAAGTTCACCTTCCCTGGGATGTCCCTGGACAAGAGACAAGG 

AGAAGGCCTTAGGATCTACTGGGGGAGTCCGGAGGAGCAGTCTCTTCTGTCCCGCTGGAA 

CCCATGGTCCACTGAAGTTCCTTATGCTACT7TCACTGAGCATCCTATGAAATACACCAG 

TG AG AAATTC CTTG 

Sequence 2977 

CGAGGTACTAACAGTCAGGAACAAATAGAACACATAGAGAGGACAGTGGGAGGTCCCCCT 

GGCTAAAGCAGAGGGTGTATTGAGAGAAGGAAACCTTTGCCTCAATGGATCGTAGGCAAA 

CTGAGAAGGACCATMCACCTTGCTGAGAAGTTTGAATGTGATTCCTTGGTGACCACAGC 

AGGAGGTCCTGAAGTCCCTGTCCTTCAGATGCAAACAAGGAGGCTGTCCCATACAGACTA 

TCAGGGAAAAMTGCACTGTTTATCAAAAGATATGAATGTTCTTGACAGGCAAAGCCCTG 

CTCAAATGAAATGGCAGTGCAGGGAGATGATATTCAGTGTGATGCAGCTGTGAGTACCTG 

CCCG 

Sequence 2978 

AGGTACCCGCTTGTGTGGTTTCCATGATCCTTCTTCTGCCTGGAATGAGGCTCCTCTTGA 

GTCCTGAGAGCTTCATTTCTGTCTGTCTTTTGGGCTGTGCCAAAGCCCCTCTGCCTTCGT 

GGACTCCCCACTGATTCATTACCAATTTTCCTCTCCTCTGGGCTCTTCTTAGATTGCAAA 

TGGTGCCTCTGGATTGGGTACCTGCCCG 

Sequence 2979 

NCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATGGGTTTGT 

CACAGAGAAAAATTATGATGCTTTAAAAACACACACAGAGAAATMTAGGTTTTATGCAA 

ATGTCAAGACATTATAAATAAAGAAAATAATGTTATTTGTTATGCAGGCTAAACACTGAC 

CAACTCACTCAATATATTCTGGAAGTGGCAAAATAAAATTGGAAAAGTAGAACAAGTAAT 

TTAACATACTACACTTATCTTCATATAAATTACTGCAATGAGGCGCCATCACCATCACCA 

TCAAAAAAAGAAGMTTCCTACAATCAATATGTGGGATTTTAGCTAAAAAAGAAAAGTCT 

ATTTAGAAGTCAGTCACAGCCACATGGTTACCTCAAGCCTTACCTCATCACTGCTCCACC 

CGGTTT 

Sequence 2980 

NGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAGTGCTTCTGGTTTCAG 

CACACATGGAGATATAGAATTGAAATTGCTCTGCTTTCTTTACAAATGACTCAGAGTGTG 

CTTGTCCTTCCCTAAMTGTTGCCAGCATGTAAAGGATTTTCAATGAGCAGTGCAATAAG 

CCAGCAACTGGGMGGCTTGGAGCGGCACTGATATAACGTGGTGCTGTCACACAGATAAT 

CAGGGCTGGAGCCAAGTCTCGGTAGCGCTGTGCATTATCCTGGTCACATACTGGAAGTTT 

TGTTTATCCAACTTGGTTGCTCTTGAAGCTGGTGAGAGCCCTACCAGGATGTGCiAGTGGA 

CTTCCTAGCAGAGGCAGTACCTCGGC 

Sequence 2981 

TTCCTAGTCACATCTGTTGTTGCAGCATGTTATCAGTAGAGCCTCCTTTi"ACTC 1 TAAAA 
ATGCCCTGGTTTGGATGATTGATCATACCTGCCCCCATCCCCACCCCACCACCCAAAAAG 
TGCAGCCACCCTGTGGTGTGCGGTGTACCT 
Sequence 2982 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAGAGACTCC 

TATTTATTTATACCAGTAAGGGCCCTCCCACTGCCTTTGTCTCCAGTAGGTCAGCTGCAT 

CATCAGCTTTAAMGCCCCCAAATATTGCTGAGCTTGCTTTTTrAAAACTTCTC^ 

ACTTTATAAATTTGTTTCAGAGCACTATAATAAAATATTTCTTGCAAACATGGACTATAA 

GTGCCAGAGACACTGGAGAAATCTGAGGGTGGGGAGAAAGGATAATCTATCAGCTTGNAA 

GCTCCAAA 

Sequence 2983 

AGGTACAGGATGCAAACACTTCTAAAATTGTCTTCTGGGATTTCTGTAGATAAAAGCTAG 
TAAAAGACCTGAGAAGAGATTTTTATTGAGCCTATAGCAATTTGGTTTCATACCCTGAGT 
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AGTMCAGATATTACATTCAAAGATGCTTAATTACCTTCACAAAGTATGAAATTCCAAGC 
CTACAAATTGAATCAGCAATTCAAACTTGAATTTGGAACTTTTCATTATTTATGTGGA^ 
TTCTATTAGCAGAGAACCAGCTATACTTCTTATTTCCATAGGAGAGAAAAGAGCACAAAG 
CAGAACAAGAGGATTCCCTACAGGTGTGGAMGAAATATTTCCGCTTGGATATTCCTACA 

GTTACAGGAAGAACAACCTTAGGGAAG 
Sequence 2984 

GGCAGGCCAGTCTMCTATATAGCTTTCCCTTTCCAGCCACTTCCCTGATTCTTATGTCA 
TTGTCATACCTGGGGTTTGTAGTGCCTGCAATGAAAACTCCCCACTTGGAGTAGAGGCAA 
AGCG AAAATACACTAAGCTTAAATACAGGTTAGTTTTC AATG TCTGTG CCTCATGC CTTC 
TTTCTCATTTTTGCTACCCCTTTTCTCCCTTGGCATAGTGACATCGATAGTTCTTTCTAA 

TGACACTMTTGCTTCCTTTACMCCCTTAC^^ 

GAGGCAGAGATTTGAGTACCT 

Sequence 2985 

TCCGCCCCCCTGACCGAGNCATCACAAAAAATCGACGC TCAAGT CANTAGGGTGGCCGAA 

MCCCGMCANGGACCTATTAAMGNATACCCATGGNCGTTTTTCCCCCCCTGGGGAAAG 

CCTTCCNCTTCGNTGNCGCCTCCTCCTTGTTTCCCGACCCCTTGCCGGCTTTACCCGGGA 

TMCCCTTGGTCNCGNCCCTTTTCTNCCCTTTNGNGGAAANTCGTTGGGCCNCCi inn 

TCATTAGGCTTCAACGGCTTGN7TMCGGTMTTCTTCAAANTTTCNGGNAGTTAATAGG 

TTNCGGTTTCCNCNTTCCCAAANNCTTNGGGCCTTGNTGGTGGCCAACCANAAACCCCCC 

CCCCC 
Sequence 2986 

GGGCGAATTTTGCTCCANCGCNCTGGCGGCCGCCCGGGCAGGTACCTCCCTCATCCTNAT 

GGAMGAGGNTTATGAGAGAAGGGAGGGAGGCCTTTGGCAGCCAAGGGAAGACAAGTTTC 

AGGAGTTCATGAAGCACCATCCCCCACCCCCAGTGAAGACCCTACCAGMGGTCTTATGT 

TCTGAGTTATATTGAGTGACTGCCTCATTACCACTCCCTACTGTTATGTGGCAAGTGTCA 

GGGCTGTGGCAAGGGTCAGGCTATCATCATTGGCCCCTTCAATAGACTGACCCTAGCATT 

AGGGCAACCATAAGTCCCTGTGTGCCCAAGACAATCCTAGTTTACATCTGAGGCCCCAGT 

GTCACTATTMTAGTGACCTCCmCACTCTCAMGTTGTTCCAGTTTGGGCAATGAATT 

ATATGATCAAAGGCCCCAT 
Sequence 2987 

CCGGGCAGGTACGCGGGGAGGCCATCTCGCTATAGGAAAGGAAAGTGGAACAGCATTCAT 

CCTCAACATTTTTACGAAGACAAAATGAAGACTGGAGTAGAAGACTGATCAGTGCAGGTG 

TAGCATAAAAGTGTAATCCTGGAAGATGTGGTGTGAGAAGGTAGCACAAGTGAAGTAGAG 

ATACAGGAGATAGGGAAGGGAAGCTGGAAGCAGAGGTCACTGGAGGGAGAGGGAGATGGA 

CACATTCAGGGCTACAAAGCAAGTTCTATGTGATTTGCTCACCTCTCAATTGTGGGACCC 

CTCAAAATGTGTACCTCGGCCGCTCTAGAACTAGT 

Sequence 2988 

AGGTACCCAATCCAGAGGCACNATTTGCAATCTANNAAGAGCCCAGAGGAGAGGAAAATT 

GGTAATGAATCAGTGGGGAGTCCACGAAGGCAGAGGGGCTTTGGCACAGCCCAAAAGACA 

GACAGAAATGAGGCTCTCAGGACTCAAGAGGAGCCTCATTCCAGGCAGAAGAAGGATCAT 

GGAAACCACACAAGCGGGTACCTGCCCG 
Sequence 2989 

CCGCGGTGGCGGCCGAGGTACGCGGGGACTCTGCACTGAGAGAAGCACTCAAAACCACCA 

TCACCMTAGCGCTACTAAAGACAGTGCTTTGGAGCTTCCMGCTGATAGATTATCCTTT 

CTCCCCACCCTCAGATTTCTCCAGTGTCTCTGGCACTTATAGTCCATGTTTGCAAGAAAT 

AJTXJATTATAGTGCTCTGAAACAAATTTATAAAGTTTGCMGAGAAGTT^ 

MGCTCAGCAATATTTGGGGGCTTTTAAAGCTGATGATGCAGCTGACCTACTGGAGACAA 

AGGCAGTGGGAGGGCCCTTACTGGTATAAATAAATAGGAGTCTCTGGTACCTGCCCG 

Sequence 2990 
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AGGTACGCGGGCTAAATACTGTGCTCTGACATTGATGCTAATATTCCATATTATCAGGGC 

TTCTGTGGTGTTTAGGCCTCTAATTTTTCTCTTCAGTAAGGGGAATAACTTTGCTGTGTC 

AGCGCAGTGCCTCAGGAGGGTTGCAATCTAGAAACAGTGATTGTTAAAATTGCCTATAAA 

CACACAATATAAACAATATGCCACAGACACAGAGGCCCCCAGCATAGTTTCTGGAAGGTG 

AAGTCTCACCTCTGCCAGAGTAGAGTCACAGTGGGAATTGAGGCTTGGCATCTGGGTGCC 

TCTGAAGCTCTTCCTGTGGCTCCTGTTAGACAATTGACCCCGAAACACAGAAGATAGAGA 

TTGCACATTGCGTTCAGTAATTGTTGTTTATTTATGTTATTTCTGA 

Sequence 2991 

GGGGCCATTGAGACTGCCATGGAAGACTTGAAAGGTCACGTAGCTGAGACTTCTGGAGAG 

ACCATTCAAGGCTTCTGGCTCTTGACAAAGAGACAAGGAGAAGGCCTTAGGATCTACTGG 

GGGAGTCCCGGAGGAAGCAGTCTCTTCTGTCCCGCTGGAACCCATGGTCCACTTGAAGTT 

CCTT ATGCTACCTTTCACTTGAAGCATCCTATGGAAAATACACCAGTTGAGAAAAATTCC 

TTTTGAAAATTTGCCAAGTTGTTCCTTGGGGGTTTCAATGTTCTTAA^ 

TNCCAGNCNTATTCCCAGNAAATTGGCCCCACCAAAAGGAAATTTCCAAAACCTTGGGGA 

ATTCCTTGGGGAAAGNAAGNGGAAAAAGGAAAAAACCTTGGAATTGGGGTNGGTTTTAAA 

ACCTTGGAAAACCCCCCAA I M I I I I I GGAA7TTTGGGANGGAAACCCCTNAACCAACCA 

AGGGGGGNCTTGGAMTNGTTCCAATTTTNCCAATTTTGGGG 

Sequence 2992 

TTTGCATGCACGGACTTCTACATACATAATACTTAAACAGTTCATTGTATTGAAACACGC 

TTAAAAATGGCCCTAGCACTTGCTGCAATCCTGCTGTGTAGGAGTTAGCCAGGTTGTGTC 

TGCTCCCTCTAGACTATTCTATACATCAAGAACTAAAAAAGAAGAGCAATCCCAGCATGT 

GGCTGGATGCACATCAAAGCCMGTCTGTCATGAATTATTTTCCTTCCTGTTCCTTTCAT 

TCAGGTCATGAAGCACACATACTGCTACAGGAAAGGGCAAAAGGCAATGTCAAGTGCTGT 

GTAATTG NTGGGTTCTTTAGTGGCAAAGGCCAG 

Sequence 2993 

AGGTAC I I I i n i n I i n 1 1 n n I I GACTGGAGAGAGAGAGATGTTTTATCTTTGCCA 

AAGCATGTGAGAGTMGTAGCAMCATCATGACACTTCACCCCTCAAATATTTTCCCACG 

TATCACTTAAGAACAAGGGCATTCATTCTGTTACATGACCACAATACAATCAGCACAGAT 

AGATATTAACATTGAGACAATGCTCTGATGTATTGGTATCTACTATGCAGTTCATAATCA 

MTAAGGGTTTTGGTAACTTGGGATCCATAAATGGTCTGCAAGGTCAGCCCCTGTANACC 

TGCACTTAGTGGGTGCCATTTCCCTGTCTCAGGTGCCTGCTCTACTGGTGATATTTTTCA 

CTGGCTCTGTTGCTCACTTAAAA 

Sequence 2994 

AGGTACCCACAAGCATACGTAGAGCTGGCCTACCTCATTAATTACTATATCTGCAGTGCC 

TCAAATATACAAATTTTAAAMGAAAAAAAGTCATATGCTACTGATGGGGTATTGTATTA 

TCTGAATTAAATGGTAAGGAATTATTTACCATGCAGAAATCGTATAAATCI Mill IGGT 

AGTAGGAAACGCTAMTGCTCATTTTCTTAGACTATGAGGCATGTCAMGGTTGAAAAAA 

GAAAGAAAGCATAGCTTACCTCCTTTTATMGAATAAAAGAACTA^ 

TCMGTAAGCCTCTTTCAGGAMTCAMGGATCMCACAGATTTAATTTCACCTTCTGAA 

ACATCTACATCTGTGGTCTTTTACAATGAGAATGTGCCCAA 

Sequence 2995 

CCGCGGTGGCGGCCGAGGTACAAGGCCACTGATTATCTAGACCTGGCTTTCTTCTTCAGT 
CTCATCATCCATCCCCACCCCACATTCTCTCTCCCCATCAGGTTTGTGCAAATGTTGTTT 
GGGATACTCTTCTCTCTTTGGAATCCCTCCTGTTTGGAATACTCTTCTCTTCTTGCCTTT 
GGCATCTCCTCACCACGCACGTTCTGCTAATTCCTAAACTGCTTATAGATTGTGGCTTAG 

ctttgac rrm icaggatgcctctgctaacccctaaattcaaggtaggtcccgcgtacc 

TGCCCG 
Sequence 2996 

CGAGCGGCCGCCCGGGCAGGTACGCGGGGAGTGCTTCTGGTTTCAGCACACATGGAGATA 
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TAGMTTGAAATTGCTCTGCTTTCTTTACAAATGACTCAGAGTGTGCTTGTCCTTCCCTA 

AAATGTTGCCAGCATGTAAAGGATTTTCAATGAGCAGTGCAATAAGCCAGCAACTGGGAA 

GGCTTGGAGCGGCACTGATATAACGTGGTACTGTCACACAGAT AATCA GGGCTGGAGCCA 

AGTCTCGGTAGCCGCTGTGCATTATCCTGGTCACATACTGGMGTTTTGTTTATCCAACT 

TGGTTGCTCTTGAAGCTGGTGAGAGCCCTGCCAGGGATGTTGGGAGTGGACTTCCTAGCA 

GAGGCAGTACCTCGGCCCGCTCTAGAACTAGTGGGATC 

Sequence 2997 

GGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAGCCTATGCCAGCGTCAGAGCTG 

CTTTTGTGCATGCCCTTGGAGATCTATTTCAGAGTATCAGTGTGCTAATTAG TGCG CTTA 

TTATCTACTTTAAGCCAGAGTATAAAATAGCCGACCCMTCTGCACATTCATCTTTTCCA 

TCCTGGTCTTGGCCAGCACCATCACTATC7TAAAGGACTTCTCCATCTTACTCATGGAAG 

GTGTGCCAAAGAGCCTGAATTACAGTGGTGTGAMGAGCTTATTTTAGCAGTCGACGGGG 

TGCTGTCTGTGCACAGCCTGCACATCTGGTCTCTAACAATGAATCAAGTAATTCTCTCAG 

CTCATGTTGCTACAGCAGCCAGCTGGGACAGCCAAGTGGTTTCGGAGAGAAATT 

Sequence 2998 

CGTGCCAGCTGCATTAATTGAATCGGCCAACGCCGCGGGGAGAGGCCGGTTTTGCGTTAT 

TTGGGCGCTCTTTCNCGCTNTCCTCGCCTCACTGACTTCGCCTGCGNCTTCGGGTCCGTT 

TCCGGCCTGCCNNGCCGAAGACCGGIMTATTCAAGNCTCACCTTCAAAAGGGCGGGTTANA 

TTACCGGNTTATTCCACCAAGNAAATCAAGNGNGGGAATTAAACCNCCMNGGAAAAAGN 

AAACAATTGTTGNAAGNCAAAAAAAAGGGGCCCAAGCCAAAAAAAGG GGCCCCAGN GGAA 

ACCCCGTTAAAAAAAAAAGGGCCCCGGCGGTTTTGGCTTTGGGCCGGi iiuini 

Sequence 2999 

CCGGGCAGGTACCTTGGCTACATGAAGACCACCAGCAGCAGCTGGGACAACCAGCACCCT 

GTGGAACTGCACAGCATGCATAGAATACGTCCTCCCTTCAGTTGGCTTGGGTCAGCTTAG 

GTCATGGGCCACTTGGACTGATAGCAGTTTCCACAGAAATGCCTCAAGATGGTAGAATAA 

TCCAGATCTCTTTGCATGGGGCATGGTGTCCCCGCGTACCT 

Sequence 3000 

CCGGGCAGGTACCATAAATGTTGTTTCTTMGAGAATGTCAAGTTTCAGTAAATTTACTN 

TGMCTTCANAGTAGTGTGCAATMCTTTGATAAATMGCCGAT CTTATG AM TTTGA AA 

GTTACCGTTTTATAATTTATATTTATMNAATTAMGTGNTTGGAATTTTTCT 

MCCTTACCCCTGNATGTTCACCATTATTTTGGATTTTTAAm 

TNTTATCTTTTNNCAAAAAAGTCAACGCCTTTCAC NTTTC 

NAGGCCTTTAATTTTTCCTTTGGCGAAATCCCTMT^ 

TTCCTTGGCCTTGNTTTTTATTTCCCTAAAAANANTGGCCNANNMTTAAAAAATTGGGN 
CCTTTTTTCCAAGGCCAACCAACCTTN GTTTTTTTAATTAAN G NNTANTTTAAATTAAGG 
CCCT 

Sequence 3001 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTGGAATG 

CTGTGTGGAGTCCTTTTTCTACTTTTGTTCCTGAGAAAGAGAGATTCACAAACTTCCCAC 

ATTCTTCCAGTGGCATCTAAGGATCCCTTCCTAAATGGCCTTTCTCCCAGCCAATAGACT 

GAAGCAAACCATGTCCCTGATGCTTCCTGCCCACCACCATCCATTCTTTCAACCAACAGC 

TTCTCCGGGCTTTCACGTG CC ATGC ACTTTGCTAAG AGCTTGAGG G AAATAC AGC AATGA 

AGMGAGATGATTTCTGTTTGGACCAGTGCTCAGGGATTCCTCCTATTAGAAATCAAAGA 

GGCTTCCAAAAGCCCTACAATTTTTTCCCTGTMCTAMGAAMTAGCTTCTCAATGTAA 

CTAAA 

Sequence 3002 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACl 1 1 ■ I I i n u \ 
nTTTTTTTTTTTTI I 'ATGCGACAGAGTTTGTTTTATACT GCCTC ATAATTACCTCTTTT 
CCTTTTTCTTTCTTTGCTACCTTGGTCAAGTTTCCCTTTATT^ 
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TTTTTCTATTAATTCANAAGTTTTAATTTGCTTTTCCAGTTTCCTAG 
ATCCTGAACAATCTCTGTATTTTTCTTTTATTATATCAGAAGTCTCTCTGTAGTTCAGGT 
GAATTTATAGACAGTCATCCTTACACTACCTATATAAAAT 
Sequence 3003 

CCGCGGTGGCGGCCGAGGTACTGGGGGCTCCTAAACCATATATATCTTGCACCATTCTGG 
GG ATATTTG C C AG GC CTG GGGG ATC AAAATATACCCC AAATATTAAACC AGTTG AAGG GT 
QATT TGAGCCTACTTTTTGMCACTTTGTATCCCCTGTGTGCCMGGAAATTCAGGGTCT 
TTTTCGTAACTTGATCAGATGTCTCCCAGGTTGGGCTACACAC 
Sequence 3004 

CTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCACAGCTGCATCACACTG 

AATAT CATC TNCCTGCACTGCCATTTCATTTGAGCAGGGCTTTGCCTGTCAAGAACATTC 

ATATCTTTTGATAAACAGTGCATTTTTTCCCTGATAGTCTGTATGGGACAGCCTCCTTGT 

TTGCATCTGAAGGACAGGGACTTCAGGACCTCCTGCTGTGGTCACCAAGGAATCACATTC 

AAACTTCTCAGCAAGGTGTTATGGTCCTTCTCANTTTGCCTACNATCCATTGAGGCAAAG 

GTTTCCTTCTCTCAATACACCCTCTGCTTTAGCCAGGGGGACCTC 

Sequence 3005 

CCGCGG TGGCGGCCGAGGTACCATTGACTACTTCATACTGCTACAAAATGCATTTGAAAT 
mm l CTG GCC AAGG AACCTATTC ATCCACTCTCTCC AAATTAAATACTTACTAG AG A 
AATGTTG GC ACTTATTAAC CTTGTAATATG AATTAG AAAC AAACTTG AC CTTC ATATATT 
CTTTTACACTTATGAATCTATTTAACTGCATGATTTATCTGCATGATTCAGMTTTTCTT 
GATACCATGTAACTGMGTAAGATTTAGAAATATATTTTCTTGATACCCTGTAACTGAAG 
TAAGATTTANAAAGCTGAGGTGACAGTTGTCACCCATAATGCTGATAAATATAGGAATGA 
GTCGATTATAAATTTAAACCTTTTGTGGCATTA 
Sequence 3006 

GCCGTGGCGGCCGGGGGCCA7TGAGACTGCCATGGAAGACTTGAAAGGTCACNGTAGCTG 

AGACTTCTGGAGAGACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGACCACTGGAACA 

ATGAGAAGGAGAGAATTCTACTGGTCACAGACAAGACTCTCTTGATCTGCAAATACGACT 

TCATCATGCTGAGTTGTGTGCAGCTGCAGCGGATTCCTCTGAGCGCTGTCTATCGCATCT 

GCCAGGGCAAGTTCACCTTCCCTGGGATNTCCCTTGGACCANGAANACAGGGANGAAGGG 

CCTTrrANGGATTTNNNTTTGGGGGGNGGTCCCCGGAANGGAAACAG l HIHHUIll 

GTCCCCC NGNT GGGAAAACCCAAATGGGGNNCCCCNTTTGAAAAMTTTCCCTTTNAAAG 

GCNTCCNTTTTGGCNACGNGGGCCCNTTCCCCTTTNNTAAAAAAATANCNCCCCCCGNGG 

G 

Sequence 3007 

CCG CGGTG GCGGCCGAGGTAC I II I i I I I 1 I 1 I I 1 1 I I I I I I 1 1 1 1 1 1 I \ GCAAATTTAC 

ACAGTTTTCTTAGGAATGCTCTTTC I 11 11 1 1 1 F CTCTTTGG ACTCTAC ATTCCTGTAGC 

TCCTATTCTTCAAGGGCACAGGGGCTTACTTAMCATAAAATTTTACAACTTAACTGGAA 

GGGTGCCCTGCTTGCAAATGGAAAACTGCCACAGAGCANACTTAACTGCAGCAAGCAGGT 

GACANAGCAAGGMCAGGATCCAAAATTATGATTTTTTGNGNGCTACTGAAGTAAAAGAA 

TTTMGAAGGTTTTACTCAGGGTCATAGGCTGGGTTGGAANACTGGGAAAAGGGAAATCA 

GTAGTATCTATATCT 

Sequence 3008 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGGGGCCATTGAGACTGCCATGGAA 

GACTTGAAAGGTCACGTAGCTGAGACTTCTGGAGAGACCATTCAAGGCTTCTGGCTCTTG 

ACAAAGATAGACCACTGGAACAATGAGAAGGAGAGAATTCTACTGGTCACAGACAAGACT 

CTCTTGATCTGCAAATACGACTTCATCATGCTGAGTTGTGTGCAGCTGCAGCGGATTCCT 

CTGAGCGCTGTCTATCGCATCTGCCTGGGCANGTTCACCTTCCCTGGGGATGTCCCTTGG 

ACAAAGAAAACAAAGGGANNAANGGCCCTNAGGGATNTACTNGGGGGGAAGTTCCGGGNG 

AMCCAAI I II I II I I I 
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Sequence 3009 

CGACCCCGCAGGCCGGAGCACCTACAGCATCGGCCAGCAACGCAATTTCGTCGTGACCAA 
GGGCGCACAGGTACCGATCCCGGTATTCGGCTCGGTGGTGGCGAATACCAGGGCCTTACC 
GGCCGGCATCTACACAGACACCCTGACGGTGACTCTGGATTGGTAAAACATGAGGATGGA 
CATCATGCATGCACTTGTATCGAAAGCAG T 1 I I I CCATTGGTGGCGCTGGTGTGGGTATC 
GGGCATCCAGGCAGCCACCACCGTGACCGGGCAGATCAGCTCCACGCTGATCCTGACCAA 
CAGCTGCCTGGTCAACGGCGTGGGCGGTACCACCGGTTTGAACTTCGGTACGATCAATTT 
CGGCACCACCAACAGCCAGTTCACCACCCAGTCCGGCCAAATGCTGGGCGGCGG 
Sequence 3010 

GACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCTAAACTCAGA 

TGGCTTATGACTAGAAMGGTCACACCTTTTAAAGACGCAGCTGCGTTCCCTCAGCTGTC 

AGGCTTGCTATTCGCAAACTCTGTTACAAGACACCATCTTAGAAGAATGTTCTGAAAAAA 

GCMGATGCAATCAAACAAATATGTCGCCATAAATAGGAAGAAAAGTCTTAGTTGAAATC 

TGATTTCTTGGGAMCATTTTAAAATGTATTTCTTTCAAAAATAAAA^ 

AGAAAAGMCTACCTTCGAAATAMTATACTCGGAAATTGCAGACTTGATTTAGATTTCT 

TCCTAGGAAAGAAAAACGTTTAAACCAGAATCATAGACACCTATAGCTGGAAGTGCCC 

Sequence 3011 

GGGCGAATTGGAGCTCCCNTTTTGTGGCGGCCGCCCGGGCAGGTACCAGTTTTATCACNT 

AMTGCCGCAGGAATMNTNTGATCATTTACTTAGATTACTGGGCTTTTTGTTGGCTTCT 

TTGTTCATTTTTAATCTGCAAATATAAAAGATTGTGTTCATAATTCTTTAGGTTGM 

GAAATGATGGAAGAAGACTTCAACTTCCAGGAGAGATTTATTAGCTAGTGGATACTCAGT 

GCTCCCACCAAMCCACTAGAAAAAAATGTATTTAAAACAAATAAATCAACTAGTTCAAG 

GAAATAGAGAGCTGAAAAGGCAAAGAGACCTTGCGAAGTAAAGACTCCAGATAAGGCAAC 

ACTTAGTGAGGTGAACTGACTTTTAAAGACTTTATCCTTGGATGCATTTCATAATTCTGG 

GCATGTTT 

Sequence 3012 

CCGGGCAGGTACAACCACCCCCCGGGTATCCATAACCTAACAATAACAATTGTTAAGCAT 

TCGGCCACACATGTTTGTTCAGCTCCCCCACCTCCTCCCTATCCTCCCAGGTTCCCACTG 

AATTATATATCATTTATTCATAMTAATTCTGTATATTTAGTTACAGCTCAATGMTTTC 

CACAAAGTGAACATGCTCATGCAAACCACACCTGCACTGGGAAATAGAAATTACAAACAC 

CCGGAGCCCCCTCTGCAATACTCTCTCAGTCCTTTCCTCACTCCTTCC AAAAGAGT AGCT 

ACTATCACCTATTGTGAACCTTGCATATATGGATTCATAGAACATGTGTATTTTTTTG 

Sequence 3013 

CCCGCGGTGGCGGCCCGCCCGGGCAGGTAC I I I I 1 1 M I I M II I M I 1 1 I I I I IGGGAG 

TNTTTAAGATTATCTCTATGTTGGCTGGCTGCGGTGGCTCATGCCTGTAATCCCAGCACT 

TTGGGAGGCCAAAGCGGGAGGCCAACACTGGGGCTACCCAAGAGTTGACCAAAGGTGATA 

TTAATACATATGCTTACTTATTACCTTGTTCGCACAGTTCTANTGTAAGTCTTGAAAATC 

MGTAAATTTGGAAACATANTTGCACAAACTCTCTCATGAGTCACACAGTTAATCCAGAA 

TATTATTTNTAAGTTATCATAAAGCCAACAAGCAAATATGATGTACCT 

Sequence 3014 

GTGGCGGCCGAGGTACGCGGGTCCTCCAGCCCCTCATGGGAGCCGGACTCCCTGAGACTC 

AANAAAATGCACCCGTCCTGCGTTCCTCCTGCCTGGGGTGACCCATAAGGAACACCCCCC 

CCTTCTCAACAACCACAGAANATAAATGCCAGGGGACCACCAACATGGGCATGGGGCTGT 

GATGCCAATCTTAAGGGAAAACACAAGTAAAATTGCTCCAGNACTGAGCTGGGGGCAGCT 

TGCTCTTTCTTGTCCATGACTTTGAGGCGGGATCTCGAGCGTCTGTCTGACGGGCCTTCT 

GTCTGCTGTCCTCTCCAGGTTTCAGACATTGAATTCAGGATTGCAGCCCTGANGGCCCGC 

AGGGCTCACNGTGAAGCCCTCGGGAAAAGCCCCGGA 

Sequence 3015 

CGCGTATACGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAT 
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CTCTGTTTCTGAGGAAGCAGCAGAGGAGCCTATGTGATCCAGAGAACTTGTTCTTGGCTT 

CTGGAAGCAGAGATGCAGGAATGGACCAATAAGGCATTAAACAAATTCATTTAGAATCTC 

MGAAAGAMTATCTAACAGTATAATCATAATAACAAMTTTTGATTTCTTCAGAAGCAC 

TCCAATAGCTTCACCCGCATTAGCTAATCTTGTGCAGTGAGGAGGTGCAATTCCCTCTAG 

TCTCCCGCGTACCTGCCCG 

Sequence 3016 

CCGGGCAGGTACACAGATCCCAGTTATCAACATAGCCAGATGGCTGGTGGTAGAAGCAAC 

CCTGTCCCTGTAGCACCTCTCTGTGTCCTGCTCAAMGCATTAAAACATCATTCTTAGAT 

MGMC AGAG AC AAAGG ATATGTCTTTMC ACTTTACAGTCTATAMGTTTTTC CACAG C 

CACTGTCTCACTGMTCAACCCCTAAGAGACAGAMTGAAAATCGCTGTTTTTACAAACT 

GAGAAATTACGGGCCTTTGAAGTGTTTCACAGCCTTCACAGCCCGCTGGTAAGAAGCAGA 

GATCAGGCCTTGGACCTGGGTCTTGTGATTCCAAAGCCCATGATCCTTCGGATTCATCAA 

GGCTCCACTCCAAGTGCCTACTGGGGACTGCCT 

Sequence 3017 

AGCTCCACCGCGGTGGCGGCCGANGTACGCGGGGAGAACACCAGGTGCTGTTCTCAACAA 

CGTCCGGCGCTTGACCAGTGCATTGAGAAGGTTGGTCCCCAATAACATCTATCTCAAGAC 

CACTTTCCACAGCCTTTTAATCTCCCACTCCCAGTCCTCTCTCTTTGTTCTCTTTCCACT 

CTGACGAATTAAATTTAGCCCTTATTCTTTTTCTTGCTTTAGAAAAGTTCCTCGM 

AGACGTGTANAATGTTAGTGTCCTCANCNTATNTCGCCCAAATCCCTCCTTTTATAGGTG 

CAGAGACTGAGGCCCACAGAGGTTAAATGCCAGGCTCAAGGTGGTACCTTGCCCCNGGGT 
TTTT 

Sequence 3018 

AGAAAGTGGCGCAGCAATGGAAAGTGTCGCGCGAACAGCAGGACGCGTTCTCCGTTGAAT 

CGCACCGCCGCGCCATCGCCGCGCAGCAAGCCGGTTTCTTCAAGGCCGAAACGACGCCCG 

TCGAAATCATCACGCGTACGCCGAACCTGGCCACAGGCCAGGTGGACGTATCGCGCCGCA 

CCGTCGACACCGACGAAGGCGCGCGCGCCGATTCGACGCTGGAGTCGCTGGGCAAACTGA 

AACCCGTGTTTGCCGCCAAGGGCACCGTCACGGCCGCCAACAGCTCGCAGATGTCCCGAC 

GGCGCCCGGTGCGCTGCTGATCGTCAGCCGAAAAAATCCTGCGCGAACATAACCTGACTC 

CGCTGGCGAAAATTCTTCTCGTTCGCCGTGC 

Sequence 3019 

AGGTAC GCGG GTGACTGAAAAGGGCACAGGAGMCTTTCTGAAATAATAAAAATGTCCTA 

TATCTTGTTTTGCATCGTAGTTACACAGGTGTAGACAGTTGCTAAAACGCTAAAAACTAA 

A TGTGTA AGATTTATGCATTTTAGTGTATTTAAATTATAACCCCCAGAAAGCATTGCCTC 

ATTTTTTCTGGAGGGTAAATATGCTCGTTAGGAGCATMGGGGATGATT7TGAGAAACCC 

TTGCCCTGTCATTCTGTCAGTAAACCGCCACTAACTTTTCTGGTTTCAAAAAAGCAAGGT 

CCATTGATGAAACTCCTTTGTGCCTTAAGTGACAAATAACATTGATAAGGCCTACTGTCA 

ACAAGGAGTGTGGTTTGTTATAT 

Sequence 3020 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGCTCTGCTTAAAACTCTTCAGAGGTT 

CACCATCGCTTCCAGAAAAAGATCAATACTCCTGACGTGGCAACCTGTCGAGACTCAGGA 

CCGCTAGGAAATCAGCTCTTGTTTACCCCTCTATCCATTACACTACGATCCAGCTGGGTT 

TAGGCATCACTTCCTCTGAGCAATCCTCCCTGATGTCATGAGGCAGTGCTGAGTGCCTTC 

TCTGGACCCTCTGTTTACTTGCCTGTTAAATCCTGAACATC 

Sequence 3021 

AGGTAC AAG G C ATTTTTGCTMCTGTAACTTCCCACTTAATCAAC AAAAACAAAAACACT 
CATTTCTGAACATTCAGTGCATTCATGATTAATCTTMTTACACCACAMGGTATTTTTC 
MTGGTGATTTTGCGGGAGTGGGGTAACAGTTTCGAAAGCAACATTGTCAGAAACATAGT 
TGATTTTAAAGGTTCTTTCTGGTGACTTTGACTTCTGC I I I I I I AGAAGACCTTACACAG 
AGTTGTATTTATTTCTCCTGGAATATTTCAAGCMTTCAGAGTGAAAGGGGTATACATTC 
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CMTTTGCGTATGAGATAAAATTTTAGTTACATTC 
GGAAAAAATTGGTAGGGGCTTTTGGAAAG 
Sequence 3022 

ACTACTATAGGGGCGAATTGGAGCTCCCCGCGGTGGCTCTGATCCTAGGATTTTTTTTTT 

CCCTGCTTTGGCTGCCTCTGTTTTGGTTCATGTGTCAAGCAGAGACGGGGAAAGCCAAAC 

GACACAATGAGCGTTCTCAGAAAGGAAACTTCTTCGGAATGAAAAGCTTTGGCCACATTG 

GAMGGGTAGMGTCTGAGAGAAACTTTTTCATCAGGGAGACTAGGTCGATGTCTTAGTC 

CTTTCAGGCTGCTGTAACAAAGTACCT 

Sequence 3023 

TATTCCACAGAATCAGGGGATTAACCGCAGTGAAAGNAACATGTGTAGCAAAAAGG CCAN 

TCNAAAAAGGTCCANNGTAACNCTNTAAAAAMGGGCNCGMGTTTGCATGGNCNGTTTT 

TTTTNCACAATAAGGGNCTTCCGNCTCCACCCCCTGGACNTAGGCCATTCNAACAAAAAA 

AAAATTCCNTACGGCNTTCCAAAAGGTTCAAGTAAGGGGTTGGGNCAG NAAAAAC NCNCC 

GNACCAGGGGNACCTTAATTAAAAAAGGMNTAACCCCAAGGGGCCGGTTTTTTCNCCCN 

CCCNTTGGNGGAAAAGGACCTTCCCCTCTTTCGGTTGGCCGGCCTTTCTTACNCTTGGTT 

TTCCNCGGMCCCCCTTGNCCCCGGNCATTTTAACNCCGGGGAATTAACCCCNTTGGGTT 

CCCCCGGCCCCC! I I I 1 I I NCCTTCCCCCCTTTTTCC 

Sequence 3024 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGAGGGCAAGGAAGGAATTT 

TTCTGTMTTTGGAAAAATGGGAACTGTGAGAAAGAAGGAGTTGGAAGCTCATACTTAGG 

GAGCATCTACAAGGACGTCTTTTTCACGTTGGTTGGMTATCCAMTCAAGGATTATTTC 

AGMTCACCCAGATGATTAAAAMCTACTGAGATCCAGGTTGTATTTCAGCAGTTCTGAC 

MTTGCTCTGGGTCGMGCTTGMTCAGTAGTTMGAAAMCMCAAACAAMCAAATTT 

TGGGGCTTTTCTCACTGATTTACAGTTAMGCTCATTTACTCTTCCTATGACTTTAGATG 

G AGG ATATTTN C AAGTCTTC AGG ATGG AG AC ATG G AGG GG AAG 

Sequence 3025 

CCGCGGTGGCGGCCGAGGTACACTCCAGTATGTAGATMCTTTTTGTGTGTAGCCCAACC 

TGGGAGACATCTGATCMGTTACGAAAAAGACCCTGAATTTCCTGGCACACAGGGGATAC 

AMGTGTTCAAAAAGTAGGCTCAAATCACCCTTCAACTGGTTTAATATTTGGGGTATATT 

TTGATCCCCCAGGCCTGGCAAATATCCCCAGAATGGTGCAAGATATATATGGTTTAGGAG 

CCCCCAGTACCTGCCCG 

Sequence 3026 

AGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGGGAATGACTGAGTAGTCACAAATT 

CAGAGAGCTGCTGGGAGGTAGATGAGTTGGGGCTGGGAGGTGTCCATGGGATTTGGGGGC 

TTGAGGGTCACGGTCACCTCAAGACAGCAGGATGGGGTGGAGAGCANGTGAGGGGCGATC 

AGAGAAACAAAGGGAGTGAGTGAAGGCAACTCTGTCTTTGAGGGGACAGCAGAGCAAGCA 

GGCTGAAGGTCTGGGCGCAGAATGTGGTGCACAGGAGGTGGGTGGGAGGGAATTTACATA 

TTCACCCATCTGACCTGCTTTGTGACCTTCTCCMTATTTGTTGTAGNA GAAGA AATAAG 

AGAACACGTGTGGCCAGACACCATGGCTCATGCCTGTAATCCCAGCACTTTTAAGGG 

Sequence 3027 

AGGTACTGTTGAGGGACTCACCATAGATCTGAGGGAGGAAGGAGCTTAGATTTGGGAAAC 

ACTGGGGTGGATCCCTCTTCTGTTGAGTAGGGGCACTTAAGACACCCTGCTGTTGTGTTG 

TTCCTCCAGTGCTAGATCCCAGAACAAMTTACTTCCTCCCACCACCTTTCAGATTTCTC 

CTTTGGTTGCCTCTTGCGTTACTTTCAGGGTTTACAGTTGAATTAAACAGGTGGGAAGAA 

AGAGGTCTCTGCCATCTTTTCTATACCAGAAGTCTCATACAGTGTATTCAATAACACTAT 

TAATTAAAATTAATAATACTATTACAATTAAAATMTTACTTTMTTA 

GTGTCTTATTTTATGTGTATTTTTACCA 

Sequence 3028 

CTATACACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTAAGAGT 
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GTGGCCTGAACCCAGCAGGGCCTTG6GTGAGATCTCAGCTCAGCTACTTCATAGTGGAGA 

CACCGTGAGAAAGCTATGTAACTTCTCCAGGCCTCAGCTTCTTCAGCTGAAAATGGAGCT 

TAACAAAAAAGCACTTTCCTCATAGGGTTATCATGAAGAGTAAAGAAATCAAGTAAGTTT 

AAAGTGCTTAGAACAAGTGTCTNGGTGCCTGGTCATTACTTCATTAAAGTGGTTAAACCT 

CATTMCCAGTTTGTTNNACCCTTGCCCCCGGGGCGGGNCCGTTTTTTAAAAAACTAAAN 

NNGGGAATCCCCCCNGGGGCTTGGNMGGGAAMTTTCCAMNTTTTCAMNCCTTTTTT 

T NCGAAAAN CCCGTCNAAACCCTTTTAANGGGGGGGGGGGCCCCCCGGGAAANCCCNAAN 

CTTTTTTTTNGGTTCCCCCTTTTAAG 

Sequence 3029 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAAAAATGACCGCAGCAGCATATTTCCTAAGAT 

CCCTGAAAACAGCTAAGATCCCTGAAAGCAACTCAAATGTCCATCAATAGAATGATGGAT 

ATTATTCTACCCCATTGAAACATTTCACACACMGTTTTGTC 

ATTTATTTA I I I 1 I CATGTCAATGAATTCAMGGAATCACACATAGGTTTTTGTATAGAA 

ACAGCACTACATTTCTGTAAGGGGTGCTTTTTCAAGATATCTGTGTTCACTAGTTGTAGC 

CTGCGACTGGAGGGCTCTGTGTCAAGCATTCCACTTGCTACTGATTCCTACCATACCCCT 

CTTTGTTCCCAGAACATTGTACCTCGGCCGCTCTA 

Sequence 3030 

CCGCGGTGGCGGCCGAGGTACGCGGGTGTGCAGCCACAGTGAGCTTCTGAACATTTCTTC 

TCAGACTAAGCTCTTACACACAGTTGCAGTTGAAAGAAAGAATTGCTTGACATGGCCACA 

GGAGCAGGCAGCTTCCTGCAGACATGACAGTCAACGCAAACTCATGTCACTGTGGGCAGA 

CACATGTTTGCAAAGAGACTCAGAGCCAMCMGCACACTCAATGTGCTTTGCCCAAATT 

TACCCATTAGGTAAATCTTCCCTCCTCCCAAGAAGAAAGTGGAAGAGAGCATGAGTCCTC 

ANA TGGA AACT TGMGTC AGGGGAAAATGGGNGGCTTCACCAAATTATTTTTGNGCATGG 

GGGTTTTAAAGTTTTTTTCCCTTTGGNAAMTTAAM 

Sequence 3031 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGCCCTGAGCTGCATCAT 
TTAMGACCAATTGTTAAGGACCAATCTGAGGGCCTGTGCATTAGTTAAAACTCCTTATC 
TGGTCATAGAGGGTGGTGTGGTTTGGACGCGTTCCCCTTGTCTGTGCAATTCCCAACAGA 
GGCAAGTGCGCATTTGAGACAGGTTGAGGGTTCAAGCTGTCCAATCCCTGCTCATCTG 
Sequence 3032 

CCCACACACACAGGGCGCACGCAGGCACACACTGCAGACACACACACACATAAACACACA 

GAGTTCACTAGTCCGAGTTACTGATTTTCTTAGGATTCTCAAAGTGACAACACCGGAAAC 

AAGGTAATTCATGTTAAAACACAAGGGTTATATCAGTAAGAGATGGGATCCCCGAAGTAA 

ACCGTGGAATTTGMTCMGCTTNGMGAGCTAAAAAAAGAAATTGGAGTTTCAACATTN 

ACCTTCTTG N A TCCTTAAGAAATNCAG NAAGTTC AAAATAG AAAAC NTTACAGTTTTCN G 

GATNCCAAAAGGTTNGAAAACCNTTTTTGGGGGNATTTA^ 

Sequence 3033 

CCGCGGTGGCGGCCGAGGTACTGTCTTTTGAAAGCAGAAAGGTAGAGCAATCATTAGGGT 

GGCACGAGGGCATTGAGAGTGCTATGCATTGTCATTTACTCAATAAGTTGTAGTATTAAA 

GACGGCCCACGGGTGTGCATCAGGCTGTATGGGTAGGAATATTCTAGAGGTTTTTCTTTC 

CTGGGCTT^ATTCCCCAACCAGATAAMTACCTGATGTTTAAAATTTTGGGTTTTTAAAA 

AAATCACCCCAAGAAACCTTCTGTTTAACAAGTCCCTATATGTACCTGCCCG 

Sequence 3034 

ACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGAGATTC 

AAAGCATAGCATGTCTACCTGAACATCCTCAGATCCTGCTGACATCACCAGTGGAGGTCC 

TCTGCTGATTGATATAAATCATTTTCAAATCCAGCAATGCATTGATGACTGTTAATTCCA 

TGCTCCTCAAACTTGCCCAGAAATGCCTCTAATAGGTAAGGGAAAGAGAAGATACCTCAG 

TATGTATCCTATGGCAATGCATCTCAAAMTGATACTTATTTAAAGACTCTGTTTTTAGG 

TAAAAAGAAATTATTCAAATGTATTCTGAAAATGTTATTGGTTTTAATATAAAATGATAT 
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GTTCTCCAAMTCTTTAAGTGGGGGATTGAAAAGATCCCAGGGAAGAATGTTATTCTTGG 
ATATTCTTG G ATC ATTT 
Sequence 3035 

CCGCGGTGGCGGCCGAGGTACATTGACAGTGCTTCTGGTTTCAGCACACATGGAGATATA 

GAATTGAAATTGCTCTGCTTTCTTTACAAATGACTCAGAGTGTGCTTGTTCTTCCCTAAA 

ATGTTGCCAGCATGTAMGGATTTTCAATGAGCAGTGCAATAAGCCAGCAACTGGGAAGG 

CTTGGAGCGGCACTGATATAACGTGGTGCTGTCACACAGATAATCAGGGCTGGAGCCAAG 

TCTCGGTAGCGCTGTGCATTATCCTGGTCACATACTGGMGTTTTGTTTATCCAACTTGG 

TTGCTCTTGAAGCTGGTGAGAGCCCTGCCAGGAATGTTGGGAGTGGGACTTTCCTAGCCA 

GGAGGGCANGTACNCTGCCCCGGGACGGCCCGCTTCTAGAAACTAG 

Sequence 3036 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCACCACG 
CCTAGATAACTTTTTGTATTrTTAATA 

CTCAAACTCCTGACCTCAAGTGATCTGCCTGCCTGGGCCTCCCAAAGTGCTGGGATTACA 

GGCATAAGCCAGCACACCTGGCCGTTAAACTAGTTATTTTAATCATGCTACACAAAGAAG 

ACATTTCTCCAAAGGAAATATATAAAGGGAAACACTGTTGAAATAAATGGAAAGCTAAAA 

GTCTCAGCATAGAAGCTATAAAAAGAAACAAGTGGAAATTGTAGAACTGAAAATACAAAT 

AANCAAAATTTACTGGAAGGTCTCAACANCAATGAAAGATTGACTGAGGGGAAAGGAGGT 

CAGTTAAAATTTGAA 

Sequence 3037 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAAAAATGACCACAGCAGCATATTTCCTAAGAT 
CCCTGAAAACAGCTAAGATCCCTGAAAGCAACTCAAATGTCCATCAATAGAATGATGGAT 
ATTATTCTACC CCATTG AMCATTTCAC ACAC AAGTTTTGTG TTTTATTTTTATTTATTT 
ATTTATTTATTTTTCATGTCMTGAATTCAAAGG^ 

ACAGCACTACATTTCTGTMGAGGTGCTTTTTCAAGATATCTGTGTTCACTAGTTGTAGC 
CTGCAACTGGGAGGGCTCTGTTGTCAGGCATTCCACTTGCTACTTAGTTTCCTACCATAC 
CCCTTTTT 
Sequence 3038 

CCGCGGTGGCGGCCGAGGTACACCGCACACCACAGGGTGGCTGCACTTTTTGGGTGGTGG 
GGTGGGGATGGGGGCAGGTATGATCAATCATCCAAACCAGGGCN I I I 1 1 AAG AGTAAAAG 
G AGGCTCTACTGATAAC ATGCTGCAACAACAGATGTG ACTAGG AAn nn n nnnnnnnnnnn 
nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCnnnnnnnnnnnnnnnnnnnnnnnnnnn 
nnnATCATAGGTCATAGCCTGTATTCCTTGAGTGAAATTNTTATCCCGTTCACCAATTTT 
CCACACNAACAATACNTAGCCCGGGGTAGCCATAAAAGTGNTAAAAGCCCTGGGGGGTT 
Sequence 3039 

TANGGCGAATTGTAGCTCCCCGCGGTGGCGGCCGAGGTACCAACTGAAAAATAAGACAAC 

CAATCCCATCATGGTGCCMCCACTCTAGTTGTCTTGGGCAAGTCACTTATTTATCCATC 

TTGGCTTTCTTGCTCAAAAATGATGGATAGCTACACCTCACAGTGAGGATTCAGTCATGT 

AAGCAAAGTGCCTTGAAGACAGGAAGCCAGACATTATAGGGGCATAATTTTTGCCCTGAA 

ATGCATTTGGTATGGGTTGGGCATCAGATGTTTCTGATTTGTGAATTAGTCTGAGAATGT 

TAGACATGCMTGGTTTMGGATTGCTAGGAGCAAAACAGMCCATGTTGTGTGTTTATG 

GGTMTCAGCCAGTTTGGCAAACCACCCAGTGGGCTGGAAACTTGTTACCCTGCCCGGGG 

NCG 

Sequence 3040 

CTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTGTGTG 
GGTGTCTTACCTTGCATTAGMTATCCTAGGCCTTTCTMTTCTTCCTCTTGAATTCCCT 
GTGTCACTCATCTGAGGTCATCTCTTGGATTTAAATACATTTGGTCTGAAATTCTCAAAC 
TGCTGAATAAAATGCCATAGTAAGTCCAACTATATGGTTTCATGTAGGGTAAAGTAACCA 
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TTGTGGAAGGATTACGATAAAACTGTGTGACAATAAATAGAATAAAAGATGGCAAAACM 

TCTGTAMTTACAAAGTGTTATAAAMTGCATTGAATTTCTAGNGGTTCTTACACTTTCA 

TATTTCTCATTGGCACATCAATCAGACCTTGCCCTTCCTTAAAGTGGGGGAGGGAAGNAA 

AAAGGGCTG 

Sequence 3041 

ACTNCTTAGGGC GAATT GGAGCTCCCCGCGGTGGCGGCCGAGGACGCGGGGGAGTGTAAA 

ATATTTATGTGGCTTTTCTCCTCGCATTTGCTAGCATGAATGCAGGGGCATGAAAGTCTG 

AATAGGGCTTTCAACCATCACCTGGGGCTTAGAAAATGAAAATCAGAGTTTAGGATACCG 

TCAAAAAGGGCCACATTGTAGAGTGGATAGCAATCCATTCAAAAGCTCAGCACTAACAAA 

TATTAMCTTCAGATTTAMTCMTTCM 

TCATAACTCCTGGCCAAAAGAAAATATCACTTCTTTTGGGMGMTATAGCATCATTTAG 
ATCCTCAAATGAATGGCTACACATGGGTCAGACCGAAAGTGTTTTAACCAGGCCAATTCA 
AAAATAATTCAAG 
Sequence 3042 

CTACTATAGGGCGAATTGGNTCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCACAGC 

TGCATNACACTGAATATCATCTCCCTGCACTGCCATTTNATTTGAGCAGGGCTTTGCCTG 

TCMGMCATTCATATCTrrTGATAMCAGTGCATTTTTTCCCTGATAGTCTGTATGGGA 

CAAGCCTNCTTGTTTGCATCCGAAGGACAGGGACTNNANGACCTNCTGCTGTGGTCACCA 

AGGAATCACATTCAAACTTCTCAGCAAGGTGTTATGGNCCTTCTCAGTTTGCCTACGATC 

CATTGAG GCAAAGGTTTTCCTTNTCTC AATACAC 

Sequence 3043 

CTATAGGGCGAATTGGTNCTCCCCGCGGTGGCGGCCGGGGGCCATTGAGACTGCCATGGA 

AGACTTGAAAGGTCACGTANCTGAGACTTCTGGAGAGACCATTCAAGGCTTCTGGCTCTT 

GACAAAGATAGACCACTGGAACAATGAGAAGGAGAGAATTCTACTGGTCACAGACAAGAC 

TCTCTTGATCTGCAAATACGACTTCATCATGCTGAGTTGTGTGCAGCTGCAGCGGATTCC 

TCTGAGCGCTGTCTATCGCATCTGCCTGGGCAAGTTCACCTTCCCTGGGATGTCCCTGGA 

CAAGAGACAAGGAGAAGGCC 

Sequence 3044 

CCGCGGTGGCGGCCGAGGTACAATTTGCATGCACGGACTTCTACATACAGAATACTTAAA 

CAGTTCATTGTATTGAAACACGCTTAAAAATGGCCCTAGCACTTGCTGCAATCCTGCTGT 

GTAGGAGTTAGCCAGGTTGTGTCTGCTCCCTCTAGACTATTCTATACATCAAGAACTAAA 

AAAGAAGAGCAATCCCAGCATGTGGCTGGATGCACATCAAAGCCAAGTCTGCCATGAATT 

ATTTTCCTTCCTGTTC CTTTCATTCAGGTCATGA AGC ACACATACTGCTACAG GAAAGGG 

CAAAAGGCAATGTCAAGTGCTGTGTMTTGTTGCJGTTCTTTAGTGGCAAAGGCCANCAAT 

GTTGCAGCCAAA 

Sequence 3045 

CTGGGAGGAGATGCTGCCACCTAGGTTACTTGTAGGACCCTATACGGCAACCTCCTTTGC 

CAGGAACTATTTATAAACATCCTGCAGGAAAATGTCAGANATGGGAAGAAACAAGAACTT 

TGACATGCTTGGTGTTCTTGCCCAAGCTTTGA/vNAAGTTTACAAAGTCTATATGTCANAA 

TACACATTTCCCACCTTGCCCAACAGTNGAAAAACATAANAAGAGAAAAACATTAAAAAA 

TGACAAGGAAGTTAATGGMGTCAGCAATGTGATGGTGTTTGGAGGTGGAGCCTTTCATA 

AGGTAATTAATGCCCT 

Sequence 3046 

CCGCGGTGGCCGGCCGCCCGGGCAGGTACCCTATCCAGAGGGTTCTACCCCATGTAGAAC 

CTCTCTTTTCTGTTCACCTGATTAGGTGTAGAAGTAGCCAGTCAAATTCATCCTCGCTGC 

CACTGTTATTTCTGCCCCACCTCTMGCATmCTTCmCTCAnCACCACTTTTTT^ 

ACATGTGTAATACATCAGTAACTAATTTATAATATTCCACTCCAATCCATCAACCTCAGT 

AAATTTAATATTCACATCATCAGCCCTTCATATCCATACATTACTTCAAATTCTTTATTC 

TGTTTGACTCCATGGATTTGCATCACCTCTCCACAGCAGCAAGGCCAAAGACCCAAACAC 
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Sequence 3047 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCACCTTGAGCCCGGCATTTAACCTCTGTGGGC 

CTCAGTCTCTGCACCTATAAAAGGAGGGATTTGGGCGAGATATGCTGAGGACACTAACAT 

TCTACACGTCTAMCTTCGAGGAACTTTTCTAAAGCMGAAAACGAATAAGGGCTAAATT 

TAATTCGTCAGAGTGGAAAGAGAACAAAGAGAGAGGACTGGGAGTGGGAGATTAAAAGGC 

TGTGGAAAGTGGTCTTGAGATAGATGTTATTGGGGACCAACCTTCTCAATGCACTGGTCA 

AGCGCCGGACGT 

Sequence 3048 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTTCAG 

GCCTGGGGCCTGTGCCAATAGAGTATTACTGGATATAGGAATAGCTCTGACACCTCAAGG 

TCACCCTAGAAGTTGMGGATTTCAGGGCTCACATATTTTAGACTCAAACTTCTAGAACA 

CTGTTCTGACCCAGCTGAGCTATTGAAGATAGTTTTCACGGCACTGCTATCCTTGATTGA 

CCTCCTAGGGGATCTGTCTCTCTTGAGAATAGGCCTCAACTCTAATTTCCTTGGGCCCAA 

GACCAANAGTGGGCAAGGGGAGCCTAGCCTGCACCTTGCCACCTGGTGTTGGCCATTATC 

AGGGTGCCAGGAGCTGAAGTGGCACATGTTAGTGTCTACAGAGTGCCACGTGGGTAGTGC 

TAGGTCTTNCAGGAGGGCTCAGCCACTGATGACTTTAAGGCATGCAGGTCTCCAACATGG 

CTTGCAGGGAAA 

Sequence 3049 

CCGGGCAGGTACTTGTGTGGGTGTCTTACCTTGCATTAGAATATCCTAGGCCTTTCTAAT 

TCTTCCTCTTGAATTCCCTGTGTCACTCATCTGAGGTCATCTCTTGGATTTAAATACATT 

TGGTCTGAAATTCTCAAACTGCTGAATAAAATGCCATAGTAAGTCCAACTATATGGTTTC 

ATGTAGGGTAAAGTAACCATTGTGGAAGGATTACGATAAAACTGTGTGACAATAAATAGA 

ATAAAAGATGGCAAAACAATCTGTAMTTACAAAGTGTTATAAAAATGCATTGAATTTCT 

AGTGTTCTTACACTTTCATATTTCTCATTGGCACATCMTCAGACCTTGCCTTCCTAAAA 

GTGGGAGGAAGAAAAAGGCTGCACCTTGTCTGGGCTGAAGGTCTGGGCCAGAACACTANC 

AAAACI 1 1 I IAA 

Sequence 3050 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGTCGTTGTTGT 

TTATATCTAAGAAAGAAAAAAATGAATMTTATCTGAGTTTCCTGAGCTTTATGACCAAT 

AGCATTGTTTCTCAATATA I Mil I CAATGGATGGGTATATCAGAGCAATAAGCTAAATT 

ATGCTGTAGTAACCAACCACCCGCTTCATTTTNGGTGAATTGACATAAAGATTCATTACT 

TGCTCACACTAMCTTCCTCCTGAGGTTTGGTTCTGTGCTGCATTTTGAATTGCCTTCAC 

TTTGGAACTTATCATGGGGCAGCAGTTACTATATGGAT7TTTATTGATCACTGTGTATAG 

AGAAAMGAACATCAGGGACTCTTCTGGCAACAGTTAATTGCTCCAGTCTGGAAATGTTG 

CACAACTCATTGTCTGGAATTAGGTTGCAAGGGCAACATTCAATCACAAGAAGGGAGACC 

AAA 

Sequence 3051 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAGTGACATGATCATAGCTCACTGCAGCCTTGA 

ACTCCCTGGGCTCAGGTGATCCTCTCACCTCAGCCTCCTGAGTAGCTAGGACTACAGGCA 

TGCACCACGACGCCTGGCTA I I II 1 1 CTTCTTCTTCTTC I I I I I I I I I I I II l IGGTGGA 

GAGGGGGCTTTGCCATTGTTGCCCAGGCTGGTANGATCTGTCCTTTTTAACAAGGAATTA 

GAAAGGGCTGCTTCGTATATCTTCATGACACCTMCAAGCTGTTTTCTCTCTTTGTTCCT 

TCAGAMTGCATGCAGTTTAATTTTTCTTTACCTATACAGGATMCTCTTATCATAACAA 

ACAAACAGTAAGGAAAACATAAATACGAAAGCAAACATTACCATAAATTCACCCCCAGGA 

TCACAACTTGGACCATTT 

Sequence 3052 

TTTCCTCCCCGTGATTACGGCCTGCTGGCTTCTCCAGGACTGGGTTAAGGCACAGCTAAA 
GCTGTTCTCTTGGCCCTGCAGAGCCAAACCCTGGACCTTTCCAGGAAATGGATAAATACC 
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ATTTCACTTCATTCATTCACTGCTGCCTGTGGTGCCTGAATACATACCCTGTGGCATCAT 
CTGTGAGTTATCCACACTTCCCACAGCAGATGCATAGGGAGGAGGTACCTGCCCG 
Sequence 3053 

ACTNCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAGATGC 

ACACAGATGGGAGGAGCTGCCTTGAGCGAGAGGACACTGTCCTGGAGGTGACAGAGAGCA 

ACACCACATCAGTGGTGGATGGGGATAAACGGGTGAAACGGCGGCTGCTCATGGAAACGT 

GTGCTGTCAACAATGGAGGCTGTGACCGCACCTGTAAGGATACTTCGACAGGTGTCCACT 

GCAGTTGTCCTGTTGGATTCACTCTCCAGTTGGATGGGAAGACATGTAAAGATATTGATG 

AGTGCCAGACCCGCMTGGAGGTTGTGATCATTTCTGCAAAAACATCGTGGGCAGTTTTG 

ACTGCGGCTGCAAGAMGGATTTAAATTATTAACAGATGAAGAAGTCTTGCCAAGATGTG 

GATGAGTGCTCTTTGGATAGG 

Sequence 3054 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTGCCC 

TAGCAAACTCTCCTAAGTGGAGAATGAGGCCTGGGTCTCTTGACAATTAAAGCTTTGCTC 

TGTTCTTGGACTAAATCCAGACATAACCACGTCAGGCTCCTCATCTGCTGGGAAGCAGAT 

TGCACTGGATTTCACAGTCACGGAAAACTTTTGACTGTGTGTACCT 

Sequence 3055 

CCGCGGTGGCAGCCAGAGGCAGTCCTGGATGAATTCAGGCTGCAAAGTGCCCTGTGACCT 

GGCCCTTGGGAAGCCACGCTCTGATGAGGTTTGCAGATCTCCCCGCAATCCAGGGCCAAG 

CATGAATGAAGGTGTTAGGCATTTCAAGGATGCGACTAGGACGTGCTGCTGTCTCAGGTT 

CACATTATACTGAAATAAGGTCCTGGCCAGACAGCACCATATAGTTAATTGAGGTGTTTG 

GGAGATACTCAAATGGGTGTTMTATTTCCCAGGCAATATTTTCAAGGCTCAGAGCCTCT 

ATTCAAAAGATAACTGAAAGGCTAGCAAAGGGTATTGCAAAGGTATATTCTGATTTCTGA 

CAGGAAGATCGATGGATGGACAGTGAGAGTATAAAGACCTTAGGGTTTGGGGTTCTGATT 

( 

Sequence 3056 

AGGTACTCACAGCTGCATCACACTGAATATCATCTCCCTGCACTGCCATTTCATTTGAGC 

AGGGCTTTGCCTGTCAAGAACATTCATATCTTTTGATAAACAGTGCAI Mil ICCCTGAT 

AGTCTGTATGGGACAGCCTCCTTGTTTGCATCTGAAGGACAGGGACTTCAGGGACCTCCT 

G CTGTGGTCACC AAGG AATCACATTCAAACTTCTCAN CAAGGTGTTATGGTCCTTCTC AG 

TTTGCCTACGATCCATTGAGGCAAAGGTTTCCTTCTCTCAATACACCCTCTGCTTTANCC 

AGGGGGACCTCCCACTGTCCTCTCTATGTGTTCTATTTGTTCCTGACTGTTAGTACCTGC 

CCG 

Sequence 3057 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCGGCCCGGGCAGGTACTGCCTGTGCGT 

AGCTG C ATTAAACCAAATGATCAAG AATATCAAG ATACATCTTCCTGG ATCTTC AA.l CCC 

ACTTAAAGATTTGGAGMCATATCATTTTATATTAAAACCAATAACATTTTCAGMTA 

TTTGAATAATTTC 1 I I 1 I ACCTAAAAACCAGAAGTCTTTTAMTAAGTATCATTTTTGAG 

ATGCATTGCCATAGGATACATACTGAGGTATCTTCTCTTTCTCTTACCTATTAGAGGCAT 

TTCTGGGCAAGTTTGAGGAGCATGGMTTAACAGTCATCAATGCATTGCTGG/JTTGAAA 

ATGATTTATATCAATCAGCAGAGGACCTCCACTGGTGGATGTCAGCAGGGATCTGAGGGA 

TGTTTCAGGTAGACCATGCCTATGCTTTGGAACCTCCCCCCGCGT 

Sequence 3058 

CCGCGGTGGCGGCCGCCGGGCAGGTACCCAACAATTTGAGTAACGTCACTTGTCAGAAGG 

AG AC A I 1 1 I IGAGTCATGCCATAGATTTGAGCCCTGCTCCATCATTTACTTGCTGTGGGA 

CTTTGGACCTGCATCCACCCTCTCAGAACTTGTGTCTTCATTGTGTATCACCTATTTTAC 

AGATTGTTGTGAGATTAAACAAGGTAAGATACAAAAGGGTCCTAATACACTGACTGTTAA 

AACTGAGCAGTGACTTAGAATAATCTCTTGAGCAGTCCAGGGCACATATTGACCTCTTAA 

GTCTCCTTTAGAGTTATAATATTAGCATAATAACACTGAGCTTTAAGAAAAGTCTTTAAG 
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CeAGTGATTCCTGTGTATATACCCTCCAGAGGGATTCCAAATCCCCAGGAACTGAATGAA 
AGTC ATGGTTGAATG CACTCGTATGC A 
Sequence 3059 

CGGGGGGAGCCGGGCTCCCTGAGACTCAAGAAAATGCACCCGACCTGCGTTCCTCCTGCC 

TGGGGTGACCCATAAGGAACACCCCCCCCTTCTCAACAACCACAGAAGATAAATGCCAAG 

GGGACCACCAACATGGGCATGGGGCTGTGATGCCAATCTTAAGGGAAAACACAAGTAAAA 

TCGGCTCCAGGACTGACCTGGGGGCAGCTGCTCTTTCTGTCCATGACTTTGAGGCGGGAT 

CTCGAGCGTCTGTCTGACGGGCCTTCTGTCTGCTGTCCTCTCGCAGGTTTCAGACATTGA 

ATCCANGATTGCAGCCCTGAGGGCCGCANGGCTCACGGTGAATCCCTCGGGAAAGCCCNG 

GAGGAAGNCAAACCTCCCGATATTTCTCCCTNGAGTGGCTGGGAAACTTGGCAAGAGAC 

Sequence 3060 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTATATCATGTAACTAT 
TATGAGATGTTCTGCTGTGATCACGTGGCCTGCCTGTCATCAGGAAAGGGAGAAAATGGA 
G GGG AGGACTCTTAC ATTC C AGTCAATTCACGTAGCCTCTTG ATG CC AAAAATTAG AAAG 
G GG AAN C NTATC ATATGTGTACCTCG GCCGCT 
Sequence 3061 

TATACGACTNCTATAGG GCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC l I I I i I I 
I I n I I n I I I I I I I 1 1 I IGGGATGGAGTCTTGCTGTCACCCAGGCTGGAGTGCAGTGGC 
ATGATCTCGGCTCACTGCAGGCTTCGCCCCCTGGGGTTCACGTGTGTTTCTAATTTTTAA 
GGCCCTACTTTAATTTCTACTGTCACTAAATAAAAMGM 

AACCCTAGACATCATGTTCCTGCTTCTGCTGGAAAAAGAGACAGGATATNACCATCTCAA 

ACCAGCACCTTTNAGCMCTGAAGCCTTTTTGTCCTCACCCTGAGTGTTAANACAATGGC 

TCCCTNACACATCACAGGCCCTGGAAGTGCTACAACAGCTGAATGGGGTTTTCATGGGAC 

ATAAAATCACTTTCCCACTACCTNCCACCTACAAACCT 

Sequence 3062 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAAGTTTCANGCCACTGGTGG7TTGCCAAACTG 

GCTGATTACCCATAMCACACAACATGGTTCTGTTTTGCTCCTAGCAATCCTTAAACCAT 

TGCATGTCTAACATTCTCAGACTAATTCACAAATCAGAAACATCTGATGCCCAACCCATA 

CCAAATGCATTTCAGGGCAAAAATTATGCTCCTATAATGTCTGGCTTCCTGTCTTCAAGG 

CACTTTGCTTACATGACTGMTCCTTACTGTGAGGTGTAGCTATCCATCATTTTTGAGCA 

AGAAAGCCAAGATGGATAAATAAGTGACTTGCCCAAGACAACTAGAGTGGTTGGCACCAT 

GATGGGATTGGTNGTCTTATTTTTCAGTTGGGTACCTCGGCCGCTCTANAACTAGTGGGA 

TCCCCCGGGC 

Sequence 3063 

TATCACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGT 

TTCTCCAGCTACAGTCTGTCACAGCTTCCCTTGGCTAGGAAAGGGAAATCCCTTGACCCC 

TTGCACTTTGTGGGTGAGGTGACACCCTGCCCCAGTTCGGCTCACCCTCCGTGGGCTGCA 

CCCACTGTCCAACCAGTCCCAGTTGAGATGAACCAGGTGCCTCAGTTGGAAATGCAGAAA 

TCACCTGCCTTCTGCTTCGATCTCNCTGGGAGCTACAGACCAGAGCTGTTCCTATTCAGC 

CATCTTGGAAGCAACCAAAAGACTGATTTTTAAATGGAAAAAAMTGTTTTCCTGACAGA 

GTATAAAAATATAAAAGAGTATCTAACCCACTGCAAAGAGAGGTCTTATGAAATTCAAGG 

GGTGAACCCTGGCTTTCTGTGAAAAGAAAGGCCGTCTT 

Sequence 3064 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTATAGC 

ACAAGCTGATGATATGCTGGCAACTGAACATTTGGCTTCAATGTCAGAGTAATAGAGTGT 

GGCAAACTGCAGAGCTCATGCTCCGCCTAAAAGGGGCAGCTGCTGCTCAGCTCCTGCCAA 

TTACCACCATGCAGGAATTTGGCCTAGTGTGCTCAGCTCCTCCTGTTTTTACCAGAAGCT 

AGGAATCTGGGTTTTTATGTGAAATGGCCTAA I I I I I AAATGTTAGCATCCACTCTGAAA 

AAAAAGCAAAACACCGTGTGGGCCAAATAAAACATTTGCTGGCCAMTGCAGCCTGATTT 
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GCTCTGTTTTCAACCCCTGGCACAGAATTTTGTTCATTTCTCTCTTGTGGTACCTNGGCC 

GCTCTAGAACTA 

Sequence 3065 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAGTTTCAGCCACTGGTGGTTTG 

CCAMCTGGCTGATTACCCATAAACACACMCATGGTTCTGTTTTGCTCCTAGCAATCCT 

TAAACCATTGCATGTCTAACATTCTCAGACTAATTCACAAATCAGAAACATCTGATGCCC 

AACCCATACCAAATTGCATTTCAGGGCAAAAATTATGCTCCTATAATGTCTGGCTTCCTG 

TCTT CAAGGCACTTTGCTTACATGACTGAATCCTCACTGTGAGGTGTAGCTATCCATCAT 

TTTTGAGCAAGAAAGCCAAGATGGATAAATAAGTGACTTTCCCAAGACAACTAGAGTGGT 

TTGGCACCATGATGGGATTGGTTGTCTTA i J 1 1 I I CAGTTGGTCCTGCCCG 

Sequence 3066 

CCGCGGTGGCGGCCGAGGTACCCAGCAGAATTCTCACATGTGCTTATTTTGAAATTAAAC 
CAATGCAAAAGTTTTTACCCTGAGGTGTTGCTAATTCATGMCCTATTCTCTTGACAGTT 
TTTCCCTCCAATATMCCTATATCTTATTAAATGTCTTTTCTTCTTCTAGATTGTT^ 
ATAGTAGCTCAATTTTAGCTTTGTTC AGGCTCCAAATTAG AAAAAC GCTG AG ATTAGCC A 
MTATTTGMCACTGTAAGGTATTAATCAGTTTAATCTGAAAAAAAAGGTATAAGAACAC 
TTNCTACAGTTCCAGAAGAGTGAGCTGTCCTGGCTGAGGGTTATTTCATAAGCACCTACA 
TGTATGGACACTAATGGTGAGTCATCACCAGAGGAAGATGAAAAATCTGGA 
Sequence 3067 

CCGGGCAGGTACCTATAGCACAAGCTGATGATAGCTGGCAACTGAACATTTGGC7TCAAT 
GTCAGAGTAATAGAGTGTGGCAAACTGCAGAGCTCATGCTCCGCCTAAAAGGGGCAGCTG 
CTGCTCAGCTCCTGCCAATTACCACCATGCAGGAATTTGGCCTAGTGTGCTCAGCTCCTC 
CTGTTTTTACCAGMGCTAGGAATCTGGGTTTTTATGTG^^ 

TTAGCATCCACTCTGAAAAAAAAGCAAAACACCGTGTGGGCCAAATAAMCATTTGCTGG 
CCAAATGCAGCCTGATTTGCTCTGTTTTCMCCCCTGGCACAGAATTTTGNTCATT 
Sequence 3068 

GACMCCMTCCCATCATGGTGCCAACCACTCTAGTTGTCTTGGGCAAGTCACTTATTTA 
TCCATCTTGGCCTTCTTGCTCAAAAATGATGGATAGCTACACCTCACAGTGAGGATTCAG 
TCATGTMGCAAAAGTGCCTTGAAGACAGGAAGCCAGGCATTATAGGAGCATAATTTTTG 
TCCTGAAATGCATTTG GTATGGGTTG GGCATCAG ATGTTTCTG ATTTGTGAA7TAGTCTG 
AGAATGTTAGACATGCAATG GTTTAAGGATTG CTAGGAG CAAAAC AG AACC ATGTTGTGT 
GTTTATGGGTAATCAGCCAGTTTGGCAAACCACCANTGGCTGAAACTTGTACCTGCCCG 
Sequence 3069 

TACGACTNCTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCGCTCGACACCCTGTCGA 

TGGGCATGTCGGCCGACTTGCGCGCCGCCGTGCTGGAAGGCGCCACCATTGNGCGCGTGG 

GCAGTGCCATCTTCGGTTCCCGTAATTACTCTCACTGATTGAATGAAAGAATCACCATGA 

CGACAGAATTGAACATCNTCTTCGTGGGGCGGAGGCAATATGGCCGCCGCCCTGATCGCC 

GGCCTGGCAGGCAAATTGACCCTGGGCGGCAATATCCACGTGATCGACCCCCATGCGCCG 

GCGCTGGAAAAATTGCAGGCGCAGTTNGGCGTCACGACGGNGACTGCCGCCGGTGAAGCG 

CTGCGGGGCGTGGG 

Sequence 3070 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGTAAGCGAAAAGCTCTTTGCCCAGACCA 

TGACTCAGATAGTTCAGGTAAAGATCCTAGAAACAGCCAGGGGAAAGGAGCTCACCGACC 

AGAACATGCCAGTGGTAGAAGGAATGTCAAGGACAGTGTTAGTGCTAGTGAAAGTGACCT 

CAACTGTGTACCT 

Sequence 3071 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGGGGCCATTGAGACTGCCATG 

GAAGACTTGAAAGGTCACGTAGCTGAGACTTCTGGAGAGACCATTCAAGGCTTCTGGCTC 

TTGACAAAGATAGACCACTGGAACAATGAGAAGGAGAGAATTCTACTGGTCACAGACAAG 
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ACTCTCTTGATCTGCAAAATACGACTTCCATCATTGCTGAGTTGTGTGCAGCTGCAAGCC 
GGATTCCTCTGAGCGCTGTCTATCGCATCTGCCTGGGCAAGTTTCACCTTCCCTGGGATG 
TCCCTGGACAAGNAGACAAGGGAGAAGGCCTTAGGGATCTACTGGGGGAGTCCGGAGGAG 
CAAGTCCTCTTCTGTCCCGCTGGAACCCATGGTCCACTGAAGTTCCCTTATGCTACTT 
Sequence 3072 

CCG G GC AG GTAC AACC ACCCC CCGGGTATCCATAACCTAACAATAAC AATTGTTAAGCAT 
TCGG CC ACTCATGTTTCTTCAGCTCCCCTACCTCCTCCCTATCCTCCCAGGTTCCCACTG 
^^ATATATCATTTCATTCATAAATAATTCTGTATATTTAGTTACAGCTCAATGMTTT 
C CAC AAAGTG AACATGCTN ATG CAAAC C ACACCTGCACTGG GAAATAG AAATTAC AAAC A 

CCCGGAGCCCCCTCTGCAATACTCTCTCAGTCCTTTCCTCACTCCTTCCCAAAAGAGTCC 
CGCGTACCT 

Sequence 3073 

AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGAGGTTCGACAGGTTGGGTGTC 

TGCTCTCAGCCTGGGCTCCCGCAAGCGGGGGAAGCAGCTCCAGCTGGGGGGAAGCGAAAA 

AGGGACAGGACCTTCANAGACAGCCCACTGAGGGCTGAGTGTGCATTGCTGGGCTTCTGG 

AATCTGTGGTGAGGGCCGGTCCCCCAACTAATTGAGCAAGAAATAGCCCCTCAGAAACCC 

ACTTGAGGCCAGGCCGCGGTGGCTCACGCCTGTAATCCCAGCACCTTGGGGAGGCTGAGG 

TGGGCNGATCACCTNAGGTCGGGNGTTTNAGACCANCCTTGACCAACATGGGAAGAAACC 

0 

Sequence 3074 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATGGTTTCCATTGC 

AGCGAATAGACTAATATTCCTTCTGCTGGCAACCTAGACAAGTTCCCTAAGCTTATGGGT 

AAAAATGGAGTCAAAGAGMGTGCCAGTATCCMTCTTCAAATATTTCGCTGAGGATCAC 

ATGGAATTTAAAAAGAAACGCTCCGAACAGGAAAAACAATTGGCCAAGCCTTGGGAAGGA 

AGACAAGAGGGAGGCATCCAGAGACTCTGGGCTGCACTGATGTAGGGCCTTTCGAGGTTG 

AACAAGGTGAAGGGGGACCCTCACATGGGAACAGCGTGATGGAGGCAGTCTGGCTTATGA 

GCTGGGGAGTGMTTGTGTAAACATTTATTGAAGTCAGTTTGGAAGTTGGCAAGCCACAA 

AAAGCCACTATCTGAAGAGCTTGTTGAAATTTAGCCCT 

Sequence 3075 

CTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACTAGGTGGTTCGCCTGTCA 

CTGACACCGNCATCTCAGMTGCCAMCTTTTGGAGGAAGGTTTGAGTCTTCCAGGCTGN 

CTTCTTCCATTCTTCTGTAGCTTCGTTACCCTCCGTGGTATATTTATAATCTCACAAAAG 

TGCAGTTTACANTNAATTMTNCTTTTAGGGCAAGCNGGGAAACATCCTCCACTCAAGGG 

TTNCGCCNTTTCCACCTACNGGCGTNCCCCCGCGGTACCTTGCCCCGGGGCGGNCCCGCT 

TCTTAGAAACTMGTTGGMTCCCCCGGGCCTGCAAGGAMTTTCGGATATCAAAAGCTT 

TNTTCC AATACN CCGTNCCAAKCTTNTAAAGGG 

Sequence 3076 

AGGTACAAGT TTCA GCCACTGGTGGTTTGCCAAACTGGCTGATTACCCATAAACACACAA 

CATGGTTCTGTTTTGCTCCTAGCAATCCTTAAACCATTGCATGTCTAACATTCTCAGACT 

AATTCACAAATCAG AAACATCT GATGCCC AACCCATACCAAATGCATTTCAGGGCAAAAA 

TTATGCTCCTATMTGTCTGGCTTCCTGTCTTCAAGGCACTTTGCTTACATGACTGAATC 

CTCACTGTGAGGTGTAGCTATCCATCATTTTTGAGCAAGAAAGCCAAGATGGATAAATAA 

GTGACTTGCCCAAGACAACTAGAGTGGTTGGCACCATGACGGGATTG 

Sequence 3077 

CGGNCGCCCGGGCAGGTACCTTCTCAGCCAGCAGTTAGCCATTGTGGAGGTGCCAGGGGA 

GTTAATGCCCTTAGGAGCAACTCTTGGTCCTTGGAGGCAGGAGTCAGCACCCACTGAACG 

TCAGCATACATTTCCCACCTTCCAGCCCTTCCAAGGGATGATCCTCAGAAGTGTGTTCTG 

CTGGGTTCCTCCAAAGCTCTCAGTGAGACCCAGCTCCAGTTGCCCATAGTGGCAACCCAG 

CTCAACCATGCCCTCCTAACACCTTTTCTTTCTTCCCTGACCTTTCTGCCTGCACTCAAG 
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TCTTTTCTCAGTTTGCAAAACCAGGATACACMTATTTAGTATCTCCCAAACGTAAA 
Sequence 3078 

GGAGGTGCCAGGGGAGTTAATGCCCTTAGGAGCAACTCTTGGTCCTTGGAGGCAGGAGTC 

AGCACCCACTGAACGTCAGCATACATTTCCCACCTTNCAGCCCTTCCAAGGGATGATCCT 

CAGAAGTGTGTTCTGCTGGGTTCCTCCNAAGCTCTCAGTGAGACCCAGCTCCAGTTGCCC 

ATAGTGGCAACCCAGCTCMCCATGCCCTCCTMCACCTrTTCTTTCTTCCCTGACCTTT 

CTGCCTGCACTCAAGTCTTTTCTCAGTTTGCAAAACCAGGATNCACAATATTTAGTATCT 

CCCAAACNTAAAATAAGTGGAGTGCNACAATTCAAAAGAAAATTCAGGATGCATAGTAGN 

GNTTTAGAGAACAACATG 

Sequence 3079 

CCGCGGTGGCGGCCGCCCGGGCAGGTACACACACTCTCTCCCCATATCTCATGGCTCTGT 

GCATAAATGGAGTGGGCACAGGGGTGGGGCATGTGCCCTGATGGCTCTTTGCAAGACTGT 

CACTGTTCTGTCTTGGGAGGTATAAGGTGAACAGGACCTTAGAAATGCTGGGAGCAAGCT 

CTAGTAACAAAGGCTCTTTGTGGTCAGGACAACAGAGGGTCTATTGCCATCTTCTCAGAA 

CGCAGGAGCTCCCGCCAACAGGTGGGGAAGAAAGAGGAGCTGATGTCAGCGGTGCTAAGC 

TGTCATCCCAGGGCAAATGTGAGGCTGAACATGGTGAAGTCTACAGCGAACCACCCGGGC 

TAGCTGCCCTATGGTACCT 

Sequence 3080 

GGAGAGAGAGAGATGTTTTATTTTTGCCAAAGCATGTGAGAGTAAGTAGCAAACATCATG 

ACACTTCACCCCTCAMTATTTTCCCACGTATCACTTAAGAACAAGGGCATTCATTCTGT 

TACATGACCACAATACAATCAGCACAGATAGATATTAACATTGAGACAATGCTCTGATGT 

ATTGGTATCTACTATGC AG NTC ATAATCAAATAAGGGTTTTGGTAACTTG GG ATCCATAA 

ATGGTCTGCAAGGTCAGCCCCTGTAGACCTGCACTTAGTGGGTGCCATTTCCCTGTCTCA 

GGTGCCTGCTCTACTG 

Sequence 3081 

CGAGGTACGCCGGGTGTGCAGNCACAGTGAGCTTCTGAACATTTCTTCTCAGACTAAGCT 

CTTACACACAGTTGCAGTTGAAAGAAAGAATTGCTTGACATGGCCACAGGAGCAGGCAGC 

TTNCTGCAGACATGACAAGTCAACGCAMCTTCATGTTCACTGTGGGCAGACACATGTTT 

TGCAAAGGAGAACTTNAGGAGCCAAACAAAGCCACACTTCAATGTGGCTNTTGCCCCAAA 

TTTTACTNCATTTAGGGTNAAAATCTTTTCCCNTCCCTCCCCAAGGAAGAAAAAGGTGGG 

AANTAGGAGCATTGGAGGTTCCTTCACCATTGGGAAAACTTTGGAAGTTCAAGGGGAAAA 

ATTG GG AN GGGCCTCCACCCC AMTTTATTTTTGTGGCC ATTGGGG G GTTTTTAAAG NTT 

TTTTCCCTTNGNAAAATNTNAAAAGCTATNCCAGGGGTTTTGGGTCCTTTTTNGNGGGT^ 

TTNACCCC 

Sequence 3082 

GGNGGGGGGGGGGGGGGGGNGNGNGTAGANNGAGATATNACGCTATACNATCAnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnAGCGGCCGCCCGGGCAGGTACAGTGGCAAAGTTT 

CTACGGGCAACTCGTCCATGTTTGCCCAGCTGAAAACATAACTGCTTCTCCCTCAGCCTN 

ACCACCCTTCCCCTTGGGGAAGACCAACACCCCAATATTGCCTCTAGAAGACTNAGCCGG 

CCAAAATANCTACTTTTCCACCTCCNTCCCAGCAACCCTTAGCTCCCGCTAAAAGNCTAC 

AACCCACNCGAAGTNGGGCNAGATCACCTTGAGGTACCTCGGCCGCTCTATTAACTAGGT 

GNGATTCCCCCGGGNCCTGCNAGGGAATTTCNATTATCAANGCNTTANNCGGATACCCGG 

TGCGAACCTTCCGTAGGGGGGGGGGGCNCCCGGGTNACCCCCAGACC I I M I NTGGTGTC 

CCCTTTTTANATTGGMGNGGGTTTTAMTTT^^ 

Sequence 3083 

AGGTACCCATGAACATAGTTTTTACTCTTTTTAAAAGCTGTAGGCACTGGAAGGTCATGT 
GTTCAGCCATGACAGTGG AAGGTCAGTGTTCAG C C ATGACAAGGCCCAATGTCATGGGC C 
CTCTGNCATNATCCTTTGTCNCGTAGTGAATCCCCCANMCACTTTTTGCATGGCTTTGT 
TTTAANMCAGAAAATTTTTTTCCTNGCAGGNTTCTTTTTTGGGGGAG 
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TGGAAAAAGGTTTTTAAACCCCCCCTTGGACAAGNTTn*GANTTGNAAAAGGAAAACNAA 
GCTNGGAGGGGNCNTTTAANNGNANGGGCCAAGGGGTTANNGGCCCCTTT 
Sequence 3084 

AG GTACAAAGAAAC AGCTTC AATCCTTTTGGGG AATGGC AGGGTTTTG C AGAATATGTGT 

TCCAAATTTGGGGTCATAGCAAAACCCCTTTATAAAGCAACAAGGGGGCCAAGAAAATGA 

GTCAATGGAGTGGGCCCTAGAAATGAAGGGGAAACTTTTGCCAAGTTAAAACAGGCTCTT 

AACAAAAGCTCCCACTCTTGGCATCCCAAACNTTCATTAMGCATTTTTCCTTGTATATA 

GCAGGNAAAAAAAGGGCATAACNTGTGGGGAATACCTAAACCCCAGGAAATTGGGGATCA 

AGAAACCCCATTACTCAGACCCCATTCTACCTTTTTTCAAAAAGGAAANATNTGTAAACC 

AGGAGGTTANGCNCTTNCACAAGGTGGACCCCAAAGGCTTGGCCCTTGGCAANGGCCAAA 

ATTAAGCCAAGGCCCCACCTNGGTTTA A 1 M ill 1 NAGGTTGG 

Sequence 3085 

CCGGGCAGGTACCCGGGCGTGTGAGGTGTCAGTCTGCCCCTACTTGGGGGTGCCTCCCAG 

TTAGGCTACTCGAGGGTCTGGGACCCACTTGAGGAGGCAGTCTGTCCGTTCTCAGATNTC 

CANCTGCNTTGCTGGGATGCTGCGAANAACCAACTACNTNTTNNCCCCCCGCGTACCTTN 

NGGCNCGCTTCTTAGGAACTAAGNTGGGATCTCCCTCGGANCTTGTCAGGGAATTTCTAT 

TAT TCAA GCCTTATTCGGATACNCGATCANACCTCGAGGNGGGGNGCCTNGGNTACCCAG 

ACATTTTGNTTCCCCTTTTMGTTGAAGGNGTTAATTGGCGCCGCCTTGGCCGTAAAANC 

ATTGGGTCAATAANCTTGNTTTTCCTTGTNGTNGAAAMTTTGGTTAATTCCCGC 

Sequence 3086 

CCGGGCAGGTACTTGTGTGGGTGTCTTACCTTGCATTAGAATATCCTAGGCCTTTCTAAT 

TCTTCCTCTTGAATTCCCTGTGTCACTCATCTGAGGTCATCTCTTGGATTTAAATACATT 

TGGTCTGAAATTCTCAAACTGCTGAATAAAATGCCATAGTAAGTCCAACTATATGGTTTC 

ATGTAGGGTAAAGTAACCATTGTGGAAGGATTACGATAAAACTGTGTGACAATAAATAGA 

ATAAAAG ATGG C AAAAC AATCTGTAAATT ACAAAGTGTTATAAAAATG C ATTG AATTTCT 

AGTGTTCTTACACTTTCATATTTCTCATTGGCCCATCAAATCAAMCCTTTGCCTTCCTA 

AANTGGGANGAAAAAAAMGGCTGCCCCTGTCTTNGGCTGMNGGTTTTGGCCAAAACAC 

TTTNCCAAAACTTTTTAA 

Sequence 3087 

CCGCGGTGGCGGCCGAGGTACGCGGGATTGCTATGTTGGATGATTTGGAGCATGCAGAGG 

CAGATACAGTAGTAGAATCAGCTTTCTGCCTCCATCTTTTACTCCGCTCCCTGTCGCTCT 

TTAACTCAGCAGTTTTTCTCTCTCAGTCTCCCAACTTGTGTGAGTGTTTTATATTCTACA 

TAAATTGAGGGTGGAGGGCAACTTCTGGTTGTGTTGAATGAGGAGATTGGCAGAATTGAC 

AAAAGCAAAATGAAACAATCTCAGCATTCTGGAAAAATTGAGCAGGGGCATACAACAATC 

TGAGAAGTATTTATGCCTGTAACTGCTGAATTTCAGGTGAGAACAGTGAGACTGTATGGC 

ATTTGGCCTGGAGCTATTCCTATGCCCTCTCCCATCTGTTT 

Sequence 3088 

CGGCCGCCCGGGCAGGTACTGGTTTGACAACCACCTGTTTCCAGCTCCACAAGCTAGCAT 

GTAATTCCCTTGGTGAGAGTAATTCGTTCAAGGATGMCTATNCAACTCMTTTTCATAG 

AGTTGTTGGAATAMCTGGCAAMGGCTATCTTTTGCAATNACCACTTGTTTGCTTGTAA 

AGAAATGGATAGTCAAGCCCTNGGAACCTGNNTGGGAAGGNTCACCCATTATTTGNGGNT 

ATAATAGTTGNNCTTGGCAMTTMTAGGTTTTGTAAAAAAAGTAAGGGANCCCCNTNTA 

GGACCCCMGCCNTTGCAAAATNTTTNGGGGNNCCCACCCCATTTTTGGNAGGCAATANC 

C AATTTTTTG NTAAAG ACTCC ATG NGG NTGC AAAACCTTCC ACC N AGGMTT7TTTTTTC 

CNTCCAAAATTTCMTTTTTTGGMGNGNAAAAAATTTANMGGTTT^ 

TTTAAGCCCCCNAAAAGGNCTAA 1 1 I 1 1 I 1 I I GGGGAATNTNGGCCTTGGCCCCCNCCTT 

TTGGCN AAATTTATTTAAAAAN GAAGGTTCC CTTG GAAAATGTAAAATGG G CCACCTTTA 

A 

Sequence 3089 
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TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACACCTGCATTAGGG 

ACAAGAAATCTCTATGTTGAACAAATACTGCAGGTGGTGCTGATGCTAAACTTTTACAAA 

CATTGGTTATACAATTMTCAACAGTTATrrGATGATTCAGCACACAMTMCTCTTTGC 

AACCAGGT7TACGTTACTGGGMCAMCTMCTAGGTTTCCCCAGGTGTGGAGAGCTTTC 

ACACAATATACCCTMCTTACATAAACAAMTTACCCAGGTAAATATTTACTAAACAGTT 

TTAACATAAATGATAGTGATACACAAATTTGAGTAATAATTCAAGA^AATAAGTNGCACA 

ATTAG G CTGTTAACTGCTG GTTATCTGTTTAATGTG AAATACC C ACTAGCTTNCCTTC AC 

ecu 

Sequence 3090 

AGGTACTTCTGTCGTCANGATGCAGGTGGTTATGAACTTGGATGCAGTCAGTAGCCAGAG 

ACTTGGCCATCTACACTCTTCAGCATGGGTCCCCCTGACCCTAAAAATCATACAGTCCCT 

GGGCCAAAATGGCCTCATCCAATCCAATGTCCTGTCCAGGGCTTGATCTCTTCTATTGCA 

GTTCTAATGGTTGTCCAGCATCTTCTTGAGCGTCAACATCACTTGTGGATCCTGGACGCG 

TACCTGCCCG 

Sequence 3091 

CCGCGGTGGCGGCCGCCCGGGCATGGTACTCCTACAAACTTAACAACTTTAGCTTTTCGG 

GGMCAGTGGTGGTCTCCACAGATAGCTAGAATCTTCTAAGATGAAAATAAACTGTTTCA 

TCCAG CTGCTTTCTGCCTGTTAACCTTTG CCACTTCC ATTCCTTC AAGTG GTG NAAATCC 

CCAGGGTTGTCTTGTATTACACACATCCTTGTTAAGTAAATATACCACTGTTGATTTCTT 

CTTCCAGTGCAACATTTGGATTAAATAAAAATCTTTAGTTTTCAAAAANAACAACACTNA 

ANAAAAANGTTCCTNGGC 

Sequence 3092 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC rTTTTTTTTTTTI 

rTlTTTTTTTGTTTTTATTTI I AGTAGAGATGGGGTTTTGCCATATTGGCCAGGCTGGTC 

TCGAACCTAAAATGTGMCTATAAAATMGACTAGACAGCTTTACTGATTTTTGTGGTTT 

ATCATATGAAMCAGAATTTTTGAAAAGCNGGGAAAAMTCAAACCAGTTCATT 

TAAAATTACTTTGGGAGGCCNAGGCGGGAGGATCACCTGAGGNCGGGAGTTTGAGACCAG. 

CCCGACCAACATGGANAAACCCCCGCCTCTACTAGAAATGCAAAAATTAGCTGGGCATGG 

TGGCGC 

Sequence 3093 

CGNGGTGGCGGCCGAGGTACCTNANTTGGAAATGCAGAAATNACCTGCCTACTGNNTTGA 

TCTCGCTGGGAGCTGCAGACCGGAGCTGTTCCTATTCAGCCATCTTGCCAGCAGGTATTT 

C7TTATAG CAAC NCAAGAAC AGCCTAATACAATTTC CAGTATN AACGGGCTGTTACAAGT 

TCCCCCTACCCCTTCCCAGTGGGATGGAGACAGTGGAGTTGNCTAAACTTTCAACATCGT 

CCAGAAGTAAAGAGGCCACTCGCTCCCTCTTNCCTCTGGTGTCAATGGAGGCCATGTGGA 

GAACANNTAATGAGACACTCCTATTCTTTCCAACTAGGATG 

Sequence 3094 

CCGGGCAGGTACTCACAGCTGTATCACACTGAATATCATCTCCCTGCACTGCCATTTCAT - 

TTGAGCAGGGCTTTGCCTGTCMGAACATTCATATCTTTTGATAAACAGTGCAI I II I IC 

CCTGATAGTCTGTATGGGACAGCCTCCTTGTTTGCATCTGAAGGACAGGGACTTCAGGAC 

CTCCTGCTGTGGTCACCAAGGAATCACATTCAAACTTCTCAGCAAGGTGTTATGGTCCTT 

CTCAGTTTGCCTACGATCCATTGAGGCAMGGTTTCCTTCTCTCAATACACCCTCTGCTT 

TAGCCAGGGGGACCTCCCACTGTCCTCTCTATGTGTTCTATTTGTTCCTGACTGTTAAGT 

ACCTCGGCCGCTCTAGAACTAGTGGGATCCCCCGGGCTG 

Sequence 3095 

ACGCGCACCGGCGTGAGCTGGCAGTGGCAACGCAGGCGGCGGTGGCCGCTGCGCTGCCGC 
TGGAACAGCTTAGTTCTTGACGTGCAGCCAGCGCGCGTGCGCGGCGCGGGCCAGGTAGTC 
TAGGATGAAACCGAGTATGCCGATCAAGCACCGANNGGCGGGCCATNANTTCCGAATACG 
CGAGGCGGTCGCGCGTGTCGAGGATGAAGTAGCCGAGGCCGGCCGAGACGCCCAGCATTT 
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CCGCCGGTACCAGCACGATCCAGATGATGCCGATGGCCAGGCGCACGCCCGTNAGAATAT 

NGGCCGCTCTANAACTAGTGGAT 

Sequence 3096 

CCGCGGTGGCGGCCGAGGTACTCACAGCTGCATCACACTGAATATCATCTCCCTGCACTG 
C CATTTCA TTTGAG CAGGGCTTTGCCTGTCAAGAACA7TC ATATC TT7TGATAAACAGTG 
CAi i ; u ICCCTGATAGTCTGTATGGGACAGCCTCCTTGTTTGCATCTGAAGGGACAGGG 
ACTTCAGGACCTCCTGCTGTGGTCACCAAGGAATCACATTCAAACTTCTCAGCAAGGTGT 
TATGGTCCTTCTCAGTTTGCCTACGATCCATTGAGGCAAAGGTTTCCTTCTCTCAATACA 
CCCTCTGCTTTAGCCAGGGGGACCTCCCACTGCCCTCTCTATGTGTTCTATTTGTTCCTG 
ACTGTTAGTACCTGCCCG 
Sequence 3097 

CTNCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGGACCCCGGGNAGTTGGCT 

AAAACNCTAAAAACTAAATGTG TAAGATTTATGC ATTTTAGTGTATTTAAATTATAACC C 

CCAGAAAGCATTGCCTCATTTTTTNTGGAGGGTAAATATGCTCGTTAGGAGCATAAGGGG 

ATGATTTTGAGAAACCCTTGCTCTGTCATTCTGTCAGNAAACCGCCNCTMCTTTTCTGG 

TTTCAAAAMGCMGGTCCATTGATGAMCTCCTTTGTGCCTTAGTGACAAATAACATTG 

ATAAGGCCTACT GTCMC AAGAGTGTGTTTGNTATATCTTTTGTTTTTAGTGACTAMTT 

TGTAGGMTGATATTTTTATGGGATTTACTCAATCTCACCCA 

Sequence 3098 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGAGCTGTTTCTCAG 

AGGAAGCAATGGAGGCTTGCTGGGATAAAGGCATTTACTGAGAGGCTGTTACCTAGTGAG 

AGTGATGAATTMTTAAMTAGTCGAATCCCmCTGACTGTCTCTGAAAGCTTCCGCTT 

TTATCTTTGGAAGNAGCMGMNTTGGNCACTCCMGGGACAmATrTMTTAAAMG^ 

AACAACTGTCCANGTGCAATGAAGGCAAAGTCATAGGTCTCCCAAGTCTTACCCCATTCC 

TGTGGAAATATCMGTTCTNGGGCTTTTCTCTGTCATGGTAGCCTCAACTTTTCTCTGAC 

CGGGTGCATTTTCTTTTCTCTGGTTTCTAAAANTTC 

CCACTTTACTTTMTTATCTGGTTAAMTTTTTGGGGACCCCMCAAAAAG TC 1 MIMA 

GCCACCTNGTNGGGNGGNNAAAAAAAGAAAAA 

Sequence 3099 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCATGCTTTTC 

CTGTAGTTGAAAATCTACCATCAAACATGCTACTACCTAATGACGAATTCAGAGAAGGTA 

AAGAAAGTATGTAAACCTTTCCCTCACTGGTGGCCCTTATTAGCTTAAAAATGTAGTCTT 

CATATAATATAG NC AAGGACATTN ATGGC AAGTCTAACA 1 I I M I ACGACTAAAGAAAAA 

AGTGTAATTACTATATTMTANGAAAGACTTAATTATCTTTCTATTCCAGCCACTGAAGT 

TATTATGACAGAATTGTTATACAAAGGTNACATAATAACATGTAGCCAAAATATATAGGA 

GMGATATTATAGAAATATATTGGACMTTAGTTMTAMTGTTATTTTTCAAAATGNTT 

ATGAACATTTTGTGAC ATTTGGN CAGC 

Sequence 3100 

CCGCGGTGGCGGCCGAGGTACTTCTGTCGTCAGATGCAGGTGGTTATGAACTTGGATGCA 

GTCAGTAGCCAGAGACTTGGCCATCTACACTCTTCAGCATGGGTCCCCCTGACCCTAAAA 

ATCATACAGTCCCTGGGCCAAAATGGCCTCATCCAATTCAANGTCCTGNACANGGGCTTT 

GANCTCTTTCTMTTGCAGTTCTAATGGGTTGTCCAGCATTCTTCTTGAGCGTCAACATN 

NCTTGATGGATCCTGGACNCGTACCTGCCCG 

Sequence 3101 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGCGCAGGTACATACAGCTT 

CATATCTGCTMTGGAGAAMTAACTTAAACTTGGCCCTGCCAGGCMCCCATTTTCTTC 

CAGTTTAAAACTTAGTCTTTTTCAGGTTAGCTAGGCCAATTCCAGGTGCTCGTCCTACCT 

TCTTTCTCCCATTTCTACACCACCCTTCTTTTTTGTCTCTCCCTGGGAGATATCTGAGCT 

TAAGGAAAAATTCGTTAAGAGTGGGGATCGGAGAGACCTCTGTTCCACTAGTTGCCTTCT 
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GGACATTTTCTGGTCTCCACTGTTGATGTATCCATCCCTTCTGGCCTTTGGGTGCACTGA 
GGCATCTTGCTATTG NCTATGGTTATGCC ATAGCTCCTAG NC AAAAGGCTCAC ATTCTTT 
TTTA 

Sequence 3102 

AGGTACCCTTAAAGAGCCTTTTCAGAGATTTCGTTTT^ 

CCATAACATTTGAACGCTGGTGATGTGCTCTGTAGCAGTGCTGTGTGTTGTGGACCCCTG 
G AAGCCTAC G CACCATTG AGTATTTG ATA1 fTTTTI I GCCCCATCCTCAAGGAGCAGTTG 
TTGCTCCTTAGGGGTAAGTCGCCCCTGCTGAGAATGCATGCTCTCGATAATGCTTGCTTC 
ACATTGGTCATTGTGGAATA I I I I I ATAAAAGAGAGGATCCATTTCCAGCTTCTTTTCTG 
CAGGAAGAGAACCTGTGGTTATGTCTTACTCCCCCATTTTGT 
Sequence 3103 

AGGTACGTGTGCGCATGCATAGTCTTACCCCACTACACCACTGGGCACCTAGCCCAGACA 

TTGGGCTAGATGCTTACATATATTATTTCCTCAGAGGACCTAGATTAATATACATATTTC 

CAGGAATTATATCCATTTATTTA I (Mil I CTAGGGTGACCTTA I Mil I ACCCTGTAGC 

CAGTGAAAG GTAAGC CTGTTTTTCTTGACTTAGTGCTGTAAACAGCC AAATG ACAAAGTG 

ACCGCCTTGCTGGCACAGCAGTGCTTGCCCACTGTGACATTTCCCCTCAGCTTCTTGATT 

TTTCJGCCCCAAAATGCAAATACTACAGGACTCCCATTAACTGGTTGAGA 

Sequence 3104 

CCGGGCAGGTACCCTGGCCTAGATGCCAGTAGCATATTCCCCAGTTGTCAAAACCAAATA 

GTTCTCCCAGCAATGCAAATTATCACCTCATGTGTGCGTGGTATTGGGGGTGGAGGAACA 

TTGTCACAGTTGGGGACTACTGGGCCGGAGGTCTGTCTGTGCAAAAGGGGAATCTCAACC 

ATCACTCTGTAGTGACAAMTGAAAATAAMGAAAMGCTATTTTTMCATTTCATATTC 

TCTCTCTGCTTTTCACATTTTGCTTTCTACTTGCCAAACTTTCCCCATTATCAGAGCTCC 

TTGTGTTTCCTAGCCACCT 

Sequence 3105 

CCGGGCAGGTACCTGGGACACCAGGCAGTAGCCCTGGAGTAAGATCAAGTGGGAGACACA 

TATTCCTCCCCTAATGGGACTGCAGTGTGCATTCTGCCTGTCAATCCATTCCTAGCATTT 

GTGGGCAAAGACCCAACATCCCCCTTTAAACACAGATTCCAGGGTCAGGCTTGGAAGGGC 

TCAGAACTAGGAAGTCACGCTTAAAACCTTTAAGTTGGNCTCTTAAATTCMCAAAAA^ 

CACAAACAACCTTATTTTACCCACTCTNCAANTAAAAAAACAAACAAACCCTGTATANGT 

TTGGAAAATAATAAAATNTAATTGCATTCATATTCGA 

Sequence 3106 

CCGGGCAGGTACACTTCCCCCTCCTCCTCCCCCGACCATGCCTGAATTCAGACCTCTAGC 

CCTTGGATCTGTTATOTGACCCAAGAAGTCTTGTCAATTTTAGAAGGCTATCACCTTTTA 

ACAGCCATTTCTGC1GCTTCCTGCACACTTTCTMTTTCTCCTCTGCTTTCCCTGGAAAT 

CTGATGCTTGACTTGGACACAATTCAACCCTGTCTCCTTATAGAGACAGCTGCTGAATGA 

ATGCTAGAAATTTA/ \ \TTCTGCTTTTAGCTTAGGGTTCTCTGCCTTCTTTAAAAAAGTAT 

TTCTAGAAAACATATACAATATATGCTGCTGCAGATTCACC I I I I I I 1 

Sequence 3107 

AAGTCATTCTCAGGAAAAGGTGAGTGTGGGACCAGGCAGGGAAGCCAGTTTCTGTCCTTT 

ATTTGGAACTATAATCAAGGCAGACAACCACAGTAAAGAGATGCAGAGGCAGGGAGTAGT 

GTAAAACCACCrrACACACAGCTTCCTGGCTTTCCCTTCACTTTAGGTTGTAGCTCAAGT 

TATTCTTGGGCTAAAAAATAAGCTCTGTTGGCCCAATTAAACCAGAATTAATCACCAATT 

CCCCATCTTTATCTGGTAATTTTAGTTCAACATTGCTCAGCTGATGGCTCATTAAGCCAA 

GAACTCTCCCAGGTCTCAGTTACCATTTTCAGACAA 

Sequence 3108 

CCGGGCAGGTACTCACAGCTGCATCACACTGAATATCATCTCCCTGCACTGCCATTTCAT 
TTG AGCAGGG CTTTGCCTGTCAAG AACATTCATATCTTTTGATAAAC AGTGTA I I l 1 1 IC 
CCTGATAGTCTGTATGGGACAGCCTCCTTGTTTGCATCTGAAGGACAGGGACTTCAGGAC 
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CTCCTGCTGTGGTCACCAAAGAATCACATTCAAACTTCTCAGCAAGGTGTTATGGTCCTT 
CTCAGTTTGC CTAC GATCCATTGAGGCAAAGGTTTCCTTCTCTC AATAC ACCCTCTGCTT 
TAGCCAGGGGGACCTCCCACTGTCCTCTCTATGTGTTCTATTTGT 
Sequence 3109 

ACAGAGTCTTGCTCTGTCACCCAGGCTGGAGTGCAGTGGCATGATNAATGAGTGAATGAC 

TTGAGTCAGTGAACGTGAAAATGTTTGGTANATTGTAATACNCCGATATTGTGTAATACG 

TAAAATACTATTAGGAACTACAGCAATGTCTTATATTCCCAGCTAGGCCATGCACAGGAG 

GCAGTGTGCCACACCACACTGAAAAGGNGGCCAGCCCTCACTCANAGCCTGCCTTCACCA 

CCTGCCCG 

Sequence 3110 

CCGGGCAGGTACCCAATCCAGAGGCACCATTTGCAATCTAAGAAGAGCCCAGAGGAGAGG 

AAAATTGGTAATGAATCAGTGGGGAGTCCACGAAGGCAGAGGGGCTTTGGCACAGCCCAA 

AAGACAGACAGAAATGAAGCTCTCAGGACTCAAGAGGAGCCTCATTCCAGGCAGAAGAAG 

GATCATGGAAACCACACAAGCGGGTCCT 

Sequence 3111 

AGGTACGTGTGCGCATGCATAGTCTTACCCCACTACACCACTGGGCACCTAGCCCAGACA 

TTGGGCTAGATGCTTACATATATTATTTCCTCAGAGGACCTAGATTAATATACATATTTC 

CAGGAATTATATCCATTTATTTA'I I I I I I I CTAGGGGNGACCCTTA I Mil RACCCTGN 

AGCCAGTGAMGGTMGCCTGTTmCTTGACTTAGTGCTGNAAACAGCCAAATGACAAA 

GTGACC GCCTTGCTGGCACAGCAGNGCTTGCCCACTGNGACATTTNCCCTCAGCTTNTTG 

ATTTTTCTGCCCCAAAGATGCAAATACTACAGGGACTCCCATTAACTGGTTGAGAGGCTA 

ACGCGCAACTGCNAACTTCACCAGGTCTTTCCTGTGCTGNTTCTCTTTTCTTTCCCTACC 

T 

Sequence 3112 

CCGCGGTGGCCGAGCGGNCGCCCGGGCAAGGTACCCTCTCTGTTCCCCTCCCTNCACCCT 
TCCCCTCCTTTTCTGTTTTT^ 

ATTTGAAAGAAGTCAGTTTTTGTTAGAGGCCTGGATATGANCNAATNTGTTTNTNAACCN 
AAATNACNACNTGMCCANTTCTGNGGGCCTGMCATANANAAAATTTGTTGCATTTATT 
TAACG 

Sequence 3113 

CCGGGCAGGTACTCAATAAAATAAGAGATACTATAGTAAAACTCTTACAGGAAAAATTAT 

TAGAATTTGTGCTATACCAAATCMTTTCACAAAAATGAACTGCCTGTCCACTGTTTTTC 

CACAAAAAATCCAGGCCATCTACACCTTGCTGGAAGGATGGAACCTCAACTGCTATACTC 

CTTAGATTGCTATACAGCAATCTAAGTGGCCCCATCACAGCTCCTCTCCTTGAACATTTG 

TTGAGATCAAGTTGCAGCTGGGCCAGGCACAATGGGTCACAGTTAGACAGCTGAGGGCAT 

TCCAAAGAGTAGTGTTCCTTCCTCCACATTATCTGCCAAAGTAGTC 

Sequence 31 14 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCAAGACTGT 

AATGAGCTATGATTGCACCACTGCACTCCAGCCTAGGCGACAGAGCTAGATTCCATTTCA 

AAAAAAAAGGTGATGATGGTGGGGCCCGGCATGGTGGTTTACACCTGTAATCCCAGCACT 

TTGGGAGTGCTGAGGTGGGAGAATTGCTTGATCCCAGGAGTTCGAGACCAGCCTGGGCAA 

CATTGAMGACCCCGTCTCTTAAAAAAAAAAAAAAAGAGAGGTAATGGTGGCTCCCTTTC 

TCTCATCAACACACTCTGGGGGATGAAAAGGATGATAATGTCTATGTGTGTGATAGATTG 

TAGAACGACTGCCANTGTGTTGAGCGMGAAAAGGAGTATGTCTGCTGAACAGATTAGCT 

TCTGGTCTCCAATCAGTATGTGTGGTGGCGTAGGGGGACT 

Sequence 3115 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTTGATCGAACACCTCTGATAGTCCGAGTATGG 

TTACCTCAATAAGTAACCATTCCTCTTGCTCCAAGCCTGATGGGMGGTAAACATAAGAA 

TAGAAATTTCTTGAGCTGAAGTGAAGTGAAGGAACTCCCTCCCGAAAACCATGGCATGTC 
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ACGAGGAACGTTTATTGTANGTTAGGCGGTCTGGTGCCTCTAGGGGGGACAATCAAATGA 
TACAGGACCTCGMGAGTCTTACTTTGGATTTCTGTGGCTTTATAATGTAGTATATTATG 
TAGTTCATTGCTGTGGCTTTATGC \ I I 11 1 ATTATTTACAAGTGATTTACTGATTTGTG A 
G CTCTTG AAACTGTGTCATTGCATGTTATGTTTACCCTGATAGTtGCATTTGCTTATAAT 
TC I I I I 1 GCTT 
Sequence 3116 

CCGCGGTGGCGGCCGAGGTAC 1 1 I M I t I I I I ) I ) M I I I I I M I I I GNTTTGAGATGGA 

GTCTCGCTCTGTTGCCCAGGGGAGCGCAGTGGCGCGATCTCGGCTCACTGCAACCTCCAC 

CTCCTGGGTTCAAACAATCCTCCTTATTTTTAAAAAGAGAAAAACNACCTCAAAAAAGTA 

AAGGGGATTTGCTCATAGAACACATGGAAAACCAGAAACCAGGAAGAAAATGAACATACA 

G TTTTC AAATCATGTG TTTTCATACAMGIATCGTATTTAAAACTAAAAACAATTCTGAA 

ATTAGTAATAAACCCATTTTCACAAATAAGGATCTAAAGCTTAAAGAACCTGCAGGCATA 

CAATTAGTCAACAGCACAGGCAGAACAGAAACTNTGAGTCTATGTGATTCTA 

Sequence 3117 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGTGCTCTTGAG 

TCGGCAGATGTTGTAAGGGACTGTGTAGATCAACCTTTAGGACAGGAGGTAGCACCTAAA 

AGTGAGGACCAGCTGTGGTGGTGGCAATAGAGTTTATGCTTGACCTTTGTTAATCGGGAG 

AAGTTCTTGGGCATTTCAGATGATGGGTAGGGCCATGGAACTCTCAGTAGTCCTGGTCCC 

ATGATCTGCCTCTGAAACAGGAGGGGTGGGATGTGGTAGTGGGATCCACTTTGTTCTCGT 

GCTCCTGACCCAGTGAGGCTCTCTCCTGCAGTCCAACACCGGCAGCAAGTCACAAGAAGT 

TTGTCTTNANGACCATGAAACTGCCTCAAGCTTGTAAAGCTTTCCTGCCTAGGTC 

Sequence 3118 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCCAACAATTT 

GAGTAACGTCACTTGGCAGAAGGAGACATTTTTGAGTCATGCCATAGATTTGAGCCCTGC 

TCCATCATTTACTTGCTGTGGGACTTTGGACCTGCATCCACCCTCTCAGAACTTGTGTCT 

TCATTGTGTATCACCTATTTTACAGATTGCTGTGAGATTAAACAAGGTAAGATACAAAAG 

GGTCCTAATACACTGACTGTTAAAACTGAGCAGTGACTTAGAATAATCTCTTGAGCAGTC 

CAGGGCACATATTGACCTCTTAAGTCTCCTTTAGAGTTATAATATTAGCATAATAACACT 

GAGCTTTMGAAAAGTCTTTAAGCCAGTGATTCCTGTGTATATACCCTCCAGAGGGATTC 

CAAATCCCCAGGAACTGAAT 

Sequence 3119 

ATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGCTCCCACCAGATACTCACTTGGTGAA 

TTGAAAGAGGAGCTGTAGTGAAMTACTGTCTTAGGAAAAACACAGCTCTTGATTGAGGC 

TATCAGCAGGTTAATGAATMCTGTTTTCAGACAACA'fTTCACCCCTAGTTTGAGCTGCT 

CTTAGAATGAATGTGGCCAGGTTAATGTGACCAGCAGACATAATGCAAAACCAAACTAAG 

GTGGTAAATTGGCATCTTGGCTTTGTTGCACATGCAGGAAAGATCCAGGACTGGTGTCAG 

AGGTATACTGTCTTTGGCAGAAGAGTGCTTGTCCATrXACCAGAGTACCTGCCCG 

Sequence 3120 

GGCGANTTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAACTGAAAAATAAGA 

CAACCAATTCCATCATGGTGCCMCCACTCTAGTTGTCTTGGGCMGTCACTTATTTATC 

CATCTTGGCTTTCTTGCTCAAAAATGATGGATAGCTACACCTCACAGTGAGGATTCAGGC 

ATGTAAGCAAAGTGCCTTGAAGACAGGAAGCCA3ACATTATAGGAGCATAATTTTTGCCC 

TGAAATGCATTTGGTATGGGTTGGGCATCANAToTTTCTGATTTGTGAATTANTCTGAGA 

ATGTTAGACATGCAATGGTTTAAGGATTGCTAGGAGCAAAACAGAACCATGTTGTGTGTT 

TATGGGTAATCAGCCAGTTTGGCAAACCACCAGTGGCTGAAACTTGT 

Sequence 3121 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGAATTTAGGGAA 
CG AATC ATCCC ACTTTTTGTATTGAGAACAGATTTGGGG AAGTAGTG GCAG G AAAGTGTA 
GMGTTGGAAGATCCTTTAGTAGGAATATCCTCTTGCAATAATACGATGTTGATCCAGGA 
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TCTTAATTATGGMGCATTAAGTAGAACAGATTTGAAGATGCACTTAACACAATTTGCGG 

TGCACTCTGAAAAAAAAATGAAATGAGAGAAATTAAAATGACTCCACAGTTGCTGTCCTG 

AGTATCTAGMGAACGAAGTTGTAATTTACTTGAATGGAGAAGACAATTCGGGGAGCACG 

TCTGCATGAAAAATGAATCATCTGGTTTGAATAAGTTAGGTTGAAGAT 

Sequence 3122 

ATCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGTGGCGATACCGTAGGCGCCGTG 

TTGAGCGCCCATCCCGACATCGCAAAAGTCTCCTTCACCGGCTCAACGGCTGTCGGCAAA 

ATGATCCTGCGTACCGGCTCTGAGACCTTGAAGCGCGTCACTCTGGAACTGGGCGGCAAA 

TCACCGGTGCTGATTCTCGATGATGCGGATCTCAAGACCGCTGTGCCCATGGCTCTCCAG 

GCCGGCTTCATGAACAGTGGCCAGGCGTGCATTGCCGGCACCCGAATCCTGGTACCTGAA 

AATCGCCTTTATGAAGTTGAAGCCGCTGATGATCCAAGGAGGCGGCCAAAGTACAAGTCG 

GTAATCCCCGTGAGCCAGCCACCGCTGTCGGCCCCATGGTCAGTCATAAACAGTGGGAGC 

GGGTGCAGCGCTACATTCAGATCGGGATTGAG 

Sequence 3123 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATACAGCTTCATATCTG 

CTAATGGAGAAMTAACTTAMCTTGGCCCTGCCAGGCMCCCATTTTCTTCCAGTTTAA 

AACTTAGTCTTTTTCAGGTTAGCTAGGCCAATTCCAGGTGCCCGTCCTACCTTCTTTCTC 

CCATTTCTACACCACCCTTC! MM I GTCTCTCCCTGGGAGATATCTGAGCTTAAGGAAA 

AATTCGTTAAGAGTGGGGATCGGAGAGACCTCTGTTCCACTAGTTGCCTTCTGGACATTT 

TCTGGTCTCCACTGTTGATGTATCCATCCCTTCTGGCCTTTGGGTGCACTGAGGCATCTT 

GCTATTGTCTATGGTTATGCCATAGCTCCTAGTCAAAAGGCTCACATTCTTTTTTATTAT 

TATTTTTT 

Sequence 3124 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTGATCTAACCAAGATATGTTGTTTTTCTCATC 

CACCAGTCACTTTCTCAGTCCTTTCTGTATCCCTTGCAATTTGAACAAAGCTTGGTGAAT 

AGTGTGCACACAAAAAGCACACTAGGTGAAAGACAGATACATAAAAAGGGTAAAGTCAGG 

ATATTTTAACAMCCTATCAAGCTCTAAATATAAGCCTCCTTGGTAGTTTTCTCTTTAAC 

CCTCTCTCCACTGTTGGATGAMTTTGCTGCATTCAATTCCAGTTCCCACCCCAACTTCC 

TTCTTAACACMGGTCAGGGTTAAGCCTTCGGTGCTTTAATCCAGAGGGAAAATTACTTA 

TTTTAAAAAGCAGTAAAAAACACCGCATTCCTTTGGCACAGGGATGAAGACC 

Sequence 3125 

AGGTACTCACAAGCTGCATCACACTGAATATCATCTCCCTGCACTGCCATTTCATTTGAG 

CAGGGCTTTGCCTGTCAAGMCATTCATATCTTTTGATAMCAGTGCATTTTTTCCCTGA 

TAGTCTGTATGGGACAGCCTCCTTGTTTGCATCTGAAGGACAGGGACTTCAGGACCTCCT 

GCTGTGGTCACCAAGGAATCACATTCAAACTTCTCAGCAAGGTGTTATGGTCCTTCTCAG 

TTTGCCTACGATCCATTGAGGCAAAGGTTTCCTTCTCTCMTACACCCTCTGCTTTAGCC 

AGGGGGACCTCCCACTGTCCTCTCTATGTGTTCTATTTGTTCCTGACTGTTAGTACCTGC 

CCG 

Sequence 3126 

TGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGGTGAAAATTGAACCATGATCA 
GCCCGGATCTCTGACGGACTATGATCAGCACAATTTCCAGCCAATTCTTATAACTCTTAA 
GAGGGCAATMTCTTGAGACACAGATA7TTANCGCCTCTTCCACATCCAATGTTTTCTAC 
CATGAMTCATGTCTTTGTAAAATGATGTGAAAAACATTATMTACATTTTATATAAGTA 
GATTTTAAMCTGTGATGAATTTCTAGCACCAGTATGAAGACAGAATTAGAATGCTATTG 
ATAATTTCAMTAAAAAAAGAAGGCATGCTTGAATACTGAGMGAGATGTMTGCATTTT 
* CTAAACAAAGCACACAGTTAATTAG 
Sequence 3127 

AGAGAGATGTTTTATTTTTGCCAAAGCATGTGGGAGTAAGTAGCAAACATCATGACACTT 
CACTCCTCAMTATTTTCCCACGTATCACTTAANAACAAGGGCATTCATTCTGTTACATG 
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ACCACAATACAATCAGCACAGATAGATATTAACATTGAGACAATGCTCTGATGTATTGGT 

ATCTACTATGCAGTTCATAATCAAATMGGGTTTTGGTAACTTGGGATCCATAAATGGTC 

TGCAAGGTCAGCCCCTGTANACCTGCACTNAGTGGGNGCCATTTCCCTGTCTCAGGTGCC 

TGCTCTACTGGTGATATTTTTCACTGGCTCTGTTGCTCACTTANA A1 Mil 1 CTCCCAGC 

TGTTGACATTTTAT 

Sequence 3128 

GNCGAGGTCATGTGACCNAAGGACCGGCCCAAACGCAGATGCAGACAGATTCCAGACTCT 

TCAGCAATCCTTCCCTTATCTGGGGCAAAGGCTGTGATGCCTACACAGTCTCTGNCATTN 

CATCCCTGATGCTCCAGTTTTTAGGAGANACCACCTNGTCATGTTCAAACTTCCAGAAAT 

GGTGGTCCTGTGGAAAGAATCCTAGGCCAGCAGCTAGGAGACCTGCTGTGAGTCCCANCC 

CTGCCCATCACAGCTATGTCAGGTGGTTCANGACATGTTGGGCTTCAGGTCCCCATCTAT 

MTCAAAAAGTTTGTTTCTAGCTTCAACAATCTATGACTATTATT 

Sequence 3129 

CCGCGGTGGCGGCCGAGGTACATGTGACCCAAGGACCAGCCCAAACGCAGATGCAGACAG 

ATTCCAGACTCTTCAGCAATCCTTCCCTTATCTGGGGCAAAGGCTGTGATGCCTACACAG 

TCTCTGTCATTTCATCCCTGATGCTCCAG I I I I l AGGAGAAACCACCTCGTCATGTTCAA 

ACTTCCAGAAATGGTGGTCCTGTGGAAAGAATCCTAGGCCAGCAGCTAGGAGACCTGCTG 

TGAGTCCCAGCCCTGCCCATCACAGCTATGTCAGGTGGTTCAGACATGTTGGGCTTCAGG 

TCCCCATCTATAATCAAAAAGTTTGTTTCTAGCTTCAACAATCTATGACTATTATTTCCG 

TTTACTGATAGAGTAACTAATTACTTCATTATAGGTTTTCAAGACCAGTTCCTCCTGTTT 

GGTTCAATTAGCAAAATGCCMCATGATTGTCATTGATTTT 

Sequence 3130 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACTGATCTTAAAACAG 

CTCTGGCTGTAAAATCACTTTGCTGGAGTCAATGAGGTAAGAGGCTAGAACACAATACTA 

GGGATCTGACCCCTGCAGGCTGGCCATGATCCATCCCTCAGGCCGNCTTCCCANAGCAGA 

AAAATCCAGNGAGCCACTTMTGAGACNTTTTCTTTAATGTTTTCTTTAACTN 

TGCAACTGAAATTTATCANGAGCTTGTACCCCTCCAAAANCCTNTCTCTTTCTTTCNCTN 

TGCCTCTNTCCCTCTTGCTGCTACTC I I I \ I ACTGCTTTTATGCACC AAACTTCATGGTT 

mrm iaatggmtgangtttcttggggaanaaaagattnnangattcatggtaacac 

CAATGAAAATTGT 
Sequence 3131 

TGGTNCTCCCCGCGGTGGCGGCCGAGGTACGTGGCGTAGCATTCCATGTTCAGCTTTGAC 

ATTTATTTTCTCATATCAACCCTTTTTACACGTGAAACACAATCTCGCTCTTGAA 

MCTGCACG/JCTGTGAAACCTGTATTTATATTTTCTTCAGTGTATTCTTGTCGTGTGTG 

TGTCCTAAACAAACCAAAAGAAMCTTTCCAAATCTAAAGTATTCATTCTCCAATTGGAG 

CAAGAGGAGTCAGTTAGATACTATCACGGCATTCATTTGTGGCTGGCTTGTCATATTTAC 

TTATGATTGATAATAAATCTATTTTGCTTTTTANAGCCTCCCCGAGAAGCCATCTGCCTT 

CGAAGCCCTGCCATNAGAAATGCAAAAGTCTGTATCCAAATAAAAGCCTTGGAAATTGA 

Sequence 3132 

CGCGGTGGCGGCCGGGGGCCATTGAGGACTGCCATGGAATGACTTGAAAGGTCACGTAAG 

CTGAGAC1TCTGGAGAGACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGACCACTGGA 

ACAATGA W AAGG AG AGAATN CTACTGGTCAC AG AC AAGACTCTCTTG AN CTGCAAAT ACG 

ACTTCATCATGCTGAGNTGTGTGCAGCTGCAGCGGATTCCTCTGAGCGCTGTCTATCGCA 

TCTGCCTGGGCAAGTTCACCTTCCCTGGGATGTCCCTGGACATGGAAGACAAGGAGAAGG 

CCTNATGNATCTACTGGGGGAGTCCCAG 

Sequence 3133 

TGTCACCCAGGCTGGAGCAAGACTCCACTTGTGAAAATTTTCATCAGTTTCCTCCAGATG 
TAAAAGAATCTCTTCGGCACACTCCAGGGAAGGGGCACCCNGTGATTTTTCTCCTTCACG 
CTCTGGAACAACCGCCTCAGCATCGCCTGCAGCATCGCCTGGTCCTCGCTGGANAAATGG 
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GCCATNTTTGGGGAAGCAGGGGTCCGCGAGGTTTCGCCTNTGGTNCCTN 
Sequence 3134 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCATGGTACATATATG 

GGACTTGTTAAACAGAAGGTTTCTTGGGGTGA 1 1 I I I I I" AAAAACCC AAAATTTTAAAC A 

TCAGGTATTTTATCTGGTTGGGGAATTAAGCCCAGGAAAGAAAAACCTCTAGAATATTCC 

TACCCATACAGCCTGATGCACACCCGTGGGCCGTCTTTAATACTACAACTTATTGAGTAA 

ATGACAATGCATAGCACTCTCAATGCCCTCGTGCCACCCTAATGATTGCTCTACCTTTCT 

GCTTTCAAAAGACAGTACCT 

Sequence 3135 

GAAATCCGGCGAATTGTTTTCCCCNCGGNNGCGGCCGTTCCTAGTCACATTTGTTGTTGC 

AGCATGTTATCAGNAGAGTTTTCTTTNACTCTTAAAAATGCCCTGGTNNGGATGATTGAT 

CATACCTGCCCCCATGCCCACCCNACCACCCAAAAAGTGCANCCACCCTGTGGTGNGCGG 

TGTACCTTCGGCNCGCTCTAGAACTANTA 

Sequence 3136 

TTAGGGCGAATTGGANCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGCCTGTGCGTA 

GCTGCATTAAACCAAATGATCAAGAATATCAAGATACATCTTCCTGGATCTTCAATCCCA 

CTTAMGATTTGGAGMCATATCATTTTATATTAAAAC^ 

TTGAATAAATTTC I I I I I I ACCTAAAAACMGAGTCTTTAMTAAGTATCATTTTTG AG A 

TGCATTGCCATAGGATACATACTGAGGTGTCTTCTCTTTCTCTTACCTATTAGAGGCATT 

TCTGGGCAAGTTTGAGGAGCATGGAATTMCAGTCATCAATGCATTGCTGGATTTGAAAA 

TGATTTATATCAATCAGCAGAGGACCTCCACTGGTGATGTCAGCCAGGGATCTGAGGGAT 

GTTCAGGTAGGACATGCTATGCTTTGAATCTCC 

Sequence 3137 

CCGCGGCCGCCCGGGCAGGTANTNGCTGTTTNGGGCGNTTATCNATAATANTTTCTACTN 

GTTACTGAGTGTCATCATCACCAATGTCATGATTGCCACGATTNATTGAGTGCCTACTAC 

ATGCAGGCCCCAGGTCAGTGCTTTACACTCTTAACTGGCATGTTGACAACAGCTCTGAAT 

GACAGGAGCTCTGTTCAAGTGTTTTACMGTGAGAAAACTAAAGACACTMTMTTTTTA 

AAAATATATCCACAGTTCTATGATTAGTAAGTAAGGGAGCTAGTCCT 

Sequence 3138 

AGGTACGCGGGGGTAGATGGAAGGAAGAACTTGTGTGCTTAGACCTGACGCTGGGAGGAG 

ATGCTGCCACCTAGGTTACTTGTAGGACCCTATACGGCAACCTCCTTTGCCAGGAACTAT 

TTATAAACATCCTGCAGGAAAATGCAGTGAAGTAGAAAGAGACAGGGATATCCCAGAAGG 

TTATGCAAAACATCAAGAAAAGATGAGAGGTCAGAGATGGGAAGAAACAAGAACTTTGAC 

ATGCTTGGTGTTCTTGCCCMGCTTTGAAGAAGTTTACAAAGTCTATATGTCAGAATACA 

CATTTCCCACCTTGCCCAACAGTAGAAAAACATAAGAAGAGAAAAACATTAAAAAATGAC 

AAGGAAGTTAATGGAAGTCAGCAATGTGATGGTGTTTGGAGGTGGAGCCTTCAGA 

Sequence 3139 

AGGTACCCTAATCCCACGGAACTTCCCTTGAACCAATACATTGCCACCTCCCCACTGAGA 

GATGCCAGACTCTTCTCCCAACTCTCCCAACCACCTTAACCAAGTCAGCCCCTCCCACAT 

AGTCCCTAGTTAAGATCAGMTGAGCTTTATCTGCTTTCTTAAACTTTGTTGTTCCTACC 

TCTGCAATCCCTCTTAGGCTATGGAACTACTGCGGGATGAATTCACCAAGCCGCAGCTGG 

GACTGCCTTTGTACCTGCCCG 

Sequence 3140 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGGGAACTATTGG 

AGCACCTAAGAGGAGCACCTACCTTGAATTTAGGGGTTAGCAGAGGCATCCTGAAAAAAG 

TCAAAGCTAAGCCACAATCTATAAGCAGTTTAGGAATTAGCAGAACGTGCGTGGTGAGGA 

GATGCCAAAGGCAAGAAAGAGAAAGAGTATTCCAAACAGGAGGGATTCCAAAGAGAGAAG 

AGTATCCCAAACAACATTTGCACAAACCTGATGAGGAGAGAGAATGTGGGGTGGGGATGG 

ATGATGAGACTGAAGAAGAAAGCCAGGTCTAGATAATCAGTGGCCTTGTACCTGCCCGGG 
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CGGCCGCTCTAGGAACTAGTGGATCCCCCGGGCTTGCAGGGAATTNGAATATCAAGCCT 
Sequence 3141 

ATCACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACCGCACACCA 
CAGGGTGGCTGCACTT I l I GGGTGGTGGGGTGGGGATGGGGGCAGGTATGATCAATCATC 
CAAACCAGGGCATTTTTAAGAGTNAAAGGAGGCTCTACTGATAACATGCTGCAACAACAG 
ATGTGACTAGGANCGGGCCGCTCTAAA 
Sequence 3142 

CCCGGTGGCGACCTCTGTCTAAANTTCTGAATCCCCTCCNACAAAGACCCATTACAGCAG 

CATGGCCATGCAGCTGTGCTCACATCTCACCCCTGCACTGGCCAGGAACACATCTATCTT 

TCCTTTGGGTAGGGTCACCCAACTGCTCTGCCACTTCCAGCTGTGAAAAGGGCATCTATG 

TGACAGACCCCTCTGCAGTTTGAAACTGTGTGACAACTTTAACACCCAACTCAGCATCTG 

CATGGGTTTCTGAGMTTACCTATATCTT^ 

TAAAAAAAAAAAMGTGACTCGTGATCCCTGAGTTCCTATNTAGNCMTTTTACTTCACT 

AGCTAAAGGAAACACTGGGATTTAAAAT 

Sequence 3143 

GTNAAGAGCTGGAAGCAACCCAGTTGTGCATCAATGGGTAGNAATGGATGACTAAATTGC 

AGNGTATTCTTACAATAAAATACTATAGAGCAATGAAAATAAATAAAACTATTGGTGTTN 

TTANTGCNAAANCANACCATTGGGATAAAATTCTTTTACCAACCAAATTGTNTCAGGACN 

AAMTTGTTNGCCTCAAAACCCCAAAAAAAATTACAGCATGGTCNCCTATTGATATCCCC 

ANTANTAATGNCNACAGGGTTTCCACGAAATAANGGTGGATTCNGCTTANGTCTTACCTT 

CCTTTTTTGGG 

Sequence 3144 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCGCTTGTGTGGTTTCC 

ATGATCCTTCTTCTGCCTGGAATGAGGCTCCTCTTGAGTCCTGAGAGCTTCATTTCTGTC 

TGTCTTTTGGGCTGTGCCAAAGCCCCTCTGCCTTCGTGGACTCCCCACTGATTCATTACC 

AATTTTCCTCTCCCTCTNGGGCTCTTCTTAGATTGCAAATGGTGCCTCTGGATTGGGTAC 

CMGACTTGCAGTGCCTTAT7TTGATACCACTAATATGAGACCATGTTATATGCCTCTTG 

AG ATATCATTCCCTTTCTAGC AATAGAC AGTG ATAAN GC ATN G CCGTAT AGAGGGGTN N A 

ACGNTACGGACTTNTGNANGTTNATTCCATTCTCAATANGACAAAAAGTAAAAAGGTATG 

CTAAATTTCAATC 

Sequence 3145 

NGCCGAGGTACMGGCATTTTTNCTMCTGTAACTTTCCACTTAATCAACATTNAACAAA 

MCACTCATTTCTGAACATTCAGTGCATTCATGATTMTCTTAATTACACCACAAAGGTA 

TTTTTCAATGGTGATTTTGCGGGAGTGGGGTAACAGTTTCGAAAGCMCATTGTCAGAAA 

CATANGTTGAATTTAAAGGGTTCTTTCTGGTG \CTTTGACTTCTGC 1 1 ill IAGAAGACC 

TTACACAGAGTTGTATTTATTTCTCCTGGAATATTTNAAGCAATTCAGAGTGAAAGGGTA 

TACATTCCMTTTGCGTATGAGATAAAATTTAJiTTACATTC 

TTACAGGGAAAAAAAATTTGTAGGGGCTTTTTNGGAAGCCTCTTTGATTTTCTAAT^ 
AGGGAAATCCCTGGAGNCACCTNGGTCCC^AACNAGNAAAATNCATCT 
Sequence 3146 

ACTACTTACGGGCGAATTGGAGCTCCCCGOGGTGGCCGGCCCGCCCGGGCAGGTACTAAC 

TTGGTCAGGGCAGGGTGACACATAAAATTAACCATCACAGGGAAGGGTAGGGCTGGAGAG 

GCAGACTGTGGCCAGGTTACAATGCGCTGAGGCTAAGGAGACTGTGTTTATCCTGTACGC 

CAAGTGGGTCTTACCTCTGAAAG\"CTTTTGGGGTGGGGACATTCATGGACTTCAAGAGAC 

CTGTGAATGCCCTAAGATTATAAAGTNAAATCTGTGAGTCTGTAACTAAAGCTAAAGCTA 

TTTTTCTGGGGCCCACCATCTAAAGAAGATTCTGAAGCCTTAGGGTAGCCGTGGAGGANA 

CATGAAGGTCCATTTTGCATGGTAGAACCCTGCCTGGCTCTTGC 

Sequence 3147 

AGGTACGCGGGGGTACGATGGAAGGAAGAACTTGTGTGCTTAGACCTGACGCTGGGAGGA 
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GATGCTGCCACCTAGGTTACTTGTAGGACCCTATACGGCAACCTCCTTTGCCAGGAACTA 

TTTATAAACATCCTGCAGGAAAATGAGTCTATATGTNAGMTACACATTTCCCACCTTTG 

CCCACCNGTTGGAATAANAANNNTAANAACAAAAAAGGTTCCTTGCCCGGGGCNGGCCCG 

NTCGAGCAAAAAGGCCAGCAAAAAGGCCAAGGAAACNCGTA 

Sequence 3148 

CNAGCCGGGGAGCATAAAGGTGTTAAANCCNTGGGGGNTGGCCTTAAATGNAGGTGGAGC 

TMNCTCACMTTTAAATTTGGCCGTTTNGCCGGCTTCAACTTGCCCCCGCTTTTTCCCA 

AGTTNCNGGGNAAAMCCCCTGGTCCGNTGGCCCCMGNCTTGGCCATTTTAAANTGGAA 

ANTACNGGGGCTCAAAACTGCCCGCCGGGGGNGGAGGAAGGGGCNGGGTTTTNTNGGCNG 

GTTAAATTTGGGGGGGCCGGCCTTCCTTNTCCCCGGCCTTTTCCCCTTNCGGNCTTTCAA 

CCTTGGMCCTTTCCGGNCTTTGGCCGGCCCTTCCGGGGNTNCCGGTTTTNCCGGGGNNC 

TTTGGCCNGGGGNCCNGAAAGGCCCGGGGTTMTTTCCAAAGGCCTTTTCAMCCTTTC^ 

AAAAAMGGGGGCCGGGGGTTAAAAANTAAACCCGGGGGTTTTTMTTTTCCCCAANCCA 

AGGGAAAAMTTTCTAANGGGGGGGGGGGAAATTAAAACCCGGCCMNNGGGGAAAAAAA 

0 

Sequence 3149 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAGGTTCAAAT 

AGTCATGCACGCTCATCATAATCAATGAGCGAGGACATAAAGTAGGAAAAATGCATCACC 

ATGGTGAGCAAGGAAAGCAAGTTATTGGAGGCACATGTTAACACATAAAATATAAAATTN 

ATATGATCACACTGGGAAAGGCTTGCCTGAGCCCCAGTTTGAATGCCTACAATAAGATGA 

GATGCACAACAAAAAGCAAGAGAACCTGATCAAGTGGGTGACCTGGCCATGGTGCTCTCA 

TCAGTGGGGACCCAAATGCTTATGTGGACTCACCAGGTNTCGAATTAGACATGAATAGGA 

GTGTTTGTTGTGATGGCAAGAAACTATATAATCAAATGAATACAATGAAACTTTAAAAAT 

AATTG 

Sequence 3150 

ACTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAAAATGACCACAGC 
AGCATATTTCCTAAGATCCCTGAAAACAGCTAAGATCCCTGAAAGCAACTCAAATGTCCA 
TCAATAGMTGATGGATATTATTCTACCCCATTGAAACATTTCACACACMGTTTTGTGT 
TTTATTTTTATTTATTTAATTTATTTA 1 I I I I CATGTCAATGAATTCAAAGGAATCACAC 
ATAGGTTTTTGTATAGAAACAGCACTACATTTCTGTAGGAGGTGCTTTTTCAAGATATCT 
GTGTTCACTAGTTGTAGCCTG CG ACTAG AGGGCTCTGTGTCAAGCATTCCACTTG CTACT 
GATTCCTACCATACCCCTCTTTGTTCCCAGAACATTGTACCTGCC 
Sequence 3151 

GCGAATTGTTCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGTGGCCCGAAGTTA 

TGATAGATGAGGAGGGTATTTATCTGAGACAAACACTGGGAGGAAATTTCAACCAGAAGG 

AATATTAGCATGTGAGAAGGCATAGAATCATAAAAGGCATGTTGTGTTAAGGGAGCTACA 

AGTAGCACANTTTGGCTAATGTGTNCAGTTGAGAAGAGAGTTGTGGGCAGATAGGTTGGA 

AAAGAAGCCGTGAGTTAGATCATGAAGAGCCTCGGTAAGGAGTTTGTTTTAAAAACMTT 

ACTGAMGATTCNAAGCAAACACAATATTTATAGGTTGGAAATATTGCCAGAA 

Sequence 3152 

GGTGGCTGCACTTTTTGGGTGGTGGGGTGGGGATGGGGGCAGGTNTGATCAATCATCCAA 
ACCAGGGCATTTTTAAGAGTAAAAGGAGGCTCTACTGATAACATGCTGCAACAACAGATG 
TNACTAGGAACG 
Sequence 3153 

CGAGCGGCCGCCCGGGCAGGTACTTTACTCACCCTTCCTCTGACAGAAAAGGATGAAGTC 
AAGGGCCTGGTNGAGGCACCACTAAGAAAGGCATTTGAAAGGACCAAAGAGAGTGACCAG 
CAAGC A I MM I GCAAGGCTGGAGGGAGCTGACAAGCTTCNCATGAAAGGGCTGGNACTC 
ACCCCANNTGGTTGGAAAMNGCNNTCCANTCCTNGGNGTTTACCTTTNGTNGCCTTGCC 
CCATTAAAAATCACCAACCC ATCAAG N AACATTTGG NTTGGA 
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Sequence 3154 

CCGGGCAGGTACACTTCTAGATTGCTCAATGAAAAGAACTCTCAATCCAAAATTTATGTC 

TAGCAAAGATATCCTTCCAGAGTGAAAGAGTAATAAAAAGAGTCTCAGATGAAGCAAAGC 

TGCTTTAAAAGAATTGCTTTAAGAGAGTTAGNTAGAAGAGAAATGATACCAGMGAAAAC 

TTGATATATTAGGAATTATAGAACAGCAATAATGTATATACCTGGGTAAACATAATAGAT 

GATGAGTTCTTCTAAGNMCTTTGAAMTTGAAAGCAGAMTTTTMCATCATT^ 

CMTTTATTTCAAAATAATAGAAGAAAAGCATGAAACTGGCAGATTGGAGGCATCGTTAG 

CATGCCCCCCCNACTTGGAAAGAAAATATTAATAGTGTGTANGAGATTCACCCTGAACTT 

TTTCCA 

Sequence 31 55 

NCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTGGAGTCATGGTGAA 

AAAGGAATCGTGGTTATGTCACTGGATCAAAGAAAAAGGAGAGGAAATCAGAAAAATGAC 

AATMTTTTTTCCTTGGTATAGTCCCAGTAAAGGATAAACAAAAAGGMTTATTCCCCGA 

ATGCCCCTCCTCTTTATGAATACACAGTATACCCAGCTGGCTTGACTATGATCATTACAC 

ATCTGCACTGCATCCATGCATTTGCAGCCTTTTGACAATACATATGGCCCATATGCTTCA 

AGAGAGATGAAATGCTTGTCTCAGTTAATCTACATGTCTACTTCTGGCTTCAATCTTAAT 

CTTTCAGATGTTCACAAATCAACAAGATAATGTTTCTAGAGTAATGTGGGNAAATC 

Sequence 3156 

GGTGGCGGCCGCCCGGGCTNGTACGCGGGGAGGTTACTNGTAGGACNTTATACGGCAACC 

TCCCTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAATGGCCTCTTTAAATATACAC 

TTCTCTGTAGTGTATGCTAGAAATGGAGTGGCTGGAATAAAAGTGGCTGAATCATCTTCA 

ACTCTAGTAGCTGAGACCAATGATGCTGACCTCCCTCAAAAGCTGCATTTCTGAATTTCT 

G AAGG CAAACTGTCTGCCTATATTGTACCT 

Sequence 3157 

CCGCGGTGGCGGCCGAGGTACTTTGTGGGTTGAGGGTAGGAGGCTAAGACTGCCAGGGAG 

GAGGTCCCGGCTTCCTGCTCCTACCGGGGAGCAGCAGGTGAAACTCTGACCACGTTACTG 

CAATCCTGACATGCTCCAGTGGAGTGGCGACA Mill CTTCTCGAGGCAGCTTTTGGGGA 

TC TGTAT TACTTTCATGTGGACCACTTGCTAGTTTTGATTTCAATGATAATT^ 

CCI I I I I I I I I I I I C AAGCCTGTCTTCATAG NGATGACAACACATTTAACATTTGTTTTG 

ATTTT 

Sequence 3158 

AGCTCCCGCGCGGTGGCGGCCGAGGTACGCGGGGAAGCTAMGAGGAGATCGTTTTTATT 
TCATAGCGAGAAATGGCTTCCGGGTGCTGAGTAGAAGGGGCAGGCATGTTTGG1 AGTGGG 
CTCTCCAGCCCTAGAAGCATGCACGCAGAGCCTCTGTAGGAC l l rTTTTTTCTTTTiTI 
AAGACAGAGTCTTGCTCTGTCGCCCAGGCTGGAGTGCAATGGCGCAATCTCGGCTCACTT 
CAGCCTCCACCTCTCGGGTTCAGGCGGTTCTCCTGCCTCAGCCTCCCGAGTGGCTGGGAT 
TACTAGGCACCTGCCACCACACCTGGCTA A I IMHIll I G TAT7TTTAGTAGAGACAGT 
GTTTCGCCATGTTGGCCAGGCTGGCCTCGAACTCCTGACCTCGGGTGGNCCGCCTGTCTC 
GGCCTNCCAAAGTGCTGGGATTACAAC 
Sequence 3159 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGVACGCGGGAG 

ACTCATTAAAGCTCCTAAATGTATACATATGTATCAGAAAGGAATGTGAACTATTAAGGA 

MCTTGGTAGACAATGAAAGAATTTGGAAATCTGMGGCTTTAAGAGGCAG/xGTTGTAGA 

TCMCCCCTCATACCAAATAATCAACTCCAAAGTAAAACTTTGAATATAAACTAACTGCA 

AAAGAAACCTTCGCGATATGTAATAAGATGGCACACGCTGTGCTTCTGAGGACATGTGTG 

GATGCCTCCCCCACGTCCT 

Sequence 3160 

ATCCGGCGAATTGTTTTCTC NCC NCGGCGGCCGGCCGCCCGGGCAGGTANTTGG AGTGCT 
GTGTTGAGTCCTTTTTNTTNTTTTGTTCCTGAGAAAGAGAGATTCACAAACTTCCCACAT 
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TCTTCCAGTGGCATCTAAGGATCCCTTCCTAAATGGCCTTTCTCCCAGCCAATAGACTGA 

AGCAMCCATGTCCCTGATGCTTCCTGCCCACCACCATCCATTCTTTCAACCAACAGCTT 

CTCCGGGCTTTCACGTGCCATGCACTTTGCTAAGAGCTTGAGGGAAATACAGCAATGAAG 

AAG AG ATGATTTCTGTTTGGACCAG TG CTCAGGGATTCCTCCTATTAGAAATCAAAGAGG 

CTTCCAAAAGCCCTACA A 1 Hi 1 1 CCCTGTAACTAAAGAAAATAGCTTCTCAATGTAACT 

AAATTTTATCTCATACGCAAATTGGAA 

Sequence 3161 

CTATCACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAT 

GGCAACTGAGGCAAAGAGCTCTTAAGTTACTTATCGAAGAATATCCAATAAGTAGGTGGT 

AGAGCTAGGATTTACACCCAGCAGTCAGCTTTATTTTCAACCATGCAATGCCTCCTTAAC 

AAAGGAGATAGGTTAGGAAAGGCAGATCTGGGCAAGTGTTGTATTATACAACAATAAAAA 

GGCTGTGGGGTCAAAAGCTATGGCTGTTAAAAGGTTAGGGTAACAGCCAGAAGAATCACC 

CTTAGGMGGTTAAAATTTAGGACMCATCTATGATCCTTCCCAMCCTTAMTCTTCAT 

GCTTTTAAAAAGAATTTCCTGCCCAGGCGTGGTGGCTCACGCCTGTAATCCCAGCACTTT 

GGGAGGGGGAGGCGGG 

Sequence 3162 

ATCCGGCGAATTGTTTCTCCCCGCGGCGGCGGCCGAGGTACAGGGGTGNGAGAAAGAATG 
CTNGAAAGGGCAGCCTTTNNAGTGTTTTATCCTAAGACCATCTAAACGTGAAGGAAGTCC 
ATTCCAGCTAGTGAGGAGGTGCCTAACACAGTGGTTTCAAGCCTGACTGACCACAGAGTT 
ACCTTGGTAGCATATTAAAGGTCCTCATTCCTACATCTCACCCTAGAGCACCTGTTTTCA 
ACCTTAGATGCACAGTNGAATTAACTGAGAAACTTTTAAAGCCATCATTATCCAGGCCCA 
AATGC AGTATTG NTAAATC AG AATTTCTGGGG ATGTTGCTCAGGCATC I I 1 1 I lAAATTT 
AATTCCCTGGGTATACCAAGGATTGAAACTTGGACACTCTATATACTGGAATCAAA 
Sequence 3163 

CCGGGCAAGGTACTCTGCGTTGTTACCACTGCTTCCCGGGACTCTGCGTTGTTACCACTG 

CTTCCCGGGACTCTGCGTTGTTACCACTGCTTCCCGGGACTCTGCGTTGTTACCACTGCT 

TCCCGGGACTCTGCGTTGTTACCACTGCTTCCCGGGACTCTGTGTTGTTACCACTGCTTC 

CCGGGACTCTGCGTTGTTACCACTGCTTCCCGGGACTCTGCGTTGTACCT 

Sequence 3164 

GCGGTGGCGGCCGCCCGGGCAGGTACCTGGAAAGATGAACANGATTGCAGCAGATTNAGA 

CAGGTANAAGGAGAGAGATTCCAAGTGGAAGGATAAAGGAGAGGGAAGAGAAGCAAAGAG 

TGCAGCTTCCAAGAAGATTTTCTGGTTTCTGGCACTGGTCANAGATAGCTTGGGAGGAAA 

TAGGCANAj\ ! AGGGTnTGGAGAMCAGAGCTGCAMGCCAAGTTAGATGTTAGGTGAGTT 

GAAAAAAGAACTTATGGCAAAAGCAGAGGGAGGGAGGTATTTACTCTGAACAGAGATGAC 

AGGGTGGGGAGGACAGATTGTTTTGAGTAGGCAAAGGCCAGGAGCATGMGCAAGCTAGG 

AATTGGGAGGCTGGCCTAGAANCATGCNCCANCTGGGAAAGGGACA 

Sequence 3 V55 

CCGGGCAGGTACCACATCCCTGTCATTCAACAACTTCCCCTTCCAAGAGGGTGCTGATCA 

CAGACTTTfTACTGTAATGGGGCCAAGAAGAGACAGTTACTTAMCTMTAGAGTTCTTT 

TTATAACAAGAGACTGCAGATATCGCTACAGAAAGAATACCGGTTGTATGTTTCCTGAAT 

TATTACT/nAAAACATTTGTTTAAATGTCTCATATATAAAGGTGATGGCCTTATCTCTGAC 

AATGAA 7 TAGGGCCTAAGGAGAAAGCAAATTGGAACACTGTTAACATTAAAAAGTCAGCT 

AGTCTTTGAMTACCATTATAATACTACCTTGGAAGATACAGCAACTCAAATCTTGTTC 

Sequence 316<B 

AGGTACTCTTGG M I I I 1 AAGACAAAGAGCAAATCCTCCCCTGCCAGGATTGACTTTTGG 
CTC 1 1 1 I m 1 CAMCCTCACTGCTTTTTGGTTTAGTTGTCATAAAATGCCAAGCACCAT 
GAACAGGGCTCCATGAAGGGGCTCAGAGGTAGGAGGGCTGTGATTAGGAGAAGGCTTGGA 
CTGATGGGCNATTTGAGTGCTCAGAATTAGAGTGAGGGGGTGGGGGTGCTGCAGGGACAG 
ATGCTGGGGAAAGACACCCTGAAGGGCAAAGGGAGCAACAATGGCTGCAGTACCTGCCCG 
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AGGTACGCAACATGACATTGGCTGGTGTA^ 

GTATTCATTTGATTATATAGTTTCTTGCCATCACAACAAACACTCCTATTCATGTCTAAT 

TCGATACCTGGTGAGTCCACATAAGCATTTGGGTCCCCACTGATGAGAGCACCATGGCCA 

GGTCACCCACTTGATCAGGTTCTC7TGCTTTTTGTTGTGCATCTCATCTTACCCGCGTAC 

CTGCCCGGGCGGCCG 

Sequence 3168 

GGGAGGAGCTGCCTTGAGCGAGAGGACACTGTCCTGGAGGTGACAGAGAGCAACACCACA 

TCAGTGGTGGATGGGGATAAACGGGTGAAACGGCGGCTGCTCATGGAAACGTGTGCTGTC 

AACAATGGAGGCTGTGACCGCACCTGTAAGGATACTTGGGNAGGTGTCCACTGCAGTTGT 

CCTGTTGGATTCACTCTCCAGTTGGATGGGAAGACATGTAAAGATATTGATGAGTGCCAG 

ACCCGCMTGGAGGTTGTGATCATTTCTGCAAAAACATCGTGGGCAGTTTTGACTGCGGC 

TGCMGAMGGATTTAAATTATTAACAGATGAGAAGTCTTGCCAAGATGTGGATGAGTGC 

TCTTTGGATAGGACCTGTGACCACAGCTGCATCMCCACCCTGGCACATTTTCTTGTTGC 

TTGCAACCC AAGGGTACCTCGCCGCTTTAG AACTANTGG ATn nnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnCNTCNACCTCNAGGGGGGGGCCCCNGTACCCCANCTT 

TTNTTNCCTTTTANTGANGGTTAATTGCNCCCCTTG 

Sequence 3169 

CCG GGC AGGTACTCACAGCTGC ATC AN G NTGAATATC ATCTC CCTGC ACTGCCATTTCAT 

TTGAGCAGGGCTTTGCCTGTCAAGMCATTCATATCTTTTGATAAACAGTGCA l I I I II C 

CCTGATAGTCTGTATGGGACAGCCTCCTTGTTTGCATCTGAAGGACAGGGACTTCAGGAC 

CTCCNGNTGTGGTCACCAAGGAATCACATTCAAACTTCTCAGCAAGGTGTTATGGTCCTT 

CTCAGTTTGCCTACGATCCATTGAGGCAAAGGTTTCCTTCTCTCAATACACCCTCTGCTT 

TAGCCAGGGGGACCTCCCACTGTCCTCTCTATGTGTTCTATTTGTTCCTGACTGTTAGTA 

CCT 

Sequence 3170 

AGGTACi I II I I I! I II I I I I M 1 1 I II I ! I GGGGTGGAGTCTTGCTCTGTCTCCCANGC 

TGGAG TGCAG TGCTGCTATCTTGGCTCACTGCAAG CTCCACCTCCTGG GTTCACACCATT 

CTCCTGTCTCAGCCTCCCGAGTAGCTGGGACTACAGGTGCCCGCCACCACACCCGACTAA 

TTTTTTGTATTTTTAGTAGAGACGGGGTTTCACCATGTTAGCCAGGATGGTGTTGATCTC 

CTGACCTNATGATCTGCCTGTTTCAGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCA 

CCATGCCTGGCCTCTCACATGCCTTTTACTCCTTTGTGGTTGCTGCTCATCTTTCAACTT 

TCCTTTCTAATTTGTCAAATGTTTTCCTATTTCCAGCTTCATTAGMGCTGTAGCATC^ 

TGCATATTTCCTGTTMCMTTGGCTTGGTCTATTTCTCTGTAGAACACCCAGAGTCCTG 

AAAGTAGTAAAGACCTTGGCCTTGAAGCAGGAGACCAGGGTTCMGTTTCATTTCTGTCA 

TTTATGAGATGGGTAGCACAGGAMGTTGCATCTCTATTTTANGCTTAATTTTCTTACCT 

GTAAA 

Sequence 3171 

CACTTAATCAACAAAMCAAAMCACTCATTTCTGMCATTCAGTGCATTCATGATTAAT 

CTTAATTACACCACAMGGTATTTTTCMTGGTGATTTTGCGGGAGTGGGGTAACAGTTT 

CGAAAGCAACATTGTCAGAAACATAGTTGATTTTAAAGGTTCTTTCTGGTGACTTTGACT 

TCTGCI I II I I AGAAGACCTTACACAGAGTTGTATTTATTTCTCCTGGAATATTTCAAGC 

MTTCAGAGTGAMGGGTATACATTCCAATTTGCGTATGAGATAAAATTTAGTTACATTG 

AGAAGCTATTTTCTTTAGTTACAGGGAAAAMTTGTAGGGCTT^ 

TTCTAATAGGAGGAATCCCTGAGCACTGGTCCAAACAGAAATCATCTCTTCTTCATTGCT 

iGNATTTCCCTCAAGCTCTTAGCAAAGTGCATGGCACGTGAAAGCCCGGAGAAGCTGTTGG 

TTGAAAGAATGGATGGTGGTGGGCAGGAAGCATCAGGGACATGGTTTGCTTCANTCTATT 

GGCTGGGAGAAAGGCCATTTAGGAAG 
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Sequence 3172 

CGAGCGGCCGCCCGGGCAGGTACGCGGGAGACTGAGTGGGGTCACAGCACAGGGCACTGT 

CTTGCCTGGCTTTATCTGAGCCAATCACACCTCTCCTGGCCACTATCTGTGGTCTAGCCC 

CCTTTGTGCAGAAAGAGAAAGAAGAGCCTTGAGGACCAGCCTAGTCAGGCTGAAGAAATG 

TCMCMTTGGGAGTTTTGAAGGATTCCAGGCTGCGTCTCTGAAGCAAGAGGGAGATGAC 

CAACCCTCTGAGACTGACCACCTATCGATGGAGGAAGAGGACCCGATGCCAAGACAGATT 

TCAAGGCAGTCAAGTGTGACCGAATCAACTCTTTACCCCAATCCTTATCATCAGCCTTAT 

ATCTCACGGAAGTACTTTGCTACA 

Sequence 3173 

CCGGGCAGGTACAGATGCAAAAACTCANNTCCACAAATGTCCCGTGGCTTATCTAAAGCT 

ACACAGCTAGAAAGTGTCAGGACGGGAATGAAACACAGAGAGTGACCTGTCTCTGCGCCT 

GTGCCCTGTCCATACTTTCTGTCCCCAAGAGAGCAACACCATAGATCCAAACCAAAACCC 

CAAAAAGNNNAACTCTAGGAAATGCACAGGCTCCACTTCACTCCCAGGAGATTCCTATGG 

ATCTATGGAAGAAAATGAGGCAN TGAC C GAG GAATTTAAACTCTGGG GTGAAATC ATTTC 

CTCCCAATTATTCTTCGTCTAATGACAAAGATGCACCACTAAAAAAGTGACCTATGCCCT 

CTAACANTCTGAGAAGATCTGAGTCACATGGCAATTCTATTTGTAACATTACAAGATTAT 

AGAGGCCAGTTTTCCCTTCCTCCTCATTTTCTTCTCCCCCATGACATTTTTAGAAAGTCA 

CNTTCATTTA7TTTATCATTAGCTATATGCCTTAACCCCTGGCC 

Sequence 3174 

AGGACAGAGGGTCTGCMTCATGGTGCAGCCACAAGTAAAATATGAGTAGTATTTATGCT 

TTTCTACACTCTCATATGTATTGCCCAGCTTATTTACTTATTTACCTGTTTACTTATTAA 

MCCMCCCCCACCTTCAGTGGATGMTATTTGCCAACTCAAATGACAGTAAAAAACGGT 

GCAGNGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCTGAGACAGGCAGATGGCTTGG 

GGCCGGGAGTTTGCGACCAGCCTGGCCAACATGGTGAAAACCAGTCTCTACTGAAATTAC 

AAAAAGTAGCCGGGCATGGTGCCACATGCCTGTAATCCCAGCTACCTCGGAAGGCTGGGG 

CACGAGAATTGCTTGAACCCAGGAGGCAGAGGTTGCAATGAGCTGAGATTGCACCACTGC 

ACTCCAGCCTGGGTGACAGAGTATGACTCTGCCAAAAAAAAAAAAAAAAAAAGAGTAGAA 

GGANTTATAGCCCNCAGAATTTTCCAGGGCAATTCCAGAGGAGACCCAACAACTGTTAGG 

GACTATGCCCCTATTGNTAAAATGTNTATATAACCTCTGAGGCTGATGACTTTGAGAACA 

CCTACACTGACTTGAAAATTCA 

Sequence 3175 

AGGTACCCATCAGGAATATCCGGCTCAGAAGCCAGGGCCTCAGAGACTGTTCTCTCACTG 

AGACCTAGAGAGGGAACTCCCTGTGTTATTCTCACTGATGGCCCAGGAACCACCCTTGCA 

AGTCATGACCACCAGCATCATGTAGCACTGGAACTGATCTGGGCAATGACCTCTGTCTAA 

ACTTCTGAATCCCCTCCGACAAAGACCCAAGACAGCAGCATGGCCATGCAGCTGTGCTCA. 

CATCCCACCCCTGCACTGGCCAGGAGCACATOTATCTTTCCTTTGGGTAGGGTCACCCAA 

CTGCTCTGCCACTTCCAGCTGTGAAAGGCATCTATGTGACAGACCCCTCTGCAGTTTGAA 

ACTGTGTGACMCTTTAACACCCAACTCGGCATCTGCATGGGTTTCTGAGAATTACCTAT 

ATCTTTTGTGGTCTCTTTGCTGATTCCCTGTTrCATTAAAAAAAAAAAAAGTGACTCGTG 

ATCCCTGAGTTCCTATATAGCCAATTTTACTGACTAGCTAMGAAACACTGTATTTAAAA 

TGACAAACCTAGCAACAATTAGGCAAGCTCTCATCAGGACTCCATGCAGGGCTGNGTGAT 

TGCCTAAAAAAGTCT 

Sequence 3176 

AGGTACATATACCAACTGTCAGTCTGTGGTAACGTAACCTTAGACGCTGCCTGACCACAA 

AGTTTGG AAATTATATTTAAC C AAAAG AAMTAAATAAATAC AG C AG GTTAC CTTGTG AA 

GAMTGCCATGTGTTCTAAACAAATTAAACCCACGGTCTGACATTTGTATCCATCTATGC 

TTTTGAATTGAGGGGATTCCTTTAGTTATAATGTGCCAAGTGAGATTAGTMCCMCAGG 

AGGCTTAGAAAAGCAAAtCCATACTGGTTCATCACACACAAGAAAGAAATACCTGGCTAA 

AATGAAATCACTACAGAAGCATCGAGGACATTGTTTTTTAAAAT 
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Sequence 3177 

CCGGGCAGGTACGCGGGAGACTAGAGGGAATTGCACCTCCTCACTGCACAAGATTAGCTA 

ATGCGGGTGAAGCTATTGGAGTGCTTCTGAAGAAATCAAMTTTTGTTATTATGATTATA 

CTTGTTAGATATTTCTTTCTTGAGATTCTAAATGAATTTGmAATGCCTTATTGGTCCA 

TTCCTGCATC TC TG CTTCC AGAAGCCAAGAAC AAGTTCTCTGGATC ACATAG GCTCCTCT 

GCTGCTTCCTCAGAAACAGAGATGGTACCT 

Sequence 3178 

AGGTACGCGGGGCACCGGGCACTGTTCTAAGCCCTGGAGATACATCAGTGAGCCAGATAG 

ACACATTTCTTGCmCATGGMCTTAAATAGTATATGTMTCAACCTGTTmCCA^ 

TTTAGAACAATGAAGAGTAGGGCAGGAGTGGAATGAGGGTCAAAAGTAAGCCTTGGAGTC 

CTGCCCCAAAGTTTTCTAATTAGATTATGAAGGAAAGACCTTATGCAACTAAATTACCCA 

TTTGGGTCAGGTGTAGTGGCTCATGCCTGTAATCTCAGCACTTTGGGAGGTCAAGGCAGG 

AGATCACTTGAGGCCAGGAGTTCAAGACCAGCCTGGCAAACATAGGGAGACCTCATCTCT 

ATAACAAATTTAAAAATTGGTAGGGCATAATGATGTGCACCTGTGGTCCCAGCTACTCAG 

GAGGCTGAGGTAGGAGGATCATTTGAGACTGGGAAGTCGAGGCTGCAGTGAGCTGTGATC 

GCGCCACCGCACTCCANCCTGAGAGACAAAGGGTCATTTTGTGTCCTATTGATATTTGTA 

ATMTTAATGGTCTTGTTGCCCCTTAGGGNTGCTTNCTTTTT 

Sequence 3179 

AGGTACTCACAGCTGCATCACACTGAATATCATCTCCCTGCACTGCCATTTCATTTGAGC 

AGGGCTTTGCCTGTCAAG AACATTCATATCTTTTG ATAAACAGTG C A I I I 1 1 I CCCTGAT 

AGTCTGTATGGGACAGCCTCCTTGTTTGCATCTGAAGGACAGGGACTTCAGGACCTCCTG 

CTGTGGTCACCAAGGAATCACATTCAAACTTCTCAGCAAGGTGTTATGGTCCTTCTCAGT 

TTGCCTACGATCCATTGAGGCAMGGTTTCCTTCTCTCAATACACCCTCTGCTTTAGCCA 

GGGGGACCTCCCACTGTCCTCTCTATGTGTTCTATTTGTTCCTGACTGTTAGTACCTGCC 

CG 

Sequence 3180 

CCGGGCAGGTACCTTGCCCTGAGGAACCTACAAGTTAGCCTCCCTCTAATCCTGGCATTC 
AAAGTCCTCCACCTTCCGAATTACACCTG I I I I I I CAGGCATTGTCTCCCACTTTCCTAG 
CMGCCTTTGGCTCCAGGAAACTGTTCTGTTCATTGACGACAGMTGTATTTTGTCCCGT 
ATGCCTCCCTTCAATCTGAAGTTC A I I I I I AGTA AH II 111 I ATTAGC AGTG ACTTTCC 
CCATTTACTAATGCCTTTCTCTTTAGCTCATTT^ 

CCTTCTGAATTGCTATCATATTCTGTCTCTGTGTCTAATCAGAAAGTTAGGTGAGCTGCT 

GTCTCAGCTCCTGTGTTAATCCAGGTACCT 

Sequence 3181 

AGGACTAGCTCCCTTACTTACTAATCATAGAACTGTGGATATAl I I I I AAAAATT ATT AG 

TGTCTTTAGTTTTCTCACTTGTAAAACACTGAACAGAGCTCCTGTCATTCAGAGCTGTTG 

TCAACATGACAGTTAAGAGTGTAAAGCACTGACCTGGGGCCTGCATGTAGTAGGCACTCA 

ATAAATCGTGGCAATCATGACATTGGTGATGATGACACTCAGTAACTAGTAGGTGTTATT 

ATTAAAAGTCGCAAAAACAGCTAGTACCTGCCCG 

Sequence 3182 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGTCTACGTGGTGGTTGGAGCTGCCAAGG 

GCACCCAACCTGGTCAACAAATTAGTGTGCTCAGCCAGAGGCAGTCCTGGATGAATTCAG 

GCTGCAAAGTGCCCTGTGACCTGGCCCTTGGGAAGCCACGCTCTGATGAGGTTTGCAGAT 

CTCCCCGCAATCCAGGGCCAAGCATGAATGAAGGTGTTAGGCATTTCAAGGATGCGACTA 

GGACGTGCTGCTGTCTCAGGTTCACATTATACTGAAATAAGGTCCTGGCCAGACAGCACC 

A TATA GTTMTTG AGGTGTTTGGG AGATACTC AAATGGGTGTTAATATTTCCCAG G CAAT 

ATTTTCAAGGCTCAGAGCCTCTATTCAAAAGATAACTGAAAGGCTAGCAAAGGGTATTGC 

AAAGGTATATTCTGATTTCTGACAGGAAGATCGATGGATGGACAGTGAGAGTATAAAGAC 

CTTAGGGTTTGGGGTTCTGATTTGGCCTTGGTCTGTTGAATGATTGATTCATTCATTCAA 



Page 48 1 of 635 



WO 01/51628 



PCT/US01/00798 



Table 1 

CAAATATTTATACAACACCCACAGGGGGCAG 
Sequence 3183 

CCGGGCAGGTACTTAGCACATGCCAGGAACTGTTCTAAACCCCTTCTGTGCGTTAGCCCA 

TTTAATCCTCATAGTAACTCAGTGCTATCTCTGATATCCCTGTGTTACAGTTGAGGGAAC 

TGAAGTATAGAGAAGGTTGAGCAATGGCATGCAGGGCATAGAGAAGGGTAAGGACGACTG 

TAAACAGGTCMGAAGTGGMGGTCAGATAATTTACCCTGATTGCACAACTGGTAAGTAG 

TG GTACC AG ATTG AGGCTCAAGTCTTCACTCTC ACAG CAGGGTACTTTCTGC AACTTC AG 

TTTCTTATTACCATTTCTGCCACTAGCATGCAGAAGACGCTAGATGATTATTGAATCAGT 

GAGTTTCTCATAGGTAAGATGGAMTGATTCTTAGTAGGGTTTTGTGAGMTAAAATTGG 

AAATTGTACATTTCTTTMCATATTTTGGGTAGTAATAGCTATTATGATGCTAGTGGATA 

ACACAAGAGTTCAAAGGTGACCAGTTAGATGTGGAGGAGGGAAAGAGTTGTGTAAAAGAT 

GATGAGTGTTCTAGCCTGAT 

Sequence 3184 

AGGTACGCGGGGAGACTAGAGGGAATTGCACCTCCTCACTGCACAAGATTAGCTAATGCG 
GAAGCCAAGAACAAGTTCTCTGGATCACATAGGCTCCTCTGCTGCTTCCTCAGAAACAGA 
GATGGTACCTGCCCG 
Sequence 3185 

CGGCCGCCCGGGCAGGTACAGAMGAGGTTTGTCTGACTTAMGTGATTTGTCATCTTTC 

TTTTCCACTCTCCTAATCATGATCTGTTCCCCCTCCTGGCCCAATAACTTAATGCTTGTA 

AACTGAGGTCCACCCATTTATGGAAAACTCT7TAGTCACAGAGTTCCCTAACACCCTCCT 

CTGTCCCTGTGACAATGTATTGAAAATAACAGTTGAAGCCCCTCTAGAGGGTGGTCCAAG 

TTGACTAAATGGGTAGCAGTGAACAGCATTAAAAGGAAAAGAMTAAAATAACACACAGA 

AMTGTCATAGAGCMCATACAGCAMGATAGTTTTTATAAAMGTGGTTTTTAATTACA 

CATATGCCATATTTCACTGAATCACAGTCATCATTTATTGGAAAATGCAAAACTTTATGT 

ATCACCCAGAANGAAAAAAANNNNNNNNNNNNNNNNNNNNNNNNNGTCCCT 

Sequence 3186 

TGCCTTGTAGCGGCAGCCGGGTCGGGAACAAGGTGTAGTTCTGGTTGCCGTCGCCAGCCA 

CGCCGTCCGCCGTGCCGCCGGCTGTGCCCGCGCCGGCGGGCTGGCCGTACAGCGCGGGCG 

GCCCGGCCGCGTAGATGCCGTCCGCGCCCGTGATCTGGTCCACGCCGTACACCACCACGG 

CGCCAGCGGCAAAGGCCGGCCCCAGGGTGGTCAGCAGCTGCTGTCGCATGGTGGCGGCCG 

CGCTGAAGCGCGTGGCTTCGTCCGCCGCGCTGCTGCTGAGCACGGGACTGACCGTGGCGC 

TGATGGCGTCGGCCAGCGTCTGTTTCGCCT 

Sequence 3187 

CCGGGCAGGTACCTTGCCCTGAGGAACCTACAAGTTAGCCTCCCTCTAATCCTGGCATTC 
AMGTCCTCCACCTTCCGAATTACACCTGTNTTTTCAGGCATTGTCTCCCACTTTCCTAN 
CAAGCCTTTGGC 
Sequence 3188 

CCGGGCAGGTACCAGCACGTGCAGAAAAGAAAGAGTCTCCGCTTGTCTTTGTCTGATTCT 
CCTGTCCTCTCCATGGAAGTTACATTTTCTGTAAAGGATGAGCTGAAAATTCTCCTGGTC 
GTTGCCAGTTGAACTTCTGCTGTGCTCTGGGAAGGCATTCTCACTCTGTTTATGTTGTCT 
AAAGTGCAGACATGGATGTGCAGGTTTGCTAGAACCTCCTGAGGATGTGCM 
Sequence 3189 

CCGGGCAGGTACACAGCACCCCCTGGAGTTGGCAATGCGGAGGCTCTGGCTGTTGATCCC 
AAAGAAGATTTAAATTGCAGTCCATTATATTTT^ 

TTCCAAATAGATTAAAATATAATCTGGMTCTTMGTGCCTTAGTCCTATTTAGGTAACA 
ATACTATCTAAGAAAAACTCTAGAAGAATTCACAACAAATTGCTAACTATAGTTATCTCT 
GGCAGAGGAAGTAGCATGGGAAAGGCGTGGGAAMTrTTTTACTTTGACI MINI ICAC 
CTGGTGTCCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGCCCGGTACnnnnnnnnnnnnnnnnnn 
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nnnnnnnnnnnnnnnnnnnCGCTTGGGCGTAATCATTGGGTCATAGCTGTTTCCTGTGTG 
GAAATTGNTATCCGCTTCCAATTTCCACACAACATTACGAGCCGGGAGCCATAAAGTGTA 
AAAGCCCTGGGGTGCCCTAATGNGGTGAGCCTAACTCACATTTAAATTGG 
Sequence 3190 

CCGGGCAGGTACGCGGGGGAGATGCTGCCACCTAGG7TACTTGTAGGACCCTA7ACGGTA 

ACCTCCTTTGCCAGGAACCTATTTATAAACATCCTGCAGGAAAATGCAGTGAAGTAGAAG 

AGACAGGGATATCCCAGTAAGGTTATCGCNAAACATCAAGAGAAGATGAGAGGAGTCTAT 

ATGTCAGANTACACATTTCCCACCTTGNCCAACAGTAGAAAGAANAAAAAANAAAAANNA 

AGGTNCCCTGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnNGCTGCCAGGATTTCAATATA 

CAMGCTTAATCGATMCCCGTNCGACCTCCGAGGGGGGGGGCCGGG1TCCCAGCTTTTT 

GTTTCCCTTTANNNGAGGGGTTAATTAGCC 

Sequence 3191 

AGGTACAGGGGTGGGAGAAAGMTGCTGGAAAGGGCAGCCCTGMGTGTTTTATCCTAAG 
ACCATCTAAACCGTGAAGGAAGTCCATTCCAGCTAGGTGAGGAGGTGCCCTAACACAGTG 
GTTTCAAGCCTGACTGGCCACAGAGTTACCTTGGG 
Sequence 3192 

AGGTACAGGGGTGGGAGAAAGMTGCTGGAMGGGCAGCCCTGMGTGTTTTATCCTAAG 

ACCATCTAAACGTGAAGGAAGTCCATTCCAGCTAGTGAGGAGGTGCCTAACACAGTGGTT 

TCAAGCCTGACTGACCACAGAGTTACCTTGGTAGCATATTAAAGTCCTCA7TCCTACATC 

TCACCCTANAGCACCTGTTTTCMCCTTAGATGCACAGTAGMTTAACTGAGAAACTTTT 

AMGCCATCATTATCCAGGCCCAMTGCAGTATTGTTAAATCAGMTTTCTGGGGATGTT 

GCTCAGGCATCTTTTTTAAATNTMTTCCCTGGGTATACCAAGATTGAAACTTGACACTC 

TATATACTGAATCAAAAG CTCC ATGGGCAAGGTGTAGGAATCTGTATTTTTTAATTACCA 

TCTCCCAAATCTACTTAGTTTTGTGGAGCAGCGTTTGAAATGGACTTGTGACAGGTATGG 

ATGAAGTGAACAAGAGACAGTANAAATCANGCGTGGTGGGGAATTGAAAGAAATGGCAGT 

GGCAGTGGTTCCAGTTTAGAGATTCTNAAGANGCTCAAGATGGANCCCTCANAGACTTTC 

ANTGGTTT 

Sequence 3193 

CCGGGCAGGTAC I I M I I I I I I I I I I I I I I I I GGAGTCTTTAAGATTATCTCTATGTTGG 

CTGGCTGCGGTGGCTCATGCCTGTAATCCCAGCACTTTGGGAGGCCAAAGCGGGAGGCCA 

ACACTGGGGCTACCCMGAGTTGACCAAAGGTGATATTAATACATATGCTTACTTATTAC 

CTTGTTCGCACAGTTCTAGTGTMGTCTTGAAAATCAAGTAAATTTGGAMCATAGTTGC 

ACAAACTCTCTCATGAGTCACACAGTTAATCCAGAATATTATTTCTAAGTTATCATAAAG 

CCAACAAGCAAATATGATGTACCT 

Sequence 3194 

AGGTACCATCTCTGTTTCTGAGGAAGCAGCAGAGGAGCCTATGTGATCCANAGAACTTGT 

TCTTGGCTTCTGGMGCAGAGATGCANGMTGGACCAATAAGGCATTAMCAAATTCATT 

TANAATCTCAAGAAAGAAATATCTAACAGW 

CAGAAGCACTCCAATAGCTTCACCCGCATTAGCTAATCTTGTGCAGTGAGGAGGTGCAAT 

TCCCTCTAGTCTCCCGCGTACCTGCCCG 

Sequence 3195 

ACCTTGGCGCCAAGCCGCGCCAGACCTTCATGCAAGTGATTTTGCCGCTGTCGGTGCCGG 

GGATTGCGGCGGGTTCGATCTTTACGTTCTCGCTGACCCTGGGGGACTTCATCGTGCCGC 

AGCTGGTGGGCCCCGCCGGGTTACTTCGTCGGCGGTATGGTCTACGCGCAACAAGGCGCT 

ATCGGCAACATGCCCATGGCCGCTGCATTCACCTTGGTGCCAATCGTGCTGATCGCGGTT 

TACCTGTCCATCGTCAAACGTCTGGGGGCCTTCGATGCACTCTGAAAAAGCTTCGTGGAG 

TCTAAAAATCGCGGCCTGGGGCCGGGTTGGTTTTCCTGCACTTCCCGATCCTGATCATCT 

TTCCTCTACGCCTTCAACACCGAAGAAAGCGGNGTTCAGCTTTTCCACCCCAGGGCTTNA 

CCCCTGAAGTGGTTTCAGCGTGGCCTTCTCGCGGCCGGACGTGCTGGAAGCCATCAAAGC 
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TGTCGCTGNAAATCGCCGNCATCGCCACGCTGATTGCGATGGTACTCGGCACCCTGGCGT 

CGGCGGCGTTGTATCGCCCGGGAGTTCTTCGGCAAGCA 

Sequence 3196 

AGGTACCAGTAAATGCTGCCTCATCACCTACACCCTTGGTTTACTTCTCAGATAAACTCA 

GGMGTTTAAATGGCATTCTGAGAATCTGTAGCTGCAACATAGTGAGCCCAGGAGTCCCG 

GTCCAGCAGCTAGTATTAAATTCTCTGTTGCTAGCTTGGACCTCTCTCGATATCTAGGAA 

TTGTCTGCAGTGTGGGGTGGCACAGTMCMCCATTTGTTTCTCAACAAGAATCTGATAA 

TGTTGCTTGMCCTCCTGAACCAGTGCTGCATGCTCTGCTTCTTCATATGTTTGTCCTTA 

GATTTGACCAAAAACAATAACTGATAATGTAATACAGATTCCCCAGCTCCCTGTGAGCTG 

GGTTG AGCCC AGGAAGTCTGTAC C TG CCCG 

Sequence 3197 

AGGTACAGGGMCTATTGGAGCACCTAAGAGGAGCACCTACCTTGAATTTAGGGGTTAGC 

AGAGGCATCCTGAAAAAAGTCAAAGCTAAGCCACAATCTATMGCAGTTTAGGAATTAGC 

AGAACCGTGCCGTGGTGAGGAGATGCCAAAGGCAAGAAGAGAAGAGTATTCCAAACAGGA 

GGGATTCCAAAGAGAGAAGAGTATCCCAAACAACATTTGCACAAACCTGATGGGGAGAGA 

GAATGTGGGGTGGGGATGGATGATGAGACTGAAGAAGAAAAGCCAGGTCTAGATNATCAG 

N GGCCTN GTCCTGCCCGGGCGG N C G CTCTAG AACTAGTGG AT 

Sequence 3198 

CCGGGCCG GTACTAACAGTCAG G AAC AAATAG AAC ACATAGAG AGG AC AGTGGG AGGTCC 

CCCTGGCTAAAGCAGAGGGTGTATTGAGAGAAGGAAACCTTTGCCTCAATGGATCGTAGG 

CAMCTGAGMGGACCATAACACCTTGCTGAGAAGTTTGMTGTGATTCCTTNTTNACCA 

CAGCAGGAGGTCCTGAAGTCCCTGTCCTTCAGATGCAAACAAGGAGGCTGTCCCATACAG 

ACTATCAGGGAAAAMTGCACTGTTTATCAAAAGATATGAATGTTCTTGACAGGCAAAGC 

CCTGCTCAAATGAAATGGCAGTGCAGGGAGATGATATTCAGTGTGATGCAGCTGTGAGTA 

CCT 

Sequence 3199 

TCACTGNCCGCTTTCCNAGTCCGGGAAACCTGTCGTGCCAGCCTGGATTAATNAATCGCC 
CANCGCCCCGGGGGAGAGGGCNGNTTTGCCGTTATTTGAGGCGCTNTTCCGCTTTCCTCG 
CTTNANTTGACTTCGCTTGCGCTNCGGTCCGTNCGGGCTTGCGGGCGAAGC 
Sequence 3200 

AGGTACCAGTAAATGCTGCCTCATCACCTACACCCTTGGTTTACTTCTCAGATAAACTCA 

GGAAGTTTAAATGGCATTCTGAGAATCTGTAGCTGCAACATAGTGAGCCCAGGAGTCCCG 

GTCCAGCAGCTAGTATTAAATTCTCTGTTGCTAGCTTGGACCTCTCTCGATATCTAGGAA 

TTGTCTGCAGTGTGGGGTGGCACAGTAACAACCATTTGTTTCTCAACAAGAATCTGATAA 

TGTTGCTTGAACCTCCTGAACCAGTGCTGCATGCTCTGCTTCTTCATATG7TTGTCCTTA 

GATTTGACCAAAAACAATAACTGATAATGTAATACAGATTCCCCAGCTCCCTGTGAGCTG 

GGTTGAGCCCAGGAAGTCTGTACCTGCCCG 

Sequence 3201 

CCGGGCAGGTACCTGGATAATGGTGACTGMTGATTGATGGMCTGACTTTTTGCTTTCC 

TTTTTTCTTTCTCCACTGGTGGCCCTATGTTCCTATTAAGAGAATCTTCCAGCCAGGCGT 

GGTGGC ACACGCCTGTAATCCCAGACTTTG GG AG ACTG AGGC AGG CAG ATC ACG AGGTCA 

GGAGATTGAGACCATCTTGGCTAACACGGTGAAACCCTCTCTCTACTAGAAATACAAAAA 

ATTAGCCAGGTGTGGTGGCACGCGCCTGTGGTCCCAGCTACTCGGGAGGCTGAGGCAGGA 

GAATCGCTTGGGCCGAGGAGGCGGAGGTTGCAGTGAGCCGAGATCCCGCCACTGCACTCC 

AGCCTGGGTGACAGAGCAGGACTCTGTCTCAAAAAAAAAAAAATCTTTCTGCCTGTTTTT 

TGAAATAGCCGCAGGAGTGAGGCMTCTCANCTCCTGCTCACACATCTTTTTGTAGAATT 

TCCATCACAMGAATGGNTATTAGAATCTCTACTCTTGGCCCAAAGNGTCTGNTNGCTTT 

TCTNCCAATCANGA 

Sequence 3202 
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CCGGGCAGGTACTCTGCGTTGTTACCACTGCTTCCCGGGACTCTGCGTTGTTACCACTGC 

TTCCCGGGACTCTGCGTTGTTACCACTGCTTCCCGGGACTCTGCGTTGTTACCACTGCTT 

CCCGGGACTCTGCGTTGTCACCACTGCTTCCCGGGACTCTGCGTTGTTACCACTGCTTCC 

CGGGACTCTGCGTTGTTACCACTGCTTCCCGGGACTCTGCGTTGTACCT 

Sequence 3203 

CCGGGCAGGTACCTTGAGTAACATTGGGAAGGACCACTCATTGAATGAACCATCAATCAA 

ATGATACATACTGCATCCTTCCAGAAAACATCTCAAAGCACTTTGCAGATGTTAACATTA 

AAGTTAAACTGAGTGTMGGGAGCCTTACAAAATAGGCCAGCTTGAATGCCTTATCCAAA 

AATGTAGAACTGCTTCAAGGCAGGTGTCAGTGTTTTAMGAAATTGAAGCATCTTGGGTT 

GAAAGAAAAATAAGAAAATCTCTATGGATGTAAAGATCATTCCCAGTTCAAAAGCTAANG 

ATGGGCAGCANGGTCCGNGTCC 

Sequence 3204 

CCGGGCAGGTACGCGGGGGGGCAATAGTGAATGAATGGGACCATCTGCGTGGGTCCCTAC 
ACGAGATGCTTGGAGGGGCTGGAGGTGGTGAGGGCTGCAGGAATGGCCTAAGGTTTGGAG 
AAT 

Sequence 3205 

CCGGGCAGGTACCAGAGACTCCTATTTATTTATACCAGTAAGGGCCCTCCCACTGCCTTT 
GTCTCC AGTAGGTCAGCTGCATCATCAGCTTTAAAAGCCCCCAAATATTGCTGAGCTTGC 

nrn \ aaaacttctcttg c aaactttataaatttgtttcagagc actataataaaatat 

TTCTTGCAAACATGGACTATAAGTGCCAGAGACACTGGAGAAATCTGAGGGTGGGGAGAA 

AGGA TMT CTATCAGCTTGGAAGCTCCAAAGCACTGTCTTTAGTAGCGCTATTGGTGATG 

GTGGTTTTGAGTGCTTCTCTCAGTGCAGGGTGAGCCAGGTTTCTCAGTGGCATTAGATAT 

ATACTCCATGACATTGGGACTCCCCCCGCGTACCTN 

Sequence 3206 

AGGTACAAGGCATTTTTGCTMCTGTAACTTCCCACTTAATCAACAAAAACAAAAACACT 
C ATTTCTG AACATTCAGTG C ATTC ATGATTAATCTTAATTACAC C AC AAAGGTATTTTTC 
AATG GGTG ATTTTGCGGGAGTGGGGTAACAGTTTCGAAAGCAACATTGTCAGAAACATAG 
TTGATTTTAMGGTTCTTTCTGGTGACT^ 

GAGTTGTATTTATTTCTCCCGGMTATTTCMGCAATTCAGAGTGAAAGGGTATACATTC 

CAATTTGCGTATGAGATAAAATTTAGTTACATTGAGAAGCTATTTTCTTTAGTTACAGGG 

AAAAAATTGTAGGGCTTTTGG AAGCCTCTTTG ATTTCTAATAGG AG GAATCCCTG AG C AC 

TGGTCCAAACAGAMTCATCTCTTCTTCATTGCTGTATTTCCCTCAAGCTCTTAGCAAAG * 

TGCATGGCACGTGAAAGCCCGGAGAAGCTGTTGGTTGAAAGAATGGATGGTGGTGGGCAG 

GMGCATCAGGGACATGGTTTGCTTCAGTCTATTGGCTGGGANAAAGGCCATTTAGGAAG 

G 

Sequence 3207 

CCGGGCAGGTACTTGGAATGCTGTGTGGAGTCCTTTTTCTACTTTTGTTCCTGAGAAAGA 

GAGATTCACAAA/;TTCCCACATTCTTCCAGTGGCATCTAAGGATCCCTTCCTAAATGGCC 

TTTCTCCCAGCCAATAGACTGAAGCAAACCATGTCCCTGATGCTTCCTGCCCACCACCAT 

CCATTCTTTCMCCAACAGCTTCTCCGGGCTTTCACGTGCCATGCACTTTGCTAAGAGCT 

TGAGGGAAATACAGCMTGAAGAAGAGATGATTTCTGTTTGGACCAGTGCTCAGGGATTC 

CTCCTATTAG/\AATCAMGAGGCTTCCAAAAGCCCTACMTTTTTTCCCTGTAACTAAAG 

AAAATAGCTTCTCMTGTAACTAMTTTTATCTCATACGCAMTTGGAATGTATACCCTT 

TCACTCTG^ATTGCTTGAAATATTCCAGGAGAAATAAATACAACTCTGTGTAAGGTCTTC 

TAAAAAAGCAGAAGTCAAAGTCACCAGAAAGAACCTTTAAAATCMCTATGTTTCTGACA 

ATGGTTGCTTTCGAAACTGTTACCCCACTCCCGCAAAATCACCATTGAAAAATA 

Sequence 3208 

CCGGGCAGGTACGCGGGATTTTTATACMTTAATGATATTCTCAATAGTAATCTTTGTGT 
GTGTATATATATAGAAATAAATTCTAAATGTAAGTTAATATATTTATTATTTTTCTAM 
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ATATATAAATATATATATGCACACAGGCTATTTAATTTTATTAGATGATGCTATTTTAAT 
TCAGAAAAAAATGACATTTATATTTTGATTTAGGTTAGTATAAGCCCTTAGAGGTGTT^ 
G AC AACTCTCTTAATTTN G N GGTTTTTACTGTTTATTTN G ATTTTATATAAN CTAAAATA 
CCATTGGTTTTTN 
Sequence 3209 

CCGGGCAGGTACCTAGCACCAAMCGTGTTTTGCCACATTATGAGGAGAAATTAAGACAC 
ATGCAAATGGCCCAGAGTTTGGTGGTTTGGCAGTCACAAGGAGGAAAGAAATGTGTGACT 
CCAATAAGAGTCCCTCTATAGTCATTTGGTTCCCTCGGCCCAGAAAGACAAAGGGGAGTT 
TATTCAGAGTGACATAAAAGAAGGGTCGCACAAGCAAATCAGTGACGGGGCAAAGCATTG 
ATGGTTTGAAGGAAAACATGCATTATCTTTTCTTAACGGTC 

TATTCATTACATTGGCCAACCAGCCTCTAAGGACTTTGGGCAGCCCAATGGAAGGCTCTC 
AAAGGCCTTTGAGTTTAGCTTTTGGGCTTACGATGGTTT^^ 

AAAGCCATCCTGATGTGGGACTCCTGAAGTGCGTCCTACTGAAAGCCAAATGGAGAACTG 

AGGAATGTCATCCTCTCCACCCAGTCATGTACCT 

Sequence 3210 

CCGGGCAGGTACGCGGGTCATCATCATAGTGAGCCCAGAGAGCCACTGCCCAGCAGCATG 

CTCACACCACCTACCCTAGTGTAGGTAATAGGTCTACGCTAGGACCCCGTGCTGGGCTCT 

CAGCCCATCATGAGATT7TGGTGGATTTMTGGCAGGTAGGAACTTATTTATAGTGGACT 

GATAATTGCTTTATAATTCCTTGGTAATGACAGCTCAGGGAAGGTTTCACAAGGTCATGA 

TCAGGAGACTTGMTTGTTACTGGATGTAGGAATTGTTTCACTGCTCTTMCTTGCTCAA 

ACTGGGGCAGGTTCCAGGAACTTGMCTAAAAATATCTATTTMGCCTNTCTCTCTTTCT 

CTCCTCCCCAACTTTTTTCTGAAAGCCTTGATT^ 

ATGTTGGGTCMGACGTGTTTCTATAGGAAATGCTTGCACGTGGCACTTGGCTTGGTGAG 

TCAAGCTTCCTGAGCTTCANACCGTTTGTCCTGTCATNTGTCCTGCCCA 

Sequence 3211 

CCGG GCAG GTACAAGG CCAC TG ATTATCTAG ACCTGGCTTTCTTCTTC AGTC TCATC ATC 

CATCCCCACCCCACATTCTCTCTCCCCATCAGGTTTGTGCAAATGTTGTTTGGGATACTC 

TTCTCTCTTTGGMTCCCTCCTGTTTGGMTACTCTTCTCTTCTTGCCTTTGGCATCTCC 

TCACCACGCACGTTCTGCTAATTCCTAAMCTGCTTATAGATTGNGGGCTTAAGCTTTGA 

CTTTTTTTCAAGGATGCCTCTGCTAACCCCCTAAAATTCAANGGTAGGTGCTCCTCTTAA 

GGTGCTCCATAGTTCCCTGTACCT 

Sequence 3212 

AGGTACACACTGAGGACCTTTCATCCAGCACGTGTGGAGGCTCCTCGAGGGGACTCGGAG 

TGAAGACAGACTTGGTCTCTGCCCCCTCGGGTCTTACAGTCAAGCTCCAGTTGAAGTGTG 

GTCATTACTACAGTAAATGAAGCACAACCCATTATTATCTCTGCCTCAGGGGATCAAGAG 

GAAGTTTCTAGAGTAGGAGACATTTCAATCAGGGCTTGGGGAGCCTGCCAAAGGGAATGA 

GGGCACGTCCCACATGAAGCAAAGGCATCTGAGTTCATGGCCCCGCTGGGGATTGGCAGG 

CATGAGTTGCAGCTGCAGGGCAATAAGCGTAGGGGAGTGAGTGGAGAAACCCAGGTGGGG 

0 

Sequence 3213 

AGGTACACACAGTCAAAAGTTTTCCGTGACTGTGAAATCCAGTGCAATCTGCTTCCCAGC 
AGATGAGGAGCCTGACGTGGTTATGTCTGGATTTAGTCCAAGAACAGAGCAAAGCTTTAA 
TTGTCAAGAGACCCAGGCCTCATTCTCCACTTAGGAGAGTTTGCTAGGGCAGGTACCTGC 
CCG 

Sequence 3214 

CCGGGCAGGTACGCGGGACAGCAGCCAGCCGGGACAGCCAAGTGGTTCGGAGAGAAATTG 
CTAAAGCCCTTAGCAAAAGCTTTACGATGCACTCACTCACCATTCAGATGGAATCTCCAG 
TTGACCAGGACCCCGACTGCCTTTTCTGTGAAGACCCCTGTGACTAGCTCAGTCACACCG 
TCAGTTTCCCAMTTTGACAGGCCACCTTCAAACATGCTGCTATGCAGTTTCTGCATCAT 
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AGAAAATAAGGAACCAAAGGAAGAAATTCATGTCATGGTGCAATGCATATTTTATCTATT 
TATTTAGTTGCATTCACCATGAAGGAAGAGGCACTGAGATCCATCAATCAATTGGATTAT 
A 

Sequence 3215 

AGGTACCTTGCCTTTGGTTACCCCTTCTCTTTACAAGGGCAGGGTAAAATATGGGGCTCA 

GCTAGGCTGTGGGAATAGGGAATGGTGTGGGAGAAGCTTGGGCCTAGGAGCAACATCTGT 

GGGACTCTTCCTTCTATGCCCTTCTGAATCTATTGAGAAGGATCAGCTAACTCTTTGGTG 

CTATGCTATTAGAAACCCGGAATTAAAATAGMTATTTCTTAAAATGGAAGGGCTTGCAG 

TTAACTCAGGTCTGTACCTGCCCGGCCGGCCGCTCTAGAnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnCTTATCGATACCCGTCGACCTCAAGGGGGGCCCCGGACCC 

ANCTTTTGGTTCCTTTAAGT 

Sequence 3216 

AGGTACATTTTTAGCTAGTTACCACATGGTGCCAAAACCCTGTAGCACTCCGTTAAGACC 

ATCTGTTAAAATGGCTGTCAGGTATCATGTAGATMTTGACTGGTTTTCTCACCTTTTAA 

TTAGTTATTTTGTCATTGTGTTGACGCAACTTAGGTGAMGGTTTTTCTTTCCTAAG 

TCATAGGTCATTTGCACTTACTCAGTTACAAATGGCTATTTGGGGGGCACTGACGTCATG 

AATGATTTCATCTTACAGGTCAGAGCTATCCTATCTCCCAAAACAAC I I I I I CATCTTCT 

AAGATGATGAAGGTCTAATTAATCTTGGCCTTCACAACTGCTCTCTCAGCACTGGAAAGA 

GCCATTCTCCTTTATACTGCATCTAAGCAATTTACATTAAAATAMTACGTTTCAAAGCT 

CTATATTCATACTGAGGTTTTGAATGAAAACAGGGAAGAGAGMCCACTTCTTTATAATG 

GACAATGAACTCACTGAATCATGTGCTGGACTTTTTAAAGGGTAACAAATCAACCCTAAA 

AATGATTTAAAAAA 

Sequence 3217 

AGG TACAA GTCTGAAGCCAACCTTGATTTTTTTGAAGGGATATACACTAAATTATAGAAA 

ATTATTTTAATATCAAGAACAAATGGCTAAGGGGACAGGATTTACTGCATGAGTTTTAGC 

ACTGAGT7TTTCTGAGTCATCTAGGGGAAGGGAGATGGGTAAAATTGTAACAAATGGCAA 

AATACACACACAAAAGGTTTTCTTAAATAACAATAGCACTGAAAAAGAGiSTAGTCCGGAA 

GCCCCAGTCATTGCCTTATGTCTCTTTAAATCATTCTTCMGTGGAGATTTATTTCGACG 

CCA7TCTCCTGCCTCAGTCTCCCGAGTAGCTGGGACTACAGGCGCCCG 

Sequence 3218 

CCGGGCAGGTACCCATTAAGTAG I I I I I CGACCCATAAACTCTCCTGTCCCAGTAGTTCA 

CAGTATCTGTTGTTCCCATGTTTATGTCTGTGGGTGCTCAATGTTTAGGTCCCACTGATA 

AGGGAAAACGTGGTATTTGGTTTTCTGTTCCTGCATTAATTTGCTTAGGAAGGAGGATTT 

TCTTAMCATCAGTCTGCCTTTTCTCTMGGCTTTATGATATGAGGCATCCGTATTCATT 

GAATATGGAGCTTAGAAATATAGGTATTGAGGGGCCACCACACCAGGCTGACTTCTTGAT 

CTTGAATTTTATAAGGATTCTAAAGGGCATTGGGTAGTTCCAGGACATCTGAGACC 

Sequence 3219 

GAGCAGCTAAAATCTAGGATGGNGTTAAGTTTCITCATTTTGTCAATTATATATAAAAAA 

TTAGAAACAACATGAATCTGCATTTCTTGGATG^»GATAGTTAATAACAAACTATTTCTCA 

ATATTTGTATACTAAAAACTAGTGAAGGNGTTA1GTGTTTCAGTATCTTATCTCTTATTT 

GAACATGGGTTTCTGAAAGGAGCCTATATAATAATATAAATGGTATGTAGTAAATGAGGC 

ACTGTCTTGGCTGGGACTACTATAAAAAAATTACCATAGGCCATTGACTAAACCACAAAC 

ATATACTTCTCACAGTTCTGG 

Sequence 3220 

AGGTACTCACAGCTGCATCACACTGMTATCATCTCCCTGCACTGCCATTTCATTTGAGC 
AGGGCTTTG CCTGTCAAG AAC ATTCATATCTTTTG ATAAACAGTGC A I I 1 1 I TCCCTGAT 
AGTCTGTATGGGACAGCCTCCTTGTTTGCATCTGAAGGACAGGGACTTCAGGACCTCCTG 
CTGTGGTCACCAGGGAATCACATTCAAACTTCTCAGCAAGGTGTTATGGTCCTTCTCAGT 
TTGCCTACGATCCATTGAGGCAAAGGTTTCCTTCTATCAATACACCCTCTGCTTTAGCCA 
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GGGGGACCTCCCACTGTCCTCTCTATGTGTTCTATTTGTTCCTGACTGTTAGTACCTGC 
Sequence 3221 

TTGAGACTGCCNTGGAAGACTNGAANGGTCACGTAGCTGAGACTTCTGGAGAGACCATTC 

AAGGCTTCTGGCTCTTGACAAAGATAGACCACTGGAACAATGAGAAGGAGAGAATTCTAC 

TGGTCACAGACAAGACTCTCTTGATCTGCAAATACGACTTCNTCATGCTGAGTTGTGTGC 

AGCTTGCANCGGATTCCTNTGAGCGCTGTCTATCGCATCTGCCTGGGCAAGTTTCACCTT 

CCCTGGGAATGTCCCTGGACAAGAGACANGGGNAGAAGGCCTTAGGATCTACTGGGGGAA 

GTCCCGGAGGAGCAAGTCTCTTCTGTCCCGCTG 

Sequence 3222 

ANGTACGCGGGAGGATGGACAGAGTAAGCACACGTGTTANATAGAAAAGATAGAAAAGAA 

TTGCTGTTTCCTGATAAAAGACTAAATNATGAGAAAACACTTAAACTACCGAACACCTGA 

AMCACTGAATAANATTTGAAGCCATCTTTTTTACAAAAAAATAN GCCTGTNN GAN CTTC 

CAAGGGAAAAAGNAAAAGGGGTAAAANTGTTTNCAAANAACACAAAAAATNGGNCTCCCN 

TGTTGCCAAGTCNNCCNCANCMCTTCTACNTTCTACTACTTTAANTTNGGGTNANAAGG 

CATCCAAATAAAAAACCCCTTAA 

Sequence 3223 

CCGGGCAGGTACAGCATGTATGGCCATAAAACCACTCATCACACTACTGAAAGACCCTCT 
GCCTC ACTCTATG AAC AGC AC ATGGTG AG CCCC ATGGTTCATGTCTATAGAAGTCCATC C 
TTTGGTCCAAAGCATCTGGAAGAGGAAGAAGAGAGTNAATGAGAAAAAGAANGGAANGTG 
ATTGCAAAMCCATCTTNCCAAMGAAMGTTCTTTT^ 

CATTTACACCCCACCCCCCANCAGGGGGTTCCAMTATNATTGNAAAAAATTAACAAAAA 
AACCTCATCTGGAAACCNCAANATTNCAAANCCAGNAAAA I I I t I I I I I AANTCCCCTTT 
TNCTCAAAAGGNAATTGGCCCCAAGGCCTTCCAATTTTGGGTNAACCCANGTAAAANACC 
AN I 1 1 I I III I AAAGCAAAAAAMNAMNGTAAAAMGGGGGGGNAAMCCCTTTTTCTA 
AGTGCCCAANACNTTNGGGGGGGAAAATTTCCAACCNAAGGGAAAAATTAANCCCCTTAA 
AMGGGGGNAAAAAAAAAACCMTTTTTGGCCTTCCAMGGNCNTTTCCCCAAGGCCCCC 
TTGGAATTAATTTGGGGAAGGGGGCCMCCAAATTTTAATTTCC 
Sequence 3224 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTGGAATGCTGTGT 

GGAGTCCi I I I ICTACTnTGTTCCTGAGAAAGAGAGATTCACAAACTTCCCACATTCTT 

CCAGTGGCATCTAAGGATCCCTTCCTAAATGGCCTTTCTCCCAGCCAATAGACTGAAGCA 

AACCATGTCCCTGATGCTTCCTGCCCACCACCATCCATTCTTTCAACCAACAGCTTCTCC 

GGGCTTTCACGTGCCATGCACTTTGCTAAGAGCTTGAGGGAAATACAGCAATGAAGAAGA 

GATGATTTCTGTTTGGACCAGNGCTCAGGGATTCCTCCTATTAGAAATCAAAGAGGCTTC 

CAAAAGCCCTACAATTTTTTTCCCTGTAACTAAAGAAAATAGCTTCTCMTGTAACTAAA 

TTTTTATTCTCATACCGCAAAATTGGAATGTATACCCTTTC 

Sequence 3225 

CCGCGGTGGCGGCCGAGGTACAAGATGCACACAGATGGGAGGAGCTGCCTTGAGCGAGAG 

GACACTGTCCTGGAGGTGACAGAGAGCAACACCACATCAGTGGTGGATGGGGATAAACGG 

GTGAAACGGCGGCTGCTCATGGAAACGTGTGCTGTCAACAATGGAGGCTGTGACCCCNCC 

CTGTAAGGATACTTCNACAGGTGTCCACTGCAGTTGTCCTGTTGGATTCACTCTCCAGTT 

GGATGGGAAGACATGTAAAGATATTGATGAGTGCCAGACCCGCAATGGAGGTTGTGATCA 

TTTCTGCAAAAACATCGTGGGCCAGTTTTTGACTGNCGGCTGCMGAAAAGGGATTTAAA 

ATTATTAAACANGATGAGTMGTCTTTGCCAAGGATGTTGGGATGAGNTGCTCTTTGGGA 

Sequence 3226 

CGCCCGGGCAGGTACTACGCTCTTGATACGAAAAAATNTTATCACGTGCCCAGCCTCTCT 

CACAGTCAGAAATAACCATGTGACACAGTTCTAGCCAACGATAACATGGATCAGCTACAT 

GAAAGAGGGATGATGGTATTGGAAGCTTTTGTTTTCTTTTMmGGGGAAAN 
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AATTACAAAGTTTTTCNA I I I I 1 1 I 1 GAATTATTGAAACCAAGGGGNGAATTGGNGGAAA 

AAATTAACCTTGGGGTTTTAGNNAANGGAAGGCCNGTTGGGGNTATTCCCCCTTTGGGGN 

CAAMTAGGNGCCTTANCNCNCCCCNCTTTCGANAAGTCNCCTTGGGGNAAAAATCCCCT 

CAAGCTAAACCNCC I I 1 I 1 AAAAATTGGGAATTAAAAGGAAAGGGTCCAATATTCCCCTT 

GG I ! I M I i 1 1 IGGCCAANCTTTTGGGANAAAAAGGTTATGGNCCCCTTTTTNTTGGGNC 

TCCNTTCCNCCCAATTGGNCCCCCNTTCCCCCCAANTNCCCCTTNGGTTAACNCCTTACG 

NGGCNCCCGGCCTTTCNTTMGGAAAACCTTTAAGGGNTGGGGGGAATTTCNCCCNCCCC 

GGGGGGGCCTTTNGCCAAAGGGGTAAAATTTTTCTGGAMTNAATTCCAAMGGGCCCTT 

TTTATTNCCGGNAATTTACCCCCGGGTTGCTGNNAACCTCCTTCCGNGAAGNGGGGGGG 

Sequence 3227 

CCGGGCAGGTACAAGCCCTGGACCTGGCCCATTCTTCATGTCCCCTTCCCAAGCCTCCAA 

GCCCACATGGGCACTTGCCAAGATCAGAGCTCCAGGGGCCTCCAGGGACGGGGTTTTCAG 

TCCTTTGGGACCATGATCCACAGTAGAGAANATTGATCCTAANTTTGNANNAACACCNAG 

GGAANGCCNAACCACCAATGGTNAAATGGNAAACCAAGCCCAAANGNCAACTCCCCAGAN 

CGNANTTCANTTCNNTAATATGGGCCAAANCNAAGTTCGCAANAAANCAACCCCACAATG 

NGCCGGAANTGCCANTCCGTTANAGTTGGNCAAAACCAAAGGGCCNCCAAAAAACCNAAC 

NNCTNCAAAGNTTTGGNAATTCCAAATNNTCTTAAAANTNGGTCCAAAGACCAAAGGCCT 

CAATANATCCATCTCCCCAMGTNCCGGGMTTTCAATTTNTTTAAAATTGGNCCAAAAN 

CAMGGTCNNCAAAAAAACCAMCNCCNCNAAGNTCNGGAATTTCCAATTTCNTTAAAAA 

TTGGGCCNAAANCCAANGACCGCCANAAAAACCAANCNCCTCNAAGGTGGGGAAATTCCA 

AT 

Sequence 3228 

TTGGAGNTCCCCGCGGTGGCGGCCGAGGTACCCAATGAAGTGAATGTGCCCCCACAACAT 
TTAACAGTNATNTCAGCAGGAAAGGGAAAACGACTGCGAAAATGTAAACAATTCCTTGCT 
TCAGCTACTTGTTAACTATGGCTGCAGCTGTCTGAGCCAGCTGTCCAGTGGTAGCCTGAT 
TCCCAGCGNGCATNTGGGAAGCANNTGNGAAGAAATGCCNATAAAGAATNNNNAAAGGAC 
C AN ATATGCTAG ATGGAAACCAACTAACCATGCTAAG GCTTCC ACCTCCATATGTG TGG A 
GGGAAAGCCCTAACTCTGCCTCCTGGGAATGCCAGTGTCTCCTTCTTAGTCAGTTAATTT 
TTACAACCTTTCCTTGTTGCTTCCAGNTCATNCTGTCTTCTTAANCTCTTTGCTTGTGTT 
TCCCAAAAACTTTA \ I II T I II I J CATAAAATCATTAATATTGCTTTTCCCACTT 
Sequence 3229 

TNAAATTGNTTTATTCCCCCTTCACCNANTTTTCCCCCANAANTCATTACCGGAAGTCNC 

GGGNGNAGGCCANTTANAAANNTTGTTAAANAAGNCCCTTGGGGGGGTNGCCCCTTANAT 

TGGAAGATGGNAGGCCTTANANCCTTCCCACCAATTTTAANAATTTTGGCCNGNTTTTG^ 

CNGGCCTTTCNACCTTGGNCNCCCCNGGCCTTTTTTTANCCCMGATTTC CNGTG CGGNA 

AAAAAACCCCCTNNGTTTCCNGGTTGGNCCCCCAACGNCCCTTNGCCCMTTTTAAAAAT 

TNGTAAAAAATTTCNGGNGGNCCCCAAAAACCCGGCCTGTCCNGGGNNNTGGTAAGGAAA 

GGGGGCCCGGGNTNNTTrrGGCCCCGGTTTAATTNTTGNGGGGGCCCGGCCNTTCCTTTT 

TTCCCCGGCC 

Sequence 3230 

GCTCCCCGCGGTGGCTCGCCCGGGCAGGTACGCGGGGTGGGGGGAAAATGTGTATGTGAG 

MTGATAAAATACATTGTATGCATATATGCACTGGGTCACTGNAATATGTATTTCC"'ACT 

GTGGATCATGGCCMCAGAGTTTGAAAACCTCCATTCTTTGAGGTAGAAAAGCTGACTGG 

ANTATCTACCCTGGGTGGTTGGAAGCTGCCAAGGGCACCCAACCTGGTCAACAAATTAGT 

GTGCTCAGCCAGAGGCAGTCCTGGATGAATTCAGGCTGCAAAGTGCCCTGTGACCTGGCC 

CTTGGGAAGCCACGCTCTGATGAAGGTTTGCAGATCTCCCCGCAATCCAGGGCCCAAGCC 

ATGAATGAMGGTGTTAGGCATTTTCAAGGGATGCCGGCTAGGACCGTGCCTGCTTGTCT 

TCAAGGGTTT 

Sequence 3231 
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ATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGAAGAATTTGAAAAACACAG 

CTGCTGMGAACATGTTGCTCAAGGAGATGCCACTCTTGAACATTCCACAMTTTAGACT 

CCTCACCATCCTTAAGTTCAGTGACTGTTGTGCCTCTGAGGGAATCGTATGATCCAGATG 

TAATTCCTNTGGTTGGCCAAMGAACTGTTTTNGGAAGGTNGCACAGCCAGCACCTCCCC 

TGCGGATCACTCTGCTCTCCCTAACCAAAGTCTGACTGTTAGGGAATCAGAAGTCCTTAA 

GACAAGTGACAGCAAAGAAGGTGGNGAAGGTTTCACAGTAGATACACCAGCAAAAGCAAG 

CATCACTAGCAAAAGGACACATTCCAGAAGCTCACCAGGCTACTTTATTGGATGGTAAAC 
AAGGGAAAGGTAATT 

Sequence 3232 

GCTCCCCGCGGTGGCGGCCGAGGTACCCTGTTACTGAGGCAATACCTTTCTGGGTAATTT 

TCTCAATCCTCTGGGTATATTGAGGTTGCCTACTGTGGCTGATGAGGAGGCCAGCTATTA 

TTTCTAGCCTTCTCTGTCCATTGAGCATTGTTCTATCTGCTTCTTTTCAGTAGTTATTTG 

CCCTGTNAGTTTCCTCACATGTGTGCACTGATCAATACTTAAGCCTGAAACTGTCTGCAG 

CACTCCAGATCTTTATTTCTCTTTTACTTTGCAACTTGTTCTCTCTGTAGTTCTACTCTC 

TTCGCTCTCTCTCTCTTCCCTCAGCACAACCCTCTCCATCTCTACACTTCCTCTCACCTC 

TGCCCTGTGMT TCTAA TTGCATTTGCTGTCCCAAATCCTCTTTAACTCATTCAGACCAC 

TGG AGCTCTGG ATTTTTCCTCTCTG AG CTG C AGC C 

Sequence 3233 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAATATANGGCAGACACG 
TTTGCCTTCAGAAATTCAGAAATGCAGCTTTTGAGGGAGGTCAGCATCATTGGTCTCAGC 
TACTAGAGTTGMGATGATTCAGCCACTTTTATTCCAGCCACTCCATTTCTAGCATACAC 
TACAGAGAAGTGTATATTTAAAGAGGCCATTTTTCCTGCAGGGATGTTTATAAATAGTTC 

CTGGCAAAGGAGGTTGCCGTATAGGGTCCTACAAGTAACCTCCCCGCGTACCTGCCCG 
Sequence 3234 

CCGCGGTGGCGGCCGAGGTACCCTGTTACTGAGGCAATACCTTTCTGGGTAATTTTCTCA 

ATCCTCTGGGTATATTGAGGTTGCCTACTGTGGCTGATGAGGAGGCCAGCTATTATTTCT 

AGCCTTCTCTGTCCATTGAGCATTGTTCTATCTGCTTCTTTTCAGTAGTTATTTGCCCTG 

GTANGTTCCTCACATGTGTGCACTGATCAATACTTAAGCCTGAAACTGTCTGCAGCACTC 

CAGATCTTTATTTCTCTTTTACTTTGCAACTTGTTCTCTCTGTAGTTCTACTCTCTTCGC 

TCTCTCTCTCTTCCCTCAGCACAACCCTCTCCATCTCTACACTTCCTCTCACCTCTGCCC 

TGTGAATTCTMTTGCATTTACTGCCCCCAAATCCTCTTTAACTCATTCAG 

Sequence 3235 

ATACTCAAAAMCATATATTTGCATGTGCCAGGAAAAGGTAAGGAAAAGGGAACCTTCAT 

GTGCCAGCCCC T GTGCTGAGCCCTGGAGCTGTAGAGATGAGTGAGGTGGATTTTTTGTCT 

GTAAAGAGTTTGCCGATGGGACCAAGTCTGTCATCTCTCTTTCCTGGCCTGACCGAGGAC 

AAGCACTTNTTAGTAACCCTGTGAATACCGATGCATCCCGTCAGTCAGCAGCTGTTTGCG 

GAACACCAGCTGCAGGCCAGGCA 

Sequence 3236 

NAACCGTAAA/v^AGGGCCCGCGTTTGCTTGGCGTTNTTTTNCATAAGGCTCCCGCCCCCC 

TTGAACCGAGt^CATCACCAANNAATCCGTACCGCTTCAAGNTCAANNAGGGTTGGCTNAA 

MCCCCCGG/iCAGGGAACTTMTATAAAGNATANCCCMGGGNCGGTTTTTCCCCCCCCT 

GGGGAAAGGCCTTNNCCCTTCGGTTGGCCGCCTTCCTTCNCTTGTNTNTCCCCGGAACCC 

CCCTTGGCTCCGGCCTTTTACCCCGG 

Sequence 3237 

CGAGGTACCCAATGAAGTGAATGTGCCCCCACAACATTTAACAGTCACATCAGCAGGAAA 

GGGAAAACGACTGCTGAAAATGTAAACAATTCCTTGCTTCAGCTACTTGTTAACTATTGN 

GCTGCAGCTGTCTAGAGCCACGCTGTCCACGTCGNGTAAGCCTGGATTCCCAAGCCGTTG 

CCATCTTGGGGAAAGCCANGTGGGGGAAAGGAAATTGCCCANATTAAAAGGAAATAAATG 

TAAAAATGGGGACCCCAAAATTAATNGCTTAAGGTGGGGAAAAACCAAACATAAACCCAT 



Page 490 of 63 5 



WO 01/51628 



PCT/US01/00798 



Table 1 



TGGCCTTAAAAGGGGCTTTTCCCACCCCTTCCCATTATTGGTTGTNGGGGGAGGGGGGAA 

AAAAGGCCCCCTTAACCTTCTTGGCCCTTCCCTTGGGGGGAAATGGCCCCA AGGTGG GTC 

CTTCCCCTTTCNTTTAAGGNTCCAAG7TTTTAAMTTTTTNTAACCCA 

CCCCTTTTGNTTTGGGCNTTTCNCCAANTTCCAATT^ 

AMGGTTNCTTTTGGCCTTTGGTGGGTTTTTTC^ 

TTTTTNTTTTCCAATTNAAATTTTTCCA 

Sequence 3238 

CGAGGTACATNGTGGCCTTTCAGCTAGTGCAGAGGATGGAAACCAGAGTGGGCTGATGAT 

TGGATGCCAGGCCTGAGCCAGGTTGAGTCCTGTTCTCATTATACTGTAAAGTGAGCGTGC 

TNCAAGCCTCCCTGATTCTTATCACTCCCAGNCCCTGCAGGGGTCCCTGAGAAGGTTAAT 

CTNTGCATTTMGCNCCTCNTGCTGTTTTCCCCTCTGTNAAGCCATTACTACCACTNTTT 

GCAGCTGNAGNTGTCTAAGTATTAACCTTANTATTTTCCCATGTGTANCATTCANCATGN 

GTTTTNANTGGACCTACCNATTAAMNAGTANMNTTTTTNGCTGGANAACCAACAATAA 

AGNGGNGNAATTTGGNTNCAATMCCTTTTNAAA^ 

CTCCCAGNGGTTGGGNCCCGNCCAATNGNCCCTCTTCTTNAGNGNGCCCAAGAACTCCTT 
NCCCNCNAAANTTATATTCCTAMGNNCTTTTGGCCTTTNNAAATNGNCCTCCCCNNGGG 
GCCGCCTTNTTTGGCTCCTTTACNTTTCCAAAAAAAG 
Sequence 3239 

CTATAGGGCGAATTGGGAAGCTCTCACCGGCGGTGGCCCGGGCCCGCCCGGGCCAGGGTA 
CCNTCCTTCCCTANTGCATTCCTGCTGNTGGGGGAAAGGTTGTGGGAATAACCTCACCAA 
GAATGAATGGCCMCAGGGGGCAATTGGTTATTTTCAAGGGGCCACCCCAACCAAGTGGC 
CAAGCCAAGTTGGAAAATTGGAAMTTGGAAMGGTGGAAAAANTGGGGTTATNTTTAAT 
TCCCCATTTTrCCCCTTGGGGAAAAMGGGGTTCNCCAAGGGGGGTTNTTNGGGGCTTCC 
TNNGCNAAGGGGGNCCCCAAAGAAMNNAAACCCMGNCCTTTTTTMNGCCTTTGGTTG 
GCCCCNTTTTAAANCCCCCCAAGNTTCCCTTGNGGAAGNAAAAAGGCCCCAAAGCCAATG 
GNCCCCGGTTAAAATTCCAACCCGGGGGGGGGNAAGGGGAAAAAAACCCCCCCANTTCCT 
llllllll I AAAAAGGGGGGGCCNTNCNCCTTTCNGGCCTTTCCAAAGGGGTTNGGGGGT 
TTGGAAACCCAAAAGGGGGTTGGAMGGGGGTTGGGGGGTTCCAAATTCTNTAATTNGNC 
CCTTGGGTTNCNTTTTTTAATTTC^ 

ATNAAACCmGGGTGGGGCCCmCCTTTGGAAACCAATTTTGGGAATTGGGGCCTTNA 
AMNTAAATTNmCCCCATTTANTNTrAAAmCCAAGGGG GGGC CNmCCTTTNGGN 
TGGGGGTTGGNTTTTTTTAANGGGGCCCCNTNCCTTAAAMTTTT 
Sequence 3240 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGAGCA 

CCAGGAAGTTACTTAAAAACATCTGCAGGAGGACATTGGTAG CCTTT ACTAATACAGCAG 

TCTCTTTTTTGAGTAGTTCATTAATTGTTAGAACACMTAGCACTTTTGAMTAAAA 

ACTAGCATCCCAGATTGAAAGACTTTACTTGCCAGNTAAGCACATGGGCTGCTGTTAGAA 

GAAAGCTAACTTGGCTGGGTGCAGTGGCTCATGCCTGTAATCCCAACACTTTGGGCGGCC 

GAGACGGGTGGATCACCTGAGGTCANCAGTTCGAGACTAGCCTAGCCAACATGGCAAAAT 

CCCATCTCTACTAAAAATACAAGCAATTAGCCGAGCCATGGTGTCACCGTGCCTTGTAAT 

CCTAGCTACTGGGG 
Sequence 3241 

CGGGCAGGTACTAGCTGTTTTTGCGACTTTTAATAATAACACCTNCTAGTTACTGAGTGT 

CATCATCACCAATGTCATGATTGCCACGATTTATTGAGTGCCTACTACATGCAGGCCCCA 

GGTCAGTGCTTTACACTCTTAACTGTCATGTTGACAACAG CTCTG AATGACAGGAGCTCT 

GTTCAGTGTTTTACAAGTGAGAAMCTAAAGACACTAATMTTTTTAAAAATATATCCAC 

AGTTCTATGATTAGTAAGTAAGGGAGCTAGTCCT 

Sequence 3242 

AGGTACGCTTCATTTCCTCCAGCACCTTGAAGACAGCTTGGCATAGAAATAATTGTTGAA 
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TCAACATGTGCATAAATAAATGACTGATGCAATGGGCAAACCCTTTGCAGTGCTCTTCCC 

TGCCTTACCTGTATACCATAACCTNCACTGTGGCAGGGACTGTGGTGACCTCACTTGCCA 

TGGCCTCCCAGCTTATTCTTAGTGTCTGATACACAGTTGGATGATTAATAAATATTTGCC 

ATCTCAGTGAGTGTATGGACATGGTGAGTCCTCTGCCAATCCTCCCAAATCCATGCCTGC 

CACACCTCCACATTGCAAACCTGTCCTCCCACACTGCAGCAAGAGCCAGGCAGGGTTCTA 

CCAfGCAAAATGGACCTTCATGTCTCCTCCACNGCTACCCTAANGCTTCAGAATCTTCTT 

TAGATGGTGGGCCCCACAAAMTAGCTTTAGCTTTAGTTACAGACTCACAGATTTTACTT 

ATAATCTTAGGGCATTCACAGGTCTCTTGAAGTCCATGAA 

Sequence 3243 

AG GTACC ATCTCAAAG AACAC CTGTATGTATG C AACGTATTTAAGCAG AAAG AATG GGCT 

CAAGATAACCCTTGATTCTAG7TCCCTACAGCATACATGTGAATGGGAAACATTCATAAG 

TTCCTATTTGCCAAGGGCATGTAAATGTGATAAAGCAGATCCGAAGACACATGGGAGTGA 

GACTAGGAAAGTATTAAGAACATGGGCCTTGACCGGGCGCAGTGGCTTACGCCTGTAATC 

CTACCACTTTGGGAGGCCGAGGCTGGTGGATCACAAGGTCAAGAGTTCGAGACTAGCCTA 

GCTAAGATGGTAAAACCCCGTTTCTAAAAATTAGCTGGGCACAGTGGCGGGCACCTGTAA 

TCCCAGCTACTTGGGAGGCTGGGGTCACTTGAACCCTG 

Sequence 3244 

GGG GCCATTGAG ACTGCC ATG G AAG ACTTGAAAGGTCACGTAG CTG AGACTTCTGGAGAG 

ACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGACCACTGGAACAATGAGAAGGAGAGA 

ATTCTACTGGTCACAGACAAGACTCTCTTGATCTGCAAATACGACTTCATCATGCTGAGT 

TGTGTGCAGCTGCAGCGGATTCCTCTGAGCGCTGTCTATCGCATCTGCCTGGGCAAGTTC 

ACCTTCCCTGGGATGTCCCTGGACAAGANACAAGGAGAAGGCCTTAGGATCTACTGGGGG 

AGTCCGGAGGAGCAGTCTCTTCTGTCCCGCTGGAACCCATGGTCCACTGAAGTTCCTTAT 

GCTACTTTCACTGAGCATCCTATGAAATACACCAGTGAGAMTTCCTTGAMTTTGCAAG 

TTGTCTGGGTTNATGTCTAAGCTTGTTCCAGCTATCCANAATGCCCACAAGAATTCAACT 

GGATCTGGAAGAGGAAAGAAACTGATGGTGTTAACTGAACCCATTTTGATTGAGACCTAC 

ACAGGGCTGATGTCATTCAT 

Sequence 3245 

CCGGGCAGGTACGCAACATGACATTGGCTGGTGTAAAGATCTTACAATTATTTTTAAAGT 

TTCATTGTATTCATTTGATTATATAGTTTCTTGCCATCACAACAAACACTCCTATTCATG 

TCTAATTCGATACCTGGTGAGTCCACATAAGCATTTGGGTCCCCACTGATGAGAGCACCA 

TGGCCAGGTCACCCACTTGATCAGGTTCTCTTGCTTTTTGTTGTGCATCTCATCTTATTG 

TAGGCATTCAMCTGTGGGCTCAGGCAAGCCTTTCCAGTGTGATCATATTAATTTTATAT 

TTTATGTGTTMCATGCGCCTCCMTMCTTGCTTrCCTTGCTCACCATGATGATGCATT 

T1 TCCT ACTTTATGTCCTCGCTCATTGATTATGATGAGCTGCTGACTATTTGAACCTGGT 

ACTTTTACTTGATCACCTCTACGCAGTTTCACTCCCATT^ 

CAGTGTCCTCTCTCAGTCCTTATAGGCCTAGATG'rrCATTGTGGTTTCAGCATCTATTTG 
ATCTCCCCTCTCGTGTTTCATTAGCATTTCTTCMTAGTTTCTCAGGGTTATTTCTCATG 
AATCACCATTGNATACTTCTTAAAAC 
Sequence 3246 

CCGGGCAGGTACGCGGGTGTTCAGGGAATTACAGCATAGATATGAAAGATGTCTGAAAAA 

TTGGAAATATCAAATAAAAATTAAGMGCCTA/\AACTACATCAGATTAATATGTAGTATA 

GTCTTT CACA TCTGGCCATGATGGAATGGAGCAATGGGATTATATATACATTCTCACAGT 

AMCAGTTTTCAGATATTGAACAACAAAAAGTAGGCAGTGCTAGCCAGAGATTCCTTGGA 

GCCAAAAAACAAAACAAAACAAACMTAAAAATTGGTAATCTCTATATTGCTCCAGTTTG 

CTGCTTGGAGAAAGTTCTCAGGCTGCAGCTCAGAGAGGAAAAATCCAGAGCTCAGTGGTC 

TGAATGAGTTAAAGAGGATTTGGGGCAGCAAATGCAATTAGA 

Sequence 3247 

ATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGCCTGTGCGTAGCTGCATTA 
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AACCAAATGATCAAGAATATCAAGATACATCTTCCTGGATCTTCAATCCCACTTAAAGAT 
TTGGAGAACATATCATTTTATATTAAMCCMTAACATTT^ 

TTCI I 1 1 I ACCTAAAAACAGAGTCTTTAAATMGTATCATTTTTGAGATGCATTGCCATA 

GGATACATACTGAGGTGTCTTCTCTTTCTCTTACCTATTAGAGGCATTTCTGGGCAAGTT 

TGAGGAGCATGGAATTAACAGTCATCMTGCATTGCTGGATTTGAAAATGATTTATATCA 

ATCAGCAGAGGACCTCCACTGGTGATGTCAGCAGGATCTGAGGATGTTCAGGTAGACATG 

CTATGCTTTGAATCTCCAAACTTGAA 

Sequence 3248 

NAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAACAGTCAGGAACAAATAGAACACA 

TAGAGAGGACAGTGGGAGGTCCCCCTGGCTAAAGCAGAGGGTGTATTGAGAGAAGGAAAC 

CTTTGCCTCAATGGATCGTAGGCAMCTGAGAAGGACCATAACACCTTGCTGAGAAGTTT 

GAATGTGATTCCTTGGTGACCACAGCAGGAGGTCCTGAAGTCCCTGTCCTTCANATGCAA 

ACAAGGAGGCTGTCCCATAC AGACTATC AGGGAAAAAATGCAC TG TTTATCAAAAG ATAT 

GAATGTTCTTGACAGGCAAGGCCCTGCTCAAATGAAATGGCAGTGCAGGGAGATGATATT 

CAGTGTGATGCANCTGTGAGTACCTGCCCG 

Sequence 3249 

CCGGGCAGGTACTTAGCACATGCCANGAACTGTTCTAAACCCCTTCTGTGCGTTAGCCCA 

TNTAATCCTCATANTAACTCAGTGCTNTCTCTGATATCCCTGTGTTACAGTTGAGGGAAC 

TGAAGTATANAGAAGGTTGAGCAATGGCATGCAGGGCATAGAGAAGGGTAAGGACGACTG 

TAAACAGGTCAAGAAGTGGAAGGTCAGATAATTTACCCTGATTGCACAACTGGTAAGTAG 

TGGTACCAGATTGAGGCTCAAGTCTTCACTCTCACAGCAGGGTACTTTCTGCAACTTCAG 

TTTCTTATTACCATTTCTGCCACTAGCATGCAGAAGACGCTAGATGATTATTGAATCAGT 

GAGTTTCTCATAGGTAAGATGGAAATGATTCTTAGTAGGGTTTTGTGAGAATAAAATTGG 

AAATTGTACATTTCTTTAACATATTTTGGGTAGTAATAGCTATTATGATGCTAGTGGATA 

ACACAAGAGTTCAAAGGTGACCAGTTAGATGTGGAGGANGGAAAGAGTTGTGTAAAAGAT 

GATGAAGTGTTCTAGCCTGATGGTGAAANGACAGTATTAATGGAMTAATTAATTTGGAA 

0 

Sequence 3250 

GCGAGGTACCGTAAGGGGAGTTTTGTTCCTACTGCCCATGAGTCTGTGGTCCCACATGTG 

TTGCTTTTGTCCATGTCATCATTTNCACAAGGATACATNTGAATGCTGTATTTATGCCTN 

ATTTNTANATACATGGGGAACTGNMCTATGTTTTTAGCTGAAGGT TCTTACATCA AAAT 

TATGATATTCCTAACACTGCTTCCCAGANMGACCCCCAGAGTATGTTTTGTTTTGAGAC 

AGAGTCTTGTTCTGTTGCCCAGGCTGGAGTGCAGTGGCATGATCTCGGCTCACTGCAACC 

TCTGCCTCCCAGGTTCAAGCGATTCTCCTGGCTCCCTAGTANCTACAGGTGCATGCGTGG 

CTA A1 I 1 1 1 GTATTTTTAGTAGACACGGGGTTTTGCCATGTTGGCCAGGATANGTCTCAA 

A 

Sequence 3251 

AATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACTTAACTAGGGTTAGCCCGAGGTCAGG 

AGGGTGGATGCCTTATTATGGGATTAAGTGCCCTTCTAAGAGGGACAGTAGAGGGCTTGT 

CCATGCTCTCTCTCTCCATGCACAAGAAAGAGCACACAAGTGAGAAGGAAAGAGGCCCTC 

ACCAGAACCTGACCATGCTGGCACCGTGATTGCAAACTTCCAGCCTCCANGATTGTGAGA 

AGATAAATATCTGTGGTTTMGTCACCMGTCTGTGGTGTGTTTTATGGCAGCACAAGGT 

AACCCAGATGATGCTTTTCACCACTGGGTGGTCCANCCTTTCATGGAAGCTGTCAACTCC 

TCAGCCTTGCAAAAAATG 

Sequence 3252 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGGAGGCAGCTTTT 
TTCTGCTCTCTGTTGACTTCTGAAGCCAGCCTCATGATCGTTTCTCTGCTAGCTTTTGCT 
TCCATCTCATGGACATTTATAGTCTCTTCAAGAATAACAATTTGTCCTTTCACGAATTCA 
TTTTCTTTGCGCAGGTCTCTMGCTTTAAAATTAAATCTTCATCTGATGCCTTGTCATTC 
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CTCTCATCTGGATCCAAGCAGTCACGCAGTTGAGTCAGAAATTCCTCATGTTTCCTGCAA 

TTCTTTGAAACTTGTTTCTTATTCTCCTCATTTTCTTTTGMCACTNT^ II MCA 

TTTAACTCTACAACTTTCTTCTTTAATTCCTGGNTCTCCTTGATTGCANCGTGAGCTGTG 

ATTTCTGTTCTGA7TTTAGAAGTGGAAAGTGCTGCTGATTCTTCTTCTAGCTCCTGAACT 

CTGTCTCTCANAGAGGTAAGGAGAGATGATTTTCTGGCATTGGTTTCCTTGTAGCTCTNC 

ATTTCANCTTTCAATTCTTGACMGAAACTTCTTTAGAAAGCATCTTGGATCGGANGNCT 

T GAAA CTTAAACTGANCCCATTCAAATTTGGACCAAAGTTGCTGCMGTTCACTTNGAGC 

ATTTTGMCCACAATTCCCAAMGGGGGTTAACTGCTCCCGGNGNGANCNGGACCTTCCG 

AAAGAAGATCGTNAGGTTNTTGCGGGGACANTTGAACTGMCCMTTCCCCCATTTNNGG 

AACTACTGCCAANCANNCNAANCCCGGGGGNCCCCATTCCCAGGCCAAAGGGGGA 

Sequence 3253 

TTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAGATGCACACAGATGGGAGGAGCTGCCT 

TGAGCGAGAGGACACTGTCCTGGAGGTGACAGAGAGCAACACCACATCAGTGGTGGATGG 

GGATAAACGGGTGAAACGGCGGCTGCTCATGGAAACGTGTGCTGTCAACAATGGAGGCTG 

TGACCGCACCTGTAAGGATACTTCGACAGGTGTCCACTGCAGTTGTCCTGTTGGATTCAC 

TCTCCAGTTGGATGGGAGGACATGTAAAGATATTGATGAGTGCCAGACCCGCAATGGAGG 

TTGTGATCATTTCTGCAAAAACATCGTGGGCAGTTTTGACTGCGGCTGCAAGAAAGGATT 

TAAATTATTAACAGATGAGAAGTCTTGCCAAGATGTGGATGAATGCTCTTTGGATAGGAC 

CTGTGACCACAGCTGCATCAACCACCCT 

Sequence 3254 

CGGGCAGGTACGCGGGGCTGCTTAAAACTCTTCAGAGGTTCACCATCGCTTCCAGAAAAA 

GATCAATACTCCTGACGTGGCAACCTGTCGAGACTCAGGACCGCTAGGAAATCAGCTCTT 

GTTTACCCCTCTATCCATTACACTACGATCCAGCTGGGTTTAGGCATCACTTCCTCTGAG 

CAATCCTCCCTGATGTCATGAGGCAGTGCTGAGTGCCTTCTCTGGACCCTCTGTTTACTT 

GCCTGTTAAATCCTGAACATCTGGAAGGGAGTCGTGTTCAAGCAACATATCTACGCCTTG 

CAGCACGGTGCCTGGCCTGCAGCTGGTGTTCCGCAAACAGCTGCTGACTGACGGGATGCA 

TCGA 

Sequence 3255 

CCGCGGTGGCGGCCGAGGTACCAGGCTAGGCAGCTCTGGAGAAAGCAGAAGTGGATAAAT 

AAGGTGTGGACTCACCAAAGACAGTTCCAAGGTCAATTTCACTCTGACACACTCTCTGTG 

ATCTTCCACAGTCAGCACAATGCCTGCCCCTGTGGGTGTTGTATAAATATTTGTTGAATG 

AATGAATCAATCATTCAACAGACCAAGGCCAAATCAGAACCCCAAACCCTAAGGTCTTTA 

TACTCTCACTGTCCATCCATCGATCTTCCTGTCAGAAATCAGAATATACCTTTGCAATAC 

CCTTTGCTAGCCTTTCAGTTATCTTTTGMTAGAGGCTCTGAGCCTTGAAAATATTGCCT 

GGGAMTATTAACACCCATTTGAAGTATCTCCCAAACACCTCAATTAACTATATGGTGCT 

GTCTGGCCAGG 

Sequence 3256 

ACATTAATTTGCTTANGMGGAGGATTTTCTTAAACATC 

CTTTATGATATGAGGCATCCGTATTCATTGAATATGGANCTTANAAATATAGGTATl'GAG 

GGGCCACCACACCANGCTGACTTCTTGATCTTGAATTTTATMNGATTCTAAAGGGCATT 

GGGTAGTTCCANGACATCTGAAAACCTTGAGGGMGGTGCATTCCTCTGATTTTrGTGCC 

TCTGCCCAGCAGTCTGGCTTCTAACAAAANTGGTCAAAGGAATGCCTGACTCACATGAAG 

GCTTCTTGCAAMGTGTTCCAAAAGGAAAAAAGAAAGGGACCAANTGTCCTAACTTCCTC 

CAAAAAGACTCATAAGACAANCATTGGCTTNCAACCTGCCAAAAATGGGGGGAAATTTNG 

TTTTTACAATTCACCCAGGGAAAANGN 

Sequence 3257 

AGGGGCGAATTGGAGCTCCCCGCGGTGGCCGGCCGCCCGGGCAGGTACGCGGGATTTGTG 
TG ACCTTCCACCTTAATTTTCATCACATGTG G ACTCTAGTATCAC CTGG ATTATTC AGAT 
TGATTTTCTTACTTAAATTCAGTAGAAAAATTGTGAATCTGACTATGAGGGGAGGCAGTA 
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TGMCTCATTAAATATGTATTCCTGTTTGGAGAAATATTCGCTGGATAGCTAGAATTTTA 
GATTGGGAAATCANGTTTTACATCAGCMTCAMTGMTAGTAGmAGAAGTmAm 
TTTAACTTCTCATTTTGAAATAATTTCANATTTACAA 
Sequence 3258 

CCGCGGTGGCGGCCCGAGGTACTAGCTCCCTTACTTACTAATCATAGAACTGTGGATATA 

TTTTTAAAAATTATTAGTGTCTTTAGTTNTCTCACTTGTNAAACACTGAACAGAGCTCCT 

GTCATTCAGAGCTGTNGCAACATGACAGTTAATAGTGTAAAGCACTGACCTGGGGCCTGC 

ATGTAGTAGGCACTCAATAAATCGTGGCAATCATGACATTGGTGATGATGACACTCAGTA 

ACTAGTAG G TGTTATTATTAAAAGTCACAAAAACAG CTAGTACCTGCCCG 

Sequence 3259 

AATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAGGTTCAAATAGTCAGCAG 
CTCATC ATAATCAATGAGCG AG G ACATAAAGTAG G AAAAATGCATCACCATGGTGAGCAA 
GG AAAGC AAGTTATTG GAGG CACATGTTAACACATAAAATATAAAATTAATATG ATCAC A 
CTGGAAAGGCTTGCCTGAGCCCACAGTTTGAATGCCTACAATAAGATGAGATGCACAACA 
AAAAGCAAGAGAACCTGATCAAGTGGGTGACCTGGCCATGGTGCTCTCATCAGTGGGGAC 
CCAAATGCTTATGTGGACTCACCAGGTATCGAATTAGACATGAATAGGAGTGTTTGTTGT 
GATGGCMGAAACTATATMTCAAATGMTACMTGAAACTTTAAAMTAATTGTAA 
Sequence 3260 

CCGGGCAGGTACCTGGTGCAGCCCATATCCATACACAGTCTAACTCTGGATCCCACACTC 

TAAACCGCGATGCAGTAATACCTGGTTTCAGTGCACCTCAGATTCGTCCTCCTAGGCTGC 

CGCTGGAAGACTAGCAGCTCTTCATTCTGTTTTCCTTTTCATTGTCATAGA7TATTTGTT 

ACAACTGTCTGTGGMTGAAATCAGMTATTTTTAACAAATGGATATAAATCATGCGTCA 

TTTCTCTATGGNTAACATGTGTCTCCTTAAGTTGAAAATGAAATTCCAAACGGACTAGGC 

TATATACATGATTGTTTTCCACTCTGCCTCCCTTTTAAAGATGTCACAACCTGCCATTTA 

TMTGCAGAGCGGTTTCTCGCTCCATCTNTCCTATTTTCTAACCCCGGTAACAAAAACCC 

TCCTGCAGACAGCACCTTTT 

Sequence 3261 

ATGCTCNCTTNGCNTANNCATGGTCATMNTGTTTTNCTGNCATCCTGGTCTTGGCCAGC 

ACCATCACTATCTTAAAGGACTTCTCCATCTTACTCATGGAAGGTGTGCCAAAGAGCCTG 

MTTACGGTGGTGTGAAAGAGCTTATTTTAGCAGTCGACGGGGTGCTGTCTGTGCACAGC 

CTGCACATCTGGTCTCTAACAATGAATCAAGTAATTCTCTCAGCTCATGTTGCTACAGCA 

GCCAGCTGGGACAGCCAAGTGGTTCGGAGAGAAATTGCTAAAGCCCTTANCAAAAGCT 

Sequence 3262 

GTTCCTGGAATTCTATTATGTGAATGTAAACCTATATMTATATTTAGCCATTTTCTCAN 
7TGGATATTTTGGTTATTTCTATGTTGNC 

CCCCTTTGACTCCTTTCTTTCTCTCTTACTCCATGTGGNANACTAAATGTGGCTGCTA 
Sequence 3263 

ACCTTCTCACACCACATCTTCCAGGATTACACTTTTATGCTACACCTGCACTGATCAGTC 
TTCTACTCCAGTCTACATTTTGTCTTCGTAAAMTGTTGAGGATGAATGCTGTTCCACTT 
TCCTTTCCTATAGCGAGATGGCCTNCCCGCGTACCTGCCCGGGCGGCCGCTCTANAACTN 
AGT 

Sequence 3264 

AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCACAGCTGCATCACACTGAATATCAT 

C TCC CTGC ACTGCC ATTTC ATTTG AGCAGGGCTTTGCCTGTC AAGAACATTCATATCTTT 

TGATAAACAGTGCATTTTTTCCCTGATAGTCTGTATGGGACAGCCTCCTTGTTTGCATCT 

GAAGGACAGGGACTTCAGGACCTCCTGCTGTGGTCACCAAGGAATCACATTCAAACTTCT 

CAGCAAGGTGTTATGGTCCTTCTCAGTTTGCCTACGATCCATTGAAGCAMGGTTTCCTT 

CTCTCAATACACCCTCTGCTTTAGCCAGGGGGACCTCCCACTGTCCTCTCTATGTGTTCT 

ATTTGTTCCTGACTGTTAGTACCTGCCCG 
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Sequence 3265 

CCGGGCAGGTACAGAMGTTCACATGAGAATGCCAAAGTATCTTGTATTCTGCATATCTT 
ACATTGTGATGGAAMGAAAMTTCAGAMTATTAAMCATATTTAGTTTTATATT^ 
ATGATT ATAAG TCTCCAA 1 1 I 1 1 CTTCAATATTCTAATAATCTTGTCTTCCAAAAGTAAA 
ATMTAATTTTCACAGAAAAGCAGCCAAGTACTGAAAATAACAGCTACAGTTGI I H I IG 
ATAMCATMTGAGAMCCATTAAGCMTGCATTTATTTAAAAMTTAGTTMTTTTATG 
TAMGTTTATCCAGTCAMTGAGTATGTAGMTAATTTATTATAACACCTTCATAGTCAT 
TCCTGTATTACAGAAATCTCTTTTAAGTAATGCAAAGAAGTTGGATGATTTAI rTTTTTA 
CTTTTTTACTTTCCAAATATAAATTCT 
Sequence 3266 

GCGGGACAAAAGCCCAGAACACACATGCTCTCAGGGGGCCCACCAGGGTTAAAGAGACTG 

AAAAMTAATTTTGGCTGCGAATTTAAGTGAGGATGAACACCACGAGGAGCTTCATATGC 

AGAAATGMGTCCTGGCTTTCTTAAAAAAAATAAAAACACTAATGCCTCAACAACAACTA 

CAACAGAMCCCTTTGGGTTTATTATTTCMGCCAGGTCATGCTAGTTTCACMGAGTTT 

GTGAAAATGGATGCACATGAATAGAGCCTTTAAAAACTCGTGATTAAAGAGAAAGATTTC 

ACMGTTGATGGTTTGGTAGMTGGCAGTCATAAAMCAA7TGCTCTTTGGTATATGTAT 

TGGGTNAAAAAAAAGTT 

Sequence 3267 

AGGTACTGAGAGGAGAACACACAAGCAGACAGATGGAGGTGTCCACAAGACAGTTGGGAG 

TTTGAGTCTAAMTTTGAGGGATTGATATAGATAGTTATATTTGGGGTTTCTTCACTGAT 

CAGTGTTAGTTAATAGCCTAAGAGTAAATGATACCTAGATTAAGAAGAGCAGCGGGCTGT 

GGACCGMCTTTGGGCACACCATCAATGACGAGTTACCCATGAAAAAGCCATTAGAAGAG 

TTG ATGGAG CATGAG GAGACCATGAG AGGACAACGTCACAGG AGTTGGGMG AGGGAGTG 

AGCGACACTGACAAGTACCTGCCCG 

Sequence 3268 

AATGGTCAACATGGCATCCTATGTTTTGGAGTCTAACTGGCGNGGGTGTCCTGCCTCTTC 

CATCCACTTTGCACAAGTTACTTATCCTCTCTAAGCTTCAGGTTTCCTTTTCCTCATCTA 

AAGAATGGAMCACGTCTGTTAGGATTCTTTGATTGCAAGCAACAGAAACAGATTCACTT 

TTGCCAGAMGGGTAATGTATTGGAATMTCTTAGTTCTCTGCCANAAACAAAGGAAAAG 

GCCTTGCAAANAATGGAAACTAGGCAACTCCATGGATCGAGATGACAGAAGAAGTATATT 

CTCCTCAGCATTCTGACACCAGAATGACTCCACCCTAACCATTTGTCATCCTGGGTTCTC 

ACAGGGTAAAGCTTGATTGGCT 

Sequence 3269 

AGGTACGCGGGGCTGCTTAAAACTCTTCAGAGGTTCACCATCGCTTCCAGAAAAAGATCA 

ATACTCCTG ACGCGGCAAC CTGTCG AG ACTCAGG ACCGCTAGG AAATCAG CTCTTGTTTA 

CCCCTCTATCCATTACACTACGATCCAGCTGGGTTTAGGCATCACTTCCTCTGAGCAATC 

CTCCCTGATGTCATGAGGCAGTGCTGAGTGCCTTCTCTGGACCCTCTGTTTACTTGCCTG 

TTAAATCCTGAACATCTGGAAGGGAGTCGTGTTCAGCAACATATCTACGCCTTGCAGCAC 

GGTGCCTGGCCTGCAGCTGGTGTTCCGCAAACAGCTGCTGACTGACGGGATGCATCGGTA 

TTCACAGGGTTACTAAGAAGTGCTTGTCCTCGGTCAGGCCAGGAAAGAGAGATGACAGAC 

TTGGTCCCATCGGCAAACTCTTTACAGACAAAAAATCCACCTCACCCATCTCTACAAGCT 

TCAGGGCTCAACACAGGGGCTGGCACATGAAAGCTCCTTTNCT 

Sequence 3270 

GATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGAACTGGGAGGGTTTTAGTCTGATAG 

CCACMTTTTGACCTAGGCAGGAAGCTTTACAGCTTGAGGCAGTTTCATGGTCTGAAGAC 

AAACTTCTTGTGACTTGCTGCCGGTGTTGGACTGAGGAGAGAGCCTCACTGGGTCAGGAG 

CACGAGAACAAAGTGGATCCCACTACCACATCCCACCCCTCCTGTTTCAGAGGCAGATCA 

TGGGACCAGGACTACTGAGAGTTCCATGGCCCTACCCATCATCTGAAATGCCCAAGAACT 

CCTCCCGATTAACAAAGGTCAAGCATAAACTCTATTGCCACCACCACAGCTGGTCCTCAC 
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TTTTAGGTGCTACCTCCTGTCCTAAAGGTTGATCTACACAGTCCCTTACAACATCTGCCG 

ACTCAAGAGCACCCGCGTACCTGCCCG 

Sequence 3271 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTCAAGCCAGGAAGTAAATTTG 

CATTACAAAAGTCAAATTCCTTTCCCTGGTGGTAGCCATGCAAAAGACTTGGCCACTTTT 

TCACTCTTGTCCAGGGAAGAAATTCTAGAAGTAGGAACCCATCTTAGTCAGAGTCAGGGC 

CACATAAAAAGAGCAAAAAGGTCATCTGGCATAGACCACCTCCCTGCTTCCTAATGTACC 

TGCCCG 

Sequence 3272 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACGTGGCGTAGCATTCCATGTT 
CAGCTTTGACATTTATTTTCTCATATCAACCCTTTTTACACGTGAAACACAATCTCGCTC 
TTGAAGTCTTMCTGCACGATCTGTGAMCCTGTATTTATATTNTCTTCAGTGTATTCTT 
GT 

Sequence 3273 

CGCCCGGGCAGGTACACAGGTAACCACATTTAGATGGACTGGGATGTTGCCACACATACA 
AGCATTGATAACTGGCTTCTCATTACCTGAATACATTCTTCTGTCAGAGCAACAGACTCA 
GCTATGCTTCTGGCAAAATTGTTCTTAATTCTCTATTGATTMTTTATTCGGTAAGTATT 
TATTG 

Sequence 3274 

GACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAACAGTCAGGA 

ACAAATAGAACACATATAGAGGACAGTGGGAGGTCCCCCTGGCTAAAGCAGAGGGTGTAT 

TGAGAGAAGGAAACCTTTGCCTCAATGGATCGTAGGCAAACTGAGAAGGACCATAACACC 

TTGCTGAGAAGTTTGAATGTGATTCCTTGGTGACCACAGCAGGAGGTCCTGAAGTCCCTG 

TCCTTCATATGCAAACAAGGAGGCTGTCCCATACAGACTATCAGGGAAAAAATGCACTGT 

TTATCAAAAGATATGAATGTTCTTGACAGGCAAAGCCCTGCTCAAATGAAATGGCAGTGC 

AGGGAGATGATATTCAATGTGATGCAGCTGTGAAGTACCTGCCCG 

Sequence 3275 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCATCTCTGTTTCTGAGGAA 

GCAGCAGAGGAGCCTATGTGATCCAGANAACTTGTTCTTGGCTTCTGGAAGCNGAGATGC 

AGGAATGGACCAATMGGCATTAMCAMTTCATTTANAATCTCAAGAAAGAAATATCTA 

ACAGTATAATCATAATAACAAAATTTTGATTTCTTCAGAAGCACTCCAATAGCTTCACCC 

GCATTAGCTAATCTTGTGCAGTGAGGAGGTGCAATTCCCTCTAGTCTCCCGNGTACCTGC 

CCGGGCGGCCGCTCGA 

Sequence 3276 

CCGCGGTGGCGGCCCGAGGTACGTGGCGTAGCATTCCATGTTCAGCTTTGACATTTATTT 
. TCTCATATCMCCCTTTTTACACGTGAAACACAATCTCGCTCTTGAAGTCTTAACTGCAC 
GATCTGTGAAACCTGTATTTATATTTTCTTCAGTGTATTCTTGTCGTGTGTGTGTCCTAA 
ACAAACCAAAAGA^ysACTTTCCAAATCTAMGTATTCATTCTCCAATTGGAGCAAGAGGA 
GTCAGTTAGATACTATCACGGCATTCATTTGTGGCTGGCTTGTCATATTTACTTAT 
Sequence 3277 

GGCGAATNGGAGCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTACTCAATAAAATAAGAG 

ATACTATAGTAA^ACTCTTACAGGAAAMTTATTAGAATTTGTGCTATACCAAATCAATT 

TCACAAAAATGAACTGCCTGTCCACTiSTTTTTCCACAAAAAATCCAGGCCATCTACACCT 

TG CTGG AAGG ATGG AACCTCAACTGCTATACTCCTTAG ATTGCTATACAG C AATCTAAGT 

GGCCCCATCACAGCTCCTCTCCTTGAACATTTGTTGAGATCAAGTTGCAGCTGGGCCAGG 

CACAATCGGTCACAAGTTAGACAGCTGAGGGCATTCCAAAGAGTAGTGTTCCTTCCTCCA 

CATTATCTGCCAAAGTAGTCATCTACCTTCAGGCAGAGCTAACCCAAGGCCTGTCCCTGA 

AA 

Sequence 3278 



Page 497 of 635 



WO 01/51628 



PCTYUS01/00798 



Table 1 



CGCCCGGGCAGGTACAGGGAACTATTGGAGCACCTAAGAGGAGCACCTACCTTGAATTTA 

GGGGTTAGCAGAGGCATCCTGAAAAAAGTCAAAGCTAAGCCACAATCTATAAGCAGTTTA 

GGAATTAGCAGAACG TGCGTG GTGAG GAGATGCC AAAGGCAAGAAGAGAAGAGTATTCCA 

AACAGGAGGGATTCCAAAGAGAGAAGAGTATCCCAAACAACATTTGCACAAACCTGATGG 

GGAGAGAGAATGTGGGGTGGGGATGGATGATGAGACTGAN 

Sequence 3279 

GTACAGGGAACTATTGNAGCACCTAAGAGGAGCACCTACCTTGAATTTAGGGGTTAGCAG 
AG GCATCCTG AAAAAAG TC AAAGCTAAG CCACAATCTATMGC AGTTTAGGAATTAGCAG 
AACGTGCGTGGTGAGGAGATGCCAAAGGCAAGAAGAGAAGAGTATTCCAAACAGGAGGGA 
TTCCAAAGAGAGAAGAGTATCCCAAACAACATTTGCACAAACCTGATGGGGAGAGAGAAT 
GTGGG GTGGG G ATGG 
Sequence 3280 

CCGCGGTGGCGGCCGAGGTACAACAGGCTTGTAGTTCTTTGTGAAGGATGCAGGCAAGAG 

GGATGTGCAGTTTAGACGGGCCCTGACAGCAGCAGCCTCTTCCACATCAAGTGAAGTGTG 

ATTGTTAACGATCATTGCCTGCACAAAGTCATAGTTGCCTTTGAAGTAGTCCACCCAATA 

CTTCATCATGACATAGCGGGGCAGGATTGGGGTGTTCACATTTTGCTGATAAGATGGTGT 

GTTAAAATCTAAAGCTTGAAGMCTGGATAAATTAMGCTTGTAGTTTGAGCTTATTTTT 

TTAGGCTGGCATCTTGAGTAAACTGTTGTCCAAGGGCAGCAGCCAGATTTCCACCAGCAC 

TGTCACCAGAAATGCAAATTCTGCCTGGATCAACCATATACTTCTGTAAGACTTCTGGCT 

TCAGGAAATACTTTGTGGCCCGTACCTGCCCG 

Sequence 3281 

CCGCGGTGGCGGCCGAGGTACAGAAGGCATGAAAGGCTTGGGGCAGACCTATTACATTCC 

TCATCTTCTTACAGCCCAAGAGAATATAGGACTCANAGATAATGATGCAGAATCTAGAGT 

TCTCAAATCTGGGAGAAAACCTCAGGCTCTGGGATTTCCTGTAAGCCCTGCAACTTCTTT 

CTCTCCACTCCCCAGCCCAGAAAAGCTAAACCTCATGAAACTTCATTTAATGAGAAGACT 

TGAGAAATGAAACACTCGGACTTGGAAGTATATCGTGAAGATTATTTGGTCTTCTTTGGT 

TCAAAATCATACGATGCTTCACTATTACCATTCAGAGCATCACAGAAGATTTCCAAGTAT 

CTTAGGCCAAGTGTGGTGGTTCACACCTATAATCCCAGCACTTTGGGAGGCCGAGGTGGA 

CAGACTGCCTGAGCTCANGGAGTTCAAGACCAGCCTGGGCAACATGGTGAGATCTGTTAT 

ACCTACAGCCTTTGGACCTGCC 

Sequence 3282 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACTGGGAATGACTGAGTAGTCACAAATTCAGAG 
AGCTGCTGGGAGGTAGATGAGTTGGGGCTGGGAGGTGTCCATGGGATTTGGGGGCTTGAG 
GNTCACGGTCACCTCAAGACAGCAGGATGGGGTG 
Sequence 3283 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGAATGATATTTTCATGGGATTACTCA 

ATCTCACCCACCATTAGTTGCAGGTGACAAGAAAGCTAAGTTGGCAGATGTTTGTGCTAG 

AAGCTGTGGGTTTACGTCTCCTTTGTGCATGTGTTCCAGACATACCAAGTGGCTTGGTAT 

TTAAACATCATGCTCAGGTGTGCAGGGTAGTTTTTGAGTTATAATAGGTATGCAGGCGCT 

GNGGGATT 

Sequence 3284 

CAGGTACTAACAGTCAGGAACAAATAGANCACATAGAGAGGACAGTGGGAGGTCCCCCTG 

GCTAAAGCAGAGGGTGTATTGAGAGAAGGAAACCTTTGCCTNANTGGATCGTAGGCAAAC 

TGNAAGGACCATAACACCTTGCTGAGAAGTTTGAATGTGATTCCTTGGTGACCACAGCNG 

G AN GTCCTG AAGTCCCTGTCCTTCAN ATGC AAACAN GGAGGCTGTCCC ATACAGACTATC 

AGGGAAAAAATGCACTGTTTATCAAAAGATATGAATGTTCTTGACAGGCAAAGCCCTGCT 

CA 

Sequence 3285 

C CG GGC AGGTACACCTGC ATTAGGG AC AAGAAATCTCTATGTTG AAC AAATACTG C AGGT 
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GGTGCTGATGCTAAACTTTTACAAACATTGGTTATACAATTAATCAACAGTTATTTGATG 

ATTCAGCACACAAATAACTCTTTGCAACCAGGTTTACGTTACTGGGAACAAACTAACTAG 

GTTTCCCCAGGTGTGGAGAGCTTTCACACAATATACCCTAACTTACATAAACAAAATTAC 

CCAGGTAAATATTTACTAAACAGTTTTAACATAAATGATAGTGATACACAAATTTGAGTA 

ATMTTCAAGAAAATMGTGCACMTTAGGCTGTTMCTGCTGTTTATCTGTTTAATGTG 

AAATACCCACTAGCTTCCCTCCACCCTTTACCCTAACTCCCATAAAATCCTCCAACCAAG 

CCACCAACATATACCCACCATACTTGCCACTCTGTCCTCAAACTCCTGTTTGGGCATTTG 

CTCCTGGTTCTTGAAACTTCATTCTGACAMGCTGAATAAAATTACCTCCTTAGAAAACA 

GTCTGTTCATGATTAAATAACTAATAAAAT 

Sequence 3286 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTAGCTG f I I I I GTGACTTTTAATAATAACACC 

TACTAGTTACTGAGTGTCATCATCACCMTGTCATGATTGCCACGATTTATTGAGTGCCT 

ACTACATGCAGGCCCCAGGTCAGTGCTTTACACTCTTAACTGTCATGTTGACAACAGCTC 

TGAATGACAGGAGCTCTGTTCAGTGTTTTACAAGTGAGAAAACTAAAGACGCTAATAATT 

TTTAAAAATATATCCACAGTTCTATGATTAGTAAGTAAGGGAGCTAGTCCT 

Sequence 3287 

AGGTACTCTGCGTTGTTACCACTGCTTCCCGGGACTCTGCGTTGTTACCACTGCTTCCCG 

GGACTCTGCGTTGTTACCACTGCTTCCCGGGACTCTGCGTTGTTACCACTGCTTCCCGGG 

ACTCTGCGTTGTTACCACTGCTTCCCGGGACTCTGCGTTGTTACCACTGCTTCCCGCGTA 

CCTGCCCG 

Sequence 3288 

TGGCGGCCGCCGGCCAGGTACNCGGGGGGACAGGCCATCTCGCTATAGGAAAGGAAAGTG 

GAACAGCATTCATCCTCAACATTTTTACGAAGACAAAATGAAGACTGGAGTATAANACTG 

ATCAGTGCAGGTGTAGCATAAAAGTGTAATCCTGGAAGATGTGGTGTGAGAAGGTAGCAC 

AAGTGAAGCAGAGATACAGGAGATAGGGAAGGGAAGCTGGAAGCAGAGGTCACTGGAGGG 

AGAGGGAGATGGACACATTCAGGGCTACAAAGCAAGTTCTATGTGATTTGCTCACCTCTC 

AATTGTGGGACCCCTCAAAATATGTACCT 

Sequence 3289 

CGCGGTGGCGGCCGAGGTACTCACAGCTGCATCACACTGAATATCATCTCCCTGCACTGC 

CATTTCATTTGAGCAGGGCmGCCTGTCAAGMCATTCATATCTTTTGATAAACAGTGC 

AN I I I I CCCTGATAGTCTGTATGGACAAGCCTN CTTGTTTG CATCTGAAGGAC AGGGAC 

TTCAGGACCTCCTGpTGTGGTCACCAAGGAATCACATTCAAACTTCTCAGCAAGGTGTTA 

TGGTCCTTCTCAGTTTGCCTACGATCCATTGAGGCA^AGGTTTCCTTCTCTCAATACACC 

CTCTGCTTTAGCCAGGGGGACCTCCACTGTCCTCTCTATGTGTCTATTTGTTCCTGACTG 

NTAAGTACCTGCCGGCCGGCCG 

Sequence 3290 

CCGCGGTGGCGGCCGCTCGGGCAGGTACTAGCTGl'fTTTGTGACTTTTAATAATAACACC 

TACTAGTTACTGAGTGTCATCATCACCAATGTCATGATTGCCACGATTTATTGAGTGCCT 

ACTAC ATGCAGGCCCC AGGTC AGTGCTTTAC ACTC r TAACTGTC ATGTTG AC AACAGCCC 

TGAATGACAGGAGCTCTGTTCAGTGTTTTACAAGTGAGAAAACTAAAGACACTAATAATT 

TTTAAAAATATATCCACAGTTCTATGATTAGTAAGT/sAGGGAGCTAGTCCT 

Sequence 3291 

TNACTACTATAGGGCGAATTGGANCTCCCCGCGGTGGCGGCCCGAGGTACAATTTGCATG 

CACGGACTTCTACATACAGAATACTTAMCAGTTCATTGTATTGAAACACGCTTAAAAAT 

GGCCCTAGCACTTGCTGCAATCCTGCTGTGTAGGAGTTAGCCAGGTTGTGTCTGCTCCCT 

CTAGACTATTCTATACATCAAGAACTAAAAAAGAAGAGCAATCCCAGCATGTGGCTGGAT 

GCACATCAAAGCCAAGTCTGTCATGAATTATTTTCCTTCCTGTTCCTTTCATTCAGGTCA 

TGAAGCACACATACTGCTACAGGAAAGGGCAAAAGGCAATGTCAAGTGCTGTGTAATTGT 

TGGGTTCTTTAGTGGCAAAGGCCAGCAATGTTGCAGCCAAAAGGGCATTTAAGAATACCT 
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GCCCGGGCNGGCCGCTCTAGAACTA 
Sequence 3292 

GCGAANGGACTCCACCGCNGTGGCGGCCGCCCGGCCAGGTACCTGGATGAAACTACACTA 

GTGGTGGTGGAGATGGAAAGGGACAGATTCAAGACAGATTAGAGAANCANNGGCANCAGA 

GCCTATTGAAGAACANGATNNNAGCCATGGTGAAAAGAGGAGATATCAAGGACGGTGTCT 

GTATTTCGGNCTTATACAACTGGCTGTGGATAGTGCTACTCACTCCAGAATGCTCAGGGA 

GTTTAACATATTCTNGTTTNGTGCATGAACANGGAGGCTAGTAGTAGGGTAGAATTCTTG 
AGAA 

Sequence 3293 

CCGCGGTGGCGGCCGCCCAAAGTGTTGGGATTACAGGCATGAGCCACTGCACCCAGCCAA 

GTTAGCTTTCT TCTA ACAGCAGCCCATGTGCTGACTGCCAGTGAAGTCTTTCAATCTGGG 

ATGCTAGTTTCTTTTATTTCAAMGTGCTATTGTGTTCTAACMTTAATGAACTACTCAA 

AAAAGAGACTGCTGTATTAGTAMGGCTACCAATGTCCTCCTGCAGATGTTT7TAAGTAA 

CTTCCTGGTGCTCTGTACCTGCCCG 

Sequence 3294 

CGAGGTACTAACAGTCAGGAACAAATAGAACACATAGAGAGGACAGTGGGAGGTCCCCCT 

GGCTAAAGCAGAGGGTGTATTGAGAGAAGGAAACCTTTGCCTCAATGGATCGTAGGCAAA 

CTGAGMGGACCATAACACCTTGCTGAGAAGTTTGAATGTGATTCCTTGGTGACCACAGC 

AGGAGGTCCAGAAGTCCCTGTCCTTCAGATGCAAACAAGGAGGCTGTCCCATACAGACTA 

TCAGGGAAAAMTGCACTGTTTATCAAAAGATATGAATGTTCTTGACAGGCAAAGCCCTG 

CTCAAATGAAATGGCAGTG C AGGGAGATG ATATTCAGTGTGATGCAG CTGTGAGTACCTG 

CCCG 

Sequence 3295 

CCGCGGTGGCGGCCGAGGACGCGGGGGCTTAAAACTCTTCAGAGGTTCACCAAATAGAAC 

TTTGGGGAATCCTTTTAAAAATCAAACGCATACTTGGTAGGGGAACTGACAATGGCAACC 

AGTTGCTCAGAGTTATTTGCTCAAAGGCCTACCTGCCTCCTAAGCCTCACTGTTCTTCAT 

CAACTGTCAGGGGTTAMCTTTCAAGAGCTACTCCCAAAAGAGCTGCAGGAAAATTCTGT 

CTAAACAGCCATGCAAAAGTGTCTGGGGATCACTACGGACAGAGGATGATGACAGAGGGC 

CCATGACATTGGGCCTTGTCATGGCTGAATACTGACCTTCCACTGTCATGGCTGAACACT 

GACCTTCCAGTGCCTACAGCTTTTAAAAAGAGTAAAAACTATGTTCATGGGTACCTGCCC 

G 

Sequence 3296 

AGGTACAAGCTCCACTGGAATAAAAAAGACTGTGTGGAAGTGAAGGGGCTCCTGCCCACA 

AGTGTGTCACCCCGTGTGTCCCTGCACTGCGGTAAGAGTGGTGGAGGAGACGGGTGCTTC 

TCAGATGTCACTCTGGCATTCACCTCTCTTCAGGACTGCAAGGGGCCTACTCTGTCACCT 

GTGGCTCTTCCTCTeCTCTCAGGAACAAACMCAAAAATCAAATGACTCTGCTTTTGGGG 

ATGTCACCACCATCAGGACMGTGTAACCTTTAAGCTAAATGAAGGCAAGTGTAGTTTGA 

AAAATGCTGAGCTGTTTCCCGAGGGTCTGCGACCAGCACTACCAGAGAAGCACAGCTCAG 

TAAAAGAGAGCTTCCGCTACGTAAACCTTACATGCAGCTCTGGCAAAGCAAGTCCCAAGA 

GCCCCCGGC 

Sequence 3297 

CCGCGGTGGCGGCCGAGGTACGCGGGCAAGGCTTCCTCTTTAAAAATCATACATTTAATT 

CGGGTAGGGMCGACTTGGCTTCTTTGTTCCTGTGAAAGAAGTATATAAAAATGCCCCCT 

TAGTMCTATGCAGTCAGCTTACTTCTCAGCAAAGTCATTGTGMCTTCTTCCCTGTCTC 

AGGTTGGAAATAGGAACCAATTCAAGGTAGA7TAAAGACTTAGGGCCGGGCGCAGTGGCT 

CATATCTGTAATCCCAACACTTTGGGAGGCCGAGGTGGGTGGATCACGAGGTCAAGAGAT 

CAATACCATCCTGGCCAACATGGCCCGTCTCTACTAAAAATACAAAAATTAGCTGTGCAT 

GGTGGTGGGTGCCTGTAGTCCCAGCTACTCGGGAGGCAGGAGAACCACTTGAACCCGGGC 

GGCGGAGGTTGCAGTGAGCTGAGATTGCCCACTGCACTCCAATCTGGTGACAGAGCCAGA 
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CTCCGTCTCAAAAAAAAGAAAAAAAGACT 
Sequence 3298 

CCGCGGTGGCGGCCCGAGGTACGCGGGGGGAGTCCCCAATGTCATGGAGTATATATCTAA 

TGCCACTGAGAAACCTGGCTCACCCTGCACTGAGAGAAGCACTCAAAACCACCATCACCA 

ATAGCGCTACTAAAGACAGTGCTTTGGAGCTTCCAAGCTGATAGATTATCCTTTCTCCCC 

ACCCTCAGATTTCTCCAGTGTCTCTGGCACTTATAGTCCATGTTTGCAAGAAATATTTTA 

TTATAGTGCTCTGAM CAAAT TTATAAAGTTTGCAAGAGAAGTTTTAAAAAAGCAAGCTC 

AGCMTATTTGGGGGCTTTTAAAGCTGATGATGCAGCTGACCTACTGGAGACAAAGGCAG 

TGGGAGGGCCCTT 

Sequence 3299 

CCGGGCAGGTACGCGGGGGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGTAGGAC 

CCTATACGGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAATGCAGT 

GAAGTAGAAGAGACAGGGATATCCCAGAAGGTTATGCAAAACATCAAGAGAAGATGAGAG 

GTCAGAGATGGGAAGAAACAAGAACTTTGACATGCTTGGTGTTCTTGCCCMGCTTTGAA 

GMGTTTACAAAGTCTATATGTCAGAATACACATTTCCCACCTTGCCCAAC 

Sequence 3300 

AGGTACAGACATTCCCTCATTTTACTCCATTAGMTTCTGGACCTCTTCTGTATTTCTTT 

CTGGGCCCTGCTACCTCCTAATCTGCATCTGGTGCTGGGAACCCAATAAGCCCAGCACAA 

TGCCAGTGTGAGGCATTCTGAGATCCGGGATTGGGTTGTGAGACTCCTGACTGAATTACT 

AGCCTTCTGAGCAAGGCTACCTCTTTAGACCCTTCTCCAGTATAAGCTGCATAAGCTCTG 

GCTCATTCCACAAACTCCTGCACCTAATCATGTTCCTGGACCTCAGACTGTACCTGCCCG 

0 

Sequence 3301 

CCGCGGTGGCGGCCGAGGTACGCGGAAATGTTAGACAAATGTAGCGAAACCAAAAAATAC 
ACATGTTCTATGAATCCATATATGCAAGGTTCACAATAGGTGATAGTAGCTACTCTTTTG 
GGAAGGAGTGAGGAAAGGACTGAGAGAGTATTGCAGAGGGGGCTCCGGGTGTTTGTAATT 
TC TATTTCCCAGTG CAGGTGTGGTTTGC ATGAG C ATGTTC ACTTTGTGGAAATTC ATTG A 
GCTGTAACTAAATATACAGAATTATTTATGAATGAAATGATATATAATTCAGTGGGAACC 
TGGGAGGATAGGGAGGAGGTGGGGGAGCTGAAAAAACATGTGTGGCCGAATGCTTAACAA 
TTGTTATTGTTAGGTTATGGATACCNCGGGGGGTGNGTTGTACCTGCCCGGGCCNGN 
Sequence 3302 

AGGTACCCTGTTACTG AGGCMTACCTTTCTGG GTMTTTTCTCAATCCTCTG GGTATAT 

TGAGGTTGCCTACTGTGGCTGATGAGGAGGCCAGCTATTATTTCTAGCCTTCTCTGTCCA 

TTGAGCATTGTCTATCTGCTTCTTTTCAGTAGTTATTTGCCCTGTAGTTTCCTCACATGT 

GTGCACTGATCAATACTTAAGCCTGAMCTGTCTGCAGCACTCCAGATCTTTATTTCTCT 

TTTACTTTGCAACTTGTTCTCTCTGTAGTTCTACTCTCTTCGCTCTCTCTCTCTTCCCTC 

AGCACAACCCTCTCCATCTCTACACTTCCTCTCACCTCTGCCCTGTGAATTCTAATTGCA 

TTTGCTGCCCCAAATCCTCTTTAACTCATTCAGAACCACTGAAGCTCTTGGATTm 

TCTCTTGAAC TTGC AGCCTGAGMCCTTTCTTCCAAGCAGCAAAACTGGAGCAATATAGN 

AGMTACCAATTTTAATGGTTGGTTNTGGTTTGGTTTTTN GG CTTCCAAGGAATCTTTTG 

GCTAGCACTGGCCTAACTTTTTGGT . 

Sequence 3303 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACCCATTAAGTAGTTTTTCGACCCATAAACTCT 

CCTGTCCCAGTAGTTCACAGTATCTGTTGTTCCCATGTTTATGTCTGTGGGTGCTCAATG 

TTTAGGTCCCACTG ATAAG GGAAAACGTGGTATTTGGTTTTCTGTTCCTGCATTMTTTG 

CTTAGGAAGGAGGATTTTCTTAMCATCAGTCTGCCTTTTCTCTMGGCTTTATGATATG 

AGGCATCCGTATTCATTGAATATGGAGCTTAGAAATATAGGTATTGAGGGGCCACCACAC 

C AG GCTGACTTCTTGATCTTGAATTTTATAAG GATTCTAAAG GGCATTGGGTAGTTCC AG 

GACATCTGAGACCTTTGAGGGAGGGTGCATTCCTCTGATTTTTGTGCCTCTGCCAGCANT 
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CTGGCTTCTAACAAAAGTGGCA 
Sequence 3304 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACTAACTTGGTCAGGGCAGGGTGACACATAAAA 

TTAACCATCACAGGGAAGGGTAGGGCTGGAGAGGCAGACTGTGGCCAGGTTAC AATGC GC 

TGAGGCTAAGGAGACTGTGTTTATCCTGTAGGCCAGTGGGTCTTACTCTGAAGTCTTTTG 

GGTGGGACATTCATGGACTTCAAGAGACCTGTGAATGCCCTAAGATTATAAGTAAAATCT 

GTGAGTCTGTAACTAMGCTAMGCTATTTTTCTGGGGCCCACCATCTAAAGAAGATTCT 

GMGCCTTAGGGTAGCCGTGGAGAAGACATGAAGGTCCATTTTGCATGGTAGAACCCTGC 

CTGGCTCTTGCTGCAGTGTGGGAGGACAGGTTTGCAATGTGGAGGTGTGGCAGGCATGGA 

TTTGGGAGGATTGGCAGAGGACTCACCATGTCCATACCACTCACTGAGATGGCAAATATT 

TATTAA 

Sequence 3305 

CGGCCGCCCGGGCAGGTACCCTATCCAGAGGGTTCTACCCCATGTAGAACCTCTCTTTTC 

TGTTCACCTGATTAGGTGTAGAAGTAGCCAGTCAAATTCA TCCTCG CTACCACTGTTATT 

TCTGCCCACCTCTMGCATTTTCTTCTTTCTCATTCACCACTTTTTTAAACATGTGTAAT 

ACATCAGTAACTAATTTATMTATTCCACTCCMTCCATCAACCTCAGTAMTTTAATAT 

TCACATCATCAGCCCTTCATATCCATACATTACTTCAMTTCTTTATTCTGTTTGACTCC 

ATGGATTTGCATCACCTCTCCACAGCAGCAAGGCCAAGACCCAAACACTGAAAGTCTCTG 

AGGGCTCCATCTTGAGCCTCTTGAGAATCTCTAACTGGAACCACTGCCACTGCCATTTCT 

TTCMTTCCCCACCACGCCTGATTTCTACTGTCTCTTGGTCACTTCATCCCATACCTGTC 

ACMGTCCATTTCAMCGCTGCTTNCACAMCTMGGTAGMTTTGGGAGATGGTAATTA 

AAAAAATACAGATTCCTACACCTTGCCCATGGGAGCTTTTGATTCAAGTANATAAGAGGG 

GCAAGGTTTCAAATCTTTGGTATACCCAGGGGAATTAAAATT 

Sequence 3306 

AGGTACCATGCTGGAGTTTTGCAAAAGCTTCCCAGGTGATTTTAAT GAGTG TCAAGAGTT 

GAGMTCCTGCCCAAGACTTACTMGGGGTGGCAAMCCAGCCCATTTTTAACCCTCACT 

TTGAAACCCTTCCCACTATTATCAAAAGATTGGGGACGGGAACAAGGGAGGGCATCACTG 

GGAGAGGTTTGTTCTCTGCCTTGTTGGGAATAAGCGAACCGGGATTTCTGTCTCTCTGCA 

CATTGATCTCCCTTCCTCGGAACACCCCTCTTCCACTGTGTCTATGGGTGAATTCTCAGG 

CATTTTTCAAGGTGGAACTCAGTGTCACCCTTTCAGAAAGTCCTTCCCTACCTTATAGGC 

AGAATCGCCTCACCCCTGGTAAAGCGACCAGTCCACTGTGAGGCTTGTTATCCCCGTGTA 

TTCCATCTCATGGGGTCTTCACCTTATGTGTGCGTCCTCGTTTAAGACCACAGCTACCTG 

AACATAAGCAGGGCCGGGTCCTATGTGTCTTTTTAT 

Sequence 3307 

CCGGGCAGGTACGCGGGCTTTGACCAAAGAGTGCTCTGACACACTGCATTCCCTTTCCCC 

TAGGGGTAGCCCATGTTGAGGGCAAGACCTCCAGAAATCCTGCAGCACCTCCGGGTCTTG 

ACAGTTCCCCTGTAGTTGCCACGGTCTCAGTGGGTTCTACCGGGTTGTTTGCANGAAATC 

AACTCACGTGGAGACATTAAGGGCTGAGATTCTCTGGGCAGAATGGCAGCACACAGCCAA 

TAAAGTGCCCACCTNNTCAGCTTGGGmGACNGTGAGGGTGAATGAAACTTTGCGAACT 

GGCCTCCTGATTGCTCTGCTCCCAAGGACTCCTCAAATGGCCACCANNAGNAGTTCCTGG 

GCTTTGGGGGTGCAG AGGAACTC ACCANC ATTTTGGCAG N N AGCAATCTTG N GGC ATGTG 

GAAAGGGGAGCAAAGAAACACCCAACCTACT 

Sequence 3308 

AGGGCCAATTGGAGCTCTTTTGCNNNGGCGGCCGAGGTACGCGGGCTAAATACTGTGCTC 

TGACATTGATGCCNTTATTCCATATTNTNAGGGCTTCTGTGGTGTTTAGGCCTCTAATTT 

TTCTCTTCAGTAAGGGGAATAACTTTGCTGTGTCAGCGCAGTGCTCAGGAGGGTTGCAAT 

CTAGAAACAGTGATTGTTAAAATTGCCTATAAACACACAATATAAACAATATGCCACAGA 

CACAGAGGCCCCCAGCATAGTTTCTGGAAGGTGAAGTCTCACCTCTGCCAGAGTAGAGTC 

ACAGTGGGAATTGAGGCTTGGCATCTGGGTGCCTCTGAAGCTCTTCCTGTGGCTCCTGTT 
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AGACAATTGACCCCGAAACACAGAAGATAGAGATTGCACATTGCGTTCAGTAATTGTTGT 
TTATTTATGTTATTTCTGAAAATAAAAGACAGCACAGGCTGCTTATCTTTAAACCACACA 
GTACCTGCCCGGGCGGCCGCTCTAGAACTA 
Sequence 3309 

GGCCGAGGTACATCAGAACNTTCCAGTAACAGGATGGCGTTNGCAGCTGNCNTNNGAATG 
CTCAGCCTCGCTGGGAATGGGTCCCCTCTTACTTGCTGGGTGACCATGAGCATGTTGATC 
CCACCTCAGAGCCCCCGCTTCCTCCTCCTCTATAAATAGGACCATGGATAGTAGCTTGGA 
GAATTCAATAAAATATAGTGTGCTGAACACTCAGCACGTGCACTAGGAAGAACCAAAAAT 
GAGCAGGACCCTCACTGATATTTAATGTCCCCTAATTTGTCAAGGGACATTATATTGCAT 
MGATTTGCTTTAMGAAAGTAMTTCCTTTCAGGCAAGTTGTTTGAGTCACCAATAGGT 
TTAAATCAMGGMGTTTTTGTCCTCACGATTTTGCAI I I I 1 I AGTGCCTTTCCACAACA 
AAT GTAT AAAAGGAACATTA 1 I 1 1 1 AAGTGTAACTATTTTGGAAACTTCTTACTTAAAAG 
C AATTTTCTACTAC AAGTCAAAGGATTAAAG CAAAAAAAAAAGG N GAATCTGC AGC AGC A 
TATATTGGATATGNTTTCTAGAAATAC TTTTTTAAAGAAGG C AGAG AAC CCTAAGCTAAA 
AGCAGAATTTAAATTTCTA 
Sequence 3310 

CCGGG CAGNTACCTTTGCTGAAGTATCATACACACAAAANTTNCATNACCCTGGCCCTTT 

ATTTTCCATTACCCANTTTTATTTGTGTCATAACGATATCAAGTATATAAAATTGTCTTA 

TTTGGCTCTTTACATGTTTCTCACCTTTATTC 

TTTATTATTCACTGTCAAAATTCTCTTTACTTTTAATTTAAAAATCTT^ 

TGAGTAAATAAGCTAAGMTTGMTTATGTATTTMTCAATTCTATGATCTACATTAGTA 

AATATCTTGATAGGTTTGTTATAAAGACCAGTAAGCCAAGAGGTTTGGCTAAAGGCTGAT 

GMTGTTTAACTGAAATGTCTTATGCAACTAATGTAGACGCTATACGAAAAAAGGAGGTA 

TAATAATTGGCAAMGCMGGGCTTTTGTAGTTTTCAAAATATTAAAAATGGGCTGGGCG 

CAATGGCTCACACCTGTAATCCCAACCCTTTTGGGAGGCCCAAGCAGGCGGATCACCTGA 

GGTTGGAGTTCGAGACCAGCCTGACCACCGTGGAGAAATCCAGTCTCTACTAAAAATACC 

AAAATAGCCAGGCATGGGGGTACATGGTCTGNAATCCANCTTACTTGGGAAGCTGAGGCA 

ANAAAAACACTTGAACCCTGGAAGTGGAGGTGCAATNANCCCANAATGCCCATTGCACTC 

CAACCTGGGAAAACAAGMGGGAANCCTGTCTCAAAAGGACAAAGAAAAAAAAAAAAAA 

Sequence 3311 

CGGCCGNGCCGCCATNCAGGNTTNGCGACCTCGCGCNCGANTNGCAGGTNCANTTNCGTG 

CTCTTGCCAGTACTCGGGCGCGGCAGGGAATAAAAGCGCTCGAATATACGTTCCAGCGCA 

TACGCGGGGATGCCCTCACCGCGGTCGCGCACGGCGATTTCCACCTGGCTGCCCTGGCGC 

TGCGCCGTCAGCTCGATGCAGCTACCCGGTGGCGCGAAGC 

Sequence 3312 

CCGGGCAGGTACCCATCAGGAATATCCGGCTCAGAAGCCAGGGCCTCAGAGACTGTTCTC 

TCACTGAGACCTAGTGAGGGAACTCCCTGTGTTATTCTCACTGATGGCCCAGGAACCACC 

CTTGCAAGTCATGACCACCAGCATCATGTAGCACTGGAACTGATCTGGGCAATGACCTCT 

GTCTAAACTTCTGAATCCCCTCCG ACAAAG ACCCAAGACAG CAGCATGGCCATGCAGCTG 

TGCTCACATCTCACCCCTGCACTGGCCAGGAACACATCTATCTTTCCTTTGGGTAGGGTC 

ACCCAACTGCTCTGCCACTTCCAGCTGTGAAAGGCATCTATGTGACAGACCCCTCTGCAG 

TTTGAAACTGTGTGACAACTTTAACACCCAACTCAGCATCTGCATGGGTTTCTGAGAATT 

ACCTATATCTTTTGTGGTCTCTTTGCTGATTCTCTGTTTCATTAAAAAAA^ 

GACTCGTGATCCCTGAGTTCCTATATAGCCAATTTTACTCACTAGCTAAAGAAACACTGT 

ATTTAAAATGACAAACCTAGCAACAATTAGGCAAGCTCTCATCAGGACTTCATGCCAGGC 

TGGGGTGAATGGCTTAAAAAAGTCTTNCACAGCGGATCTTGGAACCTTGGACCATNGGGG 

CTTGGTGGCCACATTTGACCTAANGGGCTNATNGGGGGNAACC 

Sequence 3313 

NGGCGGCCGAGGTACAAAGAAACAGCTTCAATCCTTTTGGGGMTGGCAGGTTTTTGCAG 
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ANTNTGTGTTCCAAATTTGGGGTCATAGCAAAACCCCTTTATAAAGCAACAAGGGGGTCA 

GAAMTGAGTCAATGGAGTGGCCCTAGAAATGAGGGGAAACTTTGCCAAGTTAAAACAGG 

CTCTTAACAAAGCTCCCACTCTTGGCATCCCAAACTTCATTAAGCAI I I I I CCTTGTATA 

TAGCAGGAAAAAAAGGGCATAACTGTGGGAATACTAACCCAGAAATTGGGATCAGAACCC 

ATACCAACCATCTACTTTTCAAAGAAATTGAACAGAGTAGCCTCACAGTGACCAAGCTGC 

CTGCAGGCMTAGCAGCCACTGTTATTTTAGTGGAAGAAGCCACTAAAATCACCCTGGGT 

CAACCACTGAAAGTGTTAACTCCACATCAGGTAAGGTCAGTCCTGGGAATAAAAGGACAT 

ATCTGAACGATGGGAGAAAGGTTAACCAAATAACAGGCTATATTCCTAGATAATCCAGAG 

GTTACCCTTAAA 

Sequence 3314 

TTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGTTTAGTCAGGAAGCAGGCATTAAACAC 
ACATATACACTCACMTGCTAAMTAAMTAAAGCTGTTMTTTGATCAATGCCAAGAAG 
AAAAAGAATGGGGCTTATAAGAGAATACAACAAGCGGTGCAACTTAAATGAGGGGGCAGG 
GGAGTATCTGAGAAGCCCTCTGTGAGTGAGTGAC rTTTTTTTTTTTTTTTTTi GAGATGG 
TGTCACACTTTGTCTCCAGGCTG G AGTG CAG AGGTGCG ATCCAGCTC ACTGCAATATCCC 
CCTCAAGCAATTCCCCTGCCTCAGCCTCCCGAGTAACTGGGACTTCAGGCACATGCCACC 
ACGCCTGGCTAAA I I 1 1 II IGTA1 I I 1 1 AATAGAGACGGGGTTTCA . 
Sequence 3315 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTAGCTGTTTTTGCGACTTTTAATAATAACACC 

TACTAGTTACTGAGTGTCATCATCACCMTGTCATGATTGCCACGATTTATTGAGTGCCT 

ACTACATGCAGGCCCCAGGTCAGTGCTTTACACTCTTAACTGTCATGTTGACAACAGCTC 

TGAATGACAGGAGCTCTGTTCAGTGTTTTACAAGTGAGAAAACTAAAGACACTAATNAAT 

TTTTTAAAMTATATCCACAGGTTCTATTGATTAAGTAAAGTAAGGGGAGCTAGTNCCTT 

GGCACGCTCTAATAMCTTTGTTGGGATTCCCCCCCCGGGGCCTTNGCANGGAAATTTTC 

GTNTAATTNCAAAGGCNTTTAATTGGMTTACCCCGGNNCGMCCCTTNGAAGGNGGGGG 

GGGGCCCCGGGGTTACCCCCAANTCNTTTTTTNGTNTTACCCCTTTTTAAGTTGAGGGGG 

GTTTAAA 

Sequence 3316 

CCGCGGTGGCGGCCGAGGTACTAACAGTCAGGAACAAATAGAACACATAGAGAGGACAGT 

GGGAGGTCCCCCTGGCTAAAGCAGAGGGTGTATTGAGAGAAGGAAACCTTTGTCTCAATG 

GATCGTAGGCAAACTGAGAAGGACCATAACACCTTGCTGAGAAGTTTGAATGTGATTCCT 

TGGTGACCACAGCAGGAGGTCCTGAAGTCCCTGTCCTTCAGATGCAAACAAGGAGGCTGT 

CCCATACAGACTATCAGGGAAAAAATGCACTGTTTATCAAAAGATATGAATGTTCTTGAC 

AGGCAAAGCCCTGCTCAAATGAAAT 

Sequence 3317 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACACTTCCCCCTCCTCCTCCCCCGACCATGCCT 

GMTTCAGACCTCTAGCCCTTGGATCTGTTATCTGACCCAAGAAGTCTTGTCAATTTTAG 

AAGGCTATCACCTCTTAACAGCCATTTCTGCTGCTTCCTGCACACTTTCTAATTTCTCCT 

CTGCTTTCCCTGGAAATCTGATGCTTGACTTGGACACAATTCAACCCTGTCTCCTTATAG 

AGACAGCTGCTGAATGAATGCTAGAMTTTAMTTCTGCCTTTAGCTTAGGGTTCTCTGC 

CTTCTTTAAAAMGTATTTCTAGAAAACATATACAAYATATGCTGCTGCAGATTCACCTT 

I 1 1 1 1 1 1 1 GCTTTAATCCTn"GACTTGCAGTAGAA/\ATTGCTnTMGTMGAAGTTrCC 

AAMTAGTTACACTTAAAMTMTGCTCCTTTTATACATTTG NTGNG GAAAGGC ACTAAA 

AMTGCAAAATCGTGAGGGACAAAAACTTTCTTTGATTTAAACCTATTGGTGACTCAAAC 

AACTTGNCTGAAAGGAATTTA 

Sequence 3318 

TACGACTCACTATAGGGGCGAATTGGAGCTCCACCCGCGGTGGCGGCCCGCCCGGGCAGG 

TCCGGGCAGGTACTCTTGTTTAACCATCAGAGGTGATCCCATCACCTTCACAGCCCCAGC 

CTCTGCTCCAGTCCCTNCCCAGCGAAAAGGGCCGCCCATGCCATCCTGCTGCTGGTGATT 
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TGCTTTGTGGTCATGGACTCAGTGGACATCATTATTTTATTAACCGTGTGGTAGGTTTTT 
AACTCANTTATCCTGGATATCCAAAGGTTTGTGGTCCATCTTTAGGCTTCCGTTTGTTCT 
TTGGTACCTCGGCCCGCTCTAGAACTAAN 
Sequence 3319 

AGCTCCACCCGCGGTGGCGGCCCGCCCGGGCAGGTACCCTGAAGGCCGCTTACTTGGAAC 
CGTATCTTCTAAATGCTCTGAAAACCTATCCAAAGGCAATTAAAATCACAAATATCATTG 
AAAACATCTAAGAACTACGTGTMGCACTTCGGGATAGTCAGGTAAAGGCGGCCCTCTAA 
TTATGAAGCATTTTGATMTTAATAGAATTCTAAG 

CATAATGGTTMGCCAAATAACACTTMTCATTTTACCTTNTATTATTGATAMGGCAAC 

ACACTGAAGTAGACAGACAGTAAAGAAAMGTCCACATCTTTTTATTGGGTCCAAATAAC 

CTGATTGNAAATACTTAAGCCTCTGATTTTCATGTTTCANMTTTGTATGTTAATGACCT 

TAGTAGATGTGGCTTTGTCCAGCCTCAAAAAGGGTTGCTCTCCAGCCAGGGGCCCCTTTT 

TTCCANNTACTGCTCCCATCTTCTTTAAGTATATTCAAACTGGTGAGTGAACTCTTTATC 

CCGCGTCCCTTGGCCGCTCTAAGAACTAAGTGGATNCCCCNGGCTGGCANGGAATTTCGA 

TATCAAGCTTTATCGGATCCCGTCNAAGCTTCGANGGGGGGG 

Sequence 3320 

GCGGCCCGAGGTACCAAGAAGAAAGCCATGGGCACTGTCAGTAAAGAGATCTAGAATGGG 

MCAAGACAGGGAGAAAGCCAGGTGCAAGACACAGGAAAAGTTCTAGAMGAAAGCTTTT 

TGCATCTAACATCAGTCTCCCAGA7TCAACCCATCTCAATATTGMGAAATTGTCATCCT 

ATTCCCAAATAMGTGAAATGAGGTTTTTTATGTAAAGTGCTTAGAATGGAGTGTGGCAC 

ACAGTAAGGATCACCTTTAAGTATTACCTATAGATATACTGATAAATACATGGATATGTC 

MGGGAGAGGTTGGCCTTTTAACTTGAAGGAGTCAGAAAAAAACTGCCATGCGCATTTTG 

CTTTGACTTTGAAAATCTGGATGCATGGAAGCTCTATAGMGCACAGCAGGMTTTCAAG 

AGGTATGCTGCCCAATGTGTGGCTGGCAGAGTGCTCACACCAAAGGCTCCATTGGCATTG 

AGTGGGGAGAACATACTAGAAGTGGG I I I \ I CTTGTCATGGAGTTGAA I rTTTTTTTTA A 

GCAAGTTCATCTTCTTTCTAGTGAAAAAATAAACCAAGGATAGGGGAA 

Sequence 3321 

CCGCGGCGGCCGGCCGCCCGGGCAGGTACCTCCTCCCTATGCATCTGCTGTGGGAAGTGT 

GGGTAACTCATAGATGATGCCACAGGGCATGTATTCAGGCACCACAGGCAACAGTGAATG 

MTGAAGTGMGTGGTATTTATCCATTTCTGGAAAGGTCCAGGGTTTGGCTCTGCAGGGC 

CAAGAGAACAGCTTTAGCTGTGCCTTAACCCAGTCCTGGAGAAGACAGCAGGCCGTAATC 

ACGGGGAGGAAACCCATCTTTTAAGGGCTCCTCGCTCAGGTGGTGACAAGGTGAGGTGGT 

CATCTATGCTGTCTTTATCAGTATCTGTCCTAAATACTGTGCTCTGACATTGATGCTAAT 

ATTCCATATTATCANGGCTTCTGTGGTGTTTAGGCCTCTAAl I 1 1 I CTCTTCAGTAANGG 

GMTAACTTTTGCTGTGTCAACGCANTGCCTNANGAGGGTTGCAATCTAAAAACAAGTGA 

TTGGTAAAATTGNCTATTMCNCNCCATTTTAACCATNTGCCNCANACACAGAAGCCCCC 

AACATTAGTTTTTGGAANGNGAMGCTTANCTNTTGCCAAATAAAATCACAGTGGGNM 

TG AN GCTTTGGC ATTTTGGTG N CTTTTG AAGCTCTTCCTG N GGCTTCTGGT AAN AC AATT 

GANCCCCAAACCCN NAANATTGAAATTGCCCATTGGGTTCAN AAAAN G NGGG N AATTAAG 

GNAATTCTGAA 

Sequence 3322 

CCGCGGTGGCGGCCGGGGGCCATTGAGACTGCCATGGAAGACTTGAAAGGTCACGTAGCT 

GAGACTTCTGGAGAGACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGACCACTGGAAC 

AATGAGAAGGAGAGAATTCTACTGGTCACAGACAAGACTCTCTTGATCTGCAAATACGAC 

TTCATCATGCTGAGTTGTGTGCAGCTGCAGCGGATTCCTCTGAGCGCTGTCTATCGCATC 

TGCCTGGGCAAGTTCACCTTCCCTGGGATGTCCCTGGACAAGAGACAAGGAGAAGGCCTT 

AGGATCTACTGGGGGAGTCCGGAGGAAGCAGTCTCTTCTGTCCGCTTGGAACCCCATGGT 

CCACTGAAGTTCCTTATGCTACTTTCACTGAGCATTCCTATGAAATACACCAGTGAGAAA 

TTCCTT 
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Sequence 3323 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCCTGAAGGCCGCTTACTTGGAACCGTATCTTC 

TAMTGCTCTGAAAACCTATCCAAAGGCMTTAAAATCACAAATATCATTGAAAACATCT 

A AGAA CTACGTGTAAGCACTTCGGGATAGTCAGGTAAAGGCGGCCCTCTAATTATGAAGC 

ATTTTGATAATTAATAGAATTCTAAGCCAAAGCTTAAATCCGGATACACATCATAATGGT 

TMGCCAAATMCACTTAATCATTTTACCTTTTATTATTGATAAAGGCAACACACTGAAG 

TAGACAGACAGTAAAGAAAAAGTCACATCTTTTTATTGGG TC CAAATAACCTGATTGTAA 

ATACTTAGCCTCTGATTTTCATGTTTCAGAATTTGGTATGTTAATGACCTTAGTAGATGG 

TGGCTTTGTCCAGCCTCAAAAAGGGTTGCTCTTCAGCCAGGGCCCCI MM I CCAGGTAC 

TGCTCCCATCTCCTTrTMGTATATTCAACTGGGGAGGTGAACTCTTTTATNCCGGCGTA 

CCTTNGGGCCGNTCTAGAAACTAGTGGGATCCCCCGGGGCTGGAAGGAATTCGAATTTCA 

ANGCTTATCGATTACCCGTCGACCTTCGAGGGGG 

Sequence 3324 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACTGTGTGGTTTAAAGATAAGCAGCCTGTGCTG 

TCTTTTATTTTCAGAMTAACATAAATAMCMCMTTACTGAACGCAATGTGCAATCTC 

TATCTTCTGTGTTTCGGGGTCAATTGTCTAACAGGAGCCACAGGAAGAGCTTCAGAGGCA 

CCCAGATGCCAAGCCTCAATTCCCACTGTGACTCTACTCTGGCAGAGGTGAGACTTCACC 

TTCCAGAAACTATGCTGGGGGCCTCTGTGTCTGTGGCATATTGTTTATATTGTGTGTTTA 

TAGGCAATTTTMCAATCACTGTTTCTAGATTGCAACCCTCCTGAGGCACTGCGCTGACA 

CAGCAAAGTTATTCCCCTTACTGAAGAGAAAAATTAGAGGCCTAAACACCACAGAAGCCC 

TGATAATATGGAATATTAGCATCAATGTCAGAGCACAGTATTTAGGACAGATACTGATAA 

AGACAGCATAGATGGACCACCTTCACCTTGTCACCACCTGAGCGAGGAGCCCTTAAAAGA 

TGGGTT 

Sequence 3325 

CGAGGTACATGTGACCCAAGGACCAGCCCAAACGCAGATGCAGACAGATTCCAGACTCTT 

CAGCAATCCTTCCCTTATCTGGGGCAAAGGCTGTGATGCCTACACAGTCTCTGTCATTTC 

ATCCCTGATGCTCCAGTTTTTAGGAGAMCCACCTCGTCATGTTCAAACTTCCAGAAATG 

GTGGTCCTGTGGAAAGAATCCTAGGCCAGCAGCTAGGAGACCTGCTGTGAGTCCCAGCCC 

TGCCCATCACAGCTATGTCAGGTGGTTCAGACATGTTGGGCTTCAGGTCCCCATCTATAA 

TCAAAAAGTTTGTTTCTAGCTTCAACAATCTATGACTATTATTTCCGTTTACTGATAGAG 

TAACTAATTACTTCATTATAGGTTTTCAAGACCAGTTACCTCCTGTTTGGTTCAATTAGC 

AAAATGCCAACATGATTGTCATTGATTTTGGTGGTTGGGCTTCCAGGAGAGAAACCCAAT 

GGGAAATAGTTCANGATCTTAAGCAGATTTTGGATTTTATTATTCCAAGGGAGTATATGC 

CAGGAACCAGGGTTTGTCATGCATTTCTCAAMGTTCTTTGCCTAGGAAAAAGCTCACTC 

CCATCGGTCTCTTCCATTCTAGGGGATGCCTTAAGACTCTTNNATTTANAAGTCTAGAGC 

CNGANGAGAAGGAATAATCATGGGTAANACTGGCTTTAATGGAANACNTTT 

Sequence 3326 

ACNGGGGGAAATGACACAAACTGACMGTATTTATTTAGCCTCATTCCCATCACTCACCA 

CTGCCACCCANAAAAGTGTCACCTGAGTTTTATTGCATGCAGCCCCAACACTAAAGAA//3 

ATCAGAAATNTTTGTTCCCGTGGTGGAAAATAAAGGGCTATTTGCCCACGTCGCCAGGCT 

TCCTCTGGAMGCATNTGGGMTCATTCCAGAAATTCMTTTCCAGGTGTGTGTTTCTrT 

AAGGAAATATTGTGGAGTANAGAAGGAACGCACTGNGCAGGTGCCATTGGNTGGGGGCCG 

TTGCTATGGCAGATNAGCCTTATTANTGAATAGCAAAGGAAGGAGTCACCAGGAAAA.CAC 

AACATGANCAGGCNGAGTGCCCACCNCCGTGGCCTTGGGGAAGCCNCAATrATNTfGGCT 

TGACATGGTTCCAANAGTGNGGGCATTAGGGAAACCMCNTTTTGAGCTTGCAANATTCA 

ACAAGCATNGGCCATGCCANCCAGGGAAAANTGTGCTTTACTCCAGGAATAAGGGAGCAA 

GGCCCCATANTTTCNTCTTNNTTTCNGGGNCTTCCTTTTGAACTNGMTTGNTTTAANGG 

NNAACCTCTTGAANAAAATTGGGGTCCTTGCCTTTTTGGGGGANCCTTAAAAAAAAAA 

Sequence 3327 
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CCGCGGTGGCGGCCCGAGGTACGCGGGTATTATGGAGATGCTCTGTTCCATCCAGGGAGG 

TGCAGGTGAAAAGAGGGCAGTTCATCCTCCTCACACTTAGGGCAGGGAGCAGCATGCAGG 

GGCAGATCAGCAGCTTTGCATGTTGACAAATCCACTCTCTGCTGCAGATGCCTAGGGGAA 

GTTGCAGACTTAAATTTTCTTTTGTAAAATGGGGGAACACAAACAGATCTTATGTCACAC 

TGGTTACTCCAGAAAAAAAAAGAAAAAGMGGAAGGAAGA^AAGAAAAAGMTAANTATT 

ATATATANAANANAAAAGTACCTGCCCGGGCGGC 

Sequence 3328 

TATAGGCGAATTGGACTCACCGCGGTGGATACCCTCCACTCAAGGAGGTAGAACTCAGTC 

CTCTCCCTTGTGAGTCTTCACTAAAATNAGTGACCCACTTCCAAAGAGTGGGAGTATGGA 

AAGGGAAACATANTANCTNTACAGGGGAGAAAAATGACAAATGACGTCTTCACCAAGTGA 

TCAAMTTMCTGTCACCAGTGATAMGTCATTCANATTNGTTCTAGATMTCTTTCTAA 

AAATTCATAATNCCAATCTAA7TATGAGCTAAAACATCCAGGCANACTCAAGTTGAAGGA 

CATTCTACANMTATCCCCTGGGGTATTTTAAGAGTATTCCCTCAAAACTGTAAAAATNA 

TGGAANAATAA 

Sequence 3329 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACTAAAAATGGCACCTT 

CTGTCTTCTGACATCACTGGACAGTATAGGGGGGGAAAAAATCAACCTTAAGAAAAAGAA 

ATTTGATATGAAAAAAAGAATTCAAMCATC^ 

CTGTGTCATCTGATACTCTTCAGTGTCCAATTAAAATCACACTCTATCTGGCAATCTATA 
GTCACAACTACCTAAAAATGTCTTAAGGAAGTAGGAATAAAGAAGGCATGTAGCTTTAAC 
TGAAGCACTACTTTGTTAAAGAATAAAAGCA I I I I I 1 i I I AAATGTCACCTGACACCAAA 
TATCTGGTTGGAAGAGGTGCTGTTTTAAATTTT 

AATTGAAAAAAATAAAAGTAAATTTTCTCTTCCTTGTATAGGCTTAGCATTA 

ACTATGCCTACTCTCMTAAAMTGAAACMCTCAAMTATTTAAAAGATCTTGTTTCCT 

AAGCACTTCAATTCTCAGGTTATACTAAACTTTATGATTTCATATTCCCTCAATAAATCC 

TAAATTCTCTCAATAAAATCCCCGCGTACCCCACAAGCTGCATTCAAACCTCAAATCTGT 

GGGAATGAGTGACTCGACAAAATGGTAATTCNGGATCAAACCTCATCCCCTGACTGNGTG 

AAAAAAGTCCTGCCCGGGCGGGCGCTTCTAAAATAGGNGGATNCCCCCGGGCTGNA 

Sequence 3330 

ACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGATATACAT 

TCTCACAGTAAACAGTTTTCAGATATTGAACAACAAAAAGTAGGCAGTGCTAGCCAGAGA 

TTCCTTGGAGCCAAAAAACAAAACAAAACAAACAATAAAAATTGGTAATCTTATATTGCT 

TCAGTTTGCTGCTTGGAGAAAGTTCTCAGGCTGCAGCTCANAGAGGAAAAATCCAGAGCT 

CAGTGGTCTGAATGAGTTAMGAGGATTTGGGGCAGCAAATGCAATTAGAATTCACAGGG 

CAGAGGTGAGAGGAAGTGTAGAGATGGAGAGGGTTGTGCTGAGGGAAGAGAGAGAGGGCG 

AAGAGAGTAGAACTACNAGAGAAGAACAAGTTGCAAAGTAAAAGAGAAATAAAGATCTGG 

AGTGCTGCAGACAGTTTCAGGCTTAAGTATTGATCAGTGCACACATTGTGAGGAAACTGC 

NGGCAAAAAAAAAAAAAAAAAAAAAA 

Sequence 3331 

CCGGGCAGGTACGCGGGAGACTAGAGGGAATTGCACCTCCTCACTGCACAAGATTAGCTA 

ATGCGGGTGAAGCTATTGGAGTGCTTCTGAAGAAATCAAMTTTTGTTATTATGATTATA 

CTGTTAGATATTTCTTTCTTGAGATTCTAMTGAATTTGTTTMTGCCTTATTGGTCCAT 

TCCTGCATCTCTGCTTCCAGMGCCAAGAACAAGTTCTCTGGATCACATAGGCTCCTCTG 

CTGCTTCCTCAGAAACAGAGATGGTACCT 

Sequence 3332 

CCGGGCAGGTACTCACAGCTGCATCACACTGAATATCATCTCCCTGCACTGCCATTTCAT 
TTGAGCAGGGCTTTGCCTGTCAAGAACATTCATATC I I I I I GATAAACAGTGCA l Mill 
CCCTG ATAGTCTGTATGG GACAG CCTCCTTGTTTGCATCTG AAGG AC AGGG ACTTC AGG A 
CCTNCTGCTGTGGTCACCAAGGAATCACATTCAAACTTCTCAGCAAGGTGTTATGGNCCT 
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TCTCAGTTTGCCTACGATCCATTGAGGCAAAGGTTTCCTTCTCTCAATACACCCTCTGCT 

TTAGCCAGGGGGACCTCCCA 

Sequence 3333 

CCGGGCAGGTACGCGGGATGAAAACCAATGATTTTGGAAAGCCAAATTGTTGAGAAGATA 

A GGTGG GGATTCCGACTACTAGTA I I 1 I I ACAAATCTGATTGTCGTTGCAGGTTTTGTTT 

T ATTTTG TTGTGTTMTGCTGAGAGTGGAGAATAGATTGGMTATTTGCCTCTTGTGTTT 

C I I I M GCTTTGTMCATTGCAAGTGGTCCACATTTTCCTTTAmAAAATTTAMGTT^ 

GTGCCTGGGGTTTCTGGTGTTAAATAGAAGTGATACTTCTGCATAAAGTATTATGGAGAT 

GCTCTGTTCCATCCAGGGAGGTGCAGGTGAAAAGAGGGCAG 

Sequence 3334 

m III I I IGGACAGTGAGAGTGGTCTTATTTCTTCCCCTAGTTTAGGTCACAGCAGTTG 
AGTCAGGGTATCATGCTATCGCAGGGCACAAATTTCTATCATCATCCATAGATCACTGGA 
TTAGTTGAAAMGTCACTCAAATGCTGACCAGGAGCACATGMCTTCTGTTTATTTCGGA 
CTTTCTG ATAAATGGTG G CC AGGTACCTGCC CGG N 
Sequence 3335 

CCGGGCAGGTACAAGCTCCACTGGAATAAAAAAGACTGTGTGGAAGTGAAGGGGCTCCTG 

CCCACAAGTGTGTCACCCCGTGTGTCCCTGCACTGCGGTAAGAGTGGTGGAGGAGACGGG 

TGCTTCCTCAGATGTCACTCTGGCATTCACCTCTCTTCAGGACTGCAAGGGGCCTACTCT 

GTCACCTGTGGCTCTTCCTTTCCTCTCAGGAACAAAAAAAAAAAAAAAAAANANAAGTAC 

CT 

Sequence 3336 

AGGTACGCGGGGAAGTTTGATTTTTAGAACTCATGTTTATATGGGTACATCCCA I 11111 
TTGATATACATTTAAAATMTTTCTTATGTO 

GCATATATCTTAGACTATAAATTTTAATTGTTATAGTTTGTTCCATTTTTGT^ 

AAAAACTGATACATATCTTTAAMTTGTATATTTAAATTACCTTACATATGAGGAATTT^ 

TATACTTGAAAGTTATTGTTATTAACTTGTTTTCATAATGTGAGATATTCTGTCATAAAG 

CACTTATGTTTTAAGTTGTAATTTAAAATACCCTTTCTTTTTMCA 

Sequence 3337 

CCGGGCAGGTACTCACCTTGTATTAGCTGAGTGACTTTGGTTAAGGACAGTTACTTAATT 

ATCCTG G GTCTC AGTTTCTCTCTTTG AAAAAC AG ATTC ATTCACTAAAATAATATTTATG 

AAGTATTCATTTTGTGATAAGCACTAGTCTAGCTACCGGGGATATACATTTACAAAACAG 

ATTTTAAMTATCCTGTCCTGGTGGACCTCACATTACGTTCATTACTAACCGGGATAATT 

TCAGTGCCTCTCTCATAGGATTGTTGCAAGAATTAAATTGATGAATACATACGTAACACT 

TATAACAGTGTCACAAAATGAGAGCTCAGTAAAAGCCATCATCACCA 

Sequence 3338 

AGGTACTGGGGGCTCCTAAACCATATATATCTTGCACCATTCTGGGGATATTTGCCAGGC 

CTGGGGGATCAAAATATACCCCAAATATTAAACCAGTTGMGGGTGATTTGAGCCTACTT 

TTTGAACACTTTGTATCCCCTGTGTGCCAGGAAATTCAGGGTCTTTTTCGTAACTTGATC 

AGATGTCTCCCAGGTTGGGCTACACACAAAAAAGTTATCTACATACTGGAGTGTACCTGC 

CCG 

Sequence 3339 

CCGGGCAGGTACATACAGCTTCATATCTGCTAATGGAGAAAATAACTTAAACTTGGCCCT 

GCCAGGCMCCCATmCTTCCAGTTTAAMCTTAGTCTTmCAGGTTAGCTAGGCCM 

TTCCAGGTGCTCGTCCTACCTTCTTTCTCCCATTTCTACACCACCCTTCi Ml! I GTCTC 

TCCCTGGGAGATATCTGAGCTTAAGGAAAAATTCGTTAAGAGTGGGGATCGGAGAGACCT 

CTGTTCCACTAGTTGCCTrCTGGACATTTTCTGGTCTCCACTGTTGATGTATCCATCCCT 

TCTGGCCTTTGGGTGCACTGAGGCATCTTGCTATTGTCTATGGTTATGCCATAGCTCC 

Sequence 3340 

CCGGGCAGGTACAGGTTAGCCAACAGGCCAACAAACAACTATGTCAATTACTCCCTAGGG 
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TAGTCCACTGCAACTGCGAGAGTCCACACACATAATTATTAGTGACACAAGCATTTGCTT 
TGGCCA AC TTGAT GCAACTTCCCACCTGAAATAGTAAACTATATGGGCAAATAAGCCCCC 
inn tGCATTTTTATCAAACAGGATTCAGTTAGGGTATTTAATCTAATCAAACAATTCA 
CAAATAGTTAATACATTTTTAAAATGTAGGTAAAGTAAATA 1 1 I i I ATTTAGATTC AAAT 
AAAAGCATGGATATTTGAATTTTATTTCAACCAATAATTGCAGGTATAGGG 
Sequence 3341 

CGAGGTACTAAAGAATGTGCATTTATTTGTGGGGAGGAGAGGAGGTCTAAATAGACAATA 

AACATATAAATATTAATATTGTGAAAGTGGTATGAAGAAAAATAAAGCATGATAAGGAGA 

GAGAGGTGTTCCCTGGCTCTCCAGGGGGCTGAGGTGGAAGAAAAGGACTCCGGTTTTAGG 

TAAGATATCAGANAAGACCTCTCTGATTGACTGAAATGTATGCAAAGGCCTGCATGAAGC 

CAGGGACCATGCCGTGCTGACATNTG 

Sequence 3342 

CTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACCAAGAAGAAAGCC 

ATGGGCACTGTCAGTAAAGAGATCTAGAATGGGAACAAGACAGGGAGAAAGCCAGGTGCA 

AGACACAGGAAAAGTTCTAGAAAGAAAGCTTTTTGCATCTAACATCAGTCTCCCCAGATT 

CMC CCATCTCAATATTGAAGAAATTGTCATCCTATTCCCAAATAAAGTGAAATGAGGTT 

TTTTATGTAMGTGCTTAGMTGGAGTGTGGCACACAGTMGGATCACCTTTAAGTATTA 

CCTATAGATATACTGATAMTACATGGATATGTCMGGGAGAGGTTGGCCTTTTAACTTG 

MGGAGTCAGAAAAAAACTGCCATGCGCATTTTGCTTTGACTTTGAAAATCTGGATGCAT 

GGAAGCTCTATAGAAGCACAGCAGGAATTTCAAGAGGTATGCTGCCCAATGTGTGGCTGG 

CAGAGTGCTCACCCAAAGGCTCCATTGGCATTGAGTGGGGAGAACATACTAGAAGTGGGT 

rrrmn igtcatggagttgaai mTrrn iaagcaggttcatcttctttctagggaaa 

AAAATAAACCANGGGATAGGGGGAAAAA 
Sequence 3343 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAGGTTCAAATAGTCAGCAGCTCATCATAATC 

AATGAGCGAGGACATAAAGTAGGAAAAATGCATCACCATGGTGAGCAAGGAAAGCAAGTT 

ATTG G AGGCAC ATGTTAACAC ATAAMTATAAAATTAATATGATC ACACTGGAAAG GCTT 

GCCTGAGCCCACAGTTTGAATGCCTACAATAAGATGAGATGCACAACAAAAAGCAAGAGA 

ACCTGATCAAGTGGGTGACCTGGCCATGGTGCTCTCATCAGTGGGGACCCAAATGCTTAT 

GTGGACTCACNAGGTATCGMTTAGACATGAATAGGAGTGTTTGTTGTGATGGCAAGAAA 

CTATATMTCAAATGAATACMTGAMCTTTAAAAATAATTGTAAGATCTTTACACCAGC 

CAA 

Sequence 3344 

CCGCGGTGGCGGCCGAGGTACTAACAGTCAGGAACAAATAGAACACATAGAGAGGACAGT 

GGGAGGTCCCCCTGGCT.AMGCAGAGGGTGTATTGAGAGMGGAAACCTTTGCCTCAATG 

GATCGTAGGCAMCTGAGMGGACCATMCACCTTGCTGAGMGTTTGAATGTGATTCCT 

TGGTGACCACAGCAGGAGGTCCTGAAGTCCCTGTCCTTCAGATGCAAACAAGGAGGCTGT 

CCCATACAGACTATCAGGGAAAAAATGCACTGTTTATCAAAAGATATGAATGTTCTTGAC 

AGGCAAAGCCCTGCTGAAATGAAATGGCAGTGCAGGGAGATGATATTCAGTGTGATGCAG 

CTGTG AGTACCTGCC CG 

Sequence 3345 

CCGCGGTGGCGGCCGGCCCGGGCAGGTACCCACAGAATGTGGCCCCAAATGATTCAATTA 

CAGCTAGAAGTTGTTGTCTTCCCAMGCACGAGTCTTTTGCATTCACAGGATATTTTTTC 

TTTCATAAACATTCC*j*GCGAGAATGTAAGTGCGGATGAAGATTGNGTTCTGTCTTCTTCT 

TGTCTTTTTTCTGTTTACTCTGATTTTATAAAATGCAGTGTCATGTCTTAGGACAAATTA 

TCCTCAGTTTGCCAGCAAGGTGAGTTTATAAATTATTTTATTCAMTGAGAAMTTTCC 

CTAGAACCAATAGTAAAGGATGGTGAGTTTCCAGNTGACCCACAATGACGTATTTAAC 

Sequence 3346 

CGAGGTACTTGATCGGAACCCGGCTGCCTCTAAGACTTGGTCTTGGGCAACCCGTGGGTG 
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GGCAGGMGGGAAATTATTTATTTTATTTTGTTAAGGGGAGTGAAGAAAAGAAGACGTGA 
GGACCGCAGTTCACTGTAATGTAGAGTAATTGCAAAAGTGGGCTGCCTGCCCCTAGCTTT 
TCCCGCTGTGGTTGCGGAGGTGGTCTTGGTT 
Sequence 3347 

TCGACTCACTATAGGGGCGAATTGGAGCTCCCCGCGGTGGCCGGCCGCCCGGGCAGGTAC 

CGAn"GTTGMTGTTGCAGCCCTATACCTGCAATTATTGGTTGAAATAAAATTCAAATAT 

CCATGCTTTTATTTGMTCTAMTAAAMTATTTACT^ 

TMCTATTTGTGAATTGTTTGATTAGATTAMTACCCTMCTGMTCCTGTTTGATAAAA 

ATGCAAAAAAGGGGGCTTATTTGTCCATATAGTTTACTATTTCAGGTGGGAAGTTACATC 

AAGTTGGCCAAAGCAAATGCTTGTGTCACTAATAATTATGTGTGTGGACTCTTGCAGTTG 

CAGTGGACTACCCTAGGGAGTMTTGACATAGTTGTTTGTTGGCCTGTTGGCTAACCTGT 

ACCT 

Sequence 3348 

CCGCGGTGGCCCGGCCCGCCCGGGCAGGTACCAGAGACTCCTATTTATTTATACCAGTAA 

GGGCCCTCCCACTGCCTTTGTCTCCAGTAGGTCAGCTGCATCATCAGCTTTAAAAGCCCC 

CAAATATTGCTGAGCTTG I I I I i I I AAAACTTCTCTTGCAAACTTTATAAATTTGTTTCA 

GAGCACTATAATAAAATATTTCTTGCAAACATGGACTATAAGTGCCAGAGACACTGGAGA 

AATCTGAGGGTGGGGAGAAAGGATAATCTATCAGCTTGGAAGCTCCAAAGCACTGTCTTT 

AGTAGCGCTATTGGTGA 

Sequence 3349 

(I I I i I I NTTCCTTGGNTNCTTGCCTTACTTTCAGAMTACAAACAGAACTTTCTAAAAT 

CAGCAMGMGCAATTTTTCACCTTCCAACTGCCAACATACTAATACACAATACTTGGCA 

GAGCTGTCCAAAGCAAGTCAGACCACATACAGTATACCTAAATCCTGACACCTCCAACTC 

TNTTAAC AAATAATTTTTTTTCTCCGTGGTAATG GTCTNTTGGAAGTTAAAATATAG N G A 

GAAATGTGTAAACATACCAATNTATTAAAATAAGACAAGTGAATCCCTTTAAGAATCATG 

A 

Sequence 3350 

GGAGCTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACTCCCGGGTGAACCCTCTGCAGTCT 

GCCATTTCTTGGAATCCTCTTAACCMTCTTAGAATTATCTCCCAGCATCAGATTTTGCT 

TATTMGGAMCCAGGCATTGTGAMTGCTCCTTTCTACACTAATCTGAAAGACATAAAT 

AGTGGCAGAGAATGTTAGACATAGAGCGGGAAATTTCAGAGACTTTTGTTGGAAGAATCA 

TCACTCTCCTGCAGGAACTGCAAAAAACACTTGATCTCTGTTTACACATGCCTGCAGACA 

CACAGAGAGGCCCTAATCAGCGGCTTGTGGACAGCCTTCTATCATAGCTACCCACTTTTC 

TCTGTGACCTTCCCTGTTCTTCCACTCCATGTCTATCCTGACTCTCATTCACGCATCACG 

GCTTGGGACACTTGTGCTGCCTTTATCATTTGGGAAAACATGGCGCAGC 

Sequence 3351 

CGAGGTACGCGGAAATGTTAGACAAATGTAGCCAAACCAAAAAATACACATGTTCTATGA 
ATCCATATATGCAAGGTTCACAATAGGTGATAGTAGCTACTCTTTTGGGAAGG 
Sequence 3352 

tagggcgaattggagctcaccgcggtggcggccgtcggtgccggcaaggacaagatcgac 

aacacggaaccggccctgTactcgcagtacatggattggatcaacctgatgacctacgac 

ttccacggcggctgggaaacgagcaccaacttcaatgcccagctgtttgccgaccctgcg 

atccgtcgacgggcatggcgcgcgaatacgtgggcgacaaggcggtgcaatacatgattg 

ccgccggcgtgccgcgcgacaagctgctgctgggcatcccgttctatggccgcggctgga 

cgggcgtggcgccaggaccgaacggcgatggcctgtaccaggcggccacgggcacggcgc 

cgggcacctacgagtcgggcatcgaggactataaaatcctcgtcgccaagaccggcacgc 

gctactaccatcccgtgaccaagcagctgtacctgtacacgggcgcgggtggcaagtggt 

ggagctatgacgacccgaccgtcat 

Sequence 3353 
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TNCTATAGGGCGAATTGGAGCTCACCGCGGTGGCGGCCGTCACCTTGATCCTGTGGCGCG 

CCACCGTGGGCGTGATCAACGGCAGCATGACCCTGGGCGACCTGGTGCTGGTGAACGCCT 

TCATGATCCAGCTGTACATCCCTCTCAATTTCCTGGGCGTCATTTACCGCGAAATCAAGC 

AAAGCCTGGCCGACATGGAACGCCTGTTTTCCCTGCTGAACGAGAACCGCGAAATCGCCG 

ATACGCCGGACGCAAGTCCGCTCGTCACGCGGGGCnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnGAGGGTTAATTGCGCCGCTTGGCGTAATCATG 

GGTCATAAGC 

Sequence 3354 

AGGTACTANGGTTTTGGTTGGGTAGGAGAGACCAAAACCAGGGAAAAGCTTAGAAATCCT 
TTATTGGGGTTTCTGCAGGAAATGCAAGGCAGGTCACAGTTAACAGCTTATTATTGGCTA 
GTTTGAATAATTCCAGTGGGCCCTGGGGCACAGGTATCTAGTTGTCTGGT 
Sequence 3355 

CCGGGCAGGTACCCTCGGTTGCAAGCACAAGCAAATGTGCCAGGGTGGTTGATGCAGCTG 

TGGTCACAGGTCCTATCCAAAGAGCACTCATCCACATCTTGGCAAGACTTCTCATCTGTT 

AATMTTTAAATCCTTTCTTGCAGCCGCAGTCAAAACTGCCCACGATGTTm 

TGATCACAACCTCCCCCGCGTACCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGTAATCATGGTCATTAGCTGT 

TTCCTGTGTGGAAATTGTTAT 

Sequence 3356 

ACCGGCNAAGGGCGANTTGNAGCTCCCCGCGGTGGCGGCCGCCCGGCCAGGTACTGNGTT 

GAAACTGTAGATCAGTTGGGGAATATTNCCATCTTTACAATATTTTCTTCCGATCTATTA 

ACATGAGGAGTCTTTCTATTCATTTAGTTTTAAAAT^ 

TTTGAAGTATAAMTTTGCACTTGTTAMTTTATTC^ 

CTGTTGTAMGTTGTTAATGGAAATGCTTTCTT^ 

GTGTATAGAAGTAAAACNGATATTTGTATGTGGAACTTGTATTTTACAACCTTGCTGAAC 

TAGTTAATTTCCTCTAATAANCTTTTTTTGGTGGATCTTCAAGATm 

GMCCATGTCATCTGCAANATAGGAGAAAAATTTTTACTTCTTTCTTTTTCMTATC 

TGCCCTTNAAATTNNGGNTCTTGGTTTAAATGGCCCCCANCTTAGAACCTTCATTGGAAT 

AAAAAMTTGAAAATTGGGCCCCGGCCCCAGNGGCTTCATTTCCTGNAATNCCCAGCCCC 

TTTGGGGANGGCTTGMGCCGGTTGGGAACANCTTTGANGCCCCGGNGTTCAAAGAACCA 

GGCNTGGGCCMNAATGGGGGNAAACCCCTTGCCTTTTGCTAAAAATAAAAAAAAAA 

Sequence 3357 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAAGCCAATGCCAGCGTCAGAGCTGCTTTTGTG 

CATGCCCTTGGAGATCTATTTCAGAGTATCAGTGTGCTAAITAGTGCACTTATTATCTAC 

TTTAAGCCAGAGTATAAAATAGCCGACCCMTCTGCACATrCATCTTTTCCATCCTGGTC 

TTGGCCAGCACCATCACTATCTTAAAGGACTTCTCCATCTTACTCATGGAAGGTGTGCCA 

AAGAGCCTGAATTACGGTGGTGTGAAAGAGCTTATTTTAGCAGTCGACGGGGTGCTGTCT 

GTGCACAGCCTGCACATCTGGTCTCTAACAATGAATCAAGTAATTCTCTCAGCTCATGTT 

GCTACAACAGCCAGCTGGGACAGCCAAGTGGTTCGGAGAGAAATTGCTAAAGCCCTTAGC 

AAAAGCTTTACGATGCACTCACTCACCAT 

Sequence 3358 

CCGCGGTGGCGGCCGAGGTACCCTCGGTTGCMGCACAAGCAAATGTGCCAGGGTGGTTG 

ATGCAGCTGTGGTCACAGGTCCTATCCAAAGAGCACTCATCCACATCTTGGCAAGACTTC 

TCATCTGTTAATMTTTAAATCCTTTCTTGCAGCCGCAGTCAAAACTGGCCCACGATGTT 

TTTTTGCAAGAMTGATCACAACCTCCATTGCGGGTCTGGCACTCATCMTATCTTTACA 

TGTCTTCCCATCCAACTGGAGAGTGAATCCAACAGGACAACTGCAGTGGACACCTGTCGA 

AGTATCCTTACAGGTGCGGTCACAGCCTCCATTGTTGACAGCACACGTTTCCATGANCAG 
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CCGTTCGTTTCACCCGTTTTATCCCCATCCACCACTGATGTGGGTGTTGCTCTCT 
Sequence 3359 

GGTACCAAAGAACAAACGGAAGCCTAAAGATGGACCACAAACCTTTGGATATCCAGGATA 

ACTGAGTNAAAAACCTACCACACGGNTAATAAAATAATGATGTCCACTGAGTCCATGACC 

ACAAAGCAAATCACCAGCAGCAGGATGGCATGGGCGGCCCTTTTCGCTGGGGAGGGACTG 

GAGCAAAGGCTGGGGCTGTGAAGGTGATGGGATCACCTCTGATGGTTAAACAAGANTACC 

TA 

Sequence 3360 

ATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAGGATTCATTCTCATCAAATATCATGA 
AGTGGAGAACACGGTGAACTATG7TGGGGTTGATGTCTTTGCGGCATACAGAGAGAGGGC 
CTACCAGTCCACTGTGAAGGTCCTTAGCCACAGATACAGTTGAATAGTAAAACCAAGGTA 
TGC ATTC AAAGTC C CAGTGAG GTAGG G ACC AGTTCTATCAGGTATCTGCCAAGTATATAT 
TTGMTCTCTCCTGGCTGAGTTGGAACMCAGTGGAATTATTTGTTTTCACTCCATGAGC 
ATGAATAGAATACGGTCTTGCGGCTTTA 1 1 1 1 1 AAAGATAATTTTGAATTATTTGACCAG 
GGTTGAGCAATATTMTGGGACCTAGTATATCCGAGNATGTTTTTCACCCTTCATTTCCT 
TTT 

Sequence 3361 

TGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGGAGATGCTGCCACCTAGGT 

TACTTGTAGGACCCTATACGGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCA 

GGAAAATGAGTCTATATGTCAGAATACACATTTCCCACCTTGCCCAACAGTAGAAAAACA 

TAAGAAGAGAAAAACATTAAAAAATGACAAGGAAGTTAATGGAAGTCGGCAATGTGATGG 

TGTTTGGAGGTGGAGCCTTCAGAAGGTAATTAATGCCCTTGTAAGAAGAGGCCAGAGAGC 

TTGCGCACCTTCTTCCTGCCATGTGAGGAGCCAAGAAGCCCNGCTGTCTGCAACCTGCAA 

GAGGACCCTCACTAGAAGCTAGCCATACTGGCCTCCTCATCTTGGCTTTCCAACTTCCAG 

AACTGTGAGAAGTATATGTTTGTGGTTTAGTCAATGGTCTATGGTAAl Mill IATAGCA 

GTCCCAGCCMGACAGTGCCTCATTTACTACATACCATTTATATTATTATATAG 

Sequence 3362 

ATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAAGAAACAGCTTCAATCCTT 

TTGGGGAATGGCAGGGTTTTGCAGAATATGTGTTCCAAATTTGGGGTCATAGCAAAACCC 

CTTTATAAAGCAACAAGGGGGCCAGAAAATGAGTCAATGGAGTGGGCCCTANAAATGAGG 

GGAMCTTTGTCAAGTTAAAACAGGCTCTTAACAAAGCTCCCACTCTTGGCATCCCAAAC 

TTCATTAAGCATTTTTCCTTGTATATAGCAGGAAAAAAAGGGCATAACTGTGGGAATACT 

AACCCANAAATTGGGATCAGMCCCATACCAACCATCTACTTTTCAAAGAAATTGAACAG 

AGTAGCCTCACAGTGACCAAGCTGCCTGCAGGCAATAGCAGCCACTGTTATTTTAGTGGA 

AGAAGCCACTAAAATCACCCTGG 

Sequence 3363 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAAAGAACAAACGGAAGCCTAAAGATGGACCA 

CAMCCTTTGGATATCCAGGATAACTGAGTTAAAAACCTACCACACGGTTAATAAAATAA 

TGATGTCCACTGAGTCCATGACCACAAAGCAAATCACCAGCAGCAGGATGGCATGGGCGG 

CCCTTTTCGCTGGGGAGGGACTGGAGCAGAGGCTGGGGCTGTGAAGGTGATGGGATCACC 

TCTGATGGTTAAACAAGAGTACCT 

Sequence 3364 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACATGGTTCTTCCTCAAAAA 

GTGGTAATTCCTTAATGTGTTTC I H I I ACCCTTTTTCTTCTTCTTGGAAAGGGGATTTT 

AAGTAAAGAACTGAAGTAATGGAAAGAGTAAGCTAGAAGAATATCTGGGGAAAAAGCCCC 

CAGACACATGGAACTGCTAAGTGCCAGAAGGTGTGCCTGGGAATCTTTTAAGCAGTAGGG 

GTAGGGTAAGGGAATAGCANCAAGTTCAGTGTTGCTTGAAN.CAGAGCAAAGACATAGNAA 

NCAGAAGTTTAAGCAGGAGAGGTAACATGCCAGATGGTTACTGCCTTCAGTTATTAGGAN 

GGACCTCTGCACATACTCAGAGTGAAATGGGGAGGCAATCAGAAGGGCTGGGGCAGAAGG 
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AATGAC CCAATT TGACTTATGTTTAAAATACATNCACTTGAGTTTAAGAATTGACAAAAG 
GGGAAGTTTTTAAAAACCCAGGACTATCAATTCCAAGTCTATGACACTCATCTAGACTAC 
AGATGACGGTGGCTCAAATGTACCTGCCCG 
Sequence 3365 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACGCGGGGGTAGATGGAAGGAAGAACTTGTGTG 

CTTAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGTAGGACCCTATACGG 

CMCCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAATGAGTCTATATGTCA 

G AATAC AC ATTTCC C ACCTTG CCC AACAGTAG AAAAAC ATAAG AAG AGAAAAACATTAAA 

AMTGACMGGMGTTAATGGMGTCAGCMTGTGATGGTGTTTGGAGGTGGAGCCTTCA 

GAAGGTAATTAATGCCCTTGTAAGAAGAGGCCAGAGAGCTTGCGCACCTTCTTCCTGCCA 

TGTGAGGAGCCAAGAAGCCCGACTGTCTGCAACCTGCAAAGAGGACCCTCACTAGAAGCT 

AGCCATACTGGCATCCTCATCTTGGCTTTCCAACTTCCAGAACTGTGAGAAGTATATGTT 

TGTGGTTTAATCAATGGTCTATGGTAA i M I NI 

Sequence 3366 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCGGGCAGGTACGCGGGGGGGAGA 

CCATTGCTCTATAGCAAGACCAGACTTTGCCCTTCCTCCTCTCAGCCTACTCAACGTGAA 

TATMTGAGGATGAAGACCTTTGTMTGACCTTTTCCCACATAATGAATAGCCATTGGGA 

GACACACTTCTGAACACCACCACTGGAAAATCACACATGCTGAAATGGGAGAGTTCCCTG 

ACCCCCTTGCAGGATATGTGACAGGAGTGTGGCTCATCTGTTCAGCTGGAGTGCATACTC 

AAACCCCTTATGAGACAAGGAGTATGCAGACAGAAGGTGCAGGAACTGGGGTGAGTACCT 

0 

Sequence 3367 

NGGCGAATTGGAGCTTNCCGCGGTGGCGGCCGCCCGGGCAGGTACCACACACACACAGGC 

ACACGCAGGCACNCGCAGGCACACANTCNGACACACACACACACATAAACACACAGAGTT 

CACTAGTCCGAGTTACTGATTTTCTTAGGATTCTCAAAGTGACAACACCGGAAACAAGGT 

MTTCATGTTAAAACACAAGGGTTATATCAGTAAGAGATGGGATCCCCGAAGTAAACCGT 

GGMTTTGAATCAAGCTTCGMGAGCTAAAAAAAGAMTTGGAGTTTCMCATTCACCTT 

CTTGAATCCTTAAGAMTACAGMGTTCAAMTAGAAAACATTACAGTTTCAGGATCAAA 

AGTAG AAACATCTG AG ATTAAGCCTAC ATTTTTAAAAAAAG ATATTGAAAATTACTG G GC 

TTGTAATACCAGCTTTTCAAACATCAATACCAGTATTGGCATTACCTAGATCAAATCTTT 

TCATCAAAGTTGAAAAAATAAAGCATTTGGGGGGATAGAAGGACCATCAAAAATGTCCAA 

TATTGAAATATTGGGTCTTTGCATAGTTNCATTAAAAAAATTTACCCTGCTCTCAATGGT 

TGGGTCANTTCTTCAATTCCMGGCTTTANTTTGGNACAAGACTTTTTGCAT 

Sequence 3368 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCCCGAGCGGCCCGCCCGGGCAGGTACGCC 

GGGGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGTAGGACCCTATACGGCAACCT 

CCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAATGAGTCTATATGTCAGAATCA 

CATTTCCC ACCTTG CCCAAC AGTAG AAAAAC ATAAG AAG AG AAAAAC ATTAAAAAATG AC 

AAGGAAGTTAATGGAAGTCAGCAATGTGATGGTGTTTGGAGGTGGAGCCTTCAGAAGGTA 

ATTAATGCCCTTGTAAGAAGAGGCCAGAGAGCTTGCGCACCTTCTTCCTGCCATGTGAGG 

AGCCAAGAAGCCCGGCTGTCTGCAACCTGCAAGAGGACCCTCACTAGAAGCTAGCCATAC 

TGGCATCCTCATCTTGGCTTTCCAACTTCCAGMCTGTGAGMGTATATGTTTGTGGTTT 

AGTCAATGGTCTATGGTMTTTTTTTATAGCAGTCCCAGCCCAAGACAGTGCCTCATTTA 

CTACATACCATTTATATTATTATATAGGCTTCCTTTCAGAAACCCATGTTCAAAATAAGA 

GATAANATACTGAMCACATAACACCTTCCTTAGTTTTTAAGTATACAMTATTTGAGAA 

AATAGTTTGGTTATTMCTATTTTAATCCMGAAATGCANAATCATGTGGTTCTAA 

Sequence 3369 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTACTTTACTCACCCTT 
CCTCTGACAGAAAAGGATGAAGTCAAGGGCCTGGTAGAGGCACCACTAAGAAAGGCGTCT 
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GAGAGGACCAAAGAGAGTGACCAGCAAGCATTTTTTGCAAGGCTGAGGAGCTGACAGCTT 

CATGAAAGGCTGGACCACCCAGTGGTGAGAAGCATCATCTGGGTTACCTTGTGCTGCCAT 

AAMCACACCACAGACTTGGTGACTTAAACCACAGATATTTATCTTCTCACAATCCTGGA 

GGCTGGAAGTCTGCAATCACGGTGCCAGCATGGTCAGGTTCTGGTGAGGGCCTCTTTCCT 

TCTCACTGTGTGCTCTTTCTTGTGCATGGAGAGAGAGAGCATGAACAAGCCCTCTACTGT 

CCCTCTTAGAAGGGCACTAATCCCATAATAAGGCATCCACCCTCCTGACCTCGGCTAACC 

CCAGTTAAGTACCTCGGCCGCTCTAGAACTAGTG 

Sequence 3370 

CCGCGGTGGCGGCCGAGGTACTTTGCCAAGCAGTAAAGGATCCAGGAGATAGCACTGGAT 

GTGGTGTCATGTCCTGCAAACATGMCGTTTTCACTTCAGCCTGGAGATCTGC7TCAGAG 

AAATCTTTGGTGTTTTCGCTTTTGGCACTCAAAAGTATGTCCAGAAAATCCCAGCGCCTT 

TTCTGAGTAGTATCTTGTTTTAGCTTATCCTTAAGAGACTCCTTCCGGTCCTGGATTACT 

TTCTCTGTGAACTGATGAAGTTCTTGGTTAAATTTAGAAAAGATTTGGCCTTGAGAGCTG 

AATTTGAAAACCAGGTCGTTGTGATGTAGAAAATTGTTCATGCGCTGGTTGGAG A1 1111 

TGCTAAGGTTGAACACTGCTTTCAGGTATTGAGTCCAGGGTACCTGCCCGGGCCGGGCCG 

CT 

Sequence 3371 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTGAGTAGTCT 

GTAACCAAAATMTTCATTTGGCACATTTAAGGCATCTGCCCAATAAAATACCTTGTGAA 

TAMTTCTTTAGGAATTATGAATACTACAAGGAGAAAAATGACAGAAAGAAGCTGACTTG 

GGGGGAATGGGAGTTGTTTTTAATAACCTTTTTACCACCCTAGACTTCTGGCCAATCAAA 

TTGAATCATACCTTGGGAAGGGAAACTGCCTGATGCAGCAGCTGAGCCAAAGTCAGCAAA 

TCCTCCGAATAAATCAGCTGATCCTCCTAGAAGTTAAGAAAAAGTTAAAACTGTTAGAAC 

TAGAGAAAAATTGTAMTAATTTCTGGAGGCCATTTTAGTGCCACACAAAAAGTACCT 

Sequence 3372 

TCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCTTTCTCTTACTGATAGTANG 

GATATTTCTGCTTTAGTTATTGTCACCTTAMTATATTTTCMTGTTGAAATCCTCACAG 

CATGTTTGATGAMTCTAGTTTTCAAATTTTCTTAGGTATATTTCTGTCACGTTGGCATG 

ATAACAAATGCMTMCCCAAAAGACCCCAAAAGCTAGTGTAATCCCTTTTGCAATCCAA 

GCATGAGGATTCATCTTCATGTTGACAGTGCGTGMTGTTCGGTAGGCTTTGTCAAGCTT 

GCATACAATAAATTATATATGTCCCTTTTCTTTTAGGGTCTCCTGTTAAAGGATGGTCTT 

CTGAAG GCTAACTGCGGMTG AAAGTTTCTATTCC AACTAAAGCCTTAGAATTG ATG G AC 

ATGCAAACTTTCAAAGCAGAGCCTCCCGAGAAGC 

Sequence 3373 

CCGCGGTGGCGGCCGAGGTACTCACCCCAGTTCCTGCACCTTCTGTCTGCATACTCCTTG 

TCTCATAAGGGGTTTGAGTATGCACTCCAGCTGAACAGATGAGCCACACTCCTGTCACAT 

ATCCTGCAAGGGGGTCAGGGMCTCTCCCATTTCAGCATGTGTGATTTTCCAGTGGTGGT 

GTTCAGAAGTGTGTCTCCCAATGGCTATTCATTATGTGGGAAAAGGTCATTACAAAGGTC 

TTCATCCTCATTATATTCACGTTGAGTAGGCTGAGAGGAGGAAGGGCAAAGTCTGGTCTT 

GCTATAGAGCAATGGTCTCCCCCCGCGTACCTGCCCG 

Sequence 3374 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTTGGCCACTTTCCTGAGCCAAGCACTTCATTA 

TCTTCTTGTCCTGTGAGTCCTAATAGATATATTTAGAAACACTTCTATGAAATATGGAAT 

CCAGGCTTAACCAGCTGTTTGAAATCACAACCCACAACCACTTTGCTGGTGTGTGCTCAC 

ACACACACACACACTGACACATGCTTGCTATACATACATACACACAGTTTCCATCTTTTT 

nTTTTTm I GAGACAG AGTCTTGCTCTTTCGCCCAGGCTGGAGTACACTTGGCTCAC T 

GCAACCTCTGCCTCCCAGGTTCAAGCAATTCTCCTGCCTCAGCCTCCCAAGTAGCTGGGA 

CTACAGGCACGTNCCACTACACCCGGCTAATTTTTGTATTTTT 

Sequence 3375 
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CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGCTTCCACCATGTGAGGGACACAGCAAG 

AAGGGGCTGACTCTGAACCAGGAAGCTGTGCTGTGCTCACCAGACACTTTGATCTTGGAC 

TTCTCAGCCTCCAGAACTGTGAGAAAGAGATTTCTATTTTTATAAGCCACCCAGTAGATG 

GTACT7TGTTACAGCAGCCTGAAAGGACTAAGACACCGACCTAGTCTCCCTGATGAAAAA 

G^^CTCTCAGACTTCTACCCTTTCCMTGTGGCCAMGCTTTTCATTCCGMGMGTTT 

CCTTTCTGAGAACGCTCATTGTGTCGTTTGGCTTTCCCCGTCTCTGCTTGACACATGAAC 

CA AAAC AGAGGCAGCCAAAGCAGGGAAAAAAAAATCCTAGGATCAGAGTCCACTCTATGC 

C CTTTTG AG CTTC AAAAGG AGAAAG AG AC AAAAG CCA 

Sequence 3376 

CGCCCGGGCAGGTACGCGGGGGAACTTGTGTGCTTAGACCTGACGCTGGNANGAGATGCT 

GCCACCTAN GTTACTTGTAGGACCCN NTACTGTAAN NTCCTTTGCCAGGAACTATTTATA 

AACATCCTGCAGGAAAATGAGTCTATATGTCAGAATACACATTTCCCACCTTGCCCAACA 

GTAGAAAAACATAAGAAGAGAAAMCATTAAAAAATGACAAGGAAGTTAATGGAAGTCAG 

CAATGTGATGGTGTTTGGAGGTGGAGCCTTCANAAGGTAATTAATGCCCTTGTAAGAAGA 

GGCCAGAGAGCTTGCGCACCTTC7TCTGCCATGTGAGGAGCCAANAAGCCGGCTGTCTGC 

AACCTGCAAGAGGACCCTCACTAGAAGCTAGCCATACTGGCATCCTCATCTTGGCTTTCC 

AACTTCCAGAACTGTGAGAAGTATATGTTTGTGGTTTAATCAATGGTCTATGGTAATTTT 

TTTATAGCAAGTCCCACCMGACAGNGCCTCATTTACTACATCCATTTATATTATTATAT 

AGGCTCCTHTTC AAAAAC CC ATGTTC AAATTAG AG AT AAG ATCTG G AAC ACTTAC ACCTTC 

CTAGTTTTAGT ATAC MTATTGAGAAATAGTTGGTATAACTATCTATCCAISIAAATGCAAT 

CTGTGTTCTAATTTTATTATMTGCAMTGAGAACTTACACATCTAATTTACTGCCAAAA 

AAA 

Sequence 3377 

AGGTACGCGGGGAMGCTTGGTGGAAAAAACACCTGATGAGGCTGCATCCTTGGTGGAGG 

GAACATCTGACAAMTTCAATGTTTGGAGAAAGCGACATCTGGAAAGTTCGAACAGTCAG 

CAGAAGAAACACCTAGGGAAATTACGAGTCCTGCAAAAGAAACATCTGAGAAATTTACGT 

GGCCAGCAAAAGGAAGACCTAGGAAGATCGCATGGGAGAAAAAAGAAGACACACCTAGGG 

AAATTATGAGTCCCGCAAAAGAAACATNTGAGAAATTTACGTGGGCAGCAAAAGGAAGAC 

CTAGGGAAGATCGCATGGGAGAAAAAAGAAACACCTGTAAAGACTGGATGCGT 

Sequence 3378 

CCGCGGTGGCGGCCGAGGAC I I I II I I 1 I I I I ICI II II I i I I I I I I I GGATTGTTGATC 

ACACCTTTATCTTGCTTAATATGTGTCCAGATTCACAAAAATATCTACATCCTTCTGATG 

TATCATTTTAGGGTATGATCTCAGTAATCAGGCAGTGAATTTTCTATAGATTTGTAGGAA 

GTCTAGGACAAGATGCATATTGTTGGCATGCTGTATACTTCANAATGCTTAGTTTATTAA 

TCAACAAAGGAGTCATATCTTTTTGTCACTAATGTCCTCAAAACAATCCCAACCAATCTT 

ACCGCTAAAATTGTATTTTATTATTTTA 1 1 1 1 IATTT 

Sequence 3379 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGCTTAGACCT 

GACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGTAGGACCCTATACGGCAACCTCCT 

TTGCCAGGAACTATTTATAAACATCCTGCAGGAAAATGAGTCAAGGAAGCTTTTCTT^'G 

AGCTATTTACAGCTTTTAGCAATTGAGTAAAGTATACTCCTGTGMCAAMTTTGGAACA 

TA7TTGTTTCTCTCTAACTGATTTCTCCAGAATTTGGM 

CTTGGGGTGTGGATCGAGACCCCTTTCTGGTAACGAGAGCACAGCTTGGTTTTATACATT 
TTAAGGAGACATGAGACATCMTCAACGTATNTAAGATGAACATTTTGATCTGAAAAGGC 
CTTTTTTA 
Sequence 3380 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC I rTTTTTTTTTTTTTTI 
TTTTTGCCTGGAGAGAGAGAGATGTTTTATTTTTGCCAAAGCATGTGAGAGTAAGTAGCA 
AACATCATGACACTTCACCCCTCAAATATTTTCCCACGTATCACTTAAGAACAAGGGCAT 
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TCATTCTGTTACATGACCACAATACAATCAGCACAGATAGATATTAACATTGAGACAATG 
CTCTGATGTATTGGTATCTACTATGCAGTTCATAATCAAATMGGGTTTTGGTAACTTGG 
GATCCATAAATGGTCTGCAAGGTCAGCCCCTGTAGACCTGCACTTAGTGGGTGCCA 
Sequence 3381 

AGGTACACCCCCTTTTTACCTTGGACAGTTTTCAGAGAGAAGCCATAGCTGCCTCCTTCC 

TTCACGAGATAGCAGAGCCGGGGCTGGGTCCAAGTTTGCACACCTCCATTCATCACAGGG 

GAAAGTAT ATTAT CACTCAAACCTTGCTCCTTCTGACTTTGACCCMCTCTTTCAAGNTC 

CACCCGTGTTTTCACTGCCTTTCTCATAGGAAATCTCCCATCCCAGTAACCTAGTTAAAA 

GCTCA CNTGG CATTCCCCACTTCTTTCTNGACCAGGATCCCACNAACCTGCCATAATGTN 

TCTNTCTTTTGNTCCACAAMGNACACCMTTTGTATCCTTANAGAAACCTCTTGTCTTC 

CATTCTTNGAAAGGCCAGCCCCTTCTTCTGCNTGGGCCTACACTTTCTCAAACCCAACCC 

CGGGACCAAGGGTGGGCCCCTCGGGTGT 

Sequence 3382 

AGGTACAATGCCAGGAAGATGAATGTGCGTTAATGTTGCTGGAACATGGCACTGATCCAA 

ACATTCCAGATGAGTATGGAAATACCACTCTACACTACGCTATCTACAATGAAGATAAAT 

TAATGGCCAAAGCACTGCTCTTATACGGTGCTGATATCGAATCAAAAAACAAGCATGGCC 

TTACCACCACNTGCCTACTTGNGTGTACCTGCCCGGGCGGGCCGCTCCTAGNAACCTAGT 

TGGGATCCCCCCCGGGGCTGCATGGGAATTCGATATCAANGCTTATCGGATTACCCGTTC 

GACCCTNTGAAGGGGGNGGGGNCCCCGGTTACCCMGCCTTTTNGNTTCCCTTTTTTAGN 

TNGMGGGGGTTTAAATNTGCNGCCGCCTTTGGGGCCGTTAAAAT 

Sequence 3383 

CCGGGCAGGTACCACCACGCCTAGATAACTTTTTGTATTTTTAATAGAGACAGAGTTTCA 

CCACGTTGGCCAGGCTGGTCTCAAACTCCTGACCTCAAGTGATCTGCCTGCCTGGGCCTC 

CCAMGTGCTGGGATTACAGGCATAAGCCAGCACACCTGGCCGTTAAACTAGTTATTTTA 

ATCATGCTACACAAAGAAGACATTTCTCCAAAGGAAATATATAAAGGGAAACACTGTTGA 

AATAAATGGAAAGCTAAAAGTCTCAGCATAGAAGCTATAAAAAGAAACAAGTGGAAATTG 

TAGAACTGAAAATACAATMCAAAATTTACTGGGAAGGTCTCAACACAATGAAGATGACT 

GAGGGAAGAGTCAGTAAATTTGAAG 

Sequence 3384 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGTCAGTTACATGAAAATCCCCTTTGGT 

CACTGAGATCCAAACCTTGCCGGTGTGTCATTGAGCCAGGTCAAGTCTTCGATATGACTG 

TGCTCCTACCCTCAACCCAGCCTTTGGCGTTTGTGCCTTAGTGACTGCAGAAGCCTCTTA 

GTCTTCCCATCTCCACCTTGCTCTATACCCCTGCCTTGACCCTAGCTGGCCAATAGTAAA 

AATCCCCTCCCAAAATATTTTTAAGTAMCAMCAMCMTACAAAATAACGTTTAAAM 

AAAAAAAAAAAAAAAAAGTCCCTCGGCCGCTCTANAACTAGGTGGATCCCCCGGGCTGCA 

GGGAATTCG 

Sequence 3385 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTCACAGTTGAGATAGGATTAAACCCCAATTTC 
AAAGCCCCTTACAGAGTTCCACTTACCAGGTTATTTCAGCCGGCAACCAGGAACTAGACT 
GAATGCAATCCTCCCACATCCAAGGTCCCCCGCGTACCT 
Sequence 3386 

CGGCCGCCCGGGCAGGTACACTAACTGCAGTGCCTCATTTCCTAATTTCCATATAAGCTG 

CCCCTCTGACCTAGAGCATCTTTGTTTTCCACCTACCATMCCTTTAAAGCAAGGGTCCC 

TAACCCCCGGGGCCACAGACCCTGTTAGGAACCAGGTTGCACAGCAGGAAG1AAGTGGTG 

GGCAAGTGAGCATTCCCGCGTACCT 

Sequence 3387 

CCGGGCAGGTACCGGGCAGCTTGACCTCCATTGCTTTTGGC 

TGAAGCTCAAAAGGGCATAGAGTGGACTCTGATCCTAGG Al I ill I I I I CCCTGCTTTGG 
CTGCCTCTGTTTTGGTTCATGTGTCAAGCAGAGACGGGGAAAGCCAAACGACACAATGAG 
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TGTTCTCAGAAAGGAAACTTCTTCGGMTGAAAAGCTTTGGCCACATTGGAAAGGGTAGA 

AGTCTGAGAGAAACTTTTTCATCAGGGAGACTAGGTCCGGTGTCTTAGTCCTTTCAGGCT 

GCTGTAACAAAGTACCTCGGCnnnnnnnnnnnnnnnnnnnnnnnnnCGGGCTGCAGGAAT 

TCNAATATCAAGCTTATTCGATACCCGTCGACCTTCGAGGGGG 

Sequence 3388 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGACCTAGGTTACTTGTAG 
GACCCTATACGGCAACCTCCTTTGCCAGGMCTATTTATAAACATCCTGCAGGAAAATGT 
GAAGTAGAAGAGACAGGGATATCCCAGAAGGTTATGCAAAACATCAAGAGAAGATGAGAG 
GAGTCTATATGTCAGAATACACATTTCCCACCTTGCCCAACAGTAGAAAAACATAAGAAG 
AG AAAAACATTAAAAAATGAC AAGG AAGTTAATGG AAGTCAG C AATG TGATGGTGTTTGG 
AGGTGGAGCCTTCAGAAGGTAATTAATGCCCTTGTAAGAAGAGGCCAGAGAGCTTGCGCA 
CCTTCTTCCTGCCATATGAGGAGCCAAGAAGCCGGCTGTCTGCAACCTGCAAGGAGGA 
Sequence 3389 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGACTGTGGGCTGGAATCC 

TAGC ACCCC AACCTG CTAGATGTAAG GC ACTTACC I I I I I 1 CTTCCTCAGGTTCCTTTGA 

ACAGTGTCGAGCACACAGGCTTAGCTCCATTATGGTTACTTTGGGAATAGCCTCTCGACT 

CAAATTGGGAATAAGAAATGTCACTGTTAAAGAAATCCCTAAAAAAAAAAAAAAAAAA^ 

AAAAAAANGTNCCTGCCC 

Sequence 3390 

CCGCGGTGGCGGCCGAGGTACTTGCAGGCTACGAGAGGTGATTTGCAGCATGC7TCAGAG 

ACAACCATGCTGAATCATTCGTAGATGTCAAAAGTGTTAGAACTGACATGGATCTTAGAA 

GCCACCTGGCTTTCACCACCCATTTTCTCCTGCTTCCCAATGTCCCGGGAACACACGGCA 

CTTAGGAGGGGACTGTAAAAGCAGTAGCTCCCAATTATAACACCAGTCTCCCAGTTTGGA 

ACTTTGAGGTTAGGTAACTCCAAACTTAGCCTACCCTTGACACTCCCCTCTACCCATTCT 

CTATTCAAACCTCAAATCCCGCGTACCTGCCCG 

Sequence 3391 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAGTCAACACACCCGTGGGTAATTAAC 

CTGGTCATCCCCACCCTGGAGAGCCATCCTGCCCATGGGTGATCAAAGAAGGAACATCTG 

CAGGAACACCTGATGAGGCTGCACCCTTGGCGGAAAGAACACCTGACACAGCTGAAAGCT 

TGGTGGAAAAAACACCTGATGAGGCTGCACCCTTGGTGGAAAGAACACCTGACACGGCTG 

AAAGCTTGGTGGAAAAAACACCTGATGAGGCTGCATCCTTGGTGGAGGGAACATCTGACA 

AAATTCAATGTTTGGAG AAAGGCGACATCTG GAAAGTTCGG AAC AGTC AGCAGAAG N AAA 

CACCTTAGGGGAAATTACCGAGTTCC 

Sequence 3392 

CGGGCAGGTACTCGGGTGACCCAAGGAGGACCATTTTAAACTTCTAAGTTTTCACCGAGA 

TTAACATGCATTTGTTGAAAGACAAACCTCTTTTACCCTACTCCCCCAGCTGCAAATGCT 

TTCAGGGTTCACATCATGTTGGAACATTTGGTTACAGTGTTTCCTAAACTTTGCGGGTAA 

AAATTGTTCMGTAGGTAAAAATGGAGCACATACAAAAAAAAAAAAAAAAAAAAAA^^ 

AANGTACCT 

Sequence 3393 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACI rTTTTTTTTTT7 

(TTTTTTTTTI ICCTGI I I ( I GTTAAAGCTTTATTTGGTGACACTCCCTCCCAAN ACCAA 

CANMTTGGGGAAAGGTCCCAAAGAGAGGAATCCAACTTTTAA^AACGGCCTCATTCCAT 

ACCTGAGTCAGGCTGCTNTNTGGCTCGCTCGCTCTCCTNTNTAACAAGAGTTATGNGTTC 

TTTCTGTTCTAGTATCAAAAAAGGNGATMCAAGGGAGTCATTTACAANCCANCCACCAA 

NATGGANACTGAGGCCCACNCTTGAGGTCCCCGGAG 

Sequence 3394 

GGCGGCCGAGGTACACACAAAGACAMCCTGMCTTAATTTCAAGGAAAACTTAAACCCA 
TGCACAAATMTTGGTGAGCCTTCATTTCCCTGACTTCAAGTTTCCATGTGAGGACTCAT 
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GCTCTCTCCACTTTCTTCTTGGGAGGAGGGAAGATTTACCTAATGGGTAAATTTGGGCAA 
AGCACATTGAGTGTGCTTGTTTGGCTCTGAGTCTCTTTGCAAACATGTGTCTGCCCACAG 
s TGACATGAGTTTGCCGTTGACTGNCATGTCTGCAGGAAGCTGCCTGCTCCTGTGGCCATG 
TCAAGCAATTCTTTCTTTCAACTGCA 
Sequence 3395 

TNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGTAATGAA 

AATTGAGGTTCATTCTATGAAAAMTCTCCCCTTCTCCCCTTCCCTTATTGCCTGTCTTG 

GCMTGGCCAGTAGAGTAGTTATTTAAGGGATTTTG I I I 1 1 11 G 1 I I I 1 1 I G I I I 1 1 I IT 

TGGTAGGGGAGGTGGGGAATTTCTGGGTTTGTCTTTNTTTTAATGTATATAAAAGTTTGC 

TAGTTGTTAGTTATTGNTGTTCTTTCACTCTCAACACAG 1 1 1 1 I I GTTTGGAAGTTGTAC 

GAATCAAATGTAATTGAGGGGACCCATAC 

Sequence 3396 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCCGGCCGCCCGGGCAGGTACAAAAATC 

CCCAGAGCCACATAGACGGGGGTGGTAGTGCGAACCATAGCAATATTGGCATAGGTGCCA 

ATCTGTAGCTCTTCCAGAGACTTGGACATGAACTCTGGCTTGGGGAACTCAGTGATAAAC 

AATTTGAGGAACTTCAGAATGCGCTTGTGGGTGAGGATGTACATGCCGGCTTCCCTTAAG 

TTGATTAGATGGGAAACCAAACACCGTCTTTCAAGTGTGAGGCTGTGCCATGCTCCCTCA 

GGGAGGAGGGGCGCTGGGTCTGACCAGGTCTGCCTCTGCCCTCACCAGCTGGCAGACGTG 

AACTGGAGAACAGCCACCAGAAAGGCTGGTGGTTGGGCAAGCTGCTTCACCGACACAAGT 

TGCTTG 

Sequence 3397 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTCAGTT 

GTAAATGCAGAAATCAGGCCATCTTCTGTGTCAATCATGGTGGGAGCTGCAGACCGGAGC 

TGGAACTATTCAGCCATCTTGGAACGGGAGCCCTCTAGGTGATATATTTTGAAGTCAGAA 

ATGGAAAATGTTGTCCAGAGTTCAGGGGAAGAGGATATTGGGTTCTTTTCTCTGAGGTCA 

CTGTTTCAAAACGTTACAGAAGAAAAGGCTMTATTGCTTCAGCCCTTGCCCAGATCCCA 

CAGAAGGTCAGTAAAACCTCCATTCCTGATAAGCAGCTATTCACATAGTGAGAAAGTATG 

GCTTTTGCCCTACTCAGTCTTGCTGTTATTGGAMCAAACTCTCTTTGATTGTCCGTTTC 

0 

Sequence 3398 

CGCGGTGGCGGCCCGAGGTACCAGGTGAACAAGGTTTACCAGAAAGCAGTTGAGACAAGA 

AMTAAGAGCTCANGATGAGGGAAMGAAAMGATTGAGAGMTTTGTGCCCCCAACCAT 

TTNCTCATGACTCTAAGTAAAGAACCACGCTCTNTCCAGGGCAGGGTCTTGAAAGCTCAA 

CTCTCTTAATTTGCCTCCACCTTCAGGGTTATTACCCTCACTTTTACACCMTTATGNAA 

TTTATTAACCTGGGAAAAAGGAAAGTTTGGGGGGACCACCATTGGTTATTTTTGGGGTGG 

AATNTACCATTTNTTAAAACCACCATTTANGGGANAAAAGGTTTGCCTAATTTrGAAAC 

TTNTTTTAATTGGAN I I I 1 I I I I GGGGGGGGGGAGGTTAAAAGNAANTTAA i Mil IGGT 

NGGATTGGCCAAAAATAAAAAATTATTCCC 1 1 1 1 I AAAATTTTGAATTNCGGACNTTTGG 

NCCCAAAAATTTTTAAG 

Sequence 3399 

ATTNGCGAATTGGAGCTCCCCGCTGTGGCGGCCGAGGTACTTTACAGTTTATATGATCTC 

ATTTGATTCTTAACATGAGCCTGTGAGTGAAAAATTCCTTCCCCTCTTCTACAGATTAGG 

ACGTTGAGATTCAGGGAGGTTCAGAGGGATTCAGGGAAAGTCAAGTGGCACCTGGAGTCC 

CGTGGCTAATTTGAGGCCCGGAGGGGGATTTCGAACCCAGGATTTGTGCTTCTTATGCCT 

GGGCTTCTGCTCTCTGGGGCATGGTCTTCCCCCTAGCTTTCCCATTCACTACTTTAGCCT 

AGGGGTCCTACCCTTTATTAAACTGCCAGTGCCTCACTGCTTTTCTCCCCCAAAGACAAA 

AAAAAAGTG Mil rGCTTTTGTTTTG'l I I I I CATGGGCAGAGACCTGGAATTTCAGCTTG 

AGAATTTGTGCCATATGATAAMTAMTCAACANNATGGCTTmCCCCAAAAAAAAAM 

A 
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Sequence 3400 

ATCCGGCGAATTGTTCTCCCCGCGGCGGCGGCCGCCCGGNCAGGTACATCTGANCCACCG 

NCATCTGTAGTCTATTTNAGTGTCATAGACTTGGAATNGATAGTCCTGGTTTTTAAAAAC 

TTCCCC7TTTGTCMTTCTTAACTCAGTGGATGTATTTAAAACATAAGTCAAATTGTGTC 

ATTCCTCTGCCCCANCCCTTCTGATTGCCTCCCATTTCACTNTGAGTATTGTGCCAAAGT 

TCCTCCTAATAACTGAAAGGCANTAAAACCATCTGGCATGTAACCTCTCCTGCTTAAACT 

TCTGTTTCTTATGTCTTTTGCTCTGTTTCAGCCACACTTGAACTTGTTGCTATTCCT^ 

CTACCCTACATGCTTAAAGATTCCAGGCACACCTNTGCACTTAGCAGTTTCCATGTTGTC 

TGGGATGC I I I I 1 I CCCCAN ATATTNTTCTAGCTTTACTCTTTCC A 

Sequence 3401 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTTGCC 

GATGGGGAGGGGGATACAGAGGAAGATGATTACGATGACATTATTGAGCCCCTTCTCTCC 

CTTGACCAAGCTTCTCACTGTGAGCTAGGGCCTGCACCCAGCCTGGGTCAGGCCTCTCAC 

AGTGACTCCGAAATGACATCACAGCGATTTTCGTCAGTTGATGAACAAGCAAAACTTCAT 

AAGACTATGTCTCAAGGAGAGATTACCAAGTTGGCAGTGAGACAGAAGGCTTCGGATTCA 

GATATAAGACCTCAGAGAGCTAAGATGAGATTCTGGGCCAAAGGGAAACAAGGGGAGAAG 

AAGACTACCATGAGTGAAACCTA 

Sequence 3402 

GNGCTCCCCGCGGTGGCGGCCGAGGTAC I (TTTTTTTTTTTTTI 1 GAGAAGACATCATTA 

CCTGTATATGCCCCCTCTTCTCAGTTCCCTCAATGGACATTGGCATTGATGAGCCCATAA 

GTACATGATAAGCCTACAAGGACMTAGTGAGTTTATATTTTTATGTMTGATTTTAGAA 

CATGGGATMTATTCCAGAGGNGATGGAGGTAAAACTATTCACATTTCCATATTCCAAAC 

TACTTCAAGAATCACTAGAGTGCAGAAGTTATAACTCTAAATGCTGAATGGAGAAATCAT 

CTGTCCCAAGAACTACACCCATATGACTTTCCAATAAATCTATCTCAAAAGGCCTACCAT 

TTAACAACAACAGCAACAACAAAACTTTAAMGAACCCAAATGGAAGTTTTTGTATAAGT 

TGGGGGTCTTTTTT 

Sequence 3403 

GGCNAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGAGAGCTTTTGGTTTGCT 

GAGGTTTGTCAGATTTTCCAGCTCAGGGCCCAGCCAGCTGGCAGGAAGCANGACAGAGGT 

CACTTGAATTCAGACCACATGTCCCTGTTAAATACATTAGCTTTTAAATCMTCTTTGTT 

CAAAGTCCAATGAGTTGCAAGCCTAATGCTCACCTGCAGAGACAGAATTCCTGAGTGAAC 

GAACAGAGCAGCTCCTCTTCCATCTCCAGAAATGACCTCCACCTTCAACCCCCGAGAATG 

TAAACTGTCCAAGCAAGAAGGGCAAAA 

Sequence 3404 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGTGGATTTGCCAGATC 

ATAGCGAGACTGCTTCTTTAAGTGGGACCCTGATACATTTCTGCTCACTGGGTGAAACCT 

CCCAGCCAGGCCCCCCAGCCATGTCCACCTGCACTTATTAGGGACAGAGCTCTGATCTCT 

CCCTGGGAAGGAGTGCCTGGGGGCCAGGGGAGGGTGGCCACCTGGGTTGGTTGGATGACT 

CAGTTCTTCCAACCAACCTCCTCCCCAACAGGGGCAGAGGCAGTTCCCCACCATGACACG 

GTTCCCCCACCACAATACAGCTCTTTTCTTGAGGCATGGACAGACTGCTCCTTTAAGTGA 

TTTACTCCTTCTCGTGGATTGGTTTCTCCCGGCCAGGTCCTCCCAGTCGGGGCTTCCAGC 

C ACCTTG ACCTGTATTCTGTG G 

Sequence 3405 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTN I I I I I 1 I I I I I 
rTTTTTTTl I GGCCNGGAGAGAGAGAGATGTTTTATT II I GCCAAAGCATGTGAGAGTAA 
GTAGCAAACATCATGACACTTCACCCCTCAAATATTTTCCCACGTATCACTTAANAACAA 
GGGCATTCATTCTGTTACATGACCACMTACCATTCAGCCCAGATANGATNTTTAACATT 
GAGACAATGCTCTGATGTATTGGTATCTACTATGCAGTTCATMTCAAATMGGGTTTTG 
GTAACTTGGG ATC C ATAAATGGTCTGCAAG GTC AGCCTNTGTANACCTGCACTTAGTGGG 
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TGCCATTTCCCTGTCTCAGGTGCCTGCTCTACTGGTGATATTTTTTCACTGGCTCTGTTG 

CTCACTTANA 

Sequence 3406 

CTACTATAGGGCGAATTGGANCTCCCCGCGGTGGCGGCCGAGGTACCTTGGTTTTCAGGT 
TATCTGACAAAGGCAGCTTTGATTGGGACATGGAGGCATGGGCAGGTTCTCCTTTCTTTG 

gaaacacatccatgctgggatgcacagttcttttcctmggattgtgtttactcctgact 

gcmcgtcttggccactgtttggagcagtttgggtctgaaggtcattaaggatctgcttg 

catcttgtttcatgtcctccgtgacagactgcagaccatgttagaagagaaggaaggttt 

gtggatcaagagggttgacattaggtttggtgagatgagttgcttggcacagatcagcag 

mcaggaccctggtgatmggcattctgggctttgctaaccacttatcatcttccattat 

ttctctttcttaaactaaagtMgaccccgamgtgcac 

Sequence 3407 

nactcctatagggcgaattggagctccccgcggtggcagcggccgcccgggcaggtacgc 
gggattgagagcaggtamttttttaatggaactatgcamgaccmtatttcmtattg 
gacattttgatggtccttctatccccaatgcttta i 1 1 1 i i 1 caactttgatgaaaagat 
ttgatctagatgagatactcccatcagaatcccaaccaaaagggactatgmgaaagttc 
ttgggattctgagagtctctgtgagactgtttcacagaaggatgtgtgtttacccaaggc 
tacacatcaaaaagamtagataamtamtggaamttagaagggtctcctgttaaaga 
tggtcttctgaaggctaactgcggaatgaaagtttctattccaactaaagcc ttagaatt 
gatgggacatgcaaactttcaaaggcagnagccctcccgnagaangcccatctgccctt 

Sequence 3408 

GCGAATTGTNTCTCCCCGCGGCGGCGGCCGAGGTACTNTNI I I 1 1 1 J 1 1 1 1 UN I I I 1 1 I 

TTTCGGAGGATTTTTGAAATAGAGGACTGGGAATTTGAGCAAATTTTGAGGMTCTTAAA 

TTGCTATTAGGACACCMTATGACGCAGCMTCMTCATTATTTTCTAGGACTCTGGTTA 

C C AAAGTTATAG AC AGCG AG CC AATTCTCCATTATTTTGGCAAAAGATAATGCAAATTTA 

TGTTACTGTTTGCTAGTTGATTTGATTCACTGCCTAAAATAMCTGATTCCTTCTGTTT^ 

AAAAAATCCATTTTGAAGCCACACNCAACAAAMTCATTCAGTTAGTTGAAm 

TAATTAGACTATGGGNGAAGACACCAGATATTTTAAAAAGCAATCCAAATGACTGAGTTT 

AGAAA 

Sequence 3409 

CCGCGGTGGCGGCCGAGGTACCATTGTCCAGGGAGCTCTGAAAAGCATCCTATGTCGCAC 

CCCGGGAGAGTTTTACCACAACACTCTGAGCTTCCTCAAGGTAAGGGCAGCTCCTGCCCT 

CCACTTTGGGAGTTTGGGAGTCAGAGAATGCCGCCCGTGGAATTAGGAATAATTGGTTCT 

CCTTTCTGGAGCCATGTTCCAATATTAGGAGACGGCATCGGTGTAAGCATCCACACACAC 

GTTTCTCACTTCCGGCCATTCCCTCTGCCCCTGGTGAAMGCTTCAAGTMTAGAGCTTT 

TTCTTTTTCACTCCTTATGCCAGAACATCACACCTACCGTTCCCACTCTTCAATGAGGGG 

TCTATTTTTAATGACTTTGGCTTTTTCTTTTACTT 

Sequence 3410 

ATCCGGCGMTTGTTTTCTGACAGCGGCGGCGGCCGAGGTACATGTTCTTGGCATCCTTT 

CCNTTAAAGTAMCANNTTTAAAGCCACACAGACTTTAACACATTATACCTTAATTCCAT 

CAACAGTGTCCTGAGACTGAGGATAGATAAAATACTTCAGGTCAAAGCAGAGCCTGTGGA 

ATTTTMTTTAMCAATAGATGAGAAAATGTCTTTTGTTGGAAAA^ 

GATGTTAAAATATACCCTGTTTATTATGAATTATTTGGGMCGTAMCATTACCTTTTAT 

CTTACAGACCAGACTGACAAGGCCACCGAGTATGTGCCCCACCTGCTGTGAGCCATCAGC 

TTGCACAGGTACCTGCCCG 

Sequence 3411 

TATAGGGCGAATTGGAGCTNCCCGCGGTGGCGGCCGCCCGGGCAGGTACATACACGTAAC 

GCTTCTGCCTTCCCAGCTGCTGTNTGGTTAGTGGMCTMCAATTTCCTTGATGGAAATT 

GAATMGGTMTCCAAAAAGAAGGCTCTTTTAGATGAAAAAACATGGATAGATTATTAAG 



Page 520 of 635 



WO 01/51628 



PCT/US01/00798 



Table 1 



CGCCAAGTTTTCTCTTCAACCTACTGCTCACCAGTGCTCCC TCCTCAGCTCCA AGGCTGT 
GGAGGGCTCATGAAGTCACAATGTTCTACTGAGAATTTAGC l ! I I I I I I ! t I I IGGAGTG 
GGTATACCATTTTGTAATTCCACTTGATAATGGCATCTGATT ATTTG ATCATGACATCAT 
TATTTGTATACAATGCAGCAAGAATTTTGTTTTTAATGTAGGCTTTTAAATG 
TGGMCTTGGTTCCATAGAAGGMTCCCAGATAAGGCTTTTTAAAAG 

Sequence 3412 

AATTGTTNCTCCCCGCGGNGGCGGCCGAGGTAC I U M I I I I I k t I I I I I I 1 1 I I I I I I N 

TGGAGGATTTNNAAATANAGGACTGGGAATTTGAGCAAATTTTGAGGAATCTTAANT^ 

TATTANGACACCANTATGACGCAGCAATCAATCATTATTNTCTAGGACTCTGGTTACCAA 

AGTTATAGACAGCGAGCCAATTCTCCATTATTTTGGCAAAAGATAATGCAMTTTATGTT 

ACTGTTTGCT AGTC N G ATTN G ATN C ACTG CCTAAAATAAACTG ATTNCTTCTG NTTT AAA 

AAATCCATTTTGMGTNACACNCMCAAAAATCATTCAGTTAGTTGAATTTGCmATM 

TTAGACTATGGGTGAAGACACCAGTATATTTTAAAAAGCAATCCAAATGACTGAGTTTAG 

AAAATTAAATGAGA 

Sequence 3413 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGAGACTGCGAGCG 

GAGAAGCGGAGTTTGCAGCCTCGGAATTGGCTAGAGCGCCAGAGCCGAGTCAGCCATAAA 

GCTACGCGCTAGGCTCTTGGCCCTGACGTGAGGCGCGCAGAGATGGCGTAACGGGAATAG 

TTTTCAACGTCTATTTCGAGCGGCCGCCCGGGCAGGTACACAGAGCTGAGGATGAAACAC 

ACCCAGAGTGGACAGAGCACATCTCCACTGGTGATAGATTATACTTGTCGAGTTGGTCAA 

ATGGCCTTTGTTTTCTCAAGCCTGATACCTCTGCTATTGATGACACCTGTATTCTGTCTG 

GGAMTACTAGTGAATGTTTCCAAAACTTCAGTCAGAGCCACAAGGCGTATCTTGATGCA 

CTC ACC AC C ATC AGCC ATG 

Sequence 3414 

CCGCGGTGGCGGCCGAGGTACATTCCAAACACTTTTTCATAGAATGGCTGTAAACATGCT 

CCCTATTTCTGCTTGACATGTTACATCACAGTGCTCTAAAAGCTCTGTCATGCGTATTAT 

ACTCTCAGCATCCTGGATAATAGAGTTCATCTCCAGTTCAAATTCTCCACCAACCAGCTT 

TATCTTGTTTCCTGCAAAATCATGGAGGTTTGGGMGTTCGTTTAGACCCATTCTCGTAT 

GGAGGTTTGTTTTCTTC I I I it I CTTTCCGCGGAGACCTGCCTCCTACTCCACCATCACA 

TGGAACCCACCACTGCTTCTCCGAAGCTCGCTCTCCCCGCGTACCTGCCCGGGCCGGCCG 

CTCTA 

Sequence 3415 ^ 

CCGCGGTGGCGGCCGAGGTACACACAAAGACAAACCTGAACTTAATTTCAAGGAAAACTT 

AMCCCATGCACAAATAATTGGTGAGCCTTCATTTCCCTGACTTCAAGTTTCCATGTGAG 

GACTCATGCTCTCTCCACTTTCTTCTTGGGAGGAGGGAAGATTTACCTAATGGGTAAATT 

TGGGCAAAGCACATTGAGTGTGCTTGTTTGGCTCTGAGTCTCTTTGCAAACATGTGTCTG 

CCCACAGTGACATGAGTTTGCGTTGACTGTCATGTCTGCAGGAAGCTGCCTGCTCCTGTG 

GCCATGTCAAGCAATTCTTTCTTTCAACTGCAACTGTGTGTAAGAGCTTAGTCTGAGAAG 

AAATGTTCAAGAAGCTCACTGTGGCTGCACATCTGAGCCATGTCTTCCCATTAANTTGTC 

ATGAGTCAGCAAT 

Sequence 3416 

ATCCGGCTAATTGGTTNTCCCCGCGGTNGCGGCCGAGGTACACACAAAGACAAACCTGAA 

CTTAATTTCMGGAAATTTTAMCCCATGCACAMTMTTGGTGAGCCTTCATTTCCCTG 

ACTTCAAGTTTCCATGTGAGGACTCATGCTCTCTCCACTTTCTTCTTGGGAGGAGGGAAG 

ATTTACCTAATGGGTAAATTTTGGGCAAAGCACATTGAG TGTG CTTGTTNNGGCTCTGAG 

TCTCTTTTGCAMCATGTGTCTGCCCACAGTGACATGAGTTTTGCGTTGACTGTCATGTC 

TTGCAGGMGCTGCCTGCTCCTGTGGCCATGTCAAGCAATTTCTTTCTT 

Sequence 3417 

AGGTACTTTACTCACCCTTCCTCTGACAGAAAAGGATGAAGTCAAGGACCTGGTAGAGGC 
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ACCACTAAGAAAGGCATCTGAAAGGACCAMGAGAGTGACCAGCAAGCATTTTTTGCAAG 

GCTGAGGAGCTGACAGCTTCCATGAAAGGCTGGACCACCCAGTGGTGAAAAGCATCATCT 

GGGTTACCTTGTGCTGCCATAAAACACACCACANACTTGGTGACTTAAACCACAGATATT 

TATCTTNTCACAATCCTGGAGGCTGGAAGTCTGCAATCACGGTGCCAGCATGGTCAGGTT 

CTGGTGAGGGCCTCTTTCCTTCTCACTGTGTGCTCTTTCTTGTGCATGGAGAGAGAGCAT 

GAACAAGCCCTC 

Sequence 3418 

CCGCGGTGGCGGCCGGGGGCCATTGAGACTGCCATGGAAGACTTGAAAGGTCACGTAGCT 

GAGACTTCTGGAGAGACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGACCACTGGAAC 

AATGAGAAGGAGAGAATTCTACTGGTCACAGACAAGACTCTCTTGATCTGCAAATACGAC 

TTCATCATGCTGAGTTGTGTGCAGCTGCAGCGGATTCCTCTGAGCGCTGTCTATCGCATC 

TGCCTGGGCAAGTTCACCTTCCCTGGGATGTCCCTGGACAAGAGACAAGGAGAAGGCCTT 

AGGATCTACTGGGGGAGTCCGGAGGAGCAGCCTCTTCTGTCCCGCTGGAACCCATGGTCC 

ACTTGAAGTTCCTTATGCTACTTTCACTGAGCATCCTATGAAATACACCAGTGAGAAATT 

CCTTGAAATTTGCAAGTTGTCTGGGTTCATGTC 

Sequence 3419 

CCGCGGTGGCGGCCGAGGTACCCTACTGGCCTATGCCGAAAGCACAGGAACTGTGAGGGT 

GTTTGATCTCATGGGMGTGMCTCTTTGTCATTTCCCCGGCATCTAGTTTTATAGGTGA 

CTTAAGCTATGCCATTGCTGGGTTGATATTTTTAGAATATAAAGCAAGTGCACAGTGGTC 

TGCANAACTCCTGGTCATCMTTACCGAGGAGAACTTAGAAGTTACCTTGTAAGTGTTGG 

AACAAATCAGAGCTACCAAGAAAGTCACTGTTTCAGCTTCAGTAGTCATTATCCTCATGG 

AATCAACACAGCTGTTTACCACCCTGGTCCAGACTTTTACTTGTTGGTGGATGTGAAACT 

GCTGAAGTAGGCATGTCAAAAGCTTCTAGCTGTGGCCTTTCTGCCTGGAGAGTT 

Sequence 3420 

CCGCGGTGGCGGCCGAGGTACTTTCACAGCMCATTTAGACTAGTGTTTGACCAACAACT 

GGGCACCACGGTCAAGCCAAGTTGGCATATAGAATTAGCCAGCAAACTTGGTATTCACAA 

TCTGTCATTATAGTCTTATCTATAAACCTAAGGTTTAGAGAAACGCTGGTCATTCTGAAC 

AATTCCCTGTGAATGCAGAGATCAAGGAGCCTCCCCTTAACTGGGTAGGCAGTGTGAAAT. 

ACCTGCAGTGTGCTTGACCTTACTGTTCACCCACTGGCAGCTGTTCCTACAGCTTGGCTT 

CAGATATATAATATTACATTGGCTAACTTCAAAGAGCATTGTTTTCCAAGGATAGAGATG 

GTG ATGGTGTATATTACTTTTACC AG CCC C AG ACCTGTAAG AG GTTCTC ATCTCTTTTGT 

TTAAGATTTTGTTG I I I I J I J IAATG 

Sequence 3421 

CCGCGGTGGCGGCCGAGGTACGCGGGGACACGTTTCCCGTCAAGATGGCGGATACTCTCC 

CTTCGGAGTTTGATGTGATCGTAATAGGGACGGGTTTGCCTGAATCCATCACTGCAGCTG 

CATGTTCAAGAAGTGGCCGGAGAGTTCTGCATGTTGATTCAAGAAGCTACTATGGGAAGG 

AAACTGGGCCAGTNTTAGCTTTTCAGGACTATTGTCCTGGCTAAAGGAATACCAGGAAAA 

CAGTGACATTGTAAGTGACAGTCCAGTGTGGCAAGACCAGATCCTTGAAAATGAANAAGC 

CATTGCTCTTAGCAGGAAGGACAAAACTATTCAACATGTGGAAGTATTTTGTTATGCCAG 

TCAGGATTTGCATGAAGATGTCGAAGAAGCTGGTGCGCTGCANTAAAAAAANNANTNAAT 

TAAGTACCTGCCCG 

Sequence 3422 

CCGGGCAGGTACGCGGGTGACCCAAGGAGGACCATTTTAAACTTCTAAGTTTTCACCGAG 

ATTAACATGCATTTGTTGAMGACAAACCTCTTTTACCCTACTCCCCCAGCTGCAAATGC 

TTTCAGGGTTCACATCATGTTGGAACAmGGmCAGTGTTTCCTAAACTTTGCGGGTA 

AAAATTGTTCAAGTAGGTAAAAATGGAGCACATACAAAAAAAAAAAAAAAAA 

AAAAAGTTCCT 

Sequence 3423 

NAAGCCTTCACTGTCAACTTCGGGGACACCGAAGAGGCCAAGAAACAGATCAACGATTAC 
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GTGG AGAAGGGTAC ATTGG ACATCCTG G ACTG ATTCTGTAATTC N CGTAT C 1 111! tCTC 
CCATGTACCCTTTATTTGNCNTTAGGGNTCTGCTTTTTGGGAGGATTTTTCTT^ 
TCTACCTGAGGCTCAAGACCTNATCTGTTAAATGCTGNTAAAAGGACCT 
Sequence 3424 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTACTCACCCTTCCTC 

TGACAGAAAAGGATGAAGTCAAGGACCTGGTAGAGGCACCACTAAGAAAGGCATCTGAAA 

GGACCAAAGAGAGTGACCAGCAAGCATTTTTTGCAAGGCTGAGGAGCTGACAGCTTCCAT 

GAAAGGCTGGACCACCCAGTGGTGAAAAGCATCATCTGGGTTACCTTGTGCTGCCATAAA 

ACACACCACAGACTTGGTGACTTAAACCACAGATATTTATCTTCTCACAATCCTGGAGGC 

TGGAAGTCTGCAATCACGGTGCCAGCATGGTCAGGTTCTGGTGAGGGCCTCTTTCCTTCT 

CACTGTGTGCTCTTTCTTGTGCATGGAGAGAGAGCATGAACAAGCCCTCTACTGTCCCTC 

TTAGAAG 

Sequence 3425 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGGAGATG 

CTGCCACCTAGGTTACTTGTAGGACCCTATACGGCAACCTCCTTTGCCAGGAACTATTTA 

TAAACATCCTGCAGGAAAATGCAGTGAAGTAGAAGAGACAGGGATATCCCAGAAGGTTAT 

GCAAAACATNAAGAGAAGATGAGAGGAGTCTATATGTCAGAATACACATTTNCCACCTTG 

CCCMCAGTAGAAAAAACATMGMGAGAAAAACATTAAAAAATGACAAGGAAGTTAATG 

GAAGTCAGCAATGTGATGGTGTTTGGAGGTGGAGCCTTCAGAAGGTAATTAATGCCCTTG 

TAAGAAGAGGCCAGAGAGCTTGCGCACCTTCTTCCTGCCATGTGAGGAGCCAAGAAGCCG 

GCTGTCTGCAACCTGCAAGAGGACCCTNACTAGAAGTTAGCCATACTGG 

Sequence 3426 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTCCATGTTAGCTGTGCA 

TGTCCACCACGCTTTGTCTGTAACTCGAGTAGAAAAAGATGTTGTGTTTTAATTAATCAT 

TCCTTACAATTCAAGATGAACTCCACATACCCGCGTACCCACTGAATGTTGTTACTACAT 

ATTGAGAGTCATTTTATGCATATGCATTCTACCTTTCCTGCTTTATGAGTATTTTTAAGC - 

TTTTAGTTCAAGGTTATATTCANAAMTATTTCCCAGCATAATGATACATCGTAGCCTAA 

GAMTATTTTCTCAATGTAATTCCCTTCCCAGCTACCCAAATGCTACAGAGAAATGTTTT , 

CTACTTGGCCACTATCAGGGTTCGTCATCTATTGTGTTGACTATTMTGGCTTTTTGATT 

GGGTAAGGAT 

Sequence 3427 

AGGTAC I I II I I I I I I I I I I 1 1 I I I I I I CCTATCCAGAGCCCTACTTGATCAAAAACTGG 

TTCTTTAATCCATTGCAAATTATCMGTAGCTTTTAATAACTAACACGGCTATAAACCAC 

CATAACTGTTCTTTTTTCCATATCAATACTATTCACTTACTAAAAAAAAATCTGAGGGTA 

TANAATATGCTGATAACTGTGCTGGGTATTGGCGACACCAGGGAACAAAACAGACGAGGC 

CTCTGGCTCTTCTAAl'GGACAGCTGGGTGCAACTCTCTAAAAATACTACTTCATGATTTG 

ACAGGATATTTGCCAAAGTCCTTCACTACTCTACACAACTATAACAACATTTTC 

Sequence 3428 

GCTCATACAGCTCA^ACAGGAGGAAAAGCATCTCTGGGAGCAGGCAGCCTCATGTGTCTG 

ATTCGCTGCTCAA T 1*CCAGGCAAAACTACAAGAATTACTCCATGGAGTCAAGGACACACT 

TTCGAATCCAGGCAGCCACACTCCAAGCCAAGTCTTCCCTAGAGGAACCAGCAAGCAGAT 

CCATGGTCTCATCAGTGTCCTTTGTAGCTTTTGTGCANGCCAGATTCNCGGGCCTGGGAA 

GACAGNAGNANCAGCTGGNACAGMGCCTCCTGGAAAACCATTTTCAAAAATACCCCCTT 

CCCCCCTGCCCCTTGCNCCTGGCCTTTGGGGGGTCCACCCGGGCAACTTCCCAGGTTGGG 

GATTGGGGNACCAAACCAATTAAGNTGGTTATTCCCGNTGGCCAAGGAAAANCCCCGNAG 

CCTTNGGGCAAi I I I I I CMCCCANGTTGGTTAAGGCCCCAAAGGGGGCCCCTTTTTGGC 

CCCAAAGGGGGCCCAGNAAGCCAATGGNTTGGGGAAG 

Sequence 3429 

ATCACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGCACCCTCCC 
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TACCCCCTTAAAAACCCACTAGGAAAGGAAC3AAGGGGTGGAAACCCATTAGGAGGGAGTG 

AGCAGGCAGATTTGCCCTGAGAGTCTACACCCCTGGCCTGATGCACAGAGAACAAGGATG 

TCCTAGTGTGGCACAAAAATCTAGG G GCC ACCCAAAACTGCTCTCATGTTCTAAAGGTG A 

CTGTGTGGGGTTTGAACC7TGGCGCCCACTGGACAGTGGTGAGCACTCACCACCCTGAGG 

CCTGGCAGCCTGTCTGTCACTGGAGAGCCGTTC 

Sequence 3430 

AGGTACACGTCTCTGTCCGGGCCTCGGCCAGGGTGCCGAGGGCCAGCATGGACACCAGGA 

CCAGGGCGCAGATCACCTTGTTCTCCATGGTGGCCATTGCCTCCTCTCTGCTCCAAAGGC 

GACCCCGAGTCAGGGATCCCCGCGTACCTGCCCGGGGCGGCCCGTCTCTAAAAACTAGNT 

GGNATCCCCCGGGCCTGCAAGGAATTCGATATCAAAGCTTTATNCGATACNCGTCTGACC 

TTCGAGGGGGGGGGCCCNGGGTAACCCAGCCTTTTTGTTNCCCTTTTANTTGAAGGGTTT 

AAATTGGCGCCGCCTTGGGCCGTTAAATTCATTGGGTTCAATAANCCTTGTTTTTCNCTG 

GTGGTTGAAAAMTTTGGTTTMTTCCCGGCTTCACCAAATTTTCCCMCCM 

MTTACCCGAANGCCNCGNGGGGAAGCCAANTAAAAAANTTGGTAAAAAAGCCCCCTNGG 

GGGGGTTGGCCCCTTAAATTGGAAGTTGGAANGCNTAAACCTTCMCCAATTTAAATTTT 

GGCGGTNTNGGCGGCCTTNCMCNTTGGCCCCCGGCCTTTTTTCCAANNTTCCGGGGAAA 
AAACC 

Sequence 3431 

AGGTACATTGTAGGATGCTTACCAGATCCTGCCCACTAGAGGCCAATAGCACCTCATCAT 

CATAGCACCCCTTCCCTCAGTGATAATAAAAAATGTCTCCAGACATTGTTTAAGTTAATA 

AATGTGCCCAGAGCCCAGGCACAGTGGCTCAGGCCTGTGATTTTAGCTCTTTGGGAGACC 

GAGGTGGGTGGATCACTTGAGGCCAGGAGTTCCTAGAACGTTCCTCTGTCAAAAGACAAA 

ATTACAACAATTTAAAATCTAATTGGCTGTTATTGGCCATTCTAGMTTGGTTGACATCT 

CATTCTATGAAATACAATGAGTGCTCTTCAATAAGCTGAGCAGAGGGGGGTTAGCTTTAT 

TGGCAGAAMGGGCTAAAGAAAACAGAAAMGAACAAAAAGTGATTGGTTGTTTCAGAGT 

TACTTTCCTTAAAGAGG 

Sequence 3432 

CCGGGCAGG TACGCG GGGTGGAGACATGGGATTTCTCATGGAAAGTGAAAATAACTCACT 
CAGAATAATGT TTTTT CAAACTCTTCTGTCTTTCCACAGTATGTTCTTGTTACTGATTGT 
CAGAAAGCTGGTTTTGAGACTGCAGCTTGGACTAAAGTAAGTTTTGTCTAGAATTGTTTC 
TCTATTC AGTATAG ACATTTTCATG CTATGATTTATTTCTAAGCCTAG GGGAAACAATAA 
ACAACATCTGGCTATCTTATTTCAMTTTGTTATGTAAAAATTGTGGATTGATCTTTATT 
GGAGCAMTTTGTAATCCAAATTCTCTTGGTGATTTATTTGCTTGGCACCTAATACACCT 
CTAAGCTAACAACAGTCATCTNATTTCAMTGTAAAGCCTCATTGTAAAAACAAA 
Sequence 3433 

CCGCGGTGGCGGCCGAGGTACCATGCGGTCAGCCAAATGTGTMGGAACTTGTTTTGAGA 

G GAGAG AAGAAAATTCTATAGTAGGCGAGACAGGCTTCTCTCTCCTGCTCCTCCTCCCTT 

TCTTTTTCTCTCTCTCCTCTTTCTTCCCTCCTTCTCTCCCTCTTTCTCTTTCTCTCACAC 

AGTTCCTACATTTCTGC/\CTGGGGGACAAACCAGGCTTCTGATCAAAGGCAATCCGAGAC 

TTGGGTTCAAAAGCTTTGATGAAGATTGGTACCTGCTATGCAGATGGGGGGAGGGAAGAA 

GAGACTAGCCTAGGGGCTTTTATATTTCTTTAAAGCATCAGCCATTAATGGGCTAAGTTT 

ACCACT GGGAC CCAGAGAGCTAGTCCCCAAACACAAGCTCAAAATGTATGCATTTTCCAA 

TTCATGGTTTTCAAGGACATGTTCCCCCGCGTACCT 

Sequence 3434 

CTTAGGGCGMTTGG AGCTCCCCGCGGGTGGGCGGCCGCCCGGCCGGGACCTTCTNTTTT 

i i \ i n n 1 1 1 n m iatgtatncatcactcaaattttattaaaggcagcacaatagcc 

CGCTCA) I I I rTGAT TCAGAAATAAACTTGTAACTAAACCATCTNTCTTNAAATAGTACC 
G C AGACTGTCTTTTAAAG N AGCTCATCTTTATTTG ATGCAATCTAAAAGGCCC AAACTCT 
GAAAGAAAATAANAAACTATTGTTAACTATTATTAACAAAGTAATATATCAAATAATTCA 
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CTCAAACTCCTACTTAAGAAACGTGTCATCACCAATAAGATAGTAAAGGCTCTTCTAATA 

TTGCTTTGTTGAGACAATTCATTTCATCTT 

Sequence 3435 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGCTTAGACCTG 

ACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGTAGGACCCTATACGGCAACCTCCTT 

TGCCAGGAACTATTTATAMCATCCTGCAGGAAAATGAGTCAAGGMGCTTTTCTTTTGA 

GCTATTTACAGCTTTTAGCMTTGAGTAAAGTATACTCCTGTGAACAAAATTTGGAACAT 

AmGTTTCTCTCTMCTGATTTCTCCAGAATTTGGMCTAGTTAACCAGGMCCCTCTC 

TTGGGGTGTGGATCGAGACCCCTTTCTGGTAACGAGAGCACAGCTTGGTTTTATACATTT 

TAAGGAGACATGAGACATCAATCAACGTATATAAGATGAACATTTTGATCT 

Sequence 3436 

ACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGGGCACT 

7TACTTGTTCCAGAACTGGAGCAGGATTAAACTGGTCGAGCTTCTTGGAAGAAAT7TTAT 

TTTGTTTATCATCTTTGGCACCATGGAAGAAATGCAGAACAAAGCTGTGGTTTTCTT^ 

GTTTTATTTGTGGAGTGCCANTTGAAAATTTTCAGGTGGGTAGAAATGCCAGGATGGTGT 

TTGGAATGCTGCTCCCTGT7TGCAGCAGCTCTGTTCTTGAGGGAGTTGTTATCTTAGCCA 

GAGTCTCATTTGGTTTGGGACTAAGATTACATATGTGGAGAATTCTGGTTGCACCTATGT 

GCTCTGCAAAGTTGGANNCANA i Mill /AAA 

Sequence 3437 

AGGTAC II i 1 I I 1 1 1 I i 1 I I I i I 1 I 1 I I I I [ I I 1 I ACAGGGNTGTCTTTCCCTCTTTGAC 

AACCCCCATTCCAGCTTCAGCCCTAACATGACAACATAGCTGCAGCAATCTTACGTACCT 

TCCATTTCAAACATTCTTGNGGAACTTAATTGTATACTAAATCGTATGACGTGCCAAGCA 

CTGAGCTMCGGGTTTATATAAGAGGCACAGCATAATGGTAAAAGCAACTGGGGAGTACA 

ACTGAGTTTGAATTGNGCCATTANACAGCCTTCANACTTGAAGTGCAACACCGGCTCCTC 

CCTGGGTCTCAAGNGGATTGGGTCTCTGNGGATTTTAAACT 

Sequence 3433 

AGGTACGCGGGGAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGTAGGAC 

CCTATACGGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCANGAAAATGAGTC 

TATATGTC AG AATACAC ATTTNCC ACCTTG CC C AAC AGTAGAAAAAC ATAAG AAG AG AAA 

AACATTAAAAAATGACAAGGAAGTTAATGGAAGCAATGTGATGGTGTTTGGAGGTGGAGC 

CTTCATAAGGTAATTAATGCCCTTGTAAGAAGAGGCCANAGAGCTTGCGCACCTTCTTCC 

TGCCATGTNAGGAGCCAAGAAGCCGGCTGTCTGCAACCTGCA 

Sequence 3439 

CCGGGCAGGTACTGTCTGTAGTATCAGCTAGAGCTGAGGGTAGCTGCTCTGTCTGGATAG 
CTAGGC AGGC ACAAAATGGG AGAAG G AAATTATTCAGC AGCCTTGAAGCTTCAAAG ACTd 
AGAGAAAGGAAACTGAAACTCAGTGTGCCCAGCCCACTTGACAGCATGCCAACTTNTCTC 
MCAAAGGGAGCATMGCTGATCCAGCTTTTAGACAGTAAAATTTT^ 
. GAAAGAATCAAGGAATTGAAGTCAGAGGGAAGAAAACATAACAGCTCACTNACTTTTAAA 
AMTCTTTTTATTTAAGGAAAATGCAACCAAT 
Sequence 3440 

AGGTACTATTGACATTTGGGGCCAGGTAATTCTTGGGGGCAGTGGGGAGATGCTGCCCTG 

GGCACCGGAGGATGTCCAGCAATATCGTCTACCTACTAGTAGTCAGCAGCACCGCACCAA 

GCTGTGACAACCAAAAAGGTCCCAGACATTGTCAATGTCCCCTGGAGGACAGAACTGGCC 

CTGTTTTACAGGAAGATAMTGTCCCCTAATCCTTTTGTGTAAAAGGGGGAATG M T I T I 

ACCCTTAGACCGGATAAGCAAGACCAGCTGAGACCTACTGGTATAAAGCATACTGCCTGC 

TCACATGGTGAGAACTCAATAAATGTTTGTATCACTACGTAATGAGAGAT 

Sequence 3441 

AGGTACTTCCTGTAATTGACAGCCTTGCCTTGTTCCTCATGGCATCATTCAAGGTCATCT 
TAAATGAGAGAGGAGGGAMGAAAGAAAAAGAAATCATACGTTATGGTTTTCAAATGCAC 
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CCAGAGGAAAGGCAGGGTTGTTTTTGAAAAGTTTACTTCTATCTGAAAGCTTCTAGCAAA 

TGAAAGCAAACAGTGCAAAGCTGAACTACAAATAATAAAGCACAATTATGGCAGGAATCT 

GTCCACATTCACCCAAGCTACTGTGCCATTAAAGAGGAAAATAGACAGTTGAGCTGCAGG 

TTAGTCACCAACTGGGAACATGGAAGAACGTCAACAATACATACAGGA 

Sequence 3442 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGCAGGTACGCACAAGC 

AGGGCCCAGCTTGGGAGACGCTAGGATAACTAAGACCTGGTCAGCGCCCTGAGGAGTCTT 

GTCTGGATAAAGGGAGACACACACTAGCTTGGACTTATGCGTGCAGCGGAGCTGTGTAGA 

GCAGGAGTGCGTGGAGAGGGCAGCAATGAACTTTGGCTGGAGAGCCAGAGAAAGCTTCGT 

GAGCTGGAACACGGGGTAAAGTAGGAGCTTTCCAGGGCGGAGGGGACTTCTGAAGTGGAG 

GAAACTGCCTGTTCAGGACATGGAGGTAGGGATCAAAGTTCTGAGTCATGTGGGTGGGCA 

CAGAGCAGAGGATGGCTGGGGCGGTCTCAGGGATAAGGAGAAGTTTGGGGACCAGACTGT 

TTTGTCACCGTATCTGCATCCCTG 

Sequence 3443 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCAAGTGGCCATGTGGA 

GCTAGTGGCCACCACACTTGACACCACGGATGCAGAACTTTTCCATCACTGCAGAAAGCT 

CTGTTGAACAGTAC I I I M I I I I I I I I I 1 1 1 1 I I I I I I I I 1 1 NCCGCATTANATCACTGN 

GGGTNTTAAACAMTTTCTTNAAAAGCGAACTTTGGGTTTTTACTTTNGCT^ 

GCTGTAGGCATCTTTCAGCTTTATGCANAGGACTGANAATGCAAAGCATTGACTCAATCA 

GGATCCTTTTNTCAGACATAATTAAf4AAGGCTTTGACAGGTTGAANATTCAGCCCCAGGT 

TCCCGNGTACCTGCCCGGGCGGCCGCTNTANAACTAGGGGATCCCCC 

Sequence 3444 

CCGGGCAGGTACTGCAATAACAAAATACAGCAATAAAACAACTGGACACTCCTAGGGGAC 

ACCAAAGATAMGGGCCCATTMTCAGGTGGAGGCCANAGAAACCGGAAGGCTATTTTAA 

AGTGTTTAATATAAATCATGTAAAGGGNAAAAAAA^TCCTTCCCCCAAAAACAAATTNA 

AAGTNCAAAGCCAAA I I I I I CCATTANTCCTTAATATTGGAMCTTTAATTTTTCCATTA 

TTCCTTMTTATNGAACCTAAAAAAAAAAAATGGAAAAAAMTTGM 

TTGGGCCAATT1TAATTTTGGCCTTAACCCGCCCGGAAAATMNTTCCCMTTMCCCTT 

TTTNCCCCCCCCCTTTTGGAATTAATTNTTTAAAAAGGGGMCCMTTGGTTNGGGGNGG 

AAGGAAMCCNTTTTTTAATTAAAANAAC^ 

CACCCAAGGNAAAAGGGGCCAACCCTTNGG I I I I I I I I GGNGGGCCCANCCTTTANNGCC 
CANAAAAAAAAAGGGGGAACCAANGGTTTTAAAA I I I 1 1 I CCAACC AATTGGGAAAATTC 
CAACCGGNTGGGAAAANCCTTTTTTAAAATT 
Sequence 3445 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGGGAGGAGT 
GGCATTGTCCAGTATAGCCAGGCCAGTCTGTGGTGGGCTCAGGCTGT7TCTGCTGGCTTG 
GTCCTGCTGTCCTGGACCCTAGCATGTGTGCAGTCCCCGCGTACCT 
Sequence 3446 

CCGGGCAGGTACAGTAGTGGCAACCAGCCGGGAAAATCC/.TAGGATCATTTCAGACCGCA 

TTCAACACAGATACTCATGTAGCCCCGACGTTTCTTGTGAAGGGAATTGATGTATTGCTT 

GAGGGGGTCGATCTCTGGTTCTAAGGTGTTTTGGTTTGTTTCCATCGTGACTACCAGAAA 

ATTATAGAAGGCCAGACGCAGTGGCTCATGCCTGTMTCCCAGCCAGTTTTTCTAATTAA 

GTTTTAATCTGTCTCCTCCACTAGGCCATGAGCCCTAGGAGGAGATTGTCTCTAGAGGGT 

ATCCCCTGTGCCTGGCACATGCATGCAAGTTATGATTTATGTTGATGAATGAATGAATGA 

ATGAATGAATGAATGGTGGATGGGTCCTCT 

Sequence 3447 

AGGTAC rTTTTTTTTTTTTTTTTTI I CGTCACTACCTCCCCGGGTCGGGAGTGGGTAATT 

TGCGCGCCTGCTGCCTTCCTTGGATGTGGTAGCCGTTTCTCAGGCTCCCTCTCCGGAATC 

GAACCCTGATTCCCCGTCACCCGNTGGTCACCATGGGTANGCACCGGNAGACNACCCCAN 
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TNCNAAAAGTTGTATANGGGCAGNACGNGTCGAAATTGGGNTCGNTCTGCCCGCCCACNG 

GNGGGGGNCAGTTGTCNAATACGGCNCCGCCCCGACAGTACNCTGCCCCCGGGNCGGGCC 

GTNTCTAANAAACNTAGTNGGGATTCCCCCCGGGGCCTGCCAGGGNAATTTCGATAATCA 

MGTCTTTNTTNTGAATACCCGGTCNGAACTCTTTNAAGGGGGGGGGGGGCCCCCGGTTA 

CCCCCANGCCI III! IGGTN 

Sequence 3448 

CGGGGGCCGGGCGCTCGGCTGCGGCGAGNCGGNATCAGCNNCACTCAAAGGGCNTGTTAA 

TACCGGNGTATCCACCAGAAAATCAGCGGGGGATCAACCGCANGGAAAANGNAACCANGT 

NGAGCCAANAAAGCGCNCAGCCAAAAGGGNCCCAAGNGCAAACACCGCTAAAAAAAANGN 

GGCCCCGCCGNAANNGCCTNGNGCCGGACNMTTTTNCCCAATTAAGGGGCGNCCCGGCC 

CCCCCCCCCNGGGAACGNAGGNCAATTCCAACCAAAAAAAAAAANGNCGGACCGGCCNNC 

CAAAAGATNCCACGNAAGGGGGGGNGGCCAGGAAAAAACCCCCCCGGAACCAAGGGGGAA 

CCCTNAANTAAAAAAAGGGAANTAACCCCCAANGGGGNCCGGGTNNCTNNCCCCCCCCCC 

CTGGGGGGAAAAAGGGCCNTCCCCNCCCATNCNGGAGNGGGCCCGNCCNNNCCTTNCCCC 

CTGGGNNATNNCCCCGGCAACCCCCCCNGTGGGGCCCCCGGCCCTNTNTAAACCCCCN 

Sequence 3449 

ATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGCGCGGGGAGAACTTGTGTGCTTAGACCTG 

ACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGTAGGACCCTATACGGCAACCTCCCT 

TGCCAGGAACTATTTATAAACATCCTGCAGGAAAATGAGTCTATATGTCAGAATACACAT 

TTCCCACCTTGCCCAACAGTAGAAAAACATMGMGAGAAAAACATTAAAAAATGACAAG 

GMGTTMTGGAAGTCAGCMTGTGATGGTGTTTGGAGGTGGAGCCTTCAGAAGGTAATT 

AATGCCCTTGTAAGAAGAGGCCAGAGAGCTTGCGCACCTTCTTCCTGCCATGTGAGGAGC 

CAAGAAGCCGGCTGTCTGCAACCTGCAAGAGGACCCTCACTAGAAGCTAGCCATACTGGC 

ATCCTCATCTTGG 

Sequence 3450 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACI M II I I 1 1 I II I I M I 
ITTTTTTTTn I ATTTTGGTTTGTTTAGGAGTACACGTCCCTAGAAGCACATAGCTCCAG 
CAACAAAG CTAATCTG AG AGTAATAC CC AAG ATGCC TCCTACCTTGC AAATTTCAAGG AA 
TTTCTCACTGGTGTATTTTCATAAGGATGCTCAGTGAAAGTAGCATAAGGAACTTCAGTG 
GACCATGGGTTCCAGCGGGACANAANANACTGCTCCTCCGGACTCCCCCAGTAGATCCTA 
AGGCCTTCTCCTTGGCTCTTGNCCAGGGACATCCCAGGGAAGGNGAACTTG 
Sequence 3451 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCCTTGCCCAAAATGTT 

TMTGATTTCCAG7TTCATTAACTAGTAGATACATTGAAAAATATAAAATTAAATATT 

CCTATAAGTCAATATTTGCATAATACTGCTAACTTGTTTTTTTATTAGGTCATTCATT^ 

ACTTTACAAATCCATTTCATACTTGTTATTAGATCCCTCTCTGGGCGCATTGTTGGAGGT 

GTGTGGTGGTTCTTTACCCTGATCATAATCTCCTCCTACACGGCTAACTTAGCTGCCTTC 

CTGACTGTAGAGAGGATGGTGTCTCCCATCGAAAGTGCTGA GGATC TTTCTAAGCAAACA 

GAAATTGCTTATGGAACATTAGACTCTGGCTCCACTAAAGAGTTTTTCAGGAGATCTAAA 

ATTGCAGTGTNTGATAAAATGTG 

Sequence 3452 

TCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTTTTTTTT 

I I I I I I I I I I I I A GG ATACTC AAAAAACATATATTTGC ATGTG CC AGG AAAAGGTAAGGA 

AAGGGAGCCTTCATGTGCCAGCCCCTGTGCTGAGCCCTGGAGCTGTAGAGArGAGlGAGG 

TGGATTTTTTGTCTGTAMGAGTTTGCCGATGGGACCMGTCTGTCATCTCTCTTTCCTG 

GCCTGACCGAGGACAAGCACTTCTTAGTAACCCTGTGAATACCGATGCATCCCGTCAGTC 

AGCAGCTGTTTGCGGAACACCAGCTGCAGGCCAGGCACCGTGCTGCAAGGCGTANATATG 

TTGCTGAACACGACTCCCTTCCAGATGTTCAGGATTTAACAG 

Sequence 3453 
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AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCACACACCCTCCAC 

TACTCATCACCAGACAGGGAACCCTGGCTTGGGCCCACAGCACAGACCCTCCATCCTGGG 

CCGATTACACTAAGTGATTGCTAACTCACATGTCTCTGGGATGGAGCACCCAGGAGACAA 

GC AAAGTGTTG GAG CAGCAAGTC AGGTG ATGTGGAGC CCAG AGGGCTGGGACAGCTATCT 

CTCTAGGGTCTACTTGCCCTTGTGAGACACTTTATCCCAGCACTTTAGGAGTGCTGAGGT 

CATACCAGCCACATCTCATGTGCAAGATTGCCCAACAGAGATCAGGTCCAAGAGTTCCTG 

TTTTAAGAMGGGGACTTGTTTGATMGGGAAGTCTGGCCATGTTTGTGTGGAGCANCTG 

TGCTGTGCTGGGGGTTCACTTTTGA 

Sequence 3454 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTTTACTCACCCTTCCTCTGACAGAAAAGGATG 
AAGTCAAGGG CCTGGTA GAGGCACCACTAAGAAAGGCATCTGAAAGGACCAAAGAGAGTG 
ACCAGCAAGCA 1 I I I I IGCAAGGCTGAGGAGCTGACAGCTTCCATGAAAGGCTGGACCAC 
CCAGTGGTGAAAAGCATCATCTGGGTTACCTTGTGCTGCCATAAAACACACCACAGACTT 
G GTG ACTTAAACCAC AG ATATTTATCTTGTCAC AATCCTG G AGGCTGG AAGTCTG C AATC 

ACGGTGCCAG(^TGGTCAGGTTCTGGTGAGGGCCTCTTTCCTTCTCACTGTGTGCTCCTT 

TCTTGTGCATGGAGAGAAGAGAGCA 

Sequence 3455 

CCGGGCAGGTACGCGGGGAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTG 
TAGGACCCTATACGGCAACCTCCT7TGCCAGGMCTATTTATAAACATCCTGCAGGAAAA 
TG CAGTGAAGTAGAAGAGAC AGGGATATCCCAGAAGGTTATG C AAAAC ATC AAG AG AAG A 

TGAGAGGAGTCTATATGTCAGAATACACATTTCCCACCTTGCCCAACAGTAAAAAAAAAA 

AAAAAAAAAAAAGTACCT 

Sequence 3456 

ACTACTTAGGGCG AATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTTTTT 

n 1 1 n i n n n iggtagagacagggtttcaccatgttggccagnatggccttnatctc 

CCANCCTCATGATC TGCCC ACCTTGGCTTCCCAAATTGCTAGGATTACAGGCGTGAACCA 
CCGCGCCTGGCCTATT^ 

TATTTTTG CATCAACCC AATAAATTTCAAAATGAC AAAACTG N AN ATCC AGC ACCACCCA 

CACTGNCTTATTATATTTCTACAAMGAGTGTTTTTTCCTGGATTAGACAATGCCCAGTA 

ATNACCAAGACAACTACATTTGGGTCTGGCTTCAGAGACGACGACTGTGGAGCATTTGAG 

ACGTCTGACAAACTTGAGGCCATTGNTGCGCCATGTTTTCCCAAA 

Sequence 3457 

CCCGCGGTGGCGGCCGCCCGGGCAGGTACCACTCANAAGCCTTCACTGTCAACTTCGGTG 

ACACCGAAGAGGCCAAGAAACAGATCAACGATTACGTGGAGAAGGGTACATTGGACATNC 

TGGACTGATTCTGTMTTCTCGTATCTTTmCTCCCATTG 

TGGTTCTGCi \ I I iNGGGGAGMTTTTCTTTAACTTNATCTACCTTGAGGCTCAGTCCTN 

ATCTGGTAAAAATGCTTGATAAAAAGGACCTTACCACCTAACCGTTGCTTGTGAAGGTGT 
CAGTTGA 

Sequence 3458 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCTTCTCC 

CCTTTCACTTTTCTGTCTCCCTAGAGCTTCTGCTGCTTTCTACTGAATTTTGTTTCT^ 

TTGAAGAAGAGGGTAGTGAAGGAAAGGAGGAAGGCAAAACAGCAAGAGAGAGAAGAGGCT 

GTTTGGMCATTCTGGMCAGGTTTGAAACAGCTCANGCAGGGAGAATGGAACGA^GCC 

TGCAATGAGAAACCAGAATGCTGCTATCATTTTGTTCCTGACATTCACACCTCANGTCTT 

CACTTTGGGGAGCGAMGCCTTTTAAGCAGAAAATACCAGCAAGTACCTCGGCCGNTCTA 

GAACTAGTGGGATCCCCCGGGCCTGGCAGGGAATTCCGATATCAAGGCTTTATCGATACC 

CGTCGACCTTCGAGGGGGGG 

Sequence 3459 

TTAGGGGCAANTGGAGCTCACCGCGGTGGCGGCCGANGTACTGACCCTAGACATAGGCAT 
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GGGCTTTGACTTGGNTTGNGGGNNTTATACTGCGGTNCNGCCAGGGGTTTTGTNTNAGTC 

TNAACCATGACTNTAGTCTTAGTTGTATCTTCCACCTTGATCACTTCCCTTGTCACAGCC 

ATAGCCTCCCCTGTAGGTCAACTGTATGTGTCCTGGTCATTGCTCCATCACCAGCTTTGG 

CCTGGGTCACTGATTCTGTTTTCAGTTCTGCCTCAGCCACTGCCTCGGCCTGGGTTTTAC 

CATTCACTCTAGCAGTGGCAGGACCACTGATTCCATCTTGCAGGCCAGCCTCAGCCTTTC 

CTCTAGCCCTTGCTTCTGTCCCAGAGTCAACCATGTTCCAAGTTTAAACGTAAGGACAAG 

TTATATACC 

Sequence 3460 

AGGTACCCTGGACTCATACCTGAAAGCAGTGTTCAACCTTAGCAAAATCTCCAACCAGCG 

CATGAACAATTTTCTACATCACAACGACCTGGTTTTCAAATTCAGCTCTCAAGGCCAAAT 

CTTTTCTAAATTTAACCAAGAACTTCATCAGT7TCACAGAGAAAGNAATCCAGGACCGGA 

AGGAGTCTCTTAAGGATMGCTAAMCAAGATACTACTCAGAAAAGACGCCAGGATTTTC 

TGGACATACTTTTGAGTGCCAAAAGTGAAAACACCAAAGACTTCTCTGAAGCAGATCTCC 

AAGCTTGAAGTGAAAACGTTCATGTTTGCAGGACATGGACACCACAACTACTGCTATCTC 

CTGGGATCTTTTTACTGCTTGGCAAAGTACCTGCCC 

Sequence 3461 

GGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCGCGGGTAAGCTGTGTG l I I I I GC 
ATTTAAGCTTTMTMTGGAAAGCGCCCAAAACC 

AAGACTTAGTCTCACCCCTCTTCTGACTTTTTCAAAACAGTTTTCTAATTCCAGAGAAGA 

GCCTGAGCTCTGAGAATCATTCTNCTTATTTTAATTTTCCATMNCTTTNGTCCCCGGAA 

AATGCANTCTGGGGGGAAAAGCATTTTTAAAATAGTAAATCATCTGAGTCTAGTCCAGAA 

TTTTGGAATCCAAGGGGCAAGGCGGATACTTTGTCATAAACAGGANGCAATAATACCGTA 

GATAATTGCAGTCAGTGGAGGAAAAGTNTGGCGCTTGCTGTGGCTCTTGACTGCTTCCTG 

GGATTAAGTTACTAAGACCCAAAATGGTTGCAAACTGGTCCCATTGGCTTTTTGGC 

Sequence 3462 

AGGTACAGACAGGCTTCTCTGCTATCCTCCAGGCAGTGTAATAGTCAAGGAAAAGGGCAA 

CAGTATTGGATCATTCCTTAGACACTAATCAGCTGGGGAAAGAGTTCATTGGCAAAAGTG 

TCCTCCCAAGAATGGTTTACACCAAGCAGAGAGGACATGTCACTGAATGGGGAAAAGGGN 

ACCCCNGNTTATTCNCCAGGTNNCNTTGTTAAGNCATTTCNANTTATNGCAAGGGAAAGA 

ATGGGGCTTTTTNGGNGNCAGNTGGGCCTTGGAATNNAAAAAGGCCMGANTTTTTGAAG 

NATNACCCNCAGGNCTTCCCGTNGAAAACNAAAGNGGTTCNATTNCTTTANTTAACCAAG 

GGTTTTCCTTTCCCTTTTCCACCTTGGAGAAACCAAAATGGTAAAATTTTTCANNGGGGG 

GTNTGGAATTNCCAATTTTTCTTTCNTGNAANGGGGGGGNCTTTGGAANTAMGGGGTTT 

GGCCTTTTTCCCTTCCGGGAATN I I \ I 1 1 NCAACCTTANCCCCCAATCCAATTTTTAAGC 

CCCTTNTGGGGGCCTTNCCTTTCNTNGGTTCCTTTCCAAGGAMGGGGGGGTTAAATTCC 

TTCCTTAAAACGCAACCTTrMGGGGGGGGGCCCNTTTGGGGGGTTAMTTAMTTMTT 

TGGTGTGGGGGGTTC 

Sequence 3463 

GGTACGCACAAGCAGGGCCCAGCTTGGGAGACGCTAGGATAACTAAGACCTGGTCAGCGC 

CCTGAGGAGTCTTGTCTGGATAAAGGGAGACACACACTAGCTTGGACTTATGCGTGCAGC 

GGAGCTGTGTAGAGCAGGAGTGCGTGGAGAGGGCAGCAATGAACTTTNGCTTGAGAAGCN 

AGANAAAAGCTTTCGTGAGCTGGAACACGGGGGTAAAGTANGAGCTTTCCAGGGCGGAGG 

GGACTTCTGAAGTGGAGGAAACTGCCTGTTCAGGACATGGAGGTAGGGATCAAAGTTNTG 

AGTCATGTTGGGTGGGCACAGAGCAGAGGATGGCTGGGGGCGGTCTCAGGGATAAGGAAG 

AAGTTTTGGGG ACCAN ACTGTTTTGTCAC NCGTATCTG CAT 

Sequence 3464 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTATAGCACAAGCTGA 

TGATAGCTGGCAACTGAACATTTGGCTTCAATGTCAGAGTAATAGAGTGTGGCAAACTGC 

AGAGCTCATGCTCCGCCTAAAAGGGGCAGCTGCTGCTCAGCTCCTGCCAATTACCACCAT 



Page 529 of 635 



WO 01/51628 



PCT/US01/00798 



Table 1 



GCAGGA ATTTGG CCTANTGTGCTCAGCTCTTTCTGGTTTNTACCCAGAAGCCTAGGGAAT 

CTGGGGI imi lATGTGAAATGGCCCTAATTTTTAAATGTTAGCATCCACTCTGAAAAAA 

AAACA AMC ACCGTGTGGGGCCAMTAAMCATTTGCTGGCCAAATGCAGCCTGATTTGC 

TCTGGTTTTCMCCCCTGGCACAAGAATTTTGGTTCATTTCTCTCTTGNTGGNACCTTNG 

GCCGCTCTAGAACTAGTGGGA 

Sequence 3465 

AATTGGAGCTCCCCGC GGTGG CGGCCGAGGTAC i rTTTTTTTTTTTTTI I CCCTTGTGAC 

TGGAGAGAAGTGTCACATTTTGCTCACTCTCAACCTTCCTCGCCCACCCCCTTCCCGGAG 

AACCTGTGCGGTGTGTAGAGGGTGCTGTGAGCCACCTCCAGCCTCGGGTGGCTGCTTAAG 

TAACTTTCAACTNCTNTNTTNTTMCACTTTTAAMNTNTTTTCGGGAAGCCGCAGGGGG 

CNAAGGCAGCTGC7TTTCCTGGCCTCTTTGGCTGGGCTTC 

CCCGGGG GGCA CCAGCCCACCTTTTATAACAAACCCGCTTNTGCCTGGTGGATAAAAGTG 

GATMGNATTTTATTTCCCAAAAANCAAGAMGAAGGGAAAGCCATTATTTGCCACTGGA 

AAGAAAAAATATTCCTTATCAACNTTAAAGCCTTGGGATAACCAGTAAAAAAACCAGGGA 

CCNCAAGNMNGATGAAMGGAGCCCAGNACATTGGGAAAAMTTTTCTTGGGGATTCCC 

TTCAAAAT 

Sequence 3466 

GAATGGACTCCACCGCGGTGGCGGCCGGGGGATGCGAGGCTCGGAGCACCCTTGCCCGGC 

TGTGATTGCTGCCAGGCACTGTTCATCTCAGCTTTTCTGTCCCTTTGCTCCCGGCAAGCG 

CTTCTTGCTGAAAGTTCATATCTGGAGCCTGATGTCTTAACGAATAAAGGTCCCATGCTC 

CACCCGAGGGACMGTTCTTTCGGTGCCCCNGCANATTTNTTTTGNNCAAAAAGNNCCAA 

AAGGGTCCCCTTTNGGGCCCCGGCNTCTTAAGAAACCTTMGTGGGGAATTCCCCCCCCG 

GGGGGCTTGCNMNGGAAAANTTTCTGNATATATTCCANAGGCCTTTAATTCCGGAATNA 

ACCCCG GTTCNNNA ACCCTTCTCGAAGGGGGGGGGGGGGGCCCCCCGGGGTTANCCCCNC 

AAGG GCt fTTTTI IGGGTATTCCCCCTTTTNTTAAAGNTTGGGANGGGGGGTTTNTAAAA 

TTTTGGNCNGGCCCNNCCTTTTNGGGGCCCNGTNAAAAATTCCAAATTGGGGNNTNCAAA 

TTAANGGCCATTGGNTTTTTTCCCCTTGGGNG 

TCCCCCGACNTTCCACCCANMNTTTCCTCCANCCAACCAAANCNATTTACCCGNAAGNC 
CCCCGGGGGGNAGGGCCAAATTAAAAAAAGGTTGGGTNAAAAAAGGNCCNCCTTGGG 
Sequence 3467 

ACACTACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACATGGAGGCCGAC 

AATTTGGTGACCAGTGATGGCAGGCCACTCAGCTTTGAGTAGCCATGTCCGCAACAGGCC 

CTGCGGCACATCTCAGCTCCCTGGGTGCAGAATTCTGACATCATGGCCTTCATGCCCGTG 

CTCAGTGCGTGGAGCTTGTGAANAACATGGAGGGGGGTTGGGCGGTGTTAGGGGGCCTCC 

ACCATAGGGACCAACCCTGTGCACCACTTACTGAGCATCTACTCATGCCCAGCTCAGCTC 

TGAGGTCCCGCGTACCGGCCCG 

Sequence 3468 

TTTTATNGGGGGACGANTTGGGGGAAGGCTCCNTTCGCGGGTGGGGCGGGCNCNCCCGGG 
GCAGGGGTTACCTTTTNAAAACTCANGGGCCATCTCCCAACTTTTTCAAAAATGGAACTT 
TTTANAAGGTTTGGAGGCTTAGGCCCTTMTTTTTAAAAAAAACTA 
AAAAATCCAGGTGGGGTGGACCATGGAAGGGAACTTAATTTMTTTTAAAAAGGTGGCTT 
TAAAAGCCCCTTGGGGTTTnTAAGGAMGTTTTCTAAAAGGAMGMCCCTAAAGTGGG 
G AAAATTTTG GCTTAN TTAATTG G AAAAAAAN G C C ATG G AAAAAAG G AATTTAAAN G AAA 
AANCCCTGGGGCCTAAAAAAGGGGTTCCAAGGGGTGGCCCCCAAGCCTTTCCGAAAGGGN 
AAAAAAAAGTTTNGGGGGCCCTTTNTTTAMGGGGGGAACCCGGGGGGGAAANGCCAT^ 
TAAAAAAGGGGCCACCCAAGNAAAAACCTTTTTTAACCA Ai i l 1111 1 NGGGGGGGGGGG 
GTTCNAA) I I 1 1 1 1 I I GGGTTTTGGNAAGGTTAAGGGGGTTGGAAAAAGGNTTGGAAAGG 
GGGGGGMTTTTTNGGGGGGGGCCTTTTNGGGAAATTAA A T 1 M T 1 rTGGGGGGCCCTTG 
GGGCCCCCCCCCCAATTTCCTTTCNTTAAATTTGGGTTGGGGGGGGGAAAACCCCTTCCA 
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AAAAAAAGGGGTTTTTAAAAAAAAAAAAAAACCCCCCCCCAACCCTTGGGTTAAAGGGGG 
GTTAAAAAAAAATTTTTTAAAGGG NAAAAAGGGGGGGGGCCC AAGGGCCCTTTCC I I I 11 
GGGGAAAAAAATTNCCTTNGGGGGGGGCCCCCC 
Sequence 3469 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGGGGCCATTGAGACTGCCAT 

GGAAGACTTGAAAGGTCACGTAGCTGAGACTTCTGGAGAGACCATTCAAGGCTTCTGGCT 

CTTGACAAAGATAGACCACTGGAACAATGAGAAGGAGAGAATTCTACTGGTCACAGACAA 

GACTCTCTTGATCTGCAAAATACGACTTCATCATGCTGAGTTGTGTGCAGCTGCAACGGA 

TTCCTCTGAGCGCTGTCTATCGCATCTGCCTGGGCAAGTTCACCTTCCCTGGGATGTCCC 

TGGACAAGAGACAAGGAGAAGGCCTTAGGATCTACTGGGGGGAGTCCGGAGGGAGCAGTC 

TCTTCTGTCCCGCTGGAACCCATGGTCCCTGAAGTTCCTTATGCTACTTTCACTGAGCAT 

CCTATGAAATACACCAGTGAGAAA 

Sequence 3470 

NCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACI rTTTTTTTTTTTTTI 

Mill 1GGTCTCTGAGCATTTGCAATTTGAACTGTGG7TCCCTGGCCACAGTTGTCTTCA 

GGAGTTGAAATGGCCANAAAGGGAAATATCTACATGGTTTATCCTTGAGATGGCAAAGAC 

ATCTGCACCAATGCACCACCCCACCGGGAGCATGCATTGCTAGCAGGTGAAATTCACATG 

GAAMGTAGGATTGTGGGTTAGGGTAAGAGATACAGCAATGGTTACAAGTGCAGAATTTG 

GAGGCAAGTAGTCCTTGATTAGAATCCCAATACTAGCATTTGCTAGCTGCATAATCTTGG 

GCATGTCACTCAANCTTCTCTCTGAACCACCCTCTCTTTACCT 

Sequence 3471 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCCATCAGCGTTGCACC 

CATCTCCATAGTTGTAGTTTCCACACACTTATACAACTCAAATGTTCACTTTACTGTGCA 

GTCC ATTCCCTCTGTTTTGTGA I I I I I AGG AAATCTTCATAATTTACGGTCTACAAAACT 

ATTATTTTGTTTTCAAGGMGTCGTGTTATATATTTCTGACTGTCTm 

AGMTTCAAGCCAGGAAGAAGCAGCAATCTGTCTTCTGGATTAAAACTGAAGATCAACCT 

ACTTTCAACTTACTAAGAAAGGGGATCATGGACATTGAAGCATATCTTGAAAGAATTGGC 

TATAAGAAGTCTAGGAACAAATTGGACTTGGAAACATTAACTGACATTCTTCAACACCAG 

ATCCGAGCTGTTCCCTTTGAGAACCTTAAC 

Sequence 3472 

AGGTACCCCATCAGCGTTGCACCCATCTCCATAGTTGTAGTTTCCACACACTTATACAAC 
TCAMTGTTCACTTTACTGTGCAGTCCATTCCCTC 

CATMTTTACGGTCTACAAMCTATTATTTTGTTTTCAAGGAAGTCGTGTTATATATT^ 

TGACTGTCTTTTCTCTTATTTCTAGMTTCAAGCCAGGAAGAAGCAGCAATCTGTCTTCT 

GGATTAAAACTGAAGATCMCCTACTTTTCAACTTACTAAGAAAGGGGATCATGGACATT 

GMGCATATCTTGAAAGMTTGGCTATAAGAAGTCTAGGMCAAATTGGACTTGGAAACA 

TTMGTGACATTCTTCAACACCAGATCCGAGCTGTTCCCTTTGAGAAC 

Sequence 3473 

CCGCGGTGGCCGGCCGCCCGGGCAGGTACCATTGCATGCAGGCGTGAGTGATCACAGCAC 

GTCCCAGGAACTTCAGTCATGGTTTTATCTGTATTCCAC7TACCATGGCTGTGCAAATAA 

CTACTGCCAAAACACACTAGTGTAAAACAACCATCCAGTTTCTTCATGGATATTATAGGT 

CAAGAATTTGGACTGGGTGCAGGGGAGATAGTTTGTCCCTGCTCCATGACCTCTGGGCCT 

TGGCAGGAGAGCGTGAAGGCTGGGGCAAATCACCTGAAGGCTCACTCCCTTGTGTGCCTG 

GCAGTTGATGCTGGCTGTCAGCTGAAGGCCTCGGTTCCCTCACATGAGCCTCTCTGTGCG 

GTGTCTCCAAAGGCTAGCTCGTGCTTCCCCACTGCATGGTGACTGGGTGCAAGGACAGGC 

ATTCAAGGGAAAAAGAGCCAGAGGA 

Sequence 3474 

ATACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGTAGTGG 
CAACCAGCCGGGAAAATCCATAGGATCATTTCAGACCGCATTCAACACAGATACTCATGT 
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AGCCCCGACGTTTCTTGTGAAGGGMTTGATGTATTGCTTGAGGGGGTCGATCTCTGGTT 

CTAGGGTGTTTTGTTTGTTTCCATCGTGACTACCAGAAAATTATAGAAGGCCAGACGCAG 

TGGCTCATGCCTGTMTCCCAGCCAGTTTTTCTAMGTTTTAATCTGTCTCCTCCACTAG 

GCCATGGGCCCTAGGAGGAGATTGTCTCTAGAGGGTATCCCCTGTGCCTGGCACATGCAT 

GCMGTTATGATTTATGTTGATGAATGAATGAATGAATGAATGAATGAATGAATGGTGGA 

TGGGTCCTCTGAGGCTCTAGAAATA 

Sequence 3475 

CACTACTTAG GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTm 

minium I ACAGCAATAAAMCGTTTTMTACAGGAAATTATTTTGCTGTACAATAT 

AGGCAGACAGTTTG CCTTC AGAMTCCAGAMTGCAGCTTTTGAGGGAGGTCAGCATCAT 

TGGTCTCAGCTACCATTTTCCTGCAGGATGTTTATAAATAGTTCCTGGCAAAGGAGGTTG 

CCGTATAGGGTCCTACAAGTAACCTAGGTGGCAGCATNTCCTCCCAGCGTCAGGTCTAAG 

CACACAAGTTCTTCCTTCCATCTACCCCCGCGTACCT 

Sequence 3476 

CCGGGCAGGTACGCCGGGGGAGGAAGTGCAGACTGACATGGGAGAGNATGGCAGGCAAAC 

TTAGCCTGTAAGGGTGGAGGTGCTGGAGAGACACTTCCTGCAGGAAGTGGCCTGTAGGCT 

GATACTTGAAGAATGGTTAGGAGTTAGGCAATTGGGGTGGGAGGGAGGAAATGAAGCTGA 

GACAAAGAATAGCAA GGGC CAAAACCTAGAAACAAGAGACTGTAAGCTATAGCAAAGGAA 

GTGGGGTAATGCATGTTTTAGGTCCTGTGTGGGGTGTTTGTGAATACACATGCCGGGGTG 

TGGAGCGTGATGAGAGACAAAGCTGAGAAGTGTGGTGGAAAAGGAAGAGAGAGCTGAGGC 

AGACGATGTAAGATCTGCTGTTACTTCTTGCTGTGTGACCTTGGGTGAAGTGAAGTGGGG 

TGTGCCTGGGA 

Sequence 3477 

GGGCGANTTGGAGCTCTTCGCGGTGGCGGCCGCCCGGGCAGGTACTTGGCNNCTTTCCTG 

AGCCANNCACTTCATTATTTTCTTGTCCTGTGAGTCCTAATAGATATATTTAGAAACACT 

TCTATGAAATATGGMTCCAGGCTTMCCAGCTGTTTGAAATCACAACCCACAACCACTT 

TGCTGGGTGTGT GCTCACACACAC ACACACACTGACACATGCTTGCTATACATACATACA 

CACAGTTTCCATC I I I I I 1 1 I I I M I i I I 1 I I GAGACAGAGTCTTGCTCTTTCGCCCAGG 

CTGGAGTGCACTTGGCTCACTGCAACCTCTGCCTCCCAGGTTCAAGCAATTCTCCTGCCT 

CAGC CTCCCAAGTAGCTGGGACTACAGGCACGTGCCACTACACCCGGCTAATTTTTGTAT 

TTTTAGTAGAGACTGAGTTTTGCCGTGTTGGCCAGGCTGATCT 

Sequence 3478 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGTATGACCTCAGCATTCCTAAAG 

TGCTGGGATAAAGTGTCTCACAAGGGCAAGTGGACCCTAGAGAGACAGCCATCCCTGACC 

TCTGGGCTCCACATCACCTGACTTGCTGCTCCACCACTTTGCTTGTTTCCTGGGTGCTCC 

ATCCCAGAGACAAGTGAGTTAGCAATCACTTAGTGTAATCAGCCCAGGATGGAGGGTCTG 

TGCTGTGGGCCCAAGCCAGGGTTCCCTGTCTGGTGATGAGTAGTGGAGGGTGTGTGGTAC 
CT 

Sequence 3479 

CCGGGCAGGTACGCGGGGAGGCATTGAGGCAGCCAGCGCAGGGGCTTCTGCTGAGGGGGC 

AGGCGGAGCTTGAGGAMCCGCAGATAAGTTTTTTTCTCTTTGAAAGATANTAGATTAAT 

AC MCTACT TAAAAAATATAGTCMGTAGTGTTTACTMGMTATTGCTTTANCCGTTTA 

AGTTTTTTTAAACGTTAAATTTTTAMTN A NAAA 

N ATTATGG AAGGAACTTTAGTAN N NAANTTAN N C ATTGAAN G N AAATGGG NAAAAAAGAA 

TTAAAAAANGGGTTTTCTTAAAAMCCATTGTACCGNGAAGGGTTTGTAGAATTGAAM 

CCTT TCmCATTGGGGAGGTTCATATNAAANNTGMTATTACNACATACACATTATAGG 

TTTTCCCTTNNGGNCNCGCCTTTCTTMGAAMCCTANGTTTNNGGAATTCCCCCCCC 
Sequence 3480 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCTTGTTTAACCATCATGAGGTG 
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ATCCCATCACCTTCACAGCCCCAGCCTCTGCTCCAGTCCCTCCCCAGCGAAAAGGGCCGC 

CCATGCCATCCTGCTGCTGGTGATTTGCTTTGTGGTCATGGACTCAGTGGACATCATTAT 

TTTATTAACCGTGTGGTAGGTTmMCTCAGTTATCCTGGATATCCAMGGTTTGTGGT 

CCATCTTTAGGCTTCCGTTTGTTCTTTGGTACCTGCCCG 

Sequence 3481 

ACTGCTGTGGGCTGAAGGGAGGAAGATGGTGTCTGCAAACCAGGAAGCAGCCTTGCCAGA 

CACAGGATTGGCCACAACCTTGACCCCAGACTTCCAGCCTCCAGAACTGAAGGACTGAGT 

TCATTTTCAGCGTGTCTGGCTTTTCCATTTCTGTGGCCTGGGAAGGMTTTCCCTGCACA 

GGCTCTCTTGCCTGCCACCATGTAAGACCTGCCTCTGCTCCTCCTTCACCTTCCGCCATG 

ATTGTGAGGCCACCCCAGCCATGTGGAACTCTCCTACTGCACTGAGAAGCACGTGTTCTC 

CATTTCCCTGGGGGAGACCATTGTATTGGGCAGTTTGGAACAAAACACCATGG 

Sequence 3482 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCCTCTCTGTTCCCCTCCCTCCACCCTTCCCCT 

CCTTTTCTG I I I I I AAGATMGGTGTTGATTTGGAATTCAGCAACTGCAAGTCCATTTGA 

AAGAAGTCAGTTTTGTTAGAGGCCTGGATAGAGCAATATGTTGTAACCAAATCACACGTG 

AACATTCTGTGGCTGAACATMGAMTTGTGCATTATTAACGTTGTGCTTCAAGGTGGAA 

TGCTGAAAGCACTTGGAAAGTAGTTGCCAGAGAAGAGCTCACTGAAAACATTATTAGCTT 

TATATATCCACTTGTAGAATCTCTGTGCAGTGATACATCCCTCCCTTTCATCTTGATCAT 

AAATATAAGACTTAAAGGAGGCAGAGTGAACGTGCTTAGGAAGCCCACTGAGAGCTCTCA 

GAAAGGCGCTTAATACCTTTCTTAGTAAGTTGAAAGTAGGTTGATCTTCAGTTTTMTC^ 

AGMGACAGATTGCTGCTTCTTCCTGGCTTGAATTCAGGTTATTTCAGCCGGCAACCAGG 

AACTAGACTGAATGCAATCCTCCCACATCCAAGGTCCCCGCGTACC 

Sequence 3483 

CCGGGCAGGTACGCGGGGGGGGGAAAANCCAAAAACAACAACAAAAAAACTCTCTTACAG 
MTTTTCCTTAACATTAAAAAAMCTTGTCGTATTTTTCAAAG^ 

AATTG CTAAAAGTATATTTAAAG AC ATC TAGCCTTCC ATATGTAAAAAATATT7TAG ATA 
AAG ACAGG N AC AGG ACTC AG AATAATATATTGG GTG ATTCACTGTATCCTCC ACTTTTAC 
ATTTAAAGAAATTCAAAATACGTGATTAAA I I I I I I I I I M I I AATGGATGGGTTGGTTC 
TCGCTTAACCACTTGGGTGCTAGTGGTCTTGAACTGTGATACAGTGTAGACAGNATTTGT 
AACCTTGCACAGCCTCTGGGGGTAACTGTCAAACTGTCAGATTTAACTT 
Sequence 3484 

GATCCCANCCTTTCTAGGGCTGGGTAGTGGGAACCTCCCAAAATTGGAGTTCGGGTTAGG 

AAATCCTCACATGCAGTTTGGGAGAGGCTCAGGAGGAGGAACCTGAGCCATGTCCTCTGT 

TCTGCAGTGAAAAGAAGATTAAATATGACCAGTGCTTGATCCCAAACGCCCTGCTGGAGC 

TGGCCCTGCTTGCTTATGGAGCMGACAGAAACCAAGAGGCCATCAMCTTTTGGAATCT 

GCCAAGTAAGGCTTGGGTAACCCTCAGTCTGTCCTTTTGCTCCCTCTGAACTCAGCTCAA 

GGAGCTGGCTGGGGACAGAGTTGCCCAGAGAGCANTGAGTTGGAGTTGCTAGCAAAGGCC 

AGTCTTGGCAGTCTAGAGGGTGGACCTTGTGTCCCTCGAATCCAGCTCTCCCCTGTCATA 

GGAAGACTTNTTTCTTAATGTATAAATGCCCCAGTTTNGACTCCCANAAGNCAGGAAGTA 

AGT7CCCAAAGGTTCACATCANAAACCAACTGCNTGGCTGNGGTGAAGACCATTGCCTCC 

TNGCTTTCCAGGTAGGGGGNCTATTNCCCTAAAACCAC 

Sequence 3485 

CCGCGGTGGCGGCCGAGGTACAGGGCCCTGAGATGGAAACATGCCTGATGGGTTCAAGAA 

TTAGGTCATTGGCCAGTGTGGTTGAAAGGAATGATGGGTGGGGAACGTAGTAGGAAATGC 

GGCTACAAAGGAAAAGAGCCAGAAACTAAAGGCTCCTGAAGGCAATTGCTAGGATTAAGA 

CTGGAGAGCCATTGGAGGCTTCTACTCAGAAGATTGTCATGGTGTGGCTTAGGTTTTTAA 

MGAATTTCTCAGCATGTTAGTTGGCTTGGGCTTCCATAACAAAATGCTACATACTGGGT 

GGCTTGAACAACAGACATTTATTTTCTCATAGTTCTGGAAGCTGGAAGTCCAAGATGAGT 

ATGCCAATATGGTCAGAATCTGGTGAGGGCTTTCTTCCTGGCTTGCAGACAGCCACTTTC 
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TCCCTGTGTCCTCATAGGGTGGAGAGAAAACAACATCTTTGCTGTGTCTTCTCATAAGGA 

TCTTATCAGATCAGGGCCCTCCCCTTGTGACCTCATTAAATCTGAGTTACTTCTATAAAG 

TCCCTATCTCACATTAGGGGTTAGGGTTTTAGCATATGAATTTAGTAGGGAGACAATCCA 

NTCCATTGCATTGTANCTGATTTGGAAATAAAGATT 

Sequence 3486 

ACAMCACAAAAMGTTTTAmAAAAAMGAGTTTGACATTTAAM 

TTAAAGTGACACCCTGTTTCCTTGGAMCACAATGCAGAACCACAAACTCTGAGACTTAT 

TATAGCGAAAGTATTGCCATTCAGTTCAATCTAATGGNATGAGGTTAGCTTTGGAGGCCA 

AAAGAMCTAGTTTCTAATTCTTATTCTTCTCCCCACTAGACTTGTGGCCTAGGTAATTT 

TGTMTCTTTCTGAGTCTGTTTTCTCATCAGGAGCAGGATAATTCCTAAGTAATAGATCA 

TATGTAGAAGTGAATATGATCTCATGGCAATGGACATCAACCATAGTTAATTATTAAGAA 

TATATTTAC ATTG AGCTATCCTTTATCTACTTTAAAATTGCAG AC N AAC AACAATTG ACA 

AAGMTAG ATAAAATGTTCTAAC ATAAACATTCTTCCATTCTTTCTAGG AAGAAG N C AC A 

CATACNGTAAAAAATAATTAGAGAGGACCCAGTTCATATCGAACAACCTTCCCC 
Sequence 3487 

AGGTACTGTGTTGACCACTTTTTGAGGCTACTTAGGACCTGAGCAGAACTGAAATGCATT 
TGGCTCCATGGGAGAACCACAAGCAGCTCCAAATTAAACTCCT1TAGAGACACAACACAA 
GTATAA ACAA GCCTGAGCACTTCTTGGACCAAAAGTCACATTACAAGCTCTCAGAGGCAC 
ATGCMTTTTATTTGCTTCTTACAATTTAAACTCTGAAAGTTTAM 

TTCTATTGTAGCTCATTTTTTTTAACATGCTACTAAAATATAMGAGACTTTATTGTGCC 
TAAGAAACTTGCCTGTC ACGATCTTTATC CACCAG ATG ATAC CTTGCCCTAAATGCTACA 
AGCCGGGCATAATATGCAGGGGCTGGAATAGAGACGGAGCGAGTGCACCTCACATAGGTG 
TGACACAGCTGGTAAGTCAGTAGCTGGAGTTCATCTGCAGTGAAGCAGTTGTCATCCCAC 
AAGACCTG G TAATGTG AGGGACGGCTGGTTC CTG AGG AG AG AGTTTTATGATT AATGTTT 

rTTrnrn igttcgttttttggttttgg I i I 1 M I I 1 1 1 1 m 
Sequence 3488 

I i m n m j i i i h m I GGAACAAACACAAAAAAGTTTTATTTAAAAAAAGAGTTTGACA 
TTT AAAAGTTTG AAATAACTATT AAAGTG AC NCC CTGTTTCCTTG G AAAC AC AATGC N N A 
AC C AC AAACTCTGAGACTTATTATAG CG AAAGTATTG CC ATTC AGTTCAATCTAATGGTA 
TGAGGTTAGCTTTGGAGGCCAAAAGAAACTAGTTNCTAATTCTTATTCTTCTCCCCACTA 
NACTTGGGGCCTANGTAATTTNGTAATCTTTCTGAGTCTGTNNNCTCATCAGGAGCAGGA 
TAATTCCTAAGTAATANATCATATGTAGAAGTGAATATGATCTCATGGCAATGGACATCA 
ACCATAGTTAATTATTMGMTATATTTACATTGAGCTATCCTTTATCTACTTTAAMTT 
GCAGACAMCMCAATTGACAAAGAATAGATAAMTGNTCTMCATAAACATTCTTCCAT 

TCTTTCTAGGAAGAAGTCACACATACAGTAAAAATAATTAGAGAGGACCCAGTTCATATC 

GAACAACCTTCCCCAAACCC 

Sequence 3489 

CCGGGCAGGTACTTTACTCACCCTTCCTCTGACAGAAAAGGATGAAGTCAAGGGCCTGGT 

AGAGGCACCACTAAGAAAGGCATCTGAAAGGACCAAAGAGAGTGACCAGCAAGCATTTTT 

TGCAAGGCTGAGGAGCTGACAGCTTCCATGAAAGGCTGGACCACCCAGTGGTGAAAAGCA 

TCATCTGGGTTACCTTGTGCTGCCATAAAACACACCACAGACTTGGTGACTTAAACCACA 

GATATTTATCTTCTCACAATCCTGGAGGCCGGAAGTCTGCAATCACGGTGCCAGCATGGT 

CAGGTTCTGGTGAGGGCCTCTTTCCTTCTCACTGTGTGCTCTTTCTTGTGCATGGAGAGA 

GAGAGCATGAACAAGCCCTCTACTGTCCCTCTTAGAAGGGCACTAATCCCATAATAAGGC 

ATCCACCCTCCTGACCTCGGCTAACCCTAGTTAAGTACCT 

Sequence 3490 

CCGCGGTGGCGGCCGAGGTACCCTTCCACCATAGCTGTAATCTTCAAAAACAGGAACACA 
GTGACAGAGACTGTTAAAGGAACTTCTTCACTGAAGGTAAGAAGAATAACACAAATTTGA 
AAAGGG CCTCTATCAG ATATAGG GAAACTTCTAATG ACCTG G ATTG AATACC AGAC ACAG 
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AAATGTATCCCTGTCAGCACGCTGAGTATCATGGCCAAAGTATAAAGTTTGTGATGTTGA 

AAAAAAAGGCTGGATCTGACTATGATGTCAGATGTTAAATTTACTGCTAGCTCTGGGTGG 

TTTAMCAATTCAGAACCATTATTCATTACATAATGTGAAAGTGAGTGCTGAGTCTGTGA 

GTGCTAATGTTAAGGCAGCTGGAGAATTTAGAAACCCTAGATAAGCTAATTGTGGAGGAA 

ATTTACTTGCC AAAG CAAATCTTCAATGTGTATGAACG CTCCCTATTCTGGAAATGGATG 

CTGAAAAAGATATTCATCCATAAGGAGGCCAAGTCAATGTCAGGTCTCMGCCTTTTAAG 

AACAGGATAAAAGTCTTGCTTGGGGGCAATATTTCAGGCTACAMTTCAMCCCTTTGTG 

ATCTGGCACTGGACAAACCCTGGGGCCTTCAAGCA 

Sequence 3491 

CCGGGCAGGTACACMGAAAGTTCAGAGATGAGACCTCTGGTTGTATTCCACCTTTGGGA 

CATGGGGGATGTCTTTAGTTCAAAGTCACAAATAAATGCAGGTTCTACAATTCAGAGGCT 

TCATATCCCTGCTGGAGTATTACATGTTTATTCAGGATGGACCACTTTTCTTAGCAACAG 

TTTCTAAACCTTTGCCAGGTCTGGGAAGTCTGGCAGGAGAGATTTCTAAGAACCAATCAT 

TCCTGCACACACTTCTTGAAGATAATATACATTATTCCCTAGTTATCTCTTCCTAGGTTT 

TTGTAGGCTCATTTCMTATTACAACAATCTTTATTGGAAAACCCCAAGTATmGTCTT 

GAAAAATCAGCAATCCAGGTATTAAAAATAGCATGGAATGCCCAATTTTACTTTGATAAT 

TACATGGTAGTCAGTTTCCGCTCCTGGCTAAAGCCTTGGATTTTCTTCTGGGCAGTTCCT 

AAAGGCACAGGTGGCATGGAAGAAATCATTCCATTTTCATCTCCTCCCCTTATTTGATTG 

GTGTCTGGTTACCAAAAGAGTCATCAGGCCCTTGGATTACCTTCTTGAACTGATCTTCTA 

GATCGCAAGTCTAGGTT 

Sequence 3492 

AGGTACGCGGGGGGCCTGATTATATGTGGCAACTGCATGTAGAGAAATCTAATTGCCATA 

CCTTCCTTTTTCTMTTTCTTTTMCAAAACTGCTTTTTCTCTCCCTTTGGC^ 

GTAACATTGCTTCCTTCCATCATAGCCCTGAATTATCAGATCTCTCTACCCAGCAATGGT 

TCTATTAACAACCTCMTCTTAAAACACAGCCAAATAATTAACTCAAATTTATTTTTT^ 

TTCTCCCCTACCCCTTCCTTTTACAGCCAMGAACCTGAAATTGGAAACTTAAATGTTTC 

TGACATAACTCCCGAGAGCTTCAATCTCTCCTGGATGGCTACCGATGGGATCTTCGAGAC 

CTTTACCATTGAAATTATTGATTCCAATAGGTTGCTGGAGACTGTGGAATATAATATCTC 

TGGTGCTGAACGAACTGCCCATATCTCAGGGCTACCCCCTAGTACCTGCCCG 

Sequence 3493 

CCGGGCAGGTACATAC3AATTATTTCTTCAAGTATAATTCAAAATAATATGGACATTATCA 

TGTTCTGCATTACAATAATGGGATGTCATCACCATTGCTAGAATACTGGCATGATTCTTC 

TGAGCAGAAGTTGAAACTGTAMTTTAAACCTTTTAATTATCACCTTACCTGAAAGAGGT 

TAGTTMGATATTCACACAGTATGTATTATATTAACCATATCACACTTAAGTTATTAAAT 

TCAGACTATTTGTMCTTATTGTTAtAGGGCCTGCCGTATGGCTTAGGATATTTGAGTAA 

TCATATATTTAAAGTAAAA^.CTTTGGGCTGGGCACAGTGGCTCACACCTGTAATCCCAGC 

ACTTGGGGAAGCTGAGGTGGGCAGATCAGTTGAGGTCAGGAGTTCTAGACCAGCCTGGTC 

AACATGGCGAAACCCCATCTCTACTAAAAATACAAAAATTAGCTGGGCGTGGTGGCACAC 

ACCTGTAATCCCAGTTACriTGGGAGGCTGAGGCACAAGAATCGCTTGAACCCGGGAGGCG 

GAGGTTGCAGTTAGCCMGATCGCCCTGCTGCACTCCAGCCTGGGCAACAGAGGGAGACT 

CTGTCTCCAAAAAC 

Sequence 3494 

AGCGGCCGCCCGGGCAGGTACAGGATGGGAAGGCTTTGCTATGGATCCCAGCCTTTCTAG 

GGCTGGGTAGTGGG/ACCTCCCAAAATTGGAGTTCGGGTTAGGAAATCCTCACATGCAGT 

TTGGGAGAGGCTCAGGAGGAGGAACCTGAGCCATGTCCTCTGTTCTGCAGTGAAAAGAAG 

ATTAAATATGACCACTACTTGATCCCAAACGCCCTGCTGGAGCTGGCCCTGCTGCTTATG 

GAGCAAGACAGAAACGAAGAGGCCATCAAAC77TTGGAATCTGCCAAGTAAGGCTTGGGT 

AACCTCAGTCTGTCCTTTGCTCCCTCTGAACTCAGCTCANGAGCTGGCTGGGGACAGAGT 

TGCCCAGAGAGCAGTGAGTGGAGTTGCTAGCAAGGCCAGTCTGGCAGTCTAGAGGGTGGA 
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CCGCGGTGGCGGCCGCCCGGGCAGGTAC I rTTTTTTTTTTTTTTTn I A CTATTAGAATA 

TTTGCTCTATGCGT7TCGACATTTCGTTCTGCTCACTGTTACACGTCCTAACACCTAGAA 

GAATGCCTACCACATGGTAGGCATTCAGTAAATATTTCTGAATAAATGAATAAGCAAAAG 

AACTTTGGAGCCACAGACAAACCCTCAAAAACAGGAAACTTGGAAAATGCAAAAACACAT 

AAACCCCGACTACAAGTGCATGGTCTTTGGCTGACCATAACCTACAACTGACTGTAATGA 

GGAAAGC ACGCCC 1 ITTTTTTTTTTI I GAGACGGAGTTTCGCTTTTGCTATCCAGGCTGG 

AGTGCAATGGTGCAATCTCGGCTCGCTGAAACCTCTGTCTCCTGGGTAAAAGCAATTCTC 

CTGCCTCAGCCTCCTGAATAGCTAGGACCACAGGCATGAGCCACCACACCCGGCTA 

Sequence 3500 

CCGGGCAGGTACATGAATGTGCAGAGTGTGACTCAGTTTCCAAGCTCACTCATGTGGCTC 
TTGGCAGAAGACATAAGTTCTTTGTGACATGGGCCTCTCTGTAGGACAGCTTACAGTGTG 
ACACCTGGCTTTTTTCAGAGTAACTGAGCAAAAAGCCAGAGAGGCAGAGCCCCCAAGATT 
GAAGCCACGGAC I I I I 1 I I G AGACAG AATTTTG CTCTTATAGC CCAG GCTTG AGTGC AGT 
GGCGTGGTCTCAGCTCACTGCAACCTCCACCTCCTGGGTGCAAGCAATTCTCCTGCCTCA 
GCCTCCCGAGTAGCTGCCACAGAC I I I ! ! ATG ACCTAATCTTGGATGTACCT 
Sequence 3501 

AGGTACGCGGGGTCCTTCCACCATGTGAGGACACAGCAAGAAGGGGCTGACTCTGAACCA 

GGAAGCTGTGCTGTGCTCACCAGACACTTTGATCTTGGACTTCTCAGCCTCTAGAACTAT 

GGGGTTCTGCTCTTGTCACCCAGACTGGAGTGAAGTGGCACAATCTCGGCTCACTGCAAC 

CTCCGCCTCCCGGGTTCAAGCGATTCCCCTGCCTCAGCCTCCAGAGTAGCTGGACTGCAG 

GCGCACGCCACCACTCCTGGCTAAI I I ITGTAI I 1 1 I AGTAGAGACGGGGTTTCACCATG 

TTGGCCAAGATGTTCTTGATCTTCTCACCTTGTGATCCGCCCACCTCAACCTCCCAAAGT 

GCTGAGAGACTAAGACACCTTTCTAGACAGAGAGGAGGCCGATGGCAGACATTCTCAGAT 

AGGTTTGTAGCTATTGACCTGGCTTGCATCAAAGGAGATGAAATCCCTTAGAAATTTGAC 

AGCTTCTATGAAAATTTACAATCAAGAAGGCATAAGAACAACTGCTGCAGCTTCANAACT 

ATTGCAAAAAAATAAAATGTCAACAGCTGGGAGAAAAAATT 

Sequence 3502 

AGGTACGCGGGGGCGTTCGTAGATGGAAGGAAGAACTTGTGTGCTTAGACCTGACGCTGG 

GAGGAGATGCTGCCACCTAGGTTACTTGTAGGACCCTATACGGCAACCTCCTTTGCCAGG 

AACTATTTATAAACATCCTGCAGGAAAATGAGTCTATATGTCAGAATACACATTTCCCAC 

CTTGCCCAACAGTAGAAAMCATAAGAAGAGAAAMCATTAAAAAATGACAAGGAAGTTA 

ATGGAAGTCAGCAATGTGATGGTGTTTGGAGGTGGAGCCTTCAGAAGGTAATTAATGCCC 

TTGTAAGAAGAGGCCAGAGAGCTTGCGCACCTTCTTCCTGCCATGTGAGGAGCCAAGAAG 

CCGGCTGTCTGCAACCTGCAAGAGGACCCTCACTAGAAGCTAGCCATACTGGCATCCTCA 

TCTTGGCTTTCCAACTTCCAGMCTGTGAGAAGTATATGTTTGTGGTTTAGTCAATGGTC 

TATGGTAATTTTTTTATAGCAGTCCCAGCCMGACAGTGCCTCATTTACTACATACCATT 

TATATTATTATATAGGCTCCTTTCAGAAACCCATGTTCAAATAAGAGATAAGATACTGAA 

0 

Sequence 3503 

CCGGGCAGGTACAAGCTGGCATACCCATTGGGATGGGAGGACTCTGGGCTTTCAGAGCTT 

CTTCAMTGGTCTTTGTATGGCAGACTATAAGTAACTCA/O/AAAATTCTGTCTTCTCCTC 

CACATGGGTGCTGTATTTCACAACCTTCCCTACAATTAG1TAAGTGTGGCCATACAACTG 

ATCATGACCAGTGGAACTGAGATGGGAGAATTGCCTTGGCCCCTTCATGGGACTCCTCAC 

AAAGGGGGTGGCTCATTTACTCAGCCACCATGCACTCAAACCCCTTACAGGAAGGAGAGA 

ATGCAGGTGAGCTGGGGTGAGCGCTGTTGGGCTCCGGCCCCATGGCAGCATCTAGGGGTA 

TTACAGTGTTCCTTTAGCTTTGCCATCCATGGATGGCTAAGTGTTAACCAACTCAGTGGA 

GAGTCAGGGTGACAGGC l"l I I I ATACCCCGCCCTCTTGGTACCT 

Sequence 3504 

CGACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAACTGAAGCAA 
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AGAGGAAAGCAGACCCATGGCTCAGGACACATGGGAGTCCCACCAGCTTGAAAAAAATTT 

CACAGAGCTGGCATTTCCATCAGTGCAGACACGGCTCTGAAAACAGTAATTGCTGCCTTA 

TGGGCATGATACGCCCATTTCTACTCTGACCTCCTCCCAGGAGAAACTCTTCTGCAAATG 

ACCACAGTACCTGCCCG 

Sequence 3505 

CCGGGCAGGTACCAAAGAACAAACGGAAGCCTAAAGATGGACCACAAACCTTTGGATATC 

CAGGATAACTGAGTTAAAAACCTACCACACGGTTAATAAAATAATAATGTCCACTGAGTC 

CATGACCACAAAGCAAATCACCAGCAGCAGGATGGCATGGGCNGGCCCTTTTCGCTGGGG 

AGGGACTGGAGCAGAGGCTGGGGCTGTGAAGGTGATGGGATCACCTCTGATGGTTAAACA 

AGAGTACCT 

Sequence 3506 

AGGTACCAACCGATGACGGGAATACTTCAGCTGCAATAGGTCCCCAACTTCATCTATCTC 

CTTTAMGCAGTATCAAGTATTTCTTTGGTATGAGCTGCTGACAGGCAAAACCTGGCTCT 

GGACTCAATAATTGGGGTGGCAGGAAATCCAACCACAACGACACCGATGTTCCGCTTCAG 

CATCTCCCGTCCAAAGGTGCCAATTTTGGCAGGCATGTAGAGCATCAAAGGCACTACTGG 

AGAGTCTTCATTTCCATAGACCCGCGTACCTGCCCGGGCGGCCGCTCGACCTGGGTCTGC 

AGTAAGGAGCATTGCTATTGGACACATAGCTGCCAAGACAGTATCCAAGAATTGTCCACG 

TGGTCCCTGGGACTGCCCTCCACTGGGACAGTTAGTCTGAGGTTTGTTCCTTGGTTGAGT 

AGTTACTCTGTGGACATGG ACATAGTGCATTATTTCAGGG G TTCCTG AAATGTCTGTGTG 

TGGTCAGAAGGAGGCTTAGCCCCAGAAAGGAAACACCTCTGCTTCCTGCCTAACTAAACA 

GGCAGTGATGGTCTCTTCTCCATCTGGAGAAGCTTGGG 

Sequence 3507 

CCGGGCAGGTACGCAGGGAGCCCCTCATGGGAGCCGGACTCCCTGAGACTCAAGAAAATG 

CACCCGTCCTGCGTTCCTCCTGCCTGGGGTGACCCATAAGGAACACCCCCCCCTTCTCAA 

CAACCACAGAAGATAAATGCCAAGGGGACCACCAACATGGGCATGGGGCTGTGATGCCAA 

TCTTAAGGGAAAACACAAGTAAAATTGGCTCCAGGACTGAGCTGGGGGCAGCTGCTCTTT 

CTGTCCATGACTTTGAGGCGGGATCTCGAGCGTCTGTCTGACGGGCCTTCTGTCTGCTGT 

CCTCTCGCAGGTTTCAGACATTGAATCCAGGATTGCAGCCCTGAGGGCCGCAGGGCTCAC 

GGTGAAGCCCTCGGGAAAGCCCCGGAGGAAGTCAAACCTCCCGATAT7TCTCCCTCGAGT 

GGCTGGGAAACTTGGCAAGAGACCAGAGGACCCAAATGCAGACCCTTNAAGTGAGGCCAA 

GGCAATGGCTGTGCCCCTATCTTCTGAGAAGAAAGTTCAGTAATTCCCNTGAAAAGGTCA 

AG G N AAAG ATG ATN G ATTCTTTT 

Sequence 3508 

CCGGGCAGGTACGCGGGGGCTTTCTCTTACTGATAGTAGGATATTTCTGCTTTAGTTATT 

GTCACCTTAAATATATTTTCMTGTTGAAATCCTCACAGCATGTTT3ATGAAATCTAGTT 

TTCAMTTTTCTTAGGTATATTTCTTGTCACGTTGGCATGATAACAAATGCAATAACCCA 

AMGACCCCAAMGCTAGTGTANTCCCTTTTGCAATCCAAGCATGAGGATTCATCTTCAT 

GTTGACAGTGCGTGAATGTTCGGTAGGCTTTGNCAAGCTTGCATACAATAAATTATATAT 

GTCCCTTTTCTTTTANGGGTCTCCTGTTAAAGATGGTCTTCTGMGGGCTAACTGCGGAA 

TGAMGTTTCTATTCCAACCTAAAGCCTTMGAATTGATGGACATGCAAACTTTNAAAGC 

AGGAGCCTCCCGAGAAAGCCATCTGCCTTCGAGCCTGCCATTGAAATGCAAAGTCTGTTC 

CAAATAAAGCCTTGGAAATTGAAGAATGAACAAACATTGAGAGNAGATGAGATACTCCCA 

TCAGAATCCAACNAAAGGGACTATGAAGAAAGGTCTTGG 

Sequence 3509 

CCGGGCAGGTACACACAMGACAAACCTGMCTTMTTTCAAGGAAAACTTAAACCCATG 

CACAAATAATTGGTGAGCCTTCATTTCCCTGACTTCAAGTTTCCATGTGAGGACTCATGC 

TCTCTCCACTTTCTTCTTGGGAGGAGGGAAGATTTACCTAATGGGTAAATTTGGGCAAAG 

CACATTGAGTGTGCTTGTTTGGCTCTGAGTCTCTTTGCAAACATGTGTCTGCCCACAGTG 

ACATGAGTTTGCGTTGACTGTCATGTCTGCAGGAAGCTGCCTGCTCCTGTGGCCATGTCA 
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AGCMTTCTTTCTTTCAACTGCAACTGTGTGTAAGAGCTTAGTCTGAGAAGAAATGTTCA 
GAAGCTCACTGTGGCTGCACATCTGAGCCATGTCTTCCCATTAGTTGNCATGAGTCAGCA 
ATAAAG CGGGTATG TTGATGTCTATCAATCTAATTCCTATGTTCTGMCTCANGGAAAGA 
AATANGAGAATCA 
Sequence 3510 

GGGGCCATTGAGACTGCCATGGAAGACTTGAAAGGTCACGTAGCTGAGACTTCTGGAGAG 

ACC ATTCAAGGCTTCTG G CTCTTGACAAAG ATAG ACC ACTGG AACAATGAG AAGG AG AGA 

ATTCTACTGGTCACAGACAAGACTCTCTTGATCTGCAAATACGACTTCATCATGCTGAGT 

TGTGTGCAGCTGCAGCGGATTCCTCTGAGCGCTGTCTATCGCATCTGCCTGGGCAAGTTC 

ACCTTCCCTGGGATGTCCCTGGACAAGAGACAAGGAGAAGGCCTTAGGATCTACTGGGGG 

AGTCCGGAGGAGCAGTCTCTTCTGTCCCGCTGGAACCCATGGTCCACTGAAGTTTCCTTA 

TGCTACTTTCACTGAGCATCCTATGAAATACACCAG TG AG AAATTCCTTGAAATTTGCAA 

GGTAGGAGGCATCTTGGGTATTACTCTCANATTAGCTTTGTTGCTGGAGCTATGTGCTTT 

CTAGGGACGTGTACTCCTAANCAAACCNNNTTNNNNNNNNNNNNNNNNNNNNNNNNNNNN 

NNNNNNGGTCCCTNGGNCCGTTTTANAA^CTAGNGGGATCCC 

Sequence 3511 

AGGTACCATGCGGTCAGCCAAATGTGTAAGGAACTTGTTTTGAGAGGAGAGAAGAAAATT 
CTATAGTAGGCGAGACAGGCTTCTCTCTCCTGCTCCTCCTCCCTTTC I I I I I CTCTCTCT 
CCTCTTTCTTCCCTCCTTCTCTCCCTCTTTCTCTTTCTCTCACACAGTTCCTACATTTCT 
GCACTGGGGGACAAACCAGGCTTCTGATCAAAGGCAATCTGAGACTTGGGTTCAAAAGCT 
TT GATG AAGATTGGTACCTGCTATGCAGATGGGGGGAGGGAAGAAGAGACTAGCCTAGGG 
GCTTTTATATTTCTTTAAAG C ATCAG CCATTAATGGGCTAAGTTTACCACTGGGACCCAG 
AGAGCTAGTCCCCAAACACMGCTCAAAATGTATGCATTTTCCMTTCATGGTTTTCAAG 
GACATGTTCCCCCGCGTACCTGCCCG 
Sequence 3512 

AGGTACTAGGTTTTGGTTGGGTAGGAGAGACCAAAACCAGGGAAAAGCTTAGAAATCCTT 

TATTGGGGTTTCTGCAGGAAATGCAAGGCAGGTCACAGTTAACAGCTTATTATTGGCTAG 

TTTGAATAATTCCAGTGGGCCCTGGGGCACAGGTATCTAGTTGTCTGGTCCCAGCTCTGA 

GATGATTTAAAACAGGGAAATACTGGCTTAGTGTGTGAGAGTGAGAAAAGGAGCTGATTG 

TGGCTTTATACTTGGGATTCATTGGTATGTATATGAAAGACATATTCTCAGTGGACCCCC 

TCACTAAACTGCCCAGGAAGGGCAATCTTTTCCCAGGTCTGNAAGGCCCCCCAGGNCCCT 

GCCCGGCCGGCCGTCTAGAACTAGTGGATN 

Sequence 3513 

CCGGGCAGGTACAGTCAACATGCATGATGTGTAACTCCATCTCCATAGGCTAGAGGCATC 

CTAAGTGCTGCAGACCAGACCTGGACAATAAATGTCAAGAGTCACCACCACCCCGAAAAG 

AACTAGAAGTCTGTAGATCTGCTGAGGAGCCCAGTCATTAAAATACAGGATGGGGTTCAC 

AGAGACTCAAGGTCTGCCTCTGMGCTTTTCTTGGTAGGATCTGAAAACCCAGAAGATGC 

TTTAAATCCTCACCCTAGTGAACAGATGCAGGTTCCCTGGTCCTTGGCCTGACTGCAGTC 

AGAGAGCCCAGGAGTTAGCTGGTTTTCTCCCTCCATTTTTGGTGTGAAAAGAGCCACTGC 

CACAGCAGCCATGGACTTAGGATGCTAGAGTGCATGGTCTGGATGCAAAACTTTTGCCCT 

GAMTGTGGCATCMCACAMTACAGAGTCCAGATTTTTTCATCTTCCTCTGGTGATGAC 

TCACCATTAGTGTCCATACNTTGTAGGTGCTTATGAAATAACCCTCAGCCAGGACAGCTA 

CTCTTCTGG AACTG NAGGAAG TGTTCTTATNCC7TTTTTTTCA WATTAAA 

Sequence 3514 

CCGGGCAGGTACCCTGGACTCATACCTGAAAGCAGTGTTCAACCTTAGCAAAATCTCCAA 

CCAGCGCATGAACMTTTTCTACATCACAACGACCTGGTTTTCAAATTCAGCTCTCAAGG 

CCAAATCTTTTCTAMTTTAACCAAGAACTTCATCAGTTCACAGAGAAAGTAATCCAGGA 

CCGG AAGGAGCCT CTTA AGGATAAGCTAAAACAAGATACTACTCAGAAAAGGCGCTGGGA 

TTTTCTGGACATACTTTTGAGTGCCAAAAGCGAAAACACCAAAGATTTCTCTGAAGCAGA 
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TCTCCAGGCTGAAGTGAAAACGTTCATGTTTGCAGGACATGACACCACATCCAGTGCTAT 

CTCCTGGATCCTTTACTGCTTGGCAAAGTACCT 

Sequence 3515 

AGGTACTTTGTTACAGCAGCCTGAAAGGACTAAGACACCGA CCTA GTCTCCCTGATGAAA 

MGTTTCTCTCAGACTTCTACCCTTTCCAATGTGGCCAAAGCTTTTCATTCCGAAGAAGT 

TTCCTTTCTGAGMCGCTCATTGTGTCGTTTGGCTTTCCCCGTCTCTGCTTGACACATGA 

ACCAAAACAGAGGCAGCCAAAGCAGGGAAAAAAAAAATCCTAGGATCAGAGTCCACTCTA 

TGCCCTTTTGAGCTTCAAAAGGAGAAAGAGACAAAAGCCAAAAGCAATGGAGGTCAAGCT 

GCCCGGTACCTGCCCG 

Sequence 3516 

AGGTACGCGGGGACTTCCTCATAGACCTTGGATGTGGGAGGATTGCATTCAGTCTAGTTC 

CTGGTTGCCGGCTGAAATAACCTGAATTCAAGCCAGGAAGAAGCAGCAATCTGTCTTCTG 

GATTAAMCTGAAGATCAACCTACTTTCMCTTACTMGAAAGGTATTAAGCGCCTTTCT 

GAGAGCTCTCAGTGGGCTTCCTAAGCACGTTCACTCTGCCTCCTTTAAGTCTTATATTTA 

TGATCMGATGAAAGGGAGGGATGTATCACTTGCACCAGAGATTCTACAAGTGGGATATA 

TAAAAGCTMTAATGGTTTTCAGTGAGCTCTTCTNTGGCAACTACTTTCCMGTGCTTTC 

AGC 

Sequence 3517 

AGGTACTTTGTGGGTTGAGGGTAGGAGGCTAAGACTGCCAGGGAGGAGGTCCCGGCTTCC 
TGCTCCTACCG GGG AGC AG CAG GTG AAACTCTG ACCACGTTACTG CAATCCTGACATGCT 
CCAGTGGAGTGGCGACATTTTTCTTCTCGAGGCAGCTTTTGGGGA TCTGTATTACTTT CA 
TGTGGACCACTTGCTAGTTTTGATTTCMTGATAATTTCTTCCTTCCl 1 1 1 1 III I I I I I 
CAAGCCTGTCTTCATAGTGATGACAACACATTTAACATTTGTTTTGATTTTACCCTCTC 
TCTCTCCCCACTCTCAGTCTGCANCCAGGAGAGCAGGNACGTCCTGTGCNAACTGTCANA 
CCACCACAACCACACTCTGGAGGAGGAATGCCAA 
Sequence 3518 

CCGGGCAGGTAC M I I N H I t I I I I M M I I I I I I I I I ACGTAATACACGTTCCTATGT 
TTTCAAAMTTAMTAACAATTTAAAACGCATCCATTGTTATACTGTTCCTCACTATAAT 
TCCTGAACCTCCATGTAAACCCATTTTTATATCTATCAATTTTTGTTGGTGTCAATTT 
TGTTTTTAMTAGCATGACATGCCTTTTT^ 

TATCTCTAATGGAAAACTGMGTTTATCTATGTTACATTATGCCACTGAAATAAACATAT 

CCCAGCTGTTGAACAAGTGACACTTCCTGTCTACCACCAGAAACAGCAGCCCCAGGATAC 

G 

Sequence 3519 

CCGCGGTGGCGGCCGAGGTAC I rTTTTTTTTTTTTTTTTTI I GCAGGTTGAATGCAGGGT 

TTTATTGAGTGGTGGAGGTGGTTCTCTGAAGGATGGATGCANAGCTGGAAGCGGGGATGG 

CACGGGAAGATAATCTTCCTGCTGGGTGGCCAAACCTNTCCGACCACCCTCGGCTGAACT' 

CCTCTCGGCGTTCAAATGTTCCTCTTCTCTCTTTCTCTACTGNATCCGTTTCACCATTCA 

TCTGCTTGTCTGGGATGGTGMNACAACTGTTTGCTCATTTGCTTGGCGATAGGTGATTG 

TCTCGCCGCTCAGCTATAGGCGATGGTCTCACCACTCGGTGATAAGGCCGATTGTCCCAT 

CTAGGTCGCCAAMTGTGCCTTGGMTTGGTNGGGGTTCTTTCGTCTCACTTGACTTCAA 

NAAATTGAANCCCG 

Sequence 3520 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGTCCTGCGTTTGTAGATG 

GAAGGAAGAACTTGTGTGCTTAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTAC 

TTGTAGGACCCTATACGGCMCCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGA 

AAATGAGTCAAGGAAAGCTTTTTCTTTTGAGCTATTTACAGCTTTTAGCMTTG 

GTATACTCCTGTGAACAAAATTTGGMCATATTTGTTTCTCTCTAACTGATTTCTCCAGA 

ATTTGGAACTAGTTAGTCTATATGTCAGAATACACATTTCCCACCTTGCCCAACAGTANA 
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AAAACATMGNMGAGAAAAACATTAAAAMTGACMGGAANTTAATGGGAAAGCAATTG 

TGGATGGTGTTTGGAGGTGGGAAGCCTTCAGAAAGGTAATTAATGCC 

Sequence 3521 

AGGTACTTAGAGTCCAATTTTCCTACCTTGGCTTGTGCATGAGGTTGCCTGGTTGCCACG 

GTGCCTGCTTTTCTCCCTGTGTCACCTCCACTGCCTCCTGAATAACAGTCTCAGGTTTGT 

CTATGGCCTCTGCAGGGAGCCGGGNATTCCCAGACAGGAGCAAGGCCAACTTTAAAATNA 

AGGGTGGGGGACGGCCCTCTTCCCCCNTCTTTGGGGCCCGGGNAGTTCCCAAAACTTCNC 

ACCGCAAGGGCAACCAGGAAGAATCCCCCAAGNATGGGGNCNCCCCCCAAGTTTTNTGGT 

GNAGGAAAMCCMCATTITTCAATNCTNGTTCCCAAAAACCNCCCAAAAAAGGAAATNG 

GGAACCTTCAAGNAAGAACCACCANAAAAAGCAAAACAAAATTNGGGAAAAGGAAGNAAG 

AAAI I I I I I I I AAAATTGGGGTTGGAATTCCGGTTGCC AAAGG N AAATGGGGGTTGGTCC 

TTGGGTTAGGGCCAAGGGCCCCCCGGCCGGTTACCACTTGGCCCCCCGGGGGCCG 

Sequence 3522 

GCGGGCAGGTACGCGGGCMCTATTGCMTGAAAAATTTCMCAATTTTTTAATGAAAGG 

ATTCTGAAGGAGGMCAAGAACTTTATCAAAAAGMGGTTTAGGTGTTAATGAAGTGCAT 

TATGTGGATAATCAGGACTGTATAGATTTMTTTGAAGCCANAATTAAGTGGGGGGAAAT 

ACCTTGGGAATA I ITTT1 IGGGGAATTGAAANGAAAAAAAATCCGCCCCTTTTCCCCCCC 

AAGCCCCAAANGTTGGNATTCCAAMCCMCCTTTTTTAACCMTTTCTTTGGCNAAGGT 

CTNCCAACCCCCAAAANAAAAGGCCCAACCCAAAAGGGGGNAATTNCCAAI I I I I I 1 1 I I 

NCGGGAACCTTTNCAAACCTTNAATTTTTCCCCCCCAAGGCAAAAAAAAAATTCCCTTAA 

AMGGGCCTTGGGGGGCCCMGGTTFrTTCCMTTTMGNNGNNAAMNTTMTTTNCAA 

AGGNAAGNTAMCCCGNAAACCCGGNAMNMGGGGGCCCTTTTTCCNMTTTTTAMTT 

TCCAAAGGGGGCCCAAN I I I I II I I IGGGGCCCGGGGGGGGGGGGGGCCCAAANGGTTGG 

GGTTTGGCCCTTTMTTTGGGAAAAAAAACCNAAAANCCCCC^ 

TTNGGGGGGAAG 

Sequence 3523 

ACTATAGGGGCGGGACATTGGAGCTCCACCCGCGGGTGGCGGCCCGCCCGGGCAGGTACC 

AAGAAAACTCCATCCACAAGTCCTTGCTGAATAATCAATCGCTGAGCCTCATCTCTAGAA 

MTTTTGGTGGGTGGAAACCATGGCTGGGGACCCGGTGGGGAATAGCCATTGTTTGGTGG 

GCAAGAGCCTCTGTGNAAAGAAGGGCCAGGTGGGGGGGGCCTACCCGGTTGGAAGGTTGC 

CCCCAGGGCCGTNAAAAACCATTCCC I 1 1 1 I I 1 1 1 1 CCCTTCCCCAAAGGCCGGAGGTTC 

CCCTTTTCCTTTCCAAACCCCCGGCCAAMCCTTGGAAAAAAGGGGGGCCTTTTTCCAAG 

GNTTGGGGGGGA A l nTTTTTl CCTTAAATTTAAAACCNTTCCTTTGGCC 1 1 1 I 1 1 1 1 I I 

TCCTTGGGGGCCCCCTTGGGAAGGGAAAAAGGNTTCCCCCMNTTTrTGGGCCTTTAACC 

CCCAAAGGGGGGGGAAAMNTTNTTTCCmCCCTTGGGNMTTTMTTTM 

TCCTTTCCAAATTAMGGGGGGTTTGGGAAATTAAATTTGGGCCCTTTCCCTTGGGTTGG 

GAAAAAACCCTTGGGGCCAMGGGCCCCCCAMCCCTTTTTTNCCTTTAMCCCCCTT^ 

TTGGGAAATTAAATTTGGGGGGAAAATTGGGCCCAAATTTAAATTTAA 

CTTTGGGGGGTTNCCCCCCTTTCCGGGGGGCCCCCCGGCCCTTTCCCTTTAAAGGAAAAA 

ACCCTTAAGGGGTTNGGGGGGAAATTTCCCCCCCCCC 

Sequence 3524 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGTCCGTAGCTGAGACTTCTGGAGAGAC 

CATTCAAGGCTTCTGGCTCTTGACAAAGATAGACCACTGGAACAATGAGAAGGAGAGAAT 

TCTACTGGTCACAGACAAGACTCTCTTGATCTGCAAATACGACTTCATCATGCTGAGTTG 

TGTGCAGCTGCAGCGGATTCCCTCTGGAGCCGCTGTCTATCGCATCTGCCTGGGCAAGTT 

CACCTTCCCTGGGATGTCCCTGGACAAGAGACAAGGAGAAGGCCTTAGGATCTACTGGGG 

GAGTCCGGAGGAGCAGTCTCTTCTGTCCCGCTGGAACCCATGGTCCACTGAAGTTCCTTA 

TGCTACTTTCACTGAGCATCCTATGAMTACACCAGTGAGAMTTCCTTGGAAATTTGCA 

AGGTTGTCTTGGGTTCATGTC 
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Sequence 3525 

AGGTACGCGGGGGMGTTAATGGAAGCAATGTGATGGTGTTTGGAGGTGGAGCCTTCAGA 
AGGTAATTAATACCCTTGTAAGAAGAGGCCAGAGAGCTTGCGCACCTTCTTCCTGCCATG 
TGAGGAGCCAAGAAGCCGGCTGTCCTGCAACTCTGGCAAGGANG ACCCCT TCNCTTANAA 
MNCTNAGCCCATTACCTTGGGGCCATTCCCTTCAATCTTTGGGC CTTTTTC NCCAAACC 
TTTTCCCCAGGAAACCTTGGTTGGAAGGAAAAGGTTAATTAATTGGl UN i GGTTGGGG 
TTTTNTN ANGG NTCC AAAANTGGGGGTTNCCTTAAATTGGGGGTTCAAAAN I I I I I i u I 
TTTTTTAAATTAAGGNCCCAAGGTTTCCCCCCCMGTCCCCCAANAANGAAACCNAAGGT 
TNGGGCCCCCTTTTCC AA I Hill 1 1 NMCCCNTTAAACCNAATTTAANCCCCC AAATT T 
TTTTAMTTTAATTTNTTAAATTTTT^ 

TTNCCAAGGNAAAAAAMCCCCCCCAATTTGNTTNTTNCCANMNMTTTANNACGGGAA 

AGGGMTTTAAAAMGGNAAATTAACCCTTTGGGAAAAAAAACCCAA 

CC AACCCCCTTTTTNC AAACCTTTAAGG NTTTTTTTTTTTA 

Sequence 3526 

ATACGACTACTATAGGGCGAATNGGAGCTCCCCGCGGTGGCGGCCGAGGTAC llililli 

rTTTTTTTTTTI 1 " 1 GCCTGGAGAGAGAGAGATGTTTTATTTTTGCCAAAGCATGTGANAG 

TAAGTAGCAMCATCATGACACTTCACCCCTCAAATATTTTCCCACGTATCACTTAANAA' 

CAAGGGC ATTC ATTCTG NTACATG ACCAC AATNCC AATCC NC NCC AAG ATAGG ATATTAA 

CATTGACACAATGCTCNGATGTATNGGTNTCTACTATGCAGTNCATAATCAAATAAGGGT 

TTTGGTAACTNGGGATC 

Sequence 3527 

AGGTACAAAAGGTCTGTCTTCCATCAGACCCAGTGATGCTGCGACTCACACGCTTCAATT 
CAAGACCTGACCGCTAGTAGGGAGGTTTATTCAGATCGCTGGCAGCC TCGGC TGAGCAGA 
TGCACAGAGGGGATCACTGTGCAGGTGGGACCACCCTTACTTGGGNCTTTTTNCNAGCCA 
AGGGGTTTTNNTGGGGGATGGTTTTTTACA^ 

TMGCAGGCCAAAGGGNGCCTTCACCAAAANACCAAGNAMNATTANCTTGGTCAATTGN 

GGMGGGCTGGCCTTNGAAAANCAACNAGNGGGGGAAAAGNGGTTCTTTGTGNTTNCCCN 

TTGNCCACACNGGGGGCCGGGGCCTCCGGCCTTCTTAAGGAAAAACNTTANGGTTGGGGG 

NAATTCCCCCCCCCCGGGGGGCCTTGNNCNMGGGGANMTTTTTCCGGAATTNATTTCN 

MNAGGGCCTTTTTMTTCCGGGANTTMCCCCCGTTTCNGGMNCCC CTTNNTGGAA AG 

GGGGGGGGGGNGGGGCCCCCCCCGNGGGTTMCCCCCCCAAGGCCTTTTTTTTTTTGGGT 

TATTCCCCCCTTTTTTTAAAG 
Sequence 3528 

CGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAACCTGAGAATCACAAGCGGTGTGG 

TGGTGTGTCAGTGTGGCCTGTCCATCCCATCTCATGTGAGTGTTCCACACACAGATTTCA 

TGATACTTC7TCCCACATGGATCTACT7TCTTGAAGATCCTCCAGTGCTGACTTGGAGCC 

CATTGTGCATATTTCTTCCCAGTTTTATGTTCAGTGGTGTCACATTGGTAGCTTCAAATC 

AGCCATGGTGAGAATATCTATGGCATGAAAATTGGCAAACAGTACCTGCCCG 

Sequence 3529 

AGGTACGCGGGGAACAGAAAGCAGCCCAACTTGCTACTGAAATTGCAGCATATAAAGTCA 

AAGTATGATGAAGTAAAGGAGTCTTCGAGAAGCTGCTGAACAATAAAGTGACACCGTCTT 

GGACAGGAANGAATTACANACAATANGAGGGCCAACCTGTANATTCAAAGGATCTTGGAA 

AGGACCGGGGAMCCTTGCTTCAAGGAAGGACCCCTTGGGGTATTTAAGGAAAAGAATGG 

GTTTGGCCCCGTTGCCTAAAAAAGGAAAAMGGAMCCTGGGGGTCCCAAMGGTTTCCA 

AAAACCACCTTAAATTGGGGATTAAMCCMTTGGACCCCTTTTGGGGAAACCGGTTGGA 

GGCCGGAAGGAAMTTCNTTGGAAAAAAAAACCTTCAAAAAAAGGAATTGGAAMTTGGC 

CANAAAAAAAMTTTTGGCCAAGGGTCCACCAAAATTNMTTGGCCTTMGGCCTTCCM 

GGAAGGGGCCCCACCCAAAATTTAAAAGGCCCCCCAAGGCCTTTANAAGGGGGAAGGNTT 

GGAAAACCTTTTGGCCCCAAAAAAGGGGGCCCCCCCCCCCCMGGGGGAAAMGGTTTG 
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Sequence 3530 

TACGACTACTATAGG GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC 1 1 1 ) I 1 11 

rTTTTTTTTTTTTn i 1 GAGAGTACTTTCACATGCTATACCACATTTAATTAATTCTAAC 

AACAGCACTG GAAAATAAGTATAATTATG CTCTTTAGTTCCTTCTANACAG AAAGCTTTC 

ATGGGGATGAAAGACTGTTCCTTAGCTCANAGTAGGGAATCAAAGTAGGCGGGGCTTAGT 

TCAGAGTGGGCGGGGCTTANCTCAAMTTGGCCATCAATTTGTANAATTGAAGTGTGAAC 

TNTNAGGATGAAAAAACAGGGCTGCCAGGGGTGTNTAACTTGCACCAANTNNCCCTACTT 

TTANATTTGGGCCCNAACCCGGCCCMCTTTAAGTTTTTTCNAAMTGNTTTAA 

GGNGCCTTTGANCCNCTGGANTTNCCTTTTG 

TTCCCCTNNGNCCCCCNGGGGCNTTTTTTTTTTTT 

Sequence 3531 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGTAGATGG 

AAGGAAGAACTTGTGTGCTTAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTACT 

TGTAGGACCCTATACGGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAA 

AATGTGAAGTANAANAGACAGGGGATNTTCCCAGANGGTTATTGCAAAACATCAAGANGA 

AGATGAGAGGAGTCTATATGTCACGAATACACATTTCCCACCTTGCCCAACAGCANAAAA 

AAAAAAANAAAAAAAAAAGTACCTGCCCGGGCGGCCGNTCTAGAACTAGT 

Sequence 3532 

CCGGGCAGGTACATAGCCCCAGAAGGGCGGACTGGCCGGAGTCCAGGGATGGCAGCCAAC 

GCCCCATAACAGAGATCAGCATTGGACTACAAGAAGAGGCAAGGAGAAATCAAGGATCAA 

MTTTMGTAAMGAAMGCCAAGTCATTAAAAATAGACCCCNTNATTTGAANAAGTNGG 

GAACCGTAGGTGTTGATGTTCTGGCATAAGGAGTGAAAAAGAAAAAGCTCTATTACTTGA 

AGCTTTTCACCAGGGGCAGAGGGAATGGCCGGAAGTGAGAAACGTGTGTGTGGATGCTTA 

CACCGATGCCGTTCTCCTAATATTGGAACATGGCTCCAGAAAGGAGAACCAATTATTNCT 

AATTCCACGGGGCGGCATTCTCTTGACTCCCAAACTTCCAAAGTGGAGGGGCAG 

Sequence 3533 

CCGGGCAGGTACAGAAGGGCGGTCCTGCTGACTTGGCTGGGCTAGAGGATGAGGATGTCA 

TCATTGAAGTGAATGGGGTGAATGTGCTAGATGAACCCTATGAGAAGGTTGGTGGATAGA 

ATCCAGAGCAGTGGGAAGAATGTCACACCTTCTAGTCTTGGTGGGAAAAGGAAAGGCCCC 

TTTTTGANTTTATTrTTTCCCCAANGGCCTTANAGAAAAAMTTCCTCCTM 

TTNTCCCCTCCCCCCTTGGGTCCTNGGATTCCACMCCTTTTGGTAACTATCCCTCCCCT 

TNCNCCANTGMTTCTTCTTAAAAAAGNAAMGGGGGAATNTTTANGGTTTGGGGATTGG 

GGTAAGCTTTCCANTANACCACCAATTGCNAANCTTTCNGGCCAAACNAAGTTGGGNNTC 

NAANAAAAAAGGAAAAANCCGNGGNGNCCACNTCTATNCCAAGGTNTNACCCCCNTTCCN 

GNGCCNCCCGNCNTTTCCTTAAAGTAAAAACCCTTAANGGTTTGGNNGNAATTNCCCCCC 

CCCCC 

Sequence 3534 

CGAGGACGCGGCAATAGAAGCCGGCGTAAAGAGTGTTTTAGATCACTCCCTCCCCTTCTA 
CTCAACCTGGTGACTCATCTCTCCGATTGCAAAGAGCAAGACACGCCACTCCGCTTCAAC 
GCCAAAGCGGGGATCTGCTGGGGCACAGTATGCAATTTAAGGAAAACCCCCAAAGGCTAC 
AGGCGACCTGCTGATCAGGAAAGAATTTCGCTCTTGTCAAGTACCTGCCCG 
Sequence 3535 

AGGTACTCAATGGACCMTCTGAATTMGCTTTTCCTG7TTTTCATAGAAGTGTTC7TTT 
CTCTAAACTGCTGTAATTTTTCTTGACTTATTAGGCTGGAGGACGTGTGCCAGTGTTAGA 
CACACTCATAGAACTTGTTACACGAGGCCGATCTATCCCCGTACCTGCCCG 
Sequence 3536 

NNATTGGAGCTCCCCGCGGTGGCGGCCGAGGACGCGGCAATAGAAGCCGGCGTAAAGAGT 

GTTTTAGATCACTCCCTCCCCTTCTACTCAACCTGGTGACTCATCTCTCCGATTGCAAAG 

AGCAAGACACGCCACTCCGCTTCAACGCCAAAGCGGGGATCTGCTGGGGCACAGTATGCA 
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ATTTAAGGAAAACCCCCAAAGGCTACAGGCGACCTGCTGATCAGGAAAGAATTTCGCTCT 

TGTCAAGTACCTGCCCG 

Sequence 3537 

GACTACTATAGGGCGAGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGG 

GGCTGCCTGGGGTGACCCATAAGGAACACCCCCCCCTTCTCAACAACCACAGAAGATAAA 

TGCCAAGGGGACCACCAACATGGGCATGGGGCTGTGATGCCAATCTTAAGGGAAAACACA 

AGTAAAATTGGCTCCAGGACTGAGCTGGGGGCAGCTGCTCTTTCTGTCCATGACTTTGAG 

GCGGGATCTCGAGCGTCTGTCTGACGGGCCTTCTGTCTGCTGTCCTCTCGCAGG7TTCAN 

ACATTGAATCCAGGATTGCAGCCCTGAGGGCCGCAGGGCTCACGGTGAAGCCCTCGGGAA 

AGCCCCGGAGGAAGTCAAACCTCCCGATATTTCTCCCTCGAGTGGCTGGGAAACTTGGCA 

AGAGACCANAGGACCCAAATGCANACCCTTCAAGTGAGGCCAAGGCAATGGCTGTACCCT 

ATCTTCTGAGAANAMGTTCAGTAATTCCCTGAAAAGTCAAGGTAAAGATGATGATTCTT 

TTGATCGGAAATCAGTGTACCTT 

Sequence 3538 

AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGTATATTTTGTAAACAAAAATGACTG 

GCTTTTCMCCACCCCCTAGTCAMTCCACCACAGACmCCTGGTAGATTTAAGAAACC 

CAGTCTTAMCTGCTGTTTGCATATGTCTTCTGATGTTGATTATTTAAAAAAAAACAAAA 

MCGGCCATTTCGGAMTTTCATATTGACAGTTTTCATTTATAGTACGCGGGGACTGGGA 

GAGCGGGCTGAATGCAATGGAGTGTGCATTACATTTGGAAAAAAATGTGAAtCAGTCACT 

ACTGGAACTGCACAAACTGGCCACTGACAAAAATGACCCCCATTTGTGTGACTTCATTGA 

GACACATTACCTGAATGAGCAGGTGAAAGCCATCAAAGAATTGGGTGACCACGTGACCAA 

CTTGCGCAAGATGGGAGCGCCCGAATCTGGCTTGGCGGAATATCTCTTTGACAAGCACAC 

CCTGGGAGACAGTGATAATGAAAGCTAAGCCTCGGGCTAATTTCCCATAG CCCG TGGGGN 

GACTTCCCTGGTCACCMGGCAGTGCATGCATGTTGGGGTTTCCTTTACCTTTTCTATAA 

GNTGGCCTGCCCNGGCCGGCCGTCTAAAACTAAGNGGATCCCCGGGCTGGAGGAATTCAA 

TATCAGCNTATNGATCCGCCAACTTGANGGGGGGNC 

Sequence 3539 

ANGTACATCCAAGATTAGGTCATAAAAAGTCTGTGGCAGCTACTCGGGAGGCTGAGGCAG 

GAGAATTGCTTGCACCCAGGAGGTGGAGGTTGCAGTGAGCTGAGACCACGCCACTGCACT 

CMGCCTGGGCTATAAGAGCAAAATTCTGTCTCAAAAAAAGTCCGTGGCTTCAATCTTGG 

GGGCTCTGCCTCTCTGGCTTTTTGCTCAGTTACTCTGAAAAAAGCCAGGTGTCACACTGT 

AAGCTGTCCTACAGAGAGGCCCATGTCACAAAGAACTTATGTCTTCTGCCAAGAGCCACA 

TGAATGAGCCTGGAAACTGAGTCACACTCTGCACATTCATGTACCTGCCCG 

Sequence 3540 

CGAGGTACGCGGGGGACCTTGGATGTGGGAGGATTGCATTCAGTCTAGTTCCTGGTTGCC 

GGCTGAAATMCCTGMTTCAAGCCATGAAGAAGCAGCAATCTGTCTTCTGGATTAAAAC 

TGAAGATCAACCTACTTTCAACTTACTAAGAMGGTATTAAGCGCCTTTCTGAGAGCTCT 

CAGTGGGCTTCCTMGCACGTTCACTCTGCCTCCTTTAAGTCTTATATTTATGATCAAGA 

TGAAAGGGAGGGATGTATCACTGCACAGAGATTCTACAAGTGGATATATAAAGCTAATAA 

TGTTTTCAGTGAGCTCTTCTCTGGCAACTACTTTCCMGTGCTTTCAGCATTCCACCTTG 

AAGCACMCGTTAATAATGCACAATTTCTTATGTTCAGCCACAGAATGTTCACCGTGTGA 

TTTGGTTACMCATATTGCTCTATCCAGGCCTCTMCAAMCTGACTTCTTTCAAATGGA 

CTTGCAGTTGCTGMTTCCAAATCMCACTTTATCTTAAAAACAGAAAAGGAAGGGAAGG 

GTGGAAGGAGGGGAACAGAAANGGTCCTGCCCGGCCGGCCG 

Sequence 3541 

TGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCGTAGGGTTTTTAATACCAGGTTCA 
CCACAAACACGAGTAATGTGTTGTCTGTGGAGTTATGACAGGTGGTAGAAAGTTCCCCGC 
TCTATTATAATATTATGGGACCAGCATTGTATACACAGTCCATCCT TAACCA AAACATCA 
TTATGCTGCCATGACAACATTTGTTTGGTTTTCTTGTTTGTTGTTTGl I l I \ AAGACCAG 
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GTCTGGCCCTGTCGCCCANGTTGGAGTGCAGGAAGAAAACTAGGAGAGTATGGTGTCACC 

GAAGCCCAGGGAAAAGCGTTCCCGAAAGGGAGCGGTCCACANGACACATGAAGTTAGAGT 

ATCCGGAGGATTGAGCAACATGGAGA7TGCTGCTCATCGCMCTACAGCANTTTCAATTG 

AGTTAAGGACCTGAACGCCAGACTACAGCGGAGGTGACAAGAAGTAAGAAAGTGAAGACT 

AAAAATGTAAATMTTTCAGNTGTMTTGTGAAAGGAGAGATGATTCAAAGATAAGTTNA 

GGAAGAAGTGGAGCAAAGGCATTACTTCTTTT 

Sequence 3542 

CTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGGC1 I I I I CTCTGAAAGCAAACGTGTGTC 

TTTTACACCAGGGCTTTCTCCCCACCCCAGGGGGTGTCTTCCATCCTTTTGTGGCTCAGT 

TGAAGGCGAAAAGGGCTCCAAACCACTAACTAACCAGAGGAGAGCCCCTTCTTCCACCTC 

CAGGGAGMTTTCAGATTTMTTTGTCCGAAGATAGCGTGCTCTCTTCTTACTCATTTGC 

CATCATTACGAGGAAAACAAACCACCACCTTGGCTTCAAGATCCTGGGTAGAGGCTCACG 

GTCTTTTCAACCATCTTTGGCGAGGCCTTGCTTCCTTCCACTCGAGCCTGTTGACTGGGG 

CTGCTTTTAACCCTTTCCTATTTGCTGAGAATGCAGCCGTGTGACAGTAACTGAACATTG 

GTCTAAAGTCTTTCCAAAAGGTCAAGGTTCACAAGAACATCTGCTCAAATTAATGACCAT 

GGGGGATATGAAGACCCCAGACTTTGATGACCTCCTGGCAG 

Sequence 3543 

NNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAAAGAATTAGGCACAGAAAACTGT 

TTTAATTAGGGTAACAGAAAGCAGCTGCAATACAGAGTGATGAATGTGAGCTTTGAAAGT 

AMTTTGGTGGGTTTCAGGATTTCTGAGG AGGGAAGTCAMC AGTTCTAMG AGTTG G CA 

AGGCCAATTGAGAGGGTGGTAAGCTATCCTAAGCCCTTTCTTCTACATTMCTCTGCTTT 

CCAGTTAGTCTTCAGGTGCTTTAANAATTTGAGATCC I Mil I CTCATTGTGTGGTCCAT 

GGACTAGMGTATTAGCACTACCTGGAAACTTGTTCAAAATGCANAATTCTGGGCCCCAA 

MCAGACCTACTAAATAAAAGTCTCCATTTTAAAGAGATTGCTAAATGATTCATAACTTG 

TTAAAGTTTGAGAGGCATTACCTTAGATTCTGTCTTCTTNTTACCTTTCAAACCCAGCTG 

AMGTGGGTTTGTATTTCCCTTTACTTTCGACCTTTANGGAAGTACCTGCCCG 

Sequence 3544 

AATTGGAGCTCCCCGCGGTGGCGGCCGGGGGCCATTGAGACTGCCATGGAAGACTTGAAA 

GGTCACGTAGCTGAGACTTCTGGAGAGACCATTCAAGGCTTCTGGCTCTTGACAAAGATA 

GACCACTGGAACAATGAGAAGGAGAGAATTCTACTGGTCACAGACAAGACTCTCTTGATC 

TGCAAATACGACTTCATCATGCTGAGTTGTGTGCAGCTGCAGCGGATTCCTCTGAGCGCT 

GTCTATCGCATCTGCCTGGGCAAGTTCACCTTCCCTGGGATGTCCCTGGACAAGAGACAA 

GGAGAAGGCCTTAGGATCTACTGGGGGAGTCCGGAGGAGCAGTCTCTTCTGTCCCGCTGG 

AACCCATGGTCCACTGAAGTTCCTTATGCTACTTTCACTGAGCATCCTATGAAATACACC 

AGTGAGAMTTCCTTGAAATTTGCAAGTTGTCTGGGTTCATGTCTAAACTTGTTCCAGCT 

ATCC AG AATG CCCAC AAGAATTC AACTG 

Sequence 3545 

ATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTACTCACCCTTCCTCTGACAGAAAAG 

GATGAAGTCAAGGGCCTGGTAGAGGCACCACTAAGAAAGGCATCTGAAAGGACCAAAGAG 

AGTGACCAGCAAGCATTTTTTGCAAGGCTGAGGAGCTGACAGCTTCCATGAAAGGCTGGA 

CCACCCAGTGGTGAAAAGCATCATCTGGGTTACCTTGTGCTGCCATAAAACACACCACAG 

ACTTGGTGACTTAAACCACAGATATTTATCTTCTCACAATCCTGGAGGCTGGAAGTCTGC 

AATCACGGTGCCAGCATGGTCAGGTTCTGGTGAGGGCCTCTTTCCTTCTCACTGTGTGCT 

CTTTCTTGTGCATGGAGAGGGAGAGCATGAACAAGCCCTCTACTGTCCCTCTTAGAAGGG 

CACTAATCCCATAATAAGACATCCACCCTCCTGACCTCGGCTAACCCTAGTTAAGTACCT 

GCCCGGGCGGCCG 

Sequence 3546 

GGAGCTCCCCGCGGTGGCGGCCGAGGTACCGGGCAGCTTGACCTCCATTGCTTTTGGCTT 
TTGTCTCTTTCTCCTTTTGAAGCTCAAAAGGGCATAGAGTGGACTCTGATCCTAGGATTT 
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TTTTTTCCCTGCTTTGGCTGCCTCTGTTTTGGTTCATGTGTCAAGCAGAGACGGGGAAAG 

CCAAACGACACMTGAGCGTTCTCAGAAAGGAAACTTCTTCGGAATGAAAAGCTTTGGCC 

ACA7TGGAAAGGGTAGAAGTCTGAGAGAAACTTTTTCATCAGGGAGACTAGGTCGGTGTC 

TTAGTCCTTTCAGGCTGCTGTAACAAAGTACCTGC 

Sequence 3547 

CGCGGTGGCGGCCGCCCGGGCAGGTACTTTACTCACCCTTCCTCTGGCAGAAAAGGATGA 

AGTCAAGGGCCTGGTANAGGCACCACTAAGAAAGGCATCTGAAAGGACCAAAGAGAGTGA 

CCAGCMGCATTTTTTGCAAGGCTGAGGAGCTGACAGCTTCCATGAAAGGCTGGACCACC 

CAGTGGTGAAAAGCATCATCTGGGTTACCTTGTGCTGCCATAAAACACACCACAGACTTG 

GTGACTTAAACCACAGATATTTATCTTCTCACAATCCTGGAGGCTGGAAGTCTGCAATCA 

CGGTGCCAGCATGGTCAGGTTCTGGTGAGGGCCTCTTTCCTTCTCACTGTGTGCTCCTTC 

TTGTGCATGGAGAGAGAGAGCGTGAACAAGCCCTCTACTGTCCCTCTTAGAAGGGCACTA 

ATCCCATAATAAGGCATCCACCCTCCTGACCTNGCTACCCTAGTTAAAGTCCTC 

Sequence 3548 

TAGGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAGAACT 

TGTGTGCTTAGACCCGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGTAGGACCCT 

ATACGGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAATGAGTCTAT 

ATGTCAGAATACACATTTCCCACCTTGCCCAACAGTAGAAAAACATAAGAAGAGAAAAAC 

ATTAAAAMTGACAAGGAAGTTAATGGMGTC^GCAATGTGATGGTGTTTGGAGGTGGAG 

CCTTCAGAAGGTAATTAATGCCCTTGTAAGAAGAGGCCAGAAGAGCTTGCCACCTTCTTC 

CTGCCATGTGAGGAGCCCAAAAGCCNGCTGTCTGCAACCTGCAAGAAGACCCTCACTAGA 

AGCTAGCCTTCTGGCATCCTCATCTTGGCTTTTCAACTTTCAAAACTGTGAAAAGTATAT 

GTTTGTGGGTTAATC N ATGG N CTATGGTAATTTTTTTATT ACC AGTCCC AG NCC AAN ACA 

GNGCCTCANTTACTACTTCCATTTATTTTATTATATAGGCTCCTTCAAAAACCCATGTTC 

AMTTANGAAGATTAGATCCTGGMCCCATACCNCCTTCCTAGTTTTAATNTA CAMT TT 

GAAAMTAGTTGGTATTNACTATCTTATCCANAAAGCCAAATCTGTGGTTCTAATTTTAT 

TTATANTGCCAAATGAAAAACTTACCCCTTCCTNATTTTACTTGCCCAAAAAAAAAA^ 

Sequence 3549 

CCGGGCAGGTACGCGGGGAGGAGATGCTGCCACCTAGGTTACTTGTAGGGACCCTATACG 

GCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAATGAGTCTATATGTC 

AGAATACACATTTCCCACCTTGCCCMCAGTAGAAAAACATAAGAAGAGAAAAACATTAA 

AAAATGACAAGGAAGTTAATGGAAGTCAGCAATGTGATGGTGTTTGGAGGTGGAGCCTTC 

AGAAGGTAATTAATGCCCTTGTAAGAAGAGGCCAGAGAGCTTGCGCACCTTCTTCCTGCC 

A'rGTGAGGAGCCAAGAAGCCGGCTGTCTGCAACCTGCAAGAGGACCCTCACTAGAAGCTA 

GCCATACTGGCATCCTCATCTTGGCTTTCCAACTTCCAGAACTGTGAGAAGTATATGTTT 

GGGGTTTAATCMTGGTCTATGGTMTTTTTTTATAGCAGTCCCACCAANACAGTGCCTT 

ArTTACTCATACCATTTATATTATTATATAGGCTNCTTTCAGAAACCCATGTTCAAATNA 

GAAATAGATNCTGAACCATTACANCTTCCTTGTTTTTAATTTCA MTTTGG GAAATAG^ 

GGTNTTACTTTCTCTTCCAAAAATGCCGAATCATGGTGCTTCTAATTTTTATATATNATT 

GGCCAMTGMNAMCTTAAC ACCATTCTTAAATTTTACTTGNCCCN N G AATGG AAAGAA 

TGAAAAAAAA 

Sequence 3550 

CGGGCAGGTACGCGGGCTTTGACCAAAGAGTGCTCTGACACACTGCATTCCCTTTCCCCT 

AGGGGTAGCCCATGTTGAGGGCAAGACCTCCAGAAATCCTGCAGCACCTCCGGGTCTTGA 

CAGTTCCCCTGTAGTTGCCACGGTCTCAGTGGGTTCTACCGGGTTGTTTGCAGGAAATCA 

ACTCACGTGGAGACATTAAGGGCTGAGATTCTCTGGGCAGAATGGCANCACACAGNCNAT 

AAAGTGCCCACCTCCTCAGCTTGGGTTTGACGTGAGGGTGAATGAAACTTTGCGAACTGG 

CCTCCTGATGCTCTGCTCCCAAGGACTCCTCAAATGGCCACCAGCAGCAGTTCCTGGGCT 

TGGGGGTGCAGAGGAGCTCACCACCATTTTGGCAGTCAGCAATCTGGGGCAGGTGGAAGG 
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GGAGCAMGAAACACCCAACCTACTCTTTTCATGGGATTACAAAGTTCCTCAAGGGGCAT 

TAACTTTCAAACTCTTGCTGCCTTC l"l I I I It I GCGCCCCAGATTCATTCCATGGGTTCT 

NTGACANGTTCGGAMCTCTTNCTTTAGTATTCTACTTTTGGNCATGATCATTTACCTGT 

AACTTTN GGTTTTATTTTCTTAAAAAAAAN CAGTANCTNATTGGCTGG AGTC N ANCATTT 

TTGGMGCCAAAAAAAAAAAAAAAAAAAAMCCNNCCCCNTTTTAAAANANGNGNACCCC 

CGGGGGGGGGGAATTTNNNTAANNTTTTNANCCCCCCCCCC 

Sequence 3551 

CGACTACTATAGGGCNAATTGGAGCTCCC.CGCGGTGGCGGCCGCCCGGGCAGGTACGCAC 

AAGCAGGGCCCAGCTTGGGAGACGCTAGGATAACTAAGACCTGGTCAGCGCCCTGAGGAG 

TCTTGTCTGGATAAAGGGAGACACACACTAGCTTGGACTTATGCGTGCAGCGGAGCTGTG 

TAGAGCAGGAGTGCGTGGAGAGGGCAGCAATGAACTTTGGCTGGAGAGCCAGAGAAAGCT 

TCGTGAGCTGGAACACGGGGTAAAGTAGGAGCTTTCCAGGGCGGAGGGGACTTCTGAAGT 

GGAGGAAACTGCCTGTTCAGGACATGGAGGTAGGGATCAAAGTTCTGAGTCATGTTGGGT 

GGGCACAGAGCAGANGATGGCTGGGGCGGTCTCAGGGATAAGGAAGAAGTTTGGGG 

Sequence 3552 

AATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACCACAACACTCACCAGGACTGGAGTGC 

CACCTATGACCTCATTGCGTAATGGGATGGCTTGTCCTCAAATGGGCCCGGTCTTGGACG 

AGCCTGAGGATGTCTACAAGTGGAAGAAAAGAACCACTGGAGCAGAAGGTGGGGAGGATA 

AAATTTGGAGCAAGATTCTCAAGGAAGCAACAAGATCCTAAGGAAAGGAGAAAGAGACCA 

CCTACTGCATTTCTGTCACTCGCTGAAAAGGACACTCTGTCAGAAAATCTTCTAGCAAAC 

TTCAAAGGGCAAAATCACCCCTTGTTACTGATAAAGCCCAGAGAGCTTCAGCAGCTAACA 

TTCCCTGGACAGGGCACAGCAAGGATTTGAACCTAGGTCAATCTGGCCAGAACACCCACA 

AGCTTTNCTTAACTCAGTGTGCTATCTCCCACGACTANGGTCGCTACTGCTTTATAATCA 

CCTTTGTAGCCACCAATGGATTTTGTTCATCANTATTTTTCAGGC 

Sequence 3553 

CCGGGCAGGTACGCGGGGGAGGAAGTGCAGACTGACATGGGAGAGCATGGCAGGCAAACT 

TAGCCTGTAAGGGTGGAGGTGCTGGAGAGACACTTCCTGCAGGAAGTGGCCTGTAGGCTG 

ATACTTGAAGAATGGTTAGGAGTTAGGCAATTGGGGTGGGAGGGAGGAAATGAAGCTGAG 

ACAAAGAATAGCAAGGGCCAAAACCTAGAAACAAGAGACTGTAANCTATAGCAAAGGAAG 

TGGGGTAATGCATGTTTTAGGTCCTGTGTGGGGTGTTTGTGAATACACATGCGGGGTGTG 

GAGCGTG ATGAGAG AC AAAGCTGAG AAGTGTGGTGGAAAAGG AAN AG AG AG CTG AGG C AG 

ACGATGTAAGATCTGCTGTTACTTCTTGCTGTGTGA 

Sequence 3554 

AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGTGCTCCTCTGAGAACCATCTG 

TTGGTCTGATGGTGCTAGATCTGAAGCTGGTGAGCCTCTTGGGGCGGCCTCAAGGGCTAA 

CAAGATCTCAAATGCCCTTCTCTCCATCATTTGTGGTATCCTGGATTATTGTGAAAATGT 

ATAGGGATTCCTGGGGAAGAGTTAAGGATTAAMTATAAAGGTTTAAATATAAATMCTT 

TATATATATATG I 1 1 I I ICTTGTAAACCAATATGTTATATAGTAGGAAGTTTAATTCCTT 

AACTTTGGTNTTTCTCCCTAGCCTTTTTATAGCCAACAAGAGCAAAAGCACACTGCAATG 

TCC AAACGAATTGCAGGCG N NTTCTGCTCCCTGAAGCCTGGGAATTAATTTATATACTCA 

GTCCCTTTTCCCTTGGCTGGC 

Sequence 3555 

AGGTACTTTACTCACCCTTCCTCTGACAGAAAAGGATGAAGTCAAGGGCCTGGTAGAGGC 

ACCACTAAGAAAGGCATCTGAAAGGACCAAAGAGAGTGACCAGCAMCATTTTTTGCAAG 

GCTGAGGAGCTGACAGCTTCCATGAAAGGCTGGACCACCCAGTGGTGAAAAGCATCATCT 

GGGTTACCTTGTGCTGCCATAAAACACACCACAGACTTGGTGACTTAAACCACAGATATT 

TATCTTCTCACAATCCTGGAGGCTGGAAGTCTGCAATCACGGTGCCAGCATGGTCAGGTT 

CTGGTGAGGGCCTCTTTCCTTCTCACTGTGTGCTCTTTCTTGTGCATGGAGAGAGAGAGC 

ATGAACAAGCCCTCTACTGTCCCTCTTAGAAGGGCACTAATCCCATAATAAGGCATCCAC 
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CCTCCTGACCTCGGCTAACCCTANTTAAGTACCTGCCCGnnnnnnnnnnnnnnnnnnnnn 
nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
nnnnnnCGGTCCCCANCTTTTGTTCCTTTANTGAGGGTTAATTGCCC 
Sequence 3556 

CTGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGTGGCTGGCGAGC 

AGCGGCCTCCCCGCAGGGCCAGGCAGGTGGGCCANAGCTTTTGGTTTGCTGAGGTTTGTC 

AGATTTTCCAGCTCANGGCCCANCCAGCTGGCAGGAAGCAGGACAGAGGTCACTTGAATT 

CAGACCACATGTCCCTGTTAMTACATTAGCTTTTAMTCMTCTTTGTTCAAAGTCCAG 

TGAGTTGCAAGCCTAATGCTCACCTGCAGAGACAGAATTCCTGAGTGMCGAACAGAGCA 

GCTCCTCTTCCATCTCCAGAAATGACCTCCACCTTCAACCCCCGAGAATGTAAACTGTCC 

AAGCAAGAAGGGCAAAACTATGGCTTCTTCCTGCGAATTGAGAAGGACACCGAGGGCCAC 

CTGGTCCGGGTGGTT 

Sequence 3557 

AGGTACCTTGGTTTTCAGGTTATCTGACAAAGGCAGCTTTGATTGGGACA TGGA GGCATG 

GGCAGGTTCTCCTTTCTTTGGAMCACATCCATGCTGGGATGCACAGTTCTTTTCCTAAG 

GATTGTGTTTACTCCTGACTGCMCGTCTTGGCCACTGTTTGGAGCAGTTTGGTCTGAAG 

GTCATTAAGGATCTGCTTGCATCTTGTTTCATGTCCTCCGTGACAGACTGCAGACCATGT 

TAGMGAGAAGGAAGGTTTGTGGATCMGAGGGTTGACATTAGGTTTGGTGAGATGAGTT 

GCTTGGCACAGATCANCAGAACAGGACCCTGGTGATAAGGCATCTGGCTTTGCTAACCAC 

TTATCATCTTCCATTATTCTCTTTCTTMCTMNTAGACCCGAMGTGCACTTTTGAAGT 

CTGGTTTANATGGCGAGATTCCNAATCTACAATGATCNAGTACCTGCCCG 

Sequence 3558 

CCGGGCAGGTACTTTCACAGCAACATTTAGACTAGTGTTTGACCAACAACTGGGCACCAC 

GGTCMGCCMGTTGGCATATAGAATTAGCCAGCAAACTTGGTATTCACAATCTGTCATT 

ATAGTCTTGTCTATAAACCTAAGGTTTAGAGAAACGCTGGTCATTCTGAACAATTCCCTG 

TGAATGCAGAGATCAAGGAGCCTCCCCTTAACTGGGTAGGCAGTGTGAAATACCTGCAGT 

GTGCTTGACCTTACTGTTCACCCACTGGCAGCTGTTCCTACAGCTTGGCTTCAGATATAT 

MTATTACATTGGCTAACTTCAAAGAGCATTGTTTTCCAAGGATAGAGATGGTGATGGTG 

TATATTACTTTTACCAGCCCCAGACCTGTMGAGGTTCTCATCTCTTTTGTTTMNATTT 

TGTTGN1 I 1 I I 1 1 I AATGCTCAATAAAAGAAGCTTTATTCTTTCATTAAA 

Sequence 3559 

CACCCAGANTTTTGTTCCNTTTAAAGAGGGTTAATCNNCTGCGCTTGGCNNNATCATGGT 

CATAGCTGGTTTCCTGGGCAGTTTGTTCCACTCAACGCTGGGTAAATTAGTGCCAGTGTT 

TTGTGCAGGGGTTCCCCATCCTCTTCGACTTCCTCCAGAATTTGCGCCATTTCCACTTCC 

CCATGATGTACCT 

Sequence 3560 

AGGCGGCCGANGTACCAA/vTGAAOTGTGAAGACAANGCCATCCACCACTTTATAGAGGGT 

GTAAAMTAMCCANAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATT 

GCCAAAATGCGACTTTCTAAAAA1 GGAGCAG ATTCTGAGGCTTTGCATGTCTTGG CATTC 

CTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAG 

TCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATANAGATGTGG 

TGCCCACTANGCTACTGCTTG/\AAGGGGAGCTGAMTTCCTCCCCCAAGTTTGGTGTTCA 

AMTATTGTAATGACTGGTATTGCAAAAAGAATGGACTNANNACCTTGGCCTTNCNCTGG 

ACAGGGTTNTTTTTACCCCCTGATTTGCTTGGNTCTTGAGAAAACCCCAAGGGAGTTNTG 

GGAAAGGGACCAAAATTGGGGGGT 

Sequence 3561 

AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGACCTTGGATGTGGAGGATTG 

CATTCAGTCTAGTTCCTGGTTGCCGGCTGAAATAACCTGAATTCAAGCCATGAAGAAGCA 

GCAATCTGTCTTCTGGATTAAMCTGMGATCMCCTACTTTCAACTTACTAAGAAAGGT 
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ATTMGCGCCTTTCTGAGAGCTCTCAGTGGGCTTCCTAAGCACGTTCACTCTGCCTCCTT 

TAAGTCTTATATTTATGATCAAGATGAAAGGGAGGGATGTATCACTGCACAGAGATTCTA 

CAAGTGGATATATAAAGCTAA7AATGTTTTCAGTGAGCTCTTCTCTGGCAACTACTTTCC 

AAGTGCTTTCAGCATTCCACCTTGAAGCACMCGTTMTAATGCACMTTTCTTATGTTC 

AGCCACAGMTGTTCACGTGTGATTTGGTTACAACATATTGCTCTATCCAGGCCTCTAAC 

AAAACTGACTTCTTTCAAAATGGAC TTGC AG TTGC TG AATTCC AAATC AACACTTTATCT 

TAAAAACAANAAAAAGAAG 

Sequence 3562 

MTTGTTTCTCACCGCGGCGGCGGCCGAGGTACTTTGTTACANCAGCCTGAAAGGACTAA 

GACACCGACCTANTNTCCCTGATGAAAAAGTTTCTCTCAGACTTCTACCCTTTCCAATGT 

GGCCAAAGCTTTTCATTCCGAAGAAGTTTCCTTTCTGAGAACACTCATTGTGTCGTTT^ 

CTTTCCCCGTCTCTGCTTGACACATGAACCAAAACAGAGGCAGCCAAAGCAGGGAAAAAA 

MTCCTAGGATCAGAGTCCACTCTATGCCCTTTTGAGCTTCAAAAGGAGAAAGAGACAAA 

AGCCAAAAGCAATGGAGGTCAAGCTGCCCGGTACCTGCCCG 

Sequence 3563 

CCGCGGTGGCGGC CCGC CCGGGCAGGTAC I I I I I I M 1 I I 1 I I 1 1 1 1 1 1 1 1 I I I NGCNGC 

TTAAAACAACAGCCATTTTATTTAATATCTCATGATTTTGTGGTCAGGAATGTGGACAGA 

GCTCAGCTGGGAGATTGTTCCACTCCATGTAGCATCANAAGACAGGTCTCTTGGTGATAT 

TCAGCTGGAGAATMGTTAATATGCANAGTTCMGATGGTTTCACTCACATTGACACCTT 

GGCAGGGATGGCTGGAGGGCTGGGCTCANCCAGATCAGTTAACTGGCGTGCCTGCATATG 

CACTNTCCACCATGGGGGTCTCTANAATTGACTCAAATCTANAATTGGCTGGAAGACATA 

GTGGGAGAAATAAACTTAGCAGCCACATTTAGTCTACCACATGGAGTAAGANAGAAAGAA 

AGGAGTCAAAGGGGCCTAGTTT 

Sequence 3564 

CGCCCGGGCAGGTACGCGGGGGTAGATGGAAGGAAGAACTTGTGTGCTTAGACCTGACGC 

GGGGGAGGAGATGCTGCCACCTAGGTTACTTGTAGGACCCTATACGGCAACCTCCTTTGC 

CAGGAACTATTTATAAACATCCTGCAGGAAAATGAGTCTATATGTCAGAATACACATTTC 

CCACCTTGCCCAACAGTNNAAAAACATAAGAAGAGAAAAACATTAAAAAATGACAAGGAA 

GTTAATGGAAGTCAGCAATGTGATGGTGTTTGGAGGTGGAGCCTTCAGAAGGTAATTAAT 

GCCCTTGTAAGAAGAGGCCAGAGAGCTTGCGCACCTTCTTCCTGCCATGTGAGGAGCCAA 

GAAGCCGGCTGTCTGCAACCTGCAAGAGGACCCTCACTAGAAGCTAGCCATACTGGCATC 

CTCATCTTGGCT TTCCMC TTCCAGMCTGTGAGMGTATATGTTTGTGGTTTAGTCAAT 

GGTCTATGGTAATTTTTTTATAGCAGTCCCAGCCAAGACAGTGCCTCATTTACTACATAC 

CATTTATATTATTATTTAGGCTCCTTTCAAAA 

Sequence 3565 

CCGGGCAGGTACTC ACAGTTG AG ATAGGATTAAACC CC AATTTC * AAGCCCCTTACAG AG 
TTCCAC7TACCAGGTTATTTCAGCCGGCAACCAGGAACTAGACTGAATGCAATCCTCCCA 
CATCCAAGGTCCCCCGCGTACCT 
Sequence 3566 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACCAAAA77TACCAGAGAATTACAGCAAAAGTA 

TTCTGACTCTTAATGTCTTTCTCACTGGTGGGAAGCCATTAGG3ATGGAAGCACCTGACC 

ATGGAGAGCTGTGTTCTATTTGCCATAGGTGAGAATAAGCATGTCtGCATGGCCACACAT 

GTATATAATTTGTCATATACACTCCGTATAGACCAGAAATTTTCATAGAAAATCAGCGGA 

AMCCTAGTTAACMTTCACACTGTGATTCAG 

GTTCTCAGGTGATGCTGATGCTGGTGGTCCTTGGACGTTGCTCTGGGTAGCAAGAGGAGA 
GGCCTTTTGTGGGAAAAATGTGGAAGCAGAGCAACTGGATAGAAAGGTCAGGACAGAAAA 
GGTCAGGAGAAAACATCATGGGTCCTCTTTAATTTTGAGGATGTTTTT 
Sequence 3567 

CCGGGCAGGTACTTGTGACAGGCAGACGTGATTGCAGCCACGAACACGATGAACTCACTG 
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AAGTCCACCTGGGCATCTCCATTGACGTCCAGGTCCTTGAGCAATTTATCCACGGCATCC 

TTGTCTTTTCACTCTCAGGGACACAGCTCCTGGGGAGGGCGGCAGAGACCCTGACAGTCT 

CGCCTCGCAGGCGTCTTTCTCCAGGCTAGAAACCAATGACTCATCTCTCAGGCTCTCTAG 

CAMCAGCGGGGCCCACACACCACAGTTTGTGGTCGAATGTTTCCCTGAGGGCTACGGAG 

TCTGGTCTTGCCGAATACCCCCGCGTACCT 

Sequence 3568 

CGCGGTGGCGGCCGCCCGGGCAGGTACAGTGGCCCCCCGTGAAAGACAGAATTGTGGTTT 

TCCTGGTGTCACGCCCTCCCAGTGTGCAAATAAGGGCTGCTGTTTCGACGACACTCGTTC 

GTGG G GTCCCCTGGTGCTTCTATCCTAATACCATCAG ACGTCCCTCC AGAAN GAGGAGAT 

GTGAAATTTTTAN ACACTTCTTGC AG N GGTATCTTGCCTGC N ATTCCTTGACG CGNGTGC 

ACNTCCNCTAGCACGGGTTGAATTTATGTTNCCCAAGAAGTCTCGGNCTTGCCAACNCTC 

CACTCGGGMCAACCNTCAAGAMCACCGCTTTCTTGGCAAGNCCTGTNGGCCCTTCNGG 

GCTTCCACCAAACCACCAATAATTTGGMCNTGGCCTTCNTGGACCTTATTGAACTTANC 

TTCAAAAAAAAT 

Sequence 3569 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTAACTCAGCCAT 

TTTCTCTCTCCTCTATGTGTGGTGACTTCAGGAAACCCAGGCCCAGAGGATATAAAGCCA 

CMTGCTTCTTTCTCCTTCAMCCACTCTTTGGATATTCCGGTTCCCACTGCTTCACTGC 

GCTTCACTGGTCAGTGCGTCACTCAAACTAGCTGAAAGAATTGTCT AACTCA TACACAGG 

GACAAAGTCCAGCAAGCTTATTTAACCGGTTACACMTTGCCTTTC 

TTCCTAAGTGTTTCCAGAGGCTTGGTTCCTCTTCACCTGAGGTAGAGATTGACCAATTAT 

TTATTCCAGTTAAMGTAGGCTTATTTCAAAAGGCAGAGAATTTGAAGCTATCGGGGAAA 

GGAACTGTGGCAGATAGTGCCTGCTGTGTGT 

Sequence 3570 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTTTTT 

n 1 I III > 1 I 111 I 1 1 1 1 t ACGGCTTGGAATATCTTGAATTTAAACTAGTAATTCAGCTC 

CTCTTCAGCACTCACAATATAGTCTGTTGCCAATGCCTGATGAGACACATAGCTTATGAC 

CATTGCACTTCTCTTTTTTO 

GAGGATATCCTTCCTCATCMmANAGA^ 

ACGGNTCTTACCTTCTAATTTTCCATTTAT^ 

GGGTAAAC ACACATN CTTNTGTGAAAC AGTCTC ACAG AGACTNTN AN AATCC C AAGAACT 
TTNTTC ATAAG NCCTTTTGTTTGGATT 

Sequence 3571 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTTTN 1 1 II 1 1 1 I 1 1 i I II I I III I H I llAAG 

TGACAGAAAACAGGTTATACGTAACGTTAAAAATAATAAAATGGAAGACTATGCCTTCGG 

GGCTCATTTCTATAGTTTGTTGTTAGAGAAGTTTCTCTGAACGTGTGGAGCACAGGGAAA 

AMTTGGTCTATAGGTTGTTGGGTTATAGACCCTTC CTTTT ATATTCATATGTATTTATA 

TGAGAATAMCATGACCCAMTATTCTACACTTAATATTTTGCAATCAGGAAAAAATGAC 

TTGATAGAAGTATTTACCCTGCCAGGCGTGGTGGCTCACACCTGTA/vTCCCAGCACTTTG 

GGAGGCCGAACCAGGAGGATCATCTGAGGACAGGAGTTCGAGGACAGCCTGGCCAACATC 

ATGAAACCCCATCTNTACCAAAAATACAAAAAATTAGCCAGGCATGGTGGTGCACACCTG 

TMTCCCAGCTACTTAGGAAGCTGAGGCANGAAAAATTGTTTGA ACCCA GGANGCGGAGT 

GAGATCCATNATTGCACTCCACCTGGGCGATAAAGMCCMNGCTTTTCTTAAG AAAA AA 

TTAAATAAATTAAATTCNNGTCAGGCCCNGGGGGTCACT1CTGTMTNCCACACTTTTGG 

AGGCCANGGGGGCCATTCACGANGGCAANANATCGAAACATCCTGGCTAAAACGGGGAAA 

CCCCTTNTTTCTTAAAATCCAA 
Sequence 3572 

CCGCGGTGGCGGCCGGGGGATGCGAGGCTCGGAGCACCCTTGCCCGGCTGTGATTGCTGC 
CAGGCACTGTTCATCTCAGCTTTTCTGTCCCTTTGCTCCCGGCAAGCGCTTCTGCTGAAA 
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GTTCATATCTGGAGCCTGATGTCTTAACGAATAAAGGTCCCATGCTCCACCCGCAAAAAA 

AAAAAAAAAAAAAAAAGTACCTCGGCCGCGACCACGCTTCGAGCGGCCGCCCGGGCAGGT 

ACAATCAAAGTCCTTGGCCACATTGTAGAACTTTGGGGGAAGCTCGCTCCAACCGACTAC 

TATCACCTTCGCCGTTCCGGTTTTCAMTCCTGAGTCMGCCAATAAAAAACAAAACAAA 

AAATGAAACAAGAAAACAAATAAAGCCCGCC^ 

TTTTGTCMGAMGGGTGTMCGCAACTMGTCAGAGTCCACCTAGAAACATTTGCGGTG 
GACAATGGAGGGGCCTGACTCATCATACTCCTGCTTGCTGATCCACATCTGCTGGAAGGT 
GGACAGGCGAGGCCAGGATGGAGCCCCCGACCCCACNGAGTACCTTN 
Sequence 3573 

CGCCCGGGCAGGTACTGTGTTTAGAAGGGATTACCAAAGGCTGGAGCAATTTAAGCTATG 

CACTAGCCTTGCCCTCTTAGGTTGCATTCTCTTTAGGCCACTGGTTCTCAAACATTAGGG 

TGCACTGTAACCATTACAGAGCTTGTGAAAAGTGCAGAAACCTGAGCCTCACCTTATGAG 

ATTCTGATTCAGCCAGGGGGAGGACAGGAATCTGTATTTTTCATAGCAACCTCAAGTAAT 

CTTTATTCTGAGGGTCTCGGGATTGCAGGTTGAAAAACACAACCTTAATCAGCAGTAAAT 

TCTTCJCTACTCGGCCAGGCGCAGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCC 

GAGGTGGGTGGATCACAAGGTCAGGAGTTCGAGACCAGCCTGACCAACATGGTGAAACCC 

CATCTCTACTAAAAATACAAAAATTAGCCGGGCGTGGTGGTGCTCGCCTGTAATCCCAGC 

TACCCAGGAGGCTGAGGCAAGAGAAATCACTTGAACCCGGG 

Sequence 3574 

CGCCCGGGCAGGTACGCGGGGGCACCCGTCCTGCGTTCCTCCTGCCTGGGGTGACCCATA 

AGGAACACCCCCCCCTTCTCAACAACCACAGAAGATAAATGCCAAGGGGACCACCAACAT 

GGGCATGGGGCTGTGATGCCAATCTTAAGGGAAAACACAAGTAAAA7TGGCTCCAGGACT 

GAGCTGGGGGCAGCTGCTCTTTCTGTCCATGACTTTGAGGCGGGATCTCGAGCGTCTGTC 

TGACGGGCCTTCTGTCTGCTGTCCTCTCGCAGGTTTCAGACATTGAATCCAGGATTGCAG 

CCCTGAGGGCCGCAGGGCTCACGGTGAAACCCTCGGGAAAGCCCCGGAGGAAAGTCAAAC 

CTCCCGATATTTCTCCTCGAGTGGCTGGGAAACTTGGCAAGAGACCAGAGGACCCAAATG 

CANACCCTTCAAAGTGAGGCCAANGCAATGGCTTGTGCCCTATCTTCTGAAGAAAAAAGT 

TCAATAATTCCCTGAAAAGTCAANGTAAAAGATGATGATTC I 1 1 I I GATCNGGAAATCAG 

TGTACCCTTGGGCCGCTCTAAAAACTAAGTGGATCCC 

Sequence 3575 

CGCGGTGGCGGCCCGCCCGGGCAGGTACTTAACTAGGGTTAGCCCGAGGTCAGGAGGGTG 

GATGCCTTATTATGGGATTAGTGCCCTTCTAAGAGGGACAGTAGAGGGCTTGTTCATGCT 

CTCTCTCCATGCACAAGAAAGAGCACACAGTGAGAAGGAAAGAGGCCCTCACCAGAACCT 

GACCATGCTGGCACCGTGATTGCAGACTTCCAGCCTCCAGGATTGTGAGAAGATAAATAT 

CTGTGGTTTAAGTCACCAAGTCTGTGGTGTGTTTTATGGCAGCACAAGGTAACCCAGATG 

ATGCTTTTCACCACTGGGTGGTCCAGCCTTTCATGGAAGCTGTCAAGCTCCTCAGCCTTG 

CAAAAAATGCTTGCTGGTCACTCTCTTTGGTCCTTTCAGATGCCTTTCTTAGTGGTGCCT 

CTACCAGGC CCTTGACTTCATCCTTTTCTGTCAG AGGAAGG GTGAGTAAAGTACCT 

Sequence 3576 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAACTGAAGCAAAGAGGAAAG 

CAGACCCATGGCTCAGGACACATGGGAGTCCCACCAGCTTGAAAAAAGTTTCACAGAGCT 

GGCATTTCCATCAGTGCAGACACGGCTCTGAAAACAGTAATTGTGCCTTATGGGCATGAT 

ACGCCCATTTCTACTCTGACCTCCTCCCAGGAGAAACTCTTCTGCAAATGACCACAGTAC 

CTGCCCG 

Sequence 3577 

L I I I I I 1 1 1 I 11 1 1 I I I 1 I I II I I \ GCAAATTTACACAGTTTTCTTAGGAATGCTCTTTC 
TTTTTTTTTTN N NTNGGGACTCTACATTCCTGTANCTCCTATTNTTCAAGGGCACAGGGC 
TTACTTAAACATAAMTTTTACAACTTAACTGGAAGGGTGCCCTGCTTGCAAATGGAAAA 
CTGCCACANAGCANACTTAACTGCAGCAAGCAGGTGACAGAGCAAGGAACAGGATCCAAA 
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ATTATGATTTTTTGTGTGCTACTGAAGTAAMGMTTTMGMGGTTTTACTCAGGGTCA 
TAGGCTGGGTTGGAAGACTGGGAAAAGGAAATCAGTAGTATCTATATCTAATTATGGGGA 

0 

Sequence 3578 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTGAAAATGTAACCCGA 

TACAGCACAAGACATCCCCACCACTGTGGATACCAAAGATTTACCTCCAACGGCCATGCC 

AAAGCCACAGCATACATTTTCTGACTCACAGTCCCCTGCTGAGTCATCTCTGGGCCTTCT 

CTTTCACTGTCTGCACCCGCTCCTGGGGATGTTCCCAAAGACACATGTGTTCAA TCAC CC 

ATMGCAGTTTCCCATGCACTGATCTAAMGTGT CAGA AAACCATAMGGATGTGTTTTG 

CCTGTGTCTCGTTAGAACAATGAGAAMTGCCACTTTTAGAACTTGGAGGAGAAACAACC 

CCTNCTTTGTCCACAGAGCCGTAGTCCAAAAGCTGTGGGAAGTGAGTGTCCATTCAGAGT 

CCTCGTCCAGGTCAGGTCCTTCGTGCTCCCCCGTGGAGAGCACCCAGGATGTGAGCTCCC 

AGGGTNATNCCAGGAGAGCTCTGAAGTTAAGAAGAAGTGCAAGTTGCCAACTTGTCACAG 

TAAATGAACCTGAAAGGNGGTTTTCCG 

Sequence 3579 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACCGGGCAGCTTGACCTCCATTGCTTTTGGCTT 

TTGTCTCTTTCTCCTTTTGAAGCTCAAAAGGGCATAGAGTGGACTCTGATCCTAGGATTT 

TTTTTTTCCCTGCTTTGGCTGCTCTGTTTGGTTCATGTGTCAAGCAGAGACGGGGAAAGC 

CAMCGACACMTGAGCGTTCTCAGAAAGGAAACTTCTTCGGMTGAAMGCTTTGGCCA 

CATTGGAAAGGGTAGMGTCTGAGAGAMCTTTTTCATCAGGGAGACTAGGTCGGTGTCT 

TAGTCCTTTCAGGCTGCTGTAACAAAGTACCT 

Sequence 3580 

CCGGGCAGGTAC I rTTTTTTTTTTTTTTTTTTTI I GGTAACAGATCTAATCTTCCACGTA 

AGCCTCTTTACTGGAGAAGAGTAGATGGGAGGGCAGGTGGGATCTATTTAGACAAATCCG 

ATATATTCAGAATGATTCAGTAGGCAAAGAATTCCTCCTGGTAACGAATCTGTGTGTCAA 

GGTCCTTACCTTGGCTCTCACTGTCCTCAGGATTATGGAGGGGAAGATGATGCAGAATTC 

GCACATGTCTTCTGGATCATGGGGGACTTTGCTGAGTGTGCCTGCCATGGTCCCGATCTC 

TGATTGATAGGAGTGACTGCTGAGATCCTGGATCTGACCACTGAGATATGTGGCCCAAGT 

GCCATTCACCCTNGCAGGGTAACATTCTGGCTTCCATGGGGCAAAAGCCCATTGG 

Sequence 3581 

CCGCGGTGGCGGCCGAGGTACGTAACAAGAAGGGGAACACTCCATTGTGGCTAGCAGCAA 

ATGGTGGACACCTCGATGTGGTTCAGTTACTGGTGCAAGCAGGTGCAGATGTGGATGCAG 

CAGATAACCGCAAGATAACTCCTTTATGGCAGCATTTAGAAAGGGTCATGTGAAGGTGGT 

GCGCTACTTAGTCAAAGAAGTCAATCAGTTTCCATCAGATTCTGAATGTATGAGATACAT 

AGCAACCATCACTGATAAGGAGATGCTGAAGAAGTGTCATCTTTGTATGGAGTCAATAGT 

ACCTGCCCGGGCGGCCGCTCGACCCGGGCAGGTACCCACAGAATGTGGCCCCAAATGATT 

CAATTACAGCTAGAAGTTGTTCTCTTCCCAAAGCACGAGTCTTTTGCATTCACAGGATAT 

TTTTTCTTTCATAMCATTCCTGCGAGAATGTAAGTGCGGATGAMGATTGTGTTCTGTC 

TTCTTCTTGTCTTTTTTCTGTTTACTCTGATTTTATAAAATG CAGT GTCATGTCTTAGG 

CAAATTATCCTCAGTTTGCCAGCAAGTGAGTTTATAMTTATTTTATTCAMTGAGAAAA 

TTTCCGCTTAGAACCAATAGNAAAGGATGGTGAGTTTCCAGCTGACCCACAATGACGTAT 

TTAACTCACAAGTGAAACTCAGCTCTGCGGGTTTAATAAGTT 

Sequence 3582 

ATTGGAGCTCCCCGCGGTGGGCGGCCGAGGTACTTTNl nTTTTTTTTTTTTI rTTTTNC 
TTTTTTAAMGATCATTTTTTAACTTTTCATTTTGMG 

TGCAAAAACAGTAGAAAAGTTCCTGTGGACTCTTCATCCAACGTCCCCAAAAGTTAGCTT 
TGAGGCTGGGCGCGGTGGCTCACGGGCACTCATCACAGCACTTTGGGAGACCGAGGTGGG 
TGGATCAGCTGAGGTCGGGAGTTTGAGATCAGGCTGGTCGACATGGTGAAACCCCGTTTT 
CACACTCATGGAGCCAGGAGGACCAATCCATGAGCACAGGGCANCTCGGAGTCCCGGGCG 
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GGCATTNTCCGGCGTCCTTCTTCTGAAGCTTCTGAAAAGTTGCTGTAGGACTCCTTTTCC 
CAGGTTGGGGTTTCTATCTGCGTTTTAGCTGCTTGNGTTGCCATGTGGATCTCCCCGCGT 
ACCTTGGCCG 
Sequence 3583 

GAGGGAAAGTAGGGCANAAACTGGATACAGTTCTGAGGCACAGCCAGACTTGCTCAGGGT 

GGNCCTGCACAGGCTGCAGCTACCTAGGAACATTCCTTGCAGACCCCGCATTGCCCTTTG 

GGGGTGCCCTGGGATCCCTGNGGTAGTCCAGCTC1TCTNCATTTCCCANCGTGGCCCTGG 

TTGGAAGAAGCAGCTGTCACAGCTGCTGTAMACAGCTGTGTTCCTACAATTGGCCCAGCA 

CCCTGGGGCACGGGAGAAGGGTGGGGACCGNTGCTGCCACTACTCAGGCTGACTGGGGCC 

TGNTCAGATTACGTATGCCCTTTGGTGGNTTAGAGATAATCCAAAATCAGGGTTAGGTTT 

GGGGAAAGAAAATCCTCCCCT 

Sequence 3584 

CCGCGGTGGCGGCCGGGCCCCGGTAGTTGTCTTCTTTCCCAGTCATGTCTTTTGTTAGGC 

TGCTGGTGACAATGATGACATCCTCTGCAATGCAAGACATCTCCTTGATGGTCAAGTAGC 

ACATCCGACGGAGTGTGGGATCATTGGACTGAAAGAGCTTGGTCATGGCAAAGAAGGCCT 

CGGTCGCTTCCGTGGTCCCCAGGTGCTCCCCCTGGTTTATGAGATAAAGAATCTTGGTGA 

GGATGTGGGCACATTTCCGAGGGTTGATGGGAGTTTCATTAAATACACGGGCCTCCTGGA 

GTACGCGGGGAGGCGAATTCTGGGGCTGCTGGATGCTTACTTGAAGACGAGGACTTTTCT 

GGTGGGCGAACGAGTGACATTGGCCGACATCACAGTTGTCTGCACCCTGTTGTGGCTCTA 

TAAGCAGGTTCTAGAGCCTTCTTTCCGCCAGGCCTTTCCCAATACCAACCGCTGGTTCCT 

CACCTGCATTAACCAGCCCCAGTTCCGGGCTGTCTTGGGCGAAAGTGAAACTGTGTGAGA 

AGATGGCCCAATTTGATGCTAAAAAGTTTGCANAGACCCAACCTAAA 

Sequence 3585 

CGGCCCGCCCGGGCAGGTACCAGTGCATTAATTTGGGCAAGGAAAGTGTCATAATTTGAT 

ACTGTATCTGTTTTCCTTCAMGTATAGAGCTTTTGGGGAAGGAAAGTATTGAACTGGGG 

GTTGGTCTGCCTACTGGGCTGACATTAACTACAATTATGGGAAATGCAAMGTTGTTTGG 

ATATGGTAGTGTGTGGTTCTCTTTTGGAA 1 Mill I CAGGTGATTTAATAATAATTTAAA 

ACTACTATAGAAACCGCAGATAAGTTTTTTTCTCTTTGAAAGATAGAGATTAATACAACT 

ACTTAAAAAATATAGTCAATAGGTTACTAAGATATTGCTTAGCGTTAAGTTTTTAACGTA 

ATTTTAATAGCTTAAGATTTTMGAGAAAATATGAAGACTTAGAAGAGTAGCATGAGGAA 

GGAAAAGATAAMGGTTTCTAAAACATGACGGAGGTTGAGATGAAGCTTCTTCATGGAGT 

AAAAAATGTATTTAAAAGAAAATTGAGAGAAAGGACTACAGAGCCCCGAATTAATCCAAT 

AGAAGGGCAATGCTTTTAGATTAAAATGAAGGTGACTTAAACAGCTTAAAAGT 

Sequence 3586 

AGCCTCCACCCGCGGTGGCCGGCCCGNCCGGGCAGGTACAAAAATCCCCAGAGCCACATA 

GACGGGGGTGGTAGTGCGAACCATAGCAATATTGGCATAGGTGCCAATCTGTAGCTCTTC 

CAGAGACTTGGACATGAACTCTGGCTTGGGGAACTCAGTGATAAACAATTTGAGGAACTT 

CAGAATGCGCTTGTGGGTGAGGATGTACATGCCGGCTTCCCTTAAGTTGATTAGATGGGA 

AACCAAACACCGTCTTTCAAGTGTGAGGCTGTGCCATGCTCCCTCAGGGAGGAGGGGCGC 

TGGGTCTGACCAGGTCTGCCTCTGCCCTCACCAGCTGGCAGACGTGAACTGGAGAACAGC 

ACCAGAAAGGCTGGTGGTTGGCAGCTGCTCACGACACAGTTGCTGACCTGCAGCANATCG 

GCTGTGGGTGAGCANGACAGTCAAGGCCCTCACAAGCACTGTTCTGGAGTTTCGTCTGAC 

TAGGTGAACCCTTNTGGCTTTTCGAGGACGCANGGTACCTCGGCGCTTTAAAACtAGTGG 

GATN 

Sequence 3587 

GGGGCCATTGAGACTGCCATGGAAGACTTGAAAGGTCACGTAGCTGAGACTTCTGGAGAG 
ACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGACCACTGGAACAATGAGAAGGAGAGA 
ATTTACTGGTCACAGACAAGACTCTCTTGATCTGCAAATACGACTTCATCATGCTGAGTT 
GTGTGCAGCTGCAGCGGATTCCTCTGAGCGCTGTCTATCGCATCTGCCTGGGCAAGTTCA 
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CCTTCCCTGGGATGTCCCTGGACAAGAGACAAGGAGAAGGCCTTAGGATCTACTGGGGGA 

GTCCGGAGGAGCAGTCTCTTCTGTCCCGCTGGAACCCATGGTCCACTGAAGTTCCTTATG 

CTACTTTCACTGAGCATCCTATGAMTACACCAGTGAGAAATTCCTTGAAATTTGCAAGT 

TGTCTGGG TTC ATGTCTAAGCTTGTTCCAGCTATCCAG AATGCCC ACAAGAATTCAN CTG 

GATCTGGAAGAGGAAAGAAACTGATGGGTGTTAACTGAACCCGTTTTGATTGAAACCTAC 

ACANGGCTGATGTCATTCATTGGAMCCGCAACAMCTTGGCTATTTCCTTTGCCGTGGG 

ANTATTGGTTTTTGAAAATC I 1 1 I I IGGTACCTCGGGCCGTCTAANAACTAATGGGATCC 

CCCCGGCCTGCAGGAATTCNATATCAAAGCTTATCNATACCCGTCCAACCTTAAGGGGGG 

GGCCCGGNA 

Sequence 3588 

CCGCGGTGGCGGCCGAGGTACTTTNTTN I I I I 1 I I I I 1 1 I i CTCTAAAGAAAAAACTTAA 

TGACCTTTCAAAATGAATAGGCCCGATTTATGATAAATGATTAGTTMCAATTGMGTTT 

TTTAG TTTC TTTCTTTC TTG G G AAAC AAAAAATAC TC ACTTTTATTATTAAG C C C ATAAT 

ATTTGGTAGTAATACAAATATTTCCTAGTGGCTCCCATTCTCTAGCAGCAAGTCAAACAG 

CCTCMGAGAAMGTATTAAAATTGTGCTATTATGACATAGCAGATAATCTGGAATAAAA 

CAAATCCCTCTAGGATCATTATATCCTAAMGTTCTTTGCCCCANTCTTAGAACATGGCA 

CTTTTGAGCATGCTAAAATCATGATAAGGACATATTTCTCTTCTCATTTCACACCAATAA 

AGGAGGAAAAAAGCCTTGTCCTATAGGCACCATCAAAATAATCTACCCACCCCCAAC 

Sequence 3589 

CGCCCGGCCAGGTACGCGGGGGAGATGCTGCCACCTAGGTTACTTGTAGTGACCCTATAC 

GGTMCCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAATGCANTGAANTAG 

AAGAGACAGGGATATCCAGAAGGTTATGCAAAACATCAAGAGAAGATGAGAGGAGTCTAT 

ATGTCAGAATACACATTTCCCACCTTGCCCAACAGTAGAAAAAAAAAAAAAAAAAAAAAA 

ANGGTACCT 

Sequence 3590 

CCGCGGTGGCGGCCGCCGGGCAGGTACTTTCATCCATAAAGGCCTGCAGCTGTTTCACTG 

ATCCTTGCAGTTCATCCATCACCAACTCCATACAGTCAAAGACTTTGCTCTGGTTCTGTA 

ATATTTTCTGGTAGTCAGGTTTTGATTAAGMCTTCATTCTGAGAAGACCCAAGATATGT 

CATAGGTTCCACTTTGACCTCAGTMTTTTGGCCTCAGTTGATCCTCTGGACAATATCTC 

TTTAGCCTCCTGCTGGTAGTGAAGCAAGAGCTGATCCCAAGTCTGACGTTCTAAAGAAAA 

CTTTGTTATGTATTCCTTCATCTCAGCCACAGATGCTTCCAMGAAAMTCTGATGCTTT 

TCCATTTGAATCTTCAAAACATTTTTGTAGAAGTTCCATCAGTTTCCAAGTCCGTCTGCA 

AMTGTTTCAATTCTTCAAAMGAGAAAAATGCTTTTGGCTCTAAMCTTTCAAAG 

Sequence 3591 

NCGGCCGCCCGGGCAGGTACGCGGGATAGCAGTCCCAGCCAAGACAGTGCCTCATTTACT 

ACATACCATTTATATTATTATATAGGCTCCTTTCAGAAACCCATGTTCAAATAAGAGATA 

AGATACTGAMCACATAACACCTTCACTAGTTTTTAGTATACAAATATTGAGAAATAGTT 

TGTTATTAACTATCTCATCCAAGAMTGCACGATTCATGTTGTTTCTMTTTTTATATAT 

AATTGACAAAATGAAGAAACTTAACACCATCCTAGATTTTAGCTGCCCAAAAAAAAAAAA 

AAAAAAAAAAANGTCCT 

Sequence 3592 

GTATGACCTCAGCATTCCTAAAGTGCTGGGATAAAGTGTCTCACAAGGGCAAGTGGACCC 

TAGAGAGACAGCCATCCCTGACCTCTGGGCTCCACATCACCTGACTTGCTGCTCCACCAC 

TTTGCTTGTTTCTGGGTGCTCCATCCCAGAGACGAGTGAGTTAGCAATCACTTAGTGTAA 

TCAGCCCAGGATGGAGGGTCTGTGCTGTGGGCCCAAGCCAGGGTTCCCTGTCTGGTGATG 

AAGTAGTGGAGGGTGTGTGGTACCT 

Sequence 3593 

CGAGGTACATCATTTCCAGAGCAGGCACTGGCAGCGAGATAGGGTTGGAGGAGAAGTAGC 
GCCGGGACTTCCGGATGGCAAACTTCTCTGTGGGTAGAGATTTCCCAGCAATCTTGAGCT 
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TCAGGCCTGGACAGCTCGAAATAATTCCACTTCGTCGTCCCCGAACGGCTTGTGGTCCTC 

CTTCCCAAACATGCTGAGGTAGGCGGCCTTCATGTAMTGTAGGTGGCCTTTTTAAGTCA 

GATCATGTCAGTTCCTTCTGGAAATCTGGTTATATTCCATCACACTCAGGAGACATCTCC 

TACAATTTCCTTGACACCTGCAGCACTCCCGCGTACCTGCCCG 

Sequence 3594 

CCGCGGTGGCGGCCCGAGGTACCCCTCCCATTGTGGTAGTCCCAGCAGTTTACCA.^ACAC 

TAGTTCCCCTAGTAGAGCCAGCCTATGAGMGAAAAGCTTTGTTGGCCAAGTTCTTACAT 

TAAT GACTTCATAGTMGAAACTGTTTCAAAGTGAATTAAGGAAATCATCACAGAGAATA 

TTTTCAAACAGATGTTTCTCTTTTTAAAMGTGATAGTMGATGAACTTGTAAAAM 

CCTCTGATGTTAATTGTAGAGTTGTGGGGGACGGCCTATTTTTCTCTACCGTTGGGACTG 

GCAGGGCATTGCCCACCATATACCATAGTGGTCACCAAAAAAGAAACCTCTTCAACTTGA 

AAAGGGAAGTGAACATGAGCCATGAGMGAGATTTAAAAAGACCCCTGTGCTTGCAGNTA 

ATGCCACGTTGATTGGATTATCTGTGCTTCAMGGGCACAGAAATCTTAAAGGGATTTTT 

AGCCCCATGGTAGAATATTGGCCTTTTGGCCCTGC 

Sequence 3595 

CGAGGTACGCGGGGAGTCTAGTTCCTGGTTGCCGGCTGAAATAACCTGAATTCAAGCCAG 

GAAGAAGCAGCAATCTGTCTTCTGGATTAAMCTGMGATCMCCTACTTTCAACTTACT 

AAGAAAGGTATTAAGCGCCTTTCTGAGAGCTCTCAGTGGGCTTCCTAAGCACGTTCACTC 

TGCCTCCTTTAAGTCTTATATTTATGATCAAGATGAAAGGGAGGGATGTATCACTGCACA 

GAGA7TCTACAAGTGGATATATAAAGCTAATMTGTTTTCAGTGAGCTCTTCTCTGGCAA 

CTACTTTCCAAGTGCTTTCAGCATTCCACCTTGAAGCACAACGTTAATMTGCACMTTT 

CTTATGTTCAGCCACAGMTGTTCACGTGTGATTTGGTTACAACATATTGCTCTATCCAG 

GCCTCTMCAAAACTGACTTCTTTCAMTGGACTTGCMGTTGCTGAATTCCAAATCAAC 

ACTTTATCTTAAAAACAGAAAAAGGAGGGGAAGGGTGGAAGGGAGGGGAACAAGAAAGGG 

TACCTGCCNGGGCG 

Sequence 3596 

GTCGAGCGGCCCGCCCGGGCAGGTACCACACACCCTCCACTACTCATCACCAGACAGGGA 

ACCCTGGCTTGGGCCCACAGCACAGACCCTCCATCCTGGGCCGATTACACTAAGTGATTG 

CTAACTCACATGTCTCTGGGATGGAGCACCCAGGAGACAAGCAAAGTGTTGGAGCAGCAA 

GTCAGGTGATGTGGAGCCCAGAGGGCTGGGACAGCTATCTCTCTAGGGTCTACTTGCCCT 

TGTGAGACACTTTATCCCAGCACTTTAGGAGTGCTGAGGTCATGCCAGCCACATCTCATG 

TGCMGATTGCCCAGCAGAGATCAGGTCCAAGAGTTCCTGTTTTAAAAAAGGGGACTTGC 

TTA 

Sequence 3597 

CCGCGGTGGCGGCCCGGGGGCCATTGAGACTGCCATGGAAGACTTGAAAGGTCACGTAGC 

TGAGACTTCTGGAGAGACCATTCAAGGCTTCTGGCTCTTGACAAAGAGACAAGGAGAAGG 

CCTTAGGATCTACTGGGGGAGTCCGGAGGAGCAGTCTCTTCTGTCCCGCTGGAACCCATG 

GTCCACTGAAGTTCCTTATGCTACTTTCACTGAGCATCCTATGAAATACACCAGTGAGAA 

ATTCCTTGAAATTTGCAAGTTGTCTGGGTTCATGTCTAAGCTTGTTCCAGCTATCCAGAA 

TGCCCACAAGAATTCAACTGGATCTGGAAGAGGAAAGAAACTGATGGTGTTAACTGAACC 

CATTTTGATTGAGACCTACACAGGGCTGATGTCATTCATTGGAAACCCGCAACAAACTTG 

GCTATTCCCTTGCCCGTGGGAAGT 

Sequence 3598 

GGAGCTCACCGCGGTGGCGGCCGCCGGGCAGGTACGCGGGTTCTCATTCTTTTCTCCTTT 

ATCTTCACTCTGA I I I I I CTTTTGTCATTCAACGCTTACTCCCTTCCCCATACCTCAGTC 

CTCCAGGTGACACCTGGGCGCTTTTCTGCTGAACAGCATTTCCCACCAACGGCCACAGCA 

GTCGGTCAGTGTCCTCTTGAGCACCTCCTGAAGCTGCAGTGTCATTTGCTGTTTGCCCTA 

AAAAGCCACGCCCAGATTTCTTCTACCGCTGCTTCCCTGATGGGCTAGCCCATTCTGACT 

TGATGTGTACCT 
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Sequence 3599 

CCGGGCAGGTACTCAGTGGATGACGAGTGCTTGGTGAAATTGTTGAAAGGCCTGTGTCTG 

AAATACCTGGGCCGTGTCCAGGAGGCCGAGGAGAATTTTAGGAGCATCTCTGCCAATGAA 

AAGAAGATTAAATATGACCACTACTTGATCCCAAATGCCCTGCTGGAGCTGGCCCTGCTG 

CTTATGGAGCAAGACAGAAACGAAGAGGCCATCAAACTITTGGAATCTGCCAAGCAAAAC 

TACAAGAATTACTCCATGGAGTCAAGGACACACTTTCGAATCCAGGCAGCCACACTCCAA 

GCCAAGTCTTCCCTAGAGAACAGCAGCAGATCCATGGTCTCATCAGTGTCCTTGTAG 

Sequence 3600 

CGGCCGCCCGGGCAGGTACTGAAGTCCTCAGCCTAGTGCCTCCCGATGCGTTATTTTCAC 

ACAAGCGATCGGGTTAACGCCACAGTTCCACATGGCTGGGGTGGCCTCACAATCATGGCG 

GAAGGTGAAGGAGGAGCAGAGGCAGGTCTTACATGGTGGCAGGCAAGAGAGCCTGTGCAG 

GGAAATTCCTTCCCAGGCCACAGAAATGGAAAAGCCAGACACGCTGAAAATGAACTCAGT 

CCTTCAGTTCTGGAGGCTGGAAGTCTGGGGTCAAGGTTGTGGCCAATCCTGTGTCTGGCA 

AGGCTGCTTCCTGGTTTGCAGACACCATTTTTCCTCCCTGTCCTTCANCCCACAGCAGT 

Sequence 3601 

AGGTAC I I 1 1 1 i 1 1 1 I I 1 11 1 11 I 1 I II ill I GGTAACAGATCTAATCTTCCACGTAAGC 

CTCTTTACTGGANAAGAGTAGATGGGAGGGCAGGTGGGATCTATTTAGACAAATCCGATA 

TATTCAGAATGATTCAGTAGGCAAAGAATTCCTCCTGGTAACNAATCTGTGTGTCAAGGT 

CCTTACCTTGGCTCTCACTGTCCTCAGGATTATGGAGGGGAAGATGATGCAGAATTCGCA 

CATGTCTTCTGGATCATGGGGGACTTTGCTGAGTGTGCCTGCCATGGTCCGATCTCTGAT 

TGATAGGAGTGACTGCTGAGATC 

Sequence 3602 

AGGTAC I rTTTTTTTTTTTI I GTCTGGAGAGAGAGAGATGTTTTATTTTTGCCAAAGCAT 

GTGAGAGTAAGTAGCAAACATCATGACACTTCACCCCTCAAATATTTTCCCACGTATCAC 

TTAAGAACAAGGGCATTCATTCTGTTACATGACCACAATACAATCAGCACAGATAGATAT 

TAACATTGAGACAATGCTCTGATGTATTGGGTATCTACTATGCAGTTCATAATCAAATAA 

GGGTTTTGGTAACTTGGGATCCATAAATGGTCTGCAAGGTCAGCCCCTGTAGACCTGCAC 

TTAGTGGGTGCCATTTCCCT 

Sequence 3603 

CCGGGCAGGTAC I 1 I I I M I I I I I I 1 I I I 1 1 I I M H I GGGAGAGAGATGTTTTATTTTT 

GCCAMGCATGTGAGAGTMGTAGCAMCATCATGACACTTCACCCCTCAAATATTTTCC 

CACGTATCACTTAAGAACAAGGGCATTCATTCTGTTACATGACCACAATACAATCAGCAC 

AGATAGATATTAACATTGAGACAATGCTCTGATGTATTGGTATCTACTATGCAGTTCATA 

ATCAAATAAGGGTTTTGGTAACTTGGGATCCATAAATGGTCTGCAAGGTCAGCCCCTGTA 

GACCTGCACTTAGTGGGTGCCATTTCCCTGTC 

Sequence 3604 

AGGTACGCGGGGGGAAAAAACACCTGATGAGGCTGCACCCTTGGTGGAAAGAACACCTGA 

CACCGGCTGAAAGCTTGGTGGAAAAAACACCTGATGAGGCTGCATCCTTGGTGGAGGGAA 

CATCTGACAAAATTCAATGTTTGGAGAAAGCGACATCTGGAAAGTTCGAACAGTCAGCAG 

AAGAAACACCTAGGGAAATTACGAGTCCTGCAAAAGAAACATCTGAGAAATTTACGTGGC 

CAGCAAAAGGAAGACCTAGGAAGATCGCATGGGAGAAAAAAGAAGACACACCTAGGGAAA 

TTATGAGTCCCGCAAAAGAAACATCTGAGAAATTTACGTGGGCAGCAAA 

Sequence 3605 

CCGGGCAGGTACCAGATTTTCACCGCTATGCCTCCTTTCACTCTGGGAGTCTTCCAGAGG 

TCTTGCACTCGGGAGAGCATGCTCAGGTTTCCCCAGCTCTACAAAATCACCCAGAATGCC 

AAAGACTTCAACACAAGGATACTGTGTTGACCAGTGGTCATGCCACTGCCTGTTGATTTG 

TTGAAAATATTGTTTACACGTATGTTCTTGTTACTGATTGTCAGAAAGCTGGTTTTGAGA 

CTGCAGCTTGGACTAAATTCAGTCATCTGGCTGTCTGGGGAAGCATGCTGACCAGTCTGG 

TGTTCTTTGGCATCTACTCAGCCATCTGGTCCACCATTCTCATTGC 
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Sequence 3606 

AGGTACCTGAATGAGCCTGGCCTGGTGCTGTGTCTCCAGACTCGACACAGCCCAGAAAGA 
AACAGGTGAACACACTCCTGAGAGCCGAAACTGGACTCTTGCTGATGGACTCATGGCAAT 
ACTCCAGCCAAAGGCTGCTCCGAGTCCTAGAAGGGCTCGGAACTGTCCTGGGTAAGCCTG 
TC G AC AC AGTAGGGACAG AGAGGGGTTAGTGTCCTCTTTCACAGC ACTAAGCATGTTGTT 
GCC C ACATGCAACAGTTATGAC AAG AACAGCAAGTCC AATTCAG GAAATAC ATGTCTTTA 
TCAAAGAAAATAGATGTGACTGACACAAAGGCTTGCTCCAGAAGCAAAGGCGCAC 
Sequence 3607 

AGGTACGCGGGGAGACCGTGTGTTTCCAAAATGGCGGCAGCGATGGATGTGGATACCCCG 

AGCGGCACCAACAGCGGCGTGGGCAAGAAGCGCTTTGAAGTGAAAAAGTGGAATGCAGTA 

GCCCTCTGGGCCTGGGATATTGTGGTTGATAACTGTGCCATCTGCAGGAATCGAGCGGCC 

GCCCGGGCAGGTACGCGGGTGTTTGTGGTTTAGTCAATGGTCTATGGTAAI I I I I I I ATA 

GCAGTCCCAGCCAAGACAGTGCCTCATTTACTACATACCATTTATATTATTATATAGGCT 

CCTTTCAGAAAACCCATGTTCAAATTAAGAGATAAGATACTGAAAC 

Sequence 3608 

CCGGGCAGGTACTGAACACTACATGCTATGCACTGCTCCAACAI I I I I AAGTGTCCTAAT 

TTATTAAACTCTCACACATTTGAATAGGGAGGTTTCACTGATCAGGAAATAGAGGATCAG 

AGCATTTAAGTAACTAGCTCGAGGTCACAGAGCTAATAAAGGGAAGAGCTAAGATTCAAA 

CTCAGACMTCTAGCTTCAGAACTGATGCTTAATGTGTAAACATCCTTTCTTTCATGTCC 

TTCMCATTCCCATTTCCCATCTGTAACCAAACATATGG'MGGCATAAATTTCTTTTATG 

GGTCCTAACATCACCACAGAGCCCTAGATAAATCTTAAGTAGGTGCTGGAT • 

Sequence 3609 

AGGTACAACTGAAACAAAGAGGAAAGCAGACCCATGGCTCAGGACACATGGGAGTCCCAC 

CAGCTTGAAAAAAATTTCACAGAGCTGGCATTTCCATCAGTGCAGACACGGCTCTGAAAA 

CAGTAATTGCTGCCTTATGGGCATGATACGCCCATTTCTACTCTGACCTCCTCCCAGGAG 

AAACTCTTCTGCAAATGACCACAGTACCTGCCCG 

Sequence 3610 

CCGGGCAGGTACAATCTGGAGCTGGCCTTTCATCATCATCTCTGCAAGACTCATCGCCAA 

AGCATCCTGGCAAAGCTGCCACCCAAGGGGAAGCTTCGGAGCCTGTGTAGCCAGCATGTG 

GAAAGGCTGCAGATCTTCCAGCACCTCCACCCCATCGTGGTCCAAGCCGCTTTCCCTCCA 

CTCTACAAGGAGCTCTTCAGCACTGAAACCGAGTCACCTGTGGGGCTGTCCAAGTGACCT 

GGAAGAGGGACTCCTTGCCTCTCCCTATGGCCTGCTGGCCCACCTCCCTGGACCCCGTTC 

CACCCTCACCCTTTTCCTTTCCCATGAACCCTGGAAGGGTGGTCCCCACCAGC 

Sequence 3611 

COGGGCAGGTACATGACGTTGCTTACAAGTTTGATGTAGGACMTTMTCCTTTTCAAAC 

ACTTCTAATTTAGATTAAATCACTGACCAGCATTAACAAAGAAACTCTATAATAATAGGG 

7TCCCTCCTGGCACCCAAAACTGCCAGGAGAAGCTCCTGCCATCTGCAAACCTGAACTGC 

i ATAAGCAGGTTGGAAAAGGATCAAGTGAACGAATGGAAAGTACCT 

frequence 3612 

AGGTAC I 1 1 I I 1 1 I I I 1 1 I I I I I I I I I I GGAATACTCAAAAAACATATATTTGCATGTGC 

CAGGAAAAGGTAAGGAAAAGGGAGCCTTCATGTGCCAGCCCCTGTGCTGAGCCCTGGAGC 

TGTAAANATGAGTGAGGTGGA I MM I GTCTGTAAAGAGTTTGCCGATGGG ACCAAGTCT 

GTCATCTCTCTTTCCTGGCCTGACCGAGGACAAGCACTTCTTAGTAACCCTGTGAATACC 

GATGCATCCCGTCAGTCAGCAGCTTGTTTGCGGAACACCAGCTGCAGGCCAGGCACCGTG 

CTGCAAGGCGTANATATGTTGCTGAACATGACTCCCTTCCAGATGTTCAGGATTTAACAG 

G 

Sequence 3613 

AGGTACGCGGGGAAAGGGCAGCAGGCCGCCTGCCAGGCAGTGTGTGTCAGAATCCCACAA 
CTCTCTGGGCCTCCTTAGGCCACAGGGTCTGGACTCTATGAACAGTTGGTGCTAGGCATG 
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GGGAAGATAGAAATGTTCCTTGTGGCCTAGATTCAACCAGGAGGCCTCCCTGGAGGAAGT 
GTCCCCTCTGAAGGTGTGAGACTTCCTGCCTATATCTGTGAGCTGGGGCAGGGGAGAGTG 
GAGGCTCTGGACTGGCCCAATTTAACTTCATTTACCACACTTTAAGAAATCTTCCCCAGG 
CTGGGAGCAGTGGTTCACACTTGTAATCCCAGCACTTTGGAAGGTCAAGGTGGGAGGA 
Sequence 3614 

CCGGGCAGGTACTTCCTGTAATTGACAGCCTTGCCTTGTTCCTCATGGCATCATTCAAGG 

TCATCTTAAATGAGAGAGGAGGGAAAGAAAGAAAAAGAMTCATACCGTTATGGTTTTCA 

AATGCATCCAGAGGAMGGCAGGGTTGTTTTTGAAAAGTTTACTTCTATCTGAMGCTTC 

TAGCAAATGAAAGCAAACAGTGCAAAGCTGAACTACAAATAATAAAGCACAATTATGGCA 

GGAATCTGTCCACATTCACCCAAGCTACTGTGCCATTAAAGAGGAAAATAGACAGTTGAG 

CTGCAGGTTAGTCACCAACTGGGAACATGGAAGAACGTCAACAATACATACAGGAT 

Sequence 3615 

AGGT ACAG CTCCCAGGAATCCCCTTCTCAGGCCTGTTTCAGTTCAAACTGGACATATTCC 
TGGGTTTTCAGATTGAAGAAACAAGATTTTGCTACG A I I I I I GTTAACCCTC l I I I I I GC 
AGTCTTTTTGGTTATTTGCTGGTTTGTATCTGTGTTTGAATCAGTTGCAATTT^ 
TGGCATTTTAGCAGCCTATGCTATACTATGAGAAATAGATAI I I I I I AAAGGCACTTTTT 
TCAGCAGCTACCATTTTCCTAAAAGTTTTCAATCAGTAGTATTTCCATCAAAATGCTAAT 
TATTGTTGTTGTTGTTGTGATTAAAGACAGGGTCTCCCTCTGTCACCCCCGGCTGGA 
Sequence 3616 

CCGCGGTGGCGGCCGAGGTAC I I I I 1 1 I I I I II I I I I II I GAGAAGAC ATC ATTACCTGT 

ATATGCCCCCTCTTCTCAGTTCCCTCAATGGACATTGGCATTGATGAGCCCATAAGTACA 

TGATAAGCCTACMGGACAATAGTGAGTTTATATTTTTATGTMTGATTTTAGAACATGG 

GATAATATCCAGAGTGATGGAGGTAAAACTATTCACATTTCCATATTCCAAACTACTTCA 

AGAATCACTAGAGTGCAGAAGNTATAACTCTAAATGCTTGAATGGAGAAATCATCTGTCC 

CAAGAANTACACCCATNTGACTTTNCMTNAATCTATCTCAAAAGGCCTACNATTTAACA 

AC AACC AGCATACMCAAAACTTTAAAAG AACCC N AATG AMGTTTTGTATN AAGGTTG G 

TGTCTTTTTTTCTATACATTTTGMTTGACAAAAAAGCTGCTGGG 

TATTTTATTNTAAACCATTCTTATTTTAT 

Sequence 3617 

AGGTACTGTGTCACTTTATGGGGTTGGTGCTTGCCACACTTCTTACAGAAAGTCCGGCGG 
GTTTTAGGGACGTTAACCATGCTTGCGTCCCCGCGTACGCGGGGGGGGGAATTGGAGGCT 
TCTAGGAGGTGGCACGGTGCACGCCAAGATGGCTGTGTCCACAGAGGAGCTGGAGGCCAC 
GGTTCAGGAAGTCCTGGGGAGACTGAAGAGCCACCAGI I I I I NNAGTCCACATGGGACAC 
TG NT AGCCTTCATTGTTTTCCTCACCTTN ATGGGC ACCGAGCTGCTN NTGCTGCTGCTGG 
TCGMTGGCCACTGNTGTTGCTGCAGCTCCCCCGGNCCCCCANGGAAAAGCCTCAGGAAGG 
AAAGACCCAAGGGAAGTGGATAANTTGGC 
Sequence 3618 

CCGCGGTGGCGGCCGAGGTACCATTGAGTTCAACCACATCCTTACTGATTTTCTGCCTGC 

TACATCTGTCCTGTGCTGACTGAGGGATTTTAAGTCTCCTACTACAGTAGTGGATGCATC 

TC"TCACGGGTCTATCGTTTTTGGCCTCACGTGTTATGATGCTTTGTTGTTAGGTGCATG 

CACATTMGTATTGCTCTGTCTTCTTAGATMTTGACCTCmATGATTATGAAATGCAG 

ACTGTAATTTMCTGAGATGGTTAGAAAAGGCCTTTCAGGGMGTTGGCCAAACTTCTGG 

CAAAGCTGATGGGGCAAAGGTGAAGGCTGACAATTGCCTGCACCAGAAGCAGCCAAAAGG 

TCCCATCACAGTGCAGGTCATCACCACCGGAAACCTCAGCCCTGGCTTTGCATTAGCCCT 

AGAATAAGCCAATCCAAGCCTGACCCACAAATCCTGTGAGATCTAGGCCCATCCCGTCCT 

CTTGGCATGATACTTATCTTCCTTCCCTAA 

Sequence 3619 

CGAGGTACTTCCAAGACTGAGATTTGGGTGCTATAGCCI I I I I GCTTTACAGGAAAGCAA 
TGGAAAATAGAACAATTTCCTTCAGCGCTGGTTGTAGATCCCGCGTACATCCAGGAGCTA 
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TGGAGAAAGAAGCAGTCTGATGTCATGCGCTTTCTTCTGAGGGTCCGCTGCTGGCAGTAC 

CTGCCCG 

Sequence 3620 

CGAGGTACACACAAAGACAAACCTGAACTTAATTTCAAGGAAAACTTAAACCCATGCACA 

AATMTTGGTGAGCCTTCATTTCCCTGACTTCAAGTTTCCATGTGAGGACTCATGCTCTC 

TCCACTTTCTTCTTGGGAGGAGGGAAGATTTACCTAATGGGTAAATTTGGGCAAAGCACA 

TTGAGTGTGCTTGTTTGGCTCTGAGTCTCTTTGCAAACATGTGTCTGCCCACAGTGACAT 

GAGTTTGCGTTGACTGTCATGTCTGCAGGAAGCTGCCTGCTCCTGTGGCCATGTCAAGCA 

ATTCTTTCTTTCMCTGCAACTGTGTGTAAGAGCTTAGTCTGAGAAGAAATGTTCAGAAG 

CTCACTGTGGCTGCACATCTGAGCCATGTCTTCCCATTAGTTGTCATGAGTCAGCAATAA 

AGCGGGTATGTTGATGTCTATCAATCTAATTCCTATGTTCTGAACTCAAGGGAAAGA 

Sequence 3621 

CGAGGTACGCGGGGACCTAGGTTACTTGTAGGACCCTATACGGCAACCTCCTTTGCCAGG 

AACTATTTATAAACATCCTGCAGGAAAATGTGAAGTAGAAGAGACAGGGATATCCCAGAA 

GGTTATGCAAAACATCAAGAGAAGATGAGAGGAGTCTATATGTCAGAATACACATTTCCC 

ACCTTGCCCAACAGTAGAAAAACATAAGAAGAGAAAAACATTAAAAAATGACAAGGAAGT 

TAATGGAAGTCAGCAATGTGATGGTGTTTGGAGGTGGAGCCTTCAGAAGGTAATTAATGC 

CCTTGTAAGAAGAGACCAGAGAGCTTGCGCACCTTCTTCCTGCCATATGAGGAGCCAAGA 

AGCCGGCTGTCTGCAACCTGCAAGAGGACCCTCACTAGAAGCTAGCCATACTGGCATCCT 

CATCTTGGCTTTCCAACTTCCAGMCTGTGAGAAGTATATGTTTGTGGTTTAGTCAATGG 

TCTATGGTAA I I I I T I I ATAG CAGTCCC AGCCC AAGAC AGTGCCTC ATTTACTAC ATACC 

ATTTATATTATTATATAGGCTCCTTTAGAMCCCATGTTCAAATTAGAGATAAGGATCTG 

AAGCACATAACACCTTCCTTGTTTTTAGGTATAC AAAAATTTGAG AAAAAN GTTNG NTAT 

TAACTATCTTATTCCANGAAATGCAGANTTCATGGTGGTTCTAAI Nil I ATATATTAAT 

TGGNCAAAATGAAGGAAACTTNACACCCATTCTTAAATTTANCTTGCCCAAAGAATTGGA 

AAGGATTGAAAAAAAATNT 

Sequence 3622 

GCTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTACCATAA 

ATGTTGTCTCTTMGAGMTGTCAAGTTTCAGTAAATTTACTTTGAACTTCAAAGTAGTG 

TGCAATAACTTTGATAAATAAGCCGATCTTATGAAATTTGAAAGTTACGTTTATAATTAT 

ATTATAAAATAAMTGTTGAATTTTCTGTTTTMCTTCCCCGATGTCCCATATTGATT^ 

ATTACATTCACAGATTTTTCTTTTCAAAAGTAAGCTTCA 

TATTTCTTGCAATCTATTTAATTGAGACTTTTCTGCTGTTTATCCTAAAATGCAGATA^ 

TGGCTTTCAGCACACTGTTTATAGTATAATAGCTCCTGAAGTGAGTCAGCATTATCAGCT 

AGAGTCTCCATGTCATAACIAATGGTCTCTCTGCTTATACTTTCCTGCTAGTGAGGAGCC 

ACTGAACTCCACAGAGACAGGAGCCTGGCATAGTTGGGGGGTG 

Sequence 3623 

CGAGGTACTTTGTTACAGCAGCCTGAAAGGACTMGACACCGACCTAGTCTCCCTGATGA 

AAMGTTTCTCTCAGACTTCTACCCTTTCCAATGTGGCCAMGCTTTTCATTCCGAAGAA 

GTTTCCTTTCTGAGAACACTCATrGTGTCGNTTGGCTTTCCCCGTCTCTGCTTGACGCAT 

GMCCAAAACAGAGGCAGCCAAAGCAGGGAAAAAAAMTCCTAGGATCAGAGTCCACTCT 

ATGCCCTTTTGAGCTTCAAAAGGAGAAAGAGACAAAAGCCAAAAGCAATGGAGGTCAAGC 

TGCCCGGTACCTGCCCG 

Sequence 3624 

CGAGGTACGCGGGGGGCCCGCTTTCTCTTACTGATAGTAGGATATTTCTGCTTTAGTTAT 

TGTCACCTTAAATATATTTTCAATGTTGAAATCCTCACAGCATGCTTGATGAAATCTAGT 

TTTCAAATTTTCTTAGGTATATTTCTGTCACGTTGGCATGATAACAAATGCAATAACCCA 

AMGACCCCAAAAGCTAGTGTAATCCCTTTTGCAATCCAAGCATGAGGATTCATCTTCAT 

GTTGACAGTGCGTGAATGTTCGGTAGGCTTTGTCAAGCTTGCATACAATAAATTATATAT 
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GTCCCTTTTCTTTTAGGGTCTCCTGTTAAAGATGGTCTTCTGAAGGCTAACTGCGGMTG 
AAAGTTTCTATTCCMCTAAAGCCTTAGMTTGATGGACATGCAAACTTTNAAAGCAGAG 
CCTCCCGAGAAGCCATCTGNCTTTCGAGCCTGNCATTGAAATG 
Sequence 3625 

CGCCCGGGCAGGTACGCGGGGGAGGAGGTGGATTTGAGAGATACCTCCCCTCCTTCTGCT 

CAGCT GCCTT GCAGTMTTAMCTCTTTCTCTGCTGCAACACCCCTACTGTTCTCCGTGT 

ATTGGCTTTTCTGGGCAGCAGAAAAAAGCTTGGCCAGACCACAAGCGGNAATGCAAATGC 

CTTAAMGCTGCAAACCCAGATATCCTCCAGACTCCGTTCGACTTCTTGGCAGAGTTGTC 

TTCAMCTTATGGATGGAGCACCTTCAGMTCAGAGMGCTTTACTCATTTTATGATCTG 

GA GTCAM TATTMCAAACTGACTGAATATAAAAAAGAGGGCCTCAGGCAACTCGTAATG 

ACATTTT NMCA TTTCATGAGAGAAGAAATTNCAGGATGCCTCTCAGCTTGCCACCTGCC 

TTTGACCTTTTTGAAGCCTTTTCAAAAGTGATCTGCA 

Sequence 3626 

CCGCGGTGGCG GCCGA GGTACGTGGCGTAGCATTCCATGTTCAGCTTTGACATTTATTTT 

CTCATATCAACCCI i I 1 IACACGTGAAACACAATCTCGCTCTTGAAGTCTTAACTGCATG 

ATCTGTGAMCCTGTATTTATATTTTCTTCAGTGTATTCTTGTCGTGTGTGTCCTAAACA 

MCCAAAAGAAMCTTTCCAAATCTAAAGTATTCATTCTCCAATTGGAGCAAGAGGAGTC 

AGTTAGATA CTATC A CGGCA TTCATTTGTGGCTGGCTTGTCATATTTACTTATGATTGAT 

AATAAATCTATTTTGCTTTTTAGAGCCTCCCGAGAAGCCATCTGCCTTCGAGCCTGCCAT 

TGAMTGCAAMGTCTGTTCCAAATAAAGCCTTGGAATTGAAGAATGAACAAACATTGAG 

AGCAGGTAMTTTTTTAATGGAACTATGCAMGACCMTAT^ 

TGATGGTCCTTCTATCCCCAATGCTTTATTTrTTTCMCTTTTG 

TAGGGTAATGCCAATACTGGTATTGNTGTTTGGAAAAGCTGGTATTACAAGCACAGTAAT 
TTTTCAATATC I M ill 
Sequence 3627 

TACTTAGGGCGAATTGGA GCTCC CCGCGGTGGCGGCCGAGGTCGCGGGGAGAGCTTTTGG 

TTTGCTGAGGTTTGTCAGATTTTCCAGCTCAGGGCCCAGCCAGCTGGCAGGAAGCAGGAC 

AGAGGTCACTTGAATTCAGACCACATGTCCCTGTTAAATACATTAGCTTTTAAATCAATC 

TTTGTTCAAAGTCCAGTGAGTTGCAAGCCTAATGCTCACCTGCAGAGACAGAATTCCTGA 

GTGAACGAACAGAGCAGCTCCTCTTCATCTCCAGAAATGACCTCCACCTTCAACCCCCGA 

GAATGTAMCTGTCCMGCAAGAAGGGCAAAACTATGGCTTCTTCCTGCGAATTGAGAAG 

GACACCGAGGGCCACCTGGTCCCGGGTGGGTTGAGAAGTGTAGCCCAGCAGAGAAGGCTG 

GCCTTCAAGATGGAGACAGAGTTCTTAGGATCAATGGTGTCTTTGTGGACAAAGAAGAAC 

ATATGCAGGTTGTGGATCTGGTCAGAy\AGAGTGGGAATTCAGTGACTTTACTAGTTCTGG 

ATG 

Sequence 3628 

ACCGTAAGACGTGTTTTGCGACCCGGCACCACCGTAGTTTTACNGCCCGGTGAACTCTGT 
GGAGNGTAGCAAGTTNTTTTAGAAACAAGGAAATGGCTGTGCGTGCCCTCGTGGCATGTC 
ACGTGGCTTGNCACTGCCGGCTGATTGGCT 
Sequence 3629 

ACCGCGGTGGCGGCCCGAGGACT I i II 1 1 1 i I I 1 I I I || It 1 1 H 1 CAATTGANACAGAG 

TCTTGCTCTGTCACCCAGGCTGGAGTGCAGTGGCATGATGAATGAGTGAATGACTTGAGT 

CAGTGAACGTGAAMTGTTTGGTANATTGTAATCGCCGATATTGTGTAATACGTAAAATA 

CTATTAGG AACTACAGC AATGTCTTATATTCCC AGCTAGGCCATG CACAG GAGGCAGTGT 

GCCACAGCACACTGAAAAGGTGGCCAGCCCTCACTCAGAGCCTGCCTTCACCACCTGCCC 

G 

Sequence 3630 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTCGCGGGGGGAAAGCTGATATT 
GCAATTGCTCCATTAACTATTACCCTTGTGAGAGAAGAGGTGATTGACTTCTCAAAGCCC 
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TTCATGAGCCTCGGGATATCTATCATGATCAAGAACCTCAGAAGTCCAACCAGATGTTTT 
CCTTCTTGATCCTTAANCCTATGAGAATCTGGATGTGNCATGG1 I I I 1 GCCTACATTGGG 
GCAGGGTAGTTTAATTCCTGGTCAGAAAATTAANCCCTTACAAGTGGCACACTGANGGAG 
TTGGAAGAATGGAAGMGAACCACAAAGTAGTGGAATCAACTMTGAATTNGGATTTTAA 
TAG 

Sequence 3631 

CCGCGGTGGCGGCCCGAGGTACAGGTGTC illl rGATTGAATGATTTATTTTCCTTTGGG 

TAGATACCCAGTAGTGAGATTGCTAGGTTGATTGTTCATTCTATTTTTAATTCTTTGAGA 

AATCTCTATTCTGTTTTCCATAGGGGTTGAACTAATTTGCATTCCCACCAACAGAGTATA 

AGTGTTCCCTTTTCTCTGCATCCTTGCCMCGTCTATTATTTTATGACTTTCATAATAA^ 

AGCCATTCTGACTGGTATGAGATGGTATCTCATTGTAGTTTTMTTTGCATTTCTCTGAT 

GACTAGTGATGTGGAGC I Mill I AMGTTTGCTGGGCACTTTTGTGTCTTCCCAAAGCA 

TTTTTAAATCTCTAGAATGATCTCACGTAATTGCAGTGCTATATTATTTGGAGTAGAAAT 

TTCTAAAGGCGAATMTGCTCCTTGGACTTAAAGAATGTTTGCTCTAAATTAAGACAGTA 

TATAAAACCCGAAAATGCTTTCCTGTTCAAAAGCATGAAACACTATAATCTCCATGCACC 

ATGMGATATCTTTACACATTCTTCTCGGGGGCTCTGGTTTTTGNTCTGATGTTTGCCCA 

TTACAGGNGAACACCTGGGTAACAACAATTCCTCAACGCTATAAANGGTAAGCCACACGC 

AGTTGCTTGGCATAGGAAAG NTTAAAAAAGATCTG NTG GGAAAATTCCC CATCTTCNCAG 

TTCTGNCCTCCCTGCNAANATTNAATTTCTTTNAAGGATCC 

Sequence 3632 

TGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGCTGTAGGCOCGGGTGGTTGGAAT 
TGACCGCTACCCCCGCAAAGTGACAGCTGCCATGGGCAAGAAGAAGATCGCCAAGAGATC 
AAAGATAAAATCTTTTGTGAAAGTGTATAACTACAATCACCTAATGCCCNAAGGTACCTG 
CCCG 

Sequence 3633 

CCGCGGTGGCGGCCGCCCGGGCAGGTAC I rTTTTTTTTTTTTTTTTTTTTl ICTCCGAGA 

CGGAGTCTCTCACTGTCACCTGGGCTAGAGTGCAATGGTGCTATCTCTGCTCACTGCAAC 

CTGCGCCTCCCAGGTTCAAGTGATTCTCCGGCTCAGCCTCCTGAGCAGCTGGGATTACAG 

GCGCCTGCCACCACACCTGGCTAA I Illl I GTAi I I I I AGTTAGAGACGGGGTTTCACTA 

CGTTGGCCAGGCTGGTCTCGAACTCCTGACCTCATGATCTGCCCACCTCTGCCTCCCAAA 

GTGCTGGGATTACAGGTGTGAGCCCCTGTGCCCAGCCI 1 1 I I 1 1 ATCTTTTAAAAGTAGC 

CTGTNCCTTCTTTTAAACTAACTCTCATGTNATACAACTAAAAGAAATAAAGTTNCTAAA 

AATATCTTTTCMTCTATATAMTTTAAMGGNGGGNAGGCTTmATAm 

NAAGNAAAAATTAATAAAGCCAATTTTGAAAATTGGTTAATT 

Sequence 3634 

CNCGCGGTGGCGGCCGCCCGGGCAGGTACGCGNCGAGGCATTGAGGCAGCCAGGGCAGGG 
GCTTCTGCTGAGGGGGCAGGCGGAGCTTGAGGAAACCGCAGATAA^! MINI ICTCTTT 
GAAAGATAGAGATTMTACAACTACTTAAAAAATTAGTCAATAGGTTACTAAGATATTGC 
TTAGCCGTTAAGTTTTTAAC G TAATTTTAATAGCTTAAG ATTTTMG^.GAAAATATG AAG 
ACTTAGAANAGTAGCATGAGGAAGGAAAAGATAAAAGGTTTCTAAA/vCATGACGGAGGTT 
GAGATGAAGCTTCTTCATGGAGCAGTGAGTACCNAAGCCGCANTAAAAGTACCTN 
Sequence 3635 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGGCTCAAGGACTCCTCCAGCCTCAGCCTC 

TCGAAGTGCTGGGATCACAGGTGTGAGCCAACGCACCTGGCCACACTTTGTTMTTTTTG 

AGATGTCTGCCAMTACCCMTTACGTATTCATAAGAAATCAGTTTATCTGTCATTCTTC 

MGTAAAAATAATATTTCATGAAAMCCAMTAAAAATAAGGAGAMCATTGGCTAGTTC 

AGCTAACAACTCAAATAATCCATAAGTGCTTCTTGAGACATCTCATCATATCTCAGAATG 

CGGTGAAAACGCMTATGGGTGCTTCCTATTTCTTCACACAGAATATTAAAAAGGTGTAT 

ACTCAAGGATTGAGATGTGATTAAAGTMTTTTTATTCCTTCATCAAAGACATTCCTAAG 
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TNAAAC I rTTTTTTTTTTTTTTTTTTTTr 
Sequence 3636 

CCGGGCAGGTACmGAGCTTATTGTTTTTATTCTGTATTAMTATTT^ 

ACACTAATCACAAACTGAATGACTTGACTTCAAAAGCMCAACCTTAMGGCCGTCATTT 

CATTAGTATTCCTCATTCTGCATCCTGGCTTGAAAAACAGCTCTGTTGAATCACAGTATC 

AGTATTTTCACACGTMGCACATTCGGGCCATTTCCGTGGTTTCTCATGAGCTGTGGTTC 

ACAGACCTCAGCAGGGGCATCGCATGGGACCGGCAAGGGAGGGGCAGGATTTCCGGGACC 

CA CCTAG GGCCCTGGAAi ^ 
CATTTTTCTTNMGM 

TGGTGTTATTTTCTTAMGGAAATTCAAAATTTGGTAAATAA 
Sequence 3637 

CCGCGGTGGCGGCCGAGGTACGCGGGTATCTCGATAGTGAGGAGGATGAGGAGTAGCCAG 
CAGC TCCCAGMCCTCTTCTCCCT TCTTGGCCTAACTCTTCCAGTTAGGATCTAGAACTT 
TGCCI I 11 I I I I I I I I I i U | | | | | IGAGATGGGTTCTCACTATATTGTCCAGGCTAGAG 
TGCAGTGGCTATTCACAGATGCGAACATAGTACCTGCCCG 
Sequence 3638 

CCGGGCAGGTACAACAGAAGGCACTGTTATCCGCATAAATCCATTGATTCTGCATGGCAA 

AAACCTGAGACGTGCAAGTCTATCCMTCCTCTAAAACMCCTTGGGCATTCAGCCTTTG 

GAACATATGGGTTGTCCTTTTCCGTCCMTGATTGTTTT^ 

TAGCTGTTTAAMTATATATTTCAGGGTGACAAAGGTCAAAGGGGAAAGAGTAGATAAAA 

GTGCTGCCTAATGCCAGATGGTAACTGGCACAGCAGGAGGGCCCTAAAAGCAGCCCCTCA 

TTTCATCCCCTTGACCCTCACACTCCTCTTGTGAAATAAAATGCAGCGATGACATTTGGT 

CAGCTGATGTTCTTTCCTTCAGTTGTTGCTGGGCACAGTCC 

Sequence 3639 

ACTATAGGGCGAATTGNAGCTCCCCGCGGTGGCGGCCGAGGTACCCGTATGTGTGATAGG 

GCAGGCAGAGGGCTCCACCTGCTCTGCCCTTGGCAAAGGCCCTTGGCTACACTGGTGAAA 

ATGCCAGCTCGGCTTCTGCACGGATACACTATGTTGTGCCATCCAACTGGAGTGCCGGTT 

GGACCCCAMGGCAGGGCAGGGCAGGGGGAGGGGGTATTTTGAMTGTTTTCAGGAGCTC 

TGTCCAGCTGTCTCTGTCTTCCAGCCCGGAACTGCTGCACAAAGCTACAAGGACACTGAT 

GAGACCATGGATCTGCTGCTGTTCTCTAGGGAAGACTTGGCTTTGGANTGTGGCTGCCTG 

GATTCTAAAGTGTTGTCCTTTGACTCCATGGMGTAATTTCTTNNTAGTTTTGCTTTGGC 
AGATTTCCAAAAAANTT 

Sequence 3640 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGTCCAACTGGATGCT 

GCCCTGGTGGCTGAAGGCACACTTCATGATGCTGTCCAGGGTCATCAGGGAGACATGTTG 

AAAGAGCTCCAGACGTGAGTTTTGGGC^TGTGTTCCTCCCATTTGTTCGGCATCATCCG 

AACACTCTCAGACATC ATGGTG ATGMTATTTTCAGAATGCTGATGTTGMGCCAGGTTT 

CACMTCTGGCGG TGCTT TTTCCATTTANAACCATCCAGGGTCACAAGTCCTCGACCAAC 

CCAGGATTCAAGGATTTTGTGGCTMCAGCACTTTTGGGATCTTGTCTT77CAGGGAGAA 

TCTTGGCATAAGTCTGGGGTCATGGACACTGAAGAACATCNTAAAAGGGTC 
Sequence 3641 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCATCACATGC 

TGGTCAGAGTGGCCATTTGTCCCAAGTCAAAACATAACATGCTACAGTGTGGCAGCAGAG 

AAAAGCAAACATGGCTATTCTCTTGGAGGTAATGCAAACTAGA.TCCTAAACTGTGGAAAC 

CAGTCTGMGATTTCTGGAAAAAAAMGGATGGAAGGCAGAGCTATGCTTTGACCCAGCA 

ATCCTACCCCATGCATCTACCCCAGGGAAAAACACATGCGTCTATCAAAAAGACACATGA 

CACTCACACGTCCACGGGAGTGTTACTGACAGGAATAAAGATGTGGAACTGATCTAGATG 

CCCACCGAACAGAAAACTGGATTTTTAAAAATGGTGGTTCCTATACATCACAGAATGCTA 
AAAAGCCACTTAAGGGAA 
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Sequence 3642 

CTACTATAGGGC GANTTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTTTTT 
u u M t u rTTTGGAGAATCAAAACTTTTCCGGCTTTATTCTTTGGGAAAACCCCTGGT 

CTGTTTTCATTCCTATTGGTCCAGGCCACCATTCTATGATATGAAGGCCTAAATTAGGAA 
AGCTAGGTGAG 

Sequence 3643 

ATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTTAGTAGAGA 
CGGGGTTTCACCGTGTTAGCCAGGATGGTCTTGATCTCCTGACCTTATGATCCGCCTCGG 
CCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCATGCCCGGCTGCTTAAGTTTCTTA 
■ AATACAGAMGGCTATGCTGATACAGTGTTTTTTGCCCTTTAATTGm 
TTTAAAAAGAAACCTAGGGCTTANATAGTCCTATCTTCTATTTGGATGTCAGCAAGGTTA 
AAAAGTGCAATGTCAGGAGTATGTAGCTATAGGTGGCCTGTGGCATTTGCTTACAGAAAC 
NGGCAGAAGGATCCGCAGGCCCAAGAGGCCTTGTCTAAGAACCACACCACCATTACAAAA 
GCTGTGAAAAGGAGGC 
Sequence 3644 

CCGGGCAGGTACGCGGGGGTAATGTTTCACGAGAGGTCTTGGATCACATTAAACACCTCT 

CAGTGGCAATGAGGACCGACTTTGTCATGAGTTACAAGAAMNAGCTTTCMTGMTTTT 

GAACCATCACACTTTAAACATTTGGCACAGGTGAGGGGTCAGAGGCACAGAAACTTTGGT 

GTGTCTGTATCAANACACACAAGGGTGGNGACTGTATCAGGTTGCTGGAATGTCCAATTA 

TGANTACCTNnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnNATGATTTTT 

NANTAATNGNTTCNGTGCnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnrnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGAC 

AAACACNTTTCTACTTATGGNCAATATNTCNCCNGGGNNCTTTNNGTGGACNCCNTTGNG 

GGAGCCTTAATGMGNGAACTTACTTGACATTTNATTTGCGATCGCNGCTTCANATGCCN 

GCCTTTNCCNGGCCGGNNAAAACCTTGTGGTGCCNNGCCTGCCNTTNATGGAAATGGGTC 

CCAAACGCNTCCCGGGGAAAAAAGGCCGGTTTTTGCAATAATTTGNGAAGCNTTTT^ 

GGCTTTCCTTTGGGTTNAANTTGAACTNCCCTTTGAGNACCTTTGGNAAACCCCCNAAAA 

NAAAGTGGTGGAAGGGNGNAACTTGGGCTTTANTTNTTGGGAAAGNCC I 111 I 1 1 1 AACC 

NCCCCCTTGGATTTAAGGGGTTATNGGGGGTTAAAATGGGTTACCNAMNAAAATTATTT 

Mllll! 

Sequence 3645 

CCGGCGAATTGGATAANTACNANGGTNNCNGCCGCCCGGGCAGGTAC 1 iTTTTTTI 1 NAC 
AGACC N NCTGG NTTTN AAGAAGGAN TTTATCCACTACCAGAAACCGAAAAATGTCTAGC A 
NATCCAAAAAGTGCAAAAAAGTGCATGACTCACTGGAGGACTTCCAAGAACAGCTCGTTT 
AACC 

Sequence 3646 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCAATCAGCTT 

CATTGCTGCATTGCAAAGCCCCGCAATTGCAGTTGTTTTAGCAAGGGATAGATGGAGAAN 

ATGAATGTGCTTTACAAGTCAATTACTACTGATCAGATCATCTTACCAAAGGGTGGAGGA 

AGTGGCCGAGGCCCAAAAGGTCCGCANAGCTGAGGTGGTGGNCCATATCNAATGGGTGGT 

GGTACCTCN 

Sequence 3647 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTCGMGTGGAAGGACTTTCATTTTGAGAAGAT 

ACCATTTGATCCAGCAGAAATGTCCAAATGATATCAGGTCCTCAATCTTCAGCTACAGGG 

AATGAGTAACTTTGAGTGGAGAAGAAACAAACATAGTGGGTATAATCATGGATCGCTTGT 

ACTCCTCAGCAGAGCTGGAGGACAGCAAGGCCAGGACCAGCCCCAGCATGCAGAGCGCTC 

TGGCAGCCATGACCACCGTGGGCTCCGGGACGCAGCTCAGGACTCCCCGCGTACCT 

Sequence 3648 

TGACGCGCGTAATACGACTCACTATTAGGGGCCGAAATTGGGGAAGCCTCCCACCCGCGG 
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GTGGGGCCGGGCCCGCTTCTTAAGAAAACCTAAGGTTGGGGAATCCCCCCCGGGGGCCTT 

GGCMGGGAAATTTCCGAATATTCAAAAGCCTTTAATCCGGAATACCCCGGTCCGAACCC 

NTCCGAAGGGGGGGGGGGGCCCCCCGGGTTACCCCCAAAGCC I I 1 1 I I GGTTTCCCCCTT 

TTMGTNGGAAGGGGGGTTTNAMTTTTGGCCGCCCGCCTTTGGGCCGTTAAATTCAAAT 

G G G GTTC AATTAAG G C CTTG GTTTTTCCCTTG GTG GTTG G AAAAAATTTTGGTTTT AATT 

CCCGGCTTCAACNAAATTTTCCCAACCAACCAAANCAATTTAACCCGNAAAGCCCCCCGG 

GGGGAAAGNCCMTTAAAAAAGGGTTGGTTAAAAAAAGGCCCCTTGGGGGGGGGGTTGGC 

CCCCT TTAAAAA TTGGGMGGGTTGGGMGGNCCNTAAAMCCCT1TCCAACCCAA7TTT 

TTAAAATTTTTTGGGCCCGGGTTTTTTGGGCCCGGCCCTTTCCMCCTTTGG 

CGGGCCNTTTTTTTCCCCCAAMGGTTTTCCGGGGGGGGGAAAAAAAMCCCCCNTTGGG 

GTTTCCCGGTTTGGGCCCCCCAMGGCCCNTO 

AAAMTTTCCCGGGGGGCCCCCCAAAAAACCCGGGCCCGGNCCCGGGGGGGGGGGAAAGG 
AANANGGGGNCCCCCGGGG I I I I I I I I I GGGGCCCGGGTTTAAA1 I II I GGGGGGGGGCC 
CGGGGCCCTTTCCTTTTTTTCCCCCGGGCCC I I I I I I I CCCCTTTTTCCGGGCCCTTTTC 
CAAACCNTTTGGGAAACCCTTTTCCGGGCCT 
Sequence 3649 

CCGGGCAGGTACCTGTCTCTTCTCAAACTAGATGATAAGTGGCCTCAGGGCAGGGACTAC 
ATACTCCTG AGAGCTGCTTG AGCTCGAGGTGTTTATTATC CTCAGTAAGC A 1 rTTTTTT C 
AGGAGCATGACTTGGCCCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGTACCT 
Sequence 3650 

GTATGACCTCAGCATTCCTAAAGTGCTGGGATAAAGTGTCTCACAAGGGCAAGTGGACCC 

TAGAGAGACAGCCATCCCTGACCTCTGGGCTCCACATCACCCGACTTGCTGCTCCACCAC 

TTTGCTTGCTTCCTGGGTGCTCCATCCCAGAGACMGTGAGTTAGCAATCACTTAGTGTA 

ATCAGCCCAGGATGGAGGGTCTGTGCTGTGGGCCCAAGCCAGGGTTCCCTGTCTGGTGAT 

GAGTAGTGGAGGGTGTGTGGTACCT 

Sequence 3651 

CCGCGGTGGCGGCCGAGGTACTCAGTGGCCATGTGGAGCTAGTGGCCACCACACTTGACA 
CCACGGATGCAGA ACTTTTCCATCACTGCAGAAAGCTCTGTTGAACAGTAC I I I I I I I M 
I I I I I I I n I I I 1 1 I CCG CATTAAATCACTGNGGGTCTTAANCAATTCTTCAAAGCN AAC 
TTTGGTTTTTACTTTTGCTTCACTCTCAGCTGTAGGCATNTNTCAGCTTTATGCANAGGA 
CTGANAATGCAAAGCATTGACTCAATCAGGATCCTTTTNTCAGACATAATTAAAAAGGCT 
TTGACAGGTTGAANATTCAGCCCCAGGTTCCCGCGTACCTGCCCGGGCGGCCGCTCTANA 
ACTAGTGGGATCCCCC 
Sequence 3652 

NGGCGGCCGCCCGGGCAGGTTCAGCAGAGCCCNTTTNACCNGAGGNTTTATAACCNGNGC 

TCTTAGTTACCTGGGGNCAACTACAGTCTAAAAATANTACATGGAAAATTCCAGAAANAA 

ACAGTTCATAGGNTTACAGACGNGCGCTGNTCTGAGAAGCATGATGGAATCCCGTATCCC 

CCGGGACATGMTCATTCTTTCATCAAGCGTATCCATGCGTATACACGACCTACCCATTA 

GTCACCTAGCAGTCACCCTCAGNTATCAGATGGAGTATTGCAGTCTCACAGCGCTTGTGN 

ACMGTAACCCTTATTTGACTTAANAATGCTCCCAAAGTGCAAGACNAGCGCTGTCAAAT 

TGGATATGCCAAAAAG 

Sequence 3653 

CTTTTTANGGCGAATTGGAGCTTACCGCGGTGGCGGCCGCCCGGGCAGGTACGCNGGCTT 

GCTGATCGACTTATTGGGGTGACAACAGATATGACTCTGGATGGAATCACCTCCCCAGCT 

GMCTTTTCCACCTGGAGTCCTTGGGMTACCGGACGTGATCTTCTTTTATAGGTCCAAT 

GATGTGACCCAGTCCTGCAGTTCTGGGAGATCAACCACCATCCGCGTCAGGTGCAGTCCA 

CAGAAAACTGTCCCTGGAAGTTTGCTGCTGCCAGGAACGTGCTCAGATGGGACCTGTGAT 

GGCTGCAACTTCCACTTCCTGTGGGAAGAGCGCGGCTGCTTGCCCGCTCTGCTCAGTGGC 

TGACTACCATGCTATCGTCAGCAGCTGTGTGGCTTGGGGATCCAGAAGACTACTTAACGT 
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GTGGCGAGAA 
Sequence 3654 

TATANGGCG AATTGGAGCTTCCCGCGGTGGCGGCCGAGGTACT I 1 1 1 M 1 I I U 1 I I 1 M 

■ 'iitiu iicmniiiriitinitii i n i n ii mm imvmih imii i 

' i i i n i I i I M I I M l 1 I I H I I l I I I I M TN AAAAAAAAAAAATTTTTN NTTTNANTT 

TTTGCAAAAMNANANNGGGAANTTTNAANNAAANGGNGGGGNANCCCCCCCNTTTANNN 

GGGNN GGCCANGGCCCNAANNNNCNNGGGAAAAAAAANGGGGGNNAAAAAANGGNNNNGG 

Mill iNGGGNNNNGGGNCCNTTTTTTTGGGTTTNNNTNCCGNANAAANNANNNNINNNNC 

CAANNGGGGGGGGAGGTTAACCCNTTTTAAAAA 

Sequence 3655 

CCGCGGTGGCGGCCGAGGTACTGGTTTTGAGTATCAGCAGGTGGCCTTTCAGCCCTCCAT 

ACTTACTGAGCCTCCCACTGAGGATCATCTTCTGCAGAATACTTTGTGGCCTGAAGTTCA 

AAAACTATATGGGCACGGTTATGAMTATTTTGTGTTACTTGTAACAGTTCAAAGACTCT 

GCTTGCCTCAGCTTGTAAGGCAGCTAAGAAAGAGCATGCAGCTATCATTCTTTGGAACAC 

TACATCTTGGAAACAGGTGCAGAATTTAGTTTTCCACAGTTTGACAGTCACGCAGATGGC 

CTTCTCACCTMTGAGMGTrCTTACTAGCTGTTTCCAGAGATCGAACCTGGTCATTGTG 

GAAAAAGCAGGATACAATCTCACCTGAGTTCGAGCCAGTTTTTAGTCTTTTTGCCTTCAC 

C 

Sequence 3656 

AG GTACA TAAACCTTGGAATGCCCCTCAGTTGTTTCCCTGAAGGTGGCCAGGTGGTCATT 

ACATTTTCCCAAAGTAAAAGTAAGCAGAAGGAAGATAACCACATATTTGGCAGGCAGGAC 

AMGCATCGACTGAATGTGTCAAATTTTACATTCATGCAATTGGAATTGGGAAGTGTAAA 

AGAAGGATTGTTAAATGTGGGAAGCTTCACAAAAAGGGGCGCAAACTCTGTGTTTATGCT 

TTCAAAGGAGAAACCATCAAGGATGCACTGTGCAAGGATGGCAGATTTCTTTCCTTTCTG 

GAGAATGATGATTGGAAACTCATTGAAAACAATGACACCATTTTAGAAAGCACCCAGCCA 

GTTGATGAATTAGAAGGCAGATACTTTCAGGTTGAG 

Sequence 3657 

TATNGGGCNTTTTTNAGCTCCCCGCGGTGGCGGCCGCCACGCTGGTTTTGCATCTTNNGG 

AGACGCTCGTTNCCNTCGCGCTTCTCCTCGGCCAATTCGCGGAAGAAGTGGCTCACGCCT 

TCCAGAGCCACATCATCGCGGTCGAAATAGAAGCCCAGAGAGAGGTAGGTGTAGGAGGCC 

TGCAGGTACCACTACATCATTGCAAATCTGGGATTTACTGATGGAGACCTATTAAAAATC 

CAGTTTGGAGGTGCAAATGTCTCTGGATTTCAGATAGTGGACTATGATGATTCGTTGGTA 

TCTAAATTTATAGAAAGATGGTCAACACTGGAAGAAAAAGAATACCCTGGAGCTCACACA 

ACAACAATTAAGTATACTTCTGCTCTGACCTATGATGCC 

Sequence 3658 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGCTCTCTTCCAC 

AAGCACAACCCTTCTTCAGACGTCCATATACACTTCTCCCAAAATCCTGTGCTTAACACA 

TTGTTACTGAGCACACAGGCGTGCTGGGGCAAGCGACGCCTGGGAATATGAAGCGAACGC 

CACACACTAGAACGCGCCCTGGGAGCTGGCCTCCGGTGGGGGACGCAGGTGTTGAAGACA 

TAACCGCATGACCGAGAAACCTCAGTTTTGCTCTGAAGGCCTTCAACTGCC 

Sequence 3659 

CCGCGGTGGCGGCCGAGGTACTTTN I M 1 I I I J M II 11 ! I I I I 1 1 J I I I I AATAGTGAA 

TAAN ACTTTTATGTTAATTTTTCGGTTATTTTA I 1 11 1 GAAAAATATTTCCCAAAAG AGG 

TAAGGAATATGATGTAGGCATAGGGGTATCCAAGGGANANAATACAGTCNTCGAGCGGCC 

GCCCGGGCAGGTACTTCCTGTAATTGACAGCCTTGCCTTGTTCCTCATGGCATCATTCAA 

GGNCATCCCCTAAATTTTMCACTTTCGATGCCATATACGTTGTAACCTTCCTTATAAGT 

TGCAAAATTCTGTGAATTCTGCGAGGAANATGGGTTMTATTCTGNGCATTCTTTGCCAC 

CTTCATTCGTTTCNCCTCGGCC 

Sequence 3660 
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CCGCGGTGGCGGCCGAGGTACTTATAGGTAAGAGCCAGCTGTAGCTCATGTGGCTGGGTA 

TATTCTTGATACTCCTTTATTCACCAAAGGATTCTTGTTGCCTCAGGAAATGGCTGACCT 

ATGTAAATTACAGTGGTCTGAACTCCCTGCTAGGCACTGGAGGGGTGGGGACCATAATGG 

GTGGGGCTGGACCTAGCATGCCCACCTACAGGTCCCCGCAATGGCAGGTACACTTGAAGC 

CAMTTTCTAAMCTTGTTTTTCTTAAAAM 

ATCAGTGTGTTCACTATGCTTCCACACTAGCCAGTCTTCTCACACTTCTTCTGGTTTCAA 
GTCTCAAGGCCTGACAGACAGAAAGGGCTTGGGGA I I U I I 1 I 1 CTTT 
Sequence 3661 

N CC N G GCAGGTN CCCCC NAANAMTTACAGTGTTTTCTTAATGAACTTGCAAAGGAATAT 

TGCTAAAAACAAACAAAAAAMCTGTTATCGAACTTTCTTTTGTTGC 

CCNTNN TNTCA AGTNTTATCTACCTTCTTCTTTGTCGTTCTCMGNGTTATTGCCATACC 

ATATANATTTTCTAGCCAAATNTNNNNATCTCTNTAAANAAGAAATAACCTCGTTTNTGA 

GNATAT7TTMNCANGTAATGTCNTTC 

NMTTACAANNAATTGCAAGANGTTTATGCTTTGTGNTTNNAATACCCTTCCAAGCAAAT 

TAATAATCCAAAACAAC N7TTTCCCTTC NTTTCCCTTAAATANCNTGGNGNCTTTACCTT 

NGGCCTTTTCCCTTAAGANGGAANCCAANANGTCCTAATTCNCTTCCGCNATGGCCACCA 

CNTTCNTTTNNTTANTCATGTTGCTCNTTCC^ 

ACCGGGCCAAGNTTTNNTTAAAATANGGAAATTGGGGGA 

Sequence 3662 

CTATAGGGCGAATTGGAGCTCCC CGCGGTG G CGGCCGGTAGCTTCTGATAATG ATGATGA 

GAAACAGAATTCTGATGATGAAGAACAACCACAGCTGTCTGATGAAGAGAAAATGCAAAA 

TTCTG ATG ATG AGAGGCC AC AGGC CTCAN ATG AAGAAC ACAGG C ATTCAN ATG ATG AAGA 

GGAACAGGATCATAAATCAGAATCTGCMGAGGCAGTGATAGTGAAGATGAAGTTTTACG 

AATGAAACGCAAGAATGCGATTGCATCTTGATTCAGAAGCGGATNGTTGACACTGAGGTG 

CCAAAAGATAATAGTGGGAACCATGGGATTTATTT 

Sequence 3663 

A7TCGAGCGGCCGCCCGGGCAGGTACTAGACGGGGCTGTAATCCGGTAACTGTATGTATT 

TTAGTTCTCCAGTCCCTGGGAAGGAGACAGGAGAAGGTGGGAGGGAGGAAGGGGCCAGCT 

GAAATGGAAACAGATCCCTGATCCGGGGCGGCCAGTGGAACCCTTCTTGGTGTGCGAGAG 

CCTGTGCATTTCAGAGGCAGCAAAAAAGTAAAAAAAAAAAAAAAAAATTGATCTTTGTTT 

AGATTAACAGACCCCTGACTATGAAGAAGGAAG'GCATCCAGACCAGAAACCGAAAAATGT 

NTAGC AAATCCAAAAAGTGCAAAAAAGTG C ATGACTCACTGGAGG G ACTTNCCCAAN AAC 

AGCTCGTTTAACCCGGCCC 

Sequence 3664 

CCGCGGTGGCGGCCGCCCGGGCACGGTACACAGTCCAATGATGCTAAAAAGCAAGCACAT 

ATGTAAGTCTGCAAAATAACCAGCTGACAGCATGACGACAGGATAAATTCCACACATACC 

ATTACTAACCTTAAATGTAAATGGGCTAAATGCTCCCATTGAAAGACACGGGGCANGCTG 

GGTAAAGAACCAANACCCACTGGAGTATGCCCGTCTTNAAAGCAACCCATCTTATTGTGC 

AGTGCCAATACATAGGGCTCAAAAANTAAAGGGANTTGGGAGAAAAAMTATTTTCAAGC 

CNAATGGGAAAAAA 

Sequence 3665 

CCGCGGTGGCGGCCGCCCGGGCAGGTACATTAGTAATTATMCAATGCATTAAAATTTTC 
ATTTCATGTCATAGAGAATCAGTTTTCTTCATGATACA7TATGTTTTACTGAGTGAGTTT 
GTCCCTCCAGAGACCTTTCTGGGAACATGCCTTCTCCAGGGACTGCTTCCTAAGATGCCC 
AGGTTGCTTACCACAGGTCATCTTTGGTCATTTAAAAGCAGTCAAGCTGTNTGGACTGCA 
CCCCTTTCAACTGGAAGTTGGAGCCAAATTCC I I I 1 1 I I I I I AAAACCCTAACCNCCNNC 
CNTTAATAAAAAAAAAAAGGGGGNCNTTTNNGGGNTTGGGGGCCAAAANC iTTTTTTTn 
AMCNCCCTTTNGGGGGGGNGAAAATNNTTTTNGGGGGGNAAAAAAAN rTTTTTTTI 
Sequence 3666 
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CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC I I N 1 I I 1 1 1 I I i I I I 1 
rTTTTTTTTTTrn NAAAGGNTTTTGTTAAAAACTAANACNCATNTTAGCCTAGGCCTAC 
ACCCGGTCAGGATCATTCATATCACTGTCTTCTGCCTCCACACCTTGTCCCACTGGAAGG 
TNTTCAGGGGNGTGATCGTTAATACTGAGTGTCAAACTTGACTGGACTGAANAATGCANA 
GTNTTGTTCCTGGNGTGTCTGGGAGGGTGTTGCCAAAGGAAATTMCATTTTGANTCACT 
NGACTGGGAAAGGGAAAACCCCCCCTTNATTTTGGGGGGGCCNCCATTTTAATTAAGNTT 
TGCCCNNCCCCCATTTTNNAANANAAAACCCCGGAAGGGAAAAAAAAAATNGGG 
Sequence 3667 

CCGCGGTGGCGGCCGCCCGGGCAGGTACACAAGCCCAGGTAAGCATACCAAGCAAGCCCC 
CTCACACCTTTTGTCTGAAAAAAGCTTGACTTCTTTGCAGCAATGTCTCGCCCATTCCAG 
CAGCAGTATCTCAGTTAACTTGGTTCCTTTTCCTCCAGGCTCAAAACAAAATCAAACTCT 
TCCTGGGTCAGGGGCTGGATTGATAATCTCATGGACCACTTAGGGTTGTCCTCTTTTTGC 
TAAAATGGGGTCATAAATNGGGGAATTGG I I I I I NTCAAACTTNGGGTTTTGGGCCTTGG 
GGTAAAAGCC I I I I I I I I i AANCGNAATNTTTTAATGGAANTTCCCTTGNCCNNATTGNC 
CNTTGGGCCl I I I I I I I GGNAANATTTGCNTTAATGGGGNAAAAAAAAAAAAAGGGCTT 
Sequence 3668 

CCGGGCAGGTACGCGGGATGGTGCAAATGGTTTCCAACAACAGCGCACTTCAAGGTTACC 

TTCATAATTCTTTTCTGCCAGMCCCAAAAAACAATACTCTTGAGCTACTCAGTGTTCCA 

ATTGTTAAAAATTTCCTGAAATTTTCCTTCATGTATTCAAAGTGAAACATAMGAT^ 

AAGGATGGTTGTGAAAAGTATGGACTTTATAGTATCTAGTGGGCATTTTCATTGAGCCCA 

MTGATAMTTCTGTTTTCCAAGTCTTTW 

NGTCCCTCGGCCGCTCTANAACTAGTGGGATCCCCCGGGCCTGCAGGGAATTCGATATCA 

AGCTTA 

Sequence 3669 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTC 1 I I I 1 1 I II I i I I I I II I 

TTAGAACTTCTTATTGGGGAATAATTTCAAATTAACAGCANTTATGGGAACCAGCACAAA 

GAACATATGATTCGTATGCAGATTCAGCTACTATTAACATACTGNCCATTTCTTTTTCAC 

TTATGCGCCGTGCTCCTGCTCATGCTCTCTCTCATGTATATATGTATGTATGTCTTTTCC 

TGACCTGTTAGTAAGTTGTGCCCATTAAAGCCTTCTTCGCTGTGACACTTTAGTGTGTTT 

CTTCGAAGAAAAAGGGCACTGTCCTGCAGAACCACAACACAATTATCAACTTCAACAAAT 

CTGTAATAACAATGCACTATTCAG ATTCCA 1 1 I I I GTTAATTG AC CC AATCCTT AGTGTT 

TTTTCCCTACAATACAGGATCCAGTTTAGAATTATGTCCGCTCTANAACTAGTGGGATCC 

CCC 

Sequence 3670 

TCACTATAGGGGCGAATTGGAGNTTCCACGCGGTGGCGGCCGCCCGGGCAGGTACGCAAG 

AMGTTCAGAGATGAGACCTTTGGTTGTATTCNNCCTTTGGGACATGGGGGATGTCTTTA 

GTTCAAAGTCACAAATAAATGCAGGTTCTACAATTCAGAGGCTTCATATCCCTGCTGGAG 

TATTACATGTTTATTCAGGATGGACCACTTTTCTTAGCAACAGTTTCTAMCCTTTGCCA 

GGTCTGGGAAGTCTGGCAGGAGAGATTTCTAAGAACCAATCATTCCTGCACACACTTCTT 

GAAGATAATATACATTATTCCCTAGTTATCTCTT 

TATTACMCMTCTTTATTGGAAAACCCCAAGTATTTTGTCTTGAAAAATCAGCAATCCA 
GGTATTAAAAATAGCATGGMTGCCCAATTTTACTTTGATAATTACATGGTAAGTCAAGT 
TTCCGCTCCTGGCTAAAAGCCTTGGATTTTCTTCTGGGGCAGGTCCTAAANGCACAGGGT 
GGCATGGAMGAAAATCATTNCATTTTCATCTCCTNCCCTTAATTTGAATTGGGGGTCT 
Sequence 3671 

ATAGGGCGAATTTTAGCTCCACCGCCNTGGCAGCGGNCNCCCGGGCAGGTACAGGGCTTG 
AGAGCANAGCTGCCCATGTCTGCTTTCCCTCTGAACCCTGACAGCAAACTCTTCTGAAGG 
AGCTTTTCACTCTATCAAGGTATGATAAAACTTGAAAAGAGATGGCTGGATTGAGTGCGG 
NGCGGGGGTGGGTATCAGAGGTAGAGAAGTTCCAAGGTGGCCCCATAACTTTCCTATATC 
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CTTGAAGAATCGTTTTTCATTTGCAGAATNAAA 
Sequence 3672 

CCGCGGNGGCGGCCGAGGGAC \ nTTTTTTTTTTTTTTTTTTI I GCCCTCCCAGATTCTC 

AGTCCTTTGGTGATCACTTMCTGGACMTCAGAAATGAATGAAATCCACTTTTCCAAAC 

AAGACATACCAAGAGAAGCCCGAGCAGCTCACCTGGGTCATTGGCTAACACAGACCCCGC 

GTACCTGCCCG 

Sequence 3673 

A GGTAC I I M I I 1 M 1 1 I M M H U I J M I I 1 H I GGGGGAAAAGAAAAATAAACTATA 

GTTTTTGNCATTMGTTTACTAGTCCAAAATAATGTATANAATTGGTTACCMGTTGGM 

GAATCAGGAATATTTATGATAAMTCATGGCATTCCTCAMNAMCTATTTTCAGNGCTT 

CCT AMG TCTATAGCAAACATGATGTAAATTATAAAACTTATAACTAAAACAACAAATAA 

AAGTTTTCTCAATTA7TTAAG G N AAAAATAAAAACCTTGGACTGCTTCTCTTCAATTAGG 

GGANAC 

Sequence 3674 

CCGCGGTGGCG GCCGA GGTACAATAAAGTCCTTACTTAACATCATCATTAGGTTCTTGGA 

MCTGTGACTTTATTTTTCTTTTCTTTCCTTCCTTTTCI I 1 1 I 1 I I I GGGAGACAGGGTC 

TTGTCATGCTGCCCGGGCTGGTCTCTGATGCCTGGCCTCCAGTGGTCCTCCTACCTAGGC 

CTCCCAAAGCGTTGGGATTACAGGTGTGAGCCACAGCACCTGGCCAAAACTCAAAGTGAA 

AAGTTACCAAGMCCTATCCGTGATGCTAGGTTTTGACGACTTACTGTACCTGCCCGGGC 

GGNCCGCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGAATTCGAATATCAAGCTTAT 

CGATACNCGTCGACCTCGAGGGGGGGGCCCGGTACCCAACTTTTTGT 

Sequence 3675 

AGGTACACAGCAGTGCTCACCGAGCGAAAGATGAGCAGGGAGGTGAGATTGACGTAGCGC 

ATCAGCGTCCTTCTAAGCAGGCGCCCGTGCTCGTCGCTTCCGTGAACACTGCTAGAGATG 

AGGAACATTAGCCTGTCTGGCCAGGGCAAATTCACAAACTGGTTCCACCATCGGTTCACT 

ACCAGAGTAACATAAAACCCTATTTATTTGGTTTGTAGTCTCCGACAGGAAAGAGAATCT 

TCAAATTTCTGGCTCCCCCGAAAGAGGTGCCTTCAGGTTCGGTGCTGCACGCCCGGTGTC 

ACCCTTCGCTCGTGCACAGTCAGATTTTCTTTTCTGNTCTCTGGCCACTGAGAGGACCCA 

GAAGTGTGACAAGGAACAGAAGG TTAAATG GT 

Sequence 3676 

AGGTACI n I I 1 1 I I I 1 I I I I II I I I I I I I I ATGGAGTCTTTAAGATTATCTCTATGTTG 

GCTGGCTGCGGNGGCTCATGCCTGTAATCCCAGCACTTTGGGAGGCCGAAGCGGGAGGCC 

AACACTGGGGCTACCCAAGAGTTGACCAAAGGTGGTATTAATACNTATGCTTACTTATTA 

CCTTGTTCGCACAGTTCTAGTGTAAGTCTTGAAMTCAAGTAAATTTGGAAACATANTTG 

CACAAACTCTNTTATGAGTNACACAGNTAATCCAGAATATTATTTCTAAGTTATTCATAA 

AAAGCCAACCAAAGCCAAAATATTNAATGTTACCCTTGCCCCGGGGCCGGGNCCCGNTTC 

TTTNAAACTAAGTGGGGAATTCCC 

Sequence 3677 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACANTGCGGTAGTGA 

CGGGTTACAGTCCCGTGGGCAGCCTCTGCTTCTACTGTCTTGCCATCTGGACAAACCAGC 

ACGCTGGTCATCATGCCGAGAGAGCCATACCCTTGGGCCACAGAGTCCGACTGCACGTCA 

CCATCATAGTTTTTACAGGCCCAGATGAAGCCTCCCTCTGATTTCATAGCTTGGGCCACC 

ATGTCGTCGATGAGCCTATGCTCATACCAGATCTTTTGAGCTTCAAACTGGGACTTGTAC 

CTGCCCGGGCGGCCGCTCTAGAACTAGGTGGATCCCCCGGGCTGCAGGNnnnnnnnnnnn 

nnnnnnnnnnnnTACCCGTCCGACCTCGAGGGGGGGCCCCGGTACCCAGCI I 1 1 IGTTCC 

CTTTAGTGAAGGGTTTAAATTGCG 

Sequence 3678 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCCCGCCCGGGCAGGTACAATAAGCACACA 
CCCAGAAGAGCTGAAGGCTGAAGACAGAGACGATATGGCAAGAGGCAGTGGCCTGGAATG 
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GGGACTGACCACCCTGCAGAAGTTCAGCCAGGTAGATGTGGGGCAGGGGAACGCTGATGG 

TGGTCTCAGGGGGAAAACTCAGGACCTGCACATAAGTGGATGACCGGAAACAACAATAAA 

CATTGTGAGATCTGGAAACCCTTTTCTCCAACTGGCTGAAGTGGACCCGGGCTCCTGGGA 

AGTAGTTCCTAGTGAGGGAGGCAAGTGTGGGTTCTTCTATATATACATCCAGGTGAGGGG 

GGGAATTCACATTCAGCAGTCTCAAGAGCGACTGTTAGCTTCACACACCTTCTCATGGCC 

CCC 

Sequence 3679 

AGGTACGCGGGGACTGTTGCGACTGGCATCCGCATCCGGCAGATGTAGATGGAACCAAAG 

TCCAGAAGTTACGCGTCACCCTTGCTCTACAGCCAAACATGCAGGACTCTAGTAACCCGC 

GAAATGATGGGATAGCCGTTGCAAATCCTTAAAAGAGTCTTAACGAAATCCTGGCTGACA 

TTGACTTCTCCACTGCAACCATCGAGTTCATTGTCTCCTAAACCTTGCCATGGAGGCCTG 

TGGCACCTGAGCCAGCCATTATCATCACCAGCACTTCCATGAGCTACAAGCTGGACCCAC 

TGCAGTCCTCCTGACACACTGAAATCAGAGCCTGCACACAGAGCAGCAGATGCTTCAATG 

T AAAGGTC ATTTTCC AG G GTC CTTG ACA 

Sequence 3680 

AGG TAC I I I 11 1 M i M 1 1 1 1 1 1 i 1 I' 1 1 1 GCAAATTTACACAGTTTTCTTAGGAATGCTC 

tttci rmrm ictctttggactctacattcctgtagctcctattcttcaagggcacag 

GG GCTTACTTAMCATAAAATTTTAC AACTTAACTGGAAGGGTGCCCTGCTTG CAAATGG 
AAAACTGCCACAGAGCAGACTTAACTGCAGCAAGCAGGTGACAGAGCAAGGAACAGGATC 
C AAAATTATG A I Mil IGNGTGCTACTGMGTAAMGMTTTAAGAAGGTTTTACTCAGG 
GTCATAGGCTGGGTTGGAAGACTGGGAAAAAGGAAATCAGGAGTATCTATATCTA 
Sequence 3681 

CNGAANAATTTGGGGAAGGNCNTNCNCACTCCGGCCGGGGTGGGGGCCGGGGCACCGGAA 
GGGGTTAACCCCCCTGGGGCACCCTTCAAATTAACCCCTGGGAAAAAGGNCCAGGTTGGG 
TTTTCAAACCNCCTTTNAAGNCCAAAAAAANTCCTTNCNCAAAAACCNCAAGGNCGGGCN 
ATNGAAAA CCNAAANTTTTTTTCNTTMCCNATTCCAACAAAACCCGGAAACCCCTGGGG 
I I I I I I T i' NNAAAAMTTTNCCAAGGCCTTNNTTCAAAAAGGGGGCNCCAAAAMTTTTC 
rTT TTTI I CNTTAAAAAATTTTTAAAACCCCCAAAAAGGAAANACCTTTTTCAATTTCNN 
AGNTTTTCCCACCANGNNMGAAAAAGGGGGTTTTATNTCCCCAACCCNCCCNNGGGGGG 
GGGGGGGTA I I I I I I I I I I GGGGGGGGGGGGGAATTNTNCTNAAAAAAANAAANAANGNT 
TNTTTTTTTn I AANAAAAAAAGGGGGGNGGGGGGGGGGNGTN 1 1 I l l 1 1 I I NGGGGNGA 
ANAANCN I rTTTTm I NGGGGGGNNNCCGCCCNNGNNTNNANAAAACCNCCCCCCCNNT 
rTTTTTTTTTTTTTI INTTGNGNGNNAAAAAATTTTTNCCCNGNGGGTGGGGGGGGGGGN 
GNAA^AAAACCCCCTC , I I I I I I I I 1 1 1 1 1 I TGTGGGGGGGGTGGTTTTNTNNTAACANCC 
CNCCCCCNTChn"GGNGGGGGGGGGGGGTTTTTNNTTCTTGNGGGGGNGGGNNNNCCNTTN 
NTCGGTNNGGGGGGGGGGGGNAAAAAAAAAAAAAAAAATTTNCCCCCGNGGGGGG 
Sequsnce 3682 

NGCGAGGTACGCGGGTGTGCAGCCACAGTGAGCTTCTGAACATTTCTTCTCAGACTAAGC 
TCT TACACACAGTNGCAGTNGAAAGAAAGAATTGCTTGNCATGGCCACAGGAGCAN GCNG 
CTTCCTGCANACATGACTGTCAACNCAAAACNTCATGTTNACTGGAGGGCNAGGACACAA 
TGGTTTTN N CN ACAGGAGTACCTCAN GAGGCACAATACN AN GTCACACTCAAATTGGTGG 
CTTTTTGCCNCAAAAATATTACCCNCNTTTAGGGTNANAGATTCTTTCCCCCTTCCCTGN 
CCCAAAGTAAMGGAAANMGGTGGGAAATAAAGCAAGGCNNATTTGMGTTACACTTTC 
ACCCAATTGGGGAANAAACCTNTTGNAAAANGTNGCTAAGGGNGGGCAANAAANTTNGAA 
AAGGGGGGCCTTTCAAACNCCAAAA I ITTTAATTTTn I GGGNTTGGCCCAATTTGGGGN 
GGCI Mil I AAAAAGGTTATTTNTTNCCCCTTTGGGAAAAA 
Sequence 3683 

AGCAAAAGGCCAGGAACCCGTAAAAAGGCCCGCGTTGGCTGGCG I ' I 1 1 1 1 CCATAAGGCC 
TCCGCCCCCCCTGACGAGCATTCACAAAAAATCGAACCGCTTCAAAGNTCAGGAAGGNTG 



Page 569 of 63 5 



WO 01/51628 



PCT/US01/00798 



Table I 



GNCGAA 
Sequence 3684 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGATGG 

GGTTTCACCATGTTGGCCAGGCTGGTCTTGAACTCCTGACCTCAGATGATCCACCCGCCT 

CAGCCTCTCAAAGTGTTGAGATTACAGGCTTGAGCCATTGCGCCAGCCCTACCTTTGTTT 

TCTT*\ATAGTAGGAAAAATATGCCTTTATTTCTGTTATGATTTTTCAGAGAGGTAGAAGA 

TMTAGATGCAAACTCTTTGGAANGGAAAGCTTGTACCCTTCGGCCGCTCTAAAAACTAA 

GGTGGGATCCCCCCNGGGCTTGNAAGGGAAATTNCMTATTCAAAGCCTTTATTCGAATA 

CCCGTTCGAACCCTTTNAAGGGGGGGGGGCCCCCGGGTACCCCAANCI I I 1 I I GGTTTCC 

C 

Sequence 3685 

CTAC7TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC I II 1 1 I I I I I M I J 1 

I I I I I I I I I GAATCTTAAATGTG CTCTACTAAGTGAAAAACCCATATACATG ATTCATGG 

GCTACATATCACATGATTCCATTCATATGATACTTTGGAAAAATTAAAACTATAGGAATG 

MCAGACTAGGGCCGGGGTGTCCAATCTTTTGGCTTCCCTGGGCCACACTGGAANAANAA 

TTGTGCCTTGGGCCACACATAAAATATACTAACACTAACAATAGCTCTTGAGCAAAAAAA 

GTCCATGTNTAAATCTCATAAGGTGTTAANAAAAGTTTACGMTTTGTGTTGGGCCACAT 

TCAAAGCCCATCCTGGGCCCGCATGCAGCCCCACGGGGCTGTGGGGNTTGGGACAAAGCC 

TTGGGAC 

Sequence 3686 

CCGGGCAGGTACGCGGGGGGAGGCAGGTTGGAGCCGCTGCCGTCGCCATGACCCGCGGTA 

ACCAGCGTGAGCCCGCCCGNCAGAAGAATATGAAAAAGCGGANCGACTNGGTTAAGGGAA 

AGCCNCCCGAGATGACGGGGCATTTCTGGCTGCACGCNNCGCAAGTCAGAAGGGGACATN 

GNTATNATNCATTGTCAGCACCANAGCCAGNAAGNAGAGGCNAANACGGACCAAAAGGAA 

GGGAAGTGGAAACACCAATNNTACGNCNTTNTGGTGGGGCNTATGCGGTTGNTACCCAAA 

CCCCCTTTTCNTTGNCCCCCTTTNCGNTNCCTGGTTGGTTGGCCCTTGNGAAANNCCCAA 

GGNGCCCCCCACCNCAAANGGCCTTTCAGGNCGGTTTTTTACCNTTTCCCTTNGTTAAGN 

GTGGGCCTTNCACCCAAGGGNNTTCTCNCCAANTCCGAGCCNCCGNAATTNGGGGNCCAA 

NTTTTNCCCCCCTTTTT 

Sequence 3687 

GGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGAAGAACGGCGGTAACAGTT 

ATTGGCAAAMGCATGAAAAGAGAMGCACTTTGAMTTTATTACTAGCTTGCTACCCAC 

GATGAAATCAACAACCTGTATCTGGTATCAGGCCGGGAGACAGATGAGGCGAGAGGAGGA 

GGAGGAGOAGGAGAAGGCTCTGGGCTCCTCTGCAAAAATAAAAATAAAAAAATAAATAAA 

ATTTTAA/sMTMTAAAMTTCACTATATACACATATAAAGAAATAAAAAGAAGTCTCAG 

TTGCCAAAAAAAAAAAAGAAAAAAAAAGTACCT 

Sequence 3688 

AGGTACGCGGGTATTATGGAGATGCTCTGTTCCATCCAGGGAGGTGCAGGTGAAAAGAGG 

GCAGTTCATCCTCCTCACACTTAGGGCAGGGAGCAGCCATGCAGGGGCANGCATCAGNCA 

GTCTTTGCCATTGTTGGACAAAAATCCCACCTTCTCNTNGCTTGCAANTATGGCTCTTAN 

TGCGGNAMTGTTTGCCATGAACCTTTAAMNTNTTTTCTTNTTNTGNTNAANANATTGG 

GGG 

Sequence 3689 

AGTTGCAACAATTCATCTTTATTTCTTATTTTC^ 

TNACCCCAGGTATATTNGGGATAGTNGGCTCCTCCTGGGNNAGGATGGCTGGGCNCNTTT 
CNCCNGGGGGGGGNN I I I I I I I I I I I GGGGGGGGGGGGGGGGGGGGGGNTTTTNAAAANN 
NNCCCNAAAAAAAANNCCCCNCTNNNNTGGNNGCCNATAACCCNANNNGGGAACANNAAA 
AAANATNAAANTTTNNNAAAAAANANACANNANAGTNCCCCCCCCCCNNCTANCNGNGGN 
AGNAGAAAAAAAATAATCCCCCCCCCCCCNNAGNNAAAAAAAAAAAAMGNGGTCTTTTT 
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TCTTTNNAAAAAANAAATNGNGAAAANCCCCCC 
Sequence 3690 

GAAACACTAC TATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTAC1 111! IT I 
1 1 1 1 I I I I 1 I 1 GANTCCAANTCCCCANGANGGCTTTA 1 1111 I NTTTTCAACANNCTGNT 
CTGCGGNTTCCTTGGCTNTTTTTGCCCNTATNCCCAAAAACCNGGCNTTGGCACNGGCCA 
TACNGAA/^.NTAACNAANGNTTTGAAANTCTTNTNTTCCTNAANGANGANTNNAGCTTTNT 
NCTTNTTANAAANNTTCCNGACNGGCATNANCNGNCCNGGCANCTGGGNGGCCANTTTCA 
ATTN7TNCCCNNGTACCTNCCCGGGCGGGGCGGTNTAAAACTANTGGGAACCCCCGGGCT 
GCAANGAATTCCGAAATTCAAGCCTTAATCGANACCGTCNACCTTNNAAAGGGGGGG 
Sequence 3691 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC rTTTTTTTTTTTTTTTI 

TTTTTTCTCCACCANAAACACGTGCACTTTATTGAATGCCATTGTANAAAAGTGCGTGAA 

GATAAAGGGCTGATACAGGACTGGGCTCCGGGGGCAGGGCGAGGTGGAGTATGTGGGATT 

CAGGTCATGGGCANAGCTCCTGGCCTGGATGATGCCTCCTGATCTATCGATAGGCTTGGA 

AGATCAACACCCGGATGATGATGANCANAATGGTCATGAGGATGTCCACAATCAGGGCTC 

AAATCCCGCGTACCTGCCCGGGCGGCCCGCTCTAAAACTAGTGGGATCCCCCGGGCTTGC 

AGGMTTTCGATNTCAAGCTTATCCGATACCGTTCCAACCTTTNNAGGGGGGGGGCCC 

Sequence 3692 

ACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACCCGGGCGTG 

TGAGGTGTCAGTCTGCCCCTACTTGGGGGTGCCTCCCAGTTAGGCTACTCGAGGGTCTGG 

GACCCACTTGAGGAGGCAGTCTGTCCGTTCTCAGATCTCCAGCTGCGTGCTGGGTGCTGG 

GAGAACCACTACTCTCCCCGCGTACCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGTCATAGCTG 

TTTTCCTGTGTGGAAATTGTTATCCGCTCAACAAATTTCCACAC 

Sequence 3693 

TGAGGTTTCTCACACTCATGAAAATGAAAATTATCTCTTACATGAAAATTGCATGTTGAA 

AAAGGAAATTGCCATGCTAAAACTGGAAATAGC CAC ACTGAAACACC AATACCAG GAAAA 

GGAAAATAAATACTTTGAGGACATTAAGATTTTAAAAGAAAAGAATGCTGAACTTCAGAT 

GACCCTAAAACTGAAAGAGGAATCATTAACTAAAAGGGCATCTCAATATAGTGGGCAGCT 

TAMGTTCTGATAGCTGAGAACACAATGCTCACTTCTAAATTGAAGGAAAAACAAGACAA 

AGAAATACTAGAGGCAGAAATTGAATCACACCATCCTAGACTGGCTTCTGCTGTACAAGA 

CCATGATCAAATTGTGACATCAAGAAAAAGTCAAGAACCTNCTTTCCACATTGCANGAGA 

TGCCTTGTTTGCAAAGA 

Sequence 3694 

CGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTATGTTCGCAT 

CTGTGAATAGTCACTGCACTCTAGCCTGGACAATATAGTGAGAACCCATCTCAAAAAAAA 

AAAAAAAAAAAAAAAGGCAAAGTTCTAGATCCTAACTGGAAGAGTTAGGCCAAGAAGGAA 

GAAGAGGTTCTGGGAGCTGCTGGCTACTTCCTNATCCTCCTACTATCGAGATACCCGCGT 

ACCTGCCCG 

Sequence 3695 

NGGCGGCCGGCCGGGCAGGTACGCGGGAGCCCCTCATGGGAGCCGGACTCCCTGANACTC 
AAGAAAATGCACCCNTCCTGCGTTCCTCCTGCCTGGGGTGACCCATAAGGAACACCCCCN 
CCTTCTCAACAACCACAGAATATA/^ATGCCAAGGGGACCACCAACATGGGCATGGGGGCC 
TGTTATGCCC A I I I I l AAGGGAAAACACANGTAAAATTGGCTCCAGNAACTGAGCTGGGG 
GCAGCTGCTCTTTCTGTCCATGACTTTTGAGGCGGGATCTCGAGCGTCTGTCTNACNGGC 
CTTCTGTCTGCTGTCCTCTCGCANGTTTCATACATTGAATCCAGGATTGCAGCCCTGAGG 
GCCCNCAGGGCTNACGGGGAAGCCCTCGGGAAAAGCCCCGGTGGGAAGTCAAACCTC 
Sequence 3696 
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NTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAGCATATAAMTGCAGTTCTGA 

TCCTGTCACCTCTCTGCTTAAAACTCTTCANAGGTTCACCATCGCTTCCAGAAAAAGATC 

AATACTCCTGACGTGGCAACCTGTCGAGACTCAGGACCGCTAGGAAATCANCTCTTGTTT 

ACCCCTCTATCCATTACACTACGATCCAGCTGGGTTTAAGGCATCACTTTCCTCTGAGCA 

ATCCTCCCTGATGTCATGAGGCAN7TGCTGAGTGCCTTCTCTGGACCCTCTGTTTACTTG 

CCTGTTAAATCCTGAACATCTGGAAGGGAGTCGTGTTCAANCAAACATATCTACGCCTTG 

CAGCACGGTGCCTGGCCTGCAGCTGGTGT 

Sequence 3697 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACGAGTGGGATGGTCAG 

GACCTGGGCCATTGCAACCAAAATGGGGACT7CCTGGGTAGGGAGGTCACTCCCTCTACT 

CACTGAGCTAGGATTAGGGAGGGTTATTGCCCCAACCATTGCAANGGGAGGTGGAGGGAC 

AGGCTCAGCCTTCTCATTGTCTAAATGAGGCCTAAATGTGTNAAGTGCNATTNCTNCTTT 

TGTGTNCCTGCCCGGGCGGCCGGCTCTAGAACTAGATG 

Sequence 3698 

CCGCGGTGGCGGCCGAGGTACGCGGGATTTATAAACATCCTGCAGGAAAATGANTCTATA 

TGTCAGAATACACATTTCCCACCTTGCCCAACAGTAGAAAAACATAAGAAGAGAAAAACA 

TTAAAAAATGACMGGAAGTTAATGGAAGCAATGCGATGGTGTTTGGAGGTGGAGCCTTC 

ANAAAGGAAATTMTGCCCTTGTAAGAAGAGGCCAGAGAGCTTGCGCACCTTCTTCCTGC 

CATGTGAGGAGCCAAGAAGCCGGCTGTCTGCAACCTGCAAGAGGACCCTCACTAGAAGCT 

AGCCATACTGGCATCCTCATCTTGGCTTTCCAACTTCCAGAACTGTGAGAAGTATATGTT 

TGTGGTTTAAGTCAATGGTCTATGGTAAA I 1 1 I 1 I I I ATTAGCAGTCCCAGCCAAAGACA 

GTGCCTCATTTACCTACCATACCATTTATATTATTTATA 

Sequence 3699 

ACTATAGGGCGAATTGGATTTCCCCGCGGTGGCGGCCGAGGTACCCTCTTTCCCTTCCCA 

CCCCCCAATTAMGGCAAACMTNTCACTTTGCTCTTGCTTAACCTAGATTGTGCTTCAA 

AMCTATTAAAATGTAAMGACTTMCAAAAAAACAAAAAGACGTTTAACAGATGTCAAA 

MGCTCCTTAGTGTTTGAAAATTMTGCTTTAAACAAAAGACMCATATTTTATATNAAA 

CANGTTTGAAGAGCCCTISIAATTGCAGCATTCTGTAACATAAACAAACAAAAAGCTGG 

Sequence 3700 

GCGGTGGCNGCCGCCCGGGCAGGTACTTTN I 1 1 1 I 1 H 1 1 llll I I H fl NACTGGNGGA 

MTGACACAMCTGACMGTATTTATTTAGCCTNATTCCCATNACTCACCACTGCCACCC 

AAAAAAGTGTCACCTGAGTATTTATTGCATGCANCCCCAACACTAAAGAAAGATCAGAAA 

TTTmNTTCCCGTGGGTGGAAMTAAAGGGCTNTTTGCCCACGTNTCCAGGCmcm 

GGAAAGCATTTTGGGAATCATTCCANA 

Sequence 3701 

CTCGCCCGGGCAGGTAC i ! I i I ! I ! I lAATl I I 1 1 1 I I 1 1 H I I GGTATTTTTAGTAGAG 

ACAGGTTTTGCTATGTTGGCTATGTTGGCCATGTTGACCAGGCTGTTCTAAAACTCCTGA 

CCTCAGGTGATCTGCCCACCTCAGCCTCCCACAGTGCTGGGGGACTTGCANCTTGTGNAG 

CCACCCGCGCCTTGGNNCCAGAC1 IIIIMI 11 ATGKAAGAGGAAAAAACCANNATTTAC 

TTATGNGANTTTACCCAATGTTTTAACNAl'GNAAGTGTTGTTAGTAGTCAATAAAGTTCT 

TGGAMGNGGAGATTTGCCGCATAACCCANTTATTGGGACCATTGTTGNGACCACNCTNG 

GGAMGAGNGACCNTTTTTTANCGCCATGGAGTTTTTACCATNMNAGTTCCCGAANAAG 

NCNCCNNTTAAAANACNAANGCCACCATT 

Sequence 3702 

CCGCGGTGGCGGCCGAGGTACTTAACTAGGGTTAGCCGAGGTCAGGAGGGTGGATGCCTT 

ATTATGGGATTAGTGCCCTTCTAAGAGGGACAGTAGAGGGCTTGTTCATGCTCTCTCTCT 

CCATGCACAAGAAAGAGCACACAGTGAGAAGGAAAGAGGCCCTCACCAGAACCTGACCAT 

GCTGGCACCGTGATTGCAGACTTCCAGCCTCCAGGATTGTGAGAAGATAAATATCTGTGG 

TTTAAGTCACCAAGTCTGTGGTGTGTTTTATGGCAGCACAAGGTAACCCAGATGATGCTT 
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CTCACCACTGGGTGGTCCAGCCTTTCATGGAAGCTGTCAGCTCCTCAGCCTTGCAAAAAA 
TGCTTGCTGGTCACTCTCTTTGGTCCTCTCAGACGCCTTTCTTAGTGGTGCCTCTACCAG 
GCCCTTGACTTCATCCTTTTC 
Sequence 3703 

CACTACTTAGGGCG.AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTCAGCACAGCAC 

MGCCACGCTACAAAAACGTGGCCAGACTCTTGTTTACGTCAGTCCCTGACCACATTTCT 

AGTCAGTGGGTGAAGTCTTTCAACCAGGGTCTCTGGCTACCTTGACTGCTGTTGTCTGGC 

TGACAGAGGTCTCAGGCTTCCCTGAGTCAGGGCTGCCAGGGGGAGGACCAGATTGTCATC 

TTTGCTTTTTGGGCGACCCAGCCATTTCAGCCTTAGGGCTTCAGAGTGTCTGAGGTGACC 

AGGGGCTGAAGTGAACCCCCAGCACAGCACAGCTGCTCTATAAAAACGTGGCCAGACTTT 

TTrTTTAAAGCAAGTCCCTGTGCTTGTTCCTCCTGACTAGGTMGACTCCTCAACTTGCC 

TCCAGCCACATCTTATAGGTGTGTTCAGATTGGCAACAGGTTCGT 

Sequence 3704 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACNCGGGGGTAGT 

MGCCATGGCGGCCATGAGTTTNTTGCGGCGGGTTTCGGTTACTGCGGTGGCAGNTCTGN 

CTGGCCGGCCCCTTGGCACTCGCCTCGGATTTGGGGGCTTCCTCACTCGTGGCTTTCCGA 

ANGCTGCTTGCTNCTGTTCGACACAANTTGGANACCATGGGAAAAAGACTATTTGTCATC 

ANACCTTCTAGATTCTATGACAGGCG 1 1 I I I I GAAGTTATTTGGGATTCTACATTGCATT 

NACTGGGGATTTCCAGTANCMTCTTTCATAACTCCNGGTGMTGTATTTTCATTGGTCA 

AGGCNTGGAACTAGCAGAAGATTTCCCAGGAAAGGCTATGTCCCANAAACACCTGGGGAA 

TATTTATAAGGCATCCCATNTTCAANCATGGGATTTGCC 

Sequence 3705 

CCCCGCGGTGGCGGCCGTGGCGGACCCCGCGTACI I 1 1 I I I II i I I I I I N I I I I I I IAA 
ANTGTTTTCTGGGTGMNTTTATTCTGTTTTCACATCTAGGTTGTTGGGGAGAGTGATAG 
ACAAANTTCTGG 
Sequence 3706 

CCGCGGTGGCGCCGCCCGGGCAGGTACGCGGGGACACCCTGATCTTCACTGCAGAATGTA 
MGGTTTCAACGTCTTGCTTTAATAAATCACTTGCTCTCCAAAAAAAAAAAAAAAAAAA^ 
AAAAAAAGTACCT 
Sequence 3707 

AGGTACTGGGCTGTGGCACAGCCTTCANAGACAACACNGGTGTATCTCTGATCCTGCCCT 

ACCTAGGCTTNAAGGGGCTTCTGTCTGGAGAAATGCCCNTCAAAGGGTNACCAGTAGGTG 

TTGGCATTGAANTGGCCAGTTNTTAGCATNTACMGCCCATCACCCNAGNATTNMTTTT 

TGGCCCTGNCCCNTTNNGGGCCCCCCNTTANANGGTTTATTAGAACTNGNTTATTTCAAA 

TCCAATNNGTAGGNCCCCNTTCCCNCAGCCTTGGGGTNTGTNNCTTTAAGGGAAATAAGN 

CCCCCCANCCNNNGGNNNANTGGTAAAGGGGGGTATGNCCCCTTTNTTTTCTAANTGGGN 

NGGCNATCNTTTGGGGAANMTTNCTTTTTNNTGNNTTGTCCCCNAANTNGCGNNGGTCC 

NTTTTTT 

Sequence 3708 

CATTACAAAACAAGTTATGTAGOTAGATATTGGGTTGAATCTTTTGATATTTTAAGAMC 
TATCCCACNGATCAGAGAGTrGATAGACCGTTATTAAACATAGTTGATGANTTATCTGGG 
NAGATCAGTCTTTTNCCAMCTTCAAAATTCCAMTCTGNTGGANTAATGTGGAAMTTT 
ANTTTTATCTAANCCAATGTGCTTACACACAAAACCTTACTGNTGTCTTCAGGGCCTGAG 
TTTTTAAAGNNGNCAVBCTl TTGCAATGGTTTTGNTNATTGATTNNGNGAAAATNTTATT 
AN NTAAATTTTCTTAACTTTN CAG I 1 1 I I 1 1 I CCCCNTTTNAATTNNCNTTTAAATTTCC 
TTTNCCANAAAAAMCCCCCCTTATTTNNGGTAAAAAA^ 
NNGGGGNCNNGGTTNCCCTAAATNAAAAAAAAATGGGG I I I I I I I I 
Sequence 3709 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGTGGCTGCATGCCAGCCAGACACCCA 
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GTCTTGCAAGACTGTCATTGAAAATCTCCGTTTTGCTGTTCTCCGGGTCTCTGCGTCCAG 

TCTTTGTGTTTGGACGGACCTGCCGGGCCATCTTTCTGCAAGAAGATAAAGGAAGACCAG 

GAGTGCCTGCCGAACTCCTATGGAGGAAGTCTAGGAGAGGAAGGGGACAGGGAGGAAGAT 

GGTGTCTGCAAACCAGGAAGCAGCCTTGCCAGACACAGGATTGGCCACAACCTTGACCCC 

AGACTTCCAGCCTCCAGAACTCTCCTACTGCACTGAGAAGCACGTGTTCTCCATTTCCCT 

GGGGGAGACCATTGTATTGGGCAGTTTGGAACAAAACACCATGGACTGGGAGGCTTACAC 

AACAGAAATT 

Sequence 3710 

TATCGACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGGGCTGTG 

GCACAGCCTTCGAGACAACACGGGTGTATCTCTGATCCTGCCCTACCTAGGCTTCAAGGG 

GCTTCTGTCTGGAGAAATGCCCTCAAAGGGTAACCAGTAGGTGTGGCATTGAATTGGCCA 

GTGTTAGCATCTACAGCCATCACCGAGATGAAGTTTTCCCTTGCCCGAAGGGCCCCCTAA 

GGTTATAGACGGTATCAGCAGGGAGGCCGTCCCGCTGGTTTTCTAGGAGAGCCCACAGGA 

GGMGGGAGCCTCTTCAGGGGCACTGGAATCTTTTGTGCCAGGGGCTTCTTTTCATCANA 

GCCGGGGAGTACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnTCAAGCTTATCGATACCCGTCGACCTTCGAGGGGGGGGGCCCCGGTACCCC 

ANCTTTT 

Sequence 3711 

CGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGGGTAAACAG 

GAGGATCCTTGCTCTCACGGAGCTTACATTCTAGCAGGAGGACAATATTAATGTTTATAG 

GAAAATGATGAGTTTATGACAMGGAAGTAGATAGTGTTTTACAAGAGCATAGAGTAGGG 

AAGCTAATCCAGCACAGGGAGGTCACAGAGACCATCCCTAAGGAAGTGGAGTTTAAACTG 

AGAGAAGCMGTGCTTAMCTGAAGGATGTGTTGAAGAAGAAGGGAGAGTAGAACAATTT 

GGGCAGAGGGAACCTTATAGACCCTAAGGTGGGAAGGTTCAAAGAACTGAAAGAGAGCTA 

GAACAGCTGGAGCCGTTCTCCGGTGTAAAGAGGGAGTCAAAGAGATAAGATTAAAGGATG 

TGAAGATTAMGATCTTGGCGGCATTCAGGGATTGGGCACTTCTTACAAGAAAATCA 

Sequence 3712 

CCGCGGTGGCGGCCGAGGTACGCGGGATCCTCGGAAGAGCGCTCCCAGGAGAAACAAGCT 

TGACCACTATGCTATCATCAAGTTTCCGCTGACCACTGAGTCTGCCATGAAGAAGATAGA 

AGACMCMCACACTTGTGTTCATTGTGGATGTTAAAAGCCAACAAAGCACCANGATTAA 

ACAGGCTTGTGMGAAAGCTTGTATTGACATTGATTGTNACCNAAAGTCNACACCCTTGA 

TTTCCGGCCTTGMTGGGGGAGAAAGAAAGGCATTAATGGTTTCCNANCTGGGCTTCCTG 

GANTTANCCGATGNCTITTTGGGATGTTTNCC^ 

XAAAAACCTGGGAGTTCCCANGCTTGNCCCTTAMTTTTCGTGGAMTTATTACTAATTA 

TTAATTATTAAT 

Sequence 3713 

CTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGAACACATC 

CMGCTTMGACGGTGAGGTCAGCTTCACATTCTCAGGAACTCTCCTTCTTTGGCCAGGA 

TTGCTACTCTGCAGTCACTGGGAGCAACTGGACTCTCCGGATTGACCAAGTTCATCCTGG 

GCTCCATTGGGTCTGCCATTGCGGCTGTCATTGCGAGGTTCTACTAGCTCCCTGCCCCTC 

GCCCTGCAGAGAAGAGAACCATGCCAGGGGAGAAGGCACCCAGCCATCCTGACCCAGCGA 

GGAGCCAACTATCCCAAATATACCTGGGGTGAAATATACCAAATTCTGCATCTCCAGAGG 

AAAATAAGAMTAAAGATGMTTGTTGCMCTCTTAGAAAAAAAAAAAAAAAAAA 

Sequence 3714 

ACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCCTTACTCAGGT 

TCTCCATATATTTCTGTAGTGCTTTATGCTTCCAGAGCATCTTGGTGTCTTGATGGCAAT 

ATCTCCTGGACCAATTCTTACTTMCAGATGMGMTGCAATTCCTGTTTTGTCTAAAAA 

CTTGCAGCCTTGTATCAAGATGTATCTGTCTAAATTGATGAGAATGGAGCTGGATGTGAC 

ACTGGAGGTAACCATTAAGAGATGGATGTCTAAAAAGCCAAACACACAGGTGACGATTCA 
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TCTCAGCATCTGAAATACAATAAACACAGTCTTTAAAAAAAAATCAGCTCTAAAGATACA 
TGCATAATAAACCTGGAAGGAACATCAAGAAATCTTGTTTCATCTACTGCCTCAGCCAGC 
TCCACAGTCTTAAAATTCAACTGCCGGCTGGCAGCAGTGGCTCACGCCAG 
Sequence 3715 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCACGCTGGTTTTGCATCTTCAGGA 

GACGCTCGTAGCCCTNGCGCTTCTCCTCGGCCAATTCGCGGAAGAAGTGGCTCACGCCTT 

CCAGAGCCACATCATCGCGGTCGAAATAGAAGCCCAGAGAGAGGTAGGTGTAGGAGGCCT 

GCAGGTACCACTACATCATTGCAAATCTGGGATTTACTGATGGAGACCTATTAAAAATCC 

AGTTTGGAGGTGCAAATGTCTCTGGATTTCAGATAGTGGACTATGATGATTCGTTGGTAT 

CTAAATTTATAG AAAGATGGTCAACACTG G AAGAAAAAGAATACCC TGGAGCTC ACACAA 

CAACAATTAAGTATACTTCTGCTCTGACCTATGATGCCGTTCAAGTGATGACTGMGCCT 

TCCGCACCTAAGGAAGCAAAGAATTGAAATCTCCCNAAGGGGGA 

Sequence 3716 

GCTCCCCGCQGTGGCGGCCGAGGTACGCGGGGCTCTTGATGCATAAAAACAGCTGGGCTC 

CCTTGGAGACAGAGCGCCATGGGAAACCGGGTCTGCTGCGGAGGAAGCTGGAGCTGCCCA 

TCAACTTTCCAGAAGAAAAAGAAAACAGGAAGCCAAACAAGACGGACACTGAAGCCGCAG 

CCACAACAGCTGCAGCAGAATCTCCCAAAGGGCCATGAAACAACAGGACATACGTATGAA 

CGGGTGTTACAGCAGCAAGGGTCTCAAGAGAGGAGTCCAGGCCTCATGTCGGAAGACAGC 

AACTTACATTATGCTGACATTCAAGTGTGCAGCCGTCCCCATGCCCGGGAAGTGAAACAC 

GTGCATTTAGAAAACGCTACAGAGTATGCGACCCTTCGCTTCCCCCAGGCCACACCTCGC 

TATGACAGCAAGAACGGGACCCTGGTGTGAGCGCTTG 

Sequence 3717 

AGGTACGCNGGGAAGCGTTTACTTTGAAAAMTTANNTTGTTCAAAGCAGGCCCGAGCCG 

CCTGGATACCGCAGCTAGGAAAGCGAATGATAAGGGAAAAGTTCTCAGGGAATTGAAGTG 

TTGTTGCTATGGTGACGTCCTTTTGCTGTGAATAAAGGTGCTCTTTGCAGCATAAAAAAA 

AAAAAAAAAAAAAAAAAAANGTACCTGCCCG 

Sequence 3718 

CCGGGCAGGTACCTCTTTGATGAATGAGCCTGGCAGACCCCGAATCGCATTTAAGTGAAA 

GCCATAGCCATTTTCACCTTTAGCCAGCCTGCAGAGTTTAGGCTTATGATCTACTTCCTC 

TGTAGTATCTGGTGGACTTGAGACTTCCAACNTNACTTGAGGAANTGGGGAGCTNGGGGC 

NCTCCTTGACATGAGCCATTGGGGCAGNNCTTGACTTTGATAGTNGAANAAATGGAGAAA 

AATGAGCCAGTCTGTACCT 

Sequence 3719 ' 
CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTi Ml I M I I ! I ll 1 1 I 
ITTTTTTTTI IGGI Mill IGNI Mil IGGGGGI Mill I GGCAGCCACAGGAGTTTTAG 
CCMTTCAGATGCCTTGCTCCCCACAATTTGGAACATTCC 

GAAGAGATGGGAGAGAAGTGAAACAGCAACAATAAACCCCAAACATAAACAAAAAGAGTT 

AAGCAAAACAAACAAATGCGCAATTCATGTGATTACTGAGTGTTCTAATGGTAAGGAGAA 

ATTAAAAAGCAGCTGGTGGGGTCATCTTAMTTTTAGTCCATTTA/vGGGAAAAATTTT^ 

AANAACAAACTCTAATTTCAGCTACTTACCCTGGGAAAATAAAGGGCTTCAAGGGCCTGG 

GTGAATTTGTTTCTTNTGGCCCANTCCTTTAANATGGCTTGGGA/AAAAA 

Sequence 3720 

TCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATGACTGA 

TATTACACCTCAAGGTGTGGCTATGAGAGCTGGAGTTCTGGC'fGATGATCACTTGATTGA 

AGTGAATGGAGAGAATGTAGAGGATGCCAGCCATGAGGAAGTGGTTGAAAAGGTGAAGAA 

GTCAGGAAGCCGTGTCATGTTCCTGCTGGTGGACAAAGAAACTGACAAGCGTCATGTTGA 

GCAGAAGATACAATTCAAMGGATAAACAGCCAGTTTGAAACTGTTACCCCACCAGCCCC 

GMTTGTGGGAGATGAANNAAAGGGAAAGCCAATGGGCCTATTGGGTTTTCTATCTTGAG 

GGGCCAGGGCTTCAANAAACAGAAAAAGGGTCAAAATCATTCAAANGGACCATTAGGATT 
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TCTTGGGAAAG 
Sequence 3721 

CCGCGGTGGCGGCCGCCCGGGCAGGTACl Mil I GAGCCCACTGTGTGCTGCCTGNNTTT 
ATAAGACATTGAATTTTGTTAATAGAATGT^ 

TTTTGTTCACTGTGGAATTGCTAG C AC CTAN AATACTGCTTG ACAG AC AGTAGGC AGTCT 
GTAMTTTATTTGTTTGTTTGTTTGTTTGT^ 

GATGGANTTTCACTNTTGTCGCCCTGGCTGGAGTGCTAGCGGCACAATCTCAGTTNACTT 

CAACCTCCGCCTNCCGGATTCAAGTGATTCTCCTGCCTNGGCCTCTA GAGTAGC TTGGGG 

ACTACCAGGCCGTTGCACCACCCACGCCCTGGGCTMTTTTTTGGTANTTTTTTAAGTAA 

AAGGGCGGGGGT 

Sequence 3722 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGTGGTGGTGAAGCTGTAGCCGC 
GCTCGGTGAGGATCTTCATGAGGTAGTCAGTCAGGTCCCGGCCAGCCAGGTCCAGCCCGC 
GTACCT 
Sequence 3723 

TGGCGGCCGAGGTACATCCAAGATTAGGTCATAAAAAGTCTGTGGCAGCTACTCGGGAGG 
CTGAGGCAGGAGAATTGCTTGCACCCAGGAGGTGGAGGTTGCAGTGAGCTGAGACCACGC 
CACTGCACTCAAGCCTGGGCTATAAGAGCAAAATTCTGTCTCAAAAAAAAGTCCGTGGCT 
TTTMTCTTGGGGGCTCTGCCTCTTNTGGC I 1 I I I GCTCAGTTACTCTGAAAAAAAGCCA 
GGTGTCACACTGTAAGCTGTCCTACAGAAGAGGCCCCATGTCACCAAAGAAACTTAT 
Sequence 3724 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACGTCTCTGTCTGG 

GCCTCGGCCAGGGTGCCGAGGGCCAGCATGGACACCAGGACCAGGGCGCAGATCACCTTG 

TTCTCCATGGTGGCCATTGCCTCCTCTCTGCTCCAAAGGCGACCCCGAGTCCCCCGCGTA 

CCTGCCCG 

Sequence 3725 

GCGGTGGCCGGCCGCCCGGGCAGGTACGCGGGTCCAGGGCCGGTGGCACCCCTTCACCAA 

CAAGCCGACCACGTTCCGCGGGCTACGCCCCCGAGAGGTTCAGGATCCTGCCCCAGCCCA 

TGTGCAAGCACAGTGAAGAGTTGTCCCACCAACTGCAGGCCCAGGCTTTTGGACTGTTAC 

TNCCGGTAAMGGTNGGTTCTTTCCCCTTTNGGGATTCCAAAGCCCAGGCAAAATGGGAA 

CCCCATCAATNGGGCAAGTTTGACAAGANGGTTNTGCTTTGGGNATAATTGAANAANCTG 

CCCTGTTTTCTTTTTCNANGNGGCCNTTGNTTTTTTGGGGGGGCCNGTGGACCTT^ 

GAAAACCCNACTTCAAI l l 11 1 1 I NTTCCAAAAGTTGGGCNATTCTCCTTTAAAAAACCC 

TGGANGNATGGMNGGAAMGANCNTTTCAAAGGGGT7TTTTNACCAGGGGNCCCCTTTN 

NNI ! I ( I I I I AAA 

Sequence 3726 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGTGGA 

GGGGTAGACATGGAAAGAAAGGCAGATCATGATTAAAGGTTACAGAAGGAGAAGTAGCAA 

GGCTTTMTCCTAGAGCCCATCCTCCCAGAGAATAAATGGTAAAAATGTTTGCCTTTGTG 

CCTTGGGMGCGCTAGGCTCCTGCTGCTCTTTCATCAAAAAAAAAAAAAAAAAAAAAAGT 

ACCT 

Sequence 3727 

CGGCCGCCCGGGCAGGTACCACTCAGAAGCCTTCACTGTCAACTTCGGGGACACCGAAGA 

GGCCAAGAAACAGATCAACGATTACGTGGAGAAGGGTACATT GGAC ATCCTGGACTGATT 

CTGTAATTTTCGTATC \ \ 1 1 1 1 CTCCCATGTTCCCTTTATTTGTCTTTTGGTTCTGCTTT 

TTGGGAGATTTTCTTAACTTGATCTACCTGAGGCTCAGTCCTCATCTGTAAAATGCTGAT 

AAAAGGACCTACCACCTAGCGTTGCTGTGAGGTGTCAGTGAAATAAAATGTAAAATGCTT 

AGCACAGTGAACTTTAATAAATGG 

Sequence 3728 
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AGGTACGCGGGTCAACTAAAAGAGGTATGGGAAGAGACTGATGGATTGGATCCTAATGAC 

TTTTGACCCCAAGACATTTTTCAAATTTACATGATGTCAATANGTTGATGGATTCCTGGA 

T 

Sequence 3729 

TACGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGTT 

GTGACTAGGAAAGGCATGGGACGACATGTGTCCCAGGTGAGGTATGTGCCACAGGCAGTT 

ACGGATGTGTTGGAGGAGGAAATTCCAGGAAGGGGCAAGATCTGGAACGAAGGCCTGGAG 

GGAGGTTCCCCTGAAGGATGCTAAAGGAATACGCTTGAATCCATAGGTGCCATCGTGGTT 

GAGACMGTGCAAAAAATGCTATTAATATCGAAGAGCTTTTTCAAGGAATCAGCCGCCAG 

ATCCCACCCTTGGACCCCCATGAAAATGGAAACAATGGAACAATCAAAGTTGAGAAGCCA 

ACCATGCAAGCCAGCCGCCGGTGCTGTTGACCCAAGGGCCGTGGTCCACGGTACCTGCCC 

G 

Sequence 3730 

CCGCGGTGGCCGGCCGCCCGGGCAGGTACAGGCCCCACCCCCCGTCCAAAGGTCTTGCGT 

TGTAAATGCCACCACCATTGTCCAGAAGACTCAGTCAACAATATTTGCAGCACAGACGGA 

TATTGTTTCACGATGATAGAAGAGGATGACTCTGGGTTGCCTGTGGTCACTTCTGGTTGC 

CTAGGACTAGAAGGCTCAGATTTTCAGTGTCGGGACACTCCCATTCCCCATCAAAGAAGA 

TCAATTGAATGCTG C ACAGAAAGGAACGAATGTAATAAAGACCTACAC CCTAC ACTGCCT 

CCATTGAAAAACAGAGATTTTGTTGATGGACCTATACACCACAGGGCTTTACTTATATCT 

GTGACTGTCTGTAGTTTGCTCTTGGTCCTTATCATATTATTTTGTTACTTCCGGTATAAA 

AGACAAGAAACCAGACCTCGATACAGCATTGG 

Sequence 3731 

CCGCGGTGGCGGCCGAGGTACGCGGGGCCTTCCACCATGTGAGGACACAGCAAGAAGGGG 

CTGACTCTGAACCAGGAAGCTGTGCTGTGCTCACCAGACACTTTGATCTTGGACTTCTCA 

GCCTCCAGAACTATGGGGTTCTGCTCTTGTCACCCAGACTGGAGTGAAGTGGCACAATCT 

CGGCTCACTGCAACCTCCGCCTCCCGGGTTCAAGCGATTCCCCTGCCTCAGCCTCCANAG 

TAGCTGGACTGCAGGCGCACGCCACCACTCCTGGCTAATTTTTGTATTTTTAGTAGAGAC 

GGGGTTTCACCATGTTGGCCAAGATGTTCTTGATCTTCTCACCTTGTGATCCGCCCACCT 

NAACCTCCCAAAGTGCTGAGAGACTAAGACACCTTTCTAGACAGAGAGGAGGCCGATGGC 

AGACATTCTCAGATAGGTTTGTAGCT 

Sequence 3732 

CCGGGCAGGTACNCGGGGAGGCATTGAGGCNGCCAGCGCATGGGCTTCTGCTGAGGGGGC 
AAGCGGAGCTTGAGGAAACCGNMGATMNTTTTTTTCTCTTTNAAAGATANAGATTAAT 
NCAN CTACTTAAN AAAATATAGTCAAATCCGTTACTAAG ATATTGCTTTAGCG N N AAGTT 
TTTAACCGTAATTTTAATAGG C TTAAG ATTTTAACCA 
Sequence 3733 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGGTGTTGTG 

ATCGGAACGTGTCGATCCCCTCTTCTCATCACTGCTGCTCCAACTGGATTTATTACTCCG 

GGAATGGTCTGAGGGGGAAAACCAATGTGTTTAGCGTGCCTGCCCACCTGCGCCTGAGCA 

CAACTATCCTGCAAATCTGACCTGCCCCTCCTGCACAGGAAACCACCTTCCCCTCCCAAT 

TGATGGTTCAAACACTGCCACCGCTGACTGCCCTGCATCTGTGGGTCTGTAGAACAGAAA 

GGTAGMCAACTTATTTTTTAGGATTTMCGACAACCGGTTGAAAAAACGGTAGGGGTGT 

CATGCTCACAGAGAATAAAGATTT 

Sequence 3734 

CCGGGCAGGTACTATCAATGAAGCCTTACCAAAGGACGTGTCTGAGGATCCAGCTACCAA 

GTCAGGATTGTTACCCCCACCACCTGCGCTCCCTCCAAGACCTTGTCCATCACAGTCTGA 

ACAAGTGTCGGAGGCCGAGTTACTCCCACAAGCTTGAGCAGAAGCCCCATCCCAGGCTTG 

CAGAAAGTAGTCCAGCAAAGAAGGATGTACCTCGGCCCGCTCTANNAACTAAGTGGGATC 

CCCCGGGCTGCAAGGAATTCGATATTAMGCTTTATTNGANANCCGTTNAAACCTTTNAA 



Page 577 of 635 



WO 01/51628 



PCT/US01/00798 



Table 1 

GGGGGGGG 
Sequence 3735 

CCGCGGTGGCGGCCGAGGTAC I I I I I I I I T! I I I I I M I II 1 1 I I I I AAATGTTTTTGTT 

AAAAACTAAGACACATNTTAGCCTAGGCCTACACCCGGTCAGGATCATTCATATCACTGT 

CTTCTGCCTCCACACCTTGTCCCACTGGAAGGTCTTCAGGGGTGNGATCNTTAATACTGA 

GTGTCAAACTTGACTTGGACTGAAGAATGCANAGTATTGTTCCTGGTGTGTCTGTGAGGG 

TGTTGCCAAAGGAGATTAACATTTGAGNTCANNTGGACTGGGANAGGCAAACCCCCCTNA 

ATCTGGGTNGGGCACCATCTAATCAGCTGCCAGCGCAGCTNGAATAAAGCANGAAGGAGA 

ANATGGAANMTGACTTGCANAGTTCTTCCANCCTTNATTTTTTTN 

Sequence 3736 

GGCGGCCGCCCGGGCAGGTACCCCATGTCCTCCCACACCAAAAGAGATCAGTTCTGGCAA 
GAAAGCTCCAACGTGCCTTGATGGTGCTGTTGGTAGGATGCCTTAGGCCAGGCACCGTCC 
CAGCGATGTTCTGCTGGCTGAGGTACCTNGGCCCGNTCTAAAAACTNAGTGGGATCCCCC 
GGGGCTGCAGGGANNTTCNATATCAAAGCTTATCGATNCCCGTNNACNCTCGAGT 
Sequence 3737 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGTATGGGATC 

TTAGGAAAAATGAGCCAATCATCAAAGTCAGTGACCATAGTAACAGAATGCATTGTTCTG 

GGTTGGCATGGCATCCTGATGTTGCTACTCAGATGGTCCTTGCCTCCGAGGATGACCGGT 

TACCAGTGATCCCAGAATGTGGGGATCTTCGATTTGCNTTCCTCTCCACTTCCGTGTCCT 

GGAAAAACCATGCNAGGGGG 

Sequence 3738 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGACAGCAAC 

TTCCTTGATCCCTGCCACGCACGACTGAACACAGACAGCAGCCGCCTCGCCATGAAGCTG 

CTGATGGTCCTCATGCTGGCGGCCCTCCTCCTGCACTGCTATGCAGATTCTGGCTGCAAA 

CTCCTGGAGGACATTGGTTTGAAAAGACCATCAATTCCGACATATCTATACCTGAATACA 

AAAGAGCTTNTTCAAGAGTTNATAAGACAGGTGATGCCCGCTTGCAG AGGCT ATGGGGAA 

ATTTCAAAGCAGTGTTTTCTCMCCAGTNACATAGMCTNTGAAAAAACTTTTGGACTGA 

TGATGCATACAGTGTACCTGCNCCGGGCCGGCCGCTCTAAGAACTAGT 

Sequence 3739 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACACTAAGAACCTCACCT 

CCTAAATTTGTTAGGTATCTTGCAATGCGGCCACTGAGATGGAGAGAGGCTGGGAGCTGG 

ACGAACGGCCATCTCTCCTTCCATCTTGGCCTCGGCTCTAAGAACCTAAATCCCTGACTG 

GGATGCGTGCGGGAGTTCAATCCANGTATCGCCTGGCGGTGCTCCCCGGAGGCGCCTGCG 

GGAAAATTCAGCTTTCCCTCTCCACGACAGGAGGCGCTGTTTTCGTGGAAATCCCCCGGC 

TGCGAACCTGGGATCCCTGACCTGGATCAAGTCTCCGAAGCTGGCAGAGTCCATTCTGCA 

TCACCGGTCTTGGGCTTTGAAGAAGCCTAGGAGAAATTCCGCTTCGGCCATCACGCTATG 

AAAAGTGGA 1 I I I I I 1 1 I CTTAAGTCAA I 1 1 11 I 1 1 1 I I 

Sequence 3740 

CCGCCCGGGCAGGTAC 11 1 1 1 1 1 1 1 I I 1 1 I M 1 NTTTTNTTTTTGNCCAACGAAATTATT 

TATTGCTAGTCAGGAGGCCCAGGACCNTCNANANTGAANAGTCNATGGGCCCNTGTGCAT 

GTTGANAGNTACACAGGGAATCGCTTGTTCACCCTTNTNAAGGGNATAAAGNCAGAAANA 

GNGGCCCACCANNANTGGCCTCATNGGGNCCACNTTCCGGGNGGCCACAAGGGGGGCCCC 

AGGGCNTTGGNTGCNNTCAAANANGMCNCTCANGGCCTTTTNNCAANNCTTTTGGCATT 

NTCCCAMGGNTACCCTTTATNTTGGAANANGCCCCCCCCA CNTTG NNTTTTGGGAATGG 

GGCNAAMGCCTrrmANNGGCTTTTTGCTTmGGGGGM 

TTTTGGCTTNCNNATTGGAAMTNAAAAMCATTTGTTTTNTTNGGGGNTT 

NTTAAGGGGGGAANCAAAATTGGTATTTCANTTCCAAGGGGA 

Sequence 374 1 

TTCAGTMCTATTTGAGGAATGAAATNGTNNTTGATATATTGATTTAACTGAGCAAATCC 
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ATAAGTGCCACGCTTGGAGTAAAGNACAGGGAATCTNATKTTTTGGGTNANTTCAAACAA 

GGTAGGANACCATCCACTATCTNTNAAGGACAAAGGAGATTTGAGTGACAAACATGTTCC 

AGACCATCCAATGGTGTTATTCTACTACTTCATCCCAGCTAGTAAAATGGGGCAATAAGA 

ACCAAAAAGCAGGTACTGTCCCTATGTAAGTAGACTTCC 

Sequence 3742 

CCTCTGCCI I I I I IGTTCTACGTGGGCCCTCAATGGATTGGATGACACCTGTCCGCACTG 
GTGAGGGTAAATCTTTACTTGGTC7TCTGATTCAAATGTCAATCTCTTCTGGAAACCCCC 
TCACAGACACACCTAAGAAATAACGCTTCACCAGCTCTCTGGGTAGTCCTTAGCCCAGTC 
AAATTCACATACAAAANAAAAAAAAAAAAAAAAAAAAAAGTACCTGCCCGGGCN 
Sequence 3743 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCTTGGGACGTGCCATGGCTTGGGAGA 
ACTAGTGAGGAACATA7TTCTCCATGCCAGTCCTCTTCCTAAAGGCCCCCGCGTACCT 
Sequence 3744 

CACTACTT AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACA t 111111 1 CTTT 

rnrmn icttcgagatggagtcttgcactgttgcccgggctgggttgcggngagccnag 

ATTGNGCCATTGCACTCCANCCTGGGCAACAAGAGCGAAACTCCATCTCAAAAAAAAAAA 

TNAACAAAAAAAAATCNTTGANTTTAGTTNTATGACCTNACAGGGTCAGTATAGGAGTAT 

CACTGTGAAAAGGTTANGGACATTGGTTNTCTNTNNNTTNTCTNNNAGATGGAGCCTGGG 
CAAC 

Sequence 3745 

CGAGGTACT AAGG TGAGATCATTTMTTTTGACTAAA 

GAGATGGAGTTTTGTTCTTGTTGCCCAGGCTGGANTGAGACTCCATCTCAAAAAAAAAAA 
AAAGT AC C AAAATGTTTACCTTN ATAGTTAAAATTAC AAAAAATTCTTTTTG N N1TTAACG 
TNAATTTTATTTTTCCCCANAGTTTTTCTGTMNTGAAAATAATGCCACCCTM 
AT AAT N G ANT AC ANGTGGCCCTTGGCTATTAAN C N ACTCCTNT AACNTTC 
Sequence 3746 

CGCGGTGGCGGCNCGCCCGGNCAGGTACCTGTTGGCTTTCTGCAGAGATGCTTACTTTTC 

CTCTCCTATCAGAGTTTAATGTCCTGGGACTCTGACATTTTGGCAAGTGTTTTCTTAACC 

CGCAGGGCTGTCAGCCTTGATGCAGGATTGTNAAGCCCAAGCATTCTTGNTCATGAAGTT 

TTTCCCATCTTGCCTTTAGACACTCATCACTGCTTCCACCGGTTTGGGAAATTGAAGGGG 

GCGGTAACTTCNTTGGATGGCACCACCAATCTTCCCCTCATGGTCCCTTCATAAGGAGGG 

GTC ACTTGG G GCACTTAGGGTCAATGAATAAAGGGAAAGCTTGG GTATTTCTTTCCC ACT 

AATTACCCTCCCTGGATACCACCANTNTTCCCGTAGGCCAAACCCTTNCCCCAAAAAGGG 

AATTGAAGGGGCCCCAAAAAA 

Sequence 3747 

CACGCTGGTTTTGCATNTTACAGGAGACGCTCGTAGCCCTCGCGCTTCTCCTCGGNCAAT 

TCNCGGAAGAAGTGGCTNACGCCTTNCAGAGCCACATCATCGCGGCCGAAATAGAAGCCC 

AGAGAG AGGTAN G NTGTAN NGAGGCCTTGNAN GTACCTNGGGCACGCCTNTAAGAACTAG 

GTGGATCnnnnnnnnnnnnnnnnnnnnnnnnnnnNAGCTTATCGATACCGTNGTACCTTC 

GTAGGNGGCGGGCTCCGGGTACCACANCTTTTTGTTTTCCCTTTTAAGNTGMGNGGGTT 

AATTTTGCNNCGCTNTNGGNCGGTAAATTCCNTGGGTTCf^TTAGGCCTGGTTTTTCCTT 

GCTGCNAGAAAAANTTTGGNTNTMTTCCCNGCNTNCAACNANNTTTTCNCAACGACCTA 

ANNNCATTAACNNTAANCCCTCCGGGGGNAGGCCANTTAAAAAGTTNGTTAAANAGGCCC 
TTG 

Sequence 3748 

CGTTAATCATGGTCATAAGCCTGTTTCCTGTGTGAAAATTGTTATCNCGCTCACMTTTC 
CACACAACATACCGAGCTCGGGAGCATAAANGTTGTAAAAGCCTGGGGGTGCCTAATGGA 
GTTGAGCCTTAACTTCACATTTAMTTGCCGTTGNNGCTTCACTTGCCCGCTTTTrNNCA 
GATCGGNGAAAACCCTTGGNTCCGTTGCCCAAGCNTGCAATTTNAAATGNAAATTCNGGG 
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GCCCAAACCGCCNTCCGNGGGGGANGAAGGGCCGGGTTTmNGCCNGTTAATTTGGGGG 
CCG 

Sequence 3749 

AGGTACGCGGGGGTGCTTCCGGTGCGGAGGTCAGGGACAAGATGGTGCCACCGGTGCAGG 

TCTCTCCGCTCATCAAGCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnNGGTTAATTGCNCGCTTGGCCGNTAATCATGGTTCATAGNCTGA 

TTCNCTGGTGTTGAAAMNTGGTTATCCGNATCACAATTTCCACACMCCATTACGANGC 

CCGGGGAGGCATTAAAAGGTGTTANNAGCCTNGGNGGTGGCCTAA ATTGAAG GTGGANGC 

TAACCCTCACCATTTAAATNTGGCTATTGCNGCTCACTGNCCCCNGCl I I 1 1 1 NCAAGTT 

NCGGGGAAAAACCCCTGGTTNNGATGCCCATGCNTTGGCCATT 

Sequence 3750 

GGGGGGGGGGGGGNNGGTNNTNTGNGCCTCTACfATCCTATNGGGCGAATTGGAGCTCCA 

CCCGCGGTGGCGGCCGAGGTACANAAGGGCGGTCCTGCTGACTTGGCTGGGCTAGAGGAT 

GAGGATGNCATCATTNAAGTGAATGGGGTGAATGTGCTAGAATGAACCCTATGAAGAAGG 

TGGTGGATANAATCCAGAGCAANNTGGGAANAATGTCACACTTTNTANTTNTGTTAAAAA 

GAMNGCCCTMTGATTNMTTTTGNCNANGCCTANTGTMTMNTNCCCCTTAATTTCG 

NTTNTCGCATCACCCTTGGGCCNTNGAATTCNCTACNTGTGTAACAACCCCCTCNTCCCN 

AGNAATTTNNTTAMTANGAAAAGGGGAMTTANGGTTGGGGATGNGAMGGTTNCNTAT 

ANCTCAATTGTAACNTTCGGCC AACC ATTN N NGGCCAAAAAAAAGGAAANCCTGGGGGTC 

CCCCCANCCANNTTAACTNCNTNGNCTCCCTNTGGGTCTGGGNCNCGNCNTTCCTTANGG 

AAAACNTTTATGGTTGGNGACTCCTCCNCCCCNNNGGGGCTNTGCCANATNGNATANTNT 

CTCGGAATATTACCAAANACCTTTNANTCNCNCATTAACCNCCGNTCCAGAAACNCCTTC 

CGTAAGG 
Sequence 3751 

CAAAGATCAAGGCACTTGGGAGAGAGTCAACAATANCATTNTAATACTACAAGCAGACTT 
CTCATGGGGAGGAAGAGAGGGAAGAAAAATGGGCANAGAGGAAAAAGGACATTGTATTTC 

Sequence 3752 

CCGGGCAGGTACAGGTCCACCTGGCTGCTGGTCAGGCCTGCGGACCACCTGTTAATCGTG 
TCCCGGCTCAGTGCAGCGATGACTGTGTTCATCTTCCCCTTGTCCGGAAGGATGAAGAAG 
ACAGTTCCCATTGCCCACGTTAGTTTCATCTTGCACCAG CTTGGN N AGGGGGCAGTCTCC 
GCNNTTCATGGAAGGGTAACTGGATGGTGCTTCGACTTGCNAACATNTATGGGCCNCCTT 

CACACANCAGTTTGTNCTTCGTTCCA 
Sequence 3753 

CCGGGCAGGTACTTTGTCCAGCCCTGGTCATCAGGAATCTGAATTAGGACTAGAGCATNA 

MGCCTTAAGGNAATAAAGCAAAGCATTTGAACMCCTTGTCCTCTGACCATTAACTTCG 

TT7TGAGGACCTCTGCAGAAAGTGNGAAAGTTCCANGGACAGGGGNCAGGGTGTTTNGGG 

Al'CCCCGCTGCNTTCCTTCTATGTTTTGTTCTCNTNTCTCTCAATTCTCACACACCCCCT 

ACNATATCCCGCTCAATTACCAACTTTCAGAAA 

Saquence 3754 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAACTGAAAAATAAGACAACCAATCCCATCAT 
GGTGCCAACCACTCTAGTTGTCTTGGGAAAGTCACTTATTTATCCATCTTGGCTTTCTTG 
CTCAAAAATGATGGATAGCTACACCTNACAGTGAGG ATTCA GTCATGTAAGCCAAAGTGC 
CTTGMGACAGGMGCCAGACATTATAGGAGCATAATTTTTGCCCTGAAATGCATTTGGT 

AT 

Sequence 3755 

CCGCGGTGGCGGCCGAGGTACGCGGGGACTCTCGTAGGACTGCTCTACACTGGGCCTGTG 
TCAATGGCCATGAGGAAGTAGTAACATTTCTGGTAGACAGAAAGTGCCAGCTTGACGTCC 
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TTGATGGCGAACACAGGACACCTCTGATGAAGGCTCTACAATGCCATCAGGAGGCTTGTG 

CAAATATTCTGATAGATTCTGGTGCCGATATAAATCTCGTAGATGTGTATTGCAAACACC 

GCTTTTCCANTTTTCTTGGTTANTAGTGAAGAAT^ 

G N CCC ATGGTTGCMGTCATCGAAGTGCACAACMGMGGAACATTTTGCAGGAACACCT 
GATGAGGCTGCACCCTTGGCGGAAAGAACACCTGACACAGCTGAAAGCTTGGTGGAAAAA 
ACACCTGATGAGGCTGCACCCTTGGTGGAAAGAACACCTGACACGGCTGAAA 
Sequence 3756 

CCGCGGTGGCGGCCGCCCGGGCAGGTCAAGACTGTAATGAGCTATGATTGCACCACTGCA 

CTCCAGCCTAGGCGACAGAGCTAGATTCCATTTCAAAAAAAAAAGGTGGTGATGGTGGGG 

CCCGGCATGGTGGTTTACACCTGTAATCCCAGCACTTTGGGAGTGCTGAGGTGGGAGAAT 

TGCTTGATCCCAGGAGTTCGAGACCAGCCTGGGCAACATTGAAAGACCCCGTCTCTTAAA 

AAAAAAAAAAAAGAGAGGTAATGGTGGCTCCCTTTCTCTCATCAACACACTCTGGGGGAT 

GAAAAGGATGATAATGTCTATGTGTGTGATAGATTGTAGAACGACTGCCACTGTGTTGAG 

TGAAGAAAAGGAGTATGTCTGCTGAACAGATTAACTTCTGGGTCTCCAATCANTATGTGT 

TGGTGGCCNTANGGGGACTGCATGACTTTAACT 

Sequence 3757 

TGGCGGGGGTCACAGGCAGCCATCATGTTCTTGGCATCGAAGACCTGCTGGGTGAGTTCC 

GGCACTGTGAGAGCTCGATACTGCTGGCTTCCACGGCTGGTGAGAGGGGCAAAGCCAGGC 

ATAAAGAAATGGAGACGTGGGAAGGGGACCATGTTGACTGCCAACTTGCGGAGGTCAGCA 

TTGAGCTGGCCAGGGAAACGGAGGCAGGTGGTGACACCACTCATGGTGGCTGAGACAAGG 

TGGTTCAGATCCCCCCCGCGTACGCNGGGTGGAGMGGGCTGCTTTTTTTCAGTCCTACA 

TTCCTTCAI I I I I I 1 1 i ICATTCTTGAATTCATTGTTTTGTGGGATCTAAGACCCAGGGG 

TC ATTTGAG AG G NTTG AC AGTATCTTTTCTGACC AGTTGCCAC ATG ACTTGCTTGACCCT 

GAGCCTGTGGAAATGGCATAGGG 

Sequence 3758 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGCCCCTGCGGGCCTCTCTCCGTCGCC 

ATGGAAACGAAAGCGGCCAAGTAGAGCTCCGTCCTGACGCGCCGCCTCCCGTGGGCTCCG 

GCCGGCTAAGCCGCGGCGGACAACTATGCTGAAAGCCAAGATCCTCTTCGTGGGGCCTTG 

CGAGAGTGGAAAAACTGTTTTGGCCAACTTTCTGACAGAATCTTCTGACATCACTGAATA 

CAGCCCAACCCAAGGAGTGAGGTTTGAGTCCTGCTGGCCGGCCCTGATGAAGGATGCTCA 

TGGAGTGGTGATCGTCTTCAATGCTGACATCCCAAGCCACCGGAAGGAAATGGAGATGTG 

GTATTCCTGCTTTGTCCAACAGCCGTCCTTACAGGACACACAGTGTATGCTAATTGCACA 

CCACAAACCAGGCTCTGGAGATGATAAAGGAAGC 

Sequence 3759 

GGCNGCCCGGGCAGGTACCTGCTGTGTGCTGGACAATGTTCTGGGTGCTGGAGATGTAGT 

CGTGAACAAG AACAAGCCCTGAGCTTACATTCACCAGGGCTAGTGACTGAAG ACAG N CAG 

CAAACAGAGAAACAGATGCATNAACGGGATCATTTCAGAAGCCAGAATTGTTACAGAGAA 

AATAAACCAGGGGTCCAGTGCAGCAAGTGGCATTAGGGTGGCAGGTGTATAACATGGGCC 

AGGGGGGTCAGGACGACCACTCTGAATAGTG 

Sequence 3760 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC l I 1 1 1 I 1 M I 1 1 1 1 1 M 1 1 I M 1 I 

NCI I I 1 I 1 I I I J 1 M I 1 1 M I i I I I I 1 I I I I I I I 1 I I I I j I 1 I I I I GNNTTNNNTGTNTN 

ACTNATTTNATGTNATCGNGGGGNNNCAAAGGCNTTTGGTCTGAAAAAATCAAACTGCTN 

AATAAAATGCCNTAGTAAGTCCAACTATATGGTTTCATGTAGGGGTAAAAGTAACCATTG 

TGGAAGGATTACGATAAAACTGTGTGACAATAAATAGAATAAAAGATGGCAAAACAATCT 

GTNAATTACAAAGTGTTATAAAAATGCATTTGAATTTCTAGGTGTTCTTACCACTTTCCA 

TATTCCTCATTGGCCNCAATGGGGGGGACCCTNCCCTTCCTTAAAGTTGGGCAAGNNAAA 

AAAAAGGGTTTCCNCCCCCNCNTTGGCTTGAAAGGTNTGNGGCNAAAAACCCTTTTCCCC 

AAAC NTTTTTAAAAAAAAATCTTTTATG N N N N ACCCCCCCCCCAAAAT 
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Sequence 3761 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGG GCAG GTAC \ tut iNTTT 
I I I I I I l I I I H H I G NTAAATTTTTAGTATC AGG ATGATAC AGGGTTTTATAG AATG AG 
TTAAGGAGGAGTCCCTCTTTCTCATTTCAGGGCTTATTTGACTGGTAGGNTTNTAATTAC 
TGCTTCAATTTCATTACTTGCNTTCTGTTCNATTCNM TATTT CTTGTTCCTTCCTTG^ 
C AACTTTN GTTC AN GTCTTCCTTGTTCMTTCCTTGTGTTTTCC ATT AATTTATTC ATTN 
CCTCTAAGTTCGN GTAAAATAAGAATATGC7TTATAGTA GCTTC N TGAGGGATC N CTTTG 
TATTrCTTGGNGGGGGATCTATTTGGNGMTGTTCCACCTTTTTATTAAA 
Sequence 3762 

CCGCGGTGGCGGCCGAGGTACTGGACAGAGCTAGGAAAGCAAACCCATTTGCTTCTTCCT 

GCAGGAAACCCCTTGAGGTTAAGACCCCACAATCACATGAGGATGGAGTGGCTCACCCTC 

AGTCAACAGGCCAGACTCAAGGGGTTCCTGGCAAGTCATTTCTCCCTCAGGCCTTCGGTT 

TCCTCACCTACAAGATGAGAGGGCTGGACCAGATGGAAATTCAGGGGGTAAGGGGGATGT 

CCTCGCTCAGCCCACCCCNACCCCCACGGGACCCTGGAGCCTCCATCCCAGTTCCCACCA 

CGCGCCTGCTCCACAAATCCTACCCAAGGTGCTGTCATCTATATGATCTCAGTATTTTTA 

AACATTCAGCTTCCAGCCCATATGGTGGTTCATGC 

Sequence 3763 

CTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC rTTTTTTTTTTTTI 
I lit } M I T \ GGCAGCTCAACCCTTCTTTAATGTCATCCAGGGAGGGGGCCAG GGAT GGA 
NGGGAGGGGTTGAGNAGCCANAGGCANANTATTTTrGGGTGGGATTCACCACTTTTCCCA 

TGAA 

Sequence 3764 

CTATAGGGCGAATTGGAGCTCCCCGCGGNGGCGGNCCGAGGGACTTTTAAGNAAAGGGCG 

GGGTTTNACTATGTTGGTCAGGCTGGTCTCAAACTCCTAACCTCGCGATCTGCC CGCC TT 

GGCCTCCCGAAGTGCTGGMTTACAAGCATGAGCCACCATACCCGGCCTGATACTTTTGC 

TTTTCTTATTGAGGTATAAAATGTMGCACCTTAAAATTCCCTTCCCTTATATGAACACT 

GCATTTTGCTGGATTTTTCAGACACTTMTTTCAACAACCAAGTGAATMCTCTTAACAT 

GGAAATTAGAGCTTGGTCCCAGTGACTCAGAGCTAAGGCTAATATTGAGCCTGCAGAAGA 

AATTTAATAACGGCCTTGTTAGTTTTTTCTGGGGAGTCTCTCCTGCAGATNTCCCAGCCT 

TCTCACTGAAGCCATGGNAGAAGCCTATT 

Sequence 3765 

GCGGCCGACGTACANATGGGGTCTTGCTATGTTGCCCAAGCTGGTCTTAAACTCCTGGCC 

TCAAGCAATCCTTCTGCCTTGGCCCCCCAAAGTGCTGGGATTGTGGGCATGAGCTGCTGT 

GCCCAGCCTCCATGTTTTMTATCMCTCTCACTCCTGAATTCANTTGCTTTGCCCAAGA 

TAGGAGTTCTCTGATGCANTAAATTATTGGGCTCTTTTAGGGTAAGMGTTTGTGTNTTT 

GTCTGGCCACATCTTGACTAGGTATTGCCTACTCTGAAGA 

Sequence 3766 

TTTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGAGGTGTGCCCTGGTGCATAAA 

TAGNTACTCAGCTGTGCTGGCACAGTCAGAAGCTTGGACCGCATCCTAGCCGCCGACTCA 

CACAAGGCAGGTGGGTGAGGAAATCCANAGTTGCCATGGAGAAAAATTNCAGTGTCAAGC 

ATTCTTTGCTCCTTNGGGGCCCOTCTCCTACACTCTTGGCCCAGGAGATCCACAGTCAAA 

CCTGGNAGCCCATAAAAGGGACACCAAAGGGACTTCTCGNACCCAAAACTGC 

Sequence 3767 

CCGGGCAGGTACTACTCGATT'3TCAACGTCAAGGAGTCGCAGGTCGCCTGGTTCTAGGAA 

TAATGGGGGAAGTATGTAGGAGTTGAAGATTAGTCCGCCGTAGTCGGTGTACGCCGGGGA 

TTCCCATGGCTGGCCAGAGGAGGAACGCTTTGTGTTCTCATCGGAGCTGCATGGGAAGTC 

TGCATACAGCAAANTGACCTGCATGCCTCACCTTATGGAAAGGATGGTGGGCTCTGNCCT 

CCTGTGGCTGGCCTTGGTNTCCTG 

Sequence 3768 
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CCGCGGTGGCGGCCGCCCGGGCAGGTACAGCTAGGCTCAGCAGCAAAAGTGAAGAATAAG 
AATGAAAGGTGAAAAGCTGACAGATGCCACGCAGAGCAAGGACTTCCATGAAGGTCCCGT 
GTCCTGTGT GCCAAA CACAGTCAACATTCCATCATTTTCTCATTGCTCTTAAACGAGCAA 
MCCAAATTCTTTTTTCTATTTAMGAMTATTAAATCTTTA 

GTGATCAGGTAAGGCAAAGATAAGTCCACCATCACTGGGGACAGCAGCAACATACAGACT 
TCAGAAATAGCCCTGAATCATCAGTAACATTCTATCGTGGTCCCGCGTACCTCGGCCGCT 
CTAG 

Sequence 3769 

AGGTACAGGTTTGGGCCCAGAGGTCTGACAGGAAGGGCAGTAGGAGAGGTGGTCGTAGAT 

GATGGGCCAGAGGGCGCAGGGCCTAGCAGGCCATGGAAAGGACTCTAGGTTTTACGCTAG 

GTAAAAT GAGGAGGCACTGCAGGGCTTTGAGCAGCAGAGTAACACAGTCTAACTTACTTT 

TTTTTTTAATGGATCCTACCAGCTGCTAAGTTAAGAATAATTTCCAGTGAGGGCTTTTGT 

AGTAAATTAGAGATGATGCTGGTTTGTCAATCTCCCCTTCCAGCAGAGGAATGTGTGGGT 

AAGAGAGTCACAATGTTTCTGAACAAAGCCATAGCTCCTAAGTGGTGAAGCTG 

Sequence 3770 

CACAAGACAGAATAGCCAGATCTCAGAGGAGCCTGGCTAANCAAAACCCTGCAGAACGGC 

TGCCTAATTTACAGCAACCATGAGGCCACTTAAGGATGCAGCAAGAAGGAGCCATCTGCA 

ATCCAGGMGAMTTCCTTGCCAGGAACCAAATTGGTTGTCACCTTCATCTAGGACTTCT 

AGCCTCGAGAACTTACAAATGGTGATGATCATCANGTCAAGGATAGTCTGGAGCAATTGA 

GATGTCACTTTACATGGGAGTTATTCATTGAT 

Sequence 3771 

AGGTACI ITTTTTTTTTTTTTTTTTTTn I GCCCAACATCAACCCTTCTTTAATGTCATC 

CANGGAGGGGGCCAGGGATGGAGGNNAGGGGNTGAGCAGCTAGAGGNATCTAACTTACCT 

TCCANTTCAAACATTNTNNCAGGAACTAAANNNTATACTAAATCGTATGACGTNCCAAGN 

ACNNANCNAACGGGTTTANATAAAANGCACAGCATAANGN 

Sequence 3772 

ACAAAGAGTCACCGCAGAAACACTTNTGGCACACCTTCCATAGCAGTGTCTGGAACCTNT 

CTCTCCTCAGATCAGAGTCGATCTGAGTTAGATTTGAGTGAGTCATGTTACAGAAGACTC 

ATN AGGATACTG NTAAGC ATTAAAGAAAGCANTNCTTGTCCCTG GGG GCTTTTAN G AC AN 

TGGACTTNCTTGGMGAAGACTGAAGAAAGCATTTGATGCCTTAGTGTCCTTNGCAGNNT 

ATCGTACAAACACTCACCGTNNTGGCAAGTGGCCTA 

Sequence 3773 

CCGGGCAGGTACCCTGGCCTAGATGCCAGTAGCATATTCCCCAGTTGTCAAAACCAAATA 

GTTCTCCCAGCAATGCAAATTATCACCTCATGTGTGCGTGGTATT 

Sequence 3774 

TCACCAATTCCCCATCTTTATCTTGGTMTTTTAGTTCAACATTGCTCAGCTGATGGCTC 

ATfAAGCCAAGAACTCTCCCAGGTGTCAGTTACCATTTTCAGACAAGACACCTTCTGGGG 

GTGC 

Sequence 3775 

T/vCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCCGGCCGCCCGGGCAGGTACGCGGGACA 

ACAGGGTTTGTGCTCCTATGAG AATTTAATG CTGCTGCTGATCTG ACAG G AGGTGGAGCT 

OAGGCAGCAATGCCAGCAATGGGGAGCTGCTGTAAATACAGATGAAGTTTCACTCACTCG 

CCCACCACTAACCTTCTGCTGTGTGGCCCATTTCCTAACAGGCCATGGGTTGGTACATCA 

GTTTACTTCGGTAGCCAGTGATTGTTATAGTTAATTCTGCAAAAAAACAAAACCAGTTAT 

GCAAAAACAAGAGTACCT 

Sequence 3776 

TATCACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGC 

AATAACAAAATACAGCAATAAAACAACTGGACACTCCTAGGGGACACCAAAGATAAAGGG 

CCCATTAATCAGGTGGAGGCCAGAGAMCCGGAAGGCTATTTTAAAGTGTTTAATATAAT 
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CATGTAAGGAAAAAAAAATCTCCCCAAAACAATAAAGTCAAGCAATTTCATATCTATATG 

AACTTATTTTCATATCTATATGACTAAAAAATGAAAATGACCTTTGCTGGCATTATTGCT 

ACGCGAATATCCATACTTTCCCCCTTGATATTTAAGGACATGTGGGAGAACTTTATAAAC 

AAGTAAAGTGTGCAGAGAAGGCACTGTTTTGGCACTAGCAAAMGGGACAGTTAATTTCA 

CATGAATCACCGTGAACTTTAATAACCCATGATAT 

Sequence 3777 

AGGTAC 1 11 1 1 11 ) I 1 1 1 I 1 1 M 11 H H H 1 CTCTCAAATACATAGGGGCTTTTCCCCT 

GGGCAGTTNGCATTCCACMTTACTTGTTTAGNGTTTTCCAAGCTCCCAGGGCTGGGCTG 

NGTCTCCGNGCCTGCCCCAGGGGCTTTCATGGGAAANANGGGNTNNNGAAAAGGAGCNNT 

AANGCCNTMNCTTGATTTGNGNGCGAAGCTTTTTAGGCCATCCATCATTCAAAGAGCTG 

GAAAAG 

Sequence 3778 

CCGCGGTGGCGGCCGAGGTACTCAAAGCACGTTGCTGGGAGGCAGTGGACAGACTGCTCA 

GCAACTAACCGCTCCGTGAACTCCACATCGTTCTCAAATTCTGGGAAATGTTCCATCTCA 

ATTCCAACCTATACCAACAGGAGGGAGAGGCCAACTTATCAGAAGGAGGGGAGAACAGGC 

CAATTTCCTGTGCCAGAAATTAAGGCCAGGCCTACCACTCCTATGGCACAGTATTGATGA 

GAGACATCAGTGGAGCTCCCCACCTCCACCCCAGGCTTGGAACTTAGGGAATGACTGGTT 

TTTGTCATTTCACACAACAGCTGAAGGATTCTGTCCCCACCTCGTCCTCTGTGAAGTTCT 

GCTGGACCGTACCTGCCAGAATCTTCTGAAGATGACGTTTGCTTGAACACTGACCCACAG 

GGGTTTGATGGA 

Sequence 3779 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAGGTGGATTTGAGAGATACCTCCCTC 

CTTCTGCTCAGCTGCCTTGCAGTAATTAAACTCTTTCTCTGCTGCAACACCCCTACTGTT 

CTCCGTGTATTGGCTTTTCTGGGCAGCAGGAAGGAAAAGCTGATGCGATGCTCTCAGTGC 

CGCGTCGCCAAATACTGTAGTGCTAAGTGTCAGAAAAAAGCTTGGCCAGACCACAAGCGG 

GAATGCAAATGCCTTAAAAGCTGCAAACCCAGATATCCTCCAGACTCCGTTCGACTTCTT 

GGCAGAGTTGTCTTCAMCTTATGGATGGAGCACCTTCAGAATCAGAGAAGCTTTACTCA 

TTTTATGATCTGGAGTCAAATATTAACAAACTGACTGAAGATAAGAAAGA 

Sequence 3780 

CCGGGCAGGTACAAGAAAAAAACCCCACAGCCTAANTNTCACACGGCTTCTCAGCAGGAG 

TTGGATTAMCTGAGAAAATMTTTTAATATCAAGGGATAATGCACGCAAGATGGGAAAA 

CTTTAAAGCAAACTCTGCAGTGAAAAGTGTTTCCCCCACTCCCTTGGGCGAGGGAAGGGA 

GAAGAAGATAGATAATGAGGAGAACTATAAGGCCCTGGATTCAGGTTTCTGAAACTGNTT 

TTTTTTTCMCCCTNGGAGGACMTCAGCTCTTCCTGMTCCAGAAAAGCACATCAGAM 

TGCCCAATGACAGACCACAGCCAAATATGGCAGGGCAAACTGATTTGGTTCTGCGATTTC 

CATTCTCCTCTCCACATNCGAGAAATGCCTTCAGCTACAGCTAAA 

Sequence 3781 

CCGGGCACGGTACGCGGGTAGAAATGAGCATACAATAATAGG/\ACACTATGAAAAAGAAG 

GGTAATGAGAGGTGGGACCAAGTCTATAAGATATTAAAGGGTAAGTCTCTCCGGCCCGGT 

TTCCCTCGGTGTGCTACTGTGCGCGCCGATCCAGCACCATGGGGAAGCGGGACAATCGGG 

TGGCCTATATGAACCCAATAGCAATGGCGAGATCAAGGGGTCCAATCCAGTCTTCAGGGC 

CAACAATACAGGATTATCTGAATCGACCAAGGCCTACCTGGGAAGAAGTAAAAGAGCAAC 

TAGAAAAGGAAMGAMGGCTCCAAGGCTTTGGCTGAATTTGAAG/\AAAMTGMTGA 

ACTGGAAGAAAGAACTGGAAAAACACAGAGAGAAA 

Sequence 3782 

TCTCCCCGCGGTGGCGGCCGAGGTACCAGACTAGACATGGGCTTCCTTACTATGTCCCGA 
CNCATGACAGATAGGCATCACTATTCAAGAGGACTGGGGCTTGCAACCACAGATTAGCAT 
CTTATTTGCCATTTAGGTCCCAGATTGCTTGAGCCCAGGAGTTCAAGACCAAGCCTGGGC 
AACATAGCAAGACCTCATCTCTACAAAAATAAAAAAAATTAGCCAAGCATCATAGTGCAC 
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GACTGTTGTCCCAACTACTCAGGAGGCTGGGGCAGGACGATTGCTTGAGCCCAGGAGCTT 

AAAGCTGCAGTAAGTGGTGACTTGCCACTGCACTCCAGCCTGGGCAACAAAGCAAGACTC 

TGTCTCAAAAAAGAAGAGAAAGCTGAAAGGGTTAGAGACTCAGCAGATGTTTCTAGATAA 

GACCAGTTTG 

Sequence 3783 

CCATTTGGAATCCGGCAATNGTNTTTNGCNAAGGNCGCGGCCGCnCGGGCAGGTACTCGG 

GTGACCCAAGGTNGACCATTTTAANTTTCTMGTTTTCACCGAGATTAACATGCATTTGT 

TGAAAGACAAACCTCTTTTACCCTACTCCCCCAGCTGCAAATGCTTrCAGGGTTCACATC 

ATGTTGGAACATTTGGTTACAGTGmCCTAMCmGCGGGTAAAMTTGTTCAAGTAG 

GTAAAAATG G AG C AC ATACAAAAAAAAAAAAAAAAAAAAAAAAAGTACCT 

Sequence 3784 

CCGGGCAGGTACATGACTGATATTACACCTCAAGGTGTGGCTATGAGAGCTGGAGTTCTG 

GCTGATGATCACTTGATTGAAGTGAATGGAGAGAATGTAGAGGATGCCAGCCATGATAGA 

AGTGGTTGAAAAGGTGAAGAAGTTCAAGGAAAGCCCGTTGGTCCATGGTTCCTTGCTNGG 

TNGGACAAAANGAAAACTGGACAAGCCGTTCATNGTTTGNAGGCANGAAAGTATANCAAA 

TTTCMCAGGGAGTAMNCAGCCCAGNTTTGAAAMCNTGTTTACCCCTCANTCAAACNC 

CCCGGAATTTGTNGGNANGNATTGAAAANAAAAGGGAAAGCAMTTGGGCTTATNGGNTT 

TNCTNATCTTGAAGGGGCCA 

Sequence 3785 

TCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACACCACAATG 

GCTGAGCACTNACTGACNTTGCTGGTAGTGCCTGCCATCAANAAAGATTATGGGTTCCCA 

GGAAGACTTCACTTCAAGTGTNGGAACACCACCCATGAAAGGGGCTTCAAGTGCTTGTGG 

GCTNCACCAACTATACCGGATTTTGAGGGACTCACCCTACTTCAAAGAAGAACAGTGCCT 

TTCCCCCATTCTGTTTGCAATGACAACCGTCACCAACACAGCCAATGAAACCTGCNCCAA 

GCAAAAGGCTCACCGACCAAAAAAGTAGAAGGGTTTGCTTTCAATCANCTTTTNGTATGA 

ACANTCCGAAACTTAATGCCAGTCCACCCGTGGGGTTGGTGTGGGCATGCTTGGGAATTT 

GGGGGGGTCTCCGAGCTGGGCTTGCCCATGGATTGTTGTCCCATTGTATCTNGTACCTNG 

GGCCGCTCTAAGAAANTTAGGTGGGATTCCC 

Sequence 3786 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAAGGGCATGACAGGCTGGGAGGTGGCTGGTGA 

TCACATTTACACAGCTGCTGGAGCCTCAGACAATGACTTCATGATTCTCACTCTGGTTGT 

GCCAGGATTTAGACCTCCGCAGTCGGTGATGGCAGACACAGAGAATAAAGAGGTGGCCAG 

MTCACATTTGTCTTTGAGACCCTCTGTTCTGTGAACTGTGAGCTCTACTTCATGGTGGG 

TGTGAATTCTAGGACCAACACTCCTGTGGAGACGTGGAAAGGTTCCAAAGGCAAACAAGT 

CCTATACCTACATCATTGAGGAGAACACTACCACGAGCTTCACCTGGGCCTTTCCAGAGG 

ACCACTTTTCATGAGGCAAGCAGGA.V\GTACCTCGGC 

Sequence 3787 

AGATACCAGGCCGTTTCCCCCTGG^AGCTTCCCTCGTGCGCTTCTTCTGTTTCCGAACCC 
TGCCGCTTACCGGGATACCTGTCCCGCCTTTTNTCCCTTCGGGAAAGCCGTTGGG 
Sequence 3788 

CAACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATGGCTGAAAGC 

GAAGTCCACGTTGGCGGAGGCT/ATCCGAGGTCCACGTGTGTCCCTCGGTCTTGGTTCTC 

CTGGGTCAGATTCTCCTCGTCA^ t GTGGGCTGTTAGGGTGGCAGAGGACAGCAGGGCAGAA 

CCCAGCCGCCAAGAGCCCCAGXGCCAGGAGAGGTAACATTCTCTCCATTCTCAACTCTCA 

GGGAGCTGGATGTAGCTGGTGGATTCCAAAGCCGTCTGTCTGGAGTCCCCGCGTACCTGC 

CCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnGCGCGCTTGGCCGTAATCATGGTCATAAGCTGTTTTCCTGTGTGAA 

Sequence 3789 
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TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTATGGTCGGGGTG 

GAGCGGGGCATCTTTACTGTCAGTCACTGGCACTTGTCCACTGTGAAGACCTGATGAACC 

AGAGCATTTCCTCTTCTTGTTCTGCTCACCAGCCAGCTGAGGGAATGTCGATGCTGGCAA 

TGGGCACAGTGAGTCCCTTCAGGACACTCAGGATGCTGTGGAACGGTTCCCGAACATCGC 

CCTTGAAGCTGAAGCCAAAGATGGCATCCACCACCAGCT CATAC AGTTCATCAATCGTCA 

TGGGCTCTGCGGGCAmCCCCAAGGAAAGGGATGTCCATTTTCTGACACTGGGTCACCA 

ATGCAAGTGAAGAGGGGGCTTGTTAGGCCTTTTGGGGTAATAGATGGTTGGCTCGTAGCC 

AAAGAGTTTGAGGTGTCGAGCACAAGACCAGACCATCTTCCTCCATTATTCCCCGG 

Sequence 3790 

TCACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTACGGAGTGA 

CAAAGGGAAAAGTCTGCTTTGAGGCAAAGGTAACCCAGAA TCTCC CAATGAAAGAAGGCT 

GCACAGAGGTCTCTCTCCTTCGAGTTGGGTGGTCTGTTGATTTTTCCCGTCCACAGCTTG 

GTGAAGATGAATTCTCTTACGGTTTCGATGGACGAGGACTCAAGGCAGAAAATG GACAA T 

TTGAGGAATTTGGCCAGACTTTTGGGGAGMTGATGTTATTGGCTGCCTTGCTMTTTTG 

AGACTGAAGAAGTAGAACTTTCCTTCTCGAAGAATGGAGAAGACCTAGGTGTGGCATTCT 

GGATCANCAAGGATTCCCTGGCAGACCGGGCCCTTCTACCCCATGTCCTCTGCAAAAATT 

TGTGTTGTAGAAATTAAACTTCCGGTCAAGAAGGGAGGGAGCCCTTCTT 

Sequence 3791 

CACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGCGTTAGCTGAAGGCCAGCAGG 

GACCGCAAAAGTTTTTAGGCCCCTTANGTCTTTAGGTTGGCATACGTATCANGGACAGTA 

ACTACCATGGCTCCCGAAGTTTTGCCNAAACCTCGGATGCGTGGCCTTCTGGCCAGGCCG 

TCTGCGAAATCATATGGCTGTAAGCATTTCCGTGCTATCCCTGGG GGTT GCANGCTTTTG 

TNTAAGTTTCGTGTGGGCTGATCAAAGAMGAAGGCATACGCAGATTTTCTACAGAAACT 

TATGATGTNCATGAAMGATTmGAGGAGATGAAGGAAGGCTGGTATCTTTCAAGAGTG 

TAMGTNATCTTGGGMTATAAAGGAATTTTCTTCAGGTTGAANTTACCCTANGAAAGTT 

TTGGTCCACTGACCTTTGTGGTTTCCCTTGAAACCTATTGACCACCATTGGAAATAATGG 

TGGGGGCCTTAMGNAAAMnAAGmCCCTTCTTGGMTANANATTAAAAACCAMTT 

TAGGCCAA 

Sequence 3792 

TCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC ITTTTrTTTTTTTTTTI 
ITTTTTm i GCGGGGGGAGCATGGGACCTTTATTCGTTAANACATCAGGCTCCAGATAT 
GAACTTTCAGCAGAAGCGCTTGCCGGGAGCAAAGGGACAGAAAAGCTGANATGAACAAGT 
GCCTGGCAGNAATCACAGCCCGGGCAAGGGTGCTCCAAGCCTCGCATCCCCCGGC 
Sequence 3793 

CGAGGTACTTGTGACAGGCAGACGTGATTGCAGCCACGAACACGATGAACTCACTGAAGT 

CCACCTGGGCATCTCCATTGGCGTCCAGGTCCTTGAGCAATTTATCCACGGCATCCTTGT 

CTTTTCCACTCTGCAGGAAGCCTGGTAGCTCCTTCTCCATCAGCACCTTGAGCTCCCCCT 

TGGTCAGGGTCTGCGTGCTGCCCTCGCTGCCCGAATATCGGGAAAAGACGTCTATGATCA 

TGCCCATGGCTGTCTCTAGTTCCGTCATGGTGCTAGATTCAGACCCACCTTCCTCCTGGG 

GGCTGGCAGGGCCGAGAAAATGTCCCCGCGTACGCGGGGGACTCAGATCCCAGCCAGTGG 

ACTTAGCCCCTGGTTGCTCCTCCGATAACTGGGGTGACCTTGGTTAATATTCACCAGCAG 

CCTCCCCCGTTGCCCCTCTGGATCCAC 

Sequence 3794 

CCGGGCAGGTACGCGGGTTCTCATTCTTTTGTCCTTTATCTTCACTCTGAi fl \ iCTTTT 

GTCATTCAACGCTTACTCCCTTCCCCATACCTCAGTCCTCCAGGTGACACCTGGGCGCTT 

TTCTGCCTGAACAGCATTTCCCACCAACGGCCACAGCAGTCGGTCAGTGTCCTCTTGAGC 

ACCTCCTGMGCTGCAGTGTCATTTGCTGTTTGCCCTAAAAAGCCACGCCCAGATTTCTT 

CTACCGCTGCTTCCCTGATGGGCTAGCCCATTCTGACTTGATGTGTACCT 

Sequence 3795 
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GGCG GCCG AG G NACTGTGCTCAMGCTG AAMTGTTTACTGTCTTTCCCCTCACAAAGTT 

TTCAGACTGCTGGCCTAGGGTTTTCAGGGTGGNGGATAGAGACGGTATTAACCAAAACAG 

AAAAATCATCCATTATTATGATGATGGGTCAGAAAGAAAGACGGTATCAATCTTGGAGCA 

CTTAAATGGGAGGTGTCAAAGGACACACAGCAAGGCAGCCCAGTGTGAAATGGAAAGCTC 

ATCCCACTGAAGGGGCGTGGCACTTGGCTATCCCCTCTGCCTANTACCTGCCCG 

Sequence 3796 

ATACTTAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTAGAGTCCAATTTT 

CCTACCTTGGCTTGTGCATGAGGTTGCCTGGTTGCCACGGTGCCTGCTTTTCTCCCTGTG 

TCACCTCCACTGCCTCCTGAATAACAGTCTCAGGTTTGTCTATGGCCTCTGCAGGGAGCC 

GGGATTCCCAGACAGAGCAGGCCACTTAAATCAGGTGGGACGCCTCTCCCCTCTTGGCCG 

GAGTCCAACTCCACGCAGGCACAGAGATCCCAGATGGGCCCCCAAGTTTGTGAGAAACAC 

A TTCA TCTGTCCAAACCCAAAGAATGGACTCAGAGACACAAAAGAACAATGGAAGAGAGA 

ATTTTAATGGTGATCGTGCAAGAATGGGTGTCTGGTAGGCAGGCCCCGCGTACCTGCCCG 

0 

Sequence 3797 

TTAGGGCGAATTGGAGCTCCCCGCGGNGGCGGNCGAGGNACTCNNTNTNGTTCNTTTNNN 

C NCWTG NCTTNTCTN NN ATG CTG NTN ATGTCATTC ANTGGGTGC ACTAAAGG ATCTNTT 

ATTTTATGTAAAACGTTGGGATTGACAAGATAGATCTGACACTCTGTTAAGTTACCCTCT 

GAAGCTACTG N NTTGN GAAATACTAATGACAGN ATCATCCTGCC AAGCGAAAGAGGCAGG 

CATNAAGCAAGGACAAATTAAAAGGGGGTAAGAGCCTTATCATGATGAGGAGTCTTGNTT 

TGACATCTTGG 

Sequence 3798 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGT 

GGACAGACTGCTTTGCCTGTTGTTGCTCTTCGGAGGCGGCGATCCCCGAAGGCGAGCTGA 

AATACGGCTGCAGGCTACAATTTGCAGCCGACCATTATGGATGACAAGGAGCCGAAGAGG 

TGGCCCACCCTCAGGGACCGCTTGTGCTCGGATGGCTTCTTATTTCCCCAATACCCCATT 

AAACCGTATCATCTGAAGGGGATCCACAGAGCTGTCTTCTATCGTGATCTGGAGGAACTG 

AAGTTCGTTCTGCTCACGCGTTATGACATCAATAAGAGAGACAGGAAGGAAAGGACCGCC 

CTACATTTGGCCTGTGCCACTGGCCAACCGGAAATGGTACCT 

Sequence 3799 

CGAATTGGAGCTTTCCGNGGAGGCGGCCGCCCGGGCANGGTAC I I I M 1 i II I I I 1 I I 11 

TTTTTGCAGTCACATGNTTGGCAGGGACCAGCTCAACCCTTCTTTAATGTNATNCAGGGA 

GGGGGCCANGGATGGAGGGGAGGGGNTGAGGANCNANAGGCACCTATTTNTGGGTGGGAT 

TCACCACTTTTTCCATGAACAGGGGAMCTTGGCTATTTTGGTCAATCATTAATNAAGAC 

AAAGGGTGTTGTTGAACTTGACCTNGNGGGGGATATGACATGGGTATOGNCTCTAAA 

Sequence 3800 

CTCCACCGNGGTGGCGGCCGCCCGGGCNGGTACAGGAAGGGCCATGCTGTTATTACTCTT 

ACACAAGGAGGCAGCCCTCGAGCCACAGGGTCCAGCTGTNGGCTA1AATAGCCTACCGGT 

CTCTG ATGATC ACCATGTTN CTGG AATTCAAGCCAGGAAG AAGCACC ATCTGTCTTCTG 

GATTAAAACTGAANATCAACCTACTTTCAACTTACTAAGAAAGGGGATCATGGACATTGA 

AGCATATCTTGAAAGAATTGGCTATAAGAAGACTANGAACAAATTGGACTTGGANACATT 

AACTG 

Sequence 3801 

AGGGCGAATTGGAGCTCCCCGCGGTGGCAGGCGGCCGCCCGGGCAGGTACAAGGCATGCT 

GCTTTGATGATCTTGTGGCGGGGTGTGCGMGGTAAGGCCTATTGACTTTCTGGGCMGG 

TCTTCAGTGACCATTTCTAGGTCTGTTTGCATTAGGTTCAAA 

TTTAGGTTCCAAGAAAAAATCCATATGGGATAAGAACACTTGAGAATGCGCCGAG I I I I I 
MGAGCACATGAACTGCATCTTCMTGAMGTGGTATCTGGACAGCCTCCTTCAGGTTTC 
TTTG AG AGCTCTG CTCAATTGCTC CATCTTTTCTTTG G CTGTTTTAAGAAGGCCGTTGTT 
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CTAGCTGATAACTATGCTCATGGTTTTTAAATCTTGTATAATAGTGCATAAATCTGTCAA 

GTTCCTGAAATCGTTTGTGTTTTTTC 

Sequence 3802 

CATTGCTTTNGGCTTTTGTCTCTTTCTCC 

NTGATCCNANGATNCC'l Mil ICCCTGCTTTGGCTGCCTCTGGNGAGGGTCATGTGTCAA 
AGCAGANACGGGNNAAAAGCCAAACCGACACAATGANCGTNCTNANAAAGGAAACTTCTT 
CGGAATGAAAAG 
Sequence 3803 

GNGGCGGCCGAGGTNCGCGGGAGAAATAAAATGTGGTCCTAATGCCAGGACTAAAATGGA 
CCTANTGCCAAATTAGTCCMTTGNAGCCTTTGCTTCAGCTGGATCATCCCGTGTTNGTT 
ANGAGCTCAGCTGTGGCCA 
Sequence 3804 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTN I I I I I I I 1 1 III INTCCC 
TAGGCAAGGAGGGATGATNATTTATTAGCTTCTACAGATNACACAATGGGGTGGGGGTGG 
GCTCAAGGTGAG 
Sequence 3805 

CTATAGGGCGAATNGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGAGCCTGAGAAG 

GACTTTTCCGAAAAGGCAAMCCTTCACCCACCCCATTGGGAGAAAAACCCAGACGCCAG 

GAAMGMGAAACAACAAAGGCAGGAGAACAGCCACTTTCAGACTTGAAAATGACAAAAC 

CCTCAATTTGAGCCTGAGCCCCCGGCGCGGGGGCTGCTACACTACAGGACACCCAGCATC 

GGCTTTGACTGCAGACTGTTCACCCACACGAGCCCTGTGCTTTTGGTGTAAATAATGTAC 

CTGCCCG 

Sequence 3806 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCACGGTACGCGGGGGG 

AGACCATTGCTCTATAGCAAGACCAGACTTTGCCCTTCCTCCTCTCAGCCTACTCAACGT 

GMTATAATGAGGATGAAGACCTTTGTMTGACCTTTTCCCACATAATGAATAGCCATTG 

GGAGACACACTTCTGAACACCACCACTGGAAAATCACACATGCTGAAATGGGAGAGTTCC 

CTGACCCCCTTGCAGGATATGTGACAGGAGTGTGGCTCATCTGTTCAGCTGGAGTGCATA 

CTCAAACCCCTTATGAGACAAGGAGTATGCAGACAGGAAGGTGCAGGGAACTGGGGTGAG 

TACCTCGGCCGCTCTAGAACTAGGTGGATCCCCCGGGCCTGCAGGGAATTCGATATCAAG 

CCTTATCGATAC 

Sequence 3807 

CTTAGGGCAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTAC 1111111111111111111 

TTTTTTTCGCACTACCTCCCCGGGTCGGGAGTGGGTAATTTGCNCNCCTGCTGCCTTCCT 

TGGATGTGGTANCCGTTTNTCAGGCTCCCTCTCCGGAATCGAACCCTGATTCCCCGTCAC 

CCGTGGTCACCATGGTAGGCACGGGCGACTACCATCAAAAGTTGAATAGGGCAGACGTTC 

NAATGGGTCGTCGCCNCCACGGGGGGCGTGCANATCGGCCCCACGNTGTACCTGCCCGGG 

CGGCTCGTTCTANAACTANGGGGATCCCCCGGGCTGCAGGAAATTCGATNTCANGCTTAT 

CNNATACCCGTCNGACCTNGAAGGGGGGGGGC 

Sequence 3808 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGTTCCCTTCTGAT 

TTGGTCTAGATACCAGAATCCATTCTCTTCCGTCAAACGGAAGACACAAGGCACCTGAGG 

CTGATCCTTCCCAGAAATTAACTCCAGAGGCTGCCACATCTGGTATGAGCGTCCAAACCC 

AGCATCGACAATGTAGTTCCTGCCATCAATGGTCACCTGCAGGAGAAGGTGAATCATGCC 

AGTGCTGTATTTTTTGGCTGGAGTGCTGTAAACATACCCTCCCAACATCGTGGNCTCAAA 

ACCAATAGTGGTCAGAGCCCAGTACCTGCCCGGGCGGCCGNTCGAGTTAATTGCGCGCTT 

GG 

Sequence 3809 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTNNNI I I 1 I 1 1 I I 1 I 1 I I CT 
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TCTTTTAGGNNNTCCTNTCTTNMTTTNTGGGTTTTCTTNTGCATTCTNNNNANNGTCAA 

AGCTTACCCACAATCTATAANCANNNGAGGANTTACCAGGAACCNGCGTGGTGAGGAGAT 

GCCANAGGCANAGAAGAGAANNAAGTATTCCAAACATGAGGGGATTCCAAAGANAGAAGA 

G NGTCCC AAAC AACATTTGCACAAACCTGATGAGGAGAGAGAATGTG GGGTGGGGATGGA 

TGATGACNACTGAANAAGAAAGCCAGGTCTAGGATAATCAGGTGGCCTTGTACCTGCCCG 

GGCGGCCCGNTCTAAAACTAGTCGGGATCCCCCGGG 

Sequence 3810 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGCCGGTGCGGAG 

GTCAGGGACAAGATGGTGCCACCGGTGCAGGTCTCTCCGCTCATCAAGCT 

Sequence 3811 

ACACTACTTAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACMCTTTTTAACA 

TCCTCCAAAAACTTATCCACTAGCTTCAGGCCCTCGCTGAGGAACAGGCCATTGCCGGTG 

GTCAGCTGGAGCTGGCTGTCTGGCTGGTTGAGGGTATGGAGGAGTTCCTGGAAGCCTTCA 

TGGATCTGAGCCTCCGGAAATCTCCGTGAGGTTTGAAATTCAGGCCCTCCAGGATTTCAT 

CGTGAGTGTCAGTCTTGGTCCCCAGGGAGAGCATTGCAAAGGCTGTAGCGATGCTCAGAT 

TCACTGTCCCAGGTCAGTGGTGGTGCCTGAAGCTGAGGAGACAGGGCCCTGTCCTCGTCC 

GTATTTAAGCAGTGGATCCAGAGGGGCAACGGGGGAGGCTGCTGGTGAATATTAACCAAG 

GTCACCCCAGTTATCGGAGGAGCAAACAGGGG 

Sequence 3812 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGANGTACGCGGGAGGCATTGAGG 

C AGCCAG CGCANGGGCTTNTGCTGAGGGGGCAGGCAGGAGCTTGAGGAAACCGCAGATAA 

NTTTTTTTCTCTTTGAMGATATAGATTAATACACCTACTTAAAAAACATAGGCAATAAN 

GTTACTAAGATATTNCTTACCCGTTAAG 

Sequence 3813 

GGAGCTCCCCGCGGTGGCGGCCGAGGTACGACCACAGCTCACTGCATCCTCNACCTCCCA 

GGCTCAAGCGATCCTCCCACTTCAGCCTCTGAAGTCTCTCATATGGTGTCCAAGAAATGG 

TGACAAATCTCACAAAGGGACTAGGCTCANCAGGGCTGGAATATTCAGGGAAGGTGTTAA 

AGAAAAAMGAATGAAACTTGAGTTTGGCTTTTTGAAGAAGATGCATAGGACTCCCACAG 

GCAGAGTGAAATAAGGGGCATTTTAGATGGACAGACACACAGACAAAAGCAGAAATGTGG 

GTGGTGTGACTGGGGTTTGGTGAGGGGCTGCTGTGGCTTGGAATGGANGGGCTNCCACAA 

ATAATGGGAAATGGTAAATGAGGGCAAGGTAAGGTTGGACCTGGTGGCCATANCGTCAAG 

GTTTGCCCAGCTTTATT 

Sequence 3814 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACl 1 1 1 1 I I 1 1 1 1 1 1 1 I 

mrnrn ittttat i 1 1 1 1 n i n 1 1 n n 1 1 1 n i n 1 1 u i i igggnggttnaaaa 

NATTTTCACTTTATTATTATGCTTATMTATTATTCCMCANACTGNNTTAAAGGNNGGG 
NTCNCTNACNCAAAACCCCACAGGGGCGAAAAGGCNTCCGGGCCCNATTGCAGGACGTGG 
CCTGTCGGGCCAGGGTCGNTGACATGCNCCCTGGNAGCTCANACNCTGGTNCCCTNAANA 
NAGGCTGNAGGAATNGGGACMGACANATGCCNCCGGGACTTTTAAAAGCTTT 
Sequence 3815 

CTACTTAGGG CGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACI I 1 1 1 H 1 1 I 1 1 I M 

irmTrm iatttagttttgtttaggagtacacgtccctagaagcacatagctccagc 

MCAAAGCTAATCTGANAGTMTACCCAAGATGCCTCCTACCTTGCAAATTTCAAGGAAT 
TTCtCACTGGTGGTTTTTCATTANGGATACTCAGfGAAAGTAGCATAAGGAACTTCAGTG 
GACCATGGGTTCCAGCGGGACANAAGAGACTGCTCCTCCGGACTCCCCCAGTAGATCCTA 
AGGCCTTCTCCTTGTCTCTTGTCCAGGGGACATCCCAGGGGAAGGNGAACT 
Sequence 3816 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGAGTCTAGTTC 
CTGGTTGCCGGCTGAAATAACCTGAATTCAAGCCAGGAAGAAGTAGCAATCTGTCTTCTG 
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GATTAAAACTGMGATCAACCTACTTTCAACTTACTAAGAAAGGTA7TAAGCGCCTTTCT 

GAGAGCCTTTCMNGGGGCTTCCTTAAGCACCGTTCACTCTGNCTNCTTTAAGTNTTATA 

TTTATGATCAAGATGAAAGGGAGGGATGTNTCACTGCACAGAGATTCTACAAGTGGATAT 

ATAAAGCTMTMTGTTTTCAGNGAGCTCTTCTCTGGCAACTACTTTNCMGTGNCTTTC 

AGCNTTCCACCTTrGAAGCACMCGGTTAATAATGCACAATTTCTTTNTNTTCAGCCCAC • 

AAGAA 

Sequence 3817 

AGGTACTTCATTCCACATTCAATCAAAGCAAGTGCAACCAGCACAGGTGCCCTTCCCAAT 

CCTGCAACACAATGCACTGCAACACAGCAACCTGGCTCTTCACGAAATTTGG I I I I IAAC 

AGGTTTAACCAATCATCTACTATCTGATTAGGGGGGTGGGAGCTCCATCATCAAATGGCC 

AATCTAGMCGTGGATTCCTTCTTTTTCAACTGGAGCTTTATCATATGTAGCCCCGCGTA 

TCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGAATTTNNATAT 

TCAAAGCCTTATCGATACCCGTCCGACCTCGNAGGGGGGGGCCCCGGTACCCAGCTT 

Sequence 3818 

CGTACTGNANCCTATCCTGTCCACCTTGCGCGCCTGAACNTNAGGGCCGGACCGGCCCGC 

GTAAGCCCCATGCCCCGCACGCCGGGGCA I III i i I I GACCGCACGGCAAGCCGTTGCTG 

GACAMTGGTTTTTTTCGCAMCATmGCCTGTCAGNAAATTATCGTC 

AACAGAATCNGGCCGTGCCACACTGCGCGGCTCCCGCAAGAATATACGGCCGCTCTAGAA 

CTAGTA 

Sequence 3819 

GGGCGTTTTGGAGCTNCCCNCGGTGGCCGGCCGCCCGGGCANGTACTTTN l rTTTTTTTl 

NTTN ITTTTTTTTn i AAAACGN CTCATCGCAGTTTATTAGCCTCTACAGTTTTTAAATT 

TACTAAANCAGGGCCNAAAAGTACAGGTTTNNGTNNAGATTTTTTTGGGAGGCTCACAGG 

CAGCATGACATGACTTTATMGGGAAAAAGGGGCAATNCNANTGGGTTTTTTTAAANAAf 

TTTAANTGGMCTTGMCTTTTCCTGGGCCTCCCAMCAAGGCCAGGGCTTGNGCTTGTN 

AGGGTAAAGCCCTN7TCCATCCAAGGTGGGAAGGGANCACCANGTCCCTCCAGTTTTANA 

CCAGGAGGAAAAAATTTANGGACCCCAAAAAAAGAATGTAMGTTTTNCCCGNCGGTAAC 

CCTTNNGGGCCCGGC I Mil I AGNAAACCTANGNTGGGGAATTCCCCCCCCGGGGGCCTT 

GNTAAGGGGAMTTTTCNGMTANTCCAAAGGCCTTTTNTTNNGGAATAACCCCGNACNN 

NAACCCCTTNCTAGGGGGGG 

Sequence 3820 

AGGTACTGCTTTGAATGTGACTGTTTCCGTTGCCAAACCCAGGACAAGGATGCTGATATG 
CTAACTGGTGATGAGCAAGTATGGAAGGAAGTTCAAGAATCCCTGAAAAAAATTGAAGAA 
CTGAAGGCACACTGGAAGTGGGAGCAAGTTTTTTGGCCATNG N GCCCAGGGCG ATC ATAA 
GCCAGCAATTTCTGAACGGCTTCCCGATATCAACATCTACCAGCTGAAGGTGCTCGACTG 
CGCCATGGATGCCTGCATCAACCTCGGCCTGTTNGGAGGAAGCCTTGGTTTNTATGGTAC 
CTGCCCNGGGCGGGNCCGCTTTTAGAAACTTAGTGGGAATCCCCCGGGGCTTGNAAGGAA 
ATTTCGNNTTTNAAGCCTTANTCNGNATANCCTTCNAACCCTTGAAAGGGGGGGGGCC 
Sequence 3821 

CTTCTGCTGAGGGGGCAGGCGGAGCTTGAGGAAACCGCAGATAAGl I 111 l I CTCTTTGA 

AAGATAGAGATTAATACAACTACTTAAAAAATATAGTCAATAGGTTACTAAGATATTGCT 

TAGCGTTNAN 1 1 M I I AACCGTAATTTTTATTAG CTTTAAG ATTTTAAG AG AAAATATN G 

AAGGACTTAGAAAGAGTATCATGAGGMGGAAMGATAAAAGGTTTCTAAAACATTGACG 

GAGGTTGAGATGAAGCTTCTTCATGGAGTAAAAAATGTATTTTTAAMGNAAAATTGANA 

GAAAGNGACTACANGAGCCCCCGMTTTMTACCAATTAGAAGGGGCACNTGCTTTTTAG 

NATTAAAA 

Sequence 3822 

TCCCNTTTGTGGCGGNCGACGTACGCGGGACCACTTCTCTGNGACACATTGCCTTNTGTT 
TTCTCCAGCATGCGCTTGCTCCAGCTCCTGTTCAGGGCCAGCCCTGCCACCCTGCTCCTG 
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GTTCTCTGCCTGCAGTTGGGGGCCAACAAAGCTCAGGACAACACTCGGAAGATCATAATA 

MGAATTTTGACGTTTCCAANNTCAAGTACCGTCCAAAATGACNGAAGNTNACTGCAGTG 

CTTGCAGGTTC AAGC N AGAATTGAAAG AAATG CATGGGTGGGTTAAAACTTACCTCATTA 

GCAGCATCCCTCTACAAGGTGCATTTAACTATAAGTATACTGCCTGCCTATGTGACCGAC 

AATCCAAAAACCTTCTACTGGGACTTTTTACACCAACAGAACTGTGCAAATTGCANGCCC 

GTCGTTGGATGTTTATTCGGGGAATTAGGGCATCTGCCCCTGAATGAATGGCTGGCTTGT 

AATTCNCCCCAATTCAAAAA 

Sequence 3823 

AGGTAC 1 1 I I I I I I I I I I I I n I I I I i !C I I II I ! I I I I I I I II I I II I M I I M 1 1 I N 

GGCCATCAATTTATTAAAATAAACATNTNTAGCNGGmNMCAATTGTNTTGTNNmG 

TNGTAAAAAGACTTNANAMGANAAGGNGNGGTTTGCANNAATCCCTAACTGGTTTTTCA 

NCANACCCTGCNATTTTNNNGNANCCAAMGGTTTTTNCAAAAAGNTTTAAAANTAAATC 

NNCAAAAGACTGTTTTGTANCCCNCCNNTACCNTTGCCCGGGGCGGGCCCG 1 1 I I IAAAA 

MCNTAGTGGGGATCCCNCCNGGGGCTTGCAGGGAATTCNNATTTTTTCAMNNTTTTAT 

TTNGAATNCCCGTNCNGACCCNTCGGAGGGGGG 

Sequence 3824 

AGCCNAAAAGGGCCAGCCCACAAAGGGCCCNGGNAACCCGGTAANAAAAGGGCCCGCCGT 
TTNNCTAGGGCGTNTTGTTTCCCAATAANGGCCTTNCTCGGCCCCCCCNCTTGTACNGAA 
GNCCATTCCAANCAAANANAAAATTCCGG^CCGGCCNTTCAAAANGGTTACAAGAAAAG 
AGGGTTGGGGCCCGGAAAAAAACCCCCCTGNAACCAAGGGGGNAACNTTAAATTANAAAA 
AGGGAANTTAACCCCCAAGGGGGCCCGGG 1 1 1 1 I i I CNCCCCCCNCCTTNGGGGGAAAAA 
GNCCCTTTCCNCCCCTTCNAAATANNACCGGCCCTTCCTTNCCCCCTTGGI I I I INCNCC 
GGANACCCCCCTTTGGTCCCCGNTCNTTTTAACCCCCGGGGGAANTAACCCCCTTTGGTT 
CCCCCCGGCNCC I I I 1 I I ICNTTCACCCCTTTTTCCGNGGGGGAAAAGGNCCGGTTGGGG 

nccgncci rm iccttt 

Sequence 3825 

gggcgaattggagctccccgcggtggcggccgaggtacgcggggatcggaaatttgaatt 

acctctagaccaggatttgtcagtctctttactattgacattttgggtcagatcattctt 

tcattctttcttggttgaggggttgatattgtttggtgtgtcctcatccaaatctcaaat 

tatagctcccatagttcccatgtgtcatgggagggaacccgggngggagggtaaactgaa 

mcatmgcaaaggtctttcccttgctcctctcatgacaggtgaatttctcatggatatc 

taaatggttttataaagggggagttcccctgcacacgctctctctcttccccgttgccat 

gaaagatgtgaccttttgctcctccttncccatgmttgtgaggctttccccagttcaca 

aagggaaactggtggagttcccattaaaaccci 1 1 I I I ictttgntaaaaaatttacccc 

CA 

Sequence 3826 

AGACGTGTTTGGAGCNCCGCCGCNNCGGNGGGGCNGNCGAACCGGGCNGGTNCNCNGGNC 
TNCNTNANTTNTNTNTNATNNTTATTTATNCCGTTTTACCTTTTnnnnnnnnnnnnnnnn 
nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
nnGGNNCCNGGGGNNAAAANTTGGTNATCCGCTCACAATTTCCACACAACATAC 
Sequence 3827 

CGCGGTGGCGGCCGAGGTACCTTAMGCAGAATCCATAGTTAGTCGGTGCTCCATGTTTT 

TTTTTGCCTGCCAGTGACACATAAATATCACTATTGCCAAATTCGCTGAAAAACTGCAAA 

TGCCGCGGTTCCTGGAGGCATCACATTACCCGACTTCAAACTATTACTTACAAAGGCCAT 

AGTCACCCAAACAGTAAAGTACCTGCCCG 

Sequence 3828 

CTACTTAGGGCAATTGGAGCTACCNNCCCGGCGGGGCGNGCCGAAGNACTTTTTTTTTTT 
rTTTTTTTTTTTl GAGGGGGAGTCTTGCTCTGTCTCCCAGGCTGGAGTGCAGGGCTGCTA 
TCTTGGCTCACTGCAAGCTCCACCTCCTGGGTTCACACCATTCTCCTGTCTCAGCCTCCC 
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GAGTAGCTGGGACTACAAAGGTGCCCGCCACCACACCCGACTAATTTTTTGTATTTTTAG 

TANANACGGGGTTTCACCATGTTAGCCAGGATGGNGTTGATCTCCTGACCTAATGATCTG 

CCTGTTTC AGCC TCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCATGCCTGGCCTCTC 

ACATGGCCTTTTACTCCTTTGTGGTTGCTGCTCATCTTTCAACTTTCCTTTCTAATTTG 

CAAAATGTTTTCCTATTTCCAGCT 

Sequence 3829 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAAGGTACGCGGGGAGGCATT 

GAGGCAGCCAGCGCAGGGGCTTCTGCTGAGGGGGCAGGCGGAGCTTGAGGAAACCGCAGA 

TAAGI 1 1 I n ICTCTTTGAMGATAGAGATTAATACMCTACTTAAAAAATATAGTCAAT 

A GGTT ACTAAGATATTG CTTAN C CGTTAAGTT7TTTAC C GTAANTTTAAATAGCTTTAAG 

ATTTTAAGNAGAAAMTTATGAAANACTTTANAANGAAGTAGGCATTGGANNGAAAGGGA 

AAMGAATAAAAGGGTTTCTTAAAAACATGACGGAGGTTGAGATGAAGCTTCTTCATGGA 

GTAANAAAAAAAAAAAAAAAAAGTACCTCGGCCGCTCTAGAACTAGTGGGATCCCCCCGG 

GCTGCAGGAATTCGANATTCAAGCTTATCGATACCCGTCCGACCTTCGAGGGGGGGGGCC 

CCGGATACCCAGCTTTTTGTTTCCCCTTTTT 

Sequence 3830 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCAGTGGCCATGTGGAGC 

TAGTGGCCACC ACACTTGACACCACG GATGCAGAAC7TTTCCATCACTGCAGAAAGCTCT 

GTTGAACAGAAC I I ! M J I i I i i i I II I M I II < ! I i I I I I I i CCCGCATTANATCACTG 

GGGGTCTTAAGC AATTNTTCAAAGCGAACTTTGG 1 I 11 I AAl 1 1 1 1 GGCTNAACTCTAAG 

GTGGGAGGCATNTTNTTCAGCTTTTATGCAAAAGGGACTTGAAAAATGCAAAGCATTGAC 

TCMTCAGGGATCCTTTTNTCAGACATMTTAAGMGGCTTTGACAGGTTGAANATTCAG 

NCCCCAGGTTCCCGCGTACCTTGCCCGGGCGGCCGTTTTANAACTAGNGGGATCCCCCGG 

GCTTGCAGGGAATTCNNATATCAAGCTTATCCGATNCCCGNCNACCTCGNGGGGGGGGGG 

CCCGGTCCCCCAANC I I I I 1GTTTTCC 

Sequence 3831 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGTAGTGGCAACCAC 

GCCCGG G AAAATC C ATAGG ATCATTTC AG AC C GCATTCAACAC AGATACTCATGTAGCCC 

CGACGTTTCTTGTGAAGGGAATTGATGTATTGCTTGAGGGGGTCGATCTCTGGTTCTAAG 

GTGTTTTGTTTGTTTCCATCGTGACTCCCAGAAAATTATNNNANGGGCCAGACCCNATTG 

GCTCATGCCTGTAATCCCAGCCAGTTTTTCTAAAGTTTTAATCTGTCTCCTCCACTAGGC 

CATG AG CCCTAGGAGGAG ATTGTCTCTAG AGGGTATCCCCTGTGCCTGGC AC ATG CATGC 

AAGTTATGATTTATGTTTGATGAATGAATGAATGAATGAATGAATGAATNGAATGGTGGA 

TGGGTCCTTCTTGAGGGCTCTTAGAAAATAGGGGGTTGA 

Sequence 3832 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGCTTCCTTTTCA 

GATTCCCTGTCTGCTCAATTAAAGATGTTTGAATCCAAAGGAAGTCAAGGAAGAAAAAGC 

ATGGAAAGGAAGAGAACTGATTCCTACTGAAAATTCAAATTCTATTACCATTCTAACTTT 

NAT^AAAAGTTTGGGATCAAAAMCNACTTNNTTTTCNTGCCCNNGGCTTNTNNTTAGGG 

GGGGGGATTNTTAAAGGGACATTANGGATTGGTGCTCANAAAATGGTTAATCATTGCTGT 

GTQCTAGCCAGGGCCAGCTGGTACCT 

Sequence 3833 

CCGCGGTGGCGGCCGGCCTATTTTATTTTAAGATGGAGTCTTGCTCTGTCGCCCAGGCTG 

GAGCGCAGTGGCACGAGCTCGGCTCACTGTTCTCCAGCCTGGGTGACTGAGAAAGAATCT 

CTCTCAAATTAAAAAAATTAAAAAAAATGTGTGTGGTGGGGGTATCTTGACTTTCCTCC 

TGTGTCTTAGGCCCCCMCTTTTTACATGGGGACCTTTGCANAAGAGAATTCCTTCAAAC 

CAAATACCTTTATGGTGCTTGCCTCCCTGATGGGCTTGGGGTTCTAAACAGCAGTGCCTT 

CCCCATCTGCTCAGTCCTTACTCACCTGGGCACCATCCTCCTGGTACCTnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTNGATATCAAGCTTATCGATACCCGTCGA 
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CCTCGAGGGGGGGGCCCCGGTACCCAGCTTTTTTGTTTCCCTTTTTAA 
Sequence 3834 

AGCTCC CCGC GGTGGCGGCCGCCCGGGCAGGTACAGGCCCGATATTATCTTTACAAGCAT 

GCCTACTTTTATCCTCATTTAAMCGATTTCCAGTTCAGGGCAGTNGTGAGTCATCCGAA 

GAAAATGGAGATGACAGTTCAGAAGAGGAGGAGGAAGAAGAGGAGACTTCAAATGAAGGA 

GAAAACAATGAAGAATTNAANGAAGAATGNAN GACTTTGNGGGCT I'GAGAATACCCACAC 

TTTTCTGCTACAGCACTGGGGCTTATGGGAGAGGGACGCCACCGCCTGGGCACAGGGGTA 

TACAGGGTTAGCTTGCAATCCAGCTTCCCAAGAAGGCTGGGGATATAACAAAACAAAGCT 

ACAAAAGAGAAGGAAAATNTNAATATTNAAAAAATAAANAAGGTTACCTCGGCCGCTCTA 

AGAAACTAGTGGGATCCCCCGGGG 

Sequence 3835 

CCGCGGTGGCGGCCGAGGTACT r n n I i M 1 I ) 1 1 1 J I 1 I I M N 1 1 A I 1 ) H 1 1 I 1 ) i 
' ^ i I I I I i I I I I I I I | | I | | t I I I! I 1 I I 1 I I I I 1 I I II II I 1 I I I I M i I NTTTNGNG 
GGGGGNNNNNGGNCCCTTTTTT^NTAAAMNNNCNGNCCCCAAANNTNAANm 
MCCCNTTTNCCNGGNANAAAGGGNNANAAAAAAANTTTTAAAANAAANATNTNCNNGNA 
AAAAM TACAAAMCCCTGGCAMNGGGGGNTCNTATNAAMCNTTTNATTCCCCNGGCC 
inn iAAAAAANNGNGGGANCCCCCGGGNGNNGGNANNAATTTTANNNNAAMNTTTTT 
TTAAACCCNTCNANCCNTTGNGGGGGG 
Sequence 3836 

CTACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACGCGGGGACCTCATTC 

ATTTCTACCGGTCTCTAGTAGTGCAGCTTCGGCTGGTGTCATCGGTGTACTTCCTCCGCT 

GCCGCCCCCGCMGGCTTCGCCGTCATCGAGGCCATTTCCAGCGACTTGTCGCACGCTTT 

TCTATATAC7TCGTTCCCCGCCAACCGCAACCANTGACCCCATNNTTNGGGTTATTTCNA 

GTGACCGAGACCCGCGGGCCGCTCTAGAACTAGTGnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGTAATCATGnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnCACACAACATTACNNAGCCGGGGAAGCATTAAAAG 

TTGGTAAAAGCCTTGGGGGGGTGCC 

Sequence 3837 

CCGGGCAGGTACCAGCNGAAATCAGGACAGTCCCCTAAGCTACTCACTTACTGGGCNTCT 

ACCCGGGAATCCGGGGTCCCTGACCGATTCANCGGNATGNNGGTCCGGGACAGAAATTNA 

CTCTCACCATCAACAACCTGCAGGCTGAAGATGTGGNAAGTTTATAACTGTCCAGCCAAT 

ATTTMTNANTAACCTTNNTTNTCCACTNTNATGGNTCGGGNNGGGAACCCAMNGGTTT 

GGG AGTAT TCAAAAAGCNTAMCTTGGNGGGGCTTGCGACCCMTACCTTGTTCTTTNCA 

NTNNTTTTCTCCCCCGCTNCAANTCCTNTGAAATGGAANGCCAAGCTTTTNTAAATAANT 

CCTTTGNGAAAAACCTTGGCCCCCTTCNTTGGTTTTGGGTGGGTTGGCCTCCTTGGCCNT 

GGGAAAATTNANAACCTTTTNCGTTMTTTCCNNCNAAGTAAAGTAAGGGGGCCCNAANA 

NATGGATTANNCACCTTrCNNGGGNCCCCGGNATTTCTTTTAGGTAAMCTTTATGGGNT 

GGGGTATTCNCCCCCCCC 

Sequence 3838 

CCGCGGTGGCGGCCGAGGTAC I I I I 1 H I 1 1 C M I t 1 1 M I t M I I I I 1 I GC 1 I 1 IN 1 1 

ATTTTCTTTTAATGCATCAAACMCTGTGGCCAGTGAAAGGAMCAAAACTGGCAGTTTG 

TCCATTTGAATATCAGACCTAGTTTCTTCTTMTTTCCACACTATTTCTCCCATATTCCT 

TAMCTTCTTGGCATCCTTCAATGCCTTACAGCTTNCCCAAGATGCAATAAAAGTCATTG 

TACCAGTATTTCTTACAATATAAGTTATATGCCAATGTTCAAGCA I ITTTTTTTTI 1CAC 

AGCACTAGAG ACCCTGTTAAATAGG GGATATGAGTC AGAATG GCTTATTCACAG ATGGGG 

TCCAGATTNAGTGCCCCGCCGTACCTTGCCCGGGCGGGCCGTTNTANAACTAGTGGGATC 

CCCCGGGCTGCAG 

.Sequence 3839 
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CCGCGGTGGCGGCCGCCCGGGCAGGTACCTCAGAAGCAAACCCAGTTCCTGCACACAGAA 

ACCCCATTCAGGCTCCTACTGCACTGAGAAGCACGTGTTCTCCATTTCCCTGGGGGAGAC 

CATTGTATTGGGCAGTTTGGAACAAAACACCATGGACTGGGAGGCTTACACAACAGAAAT 

TTAmCTTGCTGTTCTAGAAGGCTGGGAMGCTCMGGTGCTGGCTTGCATATTCATTC 

TGAGGCCTCTTCTGATGTGCAGGCAGCTGCCTTCTGACTTGTGCTCACATTGGAGAGAGG 

GAGTCAGCTTTGGTGTCTCTTCTTGTAAGGACACTAACCCCATTCACTAGGGCCCCACCC 

TCATGACCTAATCACCCCCAAAGGTCCTGTCTCCAGNACATATCACATGGGGGCAAGAAG 

CTTNCAATGGTACCTCGGGCCCGCTCTAAGAAACCTAGNTGGGGATCCCCCCGGGG 

Sequence 3840 

CC GCGGT GGCGGCCGAGGTACGCGGGGAAATCAGG1 1 ) > I 11 1 11 1 AAAGTGAAACTTG A 

TGA1 I I ! I AAACAAGTMTTAAAAAAATGTCC AAAAC ACCATGTG GGCC AMC ATTTGTT 

TGAGCCTGGGGGCCACCAGTTTGCGACCACTGCCTTACGTAGTTAACACCCTGAGTATGT 

ATACAGTCATATTTTTTGGTTTNGGATATTGGNAGTGTTANATATACTTGGGGGGCGTTG 

ATATTTGAAGTCATCTTTATCTCTCAGAGTTAAGCTTTATTGTAGMGAAAAAAAAT^ 

AANAAAAAAAAAAAANNAGGTACCTGCCCGGGCGGCCCGCTNTAAAACTAGTTGGATCCC 

CCGGGCTTGCAGGGAATTTNNATATCAAGCTTTATCGATACCCGTCGACCTTCGAGGGGG 

GGGGCCCGGGTANCCCANC 1 I I 1 i IG 

Sequence 3841 

GGCAGGTACGACACCCCTCAAGAAGGTGGGCCCGAAGCTTTCCTGTCTTCCCTGACAGAG 

ACCATAGAAAGGAAGTCCGATGCTGAAGGATGGGCACCGGCCCANGGGAACAACAGAAGC 

CGGNAGATCCGTNCTGGGAACCCCTTTCAGGCTCCATGGAAACATCTTACCTGGGTGCCT 

AGATGGATCMNACAGCATTTGGGGGCCAGCCAACTTTCACCAGGAAGCCAAAAAGGTGT 

CTAAGTCCAACTTAANTTGAAAAAGGTGGGCCAAAGNTATNGGTGTGGAAGCCCAANGAT 

ATGGTCTAAGTGACCNTATTGCCACATACCCCAAAATTTGGGTCAAAAAGTGGTCTGAAG 

CANACCAGCATTTAATTGCCAGNACTGGGGTCAACGAAAGCCANCTCAAATGGAAAATCA 

ATTTATTGAAAGACCACCAAGGTTTGAAAAGTTCANGGGGACNTAAACACCCAAAGAAAG 

0 

Sequence 3842 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTGTGGATCTTGGCTCACTGCAGCCTCCACCTC 

TTGGATTCAAGCAGTTCTCCTGCCTCAGCCTCCCGAGTAGCTGGGAGCCACCATGTCCAG 

CTGAI 1 I 1 1GTATTTTTAGTAMTGAGTTTTCACCATGTTGGCCAGGCTGGGTCTTTGAA 

CTCCTGACCTCAAGTGAGCTGCCTACCTGGCCTCCCAAAGCTCTGGGATTATAGGTGTGA 

GCCACTGTGCCAGGCCTGAGGTTGGGAGAAATTTTGTCCATTTCTTTAGAACCAAAATTG 

GCMCCAGAGAGTATTTGGGATGTTACACAAAATATTTAGTTTCCCTTTCTAGCCTAAAT 

TGGGGTTGTTTATAGCACCCGTCTCTCC.ATTTGAGAAAAATGGTTAGGAT 

Sequence 3843 

CGCGGTGGCTCGCCCGGGCAGGNACACCCTGTTCTTCTCTGGCTTGCCCACCAGCGGCCT 
CTGGACCGTCCAGGCCATGGATCCTAACGOTGCTTATGTGAACATGAGTAACCATCACCG 
GGGCCTGGCTTCAGCCAACGTTTGACTrTNCCTTCAGGCCTTGTTATTANANCACCCTTA 
NGTGGGCCCTTNGAMGTTCCCCCAA/J^^ 

CCCCTNGTTGGAAGCCAAATTCNTCTCCATTTGNGGCCCCTTTTAAGACNNTAATTTGNC 

TTGGTTCACCCNTNGNNNGCCNAACCCCTTTGGNTNGNGGNCTCCAACCCAATCCATCCN 

GGGGGGCCCCCCCAMNNCCTTTTANCTTTCNCAAGGNGGGGCCTCCTTAGNGNGNTTTT 

TTTNAAAAACNCCCTTCNNACCCTTrGGTANNAAAAGNGGATNTCNTTGGGAANTAACCT 

TGNNAANTGMTTCCCCATCCCCNAAGTNGNGGNNTTTTTTTNCCCAANNNCCTAACNCC 

CTTGGGCCANCCCCCANAANCTTTTNC I I I I I I I I GNTCNAAAAAAAAGTTTNCANNGNA 

ACCAACCCCCAAGGGGCTTTTTATNGTANANTATTTGGMCCCCCACTTGNGGGGGTTAA 

A 

Sequence 3844 
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CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCACATCACAA 

TGTCATTTTGATATGGAGCGTTTTGTTATTNTTACAAGGCAGAATGGGGTGNAACAGTTG 

MTTAMCTTAGCAATCACGTGCTCANAGCTTTTGCCTGTCAGTTGTGTGTGTCCCTTAT 

AGGCCCTTTCCCCCACAAGCTCTTTGCTGAMGAAGTTTGCCTTGNTTTGTTNNGNTGNT 

TTGTATTTANCCANAGGATGTCAAMTTANTCTTCTCAAAGCTTTGAGTAGAGTAAGTGT 

GGGAATAAGCCANTA! mm I CTGTTTCTGTAACTCAAATGAACGGG I I 1 1 I I 1CCCT 

TGNNTGCCACTTGNCCTAACATGTCCTTAAGGTGTTTAACCTGCCTCTGACCTGGCTTGC 

AATGCATAGGTGAGGAG 

Sequence 3845 

AGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACANCAAAACCCACCTGTGTAAAC 
AC AC ACAGC AAAGTG ATGTAAGAAGTTTCCATATAAAG GGCTG C AGTATGG AG AGGTAAT 
GTGCAGGCTGGTTTGCGGCTGTAGGGGCCACCTTGCTGCAGCTCTCCACTGATATGGTAC 
CTGCCCG 
Sequence 3846 

GAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGCCAAAAAGAAGTCAGAAGGAGC 

CAAATCAGGACTGTAAGGTGGATGAATGCCTAATGATTTCCCATCAAAGCTCTCACAAAA 

TTGCCCTTGTTTGATGAGAGGAATAAGCAGAANCATTGTGATGGGGAAGGACTCTCTAAT 

GAAGCTTTCCCAGGTGTTTTTCTGCTAAAGCTTTGGCTTTCTCAAMCACm 

AGCAGATGTTGTTGTTCTTTGGCCCTCCAGAAAACCAACTAGAAAAATTCCT1TAACATT 

CCAAMGACTATTGCCATGAGCTTTGTTCTTGACCAGTCCAGTTTTGCTTTGACTAGACC 

GCTTCCACTTCTTGGTAGCCATTGCTTTGATTGTGCTTTATCTTTAGGATCATACTGGTA 

AAGCCATGTTTTGTCTCCTG 

Sequence 3847 

ATTGGAGCTCCCCGCGGTGGCAGCGGCCGCCCGGGCAGGTACTGGAATGGCCAAGTGGGA 

CTGCTTCAGCTGACCAGGTTCTTTTAMCCGTAGTCATGCTTTCCCACTMCTCTTAMT 

CCTTATGCTTAGAAAATTGAGGATAAGGCTGGGCACAGTGGCTCAGGCCTGTAATCCCAG 

CACTTTGGGAGACCAAGGCGGGTGGATCACAAGGTCAGGAGATCGAGACCATCCTGGCTA 

ACATGGCGAAACCCCGTCTNTACTAAAAATACAAAAAAATAGCTGGGCGTGGTGGCGGGC 

GCCTGTANTCCCAGCTACTCATGAGGCTGAGGCAGGAGGATCACTTGAACATGGGAGGCG 

GAGGTTGCAGTGAGCCAAGATGGCGCCACTGCACTCCAGCCTGGGTGATGGCCGTGAGAC 

TCCATGTCAAAAAAAAA 

Sequence 3848 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAAGGTACTTCCGNGCTT 
CTTCATTTCTTCGATAGCCATAAAATTTTCTAGCTCTCTATCACCATCCTTATTTATTNC 
TGGTTCTTGAGATCTTTTCTCTGGCTGGCCATTTTCACTGCCTTTMGCCTT^ 
TTCCTCTG ATGTC AGCTTTAAGNCTNGCTCTGGGTTG CTGT NTTCAGAAGAGTATTTTGG 
CATNTG1 I n ICTTNGTAGTCAGAAACTAATTCGCAAATACTGTGATACTCTTCACTTTC 
GTTGTCAGGAAACTGCTGATTTTCAGGTGTTCTGCTCTTCCTTTGAGGAATTAATCCATC 
ATNACCATTGCCAGCAGTGACACCATTAGTCAGGTTTTCTAGT 
Sequent 3849 

GGGCGAANTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC I II 1 1 I I i 1 1 1 i I I I I I 1 1 1 I I 

TTTGGNCTCCCANGACCTCTTTNCACTNTTAAAAATGACATCCTAAANGGCTNTTATTTA 

TATGGATATNTACTGATATTGACCATA7TCAAATTNAAAACAAANGNTTGAAAATATTCA 

CTA^TTCCTTTAAAAATAGCTCATTNAAMTATGAAAMGATACTTTTACAAATTNANTG 

AGMTATCACTGNTTN ATMTTTTGC AAATC NCTTAATGCCTG GCTTANATG AAAAC N GC 

TGGAGTCCCATANCNGTTTCTGCATTAMNTTGNTGNTATATGANTTTrNTGCTTAAAAC 

CTATGAGGAA 

Sequence 3850 

CCTATCATACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAT 
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CCTGTGAAATCCAGGTGGAGGCTCCCAAACCTCAGTTCTTGACTTCTATGCACTTGCAGG 

CTCAACACCACATGAAAGCTGCCAAGGCCTGGGGCTTGAACCCTCTGAACGCATGGCCTG 

AGTTTATACTGGCCCCTTTCAGTCATGGCTGGAGCAGCTGGGATGCAGGGCACCACCTTT 

CTGGACTGCACACACCATGGGGACCCTTGGCCCGGCCCACAAAACCATGTTTTCCTCCTA 

GGCCTTTGAGCCTGTGATGGGAGGGGCTGCCATGAAGACCTGTGACATGTCCTGGAGACA 

TTTTCCCCATTGTCTAGGGGATTMCATTCAGCTCCTC 

TCCGGCTTGAATTTCTCCTCAGAAAATGGGATTTTCTTTTCTT 

Sequence 3851 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTGAGGGCTTCATTGCCAGAAC 
ACATTATATACAGGATGTCAGAGCTACCAGTGTGCTGCTGGGAGAAAATGCTGCAAAATT 
CATCTTTTGGAGGGTGGGGGGAAAACCCAAAAACAACAACAAAAAAACTCTCTTACAGAA 
TTTTCCTTAAC ATTAAAAAAAACTTGTC ATATTTTTCAAAGGC N CATTTG ATACTC AN AA 
TTG CTN AAAGTATATTTAAAG AC ATCTAG CCTTCCATATGTAAAAAATATTTTAG ATAAA 
GACAGTTACAGGACTCAGAATMTATATTGGGTGATTCACTGTATCCTCCATTTTTACAT 
TTAAAGAAATTCANAATACGTGATTAAA \ I I I H M I IT I 
Sequence 3852 

CGACTTGGAGCTCCTTCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGATCAAACAAGTC 

CTCCAAMGTGGTTGAMTTGGGCATCAAAGGCAAAAAACTCCTGAGGGATTGCAAGAAT 

CAGCTAATGTGAAAATCTCAGGAACTTTCAGCAGTGGGTGGACTGTGAGGCTGCCTGAGT 

TCCAGTTTTCTTCTCAGAACAAAGAATATCCCGATCGGGAAGATTGCACTACAGAAAAAG 

GCAAAAAGACCACTGTAGAAACAGAAGATTCTTCTGTAGAGAACCCTGAACAGGATCTGT 

TTGTAGAACAAAAAAAAAAAAAAAAAAAAAAGTACCT 

Sequence 3853 

GCGGTGGCGGCCGGGGGCCATTGAGACTGCCATGGAAGACTTGAAAGGTCACGTAGCTGA 

NACTTCTGGAGAGACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGACCACTGGAACAA 

TGAGAAGGAGAGAATTCTACTGGTCACAGACAAGACTCTCTTGATCTGCAAATACGACTT 

CATCATGCTGAGTTGTGTGCAGCTGCAGCGGATTCCTCTGAGCGCTGTCTATCGCATCTG 

CCTGGGCAAGTTCACCTTCCCTGGGATGTCCCTGGACAAGAGACAAGGAGAAGGCCTTAG 

GATCTACTGGGGGAGTCCGGAGGAGCAGTCTCTTCTGTCCCGCTGGAACCCATGGTCCAC 

TGAAGTTCCTTATGCTACTTTCACTGAGCATCCTATGAAATACACCAGTGAGAAATTCCT 

TGAAATTTGCAAGTTGTCTGGGTTCATGTCTA 

Sequence 3854 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTGGTGAATTTCTT 

GCACAAGTTGCGGCAGCTGCCTGAGCGATACATGATGAACAGCGTCCTGGAAAACTTCAC 

CATCCTCCAGGTGGTGACAAACAGAGACACCCAGGAACTGCTGCTCTGCACCGCCTATGT 

CTTCGAGGTCTCCACCAGCGAGCGTGGGGCCCAGCATCACATTTACCGCOTGGTCAGGGA 

CTGAAGGGGGACCCCAAAACTGGCTCACCCCCCAGAACACCCTCCTCCCAGGGAGGGTCC 

TCCAGCTCACCTCATGCTTCTTATTTGGGGAGAGGGCTGTGATGTAAAGGGGTTGACCTC 

AGAGGCCTAAGCAACTGGGGCCATCCCCCCCATTAGAGGGGCCCCCAAAACTTGGTTGGA 

GAGCTGAACAGGCTGGGACTGAGGAAAGGATAAACCCCGATATTGGGACCTC 

Sequence 3855 

AGGTAC l nilllllMlHlUllimiim I GGGGN l 1 1 I I ( CTTTTTCTTTGTTN 

TATNGTGCCGGTTTTTNGGTNTTTAGGGGGNTNTTNCNGCCNNCACNNNAGAATTNAGCC 

AACGCCANATGCCTTGCTCCCCACMTTTGGAACATTCCTmGGATTTTGACCCAAGTC 

AAGGGAAANAGATGGGGAGGAGAAAGTTGAAACANGCAACAATAAACCCCAAACNTAAAA 

CAAAAAGAGTTMGCAAAACAAACAAATGCNCAATTCATNTTGATTACTGAGTGTTCTAA 

TGGTAAGGAGAMTTAAMGCAGCTGGNGGGTCATCTTAAAATTTTANTTCATTTAAGGG 

AAAAAATTTTAAGACAAACTNTAATTCAGCTACTTACCTTGGAAATAAAG 

Sequence 3856 
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TTTGGAATCCGGCGAATTGTTTCTGGGCNANGGCCGCGGCCGCCCGGGCAGGTACTTCGG 

G ATTATTTCTTTG GAAAAAAG AG NTTTTATGTNTCTG ATTGGTGAGTGGCAAGAGTTCAA 

CATCGCCAGGTGTGGTGGCTCATGCCTGTAATTCCAGCACTTTGGGAGATTGAGGTGAGT 

GGATCACTTGAGGTCAGGAGTTTGAGACCAGCCTGGCCAACATGGTGAAACCCTGTNTCT 

ACCAAAAATACAAAAATTAGCCAGGCGTAGTGGCATGTGCCTGTAATCCCAGCTATTCGG 

GAGGCTAAGGCATGAGAATCACTTGAATCTGGGAGGCGAAGGTTGCCGGTGAGCAGAGAT 

TGCACCTGGGTTGACAGGAGTTGAGACCTTGTCCCAAAAAAAAAAAAAAAAAAANAGTNC 

CTCGGCCGCTCTAGAMCTAGTTGGATCCCCCGGGGCT 

Sequence 3857 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCTTGMGTTCAGATGAGATTTCGGACTTTTGA 

GTTGATGCTGAAACAGCTTGAGATTTTTGGGGACTACTGAGAGATGATAATTGTATTGTG 

CAATATGAGAAGGACATGAGATTTGGTGGGCATAGGTGTGAAATGACATTGTTTGGATGT 

GTTTACCCTCCAAATCTCTTGTTGAATGTGATCTTAAACGTTGGTGGTGGGCCTAGTGGA 

AGGTGTTGAATCATGGGGGTGGACTCTTCATAATTTGCTTAGCTCCATCCCCTTGGTGAT 

GAGCMGTCCTTGCTCTGTTGTGTCACATGAGAAGCTGGTTGTTTAAAACAGCCTGCCGN 

TGTTTCCCACTTCTTGCTATTCATTTTCTCTGTTGACACCGTTGATACACTTGGCNCCTC 

ACTT 

Sequence 3858 

CCGCGGACTTTGTNCGCGGGGGTTGATGGAAGGAAGAACTTGNGTGCTTAGACCTGACGC 
TGGGAGGAGATACTGTCACCTAGGTTACTTGTAGGACCCTATACGGCAACCTCCTTTGCC 
AGGAACT 
Sequence 3859 

ATAGGGCTTTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATCTGCATGCCATACTTAAG 

CAATCTAATTTTNGGTCACCCAGCACGTCCTACTCTGCCCTTCTTCTGCACAGAGCATTT 

TCAGGTCTCGGGGCCCTCCCGCTTTGTTAGGCTTAAAGCAGAATCCATAGTTAGTCGGTG 

CTCCATGI I I I I I I I I G CCTGCC AGTG ACACATAAATATCACTATTGCC AAATTCGCTGA 

AAAACTGCAAATGCCGCGGTTCAGTTCTTCATGGCTGGGGAGGCCTCACAATCACAGCAN 

AAGGTGAAAGGCACATCTCACATGGCAGTAGACAAGAGAACTTGTGCAGGGAAACTCCCC 

TGGAGGCATCACATTACCCGACTTCAAACTATTACTTCCCCGCGTACCTGCCCGGGGCGG 

CCGCTCTAAGAACTAAGTGGGAT 

Sequence 3860 

CCGCGGTGGCGGCCGCCCNGCCAGGTACCCTTGACATCACANGGGGCANCCCACATCAAC 

TGCCATTAAAGAAAAGAAATTTCAGCCCANAATTtCATATCCAGCAAAACTAAGCATCAT 

AAGTNMGGAGAAATMGATCCTTTTCAGACAAGCAAATGCTGAGAGAATTCCGGTATCA 

CCAGATCTACCTTACGAGAGCTGGGGNGAAGGAAGCACTAAATATGGAAAGAAAAGATCA 

TCACCTGCCACTACAAAAACGCTCTGAAGTACCTCNnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnNNGATACCGTCCACCTCNANGGGGGGCCC 

NTGTACC CAN CTTT 

Sequence 3861 

GGAGTCTTGCTCTGTCGCCCAGGCTGGAGTGCAGTGGCACAATCTTGGCTCACTGCAAGC 

TCCGCCTCCCGGGTTCACGCTATTCTCCTGTCTCANCCTCCTGAGTAGCTGGGACTACAG 

GCGCCCGCCACCACGCCCAGCTMTlTmGNGNNGGTTAGTANANACGGGGTTTCACCA 

TGTTAGCCANGACGGTCTGGATCTCCTGACCTCGTAACCTGCCCGGNnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGATACCGNCNACC 

TCNAGnnnnnnnnnnnnnnnnnnnnnnnnTGTTCCCTTTAAGTGAGGGGTAATTGCCCCC 

CTTGGGCGTAAATCATGGNCATMGCTGTTTTCCTTGNGNGAAATTGTTATCCCGGTCAC 

AATTCCCACACAACATACCNAGCCCGGAANCNTAAAAGTGTAAAGCCTT 

Sequence 3862 

AGGTAC I I M U I I M Ml U i I i I ( NGGGGGGAAGCCTTGTGGTAACCACAAAGCAAAA 
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ACTTACAACAGATACACTAAAAATAAAAAGGGACAAATTAAAATATATATTACCTGAGAA 

AATTGCTTMGCACAAAGGAGGATATTAAAAAAAAAGAGAGAGGAAAAGAGAAGACATTA 

AGAAAGANGGAGAAGAAGAGAGAAGTAAGAAAACAAGTAGAAAGCAAGTAACAAAATGGC 

AGTAGTTAGTTGTTACCTGTTAAAAATAACATTGAATATAAATAGACCGAATTCCCCAAT 

AAAACATATACAGTGGTTGAATGAATTTAAAAGCAAGACTCAACTATATGCTGTCAACAA 

AAACTCACTTTACCTATAAAGACTCACATAGACTAAAAGTAAAGAGATGGAAAAACATAT 

TCAATGCAAATGAAAACCAAAAMGAGCAGAAGTGGTTATACTTATTTCAGATGAAGTAG 

ACTTTAAGCTAAAAACTATAAAAGG AG AC AAAG AAGG N C ACTGN ATAATG ATATTATTAA 

TGATCATATNTAATGATTATAMTAAATTTTTATGTNTAAAAMTTT 

Sequence 3863 

CCGCGGTGGCGGCCGAGGTACTTTGGCCTCTCTGGGATAGAAGTTATTCAGCAGGCACAC 
AACAGAGGCAGTTCCAGATTTCAACTGCNTCATCAGATGGN N GGGAAGATGAAGACAGAT 
GGTGCAGCCNCAGGNTCGTNTTGATTTTCCACCTTGGNCCCTTGGCCCNAACCGTTGGGG 
GGGGG NAGGNATAAATATTTGGNGACAGTNATATGGNTGNCNCATCTTTCAGCCTTGCAG 
GCTGGCTGATGGGGGAG 
Sequence 3864 

TACGACTCACTATAGG GCCGAAATTGGAGCTCCCACCGCGGTGGCGGCCCGAGGTACTTT 

iMiiniiiiiiiii r ggggg 1 1 n 1 1 1 n m 1 1 n 1 1 1 m i n 1 1 m n 1 1 1 n 1 1 1 
1 1 n ii 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 1 1 n 1 1 1 1 1 1 1 n i n 1 1 n 1 1 1 1 1 1 1 11 1 inccccgn 

NAAAAAAAAAANCCNNTCNNNNNNNAAANGGGGGGGGAAAAAAAAAAAANAAAANNNNGG 

GGNNGGGGCCCACCCCCCCCCCCCCCTNAAANCCCCCCNNGAAAGGNNAAANNAAAAAAA 

CCCCTTANGCCAAAAAANNNANNGCCCCNANAACCCCCCCCCCNCCCCCCCNNAAAAAAA 

AAAAANCCCCCCCCCCCCNNAAAAAACNAAAAAAANCTNAACANCCCCCCCCCCCCNNGG 

GGGNCCCCCCCNCCCCNNNTTTTNCCCCCCNNAAANAAGGAAAAAAACCCCCCCCCCCNN 

AAAAAANANAAAAANNCNCCCCCCNANAAAAANNNNNCCCCCCCNAAAANCCCNNNAAAA 

AAANANNCCCNNAANAAAAAAANAAAANNCCCCCCGNCCCCCCNANNGNCCCCCCCCCCC 

CCCAANNCCCCNNCCNCCNCCNCCCCCCNNNNCNAAAAAAAAMNCCNCCCCCCCCCCCN 

AAAAAA 

Sequence 3865 

CCGGGCAGGTACG I 1 1 I 1 1 1 1 I I I Ml i ! I 1 1 111 I I J I I GAATTTAAAAAATGTTCTTT 

AANACCCCTCCTTAACGGTTGATGGATTTAGCAAGTAGCAATATAANATGTTCGGATAAA 

TTTAAATTTCACGTAACMTACTTTTTCCCTGTAAGCATGTCCTAAATATTGCATGGGGC 

ATGATTATAATAAAAAAGTTATTATTTATATGAAATTCAAATTTATCTGGAATCCCTGTC 

CTTTATCTACTTTTGTTTTTATGATGATTTTTATGATGATTAAMCA^ 

AAAAATTTATTTTAGTACTTCCTATTGATAAAAAT^ 

TTCAC AAAG CTAAAAATTTANMTAMTTTGAATGGTCAMCCTCMTACCTGTTTTATT 

GTGAATTTTCAGCATTTACCGCTTAATTTTCATTGATCTTTTTGAGACATCCTA 

TTCAGTTCTCTGAATTTCTATTTCATTTCTGTTCTC 

CAAATTCAA I I I 1 1 I ICT 

Sequence 3866 

TCCTCATGGCATCATTCAAGGTCATCCCCTAAATTTTAACACTTTCGATGCCATATACGT 

TGTAACCTTCCTTATMGTTGCAAAATTCTGTGAATTCTGCGAGGAAGATGGGTTAATAT 

TCTGTGCATTCTTTGCCACCTTCATTCGTTTCGCCTCGGCCCTTGACTTGTAACAGAACT 

CAATCAAAGCCACCAGCATTGCCAAACCAAGGCCCCCGGCAAGGATGTAGAATACTCCAG 

CMCGTTGCTCAGACTGAGGGCACTGGTCTTTTCCTAAAGAATGTATATGAGAAGAGGTT 

ATTAGGAAGGCAAACTGGTGAATAGAAGAGAACAGCAATGTCACATAGCATTATCACGGG 

AACAACCTAGTCACACAGMTGCATTTGAATTTGAGAMCMTGGATTTTCTCTTAGACC 

CGCGTACCTGCCCGGGCGGCCGCTCGA 

Sequence 3867 
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CCGGGCAGGTACAAAGGAACCGAGGGGGGACCACCTCTGAGATGTCCTTGACTTTGTCAT 

AGCCTGGGGCATATTGAGCATCTCTCTCACAGCTGCCTTTCTTATCCCCATTCTTGATGT 

AGACCTCCTTCCGAGTCAGCTTTTTCTCCTCCTCAGACACAMCAGAGCTTTGATATCCT 

GTGCAGGGAGCAGCTCTTCCTCTGGACGAATAGACTGCGTCACTTCAGCTGCTCCCACCA 

CTGTCATCTCCCCATTGCCTTTGTCACTCAGTAGATGTCCCCTAAGCCCTTCTAGAGCTA 

GGTTCTGGGCCAGGCATCATCTCTTCCTATTGCACCTCCCTCTCATGCCCCACTTGTCTC 

TTGGGATCTCATCCTTATCCTCTTGCCAAGTATGTCTTACTTTGTTGCTGGCAAGTGGTA 

GTTGGAGGAAGCCTCAAAGCTCGAGTTGTTCCCTCNGTGCAGGGGAGACAAATGGGCCTA 

T 

Sequence 3868 

CGGCCGCCCGGGCAGGTACCTGCCATCCAATACGGTCATTAGATTGGGTCATCTTGATTA 

GATTAGATTAGATTAGATTGTCAACAGATTGGGCCATCCTTACTTTATGATAGGCATCAT 

TTTAGTGTGTTACAATAGTMCAGTATGCAAAAGCAGCATTCAGGAGCCGAAAGATAGTC 

TGAAGTCATTCANAAGTGGTTTGAGGTTTCTG I 1 1 1) 1 GGTGGTTTTTGTTNGN I I 1111 

NTTTTTACACCTTAAGGGAGGATTTAATTTGCTCCCAACTGATTGTCACTTAMTGAA^ 

TTTANNAANGMTAAAANGACNTACTTCTCANCTGCAAATATTATGGAGAATTGGGGCAC 

CCACAGGAATGAANAGAGAAAGCA 

Sequence 3869 

AGGTACCACTCCACCTATATCCCTACCTTTGCATGTCTGTCCAACCTCAAAGTCCAGGTC 

TATACACACGGGTAAGACTAGAGCAGTTCAAGTTTCAGAAAATGAGAAAGAGGAACTGAG 

TTGTGCTGAACCCATACAAAATAMCACATTCTTTGTATAGATTCTTGGAACCTCGAGAG 

GMTTCACCTMCTCATAGGTATTTGATGGTATGAATCCATGGCTGGGCTCGGCTTTTAA 

AMGCCTTATCTGGGATTCCTTCTATGGAACCAAGTTCCATCAAAGCCCATTTAAAAGCC 

TACATTAAAAACAAAATTCTTGCTGCATTGTATACAAATAATGATGTCATGATCAAATAA 

TCAGATGCCATTATCAAGTGGAATTACAAAATGGTATACCCACTCCAAAAAAAAAAAAAA 

AGCTAAATTCTCAGTAGAACATTGTGACTTCATGAGCCCTCCACAGCCTTGGAGCTGAGG 

AGGGAGCACTGGTGAGCAGTAGGTTGAAGAGAAAACTTGGCGCTTAATAATCTATCCATG 

Mill I CATCTAAAAGAGCCTTCTTTTTGGATTACCTTATTCAATTTCCATCAAGGAAAT 

TGTTAGTTCCA 

Sequence 3870 

CCGGGCAGGTACGCGGGTTCCCACGCGGTCTCTGGGCTGGTTGAGCTGCGCCGCTGCAGG 
TAGTGCTGAGTTGCTCCAGTG CCACTGGGTTAGG GTCTCC AC AACCG AGCTGGTCTCAGC 
AAGTGGTGTCCATACATGGGGCTCGAACCTGGGTTGAAGGGTCGCCAGAGCGACGGTTGG 
AGAACGTGGAACTAAGCTGGAGGACACCTGAGTACCT 
Sequence 3871 

CCGGGCAGGTACGCGGGGAGGTCACACAGGAAAGAAGCAGACCACGAAATGAACTTCGGC 

TGTCACCTGCGGTGGGCGCACTAGAGGTCTTTTGAACTCCTTCGGCTACCGTCGCCGCGT 

TCTCGCTGTGCACTCTTATTCTGCGCCTGCGCGCGGCTAC AGCACGGTTC Ci I I I I ICCTT 

TAGTCAGGAAGGACGTTGGTGTTGAGGTTAGCATACGTATCAAGGACAGT/ACTACCATG 

GCTCCCGAAGTTTTGCCAAAACCTCGGATGCGTGGCCTTCTGGCCAGGCGTCTGCGAAAT 

CATATGGCTGTAGCATTCGTGCTATCCCTGGGGGTTGCAGCTTTGTATAV3TTTCGTGTG 

GCTGATCAAAGAAAGAAGGCATACGCAGATTTCTACAGAAACTACGATGTCATGAAAGAT 

TTTGAGGAGATGAGGAAGGCTGGTATCTTTCAGAGTGTAAAGTAATCTTGGAATATAAAG 

MTTTCTTCAGGTTGMTTACCTAGAAGTTTGTCACTGACTTGTGTTCC" GAACTATGAC 

ACATGAATATGTGGGCTMGAAATAGTTCCTCTTGATAAATAAACAATTACCAAAAAANA 

NNNNNNNNNNNNAAAAAAAAG 

Sequence 3872 

AGGTACGCGGGTATACATGAAACCCCCAGCACACGAAGAGGCCAAGGCACCTTCCAGAGG 
CTTTGTGGTGCGGGACGCACCCTGGACCGCCAGCAGCAGTGAGAAGGCTCCTGACATGAG 
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CAGCTCTGAGGAATTTCCCAGCTTTGGGGCTCAGGTGGCTCCCAAGACCCTCCCTTGGGG 

CCCCAAACGATAATGATCAAAAAGAACAGAACCCTCTCCAGCCTGCTGACCCAAACCCAA 

CCACACAATGGTTTGTCTCAATCTGACCCAGCGGCTGGACCCTCCGTAAATTGTTGACGC 

TCTTCCCCCTTCCCGAGGTCCCGCAGGGAGCCTAGCGCCTGGCTGTGTGTGCGGCCGAGG 

TACCTGAATGAGCCTGGCCTGGTGCTGTGTCTCCAGACTCGACACAGCCCAGAAAGAAAC 

AGGTGAACACACTCCTGAGAGCCGAAACTGGACTCTTGCTGATGGACTCATGGCAATACT 

CCAGCCAAAGGCTGCTCCGAGTCCTAGAAGGGCTCGGAACTGTCCTGGGTAAGCCTGTCG 

ACACAGTAGGGACAGAGAGGGGTTAGTGTCCTCTTTCACAGCACTAAGCATGTTGTTGCC 

CACATG 

Sequence 3873 

CCGGGCAGGTAC I M M I I I I I I I I M 1 I M I I I M I I 1 M I I GAAGGTTGAATGCAGGG 

TTTTATTGAGTGGTGGAGGTGGTTCTCTGAAGGATGGATGCAAAGCTGGAAGCGGGGATG 

GCACGGGAAGATAATCTTCCTGCTGGGTGGCCAAACCTNTCCGACCACCCTCGGCTGAAC 

TCCTNTCGGCGTTCAAATGTTCCTCTTCTCTCTTTCTCTGCTGNATCGTTTCACCATTCA 

TCTGCTTGTCTGGGATGGTGAANACAACTGTTTGCTCATTCTAAACGACACGGAATANAG 

GCGTCTCCTCATCTGCTCATCTGCTTCTGGAGCTGCTTGGCAATAGGTGATTGTCTCGCC 

GCTCAGCTATAGGCGATGGTCTCACCACTCGGTGATAGGCGATTGTCCCATCTAGGTCGC 

CAAAATGTGTCTGGAATTGGTGGGTTCTTCGTCTCACTGACTTCAAN 

Sequence 3874 

CCGGGCAGGTACCAAAAAGACTCTCAAAAACCAATACTCCCACGGGCAAGGGAATAGCCA 

AGTTTGTTGCGGTTTCCAATGAATGACATCAGCCCTGTGTAGGTCTCAATCAAAATGGGT 

TCAGTTAACACCATCAGTTTCTTTCCTCTTCCAGATCCAGTTGAATTCTTGTGGGCATTC 

TGGATAGCTGGMCAAGCTTAGACATGMCCCAGACMCTTGCAMTTTCMGGMTTTC 

TCACTGGTGTATTTCATAGGATGCTCAGTGAAAGTAGCATAAGGAACTTCAGTGGACCAT' 

GGGTTCCAGCGGGACAGAAGAGACTGCTCCTCCGGACTCCCCCAGTAGATCCTAAGGCCT 

TCTCCTTGTCTCTTGTCCAGGGATATCCCAGGGAAGGTGAACTTGCCCAGGCAGATGCGA 

TAGACAGCGCTCAGAGGAATCCGCTGCAGCTGCACACAACTCAGCATGATGAAGTCGTAT 

TTGCAGATCAAGAGAGTCTTGTCTGTGACCAGTAGMTTCTCTCCTTCTCATTGTTCCAG 

TGGTCTATCTTTGTCAAGAGCCAGAAGCCTTGAATGGTCTCTCCAGAAGTCTCAGCTACN 

TGACCTTTCAAGTCTTC 

Sequence 3875 

AGGTACACCGTGTTTCTGTTGTGTTCTGTGGTGGTTTGCTCCTCACATGTGAGCATTCAC 

CGTGGCCCTGTG 11 1 1 1 CCACCACACCTCTGCCTGGTGGCATCCTACTGGATGTCTCAGC 

CGAGCCACTCTGTTCTGAAGATGCCCTTGTCCTCCACCCCTTTGCTTTGGGCTCTGAGCT 

TCTTCTGCCTTTCATCTCTGTAGGATGAAGCCCCACCCGCCCTTCAGGATGCCAGCCTTG 

GCAACATTGTGAGACCCTGTTTCTACAAAAACTAAACAAGTGTAGTGGTGCACGCCTGTA 

TTCCCAGCTACTTGGGAGGCTGAGATGGGAAGATCACTTGAGCCTGGGAGGTTGAGGCTG 

CATTGAGCTGAGATCAAGCCACGGCACTCCAGCCTGGGAGACAGAGCGAAACCCCATCTC 

AAAAAAACAAAGAAAGTGGATGCGTGCTGCTCCTGCCATTGGATCATCGCAGCATTTTTC 

TTTTCTCTGATATGCGCCTTTTTCCTCATCGTGGGGGCTCATGTTTTCGTAATTCCCTCT 

CACTACTAGATTGTGAGCTGTGTGAAGGAAGGTTCATAGTTGAGTTGTATGTACCTGCCC 

GGGCGGCCGG 

Sequence 3876 

C C G GGCAGGTAC ACACACATCCTTCTGTGAAACAGTCTCACGG AG ACTCTCAGAATCCCA 
AGAATTTTCTTCAACGTTCTTTTGTTTTGATTCT 

TGTTTGTTCATTCTTCAATTCCAAGGCTTTATTTGGAACAGACTTTTGCAT^ 
AGGCTCGAAGGCAGATGGCTTCTCGGGAGGCTCTGCTTTGAAAGTTTGCATGTCCATCAA 
TTCTAAGGCTTTAGTTGGAATAGAAACTTTCATTCTGCAGGGAGCCTTCAGAAAACCATC 
ATTATCAGGAGACTCTTCTAATTTTCCATTTATT^ 
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CTTGGGTAAACACACATCCTTCTGTGAAACAGTCTCACAGAGACTCTCAGAATCCCAAGA 

ACTT TCTTCATAGTCCTTTTGTTTGGATTCTGATGGGGAGTATCTCATCTGCTCTCAATG 

TTTTGNTCATTCTTCMTTCCAAGGCTTTATTTGGAA 

GGCTCCGAAGGCAGATGGCTTCTCCCCGCGTACCTCGGCCGCTCTAGAACTAGTGGGATC 
Sequence 3877 

AGGTACGCGGGGGGAGGTGGATTTGAGAGATACCTCCCTCCTTCTGCTCAGCTGCCTTGC 

AGTAATTAAACTCTTTCTCTGCTGCAACACCCCTACTGTTCTCCGTGTATTGGCTTTTCT 

GGGCAGCAGGAAGGAAAAGCTGATGCGATGCTCTCAGTGCCGCGTCGCCAAATACTGTAG 

TGCTAAGTGTCAGAAAAAAGCTTGGCCAGACCACAAGCGGGAATGCAAATGCCTTAAAAG 

CTGCAAACCCAGATATCCTCCAGACTCCGTTCGACTTCTTGGCAGAGTTGTCTTCAAACT 

TATGGATGGAGCACCTTCAGAATCAGAGMGCTTTACTCATTTTATGATCTGGAGTCAAA 

TATTAACAAACTGACTGAAGATMGAMGAGGGCCTCAGGCAACTCGTMTGACATTTCA 

ACATTTCATGAGAGAAGAAATACAGGATGCCTCTCAGCTGCCACCTGCCTTTGACCTTTT 

TGAAGCCTTTGTAAAAGTGATCTGCAACTCTTTCACCATCTGTAATNGCGGAGATGCAGG 

AAGTTGGTGTTGG CCTATATCCCAGTATCTCTTTACTC AATC ACAGCTG NGACCCC AACT 

GTTCG 

Sequence 3878 

AGGTACGCGGGCAGGCTGAGACAGTCCTCATATCCTCTTGAGCCAAACTGTTTGGGTCTC 

GTTGCTTCATGGTATGGTCTGGATTTGTGGGAATGGCTTTGCGTGAGAAAGGGGAGGAGA 

GTGGTTACTGCCCTCAGCCGGCTTTGAGGACAGAGCCTGTCCCTCTCATGACAACTCAGT 

GTTGAAGCCCAGTGTCCTCAGCTTCATGTCCAGTGGATGGCAGAAGTTCATGGGGTAGTG 

GCCTCTCAAAGGCTGGGCGCATCCCAAGACAGCCAGCAGGTTGTCTCTGGAAACGACCAG 

AGTTAAGCTCTCGGCTTCTCTGCTGAGGGTGCACCCTTTCCTCTAGATGGTAGTTGTCAC 

GTTATCTTTGAAMCTCTTGGACTGCTCCTGAGGAGGCCCTCTTTTCCAGTAGGAAGTTA 

GATGGGGGTTCTCAGAAAGTGGCTGATTGGAAGGGGACAAGCTTCGTTTCAAGGGGTCTG 

CCGTTCCATCCTGGTTCANAGAAGGCCGAGCGTGGCTTTCTCTAGCCTTGTCACTGTCTC 

CCTGCCTGTNAATCACCACCTTTCCTCCAGAGGAGGAAAATTATCTCCCCCTGCAAAGCC 

CGGT 

Sequence 3879 

CAATTGGAGCCTCCCCGCGGTGGCGGCCGAGGTACNCGGGTCTACATGAAACCCCCAGCN 
CACAANGAGGCCAAGGCACCTTCCAGAGGCTTTGTGGTGCGGGACGCACCCTGGACCGCC 
AGCAAGCANGTGAGAAGGCT 
Sequence 3880 

CCGGGCAGGTAC I I I I II I I I I i I II 1 1 I I i I I I ! I NTGCCCNATTGCATATTTTTATAT 
AAACTGTCTTCATANATTCTTTCTTCTGCTTNAAAMTTCTGCTGTCCATGCTCTGTATT 
G C ATTTT AC ATTTATTG N ATACTTCAGCTCCAAAATTTGTTTG ATTTCTTAAATAATTTC 
AAC TTT TCTTG TC AG GTTTCTACTATTG G GTGTTC ATTTTCC TG ATTTC ATTG AATTGTT 
TCTCTGTTTTTNATTAAAGCTCAATMGATTTC^ 

CANAT(^TTTATATATCTCATTTTCTTTAGGATCAGCTACCANAAGATTACTATGTTCTT 
TTGGNGGNATATCCCATTGGGTTTTATATAC7TCTTATTGNATTATACTGATGNCNGCAT 
ATTTGANGGAACAGTCACCTCTTGCAGACTTTATGGGTTAGTTTCATTGGANAAATATCT 
TTT 

Sequence 3881 

CCGCGGTGGCGGCCGAGGTACACCCTGAAGCAGGTAGTGAAGCTGGCAGTTGGCCACTGG 

TTCCCACAGAAAATTGCTCTAGTGATGCTAGCAAATCCCAAAATTGGGGTTCAATCCAGG 

AGGGTTCCTGGCTTCATAAAGGAAATAATTCAAGAGCAAGCCAAGAGTGTAAAGTGAAGG 

CAAAGCAAGTTTATGAGAGCAAGAGAACAGGAAAATGGCTGCTCCACAGACAGAGCAGGG 

CTACTCCACAGGCAAAGTAGCACCCGTGGATTGTTGGCTAGCTATATTTATAGCTACTCC 
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TTAATTATGTGCTAAATMGAGGTCGGTTATTCATGAACTTTCTAGAAAAAGGTTGGAGA 
GTTCCCAGAACCAGGGGTTCCTTCTCTTTTAAACTACATGAGGTAGCTT.CTGGGCATTGC 
CATAGCAT 
Sequence 3882 

CCGCGGTGGCGGCCGCCCGGGCAGGTACACAGTCCAATGATGCTAAAAAGCAAGCACATA 

TGTAAGTCTGCAAAATAACCAGCTGACAGCATGACGACAGGATAAATTCCACACATACCA 

TTACTAACCTTAAATGTAAATGGGCTAAATGCTCCCATTGAAAGACACGGGGCAAGCTGG 

GTAAAG AACCAAGACCCACTGG AGTATGCCGTCTTCAAGCAAC CCATCTC ATGTG C AGTG 

CCATACATAGGCTCAAMTAAAGGAATGGAGAAAAATATTTCAAGCAAATGGAAAACAGA 

AAAMGGTGTTGTACTCCCAGTTTCTGACAAAACAGACTCTACCAATAAAGATAAAAAAA 

GAGMGGACATTACAMGGTGGTCCTGACCTTTGATAAATCTCATTATTGCTTGATACCA 

ACCTGGGCTATTTGTATTGCCCAAACCAATAGGATAATTTGCTGAGGTTGTGGAGCTTCT 

CCCTTTCACAGAGTCCCTGATCTCCGAAMTTTGGTTGAGATGTMGGNTGATTTTGCTG 

TACCTNGGCCGCTCTAGAACTAGTGG 

Sequence 3883 

CCGGGCAGGTACGCGGGGGGCTGCATGCCAGCCAGACACCCAGTCTTGCAAGACTGTCAT 

TGAAMTCTCCGTTTTGCTGTTCTCCGGGTCTCTGCGTCCAGTCTTTGTGTTTGGACGGA 

CCTGCCGGGCCATCTTTCTGCAAGAAGATAAAGGAAGACCAGGAGTGCCTGCCGAACTCC 

TATGG AGGAAGTCTAGGAGAGG AAGGGGACAGGGAG GAAGATG GTGTCTGCAAACCAGG A 

AGCAGCCTTGCCAGACACAGGATTGGCCACAACCTTGACCCCAGACTTCCAGCCTCCAGA 

ACTGAATTTCCCTGCACAGGCTCTCTTGCCTGCCACCATGTAAGACCTGCCTCTGCTCCT 

CCTTCACCTTCCGCCATGATTGTGAGGCCACCCCAGCCATGTGGAACTCTCCTACTGCAC 

TGAGMGCACGTGTTCTCCATTTCCCTGGGGGAGACCATTGTATTGGGCAGTTTGGAACA 

AAACACCATGGACTGGGAGGCTTACACAACAGAMTTTATTTCTTGCTGTTCTAGAGGCT 

GGGAAGCTCAAGGTGCTGGCTGCATATTCATTCT 

Sequence 3884 

CCGGGCAGGTACGCGGGCACTGGTGTGAGGGGTGGTGTGGTCCAAAGGACAGGCTGGATG 

GCGGGTGCATCGGGGTGGGTGTGGTCAGCATGTGTCTGCAATGCCTGTGGGCTCTACTAC 

AAGCTTCACAATATTAACAGACCCCTGACTATGAAGAANGAAGGCATCCAGACCAGAAAC 

CGAAAAATGTCTAGCAMTCCAAAAAGTGCAAAAAAGTGCATGGCTCACTGGAGGACTTC 

CCCAAGAACAGCTCGTTTAACC 

Sequence 3885 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAAGGAA 

CCGAG GGGTGACCACCTCTGAGATGTCCTTGACTTTGTCATAG CCTGGGGCATATTAGAG 

CATCTCTCTCACAGCTGNCTTTCTTATCCCCATTCTTGATGTAGGACCTCCTTCCGAGTC 

AAGCTTTATCTCCTCCTCGAGACACCANACAGAGCTTTNNTATCCTGTGCAGGGAGCNGC 

TCTTCCTCTGGACGAATAAGACTGCGTCACTTTCAGCTACTCCCACCACTGTCATCTGCC 

CATTGCCTTTGTGACTACAGGANAGTGNCCCCTAAGCCCTTCTAGAGCTTAG 

Sequence 3886 

CGGCCGCCCGGGCAGGTACTCTCATCCACTAGACTACATCTCACAACAAAGCTTTGCCAT 

TCAATACAAACAGAGCACACATGATGCACACGAATAGTCTTGCCAATCAGTGATACTATA 

GATGCTGCAACTGTGCCATCAAAACAGACAAGGAAAACACACATCCCTATAATCACTTAA 

TCTTMGTGTTTCATTMTGTTCTCATTAGGCCATTTACTCTATTTCAGTATGTTGCTGT 

TTCATGCACTATTATGTATGAGTTAGACATGAACACAGACTGAAATAAATATATTAAATG 

CATATACTATATTTAAAATAAAATAAAAACAACAAATCAGTAACTCTGCAGGCATTACCA 

AAACATCTTACCAAATTATGCATCTCATGCATATTTGCATTTACATTCTATTTAAGATGG 

GTCATTCCCACTAACACACTTGTGGTTTGATTTTGCTGTTTGTGTTTGATTT^ 

CACATAAMCCTGCAGCAACTGACCTTACTTCCAGAGTCCTTGGCTCCACATTCACCTTT 

ATCGTCCT 
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Sequence 3887 

CCGGGCAGGTACCCCCGATGCAATGGAGCTCAACATAAAGCAGCTGGATCAGAAAATGAA 
ATGTCTAGTTTACTTGTGATCTAGCTTAMGTATTTGCCCT 

CAAATGGAAATCTATAAAAAACTGACTGCTCTGACTTCAAAGTCACTAAAGATCGAGATA 

AGAAGACTGCTTACCAGCGTGGGAACTACTCTAAAGCTGCTCAATAACGTCAGACACGTC 

CAGAGTGGAGAACTCTGAAAGAGCCTCTGGTGACCTGACACTGGCACTACCCATCAGCAG 

AGAAAAGATGGCCTCCTGTGCCTGAATGTGACAGCCAGGCAACTATGTAAGGCAGTATCA 

GAGMTCTTTATGACATCACAGTAGGAAAGAATTTTTTCAACACGATAAAAAATATGGTG 

ACTCTAATAAATATCTGACTGTATTAAAACAAAGAGTTTC 

TCTMGAGAATGAAAAATGACCTCTGGGTCAAGGTATCAAGACAGAACAGAAGCATCTAA 
CTTCGTTCTCTCCCTATATTCCATTAAAAATAAAACAAAGATTCTrrTCAATAAAA 
Sequence 3888 

AGGTACTTGTGACAGGCAGACGTGATTGCAGCCACGAACACGATGAACTCACTGAAGTCC 

ACCTGGGCATCTCCATTGGCGTCCAGGTCCTTGAGCAATTTATCCACGGCATCCTTGTCT 

TTTCCACTCTTGCAGGAAGCCTGGTAGCTTCCTTCTCCATCANCACCTTGAGCTCCCCCT 

TTGGTCAGGGTCTGCGTGCTGCCCTCGCTGGCCCGAATATCGGGAAAAGGACGTCTATGA 

TCATGCCCATGGCTGTCTCTAGTTCCGTCATGGTGCTAGGATTCAGACCCACCTTCCTCC 

TNGGGGGGCTGGCAGGGGCCGANAAAATGTCCCCGCGTTCCTTGCCCGGGGCGGCCGGTT 

CTAAGAACTAGTNGGGATCCCCCCGGGGCTGCAGGAAATTCGATATCAAAGCTTAATCGA 

ATACCGGTCGACCTTNGAGGGGGGGGCCCCGGTTACCCAAGCTTT 

Sequence 3889 

TCGACGCTCAAGTCAGAGGGTGGCGAAAACCCCGACAGGGACTATTAAAGATACCAGnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTGTTCCGAnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnCTTTCTCCCTTCGGGAAAGCGTGGNNCTTTCTNATAAGCTTCACGNCT 

TGNAGGTATNTCAAGTNCGGNGNTAGGTCCGNTCCGNTTCCAAGCTTGGGCTTGTGTGCA 

CCAACCCCCCC 

Sequence 3890 

CCGGGCAGGTACCCGGGCGTGTGAGGTGTCAGTCTGCCCCTACTTGGGGGTGCCTCCCAG 

TTAGGCTACTCGAGGGTCTGGGACCCACTTGAGGAAGGCAGTCTGTCCGTTCTCAGATCT 

CCAGCTGCGTGCTGGGTACTGGGAGAACCACTACTCCCCGCGTACCTCGGCCGNTCTAGA 

ACTAnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnNNCCGACC 

TCNGAGGGGGGGCCCCGGTACCCAAGCTTTTAGTTTCCCTTTMTGAAGGTTAATTGCCN 

CGCTTTGGCCGTAATCATNGTCANTAGCTTGTTTCCTG N GIG AAAATTGTTAT 

Sequence 3891 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGAGGCATTGAGACAGCCAGCGCAGGGGC 

CTTCTGCTGAGGGGGCAGGCGGAGCCTTGAGGAAACCGCAGATAANG ) Hill I CTCTTT 

GAMGATAGAGAT TAATA CAACTACTTTAAAAAATATAGTCAATAGGTTACTAAGATATT 

GCTTAGCGTTAANTTTTTMCGTAATTTTAATTAGCTTAAGATTTTNAGA 

AGACTTAAGAAGAGTAGCATGAGGAAGGAAAAGATAAAAAGGNTTTCTAAAACATGACCG 

GGAGGTTNGAGATTGNAGCTTCrrCATGGGAGGTAAAAMTTGTATTTAAAMGAAAATT 

GNAGAGAAAGGGACTACAGAGCCCCCGAATTAATACCAATAGAAGGGCANTGCTTTTNAN 

ATTAAAMTGMGGTGACTTAMCAGCTTAMGTTTTAGTTTAAAAGTTGTAGGTGATTA 

AAATAATTTGMGGGCGATCTTTTTAAAAAMGATTAAACCGMGGGGATTAAAAGACCT 

TGAAATNCCATGACCCNGGAGAATTGNCGTCATTTAAGCCTAANTTAACCCATTTACTAA 

ACCCAGNCGAAATGGAAAGATTANTGGGGGTGGTAGGGNTGAANCATTTTG 

Sequence 3892 

CCGGGCAGGTACTGGGCCTTACAGTAACCTGAAATTCCTGATTCTTAGTCAACCTGAAAT 

TTACAAGTAAGGTTGGGTCATTAATCTTAACCATGATGCTCACCAATATTGGATAGCACC 

TNGAGGAAATCCTAGCATTTTGTGCTTAGAGTTGGACTCTGAAGAAAATAGCCATTCCCT 
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CGTGCTGCGACAGTTCCATTCCTANAGCAGNCTTACAACTAGGTCTAAGGCTGGCCCTAG 

TGAGAAGGCCTGMGTGAGAAGCAAGGGCTTCTTTGAAATTGCANGGAGAAGCANGTCAA 

ACCTGNGCCAAGGGTTTCCCCGGTCAAAGNACCTCGGCCCCTCTAGAACTAAGTGGATCC 

CCCCGGGGCTGCAGGAATnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnCAGCTTTTGTTCCCTTTTAGTGnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnTGTTTCCTGNGNGAAATTTNNnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nNAAGCCGGGGAGCNTAAANTGTAAAGCCTGNGGGTGCCTAANGAAGTGAGCTAACTCAC 

AT 

Sequence 3893 

AGGTACCCTGGCCACTCCTTTCCTTTTGGCTGGCCAATGTCTCCTCTGTAGGCTCCAGAA 

GGCTCTCAGGGATGTAGGCCGGCCTCCCGCAGGGTTGAGTTTGCAATGGGAACAAAGACA 

AGCTGTGGTCCCATAGCACCCTCATCTGGTGGACATCCTTGCTACTGACAGTCAAAAGAA 

AGCCTTCCCAGATGAMTTTTAGTCCTTCTGCGCAGCCATGCTCTTCTTTCAGCAAAAGA 

GCCCCCGCGTACCTGCCCGGGCNGGNCGCTCTAGAACTAAGTGGATCCnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnGCTTATTCGGTACCCGTCNACCTCGAGGGGGGGGGCCCGGNAC 

CCAGCCT 

Sequence 3694 

CCGGGCAGGTACGCGGGGGGTTTTATAAGGGGGAGTTTCCCTGCACAAGCTGCCTTCTCC 

TGTCTGCCACCACGTGAGATGTGCCTTTCACCTTCCACTATGATTGTGAGGCCTCACCAG 

CCATGTGGMCTAGTTCAAGATTTCAACATTTTAAGATAATCTGAAAAAMTTGGAATGT 

GCCCCCAGCTGCCAATACTCTCTCAACACTGTTTATCAAAGGAACCAATATGGAAGGAAA 

AACATTGCTTTGAAGGGAAGATGGATGGATCCCCTAGTAAACTACAGCTTGAGGTGCAAT 

TTCCTAAAGTTCAAATGCCCGAAAAAAAAAAAAAAAAAAAAAAGTACCT 

Sequence 3895 

CCGGGCAGGTACATGGGCTGGGATTGATGGGAAGGGACAGATGTAGTTGGCAACCATGGT 

TAGCATCGGATGCCCATCCCAATGGCCATGAATGTGCCAAAGGTGCCCGCCACTCTGCAT 

CATGGTTTTCCCAATGCCCGCCCATCAGCTCTCGACCCCGCATTCCGATCCTGAGACAGG 

AAAAGGTGCCGAAGAGCGCCCnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTGGCGTAATCATnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCACACAACATACGAGCCCGGGAGCATA 

AAAGTGTAnnnnnnnnnnnnnnnnnnnnnnnNGAGCTNAACTCACATTAATTTGCGTTTG 

GGCTCACTTGCCCCGCTTTTCCAGTCCGGGGAAACCTTGNCGTGNCCAGCTGGCATTTAA 

TGAATCGGNCAMCCCCCGGGGGAGAGGCGGGTTTGCGTATTGGCGCTCTTCCGNTTCCT 

CGCTTNACTGGNTTCGCTGCGCTC 
Sequence 3896 

AGGTACTAGCTGCTTGTTACAAACAAAATATTTGTGTTCCCCCAAAATTCCTATGAAGTT 

CTGACCTGCMTGTGATAGTATTTGGAGGTAGGGCCTTTAGAAGGTAATTAGGGCTAGAT 

GAGATCATGTGGGTGGGGCACCCATGATGGAATTAGTGTCCTTATAAAAAGAGGAAGAGC 

TGGTGTiCTCTGTCTACCAACTGAGGACACAGGGAGAAGGCTGCCGTCTGCAGGCCAGGA 

AAAGAGCCCTCCCCAGAGCTCAACCATGCTGGCACCATAATCTCAACTCCTAGCCTCCAG 

AAG N A7GAAAN ATAAATGTCTGTTG NTTAAGCCCCC AGTCCTATGGTATTTTACTATG AC 

AGCTCAAGGCAGACTAAGATTGCTAGTANCCTGCCCGGGCGGCCGCTCTAGAACTNGTA 

Sequence 3897 

CCGGGCAGGTACGCGGGGGCCCTTGCTCACTAGAAGGAGCCCCCTGACCCTTTCTTCCAA 

AAATACTTTTTGTCTTTGTCTTCGTTTCTGCATTCATCCCCCTTCGTTCACTCCCATM 

AACCAACAGCGACAACTGGCACCTAGGACAGGGACGTGAGTGAAGGTCTGCTGGAGCAGA 

GAAACTGACACTGACAAGAAGAACGAGAATGAGAAACCCTGCGACGAGTCTGCTGACAGC 

TGATATAAGTCTGCAAAATCAGCTGGACCACCAAACGGCAATTGAGAGCTGCACAACCTG 
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CMmCCTTTATTCMTTAATGCAAAAACAAAAATGGGGAGATGTTACAGGTCGAAAGA 

ATGAAGGTCGTGACCAACTCAGTATACCACTGAAGGCTATATGAGCAAACAGAAAACTGT 

TCTCATGAAAGCAGGATGTTGGAAAACTAACAACTGTGTCTGCCGCCAGAAGGGTTGCTG 

AGGGCAGTCATGCCCCAAGCGCAGTGTTCCTTGTGATTATCTATAGTAACATCTGAAGCC 

TGTTGNACCTCGGC 

Sequence 3898 

AGGTACTTT l I ) M ) I ) H M 1 I 1 1 1) II I I ITTTC l I II I I 1 I I 1 1 I I I I I 1 1 1 I I 1 I I 

TM n I I II I I I I I I I I I I n I I I I 111 1 I I I I I AAAGCCNNAAAAAACTGCCCNGGGGG 

GNANCAGGGACCCTTNATTNNNAAAAANATNAGNCCCNAAANAAAAANTTTAAAAAAAAC 

CCCTNGCCGGGANNAAAGGNANAAAAAACCTNAAAAAAACAGGNCCNGGNAAAAAAAANA 

CCCGGGNAGGGGGGTNCCNANCCNCNAATCCCCNGNCCGNTNAAAAANAANGGNACCCCC 

CGGNCNGNNGNAATTNNANANAAGCNTNATCAANCCCCCCANCCNNNNGGGGGGCCCCGG 

CCCCNANNTTTNGTNCCNTNAANNNNGGGTAAANNNCCCCCNNGGCNAAAANAGGGNAAA 

ANCTNTTNCCNGGGNAAAANGGNNATCCCCNNAAAANCCCNNAAAAAAAACAACCCGGNA 

ANAAAAAGGGAAAACCCGG 

Sequence 3899 

ATACCTCACCGTGGCTGCTGTCTTCCGTGGTCGGATGTCCATGAAGGAGGTCGATGAGCA 

GATGCTTAAC GTGCAGAACAAGAAC AGCAG CTACTTTGTGGAATGGATCCCCAATAATGT 

CAAGACAGCCGTCTTGTGACATCCCACCTCGTGGCCTCAAGATGGCAGTCACCTTCATTG 

GGCAATAGCACAGCCATCCAGGAGCTCTTCAAGCGCATCTCGGAGCAGTTCACTGCCATG 

TTCCGCCGGAAGGCCTTCCTCCACTTGGTACCTCGGCCGCTCTAGAACTAnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGATACCCGTCGACCTCGAGGGnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnTAGTGAGGGGTTAATTGCGCGCCTTGGNCGTAA 

TCATGGTTCATAGCNTGTTTTCCTGNTGTGAMTTTGTTATCCCGNTCACAANTTCCACA 

CCAACATACCAAGCCCGGGNAGCATAAAAGTGNTAAAGCCCTGGGGGTGCCCTAATNGAG 

TTGAATTNAACTCACAATTAAATTTGGCGTTTGNGCTTCACTTGCCCCGCTTTTCC 

Sequence 3900 

AGGTACACTTCACGCCGGCAAAGCAGGGATTGGACAGGCATCCATCTGAGACAAGGAGAG 

AGAGACAGTCACAGTAAATGGTTGGTCTAAGCTGCCATACTGQCATGCTGGGCACTTAAC 

ACAGTGTAAGATATTATAGGGTATAGGGAACCGATAACTTGTTGGGATTATGTTGTCCAC 

CTGCATCTTAG AGCAATCAAATCGCTC AG G ACTAACCAAGCTGG AATCTCCTCG AATG CT 

GACTGCAGCATTCAGTAAAAGAGCAAAGCTCCTGGGTCGTTTCATCCTGGGCTGCGTGGC 

TCACCAATTGGACAGTCCTGCTTGTTGCAGATCTGGTTTTCTGTTACATCACCAACGCAG 

TCCTTGCCTCCAAACTGGGGTGGGGGGGTTGGTTGCAGAGACGACTACGTTTCTGTACCT 

GCCCGGGC 

Sequence 3901 

CCGGGCAGGTACGCGGGGAAAACATCAAAGTGGCAAATAAAGAAGCCCAAAGTATCAACA 
TTAAGAAAGCAACAAAATAAACAGCTTCTGAAAACCCCAAAATGTGTGTGTGTGTGTGTG 
TGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTTTGGGTGAAGNGNG 
Sequence 3902 

CCGGGCAGGTACTGTGTCTGATGCTCCACACAGAAGGATGAATGGGACCAAAGGTCCTAC 

TCTAAACCATTCACAGAGGAGTAGTGAGACCTATTAGAAAGGATTTGAMCAAAAGCTTT 

CAGGTGATCTGCAACTCTTTCACCATCTGTAATGCGGAGATGCAGGAAGTTGGTGTTGGC 

CTATATCCCAGTATCTCTTTGCTCAATCACAGCTGTGACCCCAACTGTTCGATTGTGTTC 

AATGGGCCCCACCTCTTACTGCGAGCAGTCCGAGACATCGAGGTGGGAGAGGAGCTCACC 

ATCTGCTACCTGGATATGCTGATGACCAGTGAGGAGCGCCGGAAGCAGCTGAGGGACCAG 

TACCT 

Sequence 3903 

TGCGAGCCCTCTGCCAGATCACTGATAGCACCATGCTGCAGGCTATTGAGCGCTACATGA 
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AACAAGCCATTGTGGACAAGGTGCCCAGTGTCTCCAGCTCTGCCCTCGTGTCTTCCTTGC 

ACCTGCTGAAGTGCAGCTTTGACGTGGTCAAGCCGCTGGGTGAATGAGGCTCAGGAGGCA 

GCATCCAGTGATAACATCATGGTCCAGTACCTCGGCCGCTnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnTCCCTTTAATGAGAGGTTAATTGCGCGCTTTGGCGTAATCATT 

GGTCATAAGCTGTTTCCTGNGNGAAAATTGNTATCCNGTTCACAATTCCACACAACATAN 

CNAACCCGGGAGCATAAAGTGTAAAAGCCTGGGG 

Sequence 3904 

AGGTACGCGGGGATCCAGAATACATTTCCAACAAGAGCACTGGCCMGTCAGCTTCTTCT 

GAGAGAGTCTCTAGAAGACATGATGCTACACTCAGCTTTGGGTCTCTGCCTCTTACTCGT 

CACAGTTTCTTCCAACCTTGCCATTGCAATAAAAAAGGAAAAGAGGCCTCCTCAGACACT 

CTCAAGAGGATGGGGAGATGACATCACTTGGGTACAATACTTGGGCCGAAATCTGTCAGG 

TCAGCCCAACTTTCCTTGTCGTGTCAATGCTGTGCCTCGTCCTATACCGGAGAAAAAATG 

GTTCGTGGAGCCTGCGGTATTGTTTGCCTGGGTGGMTTTTTACCTTTTGGTTCAAT^ 

TATTG AAATGTATTTCATCTC CAC NTCTTTCTTGGGCATATAAGATCTATTATGTCTATG 

GCTTCATGATGCCTGGTGCTGGTTATTCCTGTGCATTGNGACTGTCTGTGNGACTATTGT 

GTGCACATNTTTTCTACTAAATGCANAAAGATTACCCGGGTGGGCMTGGGACA ANNTT T 

TCTCTCTGCTTGCATCAACTGCAATCTATGTNTACATGGTNTTCCTTTTACTACTATTTT 

TNAAAACAAA 

Sequence 3905 

TG CAGGGTGAACTCCCCTTCCTTCTCAATCAGCCACTTGTTTTGAACAAACTTCTG CAGC 

ACCTGCTCCGCTTCCTTCTTCCTCATCTTCTTGCCTTTAAGTTGATCAACCAGGTTCAAT 

ATGTTTGTGGAAGACGCAAAGCCGGTTTCTGAGTCMTAATCAGTTCCAGAGCCTTTCTA 

MCAMTCCAGTTCATTCTCTGCAAAATCCGTAGCCATTTTGGAMTTGAAGTTGTAGCA 

AGATTCACCAACGCATAAATGGGTCTCCCATCATCTTCCGTGACTCCTCTCTTTATCTCA 

ATATACAAGGGACTCCAAGACACTGTTAATGTTGTTGATGAAGTCCTCCAACTTATCTAC 

GGTGGCATTTGCCGGTCATGGACCCTGTAGCAGTGCGTCTGCAAGCGCTTCACGTCCCAT 

TCCTCTAGCACGCCATGGGTCATCAGCAACTGGAGGAAGCGCCGGTGGACATCAAGTCAT 

GACGNCCCATTCTCCTTTGTGCTGCCCTGCATGTGGGAACGAACAGGGAGCCCAAGGNGC 

ATCCCANGCCGNCCCCCGCGTCCTCGGGCCGNTCTAGAACTAAGTGGATCCCCCCGGG 

Sequence 3906 

aggtac i i n i m n i m m u i m n t cc i uii ni iimmn ittttttt 
m n n m i n n n i i m i i m i i i m i i i i i n m i i i i i u n m <n ttttnn 

NN TTTTTTTTTTTl I NTNNNNNNNN I 1 I I 1 11 III I NNNNN1 1 1 1 I M 1 I 1 NGGGGNAAA 

AAAAAAAAAN N AN AAAN N N AAAAAAAAAMN/\MNTTTTAAAAAAANCCCC nn cn n n n n n 

GGGAAAANCCCCNNNAAAAANNNNNCNCNTTTNGGGGGGGGGGGGNCCCCCCCCCCCCCC 

NAMNTTTAANNNNNNCNNNAAAAAAAANCCGGGNNNNCCCCCCCNAAAANNNCCCNAAA 

ANNGGNNNNNNCNNNAAAAAAAAANNCCNTTTTGGNNNNGGGGNAAACNCCCCCCCCCCC 

CNGGGG 

Sequence 3907 

NNN 1 I I II I 1 1 1 1 I I M N 11 1 1 1 M 1 I GGGG 1 M 1 11 1 1 1 N 1 11 1 11 I 1 1 M 1 1 NTTTTT 
rTTTTTTTTI NTTNTTNTTTTN nTTTTTTl rNGGGGGGGCCCGGGGGGGAAAAANTTNT 
TTTTNTTTTTC AN AAAN NAAAAATTTTNGGGG GAAA AAAAAAATNAN AAAAAAAAA ATTT 
rTTTTTTTTTTI [ 'GGGGGGTTTTTNGNGGGGCCCNC M 1 1 I I 1 1 1 I NTTCTTCNTCTTTT 
TGGGGGGGGGGAAAAAAAAANAAMNAAANNTCTTTNNGGCGCNNAAAAAAANGAGGGGG 
GCNCTTTTTATTmNTCTCCNGGGGGGGGGGNTTTTT^ 

CCCCCNCCCCGCNNNGGNNAAANTTNNTNAANNTN TN1 111)1) 1 1 NTCCCCCC CCCCCC 
CCCCCCGGGGGGGGGGGGGGGGNCCCCCCCCTCI I I I L I I I I I I I I 1 1 I I I I I NTAANAN 
GAAAAAAAANNCCCCCCCCCCCNNAAAAAAAAAAAAAAAAA 
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Sequence 3908 

GCGCGTAATACCGACTCACTATAGGGCCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCC 

GGGCAGGTAC I I I I M I I ( I I I I I I I 1 1 II I I M I 1 IGGI 1 1 1 I I M 1 1 I 1 I 11 ICTGGG 

TNAAGTTTATTCTGTTTTCACATCTAGGTTGNTGGGGAGAGTGATAGACAAAGAACNGGA 

TTCTGGGCATCGTCGNCGCATGCTTGAAATCCTACTNGNGAGGANAAAANAAGANGAANA 

AAAAGAAGCNAAGGANTAGAANNNANAANGNGACNANGAAAACANNTANNAAAACCANAN" 

GAACNNAAAANANNAANAANANCANANNNAAACCNCNANANNNTAANAAAACATAANANG 

G N AG ANC N AN C N AAN AAN AAN AN N ANAN AATAAAAN AATAAAAAAAAAAN AN AN AAC N N C 

AAAAANAGAAAAAGAGAGAANNACAAAATAAAAGANCAANANAACCAANAAACAANANAG 

ACNANAAGANGAANNCAANAANCNNAAAACNNNANACMATNNCANANAANNNANAAAAAN 

AAAAAMCANANAAAAANANAAANANAMNNCAAAAANANAANANANAAAAANACANAAA 

AGANNANAAANAAATNAAAAAAAAAA 

Sequence 3909 

TCGCCACCAACGCGCTCTCCCAACCCTCGGAGTCAGTTCCAAAACCAAACCAAAAAGGAG 

GAGAGCAGACAAGGCTTCAGCTCTGCGACTGAGAACAGGGAGGCGGCACGCCTTGGGGAA 

CCGAGGGCCCGAGGCCATACCGCGCTAATACAGGCCAGTGTCGGCGTAAACTTTCTCCGT 

TACCGGCCCGNnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnNAnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnNNnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnNnnnnnnnnnnnnnnnnnnnnnnnnrvinnnnnnnnnnnnnnGTAAAAT 

TGTTATCnnnnnnnnnnnnnnnnnnnnnnnTACGAGCCGnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnTAATGAAGTGAGCTNACTTCANATTTAATTGCCGTTGCGCTCACTTGCC 

CGCTTTTC 

Sequence 3910 

AGGTACATCTTTCCAAACTGGACATCAAGGAATTGCTACACAGAANAACCACATCCAGGA 

TAGAAAGGACCAGCCAGAGCTCGTTCAGTNGTGTATTTNAGAATCATCAGGGAGGAGCCC 

GGTAGCAGAATTATCCCGTTAACACTGANACCACATTTCACCTGTTACTTAI I I I ICTAA 

ATGGAGTAGGAGAGAAAGATGCTAAAATGTGATCTTGGGAGGAGTCCCCCAGTCCCCCGA 

AAAAG AAG AAG AAAAAAAAAAN G N AG AACTCC AAACCCCTGTGGTCATGGCAATTNCCCA 

TCANAACTGCCCATGTTAAAATAAACACAAGCTGCTTAAAAGTGAATGTTAAGAANGGAA 

GGCACTGGTTAGTGGGTGGTCCTGACCAATCCCCTGGAGCTGGANAGGGGTTTTTCCNAC 

NGGCCTTTGGTTTAATGTGTCTGTAAACTNCMCAAACCTNNCGTTTTCCCCTGAGGTAA 

TTCTGC AAAAAAC ATCAGTTNTTCAGTCCCCAGAN NACCATTTCCNTATTAAAC CTN ACC 

CCCCGAAATCATTTCMTNGTAACATTTTGCCNTTA1 I 1 1 1AGA 

Sequence 391 1 

TGTTGGAGOTTTCAGMCGTTCCTCTTCTTGTGTAGACACGAGATACCGCTGTAACTTTA 

CTCCATCACCAGAAACCTGTGCCTTGAGTCTAGTCTCTGCTCAGCCCTCGAGAGCATGGT 

TGGATTAACiGAAATAGTCAAAGGCAAGCAAGACAAACTCCCTAGGTTGTCTTGGTCATGC 

GTClTTT^/iAAAGGGAAAGAAGAAAAACCACCACAACAGATAAAAGCACCGAGGCCTGGG 

CTTGTGTCAGGAGCCAGAGAGGTCGTCCTTCCTTGAAATCCCGCGTACCT 

Sequence '3912 

TGGAGGGCCTGGCTGGGAGGTTTCACCCAGTGAGCAGAAATGTATCAGGGTCCCACTTAA 

AGAAGCAGTCTCGCTATGATCTGGCAAATCCACTGTACCT 

Sequence 3913 

TAGTCJTCACAAACACAGAGGGCTCCGCACTCCGCATGTAGGTCCACATTTTATCAAACA 

CTGCAATTTTAGATCTCCTGAAAAACTCTTTAGTGGAGCCAGAGTCTAATGTTCCATAGG 

CMTTTCTGTTTGCTTAGAAAGATCCTCAGCACTTTCGATGGGAGACACCATCCTCTCTA 

CAGTCAGGAAGGCAGCTAAGTTAGCCGTGTAGGAGGAGATTATGATCAGGGTAAAGAACC 

ACCACACACCTCCAACAATGCGCCCAGAGAGGGGATCTAATAACAAGTATGAAATGGATT 

TGTAAAGTGAAATGAATGACCTAATAAAAAAACAAGTTAGCAGTATTATGCAAATATTGA 
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CTTATAGGGGMTATTTMTTTTATATTTTTCAATGTATCTACTAGTTAATGAMCTGGA 

AATCATTAAACATTTTGGGGCAAGGAGTACAGCCGGAAACCTCGCAGCCGNTCCTTGCAG 

TCCAC 

Sequence 3914 

AGGTACCATAGCCAAATCTGGGACAAGCGAGTTTTTAAACAAAATGACTGAAGCACAGGA 

AGATGGCCAGTCAACTTCTGMTTGATTGGCCAGTTTGGTGTCGGTTTCTATTCCGCCTT 

CCTTGTAGCAGATAAGGTTATTGTCAC7TCAAAACACAACAACGATACCCAGCACATCTG 

GGAGTCTGACTCCMTGAATTTTCTGTAATTGCTGACCCAAGAGGAAACACTCTAGGACG 

GGGAACGACAATTACGTGAGTATGACCAATTCCTTATAAGAATTAATAGTCATGGTGAGG 

ATCTTGACTCTCCAGTTCTATTCTCTGAGCAAATGACTTATTCTGGGCCTTMTTTTCCT 

GCCTTATAAAATGAGGAAACAGMCTGTTGTCACTTGAAGAAATGTTTGAGGACCTGCAA 

AGGGCCAGGCACAAAGGTTTTGTGATTGCTGGTTGTACCTGCCCGGGC 

Sequence 3915 

CCGGGCAGGTACATCATTTCCAC3AGCAGGCACTGGCGGCGAGATAGGGTTGGAGGAGAAG 

TAGCGCCGGGACTTCCGGATGGCAAACTTCTCTGTGGGTAGAGATTTCCCAGCAATCTTG 

AGCTTCAGGCCTGGCACAGCTCGAAATAATTCCACTTCGTCGTCCCCGAACGGCTTGTGG 

TCCTCCTTCCCAAACATGCTGAGGTAGGCGGCCTTCATGTAMTGTAGGTGGCCTTTTTA 

AGTCAGATCATGTCAGTTCCTTCTGGAAATCTGGTTATATTCCATCACACTCAGGAGACA 

TCTCCTACAATTTCCTTGACACCTGCAGCACTCCAGCCACACGACGGCCTCAGGGCCCCG 

CGTACCT 

Sequence 3916 

AGGTACGCGGGGAACACATCCAAGCTTAAGACGGTGAGGTCAGCTTCACATTCTCAGGAA 

CTCTCCTTCTTTGGGGAGCCTCAGATGGGAAGGGACTCGAGCCCCACCTGTCCCTGGACT 

CTGGAATGCCACGGAATTAACCCGAGCAGGCATGGAGGCCTCTGCTCTCACCTCATCAGC 

AGTGACCAGTGTGGCCAAAGTGGTCAGGGTGGCCTCTGGCTCTGCCGTAGTTTTGCCCCT 

GGGGGCTCACCGTGTGGGGGAGCAAATCATGTATATCCAGTGCTGTGAGTGGCTCCAGTC 

ATGGAGGAGCAAGGATGAGTTCTGCCTGGAAGAATCTGGGAAGGCTTCCTGGAGGAGGGA 

ACAATGGCATGGACCTTGMGAGTCAGAAGCTTTCMTTCATTCCATTCAAGCATTGCTC 

TCATGTGGCATTCAAGCATTCTATAGTGCTTGCCGTGACTCAGGCGCACAGTGCAAAAGG 

MGCACATCTTTCTCTGCCATGAGGACTTATTAGTGTCTGMGAGCTTTTTCTGGACTAT 

AGGAGAAAGTCATGGNCTCCCTCACTAATAAACACTG 

Sequence 3917 

AGGTACGAGATGCCACGGCACGMGTCTACGTTCTCCTGATCCGAMCATCTTTTTGAAA 

ATATCAATCATTGGCATTCTTTGTTACTATTGGCTCAACACCGTGGCCCTGTCTGGTGAA 

GAGTGTTGGGAAACCCTCATCGACCAGGACATCTACCGGCTCCTTCTGATGGA1TTTGTG 

TTCTCTTTAGTCMTTCCTTCCTGGGGGAGTTTCTGAGGAGAATCATTGGGATGCAACTG 

ATCACAAGTCTTGGCCTTCAGGAGTTTGACATTGCCAGGAACGTTCTAGAACTGATCTAT 

GCACAAACTCTGGTGTGGATTGGCATCTTCTTCTGTCCCCTGCTGCCCTrTATCCAAATG 

ATTATGCTmCATCATGTTCTACTCCAAAAATATCAGCCTGATGATGMTTTCCA GCCT 

CCGAGCAMGCCTGGCGGGCCTCACAGATGATGACTTTCTTCATCTTCTTGCTCTTTTTC 

CCATCCTTCACCGGGGTC 

Sequence 3918 

CGGCCGCCCGGGCAGGTACAGTCCACTAGCACGGCAGCTATAGTGTGGGCATCTAAGAGT 

GCCCTGTAAGCATGTGTGGCCAGGCTAGGGCCATGGGAGAGGCAAGCAGACTAAGGAGTG 

CTGCCATCAGACCAGCCCCATCTCATGTGCAAGACTGCCCAGCCTCCAGAGATCAGGTCT 

CAGAGGAGAACTCTCTCAAAAGTGAACCCCTGGTCCGCCTCAGACACTCTGAAGCCCTAA 

GGCTGAAATGCCTGGGTCGCCCAAACAGCAAAGATGACAATCTGGTCCTCCCCCTGGGAG 

CTCTGACTCAGTGAGGCCTGAGACCTCTGTCGGCCAGAGAACAGCAGTCAAGGTAGCCAG 

A 
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Sequence.3919 

CCGCGGTGGCGGCCGAGGTAC'I I i I I I I I I i { I AACAGTTAAGAATGTGTGACGTTATTT 

ATGACCAGAAAGTTCAGATTGAGTCTAAAAATCACAGTAGAGGCTAACCCGTATTATTAT 

TATTTAAACACAATTAAACATTTAAAGATGAGTTCTGGGGGAGTAATTACTCAGATGTTA 

ACACTAGTGCCCATCAAAGAAGGATTTTTAAAAGATACATTTTCACATTTCATT^ 

AAMTCTTGATTGTGCT ACTCT ATTAATACCATGAATAATCTTAAGTAGCAGGTAAGCAA 

ATAAMTACATAATTGTATTTTGGCTTATTCTGTGTAAGTTTACAMTCTGTTTACCTTA 

TTC AAC AAC AAAAT ATTT ATT 

Sequence 3920 

CGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAATGCCAGGAAGATGAAT 

GTG CGTTAATGTTGCTG G AAC ATG GCACTGATCCAAACATTCCAGATGAGTATGGAAATA 

CCACTCTACACTACGCTATCTACAATGAAGATAAATTAATGGCCAAAGCACTGCTCTTAT 

ACGGTGCTGATATCGAATCAAAAAACAAGCATGGCCTCACACCACTGCTACTTGGTGTAC 

GCGGGGAAATATCGCCATCGGCATTTCCCCCTACGTTNTTTGGGTGAGTGAATTTCAGCC 

TGAGAGAGGAATAGTMGATACAGTGAAMTGTGMTAGGATCATGTGTTCTTTTAATTT 

GMTTATGTATTCTTTTTCTTCTAT7TTCCAAACTC 

Sequence 3921 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGTAAACTTTCTGAATAACT 

GAGACATGTCTTAMTTTTGGGAGTTCACMGGGAMCMGGGTAAGAATTAAACTTCAC 

TMTATTTCTCACCCTCCA MGC CACTrCMCCACTGCCTTTCTCATTTCGGTTAATAGC 

MCTCCAATCTTCCAGCAGTTTTAGGCTGCAAMCCTTAGCTCCTCTCTGTCTTACACTT 

CACATCCAATCTGTCACCTATTCAAAGTATCCCCCAAATCGAGCAATTTCTCACCATCTC 

TTACTAATACACCTCAGGTCCAAACCACCAGTATACCTCCCTAGAGTACCTGCCCGGGCG 

GCCGCTCTAGAACTAGTTGGATCCCCCGGGCTGCAGGGAATTCGATTATCAAGCTTATCG 

ATACCCGTCGACCTCGAGGGGGGGGCCCGGTACCCANCTTTTTGTTNC 

Sequence 3922 

GGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAAGAACAAACGGAAGCCTAAAGATGGACC 

ACAAACCTTTGGATATCCAGGATAACTGAGTTAAAMCCTACCACACGGTTAATAAAATA 

ATGATGTCCACTGAGTCCATGACCACAAAGCAAATCACCAGCAGCAGGATGGCATGGGCG 

GCCCTTTTCGCTGGGGGGAGGGACTTGGAGCAGAGGCTGGGGCTGTGAAGGTGATGGGAT 

CACCTCTGATGGTTAAACAAAGAGTACCTGCCCGGACCTGCCCG 

Sequence 3923 

CCGCCCGGGCAGGTACANAMGAGGTTTGTCTGACTTAAAGTGATTTGTCATCTTTCTTT 

TCCACTCTCCTAATCATGATCTGTTCCCCCTOCTGGCCCAATAACTTAATGCTTGNAAAC 

TGAGGTCCACCCATTTATGGMGNNCTCTTTTAGTCCACAGGAGNTCCCTTNAACACCCC 

TTTNTTCTTGTTCCCTGGTNGACAAATTGGTATTCTGGNAANMTTANNCCATNNTTTGA 

AMNCCCNCCfTTTTNANTAGNGGGTCGGTlCCCNAANGTGNTGNACCTANAAATTGNGA 

GTTNAGGGCCAAGATTGAACACCATGTCCAiNITTTAAAAAANGNGGATATANAGNAAANTA 

NN AAAAAN ATTN NATN N AACAAACN N AGGT AN AAATATTC NGTTTNTTTTAAAGG N ANTC 

NNNATCNTNNTN ANNN AGGGCGAACACACGCAATTNCNG l ill I I I I I NAATTNAAAAAA 

AAMGNTGGNGATTTTNTTCTCANAATT77NACCMCCCAATTAANTGGCCCCCMTTM 

TTNTTrNTCAACCTTGGGAAAMTTCCACCCANNGNTGCCMNTACMTTTTTNTAAATT 

TTNGGGTAAAANAAATTNGGCCCAACANANCCTTTTTTTAA 

Sequence 3924 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTCAGCTCCACCA 

CGGTGCAGGACAGCTCCTCGTCCCGGAAGTAAGGTATAGTCAGGTGATGCAAACTCATCA 

TGGGCACCATTACCCACTTTTTCTTGCTCAAGTAGAACCTTGACTGATGAGTATCTTGGG 

GGTCAAAGGGCATCCCCCATTTGGCTTTAAAGAAGATGTAATTCACCAGGACCATCATTG 

TCTGCGAGTCAAGGTCCTTGATCAAGATCTGTGATTTTCCCCCTAGTTCCATTCTTCACG 
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TAGTCGTTGATGAGCTTCTTAGCTGCAGCTGAGTCCTGAAAGTCACCCGCGTACCTGCCC 
GGGCGGCCCGCTCTAGAnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
nnnnnGATACCCGTCGACCTCGGAGGGGGGGGCCCCGGTACCCAGCTTTTTGTTTCCCTT 
T 

Sequence 3925 

CGAGGACNCGGGGCCTTTTCACAAGATGGCGCCGAAAGCGAATAAGGAAGCTCCTGCCCC 

TNCTAAANCTGAANCCAAAGCGAAGGCTTTAAAGGCCAAGAAGGCANTGTNGAAAGGTGT 

CCACATNCACANAAAGAAGAAGATCCGTACGTCACCCACCTTANTGTTNGNNGGATCTTN 

AGNANTAANGTGGAATCCCCNCGGTGCTTGTAAGCGNAATTCCTTATATCAANNGCNTNA 

TANNATNACCNGTTCMCACTTAGAGCGGGGGGGACCCCNGNTACCCCNAGNTTTTTTNG 

GTTTTNCNNTNTTATGTNNMGGGGTNTTATATATTGGCCGACANCCTTTGNACNTTTAN 

ATnTCATNGGGNNNGCMTTAGGNATGGTTTGTCNCCTTGGTGGGGTGGAAAAAAATGTC 

GTNTGATTNNCCGGCGTTCAACNAAAATCTCCCCAANNAACCAAANCCAANTAACCCNAA 

GTACCCNGCGNGAAAGTCCAATTANNAAAANGTTGGTTAAAAAAGAACCCATTGGGNGGN 

TTTNGCCCCTTAAAAT 

Sequence 3926 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGGGGCAGCTGGAGGTGCCTCA 

GAAGGTGCATTCTGCTTCCTGCAGGGGCTTGAAACACCAAGGCACTCCAGGGATCCTGGA 

GTCAAAGCAGCAGTCCCGGTTGTTGCACTCCTTGGGGGTGACATGGGGGTAGCCGCAGTC 

CACCCTGTCCTTGGCTGGCACGGCACACTTGGTTTTGCAAGACAGGCCCACGTACCTCGG 

CCGCTCTAGAACTAGTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnCGTAATCATGGTCATAGCnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnCACACAACATANCGAAGCCCGGGGAGCATTAAAAAGTGTAAAAGCCTGGGGGT 

TGCCTA 

Sequence 3927 

CCGCGGTGGCGGCCGAGGTACAGAAGATGATTGACCTGGAGACACCATCCACCCCGATTT 

CTTCTCACAACTTGATCAAAAATGGCCTCTAAGCCTAAGTCCACGGCATCCCCACAATGG 

ATGTTMGGTTCTCAAAGGGAACAGCTCGGATCTGGTGTTGAAGAATGTCAGTTAATGTT 

TCCMGTCCAATTTGTTCCTAGACTTCTTATAGCCAATTCTTTCAAGATATGCTTCAATG 

TCCATGATCCCCTTTCTTAGTMGTTGAAAGTAGGTTGATCTTCAGTTTTAATCCAGAAG 

ACAGATTGCTGCTTCTTCCTGGCTTGAATTCAGGTTATTTCAGCCGGCAACCAGGNAACT 

AGGACTGNAATGCAATCCTCCCACATCCAAGGTCCCCCCCGCGTACCTGCCCGGGGCGGC 

CCGCTCTAAGAACTAGTGGGGATCCCCCGG 

Sequence 3928 

ATTGGAGCTCCCCGCGGTGGCGCTTTGCACACACGTCATCAACAGACAGCAGTCAACAAG 

CTGCCTAGGGCCTCTGCAGAGCCAGATCAACCCCACAGCCTGTGAGCCAGGCCTTAAGCT 

GCCTATTTGTGCTGAAGCGGGCTGCAGAACGACCCGTGTTATCCACCCTGAGCACCACCA 

ACACTAAGCACAAATGACAACACCATCTCACCTGAACCCTGTCCAACTCACAGCTGCCTC 

ACTGGCCTTTGCCACTGCCCTCTCTGATGGAGCAAGGGTCTGGCCTTCCAAGCAAATGTC 

AGCCTCAGATTCAAAAGACACATAGGAAGGAGCAACACCCAACACACTGAAAACTCCTGC 

CCGATCCACAAAGACCCCAGTGCAAGCGATGGGCAGCATCAGATTGATGGGAAACCACCA 

CCCTTrrCATTTTAGAATTTTT 

Sequence 3929 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACI rTTTTTTTTTTTTTI I I 
nTTTTTTI i GCACGCCCCAAAGGGGCAAAAAGAGACTTTAATTAGGGGAGGGAGGATCC 
ACCANAATCAGAAAAGGGACAGCTAGCGTGGGAGCAGAGGAGCCAGAACAGGCAGGAGGA 
GGGCCCGGCCAGGMGCTNTNGANGGACTTAACCTTTGCCCACCTTTGGCACAAGGCACT 
TGACACTGACGGACAAGGCGAAACAGCGGCCCCTNTCAACTGGGAGGGCACCCAATGGCC 
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CCCCCGCGTACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 
nnnnnnnnnnnnnnnnnnnnnnnnnnnCGACCTCGAGGGGGGGGCCCCGGTACCCAGCTT 
TTTGTTCCCTTTAGT 
Sequence 3930 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGTTGGGGATCGCAAAAACCCAG 

TCTGTCGAGAAATGTCCATGGCGCTTTTATCGAACCTTGCCCAAGGGGACGCACTAGCAG 

CAAGGGCCATAGCTGTGCAGAAAGGAAGCATTGGAAACTTGATAAGCTTCCTAGAGGATG 

GGGTCACGATGGCCCAAGTACCCTGCCCGGGCGGCCGCTCCTAGAAACTAGTGGGATCCC 

CCGGGCTGCAGGN 

Sequence 3931 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCCGCCCGGGCAGGTACGCGGGGGCA 

ACCTGCTTGGGTCCCTTTCCACACCGCGGAAGCTTTGTTCTTTTGCTCTTTGCMTAAAT 

CTTGCTGCTGCTCACTCTTTGGGTCCACACTGCCTTTATGAGCTGTAACACCGCGAAGGT 

CCACAGCTTCACTCCTGNGCCATCGAGAACCACGANCCCCCACCANAAAGGAAGAATCTG 

TGMCACATCCGAACATCANMGGAACAMCTCTAGACATGCCGCCTTTAAGAACTGCAA 

CACTCACTGCNAGTGTTCGCGGCTTCATTCTNGAAGTCAGTGGGGACGAATAACCCACCA 

ATTCCAGACACATTT 

Sequence 3932 

CCCTTTTGGAAAACCCGAATAGGAGTTCCCCGNGGTGGCGGCCGAGACGGGTGGATCACN 

TGAGGTCAGCAGCTCTAGACTAGCCTANTTTNCATGGCAAAATCCCATCTCTACTAAAAA 

TACAGCAATTAGCCGAGCATGGNGTCACNGTGCCTGTAANCCTAGCTACTGGGGAGGCTG 

ANGCAGGANAATCGCTTGAACCCCGGGAANGNGGGAAGGATTGCANNTGAGCCNAGGACA 

ATTGCCACTTGCACTCCAGCCTGGGAANACANAGCTANGACTCCATTCTCAAAGAAAAAA 

AAGAAGAAAGAAGAAAGCTAACTCAGCTACCTTCGGCCGCTCTAnnnnnnnnnnnnnnnn 

nnnnnnnnnnAGGAATTCNNATATCAAAGCTTATTCGATAACCCGTCAGAACCTCCGAGG 

GGGGGGCC 

Sequence 3933 

CCGGGCAGGTACGCGGGATAGGAAAATMCCAAGCACAAAGTTATATTCCATCCTTTTTA 
CTGAl I I I I I I ICTTCTATCTGTTCCATCTGCTGTATTCATTTCTCCAATCTCATGTCCA 
TTTTGGTGTGGGAGTCGGGGTAGGGGGTACTCTTGTCAAAAGGCCCCNATTGGGTGGCCA 
TTGTTGTTGTTTTGGCTTMGCCTTCMCTTTGGTCCCCATTGGAAAAAAATTAI I I I I I 
TMTTG AATTATTTTAAAAAAG G AAAAAAATTC N I 1 1 I I I I GGAAAAAATTGGGGGCTTG 
GNTTTTTTTTT1 I AAAAGGGNAAAGGAAGGAAN A 1 1 1 II 1 AATNGNTGGGGCCTTTTCCT 

tcc ai rrmrm ia aaaaattccccccccttccaantaaaggggttggttgggmcctt 

TAAGGGTTCCNTNCC I II I II I AATTCCAAGGNCCAACCAACCCAACCNTTTTTAAAANA 
AAAAAAAAI" 11 I I I I I I I I I AAAATTTMTTTTTGGNTGCCTTAATTTTTAAAAAAAAAA 
AAANTTAAGGGGGGMCCCAAAAAAMCCCTTTTTGGGGGNAAAGNAMGGNTTMTTTG 
GGGGGAACCCAANAACCC I I 1 I I I I GGGAATTNAATTTTTGGGCC 
Sequence 3934 

AGGTACI I I I I) I □ I I I I 1 1 II I I I I I I I I I I I CGGGCAGCTAAAATCTAGGATGGNGTT 

AAGTTTCTTCATmGNCMTTATATATAAAAAATTAGAAACMCATGMTC 

TTG G ATG AN ATAGTTAATAACAAACTATGTTCTC AATATTTG NTATTACCCTA/ AAAAAC 

CTNTGNTTNAAAGGGGTTGNTTNAATTGGTTGNTTTCTCCAANCAAATTCCHTTAATCCT 

CCTTNAATTNTNTNGAANACCAATTGGGGGGG I 1 1 I I I CTTTGGAAN ANAGGGGGAAGGC 

CCCCNTTNCTTACTTAAANTTACAACTTANTTANAANANNTGGNGC. P NAATNGTTCACGG 

CTNAAAANATTGGNANGGGGGCCCMCCCTTGGTTTNTTTTTGGGGCCCTTNGGGGGGAA 

NCCTTTGGTCCTTANNTTAAAAAAAAAAAAAAAANATTTTAACCCCCAATTTANGG 

CCCCCANNTTTTTGGGAACCTTTAAMNAACCCCCCACCNANAAAANCCAATTTT^^ 

CCCI I I I rCCTTTNCAACCNMGNTTTTTNCTTTGGGGNAAAAMNGGTTTTTGGNGGAA 
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AAAAAAGNCCCCCCANMNATMTTTGGTMGGGGGTMTTGGCCCCCAMGTCTTTAAT 

TTGGGGGCCTT 

Sequence 3935 

GGCGGCCGAGGTACCCTGNACCTTCTCAGTTCGCTGGCACATAAGGCTCCGCCAATAANT 

NNTGGTTCTCTCTGTCACACACACACACACACACACACACACACACACACACACACACAC 

AATGATTGGAGGGCTATATGATCCAGCATTAGCTTCCTGGTGNGCCCAAGCCATGCNTTG 

ATCGGGGAA1 I I 11 I 1 1 1 I ATTATTATTATTTTTTAGCTG TAGCT GAAGGCATTTN TCGN 

ATGTGGAGAGGAGAATGGAAATCGCAAGMCCAMTCATTTTTGCCCTGCCATATTTTGG 

CTGTNGGTCCTGTCATTGGGCMTTTCTGGATGNTGCCTTTTCTTGGATTCAAGGGAAAG 

MGCTGMTTTGTCCTCCCGAGGGGTTGAAAAAAAAAAAMNAACAGGTGTTTCAAGGAA 

AACNCTTGAAATTCCCAGGGGGCC 

Sequence 3936 

TATACACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAATGACGGG 

GAGGAAGAGGGTTGGTTAMGCATCCCTCTAAGCAGTTTTCTGCTGTCCCTTCTTTCCAA 

TCAGAGATTTGTGGATGTGAGGGATCACACCACCCCCAGCTATGGTAGCCTTGATAAGAG 

AATCCAACTCTTCATCACCACGGATTGCNAGCTGCAAAGTGACGCCGGAGTGATACGCTT 

TACTTTGAGACCCGCGTACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGCTTGGCCGTAATCATGGGTCATAA 

GCCTGTTTCCTGTGTGGAAATTGTTATCC 

Sequence 3937 

GCTCCACCGCGGTGGCGGCCGAGGTACTTTN II I! I I I I 1 1 1 1 1 1 I 1 1 1 1 1 1 NTAAATAT 

TGAGACGGANCNTTGCGCTGTCACCGAGGCTGGAGTGCACTGGCACTGNNTNGGCTCACT 

GNAACCTTTGCCTCCCGGGCTCAAGCGATTNTCCTGC CTCAA CCTCCCAAGTAGCTGGGA 

TTACAGGCATGTGCCACCATGCCCAGCCMTTTTTGCATTTTTAGTAGAGGNGAGGTTTC 

AGCATGGTTGGCCAAGGCCJTGGTCCCGGAACCCCCCGGACCTTTGTCATCCTCCCACCC 

TTTGGGCCTTCCCAAAAGNTGGCTGGGGGATTAACAAGGGCGCCGGAGCCCAACNCACGG 

GGNCCGGGGC 
Sequence 3938 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGTAGGATGAAGCCTTAAG 

CAGTGGAMmCAGTTATmCCACAGTATTCCATTTTGGAGGAmGGGGTGTTTACT 

TTTTAMTTCTTGMCAACTTAACCTCCATGAGGCTTTGTGAAGTCAGCTGTGACCACCC 

TCCTCTTACTGTGTTCTCAGTATTCATTCACTTCCAGGGAAGAATGACAGCCACAGGGAG 

ATGGTGGTGGGCAAGAATGAGAGTCCCAGGATCCAGATTTAGCCTCAGATCTTCCCCATT 

CAGGAAGGGTTTTCCATTTAACAAGAGCACTAGTATGAAAACATTAGGGACAAATCTCCC 

ATGTCTTTGAAATTCGGATTCTCCTCTTGAGATCCCCTTCCTCACCTGCCMTCMCTTT 

ATMGGCCACMGTGGTCACTGGTTTTCCTTCCACAGGTTTTGAGGNTCTTCAGCTTTCC 

TTA 

Sequence 3939 

NCGCGGTGGCGGCNCGCCCGGGCAGGTACf f U 1 I I I I 1 1 1 1 H II I I H I I M 1 1 I 1 1 I 

TANCAGTCGGNTTCGCAAGCNNTGGTTTCCTGGATCTCACANMCNGTTGGGNCTTTTNA 

ACCTGATAGCACNTTNCAATNGCGTTTAGTTATCC AGTGCTG NGCNNNNGNANAAATACC 

ATTCNTCNATTCATTNATTTANCTCNTATCMN^ 

GGNGAAANNTTATTTTTTGTGNATTC^ 

NTTNCCCCCTTCGGCCCNCNACMNACAGTATTTTTMTTANCAAGCNNCTCNTTGTTNC 

NNAAANMNMNNCANACTCCCTTATTNCCTNTTGTGGNTATNATNTTrCTAGTTANCAN 

NCATTGNNNGCCGTNTTCTNANATTNACTNTTCAGACTCTTGTTTCCNNGTTAACACGNG 

ATATAATACTTnNTNCAATTTTTT 

Sequence 3940 
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CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAGATGGAAAATG Mill 

GGTGATCTGGCTGCTGCTTAAAGCCAGTTTTCCCTAAGAACTCCAAAGGCTAAAGTCTAC 

TAGGGGCAGAGTGTGAGGATAGATTTCTAATCAGAGAAAAGTGGCCTCCAGGAGCTTTCA 

TTTATGTCTTCTCCAGACCAGGGTTTCCCTGGTTATCTTCCTTTAATCCCCTTTCAACCA 

ACAGGTGAAGTTCTTCCAGCCCACAGAGGTAGTAATATCATCTTTTCTATCTCCTCCTCT 

CCTTTGGCCATGTAATGMGCAAAATATTATTTATTTAGCCCAGGCTTGAGAGCCACTGT 

TTGTGGGACAGTCTTCATCTAGATTCCATACCCTGGCCTAGGCCGAGGTAAGGGCTCTCT 

GGTTATTG CC AGGATGG GA 

Sequence 3941 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGA 

CCCGAGATCTAATCTCTGTGCTCTGTGTATTGGCGACGAGCAGGGTGAGAATAAGTGCGT 

GCCCAACAGCAACGAGGGATACTACGGCTACACTGGGGCTTTCCGGTGCCTGGCTGAGAA 

TGCTGGAGACGTTGCATTTGTGAAAAGMTGTTCACTTGTCTTTGCAGAAACACTGATGG 

AMTAACMTGAGGCATGGGCTAAGGATTTGAAGCTGGCAGACTTTGCGCTGCTGTGCCT 

CGATGGCAAACGGAAGCCTGTGACTGAQGCTAGAAGCTGCCATCTTGCCATGGCCCCGAA 

TCATGCCGTGGTGTCTCGGATGGGATMGGTGGGAACGCCTGGAAACAAGGGTGTTGCTC 

CACCCGGACAGGGCTAAATTTGGGGAGNAAAT 

Sequence 3942 

CCGCGGTGGCGGCCGCCCGGGCACGGTACACGTAAAAGCTTGGCTAAGCAGACGCATCAG 

GAGCAAACCATTTGTGTTCACCAATCTGTCTCGGCAAGTCCCAGGGCCACTATCTTGCTA 

GAAGCCACTGTGTGCTTTTGAAAGCATGAMTTAGTTTTATACTGAGCACATGGTTTCTC 

GTTTAAGCAGCTC I I I I 1 IATGGTAATTGGTCAATTTATGGTTCACTGCCAGGATTTCGG 

TGTACCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

AGGGTTMTTGCCGCGCTTGGCGTAMTCATGGTCATTAGCTGTTTCCTGTGTGAAATTT 

G 

Sequence 3943 

CGACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCGGTAC"! rTTTTTTTTTI 

I I 1 I I I I I I I I I I I ICAGGGGGAGCATGGGACCTTTATTCGTTAANACATCAGGCTCCAN 

ATATGAACTTTCAGCAAAAGCGCTTGCCGGGAGCAAAGGGACANAAAAGCTGANATGAAC 

AGTGCCTGGCAGCAATCACAGCCGGGCAAGGGGTGCTNCNAAGCCTCGCATCCCCnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnTAGNCTGTTTCCTGNGNGAAAATTNGTNATTCCGCTCA 

CAATTTCCACACAACATACCGAGCCCGGGG 

Sequence 3944 

GGAGCTCCCCGCGGTGGCGGCCGAGGTACTCTCTTTCTC I I I 1 1'CTTGGTTGCTGTGAAG 

GAAGTCAGCTGCCATGCTGTGAGCTTCCATTGAAGACAGAAGTCTTCCTTCTAAATGCCA 

CGTGAGTGAGCATGGAAGTCAAGCCATGCAATGACTGTCACTCTGGCTAGCACCTTGATT 

GCAATCTTTTTGGGNAGGCCTTGGTCCAAAGTCACCCTGCAAAACCAATGCCCAGAATTA 

CTGACATATTTATTTCMGCTAATAMTTATAGGCGTAATTTGTAATATGCAGAAATAGA 

TMCTATTACGTAGATCAAATGAAAAGATTTTCCAATGTGTAGAAACTACATAMTTCTT 

TTCTTACAGGACAATCTTATTCTAATCMTAACATTTGCTTTTAAGGCAGATAATCCTCC 

AAGGTT 

Sequence 3945 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGAAGA 
ACAGTTAAAGTGTCTTCTGGATGAATGCATACTTAAACAAAAATCCATCATTAMCTTTC 
TTCAGAAAGAAAAAAGGAAGACATTGAGGACGTAACACCTGTGTTCCCCCAGCTTTCCAG 
GTCCATCATCTCTAAATTGGCTAAATGAATCAGAAACAAAGGTCCAGAAAACTGAGATAG 
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AAGATGCAGATATGCTTGAGAGTGAAGAATGTGAAGCTTCTAAAGGCTACTATCTCACTA 

AAGCCTTGACTGGACATAACATGTCAGAAGCTCTTGTCACJGAAGCA GAGAA TATGAAAT 

GCCTTCAATTTTTCCMGNGACGTTATTATTTAGTGACACAAMGACTATTTTATGTCGA 

AGACTTCTTGCCATT 

Sequence 3946 

CTGATTGGAGCTCCCCGCGGTGGCGGCCGGGCAGTTGAAGGCCTTCAGAGCAAAACTGAG 

GTTTCTCGGTCATGCGGTTATGTCTTCAACACCTGCGTCCCCCACCGGAGGCCAGCTCCC 

AGGGCGCGTTCTAGTGTGTGGCGTTCGCTTCATATTCCCAGGCGTCGCTTGCCCCAGCAC 

GCCTGTGTGCTCAGTMCMTGTGTTAAGCACAGGATTTTGGGAGAAGTGTATATGGACG 

TCTGAAGAAGGGTTGTGCTTGTGGAAGAGCCCCGCGTACCT 

Sequence 3947 

CCGGGCAGGTNCGCNGGGAGATCANTGATAGCACTAGATACTCATTGGAGCATGAACCCT 

GNNGTGAACTGCTCATCTGAGGGATCTANGTTGTGTGCTTCATATGAGAATCTAATGCCT 

GATGATATGTCACTG N CTCACTTTGCCCCCAGATGAGACCATCTAGTTGCAG AAAAATAA 

GCTCAGAGTTTCCACGGATTCTACATTATGATTTGATCTTCAACAAAGCTGATACTGGGG 

AAAGGACACACTCTTCAATAAATGGTGCTGGGAAAATTGGATAGCCACATCCAGAAAAAT 

GAAACTGGACCACTATCTCACTATATACAAGAATCAAGTCAAAGTAGATTAAAAACTTAA 

ACAGMGACCTGAMCTATAAAMTACTTGAAGAAAACCTAGGGAAAAATTTCCTGGCCT 

TTTGTCTAGGCAMGAATTTATGACTAACACCTCAAAAGCCACAAGCAACACAAATAAAA 

ATAGACAAATGGGACTTAATTAACTTAGAAAGCTTCTGAGCAGCAGAAGAAATAATCAGC 

AGMTGMGAGACACCTNCTGMTGGAAAAAMTATTTGCNAACTGTTCATCTAATAGTG 

GCTAATACCTAGAATTACCAAAAC 

Sequence 3948 

CGAGGTACGCGGGGCTTAGACCTGACGCTGGGAGGAGATGCTGCCACCTAGGTTACTTGT 

AGGACCCTATACGGCAACCTCCTTTGCCAGGAACTATTTATAAACATCCTGCAGGAAAAT 

GAGTCTATATGTCAGAATACACATiTCCCACCTTGCCCAACAGTAGAAAAACATAAGAAG 

AGAAAAACATTAAAAMTGACMGGAAGTTAATGGAAGTCAGCAATGTGATGGTGTTTGG 

AGGTGGAGCCTTCAGAAGGTAATTAATGCCCTTGTAAGAAGAGGCCAGAGAGCTTGCGCA 

CCTTCTTCCTGCCATGTGAGGAGCCAAGAAGCCGGCTGTCTGCAACCTGCAAGAGGACCC 

TCACTAGAAGCTAGCCATACTGGCATCCTCATCTTGGCTTTCCAACTTCCAGAACTGTGA 

GAAGTATATGTTTGTGGTTTAGTCAATGGTCTATGGTAAI Milt I ATAGCAGTCCCAGC 

CAAGACAGTGCCTCATTTACTATATACCATTTATATTATTA 

Sequence 3949 

AGGTACTTTTTAA I rTTTTTCTTTTTm I CTTGCCCTTTCCATTAGTTGTATTTTTNAT 

TT ATTTTT ATTTTT A I 1111111 IAGAGATGGANTCTCACTATGTTGCTCAGGCTGGCCT 

TGAACTCCTGGGCTCAAGCAATCCTCCTGCCTCAGCCTCCCTAGTAGCTGGGACTTTAAG 

TGTACACCACTGNGCCTGCTTTGAATCCTTTACGMGAGAAAAAAAAAATTAAANAAANC 

CTTTANATTTATCCAATGTTTACTACTGGGATTGNTTAAAGNGAG GCCCC TCCAACACCA 

GGGGGTTAATTCCTGNGATTGNNAAAGGGGCTACTTCCMGGCATTTTCNTGCAGGCAGC 

CCCTTGGGAGGGCNCCTGAAAGCTGGTAAAGTCTGAAATTAGGGATGTGANCCTCGGGGT 

TACTGAGTAAGGTAAAATTGCNTCCACCATTGTTTGTGATACCTTAGGGAATTGCTTGGA 

CCTGGTGACAAGGGCTCCTGTTCAATAATGGTGTTGGGGANANAAAAAACCTGATTATAG 

ACCGAAAAAGTAGGANTTGAGGTGAGGTNAAAGAAGTNCTNGGGGTGAAAATGTCCCCTT 

TCCCCCT 

Sequence 3950 

TAGTCCTCACAAACACAGAGGGCTCCGCACTCCGCATGTAGGTCCACATTTTATCAAACA 
CTGCAATTTTAGATCTCCTGAAAAACTCTTTAGTGGAGCCAGAGTCTAATGTTCCATAAG 
CAATTTCTGTTTGCTTAGAAAGATCCTCAGCACTTTCGATGGGAGACACCATCCTCTCTA 
CAGTCAGGAAGGCAGCTAAGTTAGCCGTGTAGGAGGAGATTATGATCAGGGTAAAGAACC 
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ACCACACACCTCCAACAATGCGCCCAGAGAGGGATCTAATAACAAGTATGAAATGGATTT 
GTAAAGTGAAATGAATGACCTAATAAAAAAACAAGTTAGCAGTATTATGCAAATATTGAC 
TTATAGGGGAATATTTAATTTTATATTTTTCMTGTATCTACTAGTTAATGAMCTGGM 
ATCATTAAACATTTTG GGCAAGGAGTACCT 
Sequence 3951 

CCGGGCAGGTGCAAACAGCTGAGACAGATCTGCGANGATCACATCNGAGCACAGATNNAT 
CNATTCANAGAGGATNCATNGGATAGCGNTCTTTTTTTAAAGAAGATTGATNCGATGCTG 
GC AAAACCATTGCAN AC AAATGATCATGATCAGGAGC A I Mill I GTTGTCTGGATAGAA 
CTTACGTTCTTCAGAATTCAATGCTNCCCTCCATTCGGGGACATGGGACTGGAGTTATTN 
A 

Sequence 3952 

TTGGAGCTCCCCGCGGTGGCGGCCGCTGNCCCACNAGAGAACACGGCGAGGTTNGTNAGT 
GTGANATGAAANTTATCCATGCTGTCCNTNAAACTGAGTGTGCGGACACTGAGGAANGCT 
TNGANCAATTGAGTTTTNATCACACGACTTNTTTGTAATGCTTGTACCTCTG 
Sequence 3953 

GGGCNAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGAGACACATGGGTGGGA 

AATTGCAATAAAAAGACGGCCCACAGCAGGCTGCATTCCCATGGCTGGCCAGAGGAGGAA 

CGCTTTGTGTTCTCATCGGAGCTGCATGGGAAGTCTGCATACAGCAAAGTGACCTGCATG 

CCTCACCTTATGGAAAGGATGGTGGGCTCTGGCCTCCTGTGGCTGGCCTTGGTCTCCTGC 

ATTCTGACCCAGGCATCTGCAGTGCAGCGAGACCCATCCACTGTGGAGGACAAGTGTGAG 

AAGGCCTGCCGCCCCGAGGAGGAGTGCCTTGCCCTCAACAGCACCTGGGGCTGTTTCTGC 

AGACAGGACCTCAATAGTTCTGATGTCCACAGTTTGCAGCCTCAGCTAGACTGTGGGCCC 

AGGGAGATCAAGGTGAGGGTCGACAAATGTTTGCTGGGAGGCCTGGGTTTGGGGGAGGAG 

GT 

Sequence 3954 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGATGTACANCTANTNNATCNCATGNGNTN 

CTGGAATTTGATNTAANNCTTATNNATACACGNGGACCTTGANGNGGGGCCCGGTACCTA 

CATCANGTGCCATNAATGGAGGGTNAATCAGTCGCTTGNCATANTNATGGTGNATACTGT 

GGGCANTGCCCTTGACTTCTCTGAGCCTCAGTTTCCTTATGTGAAAGTTGCTGGAACCAA 

AATGGAGTCACTTATGCCAAACTCTAATAAAATGGAGTCGGGGGGCCACATAGAAGCCCT 

CACACACACATGCCCGTAACAGGATTTACCACAAGACACGCCTGCATGTAGACCANACAC 

AGGGCGTATGGAAAGCACGTCCTCAAGACTGTAGTATTCCAGATGAGCTGCAGATGCTTA 

CCTACCA 

Sequence 3955 

ANGTNCNCTGGGNATTTAANTATAANTATACTGCCTGCCTATGNGACGACAATCCAAAAA 
CCTTCTACTGGGACTTTTACACCAACAGMCTGTGCAAATTGCAGCCGTNNTTGATGTTA 
TTCGGGAATTAGGCATCTGCCCTGATGATGCTGCTGTAATCCCCATCAAAAACAACCGGT 
TTTATACTATTGAAATCCTAAAGGTANAATAATGGAAAGCCCTGTCTGTTTGCCACACCC 
ANGTGATTTCCTCTAAAGAMCTTGGCTGGAATTTCTGCTGTGGGCTATAAAATAAACTT 
CTTAACATGCAGTTNNNNNNNNNNNAANNNNATTNN rTTTTTTTTTTTTI I NNNNNNNNC 
CNNGNGNGNNANANAANAAAAMTTAANNCNCCCCCNNNNGGNANAAAAAAAAANAATTT 
TTTTTTTNGNCTNNNGGGGGGGGGGGGGGGGGGGGNN l ) 1 i H H 1 1 1 1 1 1 
Sequence 3956 

AGGTACACACCCCCATGCTACACACACTCCTTCATCTCCTTCTCAGTGTTTTCATACCCC 

AGCCGCATTACACACACCAGGCACATCTTCTCTTTCACAGAGCTTATCAGGATGGTATCT 

CCTTTTGGCTTAAACCMCCACCTTGTGTTTATAAATAACCCCCCCACCCCAGTGCCTGA 

ATTCATCCACATCTTCTGCCTTCATAGGCAGCCTCCCCTTGGCTCCAGTTCCAGCAGGAC 

AGGCCTTCCTCTGAAAGGAAGCCTGGGGCAGTGAGGCCACACTAGTCAAGCGTTCGGCCT 

CCCTACACCAATCCCAGCTGGGCTCCTAGTTGCCTANCACCTTTGCACTCAAGAAAGTTA 
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TATTGNTTTTCTCTCTCTNGCTCATTTCTTGCCAGTATTCTGCCTTTTCATMCATTTAT 
GCCAACAAGATATCTGGTCACCCGCGTACCTGCCCGGGCGGCCGCTCTAGAACTA 
Sequence 3957 

NGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTCCTGTAATTGACAGCCTTGCCTTG 

TTCCTCATGGCATCATTCAAGGTCATCTTAAATGAGAGAGGAGGGAAA GAAAG AAAAAGA 

MTCATACGTTATGGTTTrCAMTGCATCCAGAGGAMGGCAGGGTTGTTTTTGAAAAGT 

TTACTTCTATCTGAAAGCTTCTAGCAAATGAAAGCAAACAGTGCAAAGCTGAACTACAAA 

TAATAAAGCACAATTATGGCAGGAATCTGTCCACATTCACCCAAGCTACTGTGCCATTAA 

AGAGGAAAATAGACAGTTGAGCTGCATGTTAGTCACCAACTGGGAACATGGAAGAACGTC 

CCAATACATACAGGATGATCCATTGGCCACCTTCCTTATTTAAAAGAGCTTAAGGACACA 

ACMGMNGCTNAACGTGCACTAAAMGTTTGTATTAGAGCCTCGTTCCTACCCCCTAAA 

TTTTAACACTTTCGATGCCATATACGTTGTMCCTTCCTTATMGTTGCAAAATTCTGTG 

MTTCTGCGAGGMGATGGGTTMTATTCTGTGCATTCTTTGCCCCTTCTTCGTTTCGCC 

TCGCCCTTGACTTGTACAGAACTCAATCAAAGCCCCACATTGGC^4AACCA 

Sequence 3958 

NATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACACTGAGTAGCGAAGGTATAG 

AGTTTTCATTGAMTMTTTGGAAGAAGAGCAATTGGAT AGAATG TCAAGACATAACAGA 

GTCMGGGAMGCATTATTTTGATTTTATATCTGAGTATGTTTTTAGCCAGAAGAGGAGA 

AGAMGAGGTTAAAMGAAAAMCAGAATTTATAGATGCACGATATTACTTCTGAACAAG 

AGAMGAAACTGGTAGGAAMTTTATTTTAAAAAGTGTTGAAGGGAAAGAATCAAGACCA 

CAGATCCAGATCCGGAGATTATTTTGCTAAAGAATAGCAATTGTGAGGCATGAAGTGGGA 

GGGGGGAAGAAGCTATGMCTTM77TTGAGGTTTCTGAGAAGGAAACTTGAGTGAATTC 

ACTTCAGATGCATTTGGMTGTTTGCACTCCAGAAGATGAGTTTGTGTGTGCTCTGGAGA 

GTATTGGAAGAAGGAGGTATTACTAGATTTGGCGGCTCCCACAGTGACTCATTACTCTTC 

TCTGTTACTTTCAGGATTCATAGAGAGATGTTTrGTTGATATTATTTATTTAAGTGAGAT 

AAATTTGAATATGAATCCATTGGC I I 11 11 IGT 

Sequence 3959 

AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGMGCTTTTTTGCCACAAAGAAG 

ATTCTTACCAAGAGTGGGCTTTGTGGAAACAGCTGGTACGCGGGGGCAGACGCGCGGGCG 
GGMGATGGCGGCTGGGTTCAAAACCGTGGAACCTCTGGAGTATTACAGGAGATTTCTGA 
AAGAGAACTGCCGTCCTGATGGAAGAGAACTTGGTGAATTCAGAACCACAACTGTCAACA 
TCGGTTCAATTAGTACCTGCCCG 

Sequence 3960 

cgcccgggcaggtacgcgggcaactattgcaatgaaaaatttcaacaaj rm lAATGAA 

AGGATTCTGA/.GGAGGMCMGMCmATCAAAMGMGGTTTAGGTGTTAATGAAGTG 

CATTATGTGGATAATCAGGACTGTATAGATTTAATTGAAGCCAAATTAGTGGGAATACTG 

GATATTTTGGATGMGAAAATCGCCTTCCCCACCAAGTGATCAACACTTTACATCTGCAG 

TTCACCAAAACJCACAAGGATCATTTTCGACTCACTATTCCCAGAAAATCTAAGCTGGCAG 

TTCATAGGMTATCAGAGACGACGAAGGCTTCATTATCAGGCATTTTGCGGGGGCAGTGT 

GCTATGAAACAACCCAGTTTGTGGAGAAAAATMTGATGCTTTACATATGTCTCTTGAAT 

CCTTAATAT'GTGAATCCAGAAATAAGTTTATACGGGMTTATTTGAATCATCCCCAAATT 

ACAAAAAAAAAAAAAA 

Sequence 3961 

ATTGGAGCTCCCCGCGGTGGCCGGCCCGCCCGGGCAGGTACGCGGGATAGCATTGACTTT 
TACACACATTCACACMGAAAAAM^ 

TTTCCAAGGGAAGAGGGCAAAAAGTGGCCTGGGCTCTGTG GTGTGCGT GTTCCGTGGCGG 
AGAGAAGAAAATGGGAAAGACATCTCACTGGTGCTTTTC TCTTT TGTTTTAGT GCCCCC C 
GCCCCCATCCCTATAATATCTGTAACTACTCCTAAAAAGGTrTTGATTCAGGCl mill 
TGGTTTCATTTTGTTTTTTTAN G AAAAAGAAN AAN AN N N AAAAAAAAAAAAANGGGCCCC 
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CCNNCCCNTTNAAAAANNGGGGGCCCCCCGGNNNNGGNNANTr 1 I I 1 \ NAAANTTTTTTT 

NNCCCCCCCCCCCC 

Sequence 3962 

NTGGATCCCCNG NG NTGC AGGAATTNCATATCAANCTTATN N ATACCN ACCACCTATANG 

GGGGGCCNNGTNCCCANCTTNNGNTCCNTNTAGNGANGGTTAATTNCGCNCTGGGCGTAA 

TNATGCGNCATANCATGCTCACTGTGTGACTGCAGCTCAATCTCAAGACCCTGAAGGGTG 

CGCCGCAGGTCAGTAACCTCGGACCTGCTCATCTGGAGCTGCTCCGTGTGGCCAGCGACC 

TCCCGGTTCAATTCTTCAGTCCGGCTGGTGAACCAGGCTTCAGCATCCTTCCGGTTCTGC 

TCGGCCATGACCTCATATTGGCTTCGCATGTCACTCAGGATCTTGGCGAGATCGGTGCCC 

GGAGCGGAATCCACCTTCACCCGCGTACCTGCCCG 

Sequence 3963 

NGCTGNCTGCTGGGGCCAAGATATTATTTTTMCTAGCTGCTGNGCTTTTATCACTTTGA 
ACCTAGAATTTAAATCCCTCTGCCCAGATTNATTTTTTAACAACCAACATTGGNAGGCAG 
GGGAACAAAGACTATTCCCCAGAGAAGATCACTTGGTTAAAGGTTAAAAAAAAAAAAAAA 

0 

Sequence 3964 

TAAAACTTGCAGCAATGTGAATTGGGCCAGAAGATCAGAGTGTAATATGTGTAATACTCC 

AMGTATGCTAAATTAGAAGAMGAACAGGATATGGTGGTGGTTTTAATGAAAAAAAAAA 

AAAAAAAAAAAAAAANCCCCCCNNCCCCNTTAAAAAANTNGNNCCCCCCNNNNCCNNNNN 

ANTTNNNNNNNNCNTTTTTNNNCCCCCCCCCCCCNGGGGGG 

Sequence 3965 

AGGTACACATCAAGTCAGAATGGGCTAGCCCATCAGGGAAGCAGCGGTAGAAGAAATCTG 

GGCGTGGCTTTTTAGGGCAAACAGCAAATGACACTGCAGCTTCAGGAGGTGCTCAAGAGG 

ACACTGACCGACTGCTGTGGCCGTTGGTGGGAAATGCTGTTCAGGCAGAAAAGCGCCCAG 

GTGTCACCTGGAGGACTGAGGTATGGGGAAGGGAGTAAGCGTTGAATGACAAAAGAAAAA 

TCAGAGTGAAGATAAAGGAGAAAAGAATGAGAACCCGCGTACCTGCCCG 

Sequence 3966 

CGCCCGGGCAGGTACAGTGGCACAATCATAGCTTAGTGCAGCCTCATTCTCTTGGGCACA 

AGCGCTCCTCAGCCTCCCTCCTCAGCCTCCCAGTTGCTGGAACTTACAGGTGCACCACCA 

TAACCAGCTAGTTTTAl I I 1 I I GTAGAGACAGAAGTCTCAC I ! I I I I i CCTAGACTTGTC 

TGAAATAGACCATTCTATTTGCTAGAATGGTCTCAAACCATCCTTCTGCCTCAGCCTCCC 

AMTTGCTGGAATTTCAGGCATAATTCATCATGCCTAGCCATTTCi 1111 1 CTGCTGCAT 

TGTAGAGTTAGAACCAATTTCATTTGTCCTGCACTGAGAATGTGCATTCCCAAGCAGATC 

ATGGTTGAGCTCAACACACCTGCAACTTAAGCAGCCTCTTTGAAGGGTGCACAGTTACTA 

ACTGTCAAGCGTTGCACMGCACCTTTAGCANGTATGTTCAGMCTTTACATTTCCACCC 

I I I 1 I IAAACAGCTTACMGTMCCAGAAGAACGGGGANTTAAACTTTTGGACATGCACA 

CTCATTGAAAACCCAATGGATTTTGGGAMGCCCAAATTGGTGGAAAAAAAAAAAGGGGG 

GGGATTCTGACTACTAAGTA I Nit I ACAAAATCTTGAATGGGCNGTTNGCANGGGTTTT 

GGTTTAATTTTGGTGG 

Sequence 3967 

NCNGGNANGTACGCTNCCCANGNNNAGCGTGNNNGCANTNTAAGCCNNNTACTTACATCC 

CACC^4AGNGGNNNGNCCCCNTG^iNN^ICTTCTNNGATAATTGNGGNNAGNTTNNTTNATAT 

CTACCAGNAANCNTNNNCNTNGCNTNCTCTGNATCCACTGCTTAAATACGGACGAGGACA 

GGGCCCTGTCTCCTCAGCTTCAGGCACCACCACTGACCTGGGACAGTGAATCNACAATGC 

CGTCTTCTGTCTCGTGGGGCATNCTCCTGCTGGCAGGCCTGTGCTGCCTGGTCCCTGTCT 

CCCTGGCTGAGGATCCCCAGGGAGATGCTGCCCANAAGACAGATACNTCCCACCATGATC 

AGGATCACCCAACCTTNAACAAGATCACCCCCCA 

Sequence 3968 

GCNNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGAGAAGTAGTCCTATTTCAC 
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CCCAACTACAACATTAATCGAGCGGCCGCCCGGGCAGGTACTTGGTAAAGATGGTAAAGA 

AAAACCAAAGGAAGAGCGGCCAAATAAAGATGAAAAAGCAAGAGAGACCANGGAAAGAGC 

GCCGAAGTCTGACAAAGAGAAAGAAAAATTCAAGAAGGAAGAAAAAGCTAAAGATGAGAA 

ATTTAAGACCACTGTCCCCAACGCAGAATCAAAATCAACTCANGAAAGGGAAAGAGAGAA 

GGAGCCATCCAGAGAAAGAGATATNGCAAAGGAAATGAAATCAAAGGAAAATGTTTAAGG 

AG G AGAAAAAACACCCN GTTTCTGGGTC 
Sequence 3969 

TAGTCCTCACAMCACAGAGGGCTCCGCACTCCGCATGTAGGTCCACATTTTATCAAACA 

CTGCMTTTTAGATCTCCTGAAAAACTCTTTAGTGGAGCCAGAGTCTAATGTTCCATAAG 

CMTTTCTGTTTGCTTAGAAAGATCCTCAGCACTTTCGATGGGAGACACCATCCTCTCTA 

CAGTCAGGAAGGCAGCTAAGTTAGCCGTGTAGGAGGAGATTATGATCAGGGTAAAGAACC 

ACCACACACCTCCAACAATGCGCCCAGAGAGGGATCTAATAACAAGTATGAAATGGATTT 

GTAMGTGAMTGAATGACCTMTAAAAAMCMGTTAGCAGTATTATGCAAATATTGAC 

TTATAGGGGMTATTTAATTTTATATTTTTCAATGTATCTACTAGTTAATGA 

ATCATTAAACATTTTGG 
Sequence 3970 

CGGGGCAGGTACATCATTTCCAGAGCAGGCACTGGCAGCGAGGTAGGGTTGGAGGAGAAG 

TAGCGCCGGGACTTCCGGATGGCAMCTTCTCTGTGGGTAGAGATTTCCCAGCAATCTTG 

AGCTTCAGGCCTGGCACAGCTCGAAATAATTCCACTTCGTCGTCCCCGAACGGCTTGTGG 

TCCTCCTTCCCAAACATGCTGAGGTAGGCGGCCTTCATGTAAATGTAGGTGGCCTTGGAC 

CANCAGTTCTCCTTGCTGAGCAGGTCGGCGTANAAGTAGGACATCTTCCACTGGCCCTTG 

TAGGTGAAGCAGCAGTTCCGGGCTGGAAGACAGAGACAGCTGGACAGAGCTCCTGAAAAC 

ATTTTAAAATACCCCCCCC 
Sequence 3971 

CCGGGCAGGTACGCGGGGAAGAAGAAGAAGAAAGGCCAGGAGCAGTGGCTCATGCCTGTA 

ATCCCAGCAGTTTGGGAGGCTGAGGCGGGTGGATCGCCTGAGGTCAGGAGTTCGAGACAA 

GTCTGACCAACATGGTGAAACCCTGTCTCTGCTAAAAATACACAAAAAAATTAGTCAGGC 

CTGGGAGCAGGCGCCTGTAGTCCCAACAACTTGGGAGGCTGAGACAGGAGAATAGCTTGA 

ACCTGGGAGACGGAGGTTGCAGTGAGCTTAGATTGTGCCACTGCACTCCAGCCTGGGCAA 

TAGAGTGAGGCTCTGTCTCAAAAAAAGAAAAAAGAAAGAAAATTAMGATAATTCTATGT 

TAACTCCTTAMTGTAATTCCTAGTTCTTCTATCACCTTTAGCCGACTCTTCTACCTTCA 

GAGTGGACACTGCTGTTATCAAGCANANAGTGCCGATCTTACTCAAGTACCT 

Sequence 3972 

ATGTACACCTAGCTCGATNGTCCACACTTTNAGTCNTGAANCNNNAACNCNNCCGATCCC 

NACCAACANTGGAAGGAGNGNGCCNANCACNCONGATCATNGNTANNNANCANANTGTNT 

ACNTNGCCCNCTTNAAGAACTGNTNNACTTACTS4GGAGTGTGCGCGGCTTCATTCTTGAA 

GTCAGTGAGACGAAGMCCCACCAACTCCAGACACATTTTGGCGACCTAGATGGGACAAT 

CGCCTATCACCGAGTGGTGAGACCATCGCCTATAGCTGAGCGGCGAGGCAATCACCTATC 

GCCAAGCAGCTCCAGAAGCAGATGAGCAGATGAGGAGACGCCTCTATTCCGTGTCGTTTA 

GAATGAGCAAACAGTTGTCTTCACCATCCCAGACAAGCAGATGAATGGTGAAACGATACA 

GCAGAGAAAGAGAGAAGAGGAACA 

Sequence 3973 _ 

TGTCCCCCCTGGTGGGGGATTGGGGATCTGTGTTTAGCCATTTATATCTACTTTAGCTGT 

TAAAGAGGTCCAAATGAAAATCAGGTGATrGTGGAACCATGGGGACTTGGGGGTGGGGCA 

GAGGTGGGAACATTTGTATCANTTGAGTCANCTTGGTGGCTCCCTGTGGAGCCAGGGCTG 

AGCCTTGTCACGCGCACTCNCCAATTAAGAGATGGACCANCCAGCAGTCAAGTGCATTCT 

CCAGTCCTTGCAAGAAGGATCAGCCCTTTCTGTGCCAGCCTCNATCGCCTTGTGCTTTGG 

TCTCTTTTTCTCCCCCCCGCCTGGATCCTGCCTCGCGCGGGCCGTCCTGTTGCTGAGACT 

CGGGGTACCT 
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Sequence 3974 

TTGGAGCTCCACCGCGGTGGCCCGCCCGGGCAGGTACCTCTGGCCACCCTCCCGAAGGAC 

CTTTGTMGGCCAGGCAAAMTGGCTTGCTAGGGGACCCAGTGATCTCCCAGGGCTTTTT 

CCMCTGCTTCCTCTTCCCTATATTTTGCTTGGCTCTCTAAATCGACTCAGCTCCAGGTA 

AGATCAAAATCTTCTCTTITMTCTMGCCTTCAGCTTCCCCAGTGGGGGTGTGTGTTCA 

GGGGCAGATAATCTCCCTTTCCCACTTCCACAGTTTGGGCACTCACAGTATTTGGGGTGT 

CTCCTGGGTC CTGC AGGAGCAATCTGCTTCCTTCAGGGGTCTATGGATCTTTTCAGGTTT 

GCTGCAGTCATTTTCTGCAGGATTTATTCCAGTCATTCTGGAGCAAAAATGCACGAAGCA 

AGCCTCCACACGTTGCTCTGTCCATCTGAATCGGAGCTGCAATCTTGTCTTACTTCTAGT 

TCGCCATAATCCACTGCCCTCGAAGGTTTCTT 

Sequence 3975 

AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTTGGTTCAGTCCAGTAAGGTGGGACA 

ACCCTTAAGTGGGGGCTTCCTGGTTGTAGTAGATAAGAGACAAMGGTTGCATTCTTTTG 

AGTTCTTGATCAGCCTTCCACTGAACACACAATTTAGTCTGGCTCAGTGAGTCTGCATTT 

TTACAGAAGCGATAGGGGAAAGGAAGCAATCAGCTATGCAGGTGAGCCTCAGGGGGATGA 

CTGTGMTAGAATGGGAGGCAGCTTTGCCCTMGCAGTTCCCAGCTTGACTTTTTCCTTT 

AGCTTGGTGATTGTGGAGTCTCMGATTTGTTTTCTTTCCACAATCTGAAATTCAAAGTT 

CCCTGGGTATCCTGTATTTTCATTTGTACACTTTGGCGCCCTAGTCCCAGAGTCCTGGAG 

CGGCATAC 

Sequence 3976 

CGAGGTACTGGCTTTGGTGCCTTGTCTACCAGGCAGTTGAGTTGGTTACTGCTGGTGCCT 

TTGTGGGCAAMCTAGGTTGTAAMGCAGCCAAAAMTTTCCAGATGGGAGAGTTTCATA 

GCGGGCATATTTTGGCTTGTGAATTGACTGTTATGTCTCAAGGTAGTCTCTTGGTGGGAG 

AGAGTTCTGCTCTGTAGCGTCTAACTAAGAGCACATAGTTAGATGAACTTGCCCTGTTGG 

G AGTGTCTG GTG AAGGG GAGATAAAAG ATTATAATTGCATTTCTAAAGAGCTAAGTAGG A 

AGCANAGAAAAGGAGAGAGMGTGMGACAGAATMTTTTAAATAATGAAGAGAGAAATG 

AAGMGGAATAATTTTCCTGCTTTTCCATTG 

Sequence 3977 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGTCGGCCCTCATTATTC 

ATGGATTCTGTATTTGCAAATTCGCTGACTTACTGACGTTTATTTGTAACCTTCGAGTCA 

ACACTCACGGTGCTTTCTCAGTCCTTTGCAGACGTGTGGAATGGAAAAAMTTTGAGTTA 

TATGACGTATATGTTCCCAGCTGAGGCTGAGCAAGGCTCACTTCTCCTTGCAGCCCTCAG 

ACTATAAACAAoTGTCCCTCTTGCTATCTACTTCGTGTTATGATTTTTGCATTTTCATAA 

TCCCTGTTGATGATTTTGCTGTTTAAAATGGCCCCTAAGCACGGTCCTGAAGTACCTGCC 

CGGGCGGCCGCTCTAGAACTA 

Sequence 3978 

ATGTAC^.CANNGCNCATTAATCTNTCNNGTGTGNCATGTATCAATCATNTATAAGATCCC 
AGGCATGGGCCNNGNTGANCTACTATNTCAGCTTACTTGACTTCTNAACAGGNACATGCC 
CCNTNCTNGC'TTANTTCANTTAACTNGATNCNGTANATGCATGACCATTTGATCGCACC 
ACTCCTTCAGGAATCCAGGACTTGTCCACACACCGTTCCATTCCCCGCGTACCTGCCCG 
Sequence 3970 

TGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGAGTGGCAACCTGCTTGGGTCCCTT 

TCCACACCGCGGAAGCTTTGTTCTTTTGCTCTTTGCAATAAATCTTGCTGCTGCTCACTC 

TTTCG GTCCAC ACTGCCTTTATGAGCTG TAACACCGCG AAGGTCC ACAGCTTCACTCCTG 

AGCCAGCGAGACCACGAACCCCACCAGAAGGAAGAATCTGTGAACACATCCGAACATCAG 

AAGGAACAAACTCTAGACATGCCGCCTTTAAGAACTGCAACACTCACTGCGAGTGTCCGC 

GGCTTCATTCTTGAAGTCAGTGAGACGMGMCCCACCAATTCCAGACACATTTTGGCGA 

CCTAGATGGGACAATCGCCTATCACCGAGTGGTGAGACCATCGCCTATAGCTGAGCGGCG 

AGACAATCACCTATCGCCAAGCAACAAGCAGATGAATGGTGAAACGATACAGCNGAGAAA 
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GAGAGAAAGAGGAACATTTGAACGCCGAGAGGAGTTCAACCCAAGGTGGTCCGAGAAGGT 

TGGCCNCCCAACAGGMGAATATCTTCCGTGCCATNCCCGCTTTCAGCTNTTGCTTCCTT 

CCTTAAGAAGAACCACCTCCCCACTCAATN AAANCCTTG CATT CAACCTTCCANN N N NNA 

ANNNNNNNNNNNNNNNAAANTNCCCGCCGGGGGGGGCTTTTNAAAAAAAGNGGAACCCCC 

CGGGGGGGGGGMNTTNNTNNNANNTTTTTTNCCCCCCCCCCC 

Sequence 3980 

CCGCGGTGGCGGCCGNCCGGNCNGGTACACCACACAATGGTGACACAGNCTANTGTGTGG 

NCAGNGTGTGCCACNTTTTNTGCACTATACCCNNGTNTGATGGCTNGATGTNNACTNNTG 

AAANTGTNTCTCAGGGNTNTTCNTTATCNACAGAAACANAANNGAATTACNNNTNCTANT 

TCCTGTCTCANACTGNNNCTTTATGACTNTAAATANATAAANNATCAG ATNG NNAGCATN 

N ATATTTTN NCTTAGCN CCTATTGC N AAATGC ATTATCACTACTG ACNTTTTCTTTCTAT 

GCCCTGNTCCCAAANANAAACTTANCACTTACATGCTCNCCACTAGCAGAAAGGGATTNN 

CAGGGACCAGAATCTCAGAGAAAAGGAGGGTTCATTCAAACATACTGGTGCTGAATCCTT 

G CAG CTGAAMTTATCTTGC AAANTTATATGCTGGCCCTTC ATTTAAAATG 

Sequence 3981 

GCGGCCGAGGTACCATCGGACCCCTTTCGCTTGCTATTCTGTCCTGTCTTTGCTTAGAGT 
TCGGGGGCTAAATACTGGGCACCTGTCGGCCAGTTAAAAGTGACTAGCGCGGCTGCCGGA 
CTAMGACMGGGTGTCAGGCTTTCTGGGAMGGGCTTTCTAACAACTCCCGACTCTTCG 
GTGTTGAGGCCATTGGTTTGCCTAGMCCAGCTTTTGCTTTTCCTGTACCTGCCCG 
Sequence 3982 

CGGCCCGCCCGGGCAGGTACAAGCCTTGAACATCGTCCTGCTTCCCAGTGGGTTCAGACC 

TCACCTTTCAGGGAGCGACCTGGGCAAAGACAGAGAAGCTCCCAGAAGGAGAGATTGATC 

CGTGTCTGTTTGTAGGACGGAGAAACCGCTTGG GTAACTTGTTCAAG ATATGATGCATGT 

TGCTTTCTAAGAAAGCCCTGTATTTTGTGATTGCC I I I I I I I I I I I I I i AAGATGCTTTC 

ATTTTGCCAAAATAAMCAGATMTGTGGATGGTTTMGGGTTATAGTATTATAGTTTAA 

ATMTAACMCAAAATTCTTCCCAGGAACTCTGCTGGAAGGTAAATTAAMTACTTGTTT 

TTCCATTGGTAAAATATTGTTGCACTCTGTGAACCAAAAGACAGTCTAAGTTGGAGGACA 

TANAACGGAACTCAT 

Sequence 3983 

GACACAGACACACATGAGGAGTGGGTTTAGGAGTGGAAAGTTTAATA GATA AAGAAGAGA 

GTGAGAAAACTTCCTCATGTTGAGAAAGACAGTTGCTCAAGAGAGGGTTTTCTGGGTTTG 

GGGCMGATTTGATTGATTTCATACANAGGCTTGAGGAGGCGGTGATTGATATACATAGG 

GCCCGGGGATTGGTTTAACCAGGTATACTACTTACATAGCCGGTGAAAAGACTGGCCCTC 

CCACCCTMCCTTTTATTATGCAAATGCATCTTCTACCTGGCTGTCACCATGACATTTGC 

ACACGTGGCTTCACTTCACCAGTGCCATGTNACCCACACACCTGGTGACAAGGAA/^AGGG 

AGCAGGAATCACCATATTGAATGTGCCTGGCTTCCAAGTA 

Sequence 3984 

CGGTGGCGGCCCGCCCGGGCAGGTAC 1 1 1 1 M I M 1 1 1 1 1 I 1 1 I I i I I I I I I GAAAAAAT 
NAAATTTATTGCCTTGTGGGGGGAAGGGATGTGGTCACCTGNCTACTCCATTCACACCTA 

CCTGTGGCTTCTCCAGTGATGGACCANATACTCAAGGAG 

Sequence 3985 

CCGNGGTGGCGGCCGAGGTACGCGGGGGTTGTGACTAGGAAAGGCATGGGACGACATGTG 

TCCCAGGTGAGGTATGTGCCACAGGCAGTTACGGATGTGTTGGAGGAGGAAATTCCAGGA 

AGGGGCAAGATCTGGAACGAAGGCTGGAGGGAGGTTCCCCTNAAGGATGCTAAGGAATAC 

GCTGAATCCATAGGTGCCATCGTGGTTGAGACAAGTGCAAAAAATGCTATTAATATCGAA 

GAGCTTTTTCAAGGAATCAGCCGCCAGATCCCACCCTTGGACCCCCATGAAAATGGAAAC 

AATGGAACAATCAAAGTTGANAAGCCAACCATGCAAGCCAGCCGCCGGTGCTNNTGACCC 

AAGGGCCGTGGTCCACGGTACCTGCCCG 

Sequence 3986 
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CCGCGGTGGCGGCCGCCCGGGCAGGTACAGGTTTGGGCCCAGAGGCCTGACAGGAAGGGC 
AGTAGGAGAGGTGGTCGTAGATGATGGGCCAGAGGGCGCAGGGCCTAGCAGGCCATGGAA 
AGGACTCTAGGTTTTACGCTAGGTAAAATGAGGAGGCACTGCAGGGCTTTGAGCAGCAGA 
GTAACACAGTCTAACTTAC I I I I I 1 I I M I AATGGATCCTACCAGCTGCTAAGTTAAGAA 
TAATTTCCAGTGAGGGCTTTTGTAGTAAATTATAGATGATGCTNGTTTGTCAATCT 
Sequence 3987 

ACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGCATGTTGCCGT 

GTGAGCCCTGTGAAAGGGGAGCATATAAGGAGGTTATCCACATACTGTAGAAGTTATGCC 

CACCTCAAGAGATTGCTCAGTTAGA I I I I 1 GCTAGAGCTTGTCTGAATAAATGTGGGCAA 

TTTCTAAACCCCTGAGTTAGGACTGACCAGGATGAAGTTAATAGATAAAGATTTAGGTAG 

CTTTCCCAGGAGAAACAGGGCTATTAAAGGGAAAGATGAATTCANAGATCATGTGAATAT 

TAAGCAGACACCCATCTTGGAAAGTATATTTTTGC AC C AAAG AGGTGTGG GATATTTAGA 

CATTCATTACCCAGGGACTGATGGGAGAATGGTAATTGGTCCCTTAAATAATATAAAGGA 

GTGTGMTCTTTTCTTTTGGAGGGAAGTGGTGCCATTTGCCCCCATTACCTGGCAGGATT 

CCGGAGGAGAGTTGTTTAGAAMGATTAGCCAGAGTANGCAGCTCTTTAAACCCAAAGGG 

AAATTTATMTTTTTACTTGCCACCTCCAGMGTTGCCCT^ 

TGANGTCTGATTGGAAACCTGGCCAGACCGTANAGCTTCTTCACTTCAAGGNCATCGGGG 
GTGGGGATTCTGCCTAAGGACCCTTCAACCCTTAAGGCAGCCCCCGTTTCC 
Sequence 3988 

CCGCGGTGGCGGCCGAGGTACTNNNTTTNAGNTTNTTTNNI I I 1 1 I I I GGGAAAGGGTAG 

TTCTTTTCATTAGATATTGAGGTTCAATAGACAAAATTTATTTCAGAGGTA 

Sequence 3989 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACATGCCACTTATGGCATGCTCCG 
ANAAGGTCCANGAGAGCTGAGCAGTCAAGATGTGTGACTTNACCGAAGACCAGACCGCAN 
AGTTCAAGGAGGCCTTNCAGNTGTTTGACCGAACAGGTGATGGCAAGATCCTGTACCT 
Sequence 3990 

TAGGCGAATGGACTCCCCGCGGTGGCGGCCGATGTACNCGGGGGGAGAGTAGTGGTTCTC 

GCAGCACCCAGCACGCAGCTGGAGATCTGAGAACGGACAGACTGCCTCCTNANGNGGGTC 

CCAGACCCTNGAGTANCCTAACTGGNAGGCACCCCCAAGTAGGGGCAGACTGACACCTCA 

CACGCCCGGGTACCTGCCCGGGCGGCCGCTCGACTCTAGAACTAGTGGATCnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnN 

TGTTTCCTGTGGTGAAATTGTTATTCNNCTCACAATTNCCACACAACAATACCGAAGC 

Sequence 3991 

CGATTGNACTCCACCGCGTGGCGGNCCNTGTACGCGGGGGCCCTTGCTCACTAGAAGGAG 

CCCCCTGACCCCTTCTTCCAAAAATACTTTTTGTCTTTGTCTTCGNGTTCTGCATTCATC 

CCCCTTCGTTCACTCNCATAACAACCAACAGCGACAACTGGCACCTAGGACAGGGACGTG 

AGTG AAGGTCTGCTGG AGCAGAGAAACTGACAC TTGACAAN GAAGAACCGAGAATGAGAA 

ACC CTGCG ACCGAGTCTGCTTAC NAGCTTAATATAAGTCTTG CAAAAATN AAGCTTGG AC , 

CACCCMCNGGCCAATTTNAGMGCTTGCANCMCCTGGCAATTACCTTTATTCAATTAA 

TGCAAAAACCAAAAAATNGGGGAAGMTGTTANCMGGCCCGAAAAGAATGAAAGGGTCC 

GTGACCCMCTTCAATANTCCCNCTTGGMGGGNNTTTTTGAAGNCAAANCAGAAAAATT 

TGGTTNTTAATGGAAMGCCAGGATGGTTGGAAAAAACTTAACCANCTTGNTGGTCTTGC 

CCCCCAGAAAGGGGNTTGCTTGAAGGGGCCANGTCATTGCCCCCAAAGCCGCAGGNGGTN 

N CCTTNGGGATTANTCTAATTAGTAAACCA"! TTTGAAGCCTTGTTNGTANCNTGGCCCCG 

GGCCGGGNCGNTTTTAGAAACTAATGNGGMTNCNCCNNGGGCTTGGCAAGGAATTTCGA 

TTTTCAAAGCTTTAATTNGATACCCGGNCNNAACCTTCAANGGG 

Sequence 3992 

CCGCGGTGGCGGCCCGAGGTACCTGAGCTCCCAGTCTAGGGATTACCACATGGAANAGTG 
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ACNGATCAT6AGTGGAAGGGANATGAGTTAATACAAATGGAAAACANGTTTCA7TGTATG 

AAAAAGGAAAGTGAATTAATGCTTTCGAAAGAGACATTAAANTAGATNTCTGGGCAATAT 

GAAACTGATAGTCAATCTCGTCAGATTCTGAANGCATCTGTAGATAAGCTTCCAATGAAA 

AACAAGGCAAGTGACAATAGGACAACTAGAAGTAAGCTATAATTATAGCAAAGAANGCAA 

GAGCTMCATAMCTTGCCTATTGTCATTGTTGMGTCATANAACTACTCCTTTCAACTA 

CTCTTGTCATCAAGAGGCTTCCTGATGCGATCATCTGTGGTGAGGATAGCTTTTCGAGCC 

CCATCCCACTTTCCAGGGCCCTTGTACCTGCCCG 

Sequence 3993 

AN ACTAAGCTCTTAC AC ACAGTTGC NG NTG AAAG AGGGAATTGCTTG ACATGGCCAC ANG 

ANNNTGCAGCTTCCTGCAGACATGACANTCAACGCAAACTCATGTCACTGNGGGCAGACA 

CATGTTTGCAMGAGACTNAGAGCCANACAAGCACACTCAATGTGCTTTGCCCAAATTTA 

CCCATTAGGTAAATCTTCCCTNCTCCCAAGAAGAAAGTGGAGAGAGCATGAGTCCTCACA 

TGGAMCTTGMGTCAGGGAMNGGGAGGCTCACCMTTATTTGTGCATGGGTTTAAGTT 

TTCCTTG AAATTAAAGNTCAN GTGTGTCTTTG NGTTTACCA 

Sequence 3994 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTN luiiiiiiimn 

TTTTTTTCCATAAC AAAGCTCAN GTGTTTATTAAGAATTAA^ 

TAC AGTTCATTTCACAGTAAACTAAACAAAAC 1 rTTTTTTTTTl 1 1 CTTTTCGGGTTTTT 

TTTTTTTTTTTNCA 

Sequence 3995 

CGCCCGGGCAGGTACTAACAGTCAGGAACAAATAGAACACATAGAGAGGACAGTGGGAGG 

TCCCCCTGGCTAAAGCAGAGGGTGTATTGAGAGAAGGAAA CCTTT GCCTCAATGGATCGT 

AGCAMCTGAGAMGGACCAATAACACCTTGCTTGAGAMGTTTTGAATGTGATTCCTTT 

GGTGACCACMGCANGGAGGTCCTTGAAAGTCCCTTGTNCTTTCAGATGCAAAACAAGGA 

GGCTTGTNCCCATTACAAGACTATCAAGGGAAAAAATGCACTGTTTATCAAAAAGATATG 

AATGTTCTTTGACAGGCAAAACCCTGCTCAAAATGAAATGGCANTGCCANGGAGATGATA 

TTCAGTGTGATGCAACTTGTGAAGTACCTTNGGCCCGCTTCTANAACTAAAGTGGGATCC 

CCCCGGGCTTGCMGGAAATTTCGATANTCAAAGCTTATTCGATACCGTCCAACCTTCGA 

AGGGGGGGC 

Sequence 3996 

CGAGGTACTTTA 1 rTTTTTTTTTTTTTTTTTTTTI I CC1 rTTTTTTTTTTTTTTTTTTn 
1 M 1 1 1 1 1 I 11 1 I I 1 1 U 1 1 1 U I 11 1 11 I 1 I M 1 I 1 1 i NTAAAGCCNNAAAAAANTTTC 
NTTTNGGGAMCTTTGMNNTTNAAAAAAAAAAAAAAAMNNNNCCNNGNGNAANTTNGN 

nnaaan i nmm i caaaaanaamgngcccnaaamtttantttnaaaantccnnttct 

NCAATAAAAANANGGNAAAAAMTCTNTAAAMNNAAATCCCCCGGACAAAAAAAAAAAN 
TGGGANGGGGNNCCCCCCTACNANAAATTCCCCCNCCCCTTNAAAAAAANNGGNNCCCCC 
NGGGGGGGGAAMNMCNCAAAAAATrTTTTTCCTTTANCNNAGGNTTATGGGGGNNNCC • 
CCCNNNNNNNN 1 1 1 11 H 1 1 NNTTTTNAATGNGGN NAAAA AAATCCCCCCCGCCNAAAAA 
NTCTAAAAAANANANTTTATCCCC NGAAAAAAAAMTTTTAACTTCC AAN N ATNTCCN N A 
AAAAAAAAAMCNCTCCCTGAAAAATNMNAGGNNCNNCCCCCGGGGCCCNTTTNANAAN 
NGCAAAAACCCTTTTTTTNNNGNTTTANTTAAAAAAANCCNNTAAAAAA 
CGNCTCCCCATTTTTIMTANAAAAACCCCCCNCCCCCAAAAMN^ 
ACCCCTTNTTNAMTTTNTAAAAAAAAAAAAANA 

TTAANNCANTAAAAAAAGNCTTTTN NCNTTTCCCCN AAANGGGCGG N ANCCCC AAAAAAA 
A 

Sequence 3997 

CTCCACCGCGGTGGCGGCCCGAGGTACCCAGCGTCTCTTGATTTCATCTTTATGTTGATG 

TGTATTGGGTGTTGCGCTTCCACTTATAAATGAGAACCTGCAGTGTTTGGTCTTCTGTTC 

TTGCCTCAGTTGCTTAGCAGAGTGGCCTCCAGTTCCACCCATGTTCCTACCCGAGGGCAT 
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GATTTTGTTTCTGTC 

ACCACATTTTTAAAAATCCAGTTTTCTGTTCGTGGGCATCTAGATCMGTTCCACATCTT 
TATrCCTGTCAGTAACACTCCCGTGGACGTGTGMGTGTCACGTGTCTTTTTGATAGACG 
CATGTGTTTTTC 
Sequence 3998 

CCGCGGTGGCGGCCGAGGTAC I 1 I I I I I 1 I I I M I I I I I i I I I GGGCACAGAAAACTGTT 

TTMTTAGGGTAACAGAAAGCAGCTGCAATACAGAGTGATGAATGTGAACTTTGAAAGTA 

AATTTGGTGGGTTTCAGGATTTCTGAGGAGGGMGTCAAACAGTTCTAAAGAGTTGGCAA 

GGCCMTTGAGAGGGTGGTMGCTATCCTAAGCCCTTTCTTCTACATTAACTCTGCTTTC 

CAGTTAGTCTTCAGGTGCTTTAGAATTTGAGATCCTTTTTrCTCATTGNGTGGCCATGGA 

C TAG AAGTATTAGC ACTACCTG GAAACTTGTTCAAAATGC AGAATTCTGGGCCCC AAAAC 

AGACCTACTAAATAAAAGTCTCCATTTTAAAGAGATTGCTAMTGATTCATMCTTGTTA 

AAGTTTGAGAGGCATTACCTTAGATTCTGNCTTCTTCTTACCTTTCAAACCCAGCTGAAA 

GTGGGGTTGTATTTCCCTTTACTTT 

Sequence 3999 

CTATA GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGAC 1 rTTTTTTTTTTTTTTn 
TTTTTGCTAAATTTTTAGTATCAGGATGATACAGGTTTTATAGAATGAGTTMGGAGGA 
TCCCTCTTTCTCATTTCAGGGCTTATTTGACTGGTAGGI I I 1 1 AAT ACTG CTTC AATTTC 
ATTACTTGCTTCTGTTCTATTCAATATTTCTG NTCCTTCCTTG NTC AACTTTG NTC AGTC 
TTCCTTGNTCAATTCCTTGNGTTTCCATTAATTTATTCATTTCCTCTAGTTTG 
ANATATGCTTATAGTAGCTTCTGAGGATCTTTTGTATTTCTGNGGGATCTATTGNGATGT 
CACCTTTATTAATTTCTMTTGGGCTTGGNTGGANCTTCTTTC I I I i 1 1 CTTTGNTAATC 
TAGCTAGGGGTCTATCAATCTTGNTTATCCTTTCAAAAAACCCACTTTTTATTTT 
Sequence 4000 

CCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTCCATTTGCCTTCCTGGTTAAAGATTGG 

TGCCTGGCCAGAAATGCCTGTGAAACAGCTGTGGTTTAAGCTTCAACTGTAAAAGACAGA 

CGGAATCCTGCTGCATCCANAGCTNGAAGTTCTGANGTTCCACTGAAATCACAATGGAAA 

GTCTTGACTTGACTGGTCACAGTAATGAAANGCAGTAATAGAAATAAGGATCATTCAGCA 

TAAGGAGAAGGGGTTG 

Sequence 4001 

TTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTACTTGCTAAAACGAAAACAGGTAAAAAC 

AATGAACAGACGGTTCACGTTAAGAGGAATACGAAAAGCCAATAACAATACTGACATAAC 

AATCTTATTTGTAATGAAAGAAATACAAATTn*AAAAATGACATACCATTTTT^ 

ATTGGC AAAAATTAAAC C AAAAAGAATACCTAC AA'iTG ACCAAGG AGTATTCTGGCAGG G 

GGAGGGAAGTCACATACATATAGTACCTGCTTTTGCCGCTTCTGGTTTTTGCAGACATCC 

ACTACTCCCCAGCTGATTACACCAACTTGAATGAAACGACTTCTCTTGTGAACTATCAAG 

GGGCCGCCAGAATCACCTCTGCAAGTATTGGGGTCAGCATAGGGACTCACTCCTCCAGTA 

CCTGCCCG 

Sequence 4002 

CGAGGTACACATCAAACACTTCATTGCCTAAATGCAGGGACATGCTTCCATCTGACCACT 

TGACTATCCGAGCATTGCTTTCTTTAATTTCATTTCCTTCTTCATCTCGGCGTATCCTCC 

ATCTTATAGTATTTTCTACCTTTMTmMCCTGGTTCTACCTTCTTC 

CTTCATCTTCAAATTCATCTTCATAATACTGAGGATCAAAAGGTCTGGGCTCTACACTGA 

GAAAGTTGGGCAGTTTAACAAAATATAAGTCG rTTCCTAAATCAGTGTTTACTTTGGGTA 

TTTCTACTTCTATTCTGGTCTCAGGAA1TGGGCTCCTCTTCCTGTTGATCCTGAGGCAAT 

CCATTTTCATCAACAGGCTGTNCTGGAGTAGGTGGTTTGTCTTCTCCATCACTCCCTGAA 

GAGAAATCATCTGGCACCTCCAAATAAATCC^TGGTTCCACTATTATCTTT 

Sequence 4003 

TTTATTGGTTGGGAAAGGGAGAAAGCTGTGGTCAGCCCAAGAGGGAATACANAGTCCCTA 
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AAAAGGGGAGGGCAGGTGGCTGGAACCAGATGCAGGGCCAGGCANAAAACTTTNTNTCCT 
CACTGCTCAGCCTGGGTGG N GGCTGGAGCTCAAN AAATTGGGA 
Sequence 4004 

AGGTAC I (I I I I 1 1 1 I i I I T M M I I I I I I GTTTGAGACAGAGTTTCACTCATTGCCCAG 
GCTGGAGTGCAATGGCGCAACTGTGGCTCACTGCAACCTCCACCTACTGGGTTCAAGCAA 
TTCTC CT GCTAGC CTCTGGAGGAGCTGGCATTACAGGTGCCCGCCACCATGCCCAGCTAA 
TTTTTGNATTTTTAAGTAAAGACAGGGTTCCACT^^ 

TTGACCCCANGNGAANCAACCNGCTTGGGCTCCAAANGGNTGGGANTACAAGGGNGGANC 
CNCCCANNCCCNANNCNNCAAACANGTTAAANTCACNGGGANGCCNTGGGCCCCTTTTGA 
AAGCCNGCTTTNTN I MINI I GGGGGGG 
Sequence 4005 

CNGGGCAGGTACTTTA I i I I I I I I I i II I N II I II GAGACGGAGTCTCGCTCTGTCGCC 

CAGGCTAGAGTGCAGTGGCACAATCTTGGCTCACTGCAAGCTCTGCCTCGCGGGTTCACG 

CCATC CTCCTG CCTCAGCCTCCTGAGTAGCTGAGACTACAGGTGCCCGCCACCACGCCCG 

GCTAATTrTTTATATTTTTAGTAN AGACGGGGTTTCACCGTGTTAGC C AG G ATGGTCTCG 

ATCTCCTGACCTTGTGATCCGCCGCCTNGGCCTCCCAAAGTGCTGGGATTACAGGCGTGA 

GCCACCGCACCCGGCTGGAGGAGTATCTTANAAAGGCCTGGACTGGGTCCTCCCCAATNC 

CTCMGGGGAAAAGCAGAGACTAAATCAGCATCACCCCACTTACGTGGGTGANANATAGG 

GGCGGGTGCACCCCACCTNTTCACCTATAACTNCCGCGTACCTTGGCCGGTTTANAACTA 

AGTGGGATTCCCCCGGCTTGNAGGAATTCCAAATCAAGCCTTATCGATACCCGGCNAACC 

TCGAAGGGGGGGGGC 

Sequence 4006 

AGGTACTCTCTTTCTCTTTTTCTTGGTTGCTGTGAAGGAAGTCAGCTGCCATGCTGTGAG 

CTTCCATTGAAGACAGAAGTCTTCCTTCTAAATGCCACGTGAGTGAGCATGGAAGTCAAG 

CCATGCAATGCTGTCACTCTGGCTAGCACCTTGATTGCAATCTTCTGAGAGGCCCTGGTC 

CAAAGTCACCCTGCAAAACCAATGCCCAGAATTACTGACATATTTATTTCAAGCTAATAA 

ATTATAGGCGTAATTTGTAATATGCAGAAATAGATAACTATTACCGTAGATCAAATGAAA 

MGATTTCCCATGTGCAGAMCCTACATAAATTCTTTTNCTTACAGGACANTCTTATTCT 

AATCMTAACATTTGCTTTTMNGCAGATMTCCTCCAAGTTTTCTAATGATATCTGAAA 

CTATTMCTGATTCTGTGAATTATGAAATCTGAAAAGGAATTGGAAGTTGCTAAAAATCT 

ATCATTTGCATTGACCAGTGTGAAGCACAAGTGGGAATGAGAATGCGTGCCCTGACACCA 

AAGAAAAAATAAGTGACTGGAAAGCTGAANAATCACCGGCTTCAGTGACATGGAACCCAG 

TNGATTNGATTTTTTGGACNMGTATCGGGTGACTTTGANGGTGGCAAGAAAACCACACT 

TTAAGAACAATGTCCAAAAAGGGGG 

Sequence 4007 

CCGCGGTGGCGGCCGAGGTACTGGGAGTGATCACTAACACCATAGTAATGTCTAATATTC 

ACAGGCAGATCTGCTTGGGGAAGCTAGTTATGTGAAAGGCAAATAGAGTCATACAGTAGC 

TCAAMGGCMCCATMTTCTCTTTGGTGCAGGTCTTGGGAGCGTGATCTAGATTACACT 

GCACCATTCCCAAGTTAATCCCCTGAAAACTTACTCTCAACTGGAGCAAATGAACTTTGG 

TCCCAAATATCCATCTTTTCAGTAGCGTTAATTATGCTCTGTTTCCAACTGCATTTCCTT 

TCCAATTGAATTAAAGTGTGGCCTCG \ I J I I AGTCATTTAAAATTGTTTTCTAAGTAATT 

GCTGCCTCTATTATGGCACTTfcMTTTTGCACTGTCTTTTGAGATTCMGAAAAATTTCT 

ATTCI I I I I I 1 IGCATCCAATTGTGCCTGAACTTTTAAAATATGTAAATGCTGCCATGTT 

CCAA 

Sequence 4008 

CGCGGCCGCCCGGGC AGGAAC I HUM I I NTATNCANCTCTTTTTGGNAGAGANNTNGT 
CTCACTATGCTGCCCAGGCTGGNTTCAAACTCCTGGGCTCAAGTGATCCTCCCACCTTGA 
CCTTCCAAAGTGCTGGGATTACAAGTGTGAACCACTGCTCCCAGCCTGGGGAAGATTTCT 
TAAAGTGTGGTAAATGAAGTTAAATTGGGCCAAGTCCAGAGCCTCCACTCTNCCCTGCCC 
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CAGCTCACAGANATAGGCAGGAAGTCTCACACCTTCAGAGGGGACACTTCCTCCAGGGAG 

GCCTCCTGGTTGAATCTAGGCCACAAGGAACATTTCTATCTTCCCCATGCCTAGCACCAA 

CTGTTCANAGAGTCCANACCCTGTGGCCTAAGGAGGCCCAAAGAGTTGGGGGGATTCTGA 

CACACACTGNCTGGCAGGCGGCCTGCTG 

Sequence 4009 

CTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTACGCGGGT 

GTTATTGAAAATTCAGCAAMCATAGGTGTGATTGCAGCATTTACAGTTGCAGTCCTTGC 

TGCGGGTATCTCCTTTCATTACTTCAGTGATTAGGGTGAGGCACAAAGAGTTTCTTGATC 

ATCCAGAGAACATTGACAGACAATTATGAATAATAAAGATGTTAACAATCCATCTGTATT 

TAAMCACTAGCAGCCAGATCTGCTGCCATGATGCCTATTTGGTGTGTTTCTGATTAAAA 

TGAMTCACAAGCTGCCTTGTTTAGCCTGCTTTACATTGTAGGTGGCCCGCATTTCCAGA 

AATAMCGTTATGCATCTAGATGGAMGCTGCATGTAACAAAATCATTATTATCTATTTT 

TAAAAGCTTCAAAAATGATGGGGATATGATCATAGATTTTAGTCTTACTAATCTGAATCA 

CATATTAATCAGGGACATTTAAAAACTTTAACCAGAGGCTTGAAGGGCTCACACCGTATA 

ATCCTAAATGCCTTTGGAGAGGCTTGAAGGTAGGANCCTTCACCCCGGGGCTTGGGGAAG 

GGAAGTTGGGAGACCCAGCCTGGGATGAACATAAATNGANAATCTTGCCTCTACTTNGGA 

AAAAAA 

Sequence 4010 

ATANGGCGAATTGGAGCTCCCCGCGGTGGCCGGCCCGCCCGGGCAGGTACGCGGGAGTCC 

AAAGGCATCCCGCCTTCTGTGCAGACTCACAAGTCCCTGTGGACGGAATTCTTGAAGTGT 

AGCGCCGCTCAGTCCTTCCACCGGAAGTGTCCGATCGGAATCAGCCCTGTCCGAGAGATG 

GCTACAGAACCAAAGAAAGCTGCAGCCCAGAACTCTCCAGAGGATGAAGGACTTCTGATA 

GTGAAGATAGAAGAGGAAGMTTTATCCATGGGCAGGACACTTGCTTACAGAGAAGTGAA 

CTCCTTAAGCAGGAGCTCTGCAGGCAGC I I I I I AGGCAGTTCTGCTACCAGGATTCTCCT 

GGACCTCGCGAGGCACTGAGCCGNCTCCGGGAGCTCTGCTGTCAGTGGCTGAAAGCCAGA 

GATCCATACCAAGGAACAGATTCTGGAACTGCTGGTGCTAGAGCAAGTCCTGACCATCCT 

GCNAGGAGATTTGCANGCCTGGGTACCCTCGGN 

Sequence 401 1 

CCGCGGTGGCGGCCCGAGGTACGCAGGGAGGAAAAGGTCCCGGAGAGCTGAGCAGTCAAG 

ATGTGTGACTTCACCGAAGACCAGACCGCAGAGTTCAAGGAGGCCTTCCAGCTGTTTGAC 

CGAACAGGTGATGGCAAGATCCTGTACATCTGGAACGGCCACGCCGTGATTGGGAAGCAG 

CCGAAACTCACGGATGGGATACTTGAGATTATCACTAAGGCTGAAGAGATGCTGGAGAAA 

GGCCCAGAGAACGAGTACCTGCCCG 

Sequence 4012 

CCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTN rTTTTTTTTTTTTTTTTTTI I GGGGTA 

CCCANTCTGGAATCATTGGTGGGACAGAGTTAGGAGANMCCAAAAAMTTTATCTTC'rG 

GGATATTGTAGGATGGCTGAGATGGTGGTTAGGTNAAAGTTNTTTCCAGAATCCTGMCT 

GGMCCAAGTTTTATTTTCCCTCAAAAAAGG GTG AG AGACC ATATTACTATATTAG CTA \ 

GTTNTGANATTCATAGATGCTTATTTTAGGTCMCTAAAATCCTAATCACCATTTACMC 

ATCTCTCTATGAANAAACATCCTCCAAATTTGAAACAAATAACTTATAAACATTTTGTGA 

TCTMTCTGTTCATAMTTANTTATTGCTCTGCTCTGGCATAATAAGTTAATAAAAATTC 

AAAGCTGACAGCAA I I I I I GGCATAAGGGGTATCTATAAGTTTNTGCCCCTAAGTT T TTC 

TATTAAAAAAATNTTTGAGTTACTANCCCAACTAACCTGGGANGCTTAGGCTGCANTGAA 

CCCAAAATTGCACCCCTTGCCCTNCAACCTGGGGCAACANANCCAGGANCCTGTCTCAAA 

AAAAAAAAAAA 

Sequence 4013 

TGNAGCTCCCCGCGGTGGCGGACTACAGAGCCCCGAATTAATACCAATAGAAGGGCAATG 
CTTTTAGATTAAAATGAAGGTGACTTAAACAGCTTAAAGTT^ * 
GATTAAAATAATTTGMGGCGATCTTTTAAAMGAGATTAAACCGAAGGTGATTAAAAGA 
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CCTTGAAATCCATGACGCAGGGAGAATTGCGTCATTTAAAGCCTAGTTAACGCATTTACT 

AAACGCAGACGAAAATGGAAAGATTAATTGGGAGTGGTAGGATGAAACAATTTGGAGAAG 

ATAGAAGTTTGAAAGTGGAAAACTGGAAGACAGAAGTACGGNAAAAAAAAANCANAGTNA 

MNNNNAAGTACCTCGGCCGCTCTANAACTAGTGnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnCTTATCGATNCCGNCnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnGTTAATTGCGCGCTTTGGCCGTAATCATGGNCATAAGCTGNTTCCTG 

TGTGGAAATTGTNATTNCGNTCACAATTCCACCAAAACATACNNANCCGGGAGCCTAAAG 

GNTAAMGCCTG GGGN GCCCTAATGAGNNGANCTTAACTTACANTTAAATTGGGTTGGGC 

TNACTTGCCCGCTTTTCAAGTTCGGGAAAACCTNTTGNNGCCCANCTGCNNTTAATGNAA 

TCNGCCCCACNCCCCGGGGAGAGGGGGGNTTNCGTTTTGGGGGTTTT 

Sequence 4014 

CCGCGGTGGCGGCCCGAGGTAC (TTTTTTTTTTTTTTTTn I CTCTAAAGAAAAAACTTA 

ATGA CCTTTCAAMTGMTAGGCCCGATTTATGATAAATGATTAGTTAACMTTGAAGTT 

TTTTAGTTTCTTTCTTTCTTGGGAMCAAAAAATACTCACTTTTAT^ 

TATTTGGTAGTMTACAAATATTTCCTAGTGGCTCCCATTCTCTAGCAGCAAGTCAAACA 

GCCCTCMGAGAAMGTATTAAAATTGNGCTATTATGACATAGCAGATAATCTGGAATAA 

MC AMT CCCTNTAGGATCATTATATCCTAAMGTTCTTTGCCCCAGNCTTAGAACATGG 

CACTTTTGAGCATGCTAAAATCATGATAAGGACATATTTCTCTTCTCATTTCACACCAAT 

AAAGGAGGAAA 

Sequence 4015 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGATCTTTGCTCACCAC 

TACCTCCGCCTCCTGGGTTCAAGCAATTCTCCTGCCTCAGCCTCCCGAGTAGCTAGGATT 

ACAGGTGTGCGCCACCATGCCCGGCTAGTTTn"GCATTTTTAGTAGAGACAGGGTTTCAC 

CAGGTTGGCCAGGCTGGTCCTGAACTCCTGACCTCAAGTGATCTGCCTTGCCTGGGCTTT 

CCAMGTGCTGGGATTACAAMTATATGTTCGCTATTGCTAGGCAAAGCTGTAAAANTTT 

GCAGAGCGTCCAAACATTANTTTNGGCTCAATAAAATATTTGCTGGAATCATATGTAAGG 

GAAC 

Sequence 4016 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGTATTT 

TGAGGAATTACGAAAGGTGCTAGTTAAGGTGGATGAATATCATTCAGTGCATCAGAAGCT 

CAGTGCTGATATGGCTGATCATTCTAATTTGATCCGAAGTTTGCTGGTCGGAGCTGAGGA 

TGCTCGTCTGATGAGGGACATGAAAACAATGAAGAGTCGTTTNTATGGAACTCTATGACC 

TTAATTAN GAGACTTTGCTAAAATGGGATATAAAAATTCGCTGTAACAATCAC ACAG N AG 

CTTGTTGGGAAMCCTCCAAAAGCANTTTAAATTCMGCAAATTTTCAAAMGNMG 

CGGGTCCGTNTTGCCNGGGGTTTGGGAAAAAAACCCC 

Sequence 4017 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAATGAGGACCATCCT 
ACCAGTGGGGCCACGTCTCMCGGTTTATCCTTCACTCATTTTTCTGCACGTCATCCTGA 
GGTAAACCAATTTTAAATGTGTTTGTTTTCAGTAMCCAGCTATGACMTTTATACTAM 
CAMTTGMCTAGMCCCATTTGAATAGGTTTTGA 

TAATACAAAAGGCCCTGACTTGGTGCTCAATCCGTCAAAGCCTGCAATGCATTGAAAAAA 

AACTTGGCCCAGGCNCCACCCCGCNGTNACCCTTGCCCCCGGGNCCGGGNNCGCTTCTTA 

ANNAAACTAAG GTGGG GGATTCCCCNCCTCGGGNGTCNTGCNAAGGGGAAATTTCCGGAA 

TTATTCAMGGCTTTTAATTCNGGATTNACCCCGNTCCCGNATNCCTTCNAANGGGGGGG 
G 

Sequence 4018 

CGGGAAGCGTGGCGCTTTCTTCATAGCTTCACCGCTGTAGGTATCTCAGTTCGGGTGTAG 
GGTCGTTTC N GCTCCAAGCTGGGGCTTGTGGTGGC ACTAAAAN 
Sequence 4019 
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CTACTTAGGGCG AATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTA O I I 1 1 1 1 1 
n rTTTTTTTTTTTTTTI I GGTTTGTTTTGTTTTTGTTTTTGTTTTGCANCCCNAGGAGT 
TTTAGCCAATTCAAATGCCTTGCTNCCCACMTTTGGAACATTCCTTTGGATTTGACCM 
GTCAGGAAGAGATGGGAGAAAAGTGAANCAACAAATNAAACCCCCAAACATAAAACAAAA 
AANGAGTTTAAGCAAAAACCAAACCAAATTGCACAMTTTCATTATTGATTAM 
AGTNGTTTCTAAATTGGGTAMGGGGAGGGAAATTTTAAAAMNGCCANGCCTTGGGTTG 
GGGGTTAAAATTCCTTTNAAAANTTTITrTANGTNNCNTTTAAAMCAAAAM 
Sequence 4020 

CCGCGGTGGCGGCCGAGGTACACTTTCACTGAGTCCAAAATTAGGAGCAAAAATACAAAG 

GTTCACATTGGTCTTAGGTAGGATACAGGCGAAAAAGGATTGAGCTGTTTAGCTCAGAAA 

CAACAAAAACAAAGTTTCTAAAGCACCCTAAATTATATAAAAATCAAGACCATGGACGGA 

TTGAAACTGGTATTCTGGTGAMTACTTTACTATTTTGTAAAAAGTTTTACAA 

CAMTTAAMGTATCAACCCTCTGAGTTCAAAGCAGCATTTTGAAMTGAACTGGTAGTT 

AATCCTGTACCACCCTTAGAGTTGTAAGTGGGACCTTGGGAATAGGGGATGGATGGGTTC 

CCCCTGGCTGAAATTCTTNGAAACCNTTCTAACTTCCTGGCCTGTCCCCTTCCCCGTAAG 

CACTTGTCTTTGGNMTAACTGGTCCTTTGTAAGAGTGGTGACCGGTAAAAATGA 

Sequence 4021 

GGCCGCCCGG N CAGGTACTTCCTGTAATTGACAGCCTTGCCTTGTTCCTCATGGCATCAT 
TCAAGGTCATCTTAAATGAGAGAGGAGGGAAAGAAAGAAAAAGAAATCATACGTTATGGT 
TTTCAAATGCATCCAGAGGAMGGCAGGGTTGTTTTTGAAMGTTTACTTCTATCTGAAA 
GCTTCTAG CAAAT 
Sequence 4022 

CGAGGTACCTACGAGGGTGAGAACACAGTGCTCTACCTGCAGGTGGCCAGGTTCCTGGTG 

AAGAGCTACCTGCAGACTCAGATGTCCCCTGGCTCCACGCCACAGAGATCTCTCTCTCCA 

TCTGTCGCATATCTCACCGCACCTGACCTGGCCAGGTGTCCAGCCCAGAGGGCAGCCGAC 

TTCCTCTGCCCGGAGCTCTACACCACGGCCTGGGCACATGTGGCAGTAAGGCTCATAAAG 

GACTCAGTGCAGCATTTACAGACCCTGACGCAATCCGGAGCTGACCAGCACGAGGCTTGG 

AACCAGGCCACTGTCATACACCTCCAGGCTGCTAAGGTGCACTGCTACTATGTCACTGTG 

AAGGGTTTTACAGAAGCTCTGGAGAAACTAGAAAATGAACCAGCGATTCAACAGGTGCTC 

AAGCGCCTCTGTGACCTCCATGCCATACATGGAA 

Sequence 4023 

CGAGGTACGCGGGGATCCAGAATACATTTCCAACAAGAGCACTGGCCAAGTCAGCTTCTT 

CTGAGAGAGTCTCTAGAAGACATGATGCTACACTCAGCTTTGGGTCTCTGCCTCTTACTC 

GTCACAGTTTCTTCAACCTTGCCATTGCMTAAAAAAGGAAAAGAGGCCTCCTCAGACAC 

TCTCAAGAGGATGGGGAGATGACATCACTTGGGTACCTGCCCG 

Sequence 4024 

CCGCGGTGGCGGCCGAGGTACTTTN M I H l I I I I I I M I II I II I I NTTAANTANTTTT 

i i MiiiuiiMimiumiiniiiiiiiiim i ii N iniiiMiiinnu 

rTTTTTTTTTTTTTTTTi INGGGGGGGGNNNANGNNNCTTTTTNTNGNAAAACNCCNNNN 

CNAAAAAANNAAATTTAAANAAAAANCCTNNNNGGGGNANNANGNACNAAAAANTTGNNA 

NAAAAACNNCCGGGNNAAAAAACNCNNNGGGNAAGGGGGNNCCNANCNNTTTNCTCCCCG 

GGNTTTTNAAAAANTAAGANCCCCCCCNGGNNGNGNAANNCCNNTNACAACTTTTTAAAC 

CCCCCCANCCCTNNGGGGGGCCCCCCCCCCAI UN i NNNNNTTTNTGGGGGGGTNNTTT 

CCCCCCTGGAAAAANTAAAGGGCCNAATGNTTTCCNGNNGAAAAANNATTNACGGTAAAA 

AATNGCANANAAANNAAGNNCNNNGACAAAAAATNANANAACCNGGGNGNNCCCNNAGGN 

GGCCCCNCNCCAAATNNANGGGNCGCCCCCTCTNCCCCNTTNCNAAANGAAAACACTNAG 

GCCCCTTTTTTNTTAANAAACCCCCCCCCCCNGNGGGANGGCG T I I \ I NTTTTTGGGCCT 

TTTTNNCCTTTTTNCTAAAAAAAATCNTAAAGCNNGGGGGNTTG 

Sequence 4025 
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CCGCGGTGGCGGCCCGAGGTAC I M I ) 1 1 I 1 I M 1 1 1 I I 1 1 i 1 1 1 1 GGGCAGCTAAAATC 

TAGGATGGTGTTAAGTTTCTTCATTTTGTCAATTATATATAAAAAATTAGAAACAACATG 

AATCTGCATTTCTTGGATGAGATAGTMTAACAMCTATTTCTCMTATTTGTATACTAA 

AAACTAGTGAAGGTGTTATGTGTTTCAGTATCTTATCTCTTATTTGMCATGGGTTTCTG 

AAAGGAGCCTATATAATAATATAAATGGTATGTAAGTAAATGAGGCACTGTCTTGGCTGG 

GACTGCTATAAAAAAATTACCATAGACCATTGACTAAACCACAAACATATAC7TCTCACA 

GTTCTGGAAGTTGGAAA 

Sequence 4026 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGCAAGTGAACATG 

CCTGCAGATTCATTTGAATCTCCATTGACGCTAGCTGCCTGTGGAGGACATGTTGAATTG 

GCAGCTCTACTTATTGAAAGGGGAGCAAATCTTGAAGAAGTTAATGATGAAGGATACACT 

CCCTTGATGGAAGCTGCCCGGGAAGGACATGAAGAAATGGTGGCACTACTCTTmGCACAA 

GGAGCAMT ATAAA TGCCCAGACAGAAGAAACTCAAGAAACTGCTCTTACTTTGGCTTGC 

TGTGGAGGATTTTCTGAAGTTGCAGACTTTCTTATTAAGGCAGGGGCTGATATAGAACTT 

GGCTGCTCCACACCTCTGATGGAGGCATCTCAGGAGGGACACCTGGAATTGGTTAAATAT 

TTGCTGGCTTCTGGCGCTAATGTGCATGCTACAACAGCAACAGGAGACACAGCCTTAACC 

TATGCTTGTGAAAATGGACATACNGATGTTGCANATGTTTTACTTCAAGCA 

Sequence 4027 

CCGCGGTGGCGGCCCGAGGTACATTGTAGGATGCTTACCAGATCCTGCCCACTAGAGGCC 

AATAGCACCTCATCATCATAGCACCCCTTCCCTCAGTGATAATAAAAAATGTCTCCAGAC 

ATTGTTTAAGTTAATAAATGTGCCCAGAGCCCAGGCACAGTGGCTCAGGCCTGTGATTTT 

AGCTCTTTGGGAGACCGAGGTGGGTGGATCACTTGAGGCCAGGAGTTCCTAGAACGTTCC 

TCTGTCAAAAGACAAMTTACAACMTTTAAAATCTAATTTGGCTGTTATTGGCCATTCT 

AGAATTGGTTGACATCTCATTCTATGAAATACAATGAGTGCTCTTCAATAAGCTGAGCAG 

AGGGGGGTTAGCTTTATTGGCAGAAMGGGCTAMGAAMCAGAAAAAAGAACAAAAAGT 

GATTTGGTTGTTTCAGAAGTTACTnT 

Sequence 4028 

CGCGGTGGCGGCCCGCCCGGGCAGGTACCAACTAACCTCCCGCTCTTTATCTCCCCAACC 

TTCCCACTCCCCAGCCTCTAGTMCCATTATTCTCCTCTACCTGTGAGTTGATTTTGACC 

TCC C ACATAAG AGAATGTGGCAAGTATTTG 11111 CTGTGCCTGGGTTATTTCATTTAAC 

ATACTGTCCTCTGGTTCCATCCTTGTTGTTGCAMCGACAGGATTTTGTTCI I I I I IATG 

ACTGMTAATATTCCATAGCCATATACATACCATGTTTTCTGTATCCATTCATTCGTTGA 

TTGGCAGTAAGGTTGATTCCATATCTTGGCTGTTGTGAATATTACTGCAGTAAACATAGG 

AATACAGATATCTTTTCCATATACTGATTTCCTTTGTTTTGGATATATACCCAGCAAGTG 

GGCTTGCTGGATCATATGGCAGCTCTGATTTTAGGI 1 1 I I AAGGATGGG I I 1 1 III I IGC 

MTGAGAATAAAMTTTTCCATGTATAAGATCMGTGTCTTTGMCTTTGGTTATATCCT 

TTGCCCATTTTTCTATTGAGGTC 1 11111 

Sequence 4029 

GCGATTGGACTCCCCGCGGTGGCGGCCGCCGGGCAGGTACGCGGGTCTATATAAATCCAA 

GACAAGCAACAAACCCTTGATGATTATTCATCACTTGGATGAGTGCCCACACAGTCAAGC 

TTTAMGAAAGTGTTTGCTGAAMTAMGAAATCCAGAMTTGGCAGAGCAGTTTGTCCT 

CCTCMTCTGGTTTATGAAACAACTGACAAACACCTTTCTCCTGATGGCCAGTATGTCCC 

CAGGATTATGTTTGTTGACCCATCTCTGACAGTTTANAGCCGANTATCACTGGAAGATAT 

TCAAMTCGTCTCTATGCTTACGAACCTGCAGATACMGCTTTGGTGCTTTGACAACCAT 

GAAGAAAGCTCTTCAAGTTrGCTTGAAAGACTGGAATTNGTAMGNAAAAAAAAAATCTT 

CCAMGCCCCTTTTTGTCTGGCCAGGCCCTTTGAGGACCTTTGGAAAACCCAAGAAAGAA 

AGTGNTGAAGAAAGACTGGCTTAGGTGGTNGGAAACCATNAGTGGAACCACCACCTGGAT 

TTNGGGTTATTGGGTTTAAATGGTTCCAACAAAC 1 11111111 I A AGGAAAAAACAAAGG 

TTTTMG N AAAAA7TTTGGGGTTTAAAGGTN GTACATTGGTGGTGAAAAACCAAATATTT 
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GTTGTTCCTTNCCNTTTGGNGAGGCCCCNGNMTTTTTCTTAAAAAAAAAA 
Sequence 4030 

CCGCGGTGGCGGCCCGAGGTACTGGAAAGATGAACTGATCATATGGAGATGTCAGCCTTG 

TAMTCGAAAATAAMTTCTMGCCCCCCAGTCMCTGAACAGACTTTCCTTTTGGCCAA 

AGAGATCCCCCAAAACCTGAAAATAAACTAGTTCAGGAAGGCAGGGGGCAGGGGCGGTTG 

GACATGTCTCATTACACCCTCTTCCCTCTGGAGGTTAGACAAAATTGACTAGTGTTAACA 

TTAAAATAGAGATCTTAAGGACTGACAAAACAGACTCTGTAGCAATAAGACACCAAATTC 

CAACCTG ATTCTAGTGTAAC ATCACATG AC AGATAGTG GGTCCTAAAG GAAATC AAAGTA 

TTTTGCTCCAAAATATAC1TCCCTGACATATTTTGAMTGGGCCTGAAMGCTATCTCTT 

GCGGGGGAA 

Sequence 4031 

CGAGGTACTTAACTAGGGTTAAGCCCGAGGTCAGGAGGGGTGGATGCCTTATTATGGGAT 

TAAGTGCCCTTCTAAGAGGGACAGTAGAGGGCTTGTTCATGCTCTCTCTCTCCATGCACA 

AGAAAGAGCACACAGTGAGAAGGAAAGAGGCCCTCACCAGAACCTGACCATGCTGGCACC 

CGTGATTGCAGACTTCCAGCCTTCAGGATTGTGAGAAAGATAAATATCTGTGGGTTAAAG 

TCACCAAGTCTGTGGTGTGGTTTATGGCACACMGGGMCCCAGMTATGCTTTTCACCA 

CTGGGTGGGCCAGCCTTTCATGGAMCTGTCAGCTTCTTAGCCCTTGCAAAAAAATGCTT 

GCTGGTCACTCT CT1TG GTCCTTTTAAAAGCCGT7TCTTAGTGGNGCCTCTACCAAGCCC 

CTTGACTTCATCCTTTTCTTGTCCCANGNAAGGGTGAGTNAAGGACCTTGNCCCGGCCGG 

CNGCTNTAGAACTAGTGGGATC 

Sequence 4032 

AGGTACTGCCAGAAAAACACTGAGAGAAAMCACTGAAATTGAAAMCACTTTCAATTTA 
TTTGAAAGATAACTTTCAAAGAAATTTTCATMCTCAAATGTGTT^ 

AGTATATCAGAGAATTCCCTGGTAGCAATGGACTTCTCAGGCCAGAAAAGCAGAGTTATA 

GAAAATCCCACTGAAGCCCTTTCAGTTGCGGTTGAAGAAGGACTCGCTTGGAGGAAAAAA 

GGATGTTTACGCCTGGGCACTCACGGTAGCCCCACTGCCTCTTCACAGAGCTCTGCCACA 

AACATGGCTATCCACCGGTCCCAGCCATGGTTTCACCACAAAATTTTCTAGAGATGAGGC 

TCAGCGATTGATTATTCAGCMGGACTTGTGGATGGAGTTTTCTTGGTACCTGCCCGGGC 

GGCCNGCTCGACCTGCCATAACCCAAATACCAAACGCCCCTCTTCGTCTGATCCGTCCTA 

ATCACCACAGNCCTACTTCTCCTATCTCTCCCAGTCCTAGCTGCTGGCTTCACTATACCT 

ACTAAACAGACCGTAACCTTCAACCCCACCTTTTTTCGACCCCGCCGGGAGGAAGAGACC 

CCANTTCTATACCCAACACCTATTCTGATTTTTCGGCNNCCCTGAMGTTTATATNCTTT 

ATCCTACCCGGCTTNNGGAAAAATCCTCCCATATTTGTAACCTTACTACTTCCCGGAAAA 
AAANA 

Sequence 4033 

GTAATACGACTCACTATAGGGGCGAATTGGAGCTCCCCGCGGTGGCCGGCCGCCCGGGCA 

GGTACGCGGGTGGATTTGAAGGAGTTCGACTTGGATGCCCTGTTGGGTATAGATGACCTG 

GAAACCATGCCAGATGACCTTCTGACCACGTTGGATGACACTTGTGATCTCTTTGCCCCC 

TAGTCCAGGAGACTAATAAGCAGCCCCCCAGACGGTGAACCCAATTGGCCATCTCCCAGA 

AAGTTTAACAAAACCCGACCAGGTTGCCCCCTTCACCTTCTTACAACCTCTTCCCCTTTC 

CCCAGGGGTCCTGTCCTCCACTCCAGATCATTCCTTTAGTTTAGAGCTGGGCAGTGAAAG ' 

TGGATATCACTGAAGGAGATAGGAAGCCAGACTACACTGCTTACGTTGCCATGATCCCTC 

AGTGC ATAAAGG AGG AAGAC ACCC CTTCAGATAATGATAGTGGCATCTGTATGAG CC C AG 

AGTCCTAT CTTG GGGTCTTCTCAGCACAGCCCCTTTTACCANGGGGCTNTTCAAATAGGA 

GCCTTCCATTTTCAGGTGTTCTCTGNGGGTCTGCCCCGTCCCAAACCTTTACGATCCTNC 

TGGAAAGAAGANTGGTAGCANCCAAAAGTAAANGGGTGAGAAACTGGATAANGAAGCTGA 

AAAAAATGGGANCAAAAACAAGACAGCAGCCACTAGGGACCCTCGGCNGTTCTAAAACTA 

GTGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTANTCGAATACCGCNAACTNC 

ANGGGGGGGNCCGGGACCCCAA 
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Sequence 4034 

CCGGGCAGGTAC rTTTTTTTTTTTTTTTTl NTGCTGTCTCTCTCTCTCTCTTTTTGACAG 

GGAAMCAGCTATCAGGTTTCTGGAACATTCTTGAATAGCTAATTCCTGGAGTCCTTGTC 

CTCACTTCTTCTCCTCTGTACCTCACTGGCATTTGCTCCMGATCTCCTTTTCAAACAAC 

GTCCTGGCACTGAGCTGAGATTTCCAAATGTTAACTCCCTGGCATCCATCCTGTAGGTGC 

AGAMGGATTCTTCCCTGATTTGCACACAAACCTTTTCATTTCGATGCATATCAGGGTCA 

TCTCGGCTTTGTGCCTTGCCTACAAGGTGGTTTGGAGGCCAT 

Sequence 4035 

AGGTAC I I 1 U 1 11 1 I I I 1 It I I M 1 1 1 1 CTTGGCTGGAGGCGGCTGTTGCTGCGTAGGC 

ACTATGATAGGTGCTGACTGATACACCGGCTGACTTGGGTAAAAAGGCGTTCCTTGCTTC 

TGGTCGCTAGCTGCAGTGACAGGGGTGCTGGCAGCAAGATGAGCGCTCTCCACAGTCCCA 

TAAACCACAGGGCTGTGCTCGGGGACCTGGCTGGGCAGCTGAGGAGGAGTAGGTGTGGAC 

GTGGGTCTTCCTATGGGTGGAGTAGGATTTCTGCTGCCACCTCCAGACATAATCTCCTCT 

GTTATGTCTTTACCTCCCTGGTTTGGATCCCGAATTCT 

Sequence 4036 

GTACGCGGGGGAGGTGAGTCCGGTCCCTTTTGCATCCCTACCCCGACACTGCGGGTTGTC 

ACAACGGCACCCTCCCGCTTTCTCTCTGCCTCGGATTTAGTCNGTGACTGTGTGTCTCCG 

CCGNGGTGCAGCTTCAGGCCTNTCCCGCATCTACTNTCTCACGCTTCCGCTGCGGCCTGA 

GGGAGGGCGGCGGGCANGACCACNGACCGGGGTTGGCATACCGTATNAAGGACAGTAACT 

ACCATGGCTCCCGAAGTTTTGNCAAAACCTNGGATGCGTGGGCCTTNTGGCCCANGCAGT 

CTGCGAAATCATATTGGNCTGN AGCATTTCANG CTATCCCTG • 

Sequence 4037 

AGGTACTTTNTTN II I I I I I i 1 I I II I I I I i GGTAACAGATCTAATCTTCCACGTAAGCC 

TCTTTACTGGANAANAGTAGATGGGAGGGCAGGTGGGATCTATTTAGACAAATCCNATAT 

ATTCANAATGATTCAGTAGGCAAANAATTCCTCCTGGTAACNAATCTGNGTGTCAAGGTC 

CTTACCTTGGCTCTCACTGTCCTCAGGATTATGGANGGGAAGATGATGCAGAATTCNGCA 

CATGTCTTCTGGATCATGGGGGA 

Sequence 4038 

CGGCCGCCCGGGCAGGTACGCGGGGGGTGGAGGGAACGTCTGACAAAATTCAATGTTTGG 

AGAAAGCGACATCTGGAAAGTTCGAACAGTCAGCAGAAGAAACACCTAGGGAAATTACGA 

GTCCTGCAAAAGAAACATCTGAGAAA1TTACGTGGCCAGCAAAAGGAAGACCTAGGAAGA 

TCGCATGGGAGAAAAAAGAAGACACACCTAGGGAAATTATGAGTCCCGCAAAAGAAACAT 

CTGAGAAATTTACGTGGGCAGCAAAAGGAAGACCTAGGAAGATCGCATTGGGAGAAAAAA 

GMCCCCTGTAAAGACTGGATGCGTGGCMGAGTAACATCTMTAAAACTAAAGTTTTGG 

AAAAA 

Sequence 4039 

CCGGGCAGGTACCTGTTGGCTTTCTGCAGAGATGCTTACTTTTCCTCTCCTATCAGAGTT 

TMTGTCCTGGGACTCTGACATTTTGGTMGTGTTTTCTTAACCCGCAGGGCTGTCAGCC 

TTGATGCAGGATTGTGAGCCCAGCATTCTGTCATGAGTTTTCCCATCTGCCTTAGACACT 

CATCACTGCTCCACCGGTTTGGGAATGAGGGGCGTAACTTCTTGATGCACACAATCTCCC 

TCATGTCCTCATAAGAGGGGTCAGTGGGCACTAGGTCATGATAAGGAAGCTGGTATTCTT 

CCACTATACCTCCTGATACACATCTCCTAGCAACCTCCCAAAGGATGAGGC 

Sequence 4040 

CCGGGCAGGTACGCGGGGTGGTGGCTGTGCTGGTGGCGTTCAAGATGTCGACCAAGAATT 

TCCGAGTCAGTGACGGGGACTGGATTTGCCCTGACAAAAAATGTGGAAATGTAAACTTTG 

CTAGAAGAACCAGCTTGTAATCGATGTGGTCGGGAGAAAACAACTGAGGCCAAGATGATG 

AAAGCTGGGGGCACTGAMTAGGAMGACACTTGCAGAAAAGAGCCGAGGCCTATTTAGT 

GCTAATGACTGGCAATGTAAAACTTGCAGCAATGTGAATTGGGCCAGAAGATCAGAGTGT 

AATATGTGTAATACTCCAAAGTATGCTAAATTAGAAGAAAGAACAGGAT 
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Sequence 4041 

GCCGCCCGGGCAGGTACTTAACTAGGGTTAGCCCGAGGTCAGGAGGGTGGATGCCTTATT 

ATNGGGATTAGTGCCCTTCTAAGAGGGACAGTAANAGGGCTTGTTCATGCTCTTTCTCTC 

CATGCACAAGAAAGAGCACACAGTGAGAAGGAAAGAGGCCCTCACCAAAACCTTGACCAT 

GCTGGCNCCGTNATTGCAGACTTCCAGCCTCCAGGGATTGNGAGAAGATAAATATCTGTG 

GTTTMGTCACCAAGTCTGTGGTNTGTTTTTATGGCAAGCACAAGGTAACCCCANATGAT 

GCTTTTTCACCCACTTGGGTNGGTCCANCCCTTTNATTSIGGAANCTGTNAACTTCCTCAAT 

CNCTTGGCAAAAAAATGCTTGC 

Sequence 4042 

AGGTACI I I II I n n I I I I I I I I I I l I ICI I I 1 1 I I I ICM I I I I I I I I I I I 1 1 1 I n I 

! i M I ( M ( I I I M i i [ M i 1 1 [ I [ [ i i I I I I I I I I i I I NTTNNTTTAAAAAN I I II I NN 

NTTTNTNTNNNNNNNNNNNNNNNNAAAAAAAAAAAAAAAAACACCCCGGGGGGNNANNGG 

GNCCCNTTTTTNNAAAAAAAACCNNCCCCAAAAANAAMTTTTAAAAAAM 

NAAAAAAAANNAAAAAAAAANCNNAAAAAAAACNNNCCTNAAAAAAA 

Sequence 4043 

CCGGGCAGGTAC I I I I ! I I I I I I 1 1 II M I I I I AAM I I I I 1 1 I I M I I I I I I I I I I I I I 
I I □ I I I I I I n I I I II II 1 1 II I I INI I I I I I I I 1 1 NTTTTANNTNTNTATNTTTTANN 
TTTTAANNANAAAANGNAAAAANNTNAAAAAAAAAAAAA 

I I I II I I I I N AAN AAAAAAAAAAAN AAAAAAAAAN N CAAAAAATTC NTTTN N N NTN NAAA 
AAAANNNNANAAAAAAAATNTTTTrTNTTTTTTAA 
AAAAAAAAAAANCCCCCNNTCTTTTNTNTNNAAAAAA 
Sequence 4044 

CCGGGCAGGTACAACCAGCAACAAAAAAACTCACAAATTCTAATGTCTGGCCCTCAGAAC 

ATCCTTCAGACTAATTTTAGAGATTCTTAAAMTTCAAAAAAGTAATGCAATGCCCAAAA 

CAAAACAAATATATATGGAAGGAAAAGGAGGGGGGAAGCAACACTGTATCATGTACCAGC 

ATCCCCAGCGTCTGGCATTCCATGTTTCTGCTCCTGTGGTCTCCACGGTGCAACAAGCTA 

GCGGTTTACTTGGACCTCTGCCTCATCTTTCTTCTTTTGCGCTTCAGCCTGCGCATTTCG 

CTTTCTTCCTCCACTTTGGCTCTCATGGGCGCAGAGGTTTCCAAAAA 

Sequence 4045 

CCGGGCAGGTACAATTCAGGACACAGGCATGGGCAAATACTTCATGACTAAAATACCAAA 

AAGCMTGGCAACAAAAGTCAAAATTGACAMTGGGATCTAATTAAACTAAAGAGCTTTT 

GC AC AGC AAAAG AAACC ATC ATC AGG GTG AAC AGGCAACCTACAG AATG G G AG AAAATTT 

TTGCAATCTATC CGTCTGACAAATGGCTAATATCCAGAATGATCTCTATTTTA I ITTTTA 

TGACATATTAATCATGTTTGTTCTCTTGTGTCCTCTCTTTCTTTCTCTCGATTTTCCAGT 

GGGTCCACGGTGGTCTATTTGTAGGGTTCATATTTACAAATGGGAGTCA 

Sequence 4046 

aggtaci 1 1 1 m i n n n n i n n i n i iatgaaggttgaatgcagggttttattgag 

TGGTGGAGGTGGTTCTCTGAAGGATGGATGCAGAGCTGGAAGCGGGGATGGCACGGGAAG 

ATAATCTTCCTGCTGGGTGGCCAAACCTCTCCGACCACCCTCGGCTGAACTCCTCTCGGC 

GTTCAAATGTTCCTCTTCTCTCTTTCTCTGCTGTATCG7TTCACCATTCATCTGCTTGTT 

GCTTGGCGATAGGTGATTGTCTCGCCGCTCAGCTATAGGCGATGGTCTCACCACTCGGTG 

ATAGGCGATTGTCCCATCTAGGTCGCCAAAATGTGTCTGGAAT 

Sequence 4047 

CCGGGCAGGTACTCAAGGCAAACCAGCAATTGCCCATCGAGATCTGAAAAGTAAAAACAT 

TCTGGTGAAGAAAAATGGAACTTGCTGTATTGCTGACCTGGGCCTGGCTGTTAAATTTAT 

TAGTGATACAAATGAAGTTGACATACCACCTAACACTCGAGTTGGCACCAAACGCTATAT 

GCCTCCAGAAGTGTTGGACGAGAGCTTGAACAGAAATCACTTCCAGTCTTACATCATGGC 

TGACATGTATAGTTTTGGCCTCATCCTTTGGGAGGTTGCTAGGAGATGTGTATCAGGAGG 

TATAGTGGAAGAATACCAGCTTCCTTATCATGACCTAGTGCC 
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Sequence 4048 

AGGTAC 1 1 II 1 1 I M I M I HI 11 Ml rTGCGATTAAACTACTGTTGTATCATTGGGCTG 
CGACGGGTTACAGGTCCACAAACTGTTTAACTGG AC ATATGTGGAGG N GGTAAATCCCCC 
CATTTTACAGGTCTTTGGGMGTTCTGGGGGCCAGGAMTCCAMTTTGATAATAGGA 
Sequence 4049 

GGCCGCCCGGGCAGGTAC 1 I I I 1 1 I I I 1 I 11 M I 1 I M I 1 1 I I I GGAAGGTTGAATGCAG 

GGTTTTATTGAGTGGTGGAGGTGGTTCTCTGAAGGATGGATGCANAGCTGGAAGCGGGGA 

TGGCACGGGAAGATAATCTTCCTGCTGGGTGGCCAAACCTTTCCGACCACCCTCGGCTGA 

ACTCCTCTCGGCGTTCAAATGTTCCTCTTCTCTCTTTT 

Sequence 4050 

CGAGGTACTTTTTC 1 ITTTTTTTTTTI I GAGACAGAGTCTCGCTCTGTTGCCCAGGCTGG 

AGTGCAGTGGCACAATCTCAGCTCACTGCAACTTCCACCTCCCGGGTTCAAGTGATTCTC 

CTGCCTCAGCCTCCTGAATAGCTGGGACTGCAGGCACCCACTGCCACACCCCGCTAATTT 

TTGTAI I I I I AATGGAGACAGGGTTTCACCATGTTGGCCAGGCTGGTCTCGAACTTCTGA 

CCTGAGGCAATCCATCCACCTCGGCCTCCCAAAGTGCTAGGATTACAGAGGTAAGCCACC 

ATGCCTGGCCCCAGTCTCTACTMTTTTCTGTTGATGCTGTAACAAATTACTACAAACGT 

TGTGGCTTAAAACAATATACATTTATTATCTTAACAGTTCTGTAGGGTAGGAATCTGACA 

CAGGTCTTGTTGGCTAAAATCAAGGGACTGGCAAGGCTGCATTCTTTCTGGAGGCTCTAG 

GAGGGAATCCAGGTACCTGCCCG 

Sequence 4051 

GGAGCTCCCCGCGGTGGCGGCCGANGTAC I I I I M II I I I I I NTTTTTNTAACGCCTTTC 

CTACATCAGGTNTATTCAAAACACAMCAAGTATTTCTCTTTNTGTAAGGGCAAATGGTT 

CAAATAATGCGGTACACGAAACATTGACTAATCMGTGCTTTAMTATNAAACANAATTA 

TTCTTTTAAAAAAGCAAAATAATAANGNATATATACAAAAGGGACCTGGANTNTGTAAGC 

TGATTCCAAAANCGAAATAAGTAGAAAATCCATGGTGAAACCTGAACATTCTACCTTTG 

Sequence 4052 

CAGCGAGATAGGGTTGGAGGAGAAGTANCGCCGGGACTTCCGGATGGCAAACTTCTCTGT 

GGGTAGAGATTTCCCAGCAATCTTGAGCTTCAGGCCTGGCACAGCTCGAAATAATTCCAC 

TTCGTCGACCCCGAACGGCTTGTGGTCCTCCTTCCCAAACATGCTGAGGTAGGCGGCCTT 

CATGTAAATGTAGGTGGCC I I I' I " I AAGTC AG ATC ATGTCAGTTCCTTCTGGAAATCTGGN 

TATATTCATCACACTCAGGAGACATCTCCTACAATTTCCTTGACAC 

Sequence 4053 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTrrmCTTTTTCCT 

Mllll I ATTGTTTG i I U I II G UI l M I GGGGG l IMIU I GGCAGCCACAGGAGTrT 

TAGCCAATTCAGATGCCTTGCTCCCCACAATTTGGMCATTCCTTTGGATrTGACCAAGT 

CAGGMGAGATGGGAGAGAAGTGAAACAGCAACAATAAACCCCAAACA" I *AAACAAAAAGA 

GTTAAGCAAAACAAACAMTGCACAATTCATATGATTAACTGAGTGTTCTAATGGTAAGG 

AGGMTTAAMGCAGCTGGTGGGTAATCTTAMTTTTAGTCATTMGG, A AAAATT^ 

ACAAAACTCTAACCCGCGTACCT 

Sequence 4054 

CGCCCGGGCAGGTACCTTCAATTTTTCCCACGGTMTGGMTATMCTATTTATCMTTT 

ATCACTGCAACTGACATAGCCAGGGAAATGTTTAAGAMTGMTAAATAGMGTTTATTC 

CCTGCAGGTAGCGATTG AGTC CAC C AAMTCTTMGCTAAATTTTATGTTGTTTCATGGT 

AGCTGTTATGAAAATGGACCATCTAAGAGAAMTCCATTGTTTCTCAAATTCAAATGCAT 

TCTGTGTGACTAGGTTGTTCCCGTGATAATGCTATGCGACATTGCTGTTCTCTTCTATTC 

ACCAGTTTGCCTTCCTAATAACCTCTTCTCATATACATTCTTTAGGAAAAGACCAGTGCC 

CTCAGTCTGAGCAACGTTGCTGGAGTATTCTACATCCTTGTCGGGGGCCTTTGGTTTGGC 

AATGCTGGTGGGCTTTGATTGAGTTCTGTTACAAGTCAAGGGCCCGAGGCGAAAACGAAT 

GAAGGTGGCAAAGAATGCCCAGAATATTTAACCCATCTTCCTTCGCAGAAATTCACAGAA 
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TTTTTGCCMCTTATAAGGGMGGTTACAACCGTATATGGCATCGAAAGTGGTTAAATTT 
AAGGG 

Sequence 4055 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGCAGTG 
G GAAC ATACC AGCCTGAATTTGGAAAAAATAATTGTGTTTCTTGCCC AG G AAATACTACG 
ACTGACTTTGATGGCTCCACAAACATAACCCAGTGTAAAAACAGAAGATGTGGAGGGGAG 
CTGGGAGATTTCACTGGGTACCT 
Sequence 4056 

CCGCGGTGGCGGCCGAGGTACCACACCAGCCTCAGAACCCGCTCCACTGGCCCCCCATGA 

GGACCACCAGCAAAGGGAGACCAAAGAGAATGACCCCATGGACAGCCATCAGTCCCAGGA 

ATCCCCAAACCTGGAAAACATAGCAAACCCCCTAGAAGAAAATGTAACGAAAGAATCAAT 

CAGTAGTAAAAAAAAGGAAAAMGGAAACATGTGGACCACGTAGAAAGTTCACTATTTGT 

AGCACCAGGAAGTGTTCAATCCTCAGATGACCTAGAAGAAGACAGTAGCGACTACAGCAT 

TCCTTCCAGGACTAGTCACAGTGACTCCAGCATTTACCTTCGACGACATACTCATAGGTC 

TTCGGMTCGCCCAGAGCTTTGAAATTACTGGCATGCAACAGCTATACTTTTGGATAGAA 

GACAAGCAGATTGGAACCTTATCATTAAGCACAGTCCGATCGGATATCCGGTTTGGGCAA 

TAAGAGATGTAAATAAAGTGAGGATTTTAAAGCCCAAAGGCCTCCTTGCTATAAAAACGC 

ACCCAAAACGCCTT 

Sequence 4057 

CCGCGGTGGCGGCCCGAGGTACTTTC 1 rTTTTTTTTTTTTTTTTn I GCATTTGAGCTAA 

GATTTTMCAATCCTGTGAAAATAGGTMTATAGTCACGTATTTGTTGAATACATTATTT 

CAAAGAACCACAAACACACAGAAAAAATATTTAAAAAGTCCATGGTGGGGGAAATGATCC 

AGAAACAACAGCTCTTNTACTTCCAGCCCTAACCGTCTGGGATGCTAGATGGNTGNGGGC 

AAACCGTGGAAAMGATAAGCTTCTGNTTrrCMGATTTT 

AAATCTAAGTGATGNCTCTGTTTTTGCTTTCTAGCAATTAATATATGACAGCAGTCTTTG 
NGATTTA 

Sequence 4058 

CGGCCCGCCCGGGCAGGTACGCNGGGGGAGCATGAACCCTGTTGTGAACTGCTCATCTGA 

GGGATCTAGGTTGTGTGCTTCATATGAGAATCTAATGCCTGATGATATGTCACTGTCTCA 

CTTTGCCCCCAGATGAGACCATCTAGTTGCAGAAAAATNAGCTCAGAGTTTCCACGGATT 

CTACATTATGATTTGATCTTCAACAAAGCTGACACTGGGGAAAGGACACACTCTTNAATA 

AATGGTGCTGGGAAAATTGGATAGCCACATCCAGAAAAATGAAACTGGACCACTATCTCA 

CTATATACAAGAATCAACTCAAAGTAGATTAAAAACTTAAACAGAAGACCTGAAACTATN 

AAAATACTTGAAGAAAACCTANGGAAAAATTTCCTGGCCTTTTGTCTAGGCGMGAATT^ 

AATGACTAACACCTCANA^.GCCACAAGCAACACAAGATAAAAAATNGACANNATGGGACT 

TAATTAACTTAGAAAGCTTCTTGAGCNGCAGAAGGAAATAATCAGCAGNANTGAAGAGAC 

CACCTTCTTGAATGGAAGi/AAAATATTTGCAAACTGGTCATCTAATAG NGGG NCTAATACC 

TNGGAATTTTNCCAAA 

Sequence 4059 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACTTGTTTTTCAAACAATAAAACAGGCCGGGCA 

CCGTGGCTGGCTCACGCCCATAATCCCAACACTTTGGGAGGCCGAGGCGGGCGGATTACC 

TGAGGCCAATATGGTGAATTCTCCTCTCGACTCAAAATACAAAATGATTAGCTGGCCGTG 

GTGGTGTGTGCCTGTAATCCCAGTTACTTGAGAGACTGAGAGGAGAGAATCGCTTGAACC 

CGGGGGGCAGAGGTTGCAGTGAGCCAAGATTGCACCACTGCACTCCAGCCTGGGCAACAA 

GAGCGAAMCTCCATCTTAAAAATAAAATAAATAATAAAACAATGATGAAAACACAACTA 

ACTACAGCACTTCAGAGTTAGTTGACAAATCCACCTTTTCMTCTACATTTCAAAATGTT 

CAGAMGACACCATCCTAGGGGAAGTCAACCAGCAGCAACCTCTCTGCTMTTTTTGTAC 
CT 

Sequence 4060 
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CCGGGCAGGTACGCGGGGCCATTTAGCCTGCCATTGAAATGCAAAAGTCTGTTCCAAATA 

AAGCCTTGGAATTGAAGAATGAACAAACATTGAGAGCAGATGAGATACTCCCATCAGAAT 

CCAMCAAAAGGACTATGMGAAAGTTCTTGGGATTCTGAGAGTCTCTGTGAGACTGTTT 

CACAGAAGGATGTGTGTTTACCCAAGGCTACACATCAAAAAGAAATAGATAAAATAAATG 

GAAAATTAGAAGGTAGATATGCTGCTGAATTTAGAACATTCTCTGCAATGATAAGGTCTC 

CTGTTAAAGATGGTCTTCTGMGGCTMCTGCGGAATGAMGTTTCTATTCCAACTAAAG 

CCTTAGAAATTGATGGACCATGCAAACTTTCAAAGCAGAGNCTCCGAGAAGNCATCTGCC 

TTCGAGCCTTCCCATTGGAAATGCA 

Sequence 4061 

CCGCGGTGGCGGCCCGAGGTACACACTGGCAGAGCACCCGACGGACGGTGGCAGAGGCAA 

AGMGAAGATTGAACAACAGGGAGGCTTCACCTTTGAAAACAAAGGAGTCCTCTCCGCAT 

TTAATTTTGGGACTGTGCCCAGCAACAACTGAAGGAAAGAACATCAGCTGACCAAATGTC 

ATCGCTGCATTTTATTTCACAAGAGGAGTGTGAGGGTCAAGGGGATGAAATGAGGGGCTG 

CTTTTAGGGCCCTCCTGCTGTGCCAGTTACCATCTGGCATTAGGCAGCACTTTTATCTAC 

TCTTTCCCCTTTGACCTTTGTCACCCTGAMTATATATTTTAAACAGCTACTGTAAGTAT 

GAAATGAAAGAAAAACAATCATTGGACGGAAAAGGACAACCCATATGTTCCAAAGGCTGA 

ATGCCCAAGGTTGTTTTAGAGGATTGGATAGACTTGCACCGTCTCANGTTTTTGCCCATG 

CAGAATCAATGGA 

Sequence 4062 

CNAATTGGAGCTCCACCGCGGTGGGCGGCCGNANCTCTGAGGAGCGGCCAAGCAGCAGGA 
CAGGCAGCTTGCCCTGGTGGTCAATTTTGATGAGTGCGATGTCTGCTTTCTCATCCCCCG 
CGTACCTGCCCG 
Sequence 4063 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCTCGAGGTACAGGGTCCCAAAATGCGCTGAC 

TCAGCTTCACCAGCCCATCTCGGATCTCATTGCCAAAAATGTCTCTTCGGTCATGAATGA 

GGATCCAGCCCAACCTTCAGNCAGGAACCAGCCAGCGCTTGGCCAGCCCTTCCACAGGAC 

AGGATGGGGACATCGGTGCTGAGGGTGGCCAGTGGTTCATATTTGCAATCCGAAAACACC 

ATGTCTCCATAGATCTCATCAGCTAAGATGGGGACACACTGCCCGTGCAGGCACTGCCAG 

MTCTTCTGMGATGACCGTTNGCTGAACACTGACCCACAGGGGTTTTGATGGATTATTG 

ACAATGGAGACAAGCTGTCTTTTCATCAA 

Sequence 4064 

MCTCCTCTCGGCGTTCAMTGTTCCTCTTCTCTCTTTCTCTGCTGTATCGTTTCACCAT 
TCATCTGCTTGTCTGGGATGGTGAAGACAACTGTTTGCTCATTCTAAACGACACGGAATA 
GAGGCGTCTCCTCATCTGCTCATCTGCTTCTGGAGCCCGAGGTTTGGGGTTTATATGGGT 
ACCT 

Sequence 4065 

CGAGGTACGACCGCAAAGCCTGACTCTTAACCATTGTGTTCTGTTGCCTTCTCCAAGGGA 

AAGGAGTGCTTCCAGAAAGGCCATTGGAAGTGAGGAGCGTCACCTGATAATAGCTGGCTC 

CTCATCCCACTAGGTTAACAGGCAACAGATGGGGGCCACAGAGTTGGCTGGAGTTAGTCC 

TGGTTGCCAAGGGTAAATGGGGCTGTTACCATGCAATGGATAGAAACTAGGATGCTCCTG 

TGTGTCTCCTTATATCCTACCAGGTGTAGCATGAAGGTTAAGGACAATTCCGTCCTCTTA 

TTTGGAGAGGAGACCAAGGCCCTGTAGGACTAAAGGTTTGGGTGAGCTGAAAGCCAAGGG 

CMTTGGCCCAGCTTGGGGAGGAGAGTTACAAATACATTTGTGACTATCANACCAGAAAC 

AGACTGCGGTGTTCACCTGCGGTTTGGGTACCTGCCC 

Sequence 4066 

CGCGGTGGCGGCCCGAGGTACGCGGGGACTGCACACATGCTAGGGTCCAGGACAGCAGGA 
CCAAGCCAGCAGAAACAGCCTGAGCCCACCGCAGACTGGCCTGGCTATACTGGACAATGC 
CACTCCTCCTGTACCTGCCG 
Sequence 4067 
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GGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACACTTGAAACCAAATTTCTAAAACT 
TGI n 1 I CTTAAAAAATAGTTGTTGTAACATTAAACCATAACCTAATCAGTGTGTTCACT 
ATGCTTCCACACTAGCCAGTCTTCTCACAC I 1 I I I GGTTTCAAGTCTCAAGGCCTGACAG 
ACAGAAGGGCTTGGAGAI i I H'l I I CTTTACAATTCAGTCTTCAGCAACTTGAGAGCTTT 
CTTCATGTTGTCAAGCAACAGAGCTGTATCTGCAGGTTCGTAAGCATAGAGACGATTTGA 
ATATCTTCCAGTGATATCGGCTCTAACTGTCAGAGATGGGTCAACAAACATAATCCTGGG 
GACCCCGCGTACACCTGGGCACCAGTGACATTCCAGATGTCCTTGGGGGGCGTCACTATG 
GAAGGACCTTGCTCGCAGGTGCCCTTGCTGACCTGGGTGCCCGCCGTACCT 
Sequence 4068 

CCGCGGTGGCGGCCGAGGTACTTCATTTAATTTTTGCCTCTTACAGTCTGATATCCTGTT 

GGGCATTCTGGACAACTTGGACCCAGTCATGTTCTTCAAATGCCCTTCCCCAGAGCCTGC 

CAGCCTGGAGGAGCTCCCAGAGGCTACCCAGAAGGACCCAGTTCCTTACCAGCCTCCCTT 

TCTCTGTCAGTGGGGACCGTCATCAGCCAAGCTGGAAGCCATTAATGAACTAATTCGTTT 

TGACCACATATATACCAAGCCCCTAGTCTTAGAGATACCCTCTGAGACAGAGAGCCAAGC 

TAATGTGGTAGTGAAAATCGAGGAAGCACCTCTCAGCCCCTCAGAGAATGATCACCCTGA 

ATTCATTGTCTCAGTGAAGGAAGAACCTGTAGAAGATGACCTCGTTNCGGAGCTGGGTAT 

CTCAAATCTGCTTTCATCCAGCCACTGCCCAAAGCCATCTTCCTGCCTACTGGGATGCTT 

ACAGTGACTGTGGGATACGGGGGTTCCCTTTCCCCATTCAAGTGACATGTCCTCTCTGNT 

TGGTGNAAACCATTTTTGGGGAGG 
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Sequence 1 

CCGCGGTGGCGGCCGAGGTACAGGAACTTGATAATCATTGTTGTTGTCTTTATCATTACC 

ATCATCTCCATTAGCATCATCAGCATCACAGTTGCAGTATGGGGATCACATTAAGTGATG 

CGGCAAGATCAGGAAAACCATTATCAAGACAATGTCCAGTTTGGATGTGTAAATATTATC 

CTGTGCTTCTTGCCCATTTAGCAMCACTTTCTTTTCAGTGATAAACCTATGACCCCCAT 

ATGGGGCATGTTCATGAAGAAAATATTCATCACACTCATCTCTTCCTTGTTCATAATAAT 

GGTAGGGGACTTTTCTATACCCTTCTGTCTTGCAGTAGGTGTCATTTATCATCCCGTAGA 

CGTGAATGCCATAACAGGCGTCCATGGCCAGAATGAAGGTAAACCACCCTGTGCTGAGAT 

ATGAGCCAGACTGGACTCTGCCTTCCCAGTTTCCTTCTTAAAAACTCCATCACAGTAACT 

CATGCGCTTCTCTGTGGTCACCGTATATTTGGGCATTCGGATAGATCCMCTGTCTTTTT 

CMCATGTTGTAMCGATGCCATTGCCATCTTTCCTCATATTGCGGAAAGGTCCCCAAAT 

AACATAAATAGTAGTATTCGCTTCCTTGAAAAAATAATCANGGTTT 

Sequence 2 

CCGCGGTGGCGGCCGGGAGCCGCGGAAAGAGTAGGGCCCCCCAGACAACGGGGTCCGACG 

AGCAGTTGGGCACGGGCTGAGTCCGCCTGCCGGCAGAATGTCCCTGGCTGCTTATTGCGT 

CATCTGTTGCAGAAGAATAGGAACCTCTACTTCCCCACCAAAAAGTGGCACACACTGGAG 

AGATATCAGAAATATAATAMGTTTACTGGATCACTTATTTTAGGAGGTTCTTTATTTCT 

TACATATGAAGTTCTGGCCCTGAAGAAGGCTGTGACATTAGATACTCAAGTGGTAGAACG 

AGAAAAAATGMGTCATATATATATGTGCACACAGTTTCTTTAGATAAAGGAGAAAATCA 

TGGTATTGCCTGGCAGGCAAGAAAAGAACTTCACAAAGCAGTAAGAAAAGTATTGGCAAC 

ATCAGCCAAGATACTGCGGAATCCATTTGCTGATCCTTTTAGTACCT 

Sequence 3 

CGCCCGGGCAGGTACCCGGCTGGCCAAGGAGAAGGAGCAGAGCCTCACGGCAGAAAAGGA 

TGAGGACGAAGAGAACGATGCCTCTCTGGCCAACTNCTCCACCACCACTTTGGAGGACAA 

AGGCCCTGGCCATGCCACTTTTGGCCGCGAGGCCACCAAATTTGAAGAGGAGGANAAACQ 

TGACAAGGCCTGGGAAGCCAGACCCCCACGAGAGTCCAGCGATGTTCCCCCCACGAAGAG 

AAATAACTGGATCTTTATTGATGAGGAGCAAGCCTTTGGGGTCAAAANGACAGGCCCCGG 

GCCCGGGGCNCGTGGTTTCAGAAAAGTTCACTTTTCGTGGTCNGNCTTCTTGGCGGAAAT 

GGGAACCGGCCTTTTGTGGTGGGGGGGGTCCTGGGGGGCCCCCANCATNTACTGCAACAA 

GTCAAGCGCAACGGGNCGCTTTTAAAACTAGTGGGATNCCCCCGGGCCTGGAAGGAAATT 

CGATATTAAAGCTTATTCGATACCGTCACCCTTNANGGGG 

Sequence 4 

ANTNTATCCCCCNNNCTNCANNMTTCTATATCAANCTTATTTATACCTTCCAAGGGAAN 
GTTNNNCCCNNTACCCANCTTTTTTTCNMGGAATTMNCTTAATTTCCCTCTTNNCTTA 
ATCATNNTCATANCTTNTTTCCTNTTNTCAAATTNTTATCC 
Sequence 5 

CGCGGTGGCGGCCGAACATCTCCAGCAAGCTCCGGTCGATGGCCTGCACCTCTGGCTCTA 
GCTCCGACNCCATCTTCTCTCCTCCTCCTTCCCAGCGCTACCGCCGNCCGGAGAGCTGCA 
GCCTCACCTCNG 
Sequence 6 

GCTCCACCGCGGTGGCGGCCGCCCGGGCAGGTACCCTTTTCTTCCAGCCCAGAGCTAGAC 

TTCTGGATGCTGAGCCTGGGAAMTACTGMTTTCTCTCATTTTTCACTTTTAG GAGGG A 

ACGGTTATGGCATATACTTGCAGTGCCATTAGAAGCATTGGCAATCCCTGATCTTTTTGA 

TAGTTTAAGGCTTAGAATCATCAAAAGTTCCAGGTTCATATCCTTCTAAAATGGAAGAAA 

TGGAGGAAGTTCCAAGTGTGGTACCTCGGC 

Sequence 7 

GGAGCCGCGGAAAGAGTAGGGCTCCCCAGACAACGGGGTCCGACGAGCAGTTGGGCACGG 

GCTGAGTCCGCCTGCCGGCAGAATGTCCCTGGCTGCTTATTGCGTCATCTGTTGCAGAAG 

AATAGGAACCTCTACTTCCCCACCAAAAAGTGGCANACACTGGAGAGATATCAGAAATAT 
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AATAAAGTTTACTGGATCACTTATTTTAGGAGGTTCTTTATTTCTTACATATGMGTTCT 

GGCCCTGAAGAAGGCTGTGACATTAGATACTCAAGTGGTAGAACGAGAAAAAATGAAGTC 

ATATATATATGTGCACACAGTTTCTTTAGATAAAGGAGAAAATCATGGTATTGCCTGGCA 

GGCAG 

Sequence 8 

AGGTACTAAAAGGATCAGCAAATGGATTCCGCAGTATCTTGGCTGATGTTGCCAATACTT 

TTCTTACTGCTTTGTGAAGTTCTTTTCTTGCCTGCCAGGCAATACCATGATTTTCTCCT^ 

TATCTAAAGAAACTGTGTGCACATATATATATGACTTCATTTTTTCTCGTTCTACCACTT 

GAGTATCTAATGTCACAGCCTTCTTCAGGGCCAGAACTTCATATGTAAGAAATAAAGAAC 

CTC CTAAA ATAAGTGATCCAGTAAACTTTATTATATTTCTGATATCTCTCCAGTGTGTGC 

CACTTTTTGGTGGGGAAGTAGAGG7TCCTATTCTTCTGCAACAGATGACGCAATAAGCAG 

CCAGGGACATTCTGCC 

Sequence 9 

CCGCGGTGGCGGCCGAGGTACAGGCTGACAGAGAAGATTCCCGAGAGTAAATCATCTTTC 

CAATCCAGAGGAACAAGCATGTCTCTCTGCCAAGATCCATCTAAACTGGAGTGATGTTAG 

CAGACCCAGCTTAGAGTTCTTCTTTCTTTCTTAAGCCCTTTGCTCTGGAGGAAGTTCTCC 

AGCTTCAGCTCAACTCACAGCTTCTCCAAGCATCACCCTGGGAGTTTCCTGAGGGTTTTC 

TCATAAATGAGGGCTGCACATTGCCTGTTCTGCTTCGAAGTATTCAATACCGCTCAGTAT 

TTTAAATGAAGTGATTCTMGATTTGGTTTGGGATCAATAGGAAAGCATATGCAGCCAAC 

CMGATGCAAATGTTTTGAAATGATATGACCAAMTTTTAAGTAGGAAAGTCACCCAAAC 

ACTTCTGCT TTCA CTTAAGTGTCTGGCCCGCAATACTGTAGGAACAAGCATGATCTTGTT 

ACTGTGATATTTTAAATATCCACAGTACCT 

Sequence 10 

AGGTACAGGCTGACAGAGAAGATTCCCGAGAGTAAATCATCTTTCCAATCCAGAGGAACA 
AGCATGTCTCTCTGCCAAGATCCATCTAAACTGGAGTGATGTTAGCAGACCCAGCTTAGA 
GTTCTTCTTTCTTTCTTAAGCCCTTTGCTCTGGAGGAAGNTCTCCAGCTTCAGCTCAACT 
CACAGCTTCTCCAAGCATCACCCTGGGAGTTTCCTGAGGGTTTTCTCATAAATGAGGGC 
Sequence 1 1 

AGGTACGATATACGAAGACTCTGAGCTGTTTGCCTCCGATGGTTTCCAGTATTTGCCCGT 
TGTAAAGCTCATTAAGGCCAACTTTTACTTTC^ 

GGAGGCGCTGATCCATGCTGAGAGTATCATCAGAAAATGCATTATTCACAGGTGCCAGCA 

AAGTGTATTCTCCATCTGGCCTCAGAGCAGATGCCAAGCCTAATTGGGCCACAAGATCCG 

TGMGGTGGTTTGCTGTTTTCCAGCCAGCTCAATMCTTGTTTGGCAGAATCAGGAATTA 

GGACCTGATCAATCAAATGGATCACACCATTATTTGTCACAATATCCTTTTNGTTCACCA 

TTTTGATTCCATTTACTGTTATACTGTCACCGTCACATCCTATCTCAATTNGTATTTCCT 

TCCAGCGTCTCAAAGACTGCTCCTCCCATMTAGACTCAGAACCTGGANGAGTATTTTAA 

GATGTGGTACCTCGGCCGTTCTAGAACTAGGGGGATCCCCCGG 

Sequence 12 

CCGCGGTGGCGGCCGAGGTACTGTGGATATTTAAAATATCACAGTAACAAGATCATGCTT 

GTTCCTACAGTATTGCGGGCCAGACACTTMGTGAMGCAGAAGTGTTTGGGTGACTTTC 

CTACTTAAAATTTTGGTCATATCATTTCAAMCATTTGCATCTTGGTTGGCTGCATATGC 

TTTCCTATTGATCCCAAACCAMTCTTAGMTCACTTCATTTAAAATACTGAGCGGTATT 

GAATACTTCGAAGCAGAACAGGCAATGTGCAGCCCTCATTTATGAGAAAACCCTCAGGAA 

ACTCCCAGGGTGATGCTTGGAGAAGCTGTGAGTTGAGCTGAAGCTGGAGAACTTCCTCCA 

GAGCAAAGGGCTTAAGAAAGAAAGAAGAACTCTAAGCTGGGTCTGCTAACATCACTCCAG 

TTTAGATGGATCTTGGCAGAGAGACATGCTTGTTCCTCTGGATTGGAAAGATGATTTACT 

CTCGGGAATCTTCTCTGTCAGCCTGTACCTGCCCGGGCGGCCGCCCCGGGCAGGTACACC 

TTGAATGACAATGCTTCGGAGCTNCCCTGTGGGCCATCGACCCTTCCTGCCATTGATGCA 

CCATCCAACCTGCGTTTCTGGNCACCACACCCAAATTCCT 
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Sequence 13 

AGGTACCACATCTTAAATACTCTCCAGTGTTCTGAGTCTATTATGGGAGGAGCAGTCTTT 

GAGACGCTGGAAGGAAATACAATTGAGATAGGATGTGACGGTGACAGTATAACAGTAAAT 

GGAATCAAAATGGTGAACAAAAAGGATATTGTGACAAATAATGGTGTGATCCAT7TGATT 

GATCAGGTCCTAATTCCTGATTCTGCCAAACAAGTTATTGAGCTGGCTGGAAAACAGCAA 

ACCACCTTCACGGATCTTGTGGCCCMTTAGGCTTGGCATCTGCTCTGAGGCCAGATGGA 

GAATACACTTTGCTGGCACCTGTGMTMTGCATTTTCTGATGATACTCTCAGCATGGAT 

CAGCGCCTCCTTAMTTAATTCTGCAGAATCACATATTGAAAGTAAAAGTTGGCCTTAAT 

GAGCTTTACAACGGGCAAATACTGGAAACCATCGGAGGCAAACAGCTCAGAGTCTTCGTA 

TATCGNACCTCNGGCCGCTCTAGAACTAGTG 

Sequence 14 

CCGGGCAGGTACCTTTCCAATTTGGAGCTAGAAATGGACGAGGAGAGTCTTCAGACATTA 

TCTCTGCAGTGTGAGCCAGCAACCAACACATTGCCTAAGAATCCTGGTGACAAAAAGCCT 

GTCAAATCCGAGACCTCTCCAGTAGCTCCAAGGGCAGGGTCACAACAGAAAGCTCAGTCC 

CTGCCACAGCCCCAGCAGCAGCCACCACCAGCACATAAAATCAACCAGGGACTACAGGTT 

CCCGCCGTGTCCCTTTATCCTTCACGGAAGAAAGTGCCCGTAAAGGATCTCCCACCT!TT 

GGCATAAACTCTCCACAAGCTTTAAAAAAAATTCTTTCTTTGTCTGMGMGGAAGTTT^ 

GAACGTCACAAGAAACAGGCTGAAGATACAATATCAAATGCATCTTCGCAGCTTTCTTCT 

CCTCCTACTTCTCCACAGAGTTCTCCAAGGAAAGGCTATACTTTGGC 

Sequence 15 

CCGGGCAGGTACTGAGGMTGAATAGAAAGGCTTCCAGATGTCTAAAAGATTCTTTAAAC 
TA CTGMCTGTTACCTA GGTTMCMCCCTGTTGAGTAmGCTGTTTGTCCAGTTCAGG 
AATTTTTGTTTT GTTTT GTCTATATGTGCGG 
AGAAAAAAAATGTTTTATGGTAGCTTTTACTTTT^ 

AAATTCTTTTCCCCCATCCCCCTCCAMGTCTTGATAGCMGCGTTATTTTGGGGGTAGA 

AACGGTGAAATCTCTAGCCTCT 

Sequence 16 

CA CCGTG GTGGCGGCCGAGGTACATGGATGAAAACTAAAGGCTCGAGTTAATCACATTGT 

AGTTTTTAMTTTCTACAGCCTAGAGCTCACTAGTCACAGGTCTTTTAGGTCCTTCTGGA 

TGTCCCACAGGGTATCTGCACTTTTCTTGAGCTGAGCAACCTCATCATCCTTTAGCTTCT 

GGTTGATAACGCTGGTTAATCCCCGGGCATTGAGGATACATGGAAGGCTCAGGAAGACTT 

CATT CTCAATGCCATACATCCCCTTTACCATTGTTGACACGGGATGAATCCTGGATAGAT 

T T7TC AACATGGATTCAATAAGATCAGCCACACTTAATCCAATAGCCCAGTTGGTATATC 

^^^AGCTTGGTGACTTCATAGGCACTTTCAACCACCATCTTATGCACTTCCTTCCAAT 

TTTCACTATCATTGTCAGTTCCCATTTCTGGATTCAATTCCTGGAGAGAAACACCTGCCA 

CATTCACACCACTCCACACAGCCACACTTGAGCGCCATGTTCCCCCAAAATCCATCCATG 

GCAGCTGCTGGGATGAATGCCAAGTTTTTCAGCCATAAGGGNAGCGAAATCTAGCAGAAT 

CCAGATTACATCCACTTCCAATCACGCGGNGTTTGGGTAATNCACTTAGTTTCCAGGGTA 

GCATACCGTAAAGNAATGTCCACTGGGGTGG 

Sequence 17 

GGTGGCGGCCGAGGTACACAGCCAGTGTGGTTGCCTTGCACGATGATATGGAGAGCCAGC 

CCCTGATTGGAACCCAGTCCACAGCTATTCCTGCACCAACTGACCTGAAGTTCACTCAGG 

TCACACCCACAAGCCTGAGCGCCCAGTGGACACCACCCAATGTTCAGCTCACTGGATATC 

GAGTGCGGGTGACCCCCAAGGAGAAGACCGGACCAATGAAAGAAATCAACCTTGCTCCTG 

ACAGCTCATCCGTGGTTGTATCAGGACTTATGGTGGCCACCAAATATGAAGTGAGTGTCT 

ATGCTCTTAAGGACACTTTGACAAGCAGACCAGCTCAGGGAGTTGTCACCACTCTGGAGA 

ATGTCAGCCCACCAAGAAGGGCTCGTGTGACAGATGCTACTGAGACCACCATCACCATTA 

GCTGGAGAACCAAGACTGAGACGATCACTGGCTTTCA 

Sequence 18 
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AGGTACAGTCCTGATTGCATCATAATTGTGGTTTCCAACCCAGTGGACATTCTTACGTAT 
GTTACCTGGAAACTAAGTGGATTACCCAAACACCGCGTGATTGGAAGTGGATGTAATCTG 
GATTCTGCTA GATT TCGCTACCTTATGGCTGAAAAACTTGGCATTCATCCCAGCAGCTGC 
CATGGATGGATTTTGGGGGAACATGGCGACTCAAGTGTGGCTGTGTGGAGTGGTGTGAAT 
GTGGCAGGTGTTTCTCTCCAGGAATTGAATCCAGAAATGGGAACTGACAATGATAGTGAA 
AATTGGAAGGAAGTGCATAAGATGGTGGTTGAAAGTGCCTATGAAGTCATCAAGCTAAAA 
. GGATATACCAACTGGGCTATTGGATTAAGTGTGGCTGATCTTATTGAATCCATGTTGAAA 
AATCTATCCAGGATTCATCCCGTGTCAACAATGGTAAAGGGGATGTATGGCATTGAGAAT 

GAAGTCTTCCTGAGCCTrCCATGTATCCTCAATGCCCGGGGATTAACCAGCCGTTATNAA 
CCAGAAACTAAAGGAT * 
Sequence 19 

AGGTACTGAGGAATGAATAGAAAGGCTTCCAGATGTCTAAAAGATTCTTTAAACTACTGA 
AC TGTTACCTA GGTTAACAACCCT^ 

TGTT7TG TTTTG TCTATAT GTGC GGCTTTTCAGAAGAAATTTMTCAGTGTGACAGA 

A AAAA TGTTTTATGGTAG CTTTTACTTTTTATGAAAAAAAAATTATITGCCTTTTAMT^ 

CTTTTCCCCCATCCCCCTCCAMGTCTTGATAGCMGCGTTATTTTGGGGGTAGAAACGG 

TGAMTCTCTAGCCTCTTTGTGTTT7TGTTGTTGTTGTTGTTGCTGTTTTATATAATGCA 

TGTATTCACTAAAATAAAATTTAAAAAACTCCTGTCTTGCTAGACAAGGTTGCTGTTGTG 

CAGTGTGCCTGTCACTACTGGNCTGTACCT 

Sequence 20 

CCGGACAGGTACATGAAGCACAAAAAGACTGTCGCTCAGGCTGCTGACAGAGATTCCGGT 

CAGGTTGGCGTAATCAATCCTCTGCTTGACTTTGGCGTCCTCCACGATGACGACGGCGTT 

GGGCGTGAGCAGCAGCTGCCGGGAGCGAGGCTTGTAGCCCTTGCGGTCGTATTTCACAAC 

AGGCACCGCATACTGAATGGGCTCAGAGCCCAAGGCCTGCAGCACTCGGGGGCTGATCTC 
ATCTGTACCT 

Sequence 21 

AGGTACTTGTCCCGGCCTTGCCTCATGTTCTCGTTGTGGATGATGTCATTGTGGGTCCTC 

TTATTCTCATCTCGGGCCTGGGACAGCTCGCTGCTCAGCGTCAGCAGCTGCCGCTGCACA 

CGCTCGTTCTTCTCTGCCTCAGTGATGCGCTTCTCCTCATTGCGGTCATCCCGGATGCCC 

TCACTAGACAGCTCCGCGCTGTAGCCCGTGGGCTCTGCGCCCTCATCCTGCAAGCTCTCC 

TGGACATGGTAGCTCACCGGCTCGTACCTGCCCG 

Sequence 22 

AGGTACGATATACGAAGACTCTGAGCTGTTTGCCTCCGATGGTTTCCAGTATTTGCCCGT 

TGTAMGCTCATTAAGGCCAACTTTTACTTTCAATATGTGATTCTGCAGAATTAATTTAA 

GGAGGCGCTGATCCATGCTGAGAGTATCATCAGAAAATGCATTATTCACAGGTGCCAGCA 

AAGTGTATTCTCCATCTGGCCTCAGAGCAGATGCCAAGCCTAATTGGGCCACAAGATCCG 

TGAAGGTGGTTTGCTGTTTTCCAGCCAGCTCAATAACTTGTTTGGCAGAATCAGGAATTA 

GGACCTGATCAATCAAATGGATCACACCATTATTTGTCACMTATCCTTTTTGTTCACCA 

TTTTGATTCCATTTACTGNTATACTGNCACCGTCACATCCTATCTCAATTGTATTTCCTT 

CCAGCGTCTCAAAGACTGCTCCTCCCATAATAGACTNAGGAACACTGGAGAGTATTTAAG 

ATGTGGTACCTCGGCCGCTCTAN 

Sequence 23 

CTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGACTTTTTAAATCATGTT 

CCCCCTAAACATGGCTGTTAACCCACTGCATGCAGAAACTTGGATGTCACTGCCTGACAT 

TCACTTCCAGAGAGGACCTATCCCAAATGTGGAATTGACTGCCTATGCCAAGTCCCTGGA 

AAAGGAGCTTCAGTATTGTGGGGCTCATAAAACATGAATCAAGCAATCCAGCCTCATGGG 

MGTCCTGGCACAGTTTTTGTAAAGCCCTTGCACAGCTGGAGAAATGGCATCATTATAAG 

CTATGAGTTGAAATGTTCTGTCAAATGTGTCTCACATCTACACCGTGGCTTGGAGGCTTT 

TATGGGGCCCTGTCCAGGTAGAAAAGAAATGGTATGTAGAGCTTANATTTCCCTATTGTG 
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ACAGAGCCATGGTGTGTTTGTNATAATAAAACCCCGNGAAANAACNNATTTNAGGGNGCT 
TGTACCCTTGGCCG NTNTTAAG AAACCTAAG N GGGATNCCCCCGGGGCTGCC ANGG N NAT 
TCCGAATATCAAAGCCTTNATCTGNNAACCCGCCNNACCCCCTANGGGGGGGGGGCC 
Sequence 24 

CCGCGGTGGCGGCCGAGGTACCGGAGACAGGTGCAGTCCCTCACCTGTGAAGTGGATGCC 

CTTAAAGGAACCAATGAGTCCCTGGAACGCCAGATGCGTGAAATGGAAGAGAACTTTGCC 

GTTGAAGCTGCTAACTACCAAGACACTATTGGCCGCCTGCAGGATGAGATTCAGAATATG 

AAGGAGGAAATGGCTCGTCACCTTCGTGAATACCAAGACCTGCTCAATGTTAAGATGGCC 

CTTGACATTGAGATTGCCACCTACAGGAAGCTGCTGGAAGGCGAGGAGAGCAGGATTTCT 

CTGCCTCTTCCAMCTTTTCCTCCCTGAACCTGAGGGAAACTAATCTGGATTCACTCCCT 

CTGGTTGATACCCACTCAAAAAGGACA 

Sequence 25 

AGGTACTACTTCTTGGTTAAAAGGCCACTGGTAGAGTCATCTGAGTGTAGAGAATGTCCC 

TTCACTGCTGGAAAAATCCACTGGCTCCCAAGAAAAGAAAATGGTCTGAAGCCTCTGTTG 

TGGCTCTCACAACTCATCTTTCCCTAAGTCATCAAGCTCCACATCGCTGAGGTCAATGTC 

ATCCTCCACGGGAAGCTCGCCATCCCTGCCGTCCCAAGGCTCTCTCTCAACGATGGTAGG 

GAAAGCCCCGCCTCCTACAGGTGCCGTGGAGCCACGCCCAAAAGAGAGCTCCCTGAGAAA 

CTCGTTGATGCCTTGCTCACTGMGGAGCCTTTTAGCAGAGCAAATTTCATCTTGCGTGC 

ATTGATGGCGGCCATGGCGGGGTACCTCGGC 

Sequence 26 

CCTATTCACTATAGGGCGANNTGCANGCGTGGTGGCGGCCCGAGGTACTGTGGATATTTA 

AAATATCACAGTAACAAGATCATGCTTGTTCCTACAGTATTGCGGGCCAGACACTTAAGT 

GAMGCAGAAGTGTTTGGGTGACTTTCCTACTTAAAATTTTGGTCATATCATTTCAAAAC 

ATTTGCATCTTGGTTGGCTGCATATGCTTTCCTATNGATCCCAAACCAAATCTTAGAATC 

ACTTCATTTAAAATACTGAGCGGTATTGAATACTTCGAAGCAGAACAGGCAATGTGCAGC 

CCTCATTTATGAGAAAACCCTCAGGAAACTCCCAGGGTGATGCTTGGGAGGAAGCTGNGA 

GTTGAGCTGGAAGCTGGAGAACTTCCTNCAGGAGGCAGAAGGGCTTAAGAAAAGGAAAGG 

AAGGAACTCTAAGCTTGGGGTCTGCTAACATCACTCCAGTTTAGATGGGATCTTGGGCAG 

AGAGGACCATGGCCTTGTNTCCCTCTGGG 

Sequence 27 

CGAGGTACAGGAACTGTCACCGTGTCAACCTGATGGGGAGATATGGGGACAATAACCACA 

GTCAGGGCGTTAACTGGTTCCACTGGAAGGGCCACGAACACTGAATCCAGTTTGCTGAGA 

TGAAGCTGAGACCAAGCAACTTCAGAAATCTTGAAGGCAGGCGCAAACG GGCA TAAATTC 

CAGGGACCACTGGGTGAGAGAGGAATAAGGCCCAGAGCGAGGAAAGGATTTTACCAAAGC 

ATCAATACMCCAGCCCAACCATCGGTCCACACCTGGGCATTTGGTGAGAGTCAAAGCTG 

ACCATGGATCCCTGGGGCCAACGGCMCAGCATGGGCCTCACCTCCTCTGTGATTTCTTT 

CTTTGCACCAAAGACATCAGTCTCCAACATGTTTCTG 

ATCTCCCAGTGACAACATCGCAATAGTTTTTTACTTCTCTTAGGTGGCTCTGGGAATGGG 

AGAGGGGTAGGATGTACCT 

Sequence 28 

AGGTACTCAGGGGAGGCCAGGAAGGCCTTGAGCTTGGGCCGGGCACTGAGGCGCCCCACA 

TATGCTGAGAGCAGGGGGAACGCATCCAGGCAGCCAGGGGCTAGGACCTCATGGATCAGC 

AGCAAGTCCAGCAGGTTGTAGTCAGCGAAGGAGATCTGGTCTCCCACAATGAAGGTCTTG 

CCTCCCTGGTTCTGGGACAGCAGGGTCTCAAAAGGCTTCAGTTGCCCGGGCAGTGCCTTC 

ACATAGTCATCCTTGCCCGCCTCATAGTTGGTGTAGATGAGGGAGATGTATTTGCAGCGG 

AGGTCCTCCACGCCGTCATTCACCATGTCCACCAGGGCTGCCTCCTGCTGGTCCTTCCCA 

TAGAGCCCAAGGGTGCGGCCCAGGTGACGCAGGATGGTATTGGACTGGTACCT 

Sequence 29 

AGGTACTTGAGCTGTGAGGTCATCGGAATCCCGACACCTGTCCTCATCTGGAACAAGGTA 



Page 5 of 869 



WO 01/51628 



PCT/US01/00798 



Table 2 



AAAAGGGGTCACTATGGAG7TCAAAGGACAGAACTCCTGCCTGG TG ACCGGGACAACCTG 

GCCATTCAGACCCGGGGTGGCCCAGAAAAGCATGAAGTAACTGGCTGGGTGCTGGTATCT 

CCTCTAAGTAAGGAAGATGCTGGAGAATATGAGTGCCATGCATCCAATTCCCAAGGACAG 

GCTTCAGCATCAGCAAAAATTACAGTGG7TGATGCCTTACATGAAATACCAGTGAAAAAA 

GGTGAAGGTGCCGAGCTATAAACCTCCAGAATATTATTAGTCTGCATGGTTAAAAGTAGT 

CATGGATAACTACATTACC TGTTC TTGCCTAATAAGTTTCTTTTAATCCAATCCACTAAC 

ACTTTAGTTATATTCACTGGTTTTACACAGAGAAATACAAAATAAAGATCACACATCAAG 

ACTATCT ACAAAM TTTATTATATATTTACAGAAGAAAAGCATGCATATCATTAAACAAA 

TAAAATAC 1 i M I I ATCATAAAAAAAAAAAAAAAAAAAAACCTCGGCCGCT 

Sequence 30 

AGGTACATGGATGAAAACTAAAGGCTCGAGTTAATCACATTGTAGTTTTTAAATTTCTAC 

AGCCTAGAGCTCACTAGTCACAGGTCTTTTAGGTCCTTCTGGATGTCCCACAGGGTATCT 

GCACTTTTCTTGAGCTGAGCAACCTCATCATCCTTTAGCTTCTGGTTGATAACGCTGGTT 

AATCCCCGGGCATTGAGGATACATGGAAGGCTCAGGAAGACTTCATTCTCAATGCCATAC 

ATCCCCTTTACCATTGTTGACACGGGATGMTCCTGGATAGATTnTCAACATGGATTCA 

ATAAGATCAGCCACACTTAATCCMTAGCCCAGTTGGTATATCCTTTTAGCTTGATGACT 

TCATAGGCACTTTCAACCACCATCTTATGCACTTCCTTCCAATTTTCACTACCATTGTCA 

GTTCCCATTTCTGGATTCAATTCCTGGAGAGAAACACCTGCCACATTCACACCACTCCAC 

ACAGCCACCACTTGAGTCGCCATGTTCCCCCAAAATCCATCCATGGCAGCTGCTGGGATG 
AAT 

Sequence 31 

AGGTACTGCAGGGCCCCTCAAGGAGGATACTACAAAACAGCCCTGGGAACCAGGTGCGAC 

ATTCACTGCCAGAAGGGCTACGAGCTGCATGGCTCTTCCCTACTGATCTGCCAGTCAAAC 

AAACGATGGTCTGACAAGGTCATCTGCAAACAAAAGCGANGTCCTACCCTTGCCATGCCA 

GCAAATGGAGGGTTTAAGTGTGTAGATGGTGCCTACTTTAACTCCCGGTGTGAGTATTAT 

TGTTCACCAGGATACACCGTTGAAAGGGGAGCGGACCGTCACATGTATGGACAACAAGGC 
CTGGA 

Sequence 32 

GTGGTCATCGACGCCTCCACTGCCATTGATGCACCATCCAACCTGCGTTTNCTGGCCACC 

ACACCCAATTCCTTGCTGGTATCATGGCAGCCGCCACGTGCCAGGATTACCGGCTACATC 

ATCAAGTATGAGAAGCCTGGGTCTCCTCCCAGAGAAGTGGTCCCTCGGCCCCGCCCTGGT 

GTCACAGAGGCTACTATTACTGGCCTGGAACCGGGAACCGAATATACAATTTATGTCATT 

GCCCTGAAGAATAATCAGAAGAGCNAGCCCCTGATTGGAAGGAAAAAGACAGGACANGAA 

GCTCTNTNTCAGACAACCATCTCATGGGCCCCATTCCAGGACACTTNTGAGTACCT 
Sequence 33 

AGGTACAGGCTGACAGAGAAGATTCCCGAGAGTAAATCATCTTTCCAATCCAGAGGAACA 

AGCATGTCTCTCTGCCAAGATCCATCTAAACTGGAGTGATGTTAGCAGACCCAGCTTAGA 

GTTCTTCTTTCTTTCTTAAGCCCTTTGCTCTGGAGGAAGTTCTCCAGCTTCAGCTCAACT 

CACAGCTTCTCCAAGCATCACCCTGGGAGTTTCCTGAGGGTTTTCTCATAAATGAGGGCT 

GCACATTGCCTGTTCTGCTTCGAAGTATTCMTACCGCTCAGTATTTTAAATGAAGTGAT 

TCTMGATTTGGTTTGGGATCAATAGGAAAGCATATGCAGCCAACCAAGATGCAAATGTT 

TTGAAATGATATGACCAAAATTTTAAGTAGGGAMGTCACCCAAACACTTCTGCTTTTCA 
CTTAA 

Sequence 34 

CCGCGGTGGCGGCCGAGGTACACGTCCTGATTGCATCATAATTGTGGTTTCCAACCCAGT 

GGACATTCTTACGTATGTTACCTGGAAACTAAGTGGATTACCCAAACACCGCGTGATTGG 

AAGTGGATGTAATCTGGATTCTGCTAGATTTCGCTACCTTATGGCTGAAAAACTTGGCAT 

TCATCCCAGCAGCTGCCATGGATGGATTTTGGGGGAACATGGCGACTCAAGTGTGGCTGN 

GTGGAGTGGTGTGAATGTGGCAGGTGTTTCTCTCCAGGAATTGAATCCAGAAATGGGAAC 
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TGACAATGATAGTGAAAATTGGAAGGAAGTGCATAAGATGGTGGTTGAAAGT 
Sequence 35 

CCGGGCAGGTACATGACCCCTTCATTGGTTGTGCAGATTTCCTCGTGGGCAGCCATGGGG 

CAGAACCCAAACTTCTGGTCGGCATCATAGTTCTGTGTGGTCCCACACCACTTCATGTTG - 

TCTCTTCTGCCCTCAGAAGTGCAATCAGTGTAATTGTGGTTGTTGTATAGGAAGGGGAAG 

TGGCACAAGGCACCATTGGAATTTCCTNCTNGAGTCTGAACCAAAACAGTGTGGNCTGTG 

CANAAAGAGTATTTCTGGTCCTGCTCATAANTCGAAGTTGTGCTGCACCAAAAGATGTNC 

GTCCTGTCGCCCTTCTGTGGTGCAGGAGTAGAACGTCCTGCCATTGTAGGTTGAATGG 

Sequence 36 

AGGTCTAACATCCCTAGCAGCCTTCCTCAGGCACACTGTGACAGTGGAGTGGGCACACGT 

ATCGTTTCAGGCACCTTGGCTGACAGCCCAACGCGGCCACCTGGGTGCATGGACATTTGC 

AGCTGCCACCATGCCAGCACCCTGGCCACAGATTAGCTCAACTGGCCACAGTCTGTGATG 

ACMTCTTCTTGGATGTCCTCCCACTCTTAGAGCCGAAAGATTCTATTTTCTTCACGACG 

TCCATGCCCTCTTTGACGTGACCGAACACAACATGCTTGCCATCCAACCAGTCTGTCTTT 

ATGGTGCAGATGAAGAACTGGGAGCCGTTGGTGTTAGGACCAGCATTAGCCATGGACAGG 

ACACCTGGCCCCACGTGCTTCAGTGCAAAGTTCTCGTCAGGAAAAGCGGCTTCCGTAGAT 

GGACTTCCCGCCTGTGCCATTGTGGTTGGTGAAGTCGCCCGCCTGGCACATGAAGGAANG 

GGATCACCCT GTGGM GGGTGGAGCCTTTGTAGCCGAAGCCCTTTCTTACCAGTGCACAG 

GGCTCTGAAGTTTTTAGCTGTCTTTTGGGACGACATCTGCCTTCAGCTTCAGCACCACGC 

CGGGCCGTTNTAGAACTTAGNGGATCCCCCCGGGCTTGCANGGAATTCGATATCAAGCTT 

TATCGATACCGTCCNACCTCGAGGGGGG 

Sequence 37 

CCGCGGTGGCGGCCGAGGTACAAGC I I 1 i H I 1 H 11 I I 1 I I I I I 1 1 GCTGGTAATATAT 

TGCTGCACTGAGTGTGTGCMTTTTTATTCAAGGTCATCGTGATGCTGAGAAGTTTCGTT 

GATAACCTGTCCATCTCTAGTTTCAACCGTCTTMTCAGMGTGTCCTTTTTGAGTGGGT 

ATCAACCAGAGGGAGTGAATCCAGATTAGTTTCCCTCAGGTTCAGGGAGGAAAAGTTTGG 

AAGAGGCAGAGAAATCCTGCTCTCCTCGCCTTCCAGCAGCTTCCTGTAGGTGGCAATCTC 

AATGTCAAGGGCCATCTTAACATTGAGCAAGTCTTGGTATTCACGAAGGTGACGAGCCAT 

TTCCTCCTTCATATTCTGAATCTCATCCTGCAGGCGGCCAATAGTGTCTTGGTAGTTAGC 

AGCTTNAACGGCAAAGTTCTCTTCCATTTCACGCATNTGGCGTTCCAGGGACTCATTGGG 

TTCCTTTAAGGGCATCCACTTCCAGGTGAGGGACTGCACCTGTCTCCGGTACCTN 

Sequence 38 

AGGTACCTTACAAAAGTTAATGCATTAGACACTTCAGATGTTAACTGCTCTAAACAAAAC 

TCCTAAGTCTGTCCTATGCMTATATATTTTAT^TACATATATATTTTTACATAGAATAC 

CCACAAAGTGCAAGCCAATAATAACATTGCAG>\AAAGTAATACATATCTGCTAGGTGACA 

ATATCAMCMTTCAGGGAATAATTTTACTTTMTTAACATTAAC^ 

ACTTCAMTCAATCATATTTCTGTCATCTNCMCCTAAGATATTTTTTAGATTGTCTCCC 

TATTCTTTG ATTCAAAAGCCAATTACAGAAACTATGAACTTGAC CTAATTCTGG I I ) < IG 

ACM TTATGAGACAGAMTAAAGAAATGCAAGCAGTTCTTTTCTTTGCACACTGACCATT 

TTTTAATTACATCATCCTCTATGATGATGGNGCTTTCACAACTGCAGCTCTCCTGTATGT 

CAAMTCATTCTGGTTTCCAGGTNAATGGACAAAGGAGATTTGCCTTCAGTGTCTAGAAG 

GCAATTTACT7TTCAGGCTGCCTTAATTACC TATAGGTT 

Sequence 39 

CCGCGGTGGCGGCCGAGGTACTCCAGGCCGGGACTCAGGTTATCAAAAGTGCAGGAGCTC 

TGATCAGCATGGACCACTTCTTCCAMGMTTTCCCTGCTGGCCGTTTGTAGGGGTTGTG 

GTAATTCTATAACCAGTAATGTCTGGGGTGGTGCTCCTCTCCCAGGAGACTGTGAGCACT 

CCAGTGTCAGGGTTTGCCTCCAGATGCAAGTTTGTTGGTGGAGACAATGGTGTCACCACT 

TTGTTTACAATTGGCGCATCTCTTTCCTGTCCATCTCTCAGGACTTGGATGGTGTAGACC 

GTATTCTACTCCTGGAGTCAAGCCGGACACAACGATGCTTCCTGAGTCTGAAGTCACTTN 
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TCGTGGTGCCTCTCCTCCCTGGCTTGGTCGTACCTCGGCCGCTCTAGAACTAGTGGGATC 
CCCCGGGCTGCA 

Sequence 40 

CCGCGGTGGCGGCCGAGGTACCGGAGACAGGTGCAGTCCCTCACCTGTGAAGTGGATGCC 

CTTAAAGGAACCAATGAGTCCCTGGAACGCCAGATGCGTGAAATGGAAGAGAACTTTGCC 

GTTGAAGCTGCTAACTACCAAGACACTATTGGCCGCCTGCAGGATGAGATTCAGAATATG 

AAGGAGGAAATGGCTCGTCACCTTCGTGAATACCAAGACCTGCTCMTGTTAAGATGGCC 

CTTGACATTGAGATTGCCACCTACAGGAAGCTGCTGGAAGGCGAGGAGAGCAGGATTTCT 

CTGCCTCTTCCAAACTTTTCCTCCCTGAACCTGAGGGAAACTAATCTGGATTCACTCCCT 

CTGGTTGATACCCACTCAAAAAGGACACTTCTGATTAAGACGGTTGAAACTAGAGATGGA 

CAGGTTATCAACCGAAACTTCTCAGCATCACGATGACCTTGAATAAAAATTGCACACACT 

CAGTGCAGCAATATATTACCAGCCAGAATAAAAAAGAAATCCATATCTTAAAGAAACAGC 

TTTCAAAGTGCCCTTTCTTGCAGTTTTTCMGGAGCGCAAGGAATAGATTTTGGAATAGG 

GAATAAAGCTCTAGTTTCTTAACAAACCCGACCACT7TC7TACAAGGATTTAGAAAAAAA 

AGTTTACCMCNA TAATCT AAGTATNNCCGGAAAAAAANCTTTGNGGCTTAGMTACTTT 

TTTTAAAAAAGGNA I I ! I I GGAATACTTTTTAAAAAAACTGGNNTTTTTT 

Sequence 41 

CCGCGGTGGCGGCCGAGGTACGATATACGAAGACTCTGAGGTGTTTGCCTCCGATGGTTT 

CCAGTATTTGCCCGTTGTAAAGCTCATTMGGCCAACTTTTACTTTCMTATGTGATTCT 

GCAGAATTMTTTTAGGAGGCGCTGATCCATACTGAGAGTATCATCAGAAAATGCATTAT 

TCACAGGTGCCAGCAAAGTGTATTCTCCATCTGGCCTCAGAGCAGATGCCAAGCCTAATT 

GGGCCACMGATCCGTGAAGGTGGTTTGCTGTTTTCCAGCCAGCTCMTMCTTGTTTGG 

CAGA^AGGAATTAGGACCTGATCMTCAAATGGATCACACCAmTTTGTCACMTAT 

CCTTTTTGTTCACCATTTTGATTCCATTTACTGTTATACTGTCACCGTCACATCCTATCT 

CAATTGTATTTCCTTCCAGCCGTCTCAAAGACTGCTCCTCCCATAATAGACTCAGAACAC 
TGGAGAGTATTTAAG ATGTG GTACCT 
Sequence 42 

AGGTACTTCTGTTTAAGCCGGTCCATGAGCAGGCCATTCCAGGGGGCACACAGCACTCCG 

AACTGAGTGAAGGCAAAGGCATTTGTGTAGGTGCTGACTCGTGCCATGTCCCCACCGGCC 

ATGTTGGTCAGCAAGGAGTTGAGAGTGCCAATGAAGAGGTAGTGCCACAACTGTATCACA 

GACAGCCACACCAGGTGCCAGGCAAAGCGCCGAGAGAAAGCGTAGCTCCAGAAGGAGCGG 

AGTTCCTGCTTCTGCCCTGCCCCTGGGGTCTCTTCCTTCGCTGAAAGGAACTCCTTTGAC 

TGTAGCTCCCTGTTTTCATGCTCAGCTGCTTCCTTCTCTTCCTTTGTGGNGCCATTCCCA 

GGGCACAGGCCATAGCTGTAGTTGGGGGGCAGTGGGTATGGGATGTGCCCCCGGGGGCAT 

N.MGAGGAAAAGTGCCNTGCTTCATGCCAGGTTACCTTNGCCGCTCTAGAAACTAGTTGG 

AfCCCCCGGGCTGCAGGGAATTCNATATCAAAGCTTNATCGATACCCGNCCGAACCTTCG 
NAGGGGGG 

Ssquence 43 

^iGGTACAGCTGTCTGCATTGAAAATTCATGCATGGAGAAAGGGAGTAAGCAAGGGAGAAA 

CGGTGCGATTCACATATTCCGCGAGATCATCAAGCCAGCAGAGAAATCCCTCCATGAAAA 

pTTAAAACMGATAAGCGCTTTAGCACCTTCCTCAGCCTACTTGAAGCTGCAGACTTGAA 

AGAGCTCCTGACACAACCTGGAGACTGGACATTATTTGTGCCAACCAATGATGCTTTTAA 

GGGMTGACTAGTGAAGAAAMGAMTTCTGATACGGGACAAAAATGCTCTTCAAAACAT 

CATTCT TTATC ACCTGACACCAGGAGTTTTCATTGGAAAAGGATTTGMCCTTGGTGTTA 

CTAACATTTTAAAGACCACACAAGGAAGCAAAATCTTTCTGAAAGGAAGTAAATGATACA 

CTTCTGGTGAATGAATTGAAATCAAAAGAATCTGACATCATTGACAACAAATGGTGTAAT 

TCATGTTGTAAGATAAACTCCTCTATCCAGCAGACACACCTGTTGGAAAATGATCAACTG 

CTGGNAATNCTTAATAAATTAATCAMTCCATCCAMTTTAAGTTTGTTCGGGGTAGCAC 

CTTCAAANGAAATCCCCGNGACTGTNTNTAAGCCAATTNTTAAAAAATCN 
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Sequence 44 

AGGTACTTGAGCTGTGAGGTCATCGGAATCCCGACACCTGTCCTCATCTGGAACAAGGTA 

AAAAGGGGTCACTATGGAGTTCAAAGGACAGAACTCCTGCCTGGTGACCGGGACAACCTG 

GCCATTCAGACCCGGGGTGGCCCAGAAAAGCATGAAGTAACTGGCTGGGTGCTGGTATCT 

CCTCTAAGTAAGGAAGATGCTGGAGAATATGAGTGCCATGCATCCAATTCCCAAGGACAG 

GCTTCAGCATCAGCAAAAATTACAGTGGTTGATGCCTTACATGAAATACCAGTGAAAAAA 

GGTGAAGGTGCCGAGCTATAAACCTCCAGAATATTATTAGTCTGCATGGTTAAAAGTAGT 

CATGGATMCTACATTACCTGTTCTTGCCTAATMGTTTCTTTTAATCCAATCCACTAAC 

ACTTTAGTNATATTCACTGG NTTTACACCN GAG AAATACAAAATAAAGG ATCACACATC A 

AGACTATCTNCNNGAAAAAAAATNNCNNTAAATNAAAANGGCTTGTCCCTNGGCCGTTCT 

TANAACTAGNG 

Sequence 45 

AGGTACGATATACGAAGACTCTGAGCTGTTTGCCTCCGATGGTTTCCAGTATTTGCCCGT 

TGTAAAGCTCATTAAGGCCAACTTrTACTTTCGATATGTGATTCTGCAGMTTAATTTAA 

GGAGGCGCTGATCCATGCTGAGAGTATCATCAGAAAATGCATTATTCACAGGTGCCAGCA 

AAGTGTATTCTCCATCTGGCCTCAGAGCAGATGCCAAGCCTAATTGGGCCACAAGATCCG 

TGAAGGTGGTTTGCTGTTTTCCAGCCAGCTCMTAACTTGTTTGGCAGAATCAGGAATTA 

GGAC CTGATCAATCAAATGGATCACACCATTATTTGTCACAATATCCTTTTTGTTCACCA 

TTTTGATTCCATTTACTGTTATACTGTCACCGTCACATCCTATCTCMTTGTATTTCCTT 

CCAGCGTCTCAAAGACTGCTCCTCCCATAATAGACTCAGAACACTGGAGAGTATTTAAGA 

TGTGGTACCTT 

Sequence 46 

GGAGCTCCCCGCGGTGGCGGCCGAGGTACTCAGAAGTGTCCTGGAATGGGGCCCATGAGA 

TGGTTGTCTGAGAGAGAGCTTCTTGTCCTACATTCGGCGGGTATGGTCTTGGCCTATGCC 

TTATGGGGGTGGCCGTTGTGGGCGGTGTGGTCCGCCTAAAACCATGTTCCTCAAAGATCA 

TTTGTTGCCCAACACTGGGTTGCTGACCAGAAGTGCCAGGAAGCTGAATACCATTTCCAG 

TGTCATACCCAGGGTGGGTGACNAAAGGGGTCTTTTGAACTGTGGAAGGAACATCCAAGA 

TCTC TGGTCCATGAAGATTGGGGTGTGGAAGGGTTACCAGTTGGGGAAGCTCGACTGTCT 

TTTTCCTTCCAATCANGGGCTCGCTCTTCTGATTATNCTTNAGGGCCAATGACATNAATT 

GCAATATTCGGTTCCCCGGTTTCCAGGGCCAACCAATANTGGGCCCTCTTGGGACCGCCC 

AAAGGANGGNGTTCCTTGNGGNNCAAATTTrTTNTTNGGAGNGAGCATNCMCCCTTTTG 

GNNCGTTTCTTATGAAACTAAGTGGGATCCCCCCCNGGGCTTGGCAGNGNAATTCCGGAA 

TTTTCCAAGNCTTATTCCATACCCCGGNCNACCCTTCGAGGG 

Sequence 47 

AGGTACGATATACGAAGACTCTGAGCTGTTTGCCTCCGATGGTTTCCAGTATTTGCCCGT 

TGTAAAGCTCATTMGGCCAACTTTTACTTTCMTATGTGATTCTGCAGAATTAATTrAA 

GGAGGCGCTGATCCATGCTGAGAGTATCATCAGAAAATGCATTATTCACAGGTGCCAGCA 

AAGTGTATTCTCCATCTGGCCTCAGAGCAGATGCCAAGCCTAATTGGGCCACAAGATCCG 

TGAAGGTGGTTTGCTGTTTTCCAGCCAGCTCAATAACTTGTTTGGCAGAATCAGGMTTA 

GGAC CTGATCAATCAAATGGATCACACCATTATTTGTCACAATATCCTTTTTGTTCACCA 

TTTTGATTCCATTTACTGTTATACTGTCACCGTCACATCCTATCTCMTTGTATT;"CCTT 

CCAGCGTCTCAAAGACTGCTCCTCCCATAATAGACTCAGAACACTGGAGAGTATTTAAGA 

TGTGGTACCT 

Sequence 48 

CCGGGCAGGTACAAAGCAGGAGAATGATGCTCCCAGCTGAGCTGCAGGATGGGCGCTGGG 

CTGACTGGAGGGGTAGACGGGGTGGGGTCTGACCCCATTAGCCTTTCCCCATCCAACCTG 

GGCCCCCATAAGCCATTCTCTGGCCCTCTGCACAAGACAGACTCAGCAAATCTGCGAGGT 

ATGGGGATTCTGCCAACTCCCCACCTCGCCTCACCTTCCCTAGGTCTGCCGGGTGACCCA 

ATCCACTGGGTGCAGCCCCACCCCCGCTCAACCCCACATNTGGACAGACACATGGCAAAT 
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ATGGAACTGAAGCCCGGCTGGGCTGGGAGCACATCTTGGTTGTTGTTGGGTTGGAGTCCG 
TC^GCANGTGTCCCAAGGTGGTCAGGCCTNACCCACAGTGGCCAGGTTTGAGACCGTGT 

Sequence 49 

CTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGAGGAATGAATAG 

AAAGGCTTCCAGATGTCTAAAAGATTCTTTAAACTACTGAACTGTTACCTAGGTTAACAA 

CCCTGTTGAGTATTTGCTGTTTGTCCAGTTCAGGMTTTTTGTTTTGTT^ 

TG CGGCTTirTC AG AAG AAATTTAATC A G TGTG AC AG AAAAAAAAATG TmTATG G TAG C T 

TTTACTTTTTATGAAAAAAAMTTATTTGCCTTTTAAATTCTTTTCCC 

AAAGTCTTGATAGCMGCGTTATTTTGGGGGTAGAMCGGTGAMTCTCTA 

TGu n rGTTGCTGTTGTTGTTGTTGCTTTATATAATGCATGTNTTCCTAAAATAAAATT 
TAAAAAA 

Sequence 50 

CACGCTGGTTTTGCATCTTCAGGAGACGCTCGTAGCCCTCGCGCTTCTCCTCGGCCAATT 

CGCGGAAGAAGTGGCTCACGCCTTCCAGAGCCACATCATCGCGGTCGAAATAGAAGCCCA 

GAGAGAGGTAGGTGTAGGAGGCCGNAAAAACGGTGCTGGCAGGTCCCAGAAGCAGGAGAT 

GGCCGAGAAGATGGTCCCGGAGGTTGCAAGCGGAGAGGAAATCGGAGGGCGGTCGGAGGC 

TGGAAGAGAGTCCCCGGATCTGTTCCGTCCAAACACTGTTGAAGCAAGAGACAGACCCGC 
GGACCTGCCCG 

Sequence 51 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAACTTCATTGAACTGCCCTA 

CCACGGGGAAAGCATCAGCATGCTGATTGCACTGCCGACTGAGAGCTCCACTCCGCTGTC 

TGCCATCATCCCACACATCAGCACCAAGACCATAGACAGCTGGATGAGCATCATGGTGCC 

CAAGAGGGTGCAGGTGATCCTGCCCAAGTTCACAGCTGTAGCACAAACAGATTTGAAGGA 

GCCGCTGAAAGTTCTTGGCATTACTGACATGTTTGATTCATCAAAGGCAMTTTTGCAAA 

AATAACAAGGTCAGAAAACCTCCATGTTTCTCATATCTTGCAAAAAGCAAAAATTGAAGT 

CAGTGAAGATGGAACCAAAGCTTCAGCAGCAACAACTGCAATTCTCATTGCAAGATCATC 

GCCTCCCTGGTTTATAGTAGACAGACCTTTTCTGTTTTTCATCCGACATAATCCTACAGG 

GTGCTGTGTTATTCATGGGGCAGATAAAACAAACCCTGAAGAGTATACAAAAGAAACCAT 

GCAAAGCAACC GACTA CTTTTGCTTACGMGAAAGACTCCTTTCCTGCATCTTTTCATAG 

GTCTGGTAAATATTTTTGTACCTCGGCCGGCTTCTAGAAACTAGTGGATCCCCCCGGGCT 

TGCANGGAAATTCGANATCAAGGCTTATCGATACCCGGCCGACCTCGANGGGGGGGGGCC 
CGGGTCCCCAAGCTTTTGGTT 

Sequence 52 

AGCTCCCCGCGGTGGCGGCCGAGGTACATCATTTCATGTCATCCTGTTGGCACTGATGAA 

GAACCCTTACAGTTCAGGGTTCCTGGAACTTCTACCAGTGCCACTCTGACAGGCCTCACC 

AGAGGTGCCACCTACAACATCATAGTGGAGGCACTGAAAGACCAGCANAGGCATAAGGTT 

CGGGAAGAGGTTGTTACCGTGGGCAACTCTGTCAACGAAGGCTTGAACCAACCTACGGAT 

GACTCGTGCTTTGACCCCTACACAGTTTCCCATTATGCCGTTGGAGATGAGTGGGAACGA 

ATGTCTGAATCAGGCTTTAAACTGTTGTGCCAGTGCTTAGGCTTTGGAAGTGGTCATTTC 

AGATGTGATTCATCTAGATGGTGCCATGACAATGGTGTGAACTACAAGATTGGAGAGAAG 

TGGGACCGTCAGGGAGAAAATGGCCAGATGATGAGCTGCACATGTCTTGGGAACNGAAAA 

AGGGAGAATTCAAGTGTGACCCTTATGAGGCAACCGTGTTACCGAT 

Sequence 53 

CCGCGGTGGCGGCCGAGGTACAGGCTGACAGAGAAGATTCCCGAGAGTAAATCATCTTTC 

CAATCCAGAGGAACAAGCATGTCTCTCTGCCAAGATCCATCTAAACTGGAGTGATGTTAG 

CAGACCCAGCTTAGAGTTCTTCTTTCTTTCTTAAGCCCTTTGCTCTGGAGGAAGTTCTCC 

AGCTTCAGCTCMCTCACAGCTTCTCCAAGCATCACCCTGGGAGTTTCCTGAGGGTTTTC 

TCATAAATGAGGGCTGCACATTGCCTGTTCTGCTTCGAAGTATTCAATACCGCTCAGTAT 
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TTTAAATGAAGTGATTCTMGATTTGGTTTGGGATCAATAGGAAAGCATATGCAGCCAAC 

CAAGATGCAAATGTTTTGAAATGATATGACCAAAATTTTAAGTAGGAAAGTCACCCAAAC 

ACTTCTGCTTTCACTTAAGTGTCTGGCCCGCAATACTGTAGGAACAAGCATGATCTTGTT 

ACTGTGATATTTTAAATATCCACAGTCCT 

Sequence 54 

AGGTACGATATACGMGACTCTGAGCTGTTTGCCTCCGATGGTTTCCAGTATTTGCCCGT 

TGTAAAGCTCATTAAGGCCAACTTrTACTTTCAATATGTGATTCTGCAGMTTAATTrM 

GGAGGCGCTGATCCATGCTGAGAGTATCATCAGAAAATGCATTATTCACAGGTGCCAGCA 

AAGTGTATTCTCCATCTGGCCTCAGAGCAGATGCCAAGCCTAATTGGGCCACAAGATCCG 

TGAAGGTGGTTTGCTGTTTTCCAGCCAGCTCMTAACTTGTTTGGCAGAATCAGGAATTA 

GGACCTGATCMTCAMTGGATCACACCATTATTTGTCACMTATCCTTTTTGTTCACCA 

TTTTGATTCCATTTACTGTTATACTGTCACCGTCACATCCTATCTCAATTGTATTTCCTT 

CCAGCGTCTCAAAGACTGCTCCTCCCATAATAGACTCAGAACACTGGAGAGTATTTAAGA 
TGTGGTACCT 

Sequence 55 

CCGGGCAGGTACGAGTGGCAGTTGAGGGGATCCCCGTTGGTCAGCGCGAACTGCCCGTGG 

GTCGGCTGGGTCGGCGGCATCCTGCCCCTCTGCACCGCAAAACCATATTTGCCTTGTTCA 

AGGTCCCAAACCGCTTGCATCTTCCTCCTAAACTGGAATAACTCTCCTTTAGTTTGGCGG 

GTGAAATAAATGCTTTGTTGGCCCCCTGAAMGATGTGTCTTTGATAATTAAAGAGAATC 
TCCTACCT 

Sequence 56 

CCGCGGTGGCGGCCGAGGTACAGGCTGACAGAGAAGATTCCCGAGAGTAAATCATCTTTC 

CAATCCAGAGGAACAAGCATGTCTCTCTGCCAAGATCCATCTAAACTGGAGTGATGTTAG 

CAGACCCAGCTTAGAGTTCTTCTTTCTTTCTTAAGCCCTTTGCTCTGGAGGAAGTTCTCC 

ANCTTCAGCTCAACTCACAGCTTCTCCMGCATCACCCTGGGAGTTTCCTGAGGGTTTTC 

TCATAAATGAGGGCTGCACATTGCCTGTTCTGCTTCGAAGTATTCAATACCGCTCAGTAT 

TTTAAATGAAGTG ATTCT MGATTTGGTTTGGGATCAATAGGAAAGCATATGCAGCCAAC 

CAAGATGCAAATGTTTTGAAATGATATGACCAAAATTTTAAGTAGGAAAGTCACCCAAAC 

ACTTCTGCTT TCAC TTAAGTGTCTGGCCCGCAATACTGTAGGAACANGCATTGATCTTGT 

TACTGGGATATTTTAAATATCCACAGTACCT 

Sequence 57 

AGGTACACCTTGAATGACAATGCTCGGAGCTCCCCTGTGGTCATCGACGCCTCCACTGCC 

ATTGATGCACCATCCAACCTGCGTTTCCTGGCCACCACACCCAATTCCTTGCTGGTATCA 

TGGCAGCCGCCACGTGCCAGGATTACCGGCTACATCATCAAGTATGAGAAGCCTGGGTCT 

CCTCCCAGAGAAGTGGTCCCTCGGCCCCGCCCTGGTGTCACAGAGGCTACTATTACTGGC 

CTGGAACCGGGAACCGAATATACAATTTATGTCATTGCCCTGAAGAATAATCAGAAGAGC 

GAGCCCCTGATTGGAAGGAAAAAGACAGTTCAAAAGACCCCTTTCGTCACCCACCCTGGG 

TATGACACTGGAAATGGTATTCAGCTTCCTGGCACTTCTGGTCAGCAACCCAGTGTTGGG 

CAACAAATGAJCTTTGAGGAACATGGTTTTAGGCGGACCACACCGCCCACAACGGGCACC 

CCCATAAGGCATAGGCCAAGACCATACCCGCCGAATGTAGGTGAGGAAATCCAAATT 
Sequence 58 

CCGCGGTGGCGGCCGAGGTACTCAGAAGTGTCCTGGAATGGGGCCCATGAGATGGTTGTC 

TGAGAGAGAGCTTCTTGTCCTACATTCGGCGGGTATGGTGTTGGCCTATGCCTTATGGGG 

GTGGCCGTTGTGGGCGGTGTGGTCCGCCTAAAACCATGTTCCTCAAAGATCATTTGTTGC 

CCAACACTGGGTTGCTGACCAGAANTGCCAGGAAGCTGAATACCATTTCCAGTGTCATAC 

CCAGGGTGGGTGACGAAAGGGGTCTTTTGAACTGTGGAAGGAACATCCAAGATCTCTGGT 

CCATGAAGATTGGGGTGTGGAAGGGTTACCMGTTGGGGMGCTCCGCTGTNTTTTTCCT 

TCCMTCANGGGCTTTGTTCTTCCTGATTATTCTTCAGGGCAATGACATAAAATTTGTNT 

ATTCCNGTTCCCCCGGGTT 
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Sequence 59 

AGGTACACAGTCAGTGTGGTTGCCTTGCACGATGATATGGAGAGCCAGCCCCTGATTGGA 

ACCCAGTCCACAGCTATTCCTGCACCAACTGACCTGAAGTTCACTCAGGTCACACCCACA 

AGCCTGAGCGCCCAGTGGACACCACCCAATGTTCAGCTCACTGGATATCGAGTGCGGGTG 

ACCCCCAAGGAGAAGACCGGACCAATGAAAGAAATCAACCTTGCTCCTGACAGCTCATCC 

GTGGTTGTATCAGGACTTATGGTGGCCACCAAATATGAAGTGAGTGTCTATGCTCTTAAG 

GACACTTTGACAAGCAGACCAGCTCAGGGTGTTGTCACCACTCTGGAGAATGTCAGCCCA 

CCAAGAAGGGCTCGTGTGACAGACGCTACTGAGACCACCATCACCATTAGCTGGAGAACC 

AAGACTGAGACGATCACTGGCTTCCAAGTTGA 

Sequence 60 

AGGT CAAA GATTTATTGAAGCAGMCCMGTTGGTTGGATACTTGCTGGAAAAAAAAAAA 

GCAGTTTTAATAGTATTCAAAATACCTTTTAAAMGTATTCTAGC^ 

AAACTAGATTATGTTGTAAACTTTTTTCTAMTCTTGTAGGAGTGTCGGTTGTTAAGAAC 

TAGAGCTTATTCCTATTCCAAATCTATCTTGCGCTCCTGAAAAACTGCAGAAAGGCACTT 

GAMGCTGTTTCTTTAAG GATATG GGATTCTCTTnTTATTCTTGCTGGTAATATATTGC 

TGCACTGAGTGTGTGCAATTTTTATTCMGGTCATCCGTTGATGCTGAGCMGTTTTCCG 

TTTGATAACCTGTCCCATCCTCTAAGT7TCAACCGGGCTTAMTCAAGMGTGTCCTTTT 

TGAGTGGGGTATCAACCCNGAGGGGAGGTGAMTCCAGATTAGTTTTCCCTCAGGGTTCA 

GGGGGAGGGAAAAAGGTTTT 

Sequence 61 

CCGCGGTGGCGGCCGAGGTACAAGAGTGTTTCATGAAATCCGTTTTTAAAATGAACATCT 

CTGTGTGCCACAGTTCCTAGGACTGGGGCAAGGACACAGTGTCAAGTCTTGTTTTGAGGA 

TGAGTCTCTGAAGAGACAGAATTCCTGCCAGAATGCGCACAGAACATAAGTCAGCCAAGT 

GTGTCGTGCCAGGGATACTTTGACTTTGGTTTGCTGCTGCTGCTAGGGATATTGGGAGGG 

TTATCCTTTCCAGGTTGTAGGAGAGGGTTGTGGGTAAAGGTCTGTCGTAAAGGACCCCTG 

GCTGC TAGCTC CAACT GATTCC GCATGCGTTGTTCACGCTCTCGCAGCTGACGCGTCATT 

TCAGCATTTTTCCAGCCTTTTTTGAAGCTCTCTAGGAAGCCTTCCCGTGGAGGTAAT7TG 

TCCAGGTCATGTACCTGCCCGGGCGGCCGCTCGAGTAATACGGTTATCCACA 

Sequence 62 

AGGTACTCAGAAGTGTCCTGGAATGGGGCCCATGAGATGGTTGTCTGAGAGAGAGCTTCT 

TGTCCTACATTCGGCGGGTATGGTCTTGGCCTATGCCTTATGGGGGTGGCCGTTGTGGGC 

GGTGTGGTCCGCCTAAAACCATGTTCCTCAMGATCATTTGTTGCCCAACACTGGGTTGC 

TGACCAGAAGTGCCAGGAAGCTGAATACCATTTCCAGTGTCATACCCAGGGTGGGTGACG 

AAAGGGGTCTTTTGAACTGTGGAAGGAACATCCAAGATCTCTGGTC3ATGAAGATTGGGG 

TGTGGAAGGGTTACCAGTTGGGGAAGCTCCGTCTGTCTTTTTCCTTCCAATCAGGGGCTC 

GCTCTTCTGATTATTCTTCAGGGCAATGACATAAATTGTNTNTTCGGTTCCCGG 
Sequence 63 

AGGTACTGAGGAATGAATAGAAAGGCTTCCAGATGTCTAAMGArrCTTTAAACTACTGA 
AC TGTTACCTA GGTTAACAACCCTGTTGAGTATTTGCTGTTTGTCCAGTTCAGGAATTTT 
TGTTTTGTTTTGTCTATATGTGCGGCTTTTCAGMGAAATTTAATCANTGTGACAGAAAA 
A AAM TGTTTTATGGTAGCTTTTACTTTrTATG 

CTmCCCCCATCCCCCTCC/WVGTCTTGATAGCAAGCGTTATTiTGGGGGTAGAAACGG 

TGAAATCTCTAGCCTCTTTGTGTTTTTGTTGTTGTTGNTGTTGTTGTTTTATAT 

TGTATTCACTAAAATAAAATTTAAAAAACTCCTGTCTTGCTANACAAGGTTGCTGTTGTG 

CAGTGTGCCTGTCACTACTGGTCTGTACCTCGGC 

Sequence 64 

AGGTACATTTGCCATCTCTGGTTAGTTGATTTCTGTCATGTTCCCTTCACTGATAATCTT 

GGCATTTCATTCTTGAAGAACAAGTGGGGCTCTTTCTCTGCCATGGCCATGCTGCCTTGG 

GCTGTGGTGACTCATGTATGTGTCTGTCCTCTGCACTTGAATGTGAGCTCTGTGTGGGTT 
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TTTCTGTTCCTCTCATTAGCAGGTTCTGCTCCCTGGGCACAGAGGAGGTCATGAGTTGAG 

AGTGGGCTGTGCGTGCATCCTAGAATCTCAGCCTTCTTGGAGCAGCGACTATTATATGCT 

CTGGTGGGCTCTCATCTCCATCCGATTAAGTGTTTCTCTTTGGCTGCCACGCCAGCTGGA 

TGACTGGGGGTTCTCTTCTTACCATCATGAAGCTGTCAACACCCACGGTGGGCCCTCTCA 

GACGGGGGCGGC 

Sequence 65 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGTGATATCCACATATTTTTG 

AGAAAAATTCCCAAGCCAGGCGAATGTGGATTGGAATAAAGACATAGGCAGTGTATACCA 

CCATAGCAATAATGGTTAGTAAGATGGTGTTAAACATAGATCGCTCCCAGGGCTCTAAAA 

CAGCACAGCAGCTAATGATTTGGTATTGATAGTAGAGCCAGGAGAAATATTCCTTCACAC 

GCCTCAAATCCATGGTTGGCTCCTTCAAGCTGCAGTAAGTTTGTCCTAAGAAAGTCCAGG 

TCTGGTTCTTCAGCCTTGCTCCTTCGCGAAATGATCCTGTGTGGGTTAGTTCTCCTCTCT 

GGGTTGCTGTTTCCTCATCTCCCAGTTGGGTGTATCTCCCTGCGGCTTANGTGAGCGCCG 

AGGCTTTGGCTCCTCCCCAGGTGACCT 

Sequence 66 : 

C ATAAAGTTG TAAAAGCCCTGGGGTGCCTAATG AGTG AAGCTAACTCAC N ATATAATTTG 
CGTTGCGCTCACATGCCNCGCTTTTCCAAGTTCGGGTAAACNCTGTCNNTGCANAGCCTT 
GC AATTAATGG AAATCG N GTTCAAAC AN N CG NCG AGTTG AAG AAGGCC N N ATTATGTTTT 
TAATTTGGGAGTCGC I I I I I I I I CNGCCTTTTNCNTNNGGNTTTNATNATGAAACTTNAG 
NTTNGNGANTTCTCCCCCCTNGGGGNCCTTTANNTANTGAAANATTTCCTNAAATTNTTT 
NAAMGNCTTTTATTCTNGAATTACCCCCNTTTCGGACCCCTTTCGGNAGNGGGGGGGGG 
GGGCCCCCCGGGGGTTACCCNCCANNCC 1 H I M I GGGTTTTCC 
Sequence 67 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGAATCATATTCTAAGA 

TTATGTGCAAATATATGTGTAGAAAGAGGCATTTAAAAAAAGAGGCAGGAGATTGAGAAG 

AGACATATTGAATGAGGCACTMTCTTTAGAAAAATGATCTTTTACTGAATCAAGCCATG 

CTCATTCCTTGCTGAATCCMCAGCCCCAAATCTCTTTTCTTATATTTGGCTGTTAATTT 

TTrrCCTTGCACCATCCTAGATTGTTAACMCCATTGCACAGTTGGCTCATGTATCCTTA 

CACTCTGAGTTATAATMGTTGTATTTTTATAAACAAATACCTAAATATTCMGTTTTAA 

GGCGCATGGAAACTGAATCTAGGCAACGGG 

Sequence 68 

CTNCTATAGGGCGAAJTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAAAAGAAAACCAT 

7TCTTTTCTCTTTTTCCCCCAGCATCATGCAAGGCAAGGCAACACCACAAATACAATAAC 

TGAATTTACTTTGATTATACTTTAGTTTCTTACACTTAAAATTATm 

TTACTTGGTTATCTAAMTCMCATTGTCGATTTTAAACGGTTATAMGCCAGACTTTAA 

GAAGCATTTAAAACAATTCATGAACTCTGTATGAAAGGAAATAATTGCCTCCAGCTTATT 

CTTTTGCATTTGAAGTAAMGMTTTCCAGTATCATCAGTTTACTAGAAAATAAGACTTA 

GGGGGTCCATATAATTTATTCACATC I I I I I ATACATCGTAAACAAGGATAGGATACTTC 

TGACAGTANCCTATAAAAGA I Mil I AAAAATTTATCC 

Sequence 69 

AGGTACAAACTGTTTTGAAGCCAGTTCATCATAAGGAAGGACAAGAACTAACTGCTTTGC 

TGAATACTCCACATATTCAGGCACTTTTACTGGCCCACGATAAGGTTGCTGAGCAGGAAA 

TGCAGCTAGAGCCCATTACAGATGAGAGAGTTTATGAAAGTATTGGCCAGTATGGAGGAG 

AAACTGTAAAAATAGTTCGTATAGAAAAGGCTCGTGATATTCCGTTGGGTGCTACAGTTC 

GTAATGAAATGGACTCTGTCATCATTAGCCGGATAGTAAAAGGGGGTGCTGCATAGAAAA 

GTGGTCTGTTGCATGAAGGATATGAAGTTTCTAGNAGTATTAATGGCATTTGAAAATT 

Sequence 70 

GTCCTCTG I I I I I AAACTGTAAAGGACTCCAGCGGATTGTTCGAAGTCTCCGATATCCCG 
CGATCTGGTGGCCGCAGAAGAAAGAGGCCGAGTCGGCGGGGACTGGGAACCACCAGTTAG 
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CGGCCTACGGGCAACCTCCCGGACGCGCGCGCCGGGCCGACGTTAAACGTACCT 
Sequence 71 

AGG7ACCCGGCAACAGACCCTTTTTGCTGATATTAGCACCAGAGAAAGCTTACAGCAATC 

TAAGGTCCCAGGCTCTGGGTTCTCAGATGCCAGTCATTAGATTCTCGTGGGTTAACAGAG 

ATGGGAAMGTCAATGGGGAAAATGTGCGAATGATCGATTTCTTCGATTGTCGGATTGAG 

GGTGGCACAATGTCTTAGGCGGCGGCGGGTAAGTGACTCAAGCGAACCTGTTCGTTCCCT 

AGGCGCCCCTGTCTCCAAGAGGCGGCCCCGAAGGTGATCCGCCCCCCAGAGCAGGAAACC 

AGCCCACCAGTCCCAGCAAACTCTCCTCCGAGCCTGGGATGAGGAA 

Sequence 72 

AGGTACAGMTCATATTCTAAGATTATGTGCAAATATATGTGTAGAMGAGGCATTTAAA 

AAAAGAGGCAGGAGATTGAGAAGAGACATATTGMTGAGGCACTAATCTTTAGAAAAATG 

ATCTTTTACTGAATCAAGC CATGC TCATTCCTTGCTGAATCCAACAGCCCCAAATCTCTT 

TTCTTATATTTGGCTGTTAATTTTTTTCCTTGCACCATCCTAGA7TGTTAACAACC 

CACAGTTGGCTCATGTATCCTTACACTCTGAGTTATAATAAGTTGTATTTTTATAAACAA 

ATACCTAAATATTCAAGTTTTAAGGCGCATGGAAACTGAATCTAGGCAACGGG 
Sequence 73 

NGGCAGGTACTTTCCTTAGCATATGTTTTCCTCATATGTTCTGGGGTTTTAGACTGTAGA 

CTAATCTGGAGTTGGAGTCTGTGCCTCTCCACTNTCCACCAACCAGGCGGGCCCATCACA 

TACTGATAGGAAATACCTGAAACCCAGGCCANCAGGCAGCTCGGCCATGGCTGAGAGGCT 

GTGTTTTAGAAACGAGGCTCTTGGGAATGGGGCAGAACTTTTTTNAGGCCTTTTCAA 

CANATGGANCTCCATTGCAATTMTACCCCAGTCTGCTTNTAGACGCCGCTATTTCTTTC 

TTCAATGCTNCTTATGGCTTTGNGTNGCCGTTTCATTTCTAGTACCTCGGCCCNCTCTAA 

AACTAGTGGATCCCCCNGGGCTGCAGGAAATTCATATTAAAGCTTATTGATACCGNCGAA 
CCTNGAAGGGGGG 

Sequence 74 

TTGGAGCTCCCCGCGGTGGCGGCCGGTTCCTTGTGTCTTAAAGCAAACACAGGCACAGTA 

TTCTGGGCAAACCAGATGTTAATAGCTTGACCTCTCTGTCAAAGGAGGACTTGGTTGCTT 

AANACCCTTAAAGCTGTCAAAGATNTAGCTTNTCCAGGATACTATCATCCTCACCCATTT 

CTTATTCTCTTTGCAGGGAGAGATCATCTCTCAAAAAGCTCCCATCAGAGCAACCATTCT 

GGGACTTTGAGC 

Sequence 75 

TTGGAGCTCCCCGGGGTGGCGGCCGAGGTACTTTAGTGTTTAGCAAAAAAGGGACTATGG 

AATGTCAAACANGAACATATATGTTAAACATAAACATGAAGCAGTGCCAGCTGCTGTCTG 

CCATATAGNCATGCTTNGTTAATGTGGCACATCTTGCCATGATGGTGCCCTCATGAAACG 

AAGGATGCTGAAGCAGATGCCATTCTGTTTGCTTTGATCAGAAGCCACTCTNTGGTCAAG 

GMGNATTNGNTNCATGCTTAATTATAACMGTTCTCATCACAAAGATTATTTTCTTTCT 

CCAGATCTGGATTGTNNGCACTCACTTTATATTATNANACAATCACTCAGATTGTTTATG 

TTTTTA TGTTTAGAAATAAGACAGTGATTCAGAGAGTCCCTGTATTTAGATGAAAGAACA 

i I I I l lAAACATATTGACTGTAAACACTGAAAATCATTAATTCTCAACAGTTGAAGGTTT 

CTGACAGGAACGGCCCTTGGAAAT 

Sequence 76 

NCGCCCGGGCAGGTACGCGGGGACACATTCAGAGGTGAGCCCAGAGCGGGTAAAGTGGAC 

TGGGGAGAACTTCGGAGGATGTTCATGTCCAGGAGCANCCCCACGCCCTGTATGGTCGGT 

GTCTAGAGC CTCA CAGCAACTAAGACCAACCCANCTCTCAGAAGAAGGAATGTCAAAATG 

TCATGTTCMTTTTACATTCAGTGCCTGGMTCTTTTCTTCACAATTGAAATGAAATGTG 

CTGAAGGAGGTGAATCCATGCATTAATCTTCAGCTCACAAAGGAAATACTACATAAGAAG 

CAAGACCACANACTCAAGACGGACATAATTGGATTTTTTTTGCCATGGCCTGGAAAGAAA 
GGTACCT 

Sequence 77 
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CCGGGCAGGTACTTACCGGGGAGCACAGTGGACTAGGGGATGCCCTTGATAATGACGGTT 

TCCTAGGTGCAAGATCAGAGCATGTGCCTACACCTTGTGCCCTGATGCCGAGGTGGAGAC 

CATTGGGAAATAATTAATAGATATTTTTCTGGGGGGAAATCACAAGTCACTGAGGTGTTT 

GTTTGTTTGTTAGGCGGTGCTAGAGCTTGAAGCCACTCAAGCCAGTTGGGGAGGAAAGGG 

ATGCGGAGGTACCTCN 

Sequence 78 

CNAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAGAGCGGGTAAA 

GTGGACTGGGGAGAACTTCGGAGGATGTTCATGTCCAGGAGCAGCCCCACGCCCTGTATG 

GTCGGTGTCTANAGCCTCACAGCAACTAANACCAACCCAGCTCTCAGAAGAAGGAATGTC 

AAAATGTCATGTTCAATTTTACATTCAGTGCCTGGMTCTTTTCTTCACAATTGAAATGA 

AATGTGCTGAAGGAGGTGAATCCATGTATTAATCTTCAGCTCACAAAGGAAATACTACAT 

MGAAGCAAGACCACANACTCMGGCGGACATAATTGGATTTnTCGCCATGGCCTGGAA 

AGAAAGGTACCTN 

Sequence 79 

TTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTACCGGGGAGCACAGTGGACT 

AGGGGATGCCCTTGATAATGACGGTTTCCTAGGTGCAAGATCAGAGCATGTGCCTACACC 

TTGTGCCCTGATGCCGAGGTGGAGACCATTGGGAAATAATTAATAGATAI I I I ICTGGGG 

GGAAATCACAAGTCACTGAGGTGTTTGTTTGTTTGTTAGGCAGTGCTAGAGCTTGAAGCC 

ACTCAAGCCAGTTGGGGAGGAAAGGGATGCGGAGGTACCT 

Sequence 80 

AGGTACCMTTCCCAGCCTGGGCCTTMGAGGTCTTGCTTGCTTCTGTTCCCTTTGGAAC 

TCTGTTACTGCTGTGAGCACGTGAACAGGCTCACTTGGTGGAAGGGGAGCAGCTAAATGG 

AGCATANCCAAGTCATTTTACTTGAGTCTGGCCCTCTCAGACCAGCCGTCAGCCAGCTGA 

CCAAAAATGTGTTAAGACAGAACAGCCAACATCAGCAGAGCCACCTACCTAACCTGTACC 

TGCCCG 

Sequence 81 

ATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTTTCTTTCCAGGCCATGGCAAAAAAAA 

ATCCAATTATGTCCGTCTTGAGTCTGTGGTCTTGCTTCTTATGTAGTATTTCCTTTGTGA 

GCTGAANATTAATGCATGGATTCACCTCCTTCAGCACATTTCATTTCAATTGTGAANAAA 

ANATTCCAGGCACTGAATGTAAAATTGMCATGACATTTTGACATTTCTTCTTCTGAGAG 

CTGGGTTGGTCTTAGTTGCTGTGAGGCTCTAGACACCGACCATACAGGGCGTGGGGCTGC 

TCCTGGACATGAACATCCTCCGAAGTTCTCCCCAGTCCACTTTACCCGCTCTGGGCTCAC 

CTCTGAATGTCCCCGCGTACCTGCCCG 

Sequence 82 

TCCTGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGGGGACTGGTAGGTGGTTTGGAT 

TCTGATAGGAGAAAACACTGCACACAGCAGGCCAGAGTGGGAATACCTGTATTCTCTATG 

GTAACAGTCAGGTGAATGGAGAAACTGGAGAGATATTTTGACGGTTGTCTTTAAACTCAA 

TGGAGGGCTTGTGTAGCAANAGATTGTAAACTGGCAGCTGCTGGTGGTGTAATGGATTAA 

ATCAGCAGTGCTTATAGAAAAGAATTTGAGTGTTTTTAGATAGGCANATAGTCTCCTGTC 

TTCCCTGTTATCTTGGCTTCCCTTGTTTACGTANCTTATCTACCTCTCATGGAGATATGA 

GTTTGAGATACTTGTAAGGTCTGCCCAGGAGCAGATGTGTACCTGCCCG 

Sequence 83 

GGGGCCATTGAGACTGCCATGGAAGACTTGAAAGGTCACGTANCTGAGACTTCTGGAGAG 

ACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGACCACTGGAACAATGANAAGGAGAGA 

ATTCTACTGGTCACAGACAAGACTCTCTTGATCTGTAAATACGACTTCATCATGCTGAGT 

TGTGTGCANCTGCANCGGATTCCTCTNANCGCTGTCTATCGCATCTGCCTGGGCAAGTTC 

ACCTTCCCTGGGA 

Sequence 84 

CCGGGCAGGTACCCAGTGATTACTATATCGGATGGTGCAGGTCTCAGTTCTTTCCACCAT 
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Mill CCCCACCATATTTTATATNTTTACATAGGTGAGACAAATGGACTTAGGCCAAAAG 

GTTTACAACTGGAGMCCAAGTTGGMGCAMGAGGTTTTTTTGCAAAGAGGM 

GCAMCAAACCTGGAACTTGGCTTATTGAGTCAGTTTTTCTGACATGTAAGTATGGCTCT 

TTCAGAGCACTAGCACTGGGGCTGTGCTCCCCTGGAGAGGAAGAGCCTTAAGGANGGAAG 

GANGGACCTCTNGAGTCTGTTAAGTNAACTGAGTGTGTTGACAGTTTGAACACACTCCTT 

NTGGCAGNAAAACAAACTGCTTGGCTTATTTCGTATGTTAAATCAAAGGAATTCCTTTAA 
ACTGGCTTTTAA 

Sequence 85 

GAATTGGAGCTCCACCCGCGGTGGCGGCCGCCCGGGCAGGTACCAGACCCGGTGAGAGCC 

CCCATTCCAATGCACCCCCATCTCAGCTTNTGGCCAGAAGACCTGAGCAAGTCCNTCCTT 

CTTCCTGGCCTTGGCCTTNCCATGGGTGGAACCNGGAGGGCTGGCTTTAATCTCCACCAG 

ACTNTTGCCCCGGGACTGTGATGGGCGATTGGCCACTTCTCAATAATCTTCCTGTTTTCC 

TTCCGCCTTCTGTTTGGCTTTAGACAGGACCANTGCATCTATAACACCACCTACCTGAAT 

GTCCAGCGGGAAAATGGAACCATNTCCAGATNCGTGGGAGGCCAATAGCATTTCGCTCAC 
TTGCTGATCCT 

Sequence 86 

AATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTACATTCCCGGGCTGGTTGATT 

GGTAAGCTTTCATCTGCTGAGAAGATTCAGAAATCTTTTCCGATTTCAGGCTGAAATTAN 

AACCAATTTCTTGTCTTCCAGCTCTTATTGTGGGCCTTTAtTCTGCAAAGAAGGCTGCAG 

TGCCAGCGGAGCTGACTGCCAGTGCCAAGTGTGTGACTGACATAACGTATGGAGGTAATC 

CACACGTGTAGCTCAGGAAGGCTATTAATAAGAGGTGCTATTTATGCTGGTTTCCTGGGG 

ACTATACCTAAACTGAAAAG GCCAGTTCAAGTACCT 

Sequence 87 

CGCCCGGGCAGGTACGCGGGGCAGAGCGGGTAAAGTGGACTGGGGAGAACTTCGGAGGAT 

GTTCATGTCCAGGAGCAGTCCCACGCCCTGTATGGTCGGTGTCTAGAGCCTCACAGCAAC 

TAAGACCAACCCA GCTC TCAGAAGAAGGAATGTCAAAATGTCATGTTCAATTTTACATTC 

AGTGCCTGGAATCTTTTCTTCACAATTGAAATGAAATGTGCTGAAGGAGGTGAATCCATG 

CATTAATCTTCAG CTCACAA AGGAAATACTACATAAGAAGCAAGACCACAGACTCAAGAC 

GGACATAATTGGATTTTTTTGCCATGGCCTGGAAAGAAAGGTACCT 

Sequence 88 

ATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTGTTGGGAATCTGANTCTCAGGGTTTC 
TGGCCATTTCCTGTTTTAAAAAATCCTCCAGTMGAMTATTGTNGCTGTTTCCTTTCTC 
TTTCTAAG N GTTGATTC AACTTG AGTGTATN ACTTATTGGCTG ATAAATATTGTCCTTTG 
AGGGGAATCGGATCAAACCCAAACAAC 
Sequence 89 

GGGCGAATTGGAG CTCCC CGCGGTGGCGGCCGAGGTACTCATCCTTCCCACCATTGCCAA 
CAmAATATTACCATT TTTAAA CTTTM 

TTTCTATGTTG CATTTCTTTTTTAGTGAAG ATGMC ATTTTTTTC G N AAGTTC ATTAACT 

TTTAAATTTTCTAMCAGCAGTGGATCCMTCAATACTCCTCAAGTNGCATTTAAAATTT 

TAAAAAAACACAGCTATAATCCTACCACTTTGGGAGGCCAAAATGGGAGGACTGCTTGAG 

GCCAGGAGTTCAAGACCAGCCTGGGCAACATAGCAAGGTCCTGTCTCTTAAAAATAAAAA 

AATGTGTCAGATATGGCGGCGTGTGCCTGTAGTCCCAGATACTCAAAAGGCTGAGGCAGG 

AGCATCAACTTGAGCCCAGGAACTTGAAGCTGCAGGTGAGCCAGGGTCACTCCACTACAT 

TCCAGCCTGGGTGACAGAANMNACCCATTTGCAGATTGTATCATGTTTCNAGATTTCTG 

GACTTNATNAAATTTTGTAA 

Sequence 90 

AATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAATCCTGCCCTCAAGAANGG 

TTMGGTCCAGTTGGAGGAGACTTAGGAGTTNAACTGACCCAGTTCTGCAACCTTAGGTG 

GCTGTCCTNATCCTGACAGGAGACAGGGGACACACAAAAAACAGTGATTTAGGCGTTAAC 
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AAAGTGGCTTTACAAAGGTGTAGACAGAATATAGGAAATCCAACAAGGCCTGGTGGAGCA 

CCATGAAGC7GGTGACAGCAGGGAACACCCGCGTACCT 

Sequence 91 

GGAGCTCCCCGCGGTGGCGGCCGAGGTACCTTCCTGCTCTTGCTGAAGTGCCCTGCAGTC 

TGTCTGCCANGACAAAAGTTTGGGGCTACACAGGCTGGCCTAACAATGCCACAGGTGCTG 

ATTTGGAGGGCCGAGAGGAGTGATGCGGGCACTTATAAAGAGGAGAGAAAGGAGGAGAGG 

CTCACCCTCACGAGGCTTCTGAGAAGGGGGTGAACTGCAAAAGTGCAGAGGGCAGGAGAG 

ACCTCAGCATCTACCCAGTATGAGGAGTGTATCACAGCTGGGAAGGTGATTCCANAGCAG 

GGGAAAATGCAAGCTCCACTAATACAAATGAGGTGAGGCAACCAGTGCACANNGAGGGGT 

TCCACANGACCCAACAACCTCACAAATCCAACAGACGACCAACAATANCTGGGGACATGC 

TCAAGACCCACAGAGCAAGTGCATGAAGCCAGGGCAAGGGGCAGCAGTGAGGACAAATCT 

CTCCATGANTACCTGCCCG 

Sequence 92 

CCGGGCAGGTACCATATCAGTGGAGAGCTGCAGCAAGGTGGCCCCTACAGCCCGCAAACC 
AGCCTGCACATTACCTCTCCATACTGCAGCCCT1TATATGGAAACTTCTTACATCACTTT 
GCTGTGTGTGTTTACACAGGTGGGTTTTGCTGTACCT 
Sequence 93 

AGGTACTCAAGCCACCTGCCTGTCATGGCCCCCCAAAATGCTGGGATTGCAGGCCACCAT 

AACCGGTCATCGGGGCAACTATCCTTCAGGGCTTCCTCCAGTAGGGCTCCCCAGGTTGGC 

CTTGAGGCCACAATAAAAAATCTGACTTAGCTATACTCCTTCCCTGTCCCCAACCACAGT 

TA TTCTT GTGGATGAGGCAAATGGCAGCAGGACAGCTGGAGTATACCAGAGTTGACTCAT 

TTATTTTATGCATGCTTAGTGTGGCATCCCTGGGGACATGAAATTCCAGTGAAATCCAGT 

CTTTGTGCTGGAGAAGCTCATAGATCTGTGGAGGAGGCAGATGCCCACCCACTCACTGTA 

CCTGCCCG 

Sequence 94 

CGAGGTACAGACAACGGATTGAGCTACTTCCTAACACAGTGACTGGGATTCAGGACATAT 

CCANTAAACGTCTGTTAAGGAANTAAACAAAATAAATAAATAAAACAGGATAGTGGCTGG 

CTGCTCTCTCTGTGTAGACTGGATTAAAGTAGANCAGGAAAACATGTGGGAGGTTACTGA 

AGTGGTGGTATTGGGANTGAGGACNGGGGAGT 

Sequence 95 

GCGMTTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACAAAATGCATATTATTATTCTTTT 

NATATACATATAATGAATNCAAANTNAAATANTAACNTATAATANATGCTCTNCACAGAA 

CATGCTGCAGAATAAATAATAAATGGACAATAAATTATCCTGGTCCTGTNTTGATAGAAA 

AGGAAAATAAAGACTAGAGGTTMGGCTTGCCAAATGTCATGTATTTAGTTTGAGMCTT 

GTTGGTAGAGCAAAGGGMTTTCTTCCATAAACTTGAMGTNCATTACACAMTATTTTT 

CTACTTAGTTCCTATAGCATTTCTTTGAGAAAAGTAGATATTACCCCTTAATAGTGAGG 

Sequence 96 

TGGAGCTCCCCCiCGGTGGCGGCCGCCCGGGCAGGTACCTGTTATCATCATTCCTCAAGAG 

CTGCATGTGGCACAGCAGGAAGCCAGGCAAGCTCCTGCCATTAGCACACAACGGCAAAAG 

TGGATGTCTGTGTCAACATTTCTGAAGACCCAGMTGGGCCGTAGATAGGCATGATTTTC 

CTAATGACTTfCGCACACAGGCAAAAAAATAGTAATGATAACTACTAATATTATAGGTGC 

TTCCTGTGGCCAGGGGATGTACCT 

Sequence 97 

AATTGGAGCTCCCCGCGGTGGCGGCCCGACGTNCNCANNTNCTNATCTNTGGTCTNTNCT 

TNAGAGNNAATATGGTTNACCTACNTGNMNCNTNCNCANMNATTTTTNAGAAACCCAC 

TNACATGTGTCAGGCAGTANGmGCCMGGTGCCAGACMNCMGAATTTCAAAGTTGG 

ATGATACAAAGTATAAAACCTNTAATTTTGGCCANGCNCAANGGNTNACNCCTGTAATCC 

CAGCACTTTNGGAGGCCGAGGCGNGCAGATCACCTGAGGTCAAGAGGTCGATNTCAGTCN 

GCCAACATGGCAAAACCCCGTCTNTACTAAAAATATAAAAATTAGCCAGGCATGGTGGTA 
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CCTGCCCG 
Sequence 98 

GGAGCTCCCCGCGGTGGCGGCCGAGGTACATCATTACCAGTGACCAGTCATTGTGTCATT 
TCTTTCGAAGTCTGTCAGTGATTGGTCMTGMGATCTG7TACTCAG7TTACCCTTAAAT 
ACCAAGGCATGTAGTTGTGTCCCTCCTTATTTCCCAGTGATGGAAGTTGGTGACAGCTTA 
CAACAAAAGTATGAGCAAAAAAAAATGAGTTGGTCAGCAAAGGTAAATGTGCAAGAAAAA 

Sequence 99 

CNAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAGTTGTCTTTATGCTGCGAGATAA 

GTCCTCTCTTGGTTTGAGCTCCCACCTTTTCAGTGMCTCTTACATTTTGGGGGATCTGC- 

TCTTGTAAAGGACATCCTTTCTGGTGAGTATTCTTTTGGTTAATTTTTGGTT^ 

TGTGCATGMTTTAATCTCATTAGGAAACAAGTTAAGTTGAATAGACCAACTAGTGAATT 

AATCCCGTCACCAAA 

Sequence 100 

CCGGGCAGGTACGCGGGGAGTCATCTCAGAGGGCACAGCCTGGAGTGCAGCTCAACAAAC 

ATATGTGTAACAGCCGAGTGCCACCACGTTAAGAACTGGACTTACAAGATGGATAGGACC 

TGGTGGCAGCTGTCCTTGAGGAAGCCAGAGCCTGGAGAGACAGATCTCATGTTATAGCAG 

AGTACCTN 

Sequence 101 

TTAAGGACAGTTGCGGCTNANGAGAAANGCNNCCCCAGGNTGGCCCTTCATTCCGCAAAC 

CACCTGCACATTACCTCTCCATACTGCAGCCCTTTATATGGAAACTTCTTACATCACTTT 

GCTGTGTGTGTCACACAGGTGGGGTTTTTGCTTGTACCTCGGCCCGCTCTAGAAACTANT 

GGGATCCCC 

Sequence 102 

ATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATACTTTCCTGTGTCTTGCTGTTTTCAC 
TTACTATATCTTGGAGCTATTTCTATATTGCCACATTCAMGCTTTTTAMGCATT^ 
GGATAGCTATTTATAGTCAAGATGTATCACAGTTTGTCTTATTTACCTGTCCCCCCACTG 
ATGAACAGNCTGGTTGCTTCTGTTTGI Mill I CTGTTATTGTAATAAACATCCTTATAG 
T 

Sequence 103 

TTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACACCTCTAGCCTTCACTCACACT 

CCTTTCTATGTTGGTCAAAAGGCAAAAACCTCCTGAGTGACAATTACCCCTTCTCAGGTA 

GCAGCCAAAAAGAGTTAATTGTTTCTCATCAGAATCTGCATGTGGCCCCAGAGCATCATG 

TATAGAACTATTCTGGAGTCCTTATCATCATCATATAACTACGTGTCCACTGTGCATCTC 

CAGAAAAGAGATCCTATTATAGTCATCTTTGCTTCTCAGTTTCCACCACGATACTAACAA 

TAGCAGCCATTCAACAGATGGTTGTGGAATGACTAAAATTCTGTGTTCCTTATTGCATGC 

CTTGCCTTCCAACTAAGCTTTTCAAGAGTATGGATACNGTATTATAAGGTATTTATATAA 

CACCCACACACAGAGACACATAAAGCACCTGGGAATCGCACTTTTCAGACAGAAAATACC 

CAATAMTCTTACCGAATAMTTAATCMTNGAGMTTAAAGAACAATTTTGCCAGAAGC 

ATAGCTGAATCTGGTGCTCTGACAGAAAAATGGATTCAGGTTATGAAAAACCNGTTATCA 

ATGCTTGNATGTGTNGNAACATNCCANTNCATCTAAATGTGGGTACCTGTGACCCTGGGN 

CGGTTTAA 

Sequence 104 

GTGTGTGCATACTCCCCCCATCTCACCCAATATGCTATTTTTGAGCCTCCATCAAGCAGT 

GGGGTCTTTGNTTCCTCCTCTTGMCCCGTGCAGACAGGAATTCTNCCCCGANCGTTTTG 

GATCTTGAGCATTGTGTTTTAAGAGCACAGNTGCAAGATCAAGANCATGCACCTGGCACT 

GCTAGGTGCNCAGCGCCATCANGCCAGCATCTCCCTTCCCATTTTCCTTGTGACCACTGT 

TTAGGCATTGANATTAAGCACTGTATATGTATATGTGTTCATTGAGAGAGGGGATTAAGT 

TTTTTTACTTTTAAATGAAATCMCMTGAAM 
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TTTTAAGCNGTGAAATGMCCMCATGACCGCTGTCGGGTGCATTTTTTAAAAATCC 
Sequence 105 

CCGGGCAGGTACGCGGGACACATTCAGAGGTGAGCCCAGAGCGGGTAAAGTGGACTGGGG 

AGAACTTCGGAGGATGTTCATGTCCAGGAGCAGCCCCACGCCCTGTATGGTCGGTGTCTA 

GAGCCTCACAGCAACTAAGACCAACCCAGCTCTCANAAGAAGGAATGTCAAAATGTCATG 

TTCMTTTTACATTCAGTGCCTGGAATCTTTTCTTCACAATTGAAATGAAATGTGCTGAA 

GGAGGTGAATCCATGCATTAATCTTCAGCTCACAAAGGAAATACTACATAAGAAGCAAGA 

CCACAGACTCAAGACGGACATAATTGGA I I I I I I I I GCCATGGCCTGGAAAGAAAGGTAC 

CT 

Sequence 106 

AGGTACATTACCCTACGCTCCAAGCGCAGCAAGCCGGCCAATATCCGTGGCACCGTGAAG 

CCCAAGCGCAGGAAAAAGCATGCAGTGGCATCGGCTGCCCCAGGGCAGGAGGCTTTGGTC 

GGACCATCCCTTCAGCCCGCAGGAAGCGGCAAGGGAAGCTGATGCTGTAGCACCTGGGTA 

CCTGCCCG 

Sequence 107 

AGGTACAMGAMCCCTTTGCAGACAGCCCATTTATAGAGAGAAGCTAAGTATTTATTTA 

CTTGCTACTCAATTTAAAATACGACCATGGCAGGCAACCAACCAGGAGAAGAGCTAAAAG 

TTTGAAAGCACTGCCCTTTTCTCTACTATAGMGTTACAGGGTCAAGCCTTGGAAAATTT 

CCACAAAATAAGGCTTTGCTAMCATATTTACCTTATCAGAAAGATTGTATCTCAAAAAA 

GTTACGTTTTTCTTCAGGCAAGACAAGAAAAACTGCCTGTCCTAATCGACAGTACCTGCC 

CGGGCGGC 

Sequence 108 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGACACATTCAGAGGTGAGCCCAGAGGG 

GGTAAAGTGGACTGGGGAGAACTTCGGAGGATGTTCATGTCCAGGAGCAGCCCCACGCCC 

TGTATGGTCGGTGTCTAGAGCCTACAGCAACTAAGACCAACCCAGCTCTCAGAAGAAGGA 

ATGTCAAAATGTCATGTTCAATTTTACATTCAGTGCCTAGAATCTTTTCTTCACAATTGA 

AATGAAATGTGCTGAAGGAGGTGAATCCATGCATTAATCTTCAGCTCACAAAGGAAATAC 

TACATAAGGAGCAAGGAACACGCAAGAGATCTACAGCTCTGATCTCCAGGATAGTGAAAT 

GAGGTGGTGAATGATAAGGGTATAATTTAGCCAGCTCAGTGAATAAATAATCATTGCAAA 

TGAANCAGTAGCACCAGACCACAGACTCAAGACGGACCATAATTGGAI I I i I I I IGCCAT 

Sequence 109 

GCGCGTAATACGACTACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCCGAGGTAC 

ATTTTACTTGCTTGATCTCTGATTCCCCTCACTGGGTCACCCGCACCTCTGAGTGCCCTC 

CTCACCCTGCATGAGTCTTGACACCAGGCCCTGAGCCTTTCCTCTGTGTGGTTGTCCTCA 

CAGTCCTCTGTGTGGGATCCAGAACTAGGCACCAAGCTCCCCACGTAGGAGGGTGCCCTC 

CCCATTCCCTCTGCTGAGGCTCGAACCAGCATGTCAGGCTGCTCCCCCATGTGGACACCT 

GACTCTCTTTGGGCTCTTACTCTACACTGGCTGTACCTGCCCG 

Sequence 110 

AAAAGGGCCTGCTTATTGTTTTGTGCTACCTANCATCCAGCCGCCCCTTTMTTTTCAGG 

TTAGCATGTGTTGTGTGGGTCCCAGTGGACTGGCCTTGTCGTTTCTGCTCACCTCAGGAT 

AGTTATGGAATGTTCATGACATANACTTAGCCAATTCAATGTCCTTGTCCTANATTTTGA 

ATACTGMGGAGGAACAMGTGTAGGGACAGTTGAAGTGTTTACAGTGGTAAGGGCAATG 

ATGTGGANAGTCCAAAGGTGAGGGCTGCAGCAAGACTACAGTAGTGGCAAGTGTCTAATA 

ATAGCAATGCACAATGTGACACACGAACTANACCTTGATCATCTTTCCTGATGTCTGATC 

TGGACTCTGTTGCATCTGGCCTGGTTCTCCAGCCTATANTTGATTCTGCAAGCTCCCTG 

Sequence 1 1 1 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACACAGACAGCAGGTAGGTA 
AGCTGATCTGGAAGGTAGGGATAACTTGTATAATGAATGGAATGTTAGGAGTCTGTGATC 
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CTAGCTCTTCAACTTAAGAGTTCTGCAAACCTGGCCGGGCGCAGTGGTTCACGCCTGTAA 
TCCCAGCACTTTGGGAGGCCAAGGTGGGGGCGGTGGATCACCTGAGGTCAG'GAGTTCGAG 
ACCAGCCTGGCCAATATGGCAAAACCCCGTCTGTACCTGCCCG 
Sequence 112 

CCGCGGTGGCGGCCGGTTCCTTGTGTCTTAAAGCAAACACAGGCACAGTATTCTGGGCAA 

ACCAGATGTTAATAGCTTGACCTCTCTGTCAAAGGAGGACTTGGTTGCTTAAGACCCTTA 

AAGCTGTCAAAGTTATAGCTTCTCAGGATACTATCATCCTCACCCATTTCTTATTCTCTT 

TGCAGGGAGAGATCATCTCTCAAAAAGCTCCCATCAGAGCAACCATTCTGGGACTTTGAG 

CAGAATGCTTAAGGCAGTGGTTCTAAAATTCCACACCACCTGTTCTTCCAGTTGTGTCCT 

CTCTGAGCTGCCTCTGACGGTCACATTCCATGGCCATGTGCCACAGCAGACTTGTGATAC 

TGATTTTAGTGTTGTCCTTGGCCATTGGGGGAAGGGAGGAAGGAGAGAGAAGAGAAGGGG 

AGAGAAGAAGANAAATGGGAGTGGG 

Sequence 113 

GCGGCCCGAGGTACTTGGTGAAGTGAGATTCANACTGAGTGGGGTCACAGCACAGGGCAC 

TGTCTTGCCTGGCTTTATCTGAGCCAGTCACACCTCTCCTGGCCACTATCTGTGGTCTAG 

CCCCCTTTGTGCAGAAAGAGAAAGAAGAGCCTTGAGGACCAGCCTAGTCAGCCGGGGGCC 

ATTGAGACTGCCATGGAAGACTTGAAAGGTCACGTAGCTGAGACTTCTGGAGAGACCATT 

CAAGGCTTCTGGCTCTTGACAAAGATAGACCACTGGAACAATGAGAAGGAGAGAATTCTA 

CTGGTCACAGACAAGACTCTCTTGATCTGCAAATACGACTTCATCATGCTGAGTTGTGTG 

CAGCTGCAGCGGATTCCTCTGAGCGCTGTCTATCGCATCTGCCTGGGGCAAGTTCACCTT 
CCCTGG 

Sequence 114 

CGCGTAATACGACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCA 

GGTACAGAAAAGCAGAACATGACAAAGCCATTAGAAATACCAAGCATGTCTCCTGGAGGG 

TMGAGGACAGTGAGCCCTTGMGCAAGTTGGGGGGATTGGGAATGATTTTACAGGATGG 

TCAGAGTTCGAAAGATGGGTAGGAATTCAACAGGAGTGAGAGGTAGAACACAAGCTTAGC 

TGAAAGMCAGCATAGAGAAMGACAAGTTATTGGTTACTTGAAGAAGAGTTAGTGAAGA 

GCCTCAATGCCTAGCTAGGCAGCTTTATACCTAATTTCAGTGGGTGATGAGGACTACTAA 

AGGTAACTGAGTAGGGAAGTGACATCATGATAGCTATAATGACTAGGCTGGGGCAGTGGG 

ATATCAGTCCAGTTAGCAGATCAATAATCTGGACCACTTTGAGGAAAAGCAAAGGCATCG 

TAGAGAAMGAACCTTAATTTCAGGCCTTATTTTTAGTTGAAAATGGTCTGAGTTTGGTG 
G 

Sequence 115 

CCGAGGTACTTCCATGANTATTCAMGAAAAACMTTTTTTCCATNTGNTTTNTGANATG 

ACCACCAGTGGTGTGAACTACCTNCTGCCCCCATCCGCAGTCCCAGGATCCTGGGACAGG 

GCTTATGAACACAACCACTGTAGTCAAGCTCACTTGATCCACAGCCTGGCACCCCCACTG 

TCTGGCTAGGGAGCCTCGAATGGGTCCCAAGGCCACCCTGCTCCTCAGTTACATCATCTG 

CATAAGTAGTGGTGGTTGTGAGGAATTCAGGAGCTGCAGCATAAGGGCTCGGCAGGTCCT 

AAGAGCACAGTAAATGCCAGTGATTCTTAAGAGTCTGAGCTCCATTGGAGAGGCAAGTAA 

GCTGAGGNTCAGAGACAGAAAATGACTTGCCCAAGATCACCCANCTGGGGAAGTGACAAG 

GCCAGGGTTGGAGCCCNTGGTTGAGCTGGNTCCCACAGGCCANANCNTAATTCTGCCCTT 

TCCTGGAAAGAACCTCCCACCCNG 

Sequence 116 

CCCTTAGCGTGGTCGCGGCCGAGGTACTACAAAGCATATCTGAAAGGAAAGATTGAGCGC 

ACCAACTATTTGCAGAAACGAGCTGCAGCCATTCAACTACAAGCTGCTTTTAGGAGACTG 

AAAGCTCATAATTTATGTAGACAAATTAGAGCTGCTTGTGTTATTCAGTCATACTGGAGA 

ATGAGACAAGACAGAGTTCGATTTTTAMCCTTAAGAAGACTATTATCAMTTtCAGGCA 

CATGTAAGAAAACATCAACAACGACAGAAATATAAGAAGATGAAGAAAGCAGCTGTTATA 

ATTCAGACTCATTTCCGAGCTTATATTTTTGCCATGAAAGTTCTAGCATCTTACCAGAAA 
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ACACGCTCTGCTGTCATTGTGCTGCAGTCTGCATATAGAGGGATGCAAGCCAGGAAAATG 

TATA1TCACATNCTCCATCTGGTATAAAGATTCAATCATATTATCGNGCTTATGTTTCTA 
AAAAGGAAT 

Sequence 117 

CGCCCGGGCAGGTACTTTGCTACACGGTCGGGGGCCATTGAGACTGCCATGGAAGACTTG 

AAAGGTCACGTAGCTGAGACTTCTGGAGAGACCATTCAAGGCTTCTGGCTCTTGACAAAG 

ATAGACCACTGGAACAATGAGAAGGAGAGAATTCTACTGGTCACAGACAAGACTCTCTTG 

ATCTGCAAATACGACTTCATCATGCTGAGTTGTGTGCAGCTGCAGCGGATTCCTCTGAGC 

GCTGTCTATCGCATCTGCCTGGGCAAGTTCACCTTCCCTGGGATGTCCCTGGACAAGAGA 

CAAGGAGAAGGCCTTAGGATCTACTGGGGGAGTCCGGAGGAGCAGTCTCTTCTGTCCCGC 

TGGAACCCATGGTCCACTGAAGTTCC 

Sequence 118 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAGAGCGGGTAAAGTGGACTGGGGAGA 

ACTTCGGAGGATGTTCATGTCCAGGAGCAGCCCCACGCCCTGTATGGTCGGTGTCTAGAG 

C CTCAC AGCAACTAAGACCAACCCAGCTCTCAGAAGAAGGAATGTCAAAATGTCATGTTC 

AATTTTACATTCAGTGCCTGGAATCTTTTCTTCACAATTGAAATGAAATGTGCTGAAGGA 

GGTGAATCCATGCATTAATCTTCAGCTCACAAAGGAAATACTACATAAGAAGCAAGACCA 

CAGACTCAAGACGGACATAATTGGA I M I I I 1 1 GCCATGGCCTGGAAAGAAAGGTACCT 

Sequence 119 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTTTTAGAAGAGACAGGGTTTCACCATGGTGGC 

CAGGCTGGTCTCGATGTCTTGACCTCGTGATCTGCCTGCCTCGGCCTCCCAAAGTGCTGG 

GATTACAGGTGTGAGCCACCACACCCGGCCTGMTTGCACTTTTGATGGCTGAGCCAAAT 

GCTCATTCTATTTAATATTGCACAGCCTGAACCAAAACCCGTAACATTATAAAGGAAGAG 

ATGAGAGCCATTTCAAACTGTGAGAGMGGGAAGACATCAAGCAAAATCTGGGGGTTTTA 

GCCAACAGACTTCAGTCCTGGGGCCTCGCGGCAAAAAAACAC 

Sequence 120 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACACACTAGCTGATAAGACACTGTTGCCCATAA 

TGTCTATTTATTGGATCAGCAATTTATAAGTCCCACATTCTCATGCCACATAGCTCTACA 

CAGCTGCAAAAATATACCATAGCTTGCAGGTGATCATTGGTTTGATAAAAGATATTGAGT 

CGCTCATCTTGTGAAAGTGATCTTTGATATAAGAGGAGCATCAGCGGGGAAGCTCACATG 

TCCCGTGGC TCACA CACCAGAAGGTATTTGTGTCTTGTCATTGTCTGTCTGGCAGTCCAT 

GGCAATGGCl l l l ICAGAGAGGCCTGTTTTGTGGTTAACTGTGTGTACCTCGGC 

Sequence 121 

CCGCGGTGGCGGCCCGCCCGGGCAGGTAC I I I M I I I I I I I I M I I I I I IGTGACAATGC 

CAATATTCCAAGAGCCTGT AATA AAGGCTCTCAGCATAAATGGTCTACTATCTAACACCA 

CCAGMCCCTT TTGGC ATCTTTTGCATCTGGATACrrTTGCTCTATGCATGAGTGGGATG 

TATGGTCATTCCTTTTATGCAATCATGTAGTAGGCAAAACAATGGCTCCCTCAA.^GATGT 

CCACATCTTAATCCCAAGMTCTGTGATTATGTTAGTTTATATTGCAAAGGAGAATTAA.G 

GCTGCAGATTGAATTAAGGTTGCTAATCAGCTTGCTTTTAAATATGGAGAGCATCCTGGA 

TTATCTGTGGGAGCCCAGTGTAATCATGAGGCTACTTAAAAGTGAAAGCGGAAGACAGAA 

NAAGAGTCAAGAGTGAAATGTGACCCCGGAAGAACTTTCANAGGCGCAATGTTGCTGGCC 

CTGAAAAANGAGGAAGGGCTTGCTGANCCCAATAANGNANGTGGGCCTCCGGAAGAATGA 

AAAGMGGATCACCCCAAAAAGCTTCCAAAANGGAMGGCANCCTGGCCACACTTTTGTT 

TTTACCCCAATGGAGAACCTCAGTAGGAACGGTTCAAAAATCCTAAAAAAAGNNAAAATG 

NGTATTTNAAATGGNNGNNNNTNTTTNCCCCCCCCCCCC 

Sequence 122 

AGAATTGGAGCTCCACNCGCGGTGGCGGCCGANGTACATGTCATTTTTTGGTGGTAGCCA 

GTAGGATGGTATTACAGTAGGGAAAAATCCAACATTTTGTTAAACCATTCCCCCTTTCAT 

GGGTTGCCATTTGCAGACTCTGCCTGCATGTGTCAAGGCNACCTTTGGCCCTTTTAAAAA 
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TCATAC AGTAT GTCAAGGMTATAATTCNTTAAAATTATTCTCATCCCCCCATTTTGGAA 

GGAAGNTTTTNGGGGNmNNNNNNNAAAAAAATTTTCCANCCCCCCCAAATNGTTNAM 

AANCNTGGNAACCTNGCCCTNAACCCGGGGGTTTAAAAAANCNAAAAAANNTNNNNNNNN 

NTTTNTTCCAAAGGGCCACCCCCCAMNAAANTTTNCCNNNTTTTTCNTTAAAANANNNN 

N NNAAAA ACCCCCCNNNNGAAAAANTTNAAAATCCCTNAAAAACCCCNNCNNANAAANAN 

Ni J I M I NNNNNNNNNNNNNNNNNNNNNNNNNNNCCCNGNAAAAAAAANNCCCCCNTTTT 

TTTNCNNNNNNNNNNNNGGGGGGGGGGGGNNNCCCCACAAAAAAAAANCCCCCCCNNNTT 
J I I I I I I I J I I 

•Sequence 123 

AGGTACAGTATTGTTGACTGGCTAACAGAGGACCAATTAATAAGCCAAAGAAACGGCTCT 

TTMCAATGAACAT TTCTG CCATCAACTGACAGATC.CCAGGAATAMTGTTTTCCAGTGA 

GGAGACTTCTCTGGTTTTCAGAACACCTCTGGCTGCCCCTGCCCACCCCATAGAAGGGCT 

ATCCCTCCAGGTCAGGTTAGCATCATCACCTAGAGCCAACAAGTCAAGGAGGTGATGGTT 

TGCCTTTGACATCTCTACCCAGACCAGACTCCACTGAGAAGACTCTCCCTTTTTCATCAC 

TGCCCTACCTAGTTAGTTGGTCCTGCCCTGGGGCCAGAGTTTCACTAGGGGCTGAATAGT 

ATACTGTT AGCTC AGGCAACAGATGAACTCTGCCTCCATGCGCAAATACAAAAAGCTATC 

TCTGCTTCTTTTTCACTCACTTMGATTTTGGAAGAATGGCTCCTAAAGCGGGTGATCAT 

CTGCCCTCACGACTTACAGTGCCTCAGCATCACATGCTATTCATTTTGCACAGCAAAACC 

Sequence 124 

CCGCGGTGGCGGCCCGAGGTACTTGGTGAAGTGAGATTCAGACTGAGTGGGGTCACAGCA 

CAGGGCACTGTCTTGCCTGGCTTTATCTGAGCCAGTCACACCTCTCCTGGCCACTATCTG 

TGGTCTAGCCCCCTTTGTGCAGAAAGAGAAAGAAGAGCCTTGAGGACCAGCCTAGTCAGC 

CGGGGGCCATTGAGACTGCCATGGAAGACTTGAAAGGTCACGTAGCTGAGACTTCTGGAG 

AGACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGACCACTGGAACAATGAGAAGGAGA 

GAATTCTACTGGTCACAGACAAGACTCTCTTGATCTGCAAATACGACTTCATCATGCTGA 

GTTGTGTGCAGCTGCAGCGGATTCCTCTGAGCGCTGTCTATCGCATCTGCCTGGGCAAGT 

TCACCTTCCCTGGGATGTCCCTGGACAAGAGACAAGGAGAAGGCCTTAAGGATCT 

Sequence 125 

CCGCGGTGGCGGCCGCCGGGCAGGTACCTCTGTGTGGCAGGCGATCAATGGTGTTTCTAT 

CTCAGCCAGGGACTCAGGTGCGAGACTGAGAAGCCCAGGAGGCAAGAGGCACCTGCAGCT 

C TAGA GCCCACTGCAGATTTTCCTTGAGGGCCGTATATCCAGAACCATCCCAGGGCTTGG 

CTTTTGAGAGAAAATGTAGTCTTAAAGAATGAACAAGTGGGCCGGGCGCCGTGGCTCACG 

CCTGTAATCCCAGCACTTTGGGAGGTCACGAGGTCAGAAGATGGAGACCATCCTGGCCAA 

CACAGTGAAGCCCCGTCTCTACTAAAA/\AAAAAAAAAAAAAAAAAAGTACCT 

Sequence 126 

AGGTACTCACTN AN ATN ANATN AGGNAN NCTACATTAGANNTGNGGTNCNC N N NCTTNAG 

NTANTGCNTTNTNTNCCATATANNTG^iTAACAANCTNCCACCAGACACCGATGGNGANG 

AGCCTCATAGCAGATGTGGCAGTGGCCGTCACTTCCCATGACCCGCGTACCTGCCCG 
Sequence 127 

CCGCGGTGGCGGCCGAGGTACAGTGAGGGTGTTCAGAGGGAGGCACAAAGAATAGCTCTG 

AGATTAGGCAATGGAAATGACAAAA/sAGAGATGAATAAATCCGATTTGAATACCAACAAT 

TTGCTCTTCAAACCTCCTGTAGAGAGCCATATACAAAAGAATAAGAAAATTCTTAAATCT 

GCAAMGATTTGCCTCCTGATGCACTTATCATTGAATACAGAGGGAAGTTTATGCTGAGA 

GAACAGTTTGAAGCAAATGGGTATTTCTTTAAAAGACCATACCCTTTTGTGTTATT^^ 

TCTAMTTTCATGGGCTAGAAATGTGTGTTGATGCAAGGACTTTTGGGAATGAGGCTCGA 

TTCATCAGGCGGTCTTGTACCTGCCCG 

Sequence 128 

CGCGGTGGCGGCCCGCCCGGGCAGGTACAACTGACCTTCTGGGCTGGATAATTCTTGGTT 
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GGGGGGACTACTCTGCACATGATAGGGTGTTCATCTCTGGTCTTCACCTATCAGGTGCTG 

ACAGCACCACAGCACCCCAACACCGCATTGTCAACAGACACTACCAAATGTTATCAGGGG 

AGGAGAGTGATTCACCCCTCTTAAGGAACCACTGAAMTGATTTTAAAAATACCTATTAT 

TTTCATCTTTAAGGAATAATTATTGGCCAGGCTCAATGGCACATGCCTGTAATCCCAGCA 

CTTTGGGAGGCTGAGGCAAGCAAATAGCTTAAGGCCAGGAGCTCAAAACCAGCCAGGACA 

ACATAATGAAACCTCGTATGTACCTCGGC 

Sequence 129 

ATCGACTACTATAGGGCGAATNGGAGCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTACA 

AGCAAAACCCACCTGTGTAAACACACACAGCAAAGTGATGTAAGAAGTTTCCATATAAAG 

GGCTGTAGTATGGGGAGGTAATGTGCAGGCTGGTTTGCGGCTGTAGGGGCCACCTTGTGC 

AGCTCTCCACTGATATGGTACCT 

Sequence 130 

CGAGGTACACAGAGTTNCAAATTAACCCCTCCAACAGCCTCTATCAACNTCNTGCACCAN 

TGTGGTAAACCGTTACAACAAATAAACCTAC NTNGAC NGTCCACAGTTTACCTTAGGGTT 

CACTTTNTGGTGTTATACAGTCTATGGGTTTGTATAAACAGATAATGACAGGTTTCCATG 

ATTACAGTATCATACAGAATANATTCACTGCCCTAAAAAATCTGTATGCTTCAGGAGGCC 

GAGACGGGAGAATCGCTTGAGCCCAGGAGTTCAAGACCGGCCTGGGCAACATMCGAGAC 

CCCAGCTCTACAAAAAAAGTGAAACTAACAGCAAAAAAAATTAGCCAGGCATGGTGCCAC 

ACACCTGTACCTGCCCG 

Sequence 131 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATGGGTAAGTAAAACTCGTG 

TAAGTTTCCGCATAGTTCACACTCATGGGAAACAAAAACTGCGCCTTGTATATCATCATC 

CCCCGACTGTCTGTCACTCACAGAATTCAAGGCACATATGAAGTGCTCAATAAACAAGAG 

CTTAGAGGAAATCAATGAGGCTTAGTGCATAGCCTTATTGCAAACTGTGGTCTCCAATGC 

AAAAACAAAATAAAACAAAAACCCCACAACTAAAAACAAAGTAGGGGATGTTAACAAGAG 

AACAAGGAT ATGAA ACCTATCTCATGAAAGAATAAACATTTCACACAGAAATCATGAATG 

TTGGTATTMTTTTAAATGACATTTGTCAGCCATAGAAAGTGATCTCTAATAACTCACGA 

TAGAGCAGTTTAATGCCCAAATAATTGAAAAAAGAATTGGAGAGGTAACCTAACGGCAGA 

TAAGATTAAGTAAGGAAMTTTCAATCTGTGTGAAGAMGAGAANGCTTTMTTTTTC^ 

CCCTTTGGGACAATTTTGGAACATCANGGCTTTTCCCCAAAACTCGACTTTATTTATCCT 

ATATATATCCCTTANTTANGGATCCCTAAATTTA I ITTTTT C 

Sequence 132 

C CG GGC AGGTACGTTTAAATGGG ATTGTTTGTTTTTC ACTTGTTG A 1 11111 I AAGTTCA 

CCAGATGCACTGTGCTGGGGTTCTGTGATAGTCCCTAATTGCTGTGCACCCTCCCAAGCC 

TGAGAGCAGCAGGAGGGAGGGTTGCGAGACAGCAAAAAGGTGGACTGCCTCTCTCTTTGG 

AAGCTGCATGCCGGAGAAGTGTAGAGCTGCTCCCAGCTGGAGAACTCAGGAGGACTAGGG 

TGGCCTCACTAGCATCCCAGGCTAGTGGGCCTTATCCTACAAGGTTCAGTGGTGGTGAGG 

TCTGCAGTCTATCACTGCTCAGCCCCATGGACTTGGCCCCTTTTCTGGGGAGCGTGCAAG 

AAAACTTGGCCTTCCCAATTGCTGGAGCTGCAGCCCCTGGTTTTGGGGTACCT 

Sequence 133 

ACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGAACTTAAACAC 

CACTATTTGTTGAGATGAAAAAAAGCATATATAGGAAGCCTTCAAAATGAAATGGTCAAG 

GGT GAGT TTACACAGATAGATAGATTTAGGTCTCTTCCTTTTGCTTTGTGAAAGCATCTA 

GTGTTTTAGGTGTCAGAGAGGGAGATATCCTTACAMGCAGAGATTATCATTACAGGTTT 

ACATTTCTTACAAAGAGTTTCAAAATAAACAGGTAAATGCCAAAAACATATATTTTGGTG 

ACGGATTMTTCACTAGTTGGTCTATTCMCTTMCTTGTTTCCTAATGAGATTAAATTC 

ATG C AC AAAT AAC C AAACC AAAAATTAAACC AAAAG AAT ACTC ACC AG AAAG G ATGTCCT 

TTACAAGAGCAGATCCCCCAAAATGTAAGAGTTCACTGAAAAGGTGGGAGCTCAAACCAA 

GAGAGGACTTATCTCGCAGCATAAAGACAACTTGTACCTCGGCCCGCTCTAGAACTAAGT 
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G 

Sequence 134 

AGGTACTAACAGTCAGGAACAAATAGAACACATAGAGAGGACAGTGGGAGGTCCCCCTGG 

CTAAAGCAGAGGGTGTGTTGAGAGAAGGAAACCTTTGCCTCAATGGATCGTAGGCAAACT 

GAGAAGGACCATAACACCTTGCTGAGAAGTTTGAATGTGATTCCTTGGTGACCACAGCAG 

GAGGTCCTGAAGTCCCTGTCCTTCAGATGCAAACAAGGAGGCTGTCCCATACAGACTATC 

AGGGAAAAAATGCACTGTTTATCAAAAGATATGAATGTTCTTGACAGGCAAAGCCCTGCT 

CAAATGAAATGGCAGTGCAGGGAGATGATATTCAGTGTGATGCAGCTGTGAGTACCTGCC 
CG 

Sequence 135 

CCGCGGTGGCGGCCGCCCGGGCAGGTACNCGGGACACATTCAGAGGTGAGCCCAGAGCGG 

GTAAAGTGGACTGGGGAGAACTTCGGAGGATGTTCATGTCCAGGAGCAGCCCCACGCCCT 

GTATGGTCGGTGTCTAGAGCCTCACAGCAACTAAGACCAACCCAGCTCTCAGAAGAAGGA 

ATGTCAAAATGTCATGTTCAATTTTACATTCAGTGCCTGGAATCTTTTCTTCACAATTGA 

AATGAAATGTGCTGAAGGAGGTGAATCCATGCATTAATCTTCAGCTCACAAAGGAAATAC 

TACATAAGAAGCAAGACCACAGACTCAAGACGGACATAATTGGA I M I N I I GCCATGGC 

CTGGAAAGAAAGGTACCT 

Sequence 136 

ATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCCTGTAGTCCCAGCTACTCT 

GGAGGCTAAGGCAGGAGAATCGCTTGAATCCGGGAGGTGGAGGTCGCAGTGAGCCGAGAT 

CGGGCCACTGCACTCCAGCCTGGGAGACAGAGCGAGACTCTGTCTCAAGAAAAAAAAAAN 

NAAAAANA AAAAAA NTTGGGGNGNAN mTTTTTI NCCAAAANANTTANGGNCNTNTNCA 

ANTTTNAAGTTTTTNTAGGGTNTCNNTTTTNTAGAMTTTTTTTNAAAA 

TAAACNTAATNATNTAAAATTTTT^TTNTNATNAANCCNNTTTTTANTAANNGNGGAAANT 

TAAACCCCCNTTNN NTTAN N NTAAAAAANTACCAAATTAATTGNGGAANNTANGG NTTNT 

AAANANGGTTTTTTGTANATNNNAAAGTGGNAAANCCAAAACTGAANTTCAGGCCTCCTG 

ACTTTTATCTAGGGCTNTTTTTGCATNATTNGCCTC 

GTGCTTTAAMCCATTGGGCTTTNAAAAATNTTTTTTATTTGGAATTTG 

ACCTNNNNNCNTTTNCTTTTTTTNGG 

AAAAA NNNNNNNNN AAAANNCCCCCGGCNCCTTTNAAAAAANGGA7TCCCCCCCGGGCGG 
GGGNN I I I I I I i I I NAAATTTTTTNCCCCCCCCC 
Sequence 137 

ANGTACTCAAGCCACCTGCCTGTCATGGCCCCCCAAAATGCTGGGATTGCAGGCCACCAT 

AACCGGTCATCGGGGCAACTATCCTTCAGGGCTTCCTCCAGTAGGGCTCCCCAGGTTGGC 

CTTGAGGGCCACAATAAAAAATCTGACTTAGCTATACTCCTTCCCTGTCCCCAACCACAG 

TTA TTCTT GTGGATGAGGCAAATGGCAGCAGGACAGCTGGAGTATACCAGAGTTGACTCA 

TTTATTTTATGCATGCTTAGTGTGGCATCCCTGGGGACATGAAATTCCAGTGAAATCCAG 

TCTTTGTGCTGGAGAAGCTCATAGATCTGTGGAGGAGGCAGATGCCCACCCACTCACTGT 

ACCTGCCCG 

Sequence 138 

CCGCGGTGGCGGCCCGAGGTAC I II I I I I I I I I I I I I M I CCCAGAAAAGGTTTCCAATA 

GAGATTGGAATGGTTTTCCAATAGAGATCTTCTTGGGTCCAATGATGATGTAATACTAAT 

GCTTGTTAATATACTTTACAAAATTGCTCTCTAMGCTCTTGTTTCTATCTGAAATACCC 

MTTAATGACAGTTTGAACTGTAAATAACCTCTACAAGGTATAOAAACTATTGACTGTTA 

ATATACTTTATAAAATTGCTCTCTAAAACTCGTTTCTATCTGAAACATCCACCAATTAAT 

G ATAGT TTG ATCTACMTAACCTCTATAACATATACAAACTATTGGTTTTTTTGTTG N TG 

imii IGTTTGTTTTGGAGATGGAGTCTTGCTCTGTTGTCCAGGCTAGAGTGCAGTGGCA 

TGATCATGGCTCACTGCAACCTCTGCACTCCAGCCTGGGTGACAAAATGAGACCCTGTCT 

CAAGAAAAACAAAACCCAAAAATTTNNACAGCAAAACTCAAAA 
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Sequence 139 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCTGAAGTCCCAGCTACTCAGGAGGTTGAGGCA 

GGGGGATCCCTTGGGACCAGGAGTTTGAGGTTACAGTGAGCTACAATAGCACCACTGCAC 

TCCAGCCTGGGCAACACAGCAAGATCCTGTCTCCTTTTAATGAAAAAAAAAGAGGGGAGG 

GGGGCAGAGGAGACAATAGGTATTTGCCAGAAGAATTACTAAACCACCAGTAAATATGAA 

ATTATGCACAACCTC AGTGGC AATTAAMTGCAAAATTAGAAAGACAACTTTAGGCCTAT 

AAGATTCATAATGATGTTTTTAAAAAAACTTATTGAAAAAGG 

GTATTTTCATCTCTTACTATGTTTTCTTACATATTAATAATAATCCCCCACTTCCTATCC 
TTAGGAMTATCAACTCTTCCACCTTTATTGGGAACTCTAGCATCTCTACTGTAGTCTTT 
TCTACTCCCCAACTCATACCATAATCAGGGGCATTCTCGACGTACCT 
Sequence 140 

GATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGTAACTGAAGAATTAAAGGCAAAAAT 

TTMGACTGGAATCAATGGAGAATAGAAAMGTTTTAGCTGGCAAGTGAATATGCTTCAG 

ATAAATCAACTAAGATAGACTGAAATAAGCCAGGGGTAAGAGTAGTTCTTACTAGAATAG 

AAATGCCAGCCGGGCGCCGCGGCTCACGCCTGTAATCCCAGCACTTTGGGAAGCCAAGGC 

GGGTGGATCACGAGGTCAGGAGATTGAGACCACGGTGAAACCCCGTCTCTACTAAAAATA 

CAAAAAATTAGCCGGGCATTGGTGGTGGGCGCCTGTAGTCCCAGCTACTCGGGAGGCGGA 

GGCAGGAGAATGGTGTGAACCTGGGAGGCGGAGCTTGCAGTGAGCCGAGATTGTGCCACT 

GCACTCC AGCCT GGGTAACAGAGCGAGACTCCGTCTCAAAAAAAAAAAAAAAAAANTTTT 

NAAAAANNTTTTNN TGNNG GGGGGGGCNTTTAAAAAANANGGGTCCCCCGGGNNGGGGGG 

AATTNNNTTTAACNTTTTTCCCCCCNCCCCCNNG 

Sequence 141 

CGAGGTACAAGTTGTCT TTATG CTGCGAGATAAGTCCTCTCTTGGTTTGAGCTCCCACCT 

TTTCAGTGAACTCTTACATTTTGGGGGATCTGCTCTTGTAAAGGACATCCTTTCTGGTGA 
GTATTCTTT 

Sequence 142 

ATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGAACTTAAACACCACTATTTGTTGAGA 

TGAAAAAAAGCATATATAGGAAGCCTTCAAAATGAAATGGTCAAGGGTGAGTTTACACAG 

ATAGATAGATTTAGGTCTCTTCCTTTTGCTTTGTGAAAGCATCTAGTGTTTAGGTGTCAG 

AGAGGGAGATATCCTTACAAAGCAGAGATTATCATTACAGGmACATTTCTTACAAAGA 

GTTTCAAAATAAACAGGTAAATGCCAAAAACATATATTTTGGAGACGGATTAATTCACTA 

GTTGGTCTATTCAACTTAACTTGTTTCCTAATGAGATTAMTTCATGCNCAAATAACCAA 

ACCAAAAATTAAACCAAAAGAATACTCCCAGAAAGGATGTCCTTTACAAGAGCAGATCCC 

CCAAAATGTAAGAGTTCACTGAAAAGGTGGGAGCTCAAACCAAGAGAGGACTTATCTCGC 

NNCATAAAAACAACTTGTCCTCGCCGCTCTAAAACTANTGGAT 

Sequence 143 

GCTCCCCGCGGTGGCGGCCGAGGTACCTCAATGGTTTGGAATGCAGAAAGTGTAATTAAA 

TTTGAAGCTGAAAAGGAAGGCACCTTGGAGAGTCCTGGCTCTTATTAGCTTTCANGCTAG 

GGACAAACAGATGCCACAGACCATTTGAGGGCTGTTTGTTGGATGATGCTAATTGTTTAC 

TGTGCTTATTATTCTGTGACATCCTTAGAATCACCCAGGTGGGGAGATGTCAGACTAAAG 

TTACCAATATCTGTGCTMGAAAAACTCAACACTTCTATAGTTTAAGTCCATTTTGGTCA 

AAGC AGM GGATTCAMGCCATTCTGAGGTTTATGAACAAGATTAAGAAAGCACCTGGCA 

CTG ATTTTAGCTTGTAAAAGTGG AACC CTCAG AAATTATTTCAG ATTATTTC ATAC N CCT 

AGAAAAAATATATAGAGAAAGGTGTTTACNAAGAAGAAATNATATCAAAAACCAAGTACA 

TTATTC AAAGGTCTATTTC AAATTAATAATC ATTTGAG CATAATTTCTCC CTGTC AAAAA 

C C ACTG GTATTTTATTTTC . 

Sequence 144 

CTCCCCGCGGTGGCGGCCGAGGTACTATGTCCCTCTCAAGTCCTGAATCTAACTTAGGGA 
GTAGCCAGGAGACTATGAGAAGGAACTACTCCAGGGAAAGAACATCCCATGCCCTTTCTG 
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AAACACAAGTGGCMCCATMGACACCACTCTTTATCCTACCTTTTAGCAGACTATGCAC 

AGTCCCAGGAACGAGGCCAGAACTGATAAGCAAGTGTGGCATTGGCTCCAGCCAAGGTTA 

TGGGATTCGGGAACCTCCCCATCCTGGAATTGGGCTGGGAGGGGTTTCACCTGGGAGCCA 

CAGTTTTATGGCCTGGGGTTATATTGCAGTCCAAAGACAAACTGCTTGTGATT 
Sequence 145 

TTTGi ITTTl iGAGACAGAGTCTCGTTCTGTCACCCAGACTGGAGTGCAGTGGTGCGATC 

TCGGCTCACTGCACTCCAGGCTGGGCAACGGTGTGAGACCTGGACTCAAAACACACACAC 

ACACACACACACNCACNCACACACACNAAGCGGGAAAAATTCTGGTGCTTCGCAAAGTCA 

TCACMTACAAATCAAMTAMTTTGTCTTAMGCCACTTAGACAATACTAACCMCTTT 

CACCAGGTGTTATCCATGCACAACCAAAACCTTTTACTGTGAGAGCGAAGCTCTTATGGA 

GTTCTACCTCATCAGTGNTCTACCATTNTAAGATATNAAAGACTAGTNGATTCTATTCAT 

GAAACCATTTTTCTTGGCTTG 

Sequence 146 

GCGAATNGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTAAATTTCTAGGAGCAA 

AGTTCTAGGGGCTGTGTCAGGCCAGCTGAGTGAGAGGGTGGTGTGAGGAGGCACAGGCAG 

GTCCTTACTAATCACACAGCATCAGAGCTGCCAACTCGAGTGTCGACAGCCTTTCCTGGG 

CACACCTTCCTGAACTCACTCTGAAGGGAGGGAGGCTATTAGGAAAGAAATATAAGCTGA 

AAATGCCCCCGAAAGGTCCACTCACAGAAAACCAAGAGCTGGCCGCCACCCTGCCAGCAG 

TGGTGACGGGAATCATGCCAGTGACTCANGCANCAGCCACAGCTTCCAAAGTCCTGGAGC 

CGGCCCCGCCCAACACACCGGATGCCACAAAGCACCACCTACCTGAGCATACTGTTCCTT 

GAGTGGACCAGAGAGCCGGAGGACAGCACAGACATCAGCTCCAAGTCTGCTCTTGCANCG 

CCTTCGGTCACGAAAGTCCTTTCCCGGATTTCCTCTTGCCCGCTGAACAAGCAAGTACCC 

TCGGCCCGCTCTTAGAACTAAGTGGATC 

Sequence 147 

CCGCGGTGGCGGCCCGAGGTACACGCCTGTAGTCCTAGCTACTCAGGAGGCTGAGGCAGG 

AGGATGGCTTGAGCTGAGGAGGCAGAGGCTGCAGTGAGCCAACATTGCAACACTGCACCC 

CAGCCTGGGCATTAGAGCCAGACCTAGTCTCAAAAAATAAATAAAATAAAATAAAATAAA 

ATAAAATGTTTAATAATGCCTACTCAGACAAAGAAAGGCTCAGGCAAAAGACTGAGTCTG 

ATCTTCTACATTGAGGAGACCATGTTGATTAGTGTTATATTCAAAAAAACCAATTCCCAA 

GTMTGCATACACAGCTTGAMGAAAGGMGTATTCCAGCTACACCCTCCAAGTTTTCTA 

CATCTCAGTAAACGGCATCGCCACCCACCTAGAAACTGGGCATCATTTGGTTCCTCGTTC 
TCCI I I I I I 1 I I I 

Sequence 148 

CGCGGTGGCGGCCGAGGTACTTTACTCACCCTTCCTCTGACAGAAAAGGATGAAGTCAAG 

GG CCTGGTA GAGGCACCACTAAGAAAGGCATCTGAAAGGACCAAAGAGAGTGACCAGCAA 

GCAi mi I ACAAGGCTGAGGAGCTGACAGCTTCCATGAAAGGCTGGACCACCCAGTGGT 

GAAAAGCATCATCTGGGTTACCTTGTGCTGCCATAAAACACACCACAGACTTGGTGACTT 

AAACCACAGATATTTATCTTCTCACAATCCTGGAGGCTGGAAAGTCTGCAATCACGGTGC 

CAGCATGGTCAGGTTCTGGTGAGGGCCTTTTTCCTTCTCACTGTGTGCTCTTTCTTTGTG 

CATGGAGAGAGAGAGCATGAACAAGCCCTCTACTGTCCCTCTTAAGAANGGCACTTANTC 

CCATAATAAGGCATTCACCCTTCTGACCTTGGCTAACCCTAAGTTNAAGTNCCTTGCCCC 
GGG 

Sequence 149 

CCGGGCAGGTACGCGGNGAGACTGANTGGGGTCACAGCACANGGCACTGTCTTGCCTGGC 

TTTATCTGAGCCAATCACACCTNTCCTGGCCACTATCTGTGGTCTAGCCCCCTTTGTGCA 

TAAAGAGAAAGAAGAGCCTTGAGGACCAGCCTAGTCAGGCTGAAGAAATGTCAACAATTG 

GGAGTTTTGAAGGATTCCAGGCTGTGTCTCTGAAGCAAGAGGGAGATGACCAACCCTCTG 

AGACTGACCACCTATCGATGGAGGAAGAGGACCCGATGCCAAGACAGATTTCAAGGCANT 

CAAAGTGTGACCNAATCAACTCTTTACCCCAATCCTTATCATCAAGCCTTATATCTCACG 
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GAAGTACTTTGCTACA 
Sequence 150 

CTGCCCGCGGTGGCGGCCCGAGGTAC I I I I 1 I I I I ) I M 11 I 1 GTGATGGGGTCTTGCTC 

TGTCGCCCAGGCTGGAGTGCAGTGGTGCGATCTCGGCTCACTGCAAGCTCTGCCTCCTGG 

GTTCACACCATTCTCCTGCTCAGCCTCCTGAGTAGCTGGGACTACAGGTGCCCACCACCA 

TACCAGGCTA I I 1 1 1 1 1 1 1 1 1 I GTATTTTTAGTAGAGACGGAGTTTCACCGTGTTGGCCA 

GGATGGTCTTGATCTCCTGACCTCGTGATCCGCCACCTCGTCCTCCCAAAGTGCTGGAAT 

TGTAGGCGTMGCCACCGTGCCCNGCCAGAMGCACCATTTTTTTATATACCTTTAAAAA 

AGAAAAATATTGCTMCTAMTTAAGACATGTCTTCTGTTGTAAGACATATTCTGGTTTA 

AGATATGTTAAATGGAAAAAAAACAAACAAACTCAAAACCCATGTACCTGCCCGGGCGGG 
CCGCTTTANAACTAG 

Sequence 151 

CGGGGGCCATTGAGACTGCCATGGAAGACTTGAAAGGTCACGTAGCTGAGACTTCTGGAG 

AGACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGACCACTGGAACAATGAGAAGGAGA 

GAATTCTACTGGTCACAGACAAGACTCTCTTGATCTGCAAATACGACTTCATCATGCTGA 

GTTGTGTGCAGCTGCAGCGGATTCCTCTGAGCGCTGTCTATCGCATCTGCCTGGGCAAGT 

TCACCTTCCTGGGATGTCCCTGGACAAGAGACAAGGAGAAGGCCTTAGGATCTACTGGGG 

GAGNCCNGANGAGCAAGTCTCTTCTGTCCCGCTGGAACCCATGGTCCACTGAAAGTTCCT 

TATGCTACTTTCACTGAGCATCCTATGAAATACACCAGTGAGAAATTCCTTGAAATTTGC 

Sequence 152 

CCGCGGTGGCGGCNCGAGGTACCACCACCTATCCACACTCTCACACAATCACCCCTTTTG 
AAGTAAAGCCTATATTTTAAAAATAACCATTACAGATTGTGAGTGCAGTCTAGGATCACA 

TAGCAGAACTAAAAGCTAACTGGCCCACCCACAAAGTAAATGTGAGACAGAGGGCTCGGG 

GTAGCCATACTCTAGCCAAGTAGTACCTGCCCG 

Sequence 153 

CGAGGTACTTTTGGAGTATAAGAGAGTCAGTGTTTTCTAAAACAGTTGTTCTTAGAGAAT 

AATAATGATATCTACCATTACTGAAAGATATTTTATTTCACCCTAACCATAACTATATGA 

GGTAGGTGTTGGGTTCTCTAGAGGACCAAATAAGTGACAACCTCATTTTCTACCTGAGAT 

CTGAATCCATGGAMTCTTTTTTAAAAGATTAAAAGATTGAGAGATTAAAAGACTGCTTT 

ATAAATGATGTTAAMGAATGTAGCTTTTTATTTGTTGGCAAATGCACTTCCTAACCCTG 

AACTTCCCCTCCCAGTCACAGACAATGTTGGGCCAAGGTAGACCTCCTTCACAGAAAATC 

CTGCTCTTGCCAAACAAGAGTATTGGAATAACAGAGTGAACTGTGCTTCCTATGAGCAGT 

TTGCTCCAAGTGTAATTTGGAAGGGCAGAGCTACACATCCCTAGTTCCTTATTCTCATCA 

AAAGAATCACTATGCATGAGAGGATAGAGAGTTGCTACAAGTATTTTGCTGCTGGAAGTC 
CCTTT 

Sequence 154 

CCGCGGTGGCGGCCGGGGGCCATTGAGACTGCCATGGAAGACTTGAAAGGTCACGTAGCT 

GAGACTTCTGGAGAGACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGACCACTGGAAC 

AATGAGAAGGAGAGAATTCTACTGGTCACAGACAAGACTCTCTTGATCTGCAAATACGAC 

TTCATCATGCTGAGTTGTGTGCAGCTGCAGCGGATTCCTCTGAGCGCTGTCTATCGCATC 

TGCCTGGGCAAGTTCACCTTCCCTGGGATGTCCCTGGACAAGAGACAAGGAGAAGGCCTT 

AGGATCTACTGGGGGAGTCCGGAGGAGCAGTCTCTTCTGTCCCGCTGGAACCCATGGTCC 

ACTGAAGTTCCTTATGCTACTTTCACTGAGCATCCTATGAAATACACCAGTGAGAAATTC 
C 

Sequence 155 

GCGGCCGANGTACCTGGMTTCAGGTTACAGGTTTTCCTTTTCAGGGCAGGGTGCCCTCC 

CATCTAACCTCTCTACCCAGTCTCCCCCAAACAAGCTGAAGGCACTTACATTGGCATGTT 

GCTAGGCATAAGGTTACTGCAAGCAGCAACAAAGTCCGCGTATCCACAAAGCTGAGCATG 
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TCTAGCACTTAGACATGCAGACTCCTTGTGTCGCAAAGCCTCTGGGTCACCGGCGGAGGT 

ATCACCTGGCGGGCGCGGGCATGCAGTCGTGGCCAGTACCTGCCCG 
Sequence 156 

CGAGGTACCAGGCTAGCCTTCCTGATGACTGCAGCAGCAAGGCTGCCCATGGGCCACCCC 

AGACAGCCTGCTTAGAATCTCTAGAAGAGCTGCTTGATGTAAGACATTAAGACATTCTTA 

AACGAAGCCAGTCTTCAAGGACTAGAATAAATCCTTAATTCTTCCAATACATAGACATTA 

ACGTAAAGCAACAAGAAACAAGGAAAGTCAAATAGACATAGCACTCCCCAAAGCATATAA 

1^1 CTCT ^ G ^ GATCC/ ^ GATA CAMCTGCCTGACAAATAATTCAAAATAGCTCATT 

•AAAAAATGCTCACTAAACTTCCATAGGATACAGGGAAACAATTCAATGGAATCAGGAAAA 

CCAATAMTTACCAAAAAGAGGATATTTACCAAATTACCAAAATTACCAAATTACAAAAT 

TACCCAAAAGGAGATATTTMCAGACAGATTGAMTmAAMCCAACCAACCAACAACA 

NCNACAACAAAAATCAAACAAGAATTTCTGGGGCAGAAAAATTAGAGGTGGAACGGAAAT 

A^CCCACCCGGNGAAAATATCAAGTTACCAGAATTGGATCAAGTTGGAAAAAAGATT 

TTTTCCATGAATTTGGAAGACAAGTTATTTGGAAAATNTACCCGTCNGGAAGAGGAAAAA 

ATTNAACCANATTTAAAAGGGGANTTAGGAAATTTAT7TTTTAGAAATTTTTAAGGACTT 
TTGGGAAAA 

Sequence 157 

ATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACAAGAAGGGCTGAGGCTTCTGTCTGAGT 

TGACCGTGACCTGTGGTTACAGCATTTTAGGACTTCGTAACAGAATGATCCACAGAGGTC 

AGCAGGCCCTTGAAAGAACGCTGAAGGAAAC7TGCATACAAACAGTCTCAAGGAGGCAAA 

CACATGCTTCAAGCGCTGCCCGCGTACCTGCCCG 

Sequence 158 

GCGGCCCGAGGTACACCTGTAATGGCTACCCCGCTGGCTCAGAGCTCTGCCTCACCGCCC 

TGCCCCGGCACCTGCAAGCCAGTGTGCCACTTTTTTGATG7TGTATGTTTTTAATATTCC 

CTGAGACAGACTCCAGAGTTCTGAGTTTTCTTACTTGGGGTGTCAAATTATATTCCAGGA 

ACCMTTTGGGTTTTCCTGGCTTCTGGTTAAAAAGCTGGACAGATTTTGAGTTGTTTGCT 

CTTAGTTCTCTCATAAACTGCACTATTTTTAGGGCACTATrTTGTAGAACACAAGATTTT 

TCAGGAAGTATATTGAAAGGCCTGTACCTGCCCG 

Sequence 159 

GGTGGCNGCCCGCCCGGGCAGGTACTTGGTGAAGTGAGATTCAGACTGAGTGGGGTCACA 

GCACAGGGCACTGTCTTGCCTGGCTTTATCTGAGCCAGTCACACCTCTCCTGGCCACTAT 

CTGTGGTCTAGCCCCCTTTGTGCAGAAAGAGAAAGAAGAGCCTTGAGGACCAGCCTAGTC 

AGGCTGAAGAAATGTCAACAATTGGGAGTTTTGAAGGATTCCAGGCTGTGTCTCTGAAGC 

AAGAGGGAGATGACCAACCCTCTGAGACTGACCACCTATCGATGGAGGAAGAGGACCTGA 

TGCCAAAGACAGATTTCAAAGGCAGTCAGAGTGTGACCCGMTCAAACTTTTTTACCCCA 

AATCCCTTATCAATCGGCCTTATATTTTCAC 

Sequence 160 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTACGGAAGAACA 

TGGGAGTCTGTATGCAGCACGACGTCTTGTTCAGTTACCTCACTACTAAGGAGCACCTTC 

TCCTATATGGTTCCATCAAAGTTCCTCACTGGACTAAAAAGCAGCTCCACGAGGAAGTAA 

AAAGGACTTTAAAAGATACTGGACTATATAGCCATCGTCATAAGAGAGTTGGAACACTGT 

CAGGAGGCATGAAGAGGAAGTTATCTATATCCATAGCTCTCATTGGTGGATCAAGGGTAG 

TAATTTTGGATGAACCATCTACTGGAGTTGACCCATGTTCTCGCCGAAGTATATGGGATG 

TTATATCCAAGAACAAAACTGCCAGAACAATCATTCTGTCAACGCACCACTTGGACGAGG 

CTGAAGTGCTGAGTGACCGCATCGCCTTCCTGGAACAGGGGTGGGGCTTANGTGCTGTGG 

GGTCCCCCATTTTACCTCAAGGAAGCCTTTGGCGATGGGTATCACCTTACGCTTACCAAG 

AAGNAAGAGTCCAAATTTAAATGCAAATGCNNTATTGTGACACCATGGCCCGNGACAGCA 

ATTGATCCCAATNACATNTTCCCCNAAANCCTTACCTTAANGGGNGGATATTTGGGGGGG 

ANAGCTTGGTTTATTGTNCCTTTGGNCCGCTTTANAACTAAGGGGANCCCCCGGGCTGGA 
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NGGMTTTCNATTTNTAANCTTTTNNGGATNCCGGCGNACTTGGGGGGGG 
Sequence 161 

CGAGGTACATTTGGCTGATACTGCAAAAATGACGGTCCCTGGGTTAACGGGAGGCATGTC 

TGGCTGCTAGGTCATCCCGAGAGGCAGATTGGGGCAGGTTTCTAGGAAGTATAGGCTCGC 

CTTGCGGAGGACCAGGTTGGCACAGTTTGACTGCTGTGCACTAAGGCAGGGCACAGAGCT 

GATGCCCTCAAGGTCAATTCCAGCCACATTTCTCTGTGATTCTATGACTCAAAAGCTTGG 

AAACMCCTCAMGGA7TAAATAAATGGGGGMCAMGTATTTGCTCTCCTCTATTAAGC 

CTTGGCTGGGAAGTCTTTCTTCCACACTCCTCACATGCAAAATGCTGTTTGTGGGAGCAG 

AT TAAC ATTTCAAACCACTTCCTGCMTTAGACGGGMCACCTGGTTTTGCAATAACTCT 

CATTTTCTGTGTGTGCTGAGAGG Mi l l 

Sequence 162 

CCGCGGTGGCGGCCCGAGGTACAGAACTTAAACACCACTATTTGTTGAGATGAAAAAAAG 

CATATATAGGAA GCCTT CAAAATGAAATGGTCAAGGGTGAGTTTACACAGATAGATAGAT 

TTAGGTCTCTTCCTTTTGCTTTGTGAAAGCATCTAGTGTTTAGGTGTCAGAGAGGGAGAT 

ATCCTTACAAAGCAGAGATTATCATTACAGGTTTACATTTCTTACAMGAGTTTCAAAAT 

AMCAGGTNAATGCCAAAAACATATATTTTGGAGACGGATTAATTCACTAGTTGGTCTAT 

TCAACTTAAC TNGCTTCCTAATG AG ATTAAATNC ATGCAC AAATAACC C AGCC N NANC AT 

TAAACCAAAAGAATACTCACCAGAAAGGATGTCCTTTACAAGAGCANATCCCCCAAAATG 

TAAG AAGTTC CTG AAAAGGTG 

Sequence 163 

CCGCGGTGGCGGCCGAGGTACATACTTTCCTGTGTCTTGCTGTTTTCACTTACTATATCT 
TGGAGCTATTTCTATATTGCCACATTCAAAGCTTTTTAMGCATTTTCAGGATAGCTA^ 
TATAGTCAAGATGTAT CACAGTT TTGTCTTATTTACCTGTCCCCCCACTGATGAACAGTC 
TG GTTGCT TC TGTTTGTTTTTTTCTC 

TTAl I I I I AAA I I i I I 1 I I I ATTTTTAAI I I I I I I S I I I AATAGAG AC AAGGTCTCACTG 

TGTTGCCCAGGCTGGTCTTGAACTCCTGAGCTCAAGTGATCCTCCTGCCTCAGCCTCCCA 

AAGTQCTAGGATTAC AGACG TGAGCAACCACGACTGGTCAATAGTTGCACTTTCTAATGT 

GTGTTTGTAGGACAGTTTTTCAAGTAGTGTAGGGCATGAAGGTTTATAGTTAGGTAAAAA 

TGGTTGCTAGATTGGTACCTGCCCGGGGCGGGCCGNTCTANAACTAGTGGATCCCCCCGG 

GCTGGNAGGAAATTNGATAT 

Sequence 164 

CCGCGGTGGCGGCCGAGGTACGCGGGTATGTCTCGAGAAAAAAGCCAATACAGTCTCTTT 

TGAGGACCTTGAAAACTAACCTGCTAGGGATATCATACTAGGCCTCTGGGTGCCGCTGTC 

CGCCAATCCGTGGCTGGAGGGAAACAAAGCAATCCACTGACGCAGCAGTCCCCGCAGGCT 

TTTGAGACACACATTAGGAGAGGACTCTGCCCGCTGTCTGTCTCACAATCTGCTCGAAAT 

ATCTTGGTCAGGAATCTCAGGATGTCTAGTGACCCCGAGTGAGGTGCTGCCTTTTATTCC 

TGAGCATCTTTGGAAGCAGGAAGCAGGGAGCAACGGTAGCGAGAGGTATTCGGATCTGAG 

TATCAGCAGGAAGAAGAAACAGAGGAAGAAGTCCCCCATAAGGAAGGCGTGAGGAGCGCA 

NGAAGCGCTGTTTCCCTGCCTGAGATCTTTATTCTACTAGGTC 

Sequence 165 

CCGCGGTGGCGGCCGAGGTACACACAGTTAACCCAAAACAGGCCTCTCTGAAAAAGCCAT 

TGCCATGGACTGCCAGACAGACAATGACAAGACACAAATACCTTCTGGTGTGTGAGCCAC 

GGGACATGTGAGCTTCCCGCTGATGCTCCTCTTATATCAAAGATCACTTTCACAAGATGA 

GCGACTCMTATCTTTTATCAMCCMTGATCACCTGCAAGCTATGGTATATTTTTGCAG 

CTGTGTAGAGCTATGTGGCATGAGAATGTGGGACTTATAAATTGCTGATCCAATAAATAG 

ACATTATGGGCAACAGTGTCTTATCAGCTAGTGTGTACCTGCCCGGGC 

Sequence 166 

CGAGGTACTCAAGCCACCTGCCTGTCATGGCCCCCCAAAATGCTGGGATTGCAGGCCACC 
ATAACCGGTCATCGGGGCAACTATCCTTCAGGGCTTCCTCCAGTAGGGCTCCCCAGGTTG 
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GCCTTGAGGGCCACAATAAAAAATCTGACTTAGCTATACTCCTTCCCTGTCCCCAACCAC 

AGTTATTCTTGTGGATGAGGCAAATGGCAGCAGGACAGCTGGAGTATACCAGAGTTGACT 

CATTTATTTTATGCATGCrrAGTGTGGCATCCCTGGGGACATGAAATTCCAGTGAAATCC 

AAGTCTTTGTGCTGGAGAAGCTCATAGATCTGTGGAGGAGGCAGATGCCCACCCACTCAC 
TGTACCTGCCCG 

Sequence 167 

CCGGGCAGGTACTCTTAAAATACCCCCTTAAAAAAGCTAAMCACCACTTTGGCAAAAAT 

TAAGGCATTTAACAATTCATGTCTTAGTGTATTTCCTGTTATCATTTGCTCAGATATCAT 

MGCTGTGGTTCATTCATCAAT TTCCTA CAGCTTTGAATCTAAGGCTACAAAAGTCCCAC 

TGCTGTTAAATCCTGGAGCAGCTTTTTACACAATTCTACAATCCTCTTGTCTTCAGGTTA 

CCGAGAGCCTATGAAAGCCTGGAAAAGCCCCCAAGGGAGTGCCACTGAACAACTCTTCGC 

CAGAAGTGGGATTTTGCTTGAATATATCAGCCAATCGATAGCATNCACAGAGCAGTCAGG 

GAAGTCTTTGAAACTTGNCTGAATTTACACACAAGATCAGACCCACCGGCTGTTTGGGCT 

TAATGAGACCTTAGAGATCACCTAGTCTGGGGTTTTCATTCTCCATATGAAA 
Sequence 168 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACTCACATGCTGAGGGTAA 

GAGGCAGMTTTCTTTTGC7TGTAAGTAACAGGATCTTTTCCAGAGCCTGGGTCTGAAGA 

CCCCAGGTGTAGATGGAACTTCAGTTATGACTGGATCCGTGAGTTCAAAGGATGCCACTC 
CCCGCGTACCTGCCCG 

Sequence 169 

CTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTACTTAGGA 

CTATGGAGAAACAATGGAAGACATTAATTCTGCTTGTCTTGATGGTAAGGGTGGGGATGA 

AAATACAGATCGAATTTGGGGGAGCATGTTAAGGTTAGGGGAATAGAGATAGCTTATATT 

TAGAACAAGTTAGTAGAGACTGTATGATTTTCCAAATAGAGAAGGGTAAAAAGACATTCC 

ATAGAGGAAGAAAATTACATGCAAAACCACAGAGACCTTAGTTTATAAAGAGTAACTGGG 

TATGGCCGGGTGCGGTGGCTCACGCCTGTAATCCCNGGCCTTTGAGAGGCTTGANGAGGG 

GTTGATCATTGANGTCAGGAGATCGAGACCATCCTTGGCTNACACGGTGAAACCCCGTCT 
TTACCTAAAAA 

Sequence 170 

CCGGGCAGGTACGCGGGGGGACTTACAAGATGGATAGGACCTGGTGGCAGCTGTCCTTGA 

GGAAGCCAGAGCCTGGAGAGACAGATCTCATGTTATAGCAGAGTACCT 

Sequence 171 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTTTCTTGGCATCACATGGTTTGGTTCTGGCAC 

AGCCAAGCCTAGAACCCAACCTTCCTACTGCCTGCTGTGTTTCTTCACCACCCAGTTGCT 

CCTTGGCAAGTGA\CCAATTCAAAGGCTGTCTTACTTGTCCAAGGACAGAGAGAAAGGTG 

CATCCTGGGTGACTGGAAATGGGAGTGGGAGGGAGAGGATGGAGGCAGCAGACATTGAAA 

GTAAGTATTGACTAGACTTGGTGACAGAATGGCTCAACTCGGCTCAATGAAAAAAGGGAG 

TCCAAAAGCCAAACTT TCAACCC TGGTGGGATGGCTATACAGTTAAAGTAAAGTAAAATC 

AGTAGMTGGTGTTGGTTTTTTTGCTTCATGATGAATTTAAAGGGACTGCCTTATNCAGA 

TAGAAATATGTTAAAGATATAGGTAGGAGAGGTAGCTCCATTAAAGATCAAGATTTTGAG 

ATTAATGCCACA/vNAAGGAGAGCTTGAACCCTTGTACCTNGGCCCGTCTAAAACTAATGG 
ATC 

Sequence 172 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGACATTGTAGTGGGTGTGA 

TGGAACAGGCTCTCAGCTGTGCATTGGGGTGCTTTAAATTCTCATCAAGGAACTCATCTT 

TTGGAGTAAGATACTGTGTCTCCGGGAGGCCTTTGCTGGAAAGAAAATACATTNGCTGTG 

AAAGTNCATAAAGACAATGAATCCTTAACTAANGTTAATAACAGAGTGGGAGCANGTTCT 

GGCCATGTGTCAGGCGATCATAAGCANCAATTCTGAACGGCTTCCCGATATCAACATCTA 

CCANCTGAAGGTGCTCGACTGCGCCATGGATGCCTGCATCAACCTCGGCCTGTTGGAGGA 
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AGCCTTGTTCTATGGTNCCTGCCCG 
Sequence 173 

CCACCGCGGTGGCGGCCCGCCCGGGCAGGTACTTACTTTGATTCCTCTAGTGCAAGATTA 

TAGTGGGGTTATACCTGAGACTTCAATAMTGTTTGACTAACTAAACTAAAATAGCTTAG 

GGTAAGGACTACTTCCCCAAACGCCCTnTAAACATGTGAGAAAGGGAATCTCCCTGACA 

TACTGGTATGGCCATTTGTAGCMTATACTGAGAGTGACTTGGGTGATTTTCTGGGGTGA 

TCAACCACATTCCATGAGCAGGTTAACTGTGGAAGACACCTGCCCTTGAGCATCGCGTTT 

GGGCCGCATGCGTCMTGGGGAMTTTGTGTTTCCATTCTGCTTCTTGNTTTGCCTTCAC 

MCTTCAAGGGATAGMGCGTATTCCATTITTAGTTMTMTCAAGCTCTCTGGGGCTTC 

CATGTAATGAGTCAAGANACTGATGACTGACTCCACTCTGCTCATTATCATGTCTCCCAA 
AAACAACC 

Sequence 1 74 

AGGTACACACAGTTAACCACAAAACAGGCCTCTCTGAAAAAGCCATTGCCATGGACTGCC 

AGACAGACAATGACAAGACACAAATACCTTCTGGTGTGTGAGCCACGGGACATGTGAGCT 

TCCCCGCTGATGCTCTCTTATATCAAAGATCACTTTCACAAGATGAGCGACTCAATATCT 

TTTATCAAACCMTGGTCACCTGCAAGCTATGGTATATTTTTGCAGCTGTGTAGAGCTAT 

GTGGCATGAGAATGTGGGACTTATAAATTGCTGATCCAATAAATAGACATTATGGGCAAC 

AGTGTCTTATCAGCTAGTGTGTACCTGCCCG 

Sequence 175 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGGAC 

TGGGGAGAACTTCGGAGGATGTTCATGTCCAGGAGCAGCCCCACGCCCTGTATGGTCGGT 

GTCTAGAGCCTCACAGCAACTAAGACCAACCCAGCTCTCAGAAGAAGGAATGTCAAAATG 

TCATGTTCAATTTTACATTCAGTGCCTGGAATCTTTTCTTCACAATTGAAATGAAATGTG 

CTGAAGGAGGTGAATCCATGCATTAATCTTCAGCTCACAAAGGAAATACTACATAAGAAG 

CAAGACCACAGACTCAAGACGGACATAATTGGA I M I N I I GCCATGGCCTGGAAAGAAA 

GGTACCT 

Sequence 176 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGAGCACAGCACA 

GTATGGCTATCATAGAGGGAAAATTAAACTGAGTCACAAAATGCATTTTATAATAACACA 

GTAATGCATTTCATCCCTGGGTATAAATTGAAAATGTCCTATTACAGAGTGTTTTAAAAA 

TTCACTGGMTTATTGTTCTAGCATAGGTAAGTTTGAATAAAAAGAGTAAAATGAAATCA 

GTCTAGTCTTAAAATTGMCTTTTAAMCAAGAAAMTCATTACMCTACTMGATTATT 

TTTAAAGGTATCACT TAAT GACTTMTTACATCTTGCTATCATTATCTCTATTTTGTATA 

CMTCCATTT(;TGTATTTTACCATMCTGGGAAAATTMGMTATACGTGATCTTTM^ 

AGATCAAAAACCTTCATATCCCATGCTGATTATTCAGCTGCCTCTGAATCCTGTATCTTC 

ACGCTGCTCTCACGTGTGCTGCCTGATAAGTATATGTGAGGTTGTAAGTTATTCCTTACT 

CGCGGTTCCGGGTGCTCTT 

Sequence 177 

ANGGCGAATNGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATGACAGATAAATA 

CAGTTATTrCCGGCAATCATTAGTCTCAATAAGAAAATGATTCTCTTTTCTAAGATAGTC 

TAAGAAGTGGGAGGCTTCCAAACTAAGTGTCAGGACTTACCAGGTGGTAGGTGGGCCATG 

ATATTATCAATCCCCTCAGCCCCTCTGCACATAGACTGAGGCATCCAGAGTCCCTGGATG 

AGTCTTGCCCATGCTGCCTTGTMTATATTCCAAATATTA^ 

CC AAAGA CAGGGGTCTCTCAAGGTTACCCAGGCAGGCATTGAACTCCTGGGCTCAAATGA 

TT CTTTT ACCTCAGCCTCCTGAGTAGCTGGGATTGAAGGTGCCCATCACCACGCCTGGCC 

ACTTTTGTGTCTTCACATGAAAGCTATTGGAAATTATTGACTAAGGAACCTCATATGATG 

AAATTCTACGTTGAAAAGTTGAGGTAGAAAGTGGGCAGATAAATTTTCCTACTTAAGAAT 

TATAAATCTATAGAGATAAAAGTTTATTATGAATGCTTANGTGTCTGGATTCTTTTGCTG 

TATTATTC ATCTTCTAGTTTC CTG N CTTAATAGTC ACCTTTTAGTG AACCTGTTC AAATN 
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AGGACTTTAATGAATATCCTTATTGGGGTAAATCTCTATTCTCTGCTTCACTNCTTTTAT 

ACCTTAATTTGGGGGGATAAATGCAAAATTTTAAA 

Sequence 178 

AGGTACTCATCTGTGAGAAGTGGAATAATTCAAAGATTCACAATGGCTTTCCTCCCCCAA 

GGTCATCAGTGTTAACTGCCAACTAGATCCCTAGCCCACCCTCTCTGCTCATCCGAGCAG 

CAGATGCCAGGTGGGAGTTGCTTTTATGCACATGGGATGTCACTGTGACTCTGTTCCTGT 

CACTCTGTAACATAAGCTCATTTCATTTTTCTCTAGGTATATAGGTATGGATTTAATTAT 

TTTAGTCATTACATAATAGTCCACAGTTTAGATTGGCCACAGCTAACCATTCTGTATATC 

ATGGACATTCAGGCTCTTATAGTTTTAAGCTGCTTAATTATTCAGACTATGATG 
Sequence 179 

AGGTACAAACCATTGCAGGAGCTGAAACATCAACAGCTACCCATACTACTGCTAATAATT 

CAGGCTCTCTTGGCTCTGAGAACTTCAGCAAAAAAGCTGAAAAGGCTAACCTATAACATG 

TAMTAGGTTCTCATTTATCATCTTGATAACTCCACTTGGCTGATTCAAATACATATGGC 

TACACACCATAAGCTAAAGAAACACCTCCTAAGAGCCTGCTAGATGTCAGGAAAGCAATA 

AAGGGCAAGCAGGGAACCCAAAGTTGGCCCCAGGGTGTCACGAGTCAGGAAAGTAATATA 

AAGAATATGGCATGTAGCTGGGTGCGGTGGCTCACGCCTATAATCCCA 

Sequence 180 

AGGTACATGAAAATGCACAAGACTCTTCCCACCACTGACCTCAGTGTCTGCCCCAGGAAG 

AAGMGAAMGTTTTGGCTCGAAAATGCTCTTCAAGGACACAAGGCGGTCACCAAATTAC 

AGCATGCAGCTTTCAGGGGATGACTAAAAGGATGAGGATGATGACTCATAGCTGGGGACA 

GMGCGMTCCCTCTTGCTGACCTGTGCCCTCCTTTGCACCGGTGTTCTTTCTCAGCCCT 

CCTGGGGTGAGGATGTGACTGTCCCCTGGGAAAGCTTGGCAAGATAAGAGTAGGAGCACA 

MGMGGAGGTAGATAGAGAGTGGTTTGCAGTCGATACTCAGTGGTTTGGGTAGGC 
Sequence 181 

AGGTACGCGGGGGGGTCCCCAGTGGAACATGCAGTTCTTAGGACAGAGCAGGGGATGGGG 

CCAGCTGACCTAGTGAGGAAATTTAGGCTCCTGCATCAGTCTTTCTGCATAAGTAAAGAA 

AGGACAGTTGTGCTAATTGACATTGGCTATGAGGTCATAGCCACAGACAGGATTATGAAA 

GATGCAAATATGCGAAACATGGCACATGACCTCCCTCCCTCAAGCTAGGCCCTTGCTTTG 

TGTGGTTGGGAAATGTTGGGAGTAGCAGAGAGAGGACCTAGGCAAATAGATGGGCATTTC 

CTTGAAGAAATCGGTTTTCCAGTAACATCTGGGTTGTGTTGCAGCTGGT 

Sequence 182 

CCGGGCAGGTACGCGGGGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTATGTATGTGTA . 
TGTAAAACAGAGAAAGATAGTGGTCCAGAGGCCAACACACCTGGAATCTATTTGTGAGGG 
TCCCTGCTTCAGTCCACGAATTGGGCCATTCATTTATTTCTCAGTAAGTGAACATCTGGG 
ATTCAGTTTGAAACGCCAATAAGT7TGGATGGGATCGCAAAGCTGTGGTTGGGATGTGGA 

ACTGCTCGCCTGAGGTGGGAGTGAGCGAGGTGAGCGGGGGCTCTTAAACTATTTCTGCAN 

ACTTCANTTCAAATGCAATANCCAGAGGGTTGAGA/v^ 

Sequence 183 

AGGTACGCGGGATATGGTTTGAGATATTATTGCTTGTGTTTACATTTCCACATTTGAGGC 
AAAGTTAGACTCCAAG7TGAATTCATTAAATTTTGA ITCCAACA7TCTGTTCTTCCTTTG 

GGCACAGAATTCCTATGAAATGAAACACTGGGTrTGTTCCAGAACAAGAACAGACTAAC 
CC AAAATTTCTC CTG GGGG AAAAAAAAAG AGTAA AATTAC ATTTCTC ATTGAAAATTGTA 
TTTCTCATTGAGGAAGGGTGTGTTACT7TATTATATAAAATGCCTTATCTTCATTATAAG 

AAAAAAATGCCCTTGCATTTGTCACTGTTGCCA7ACATTAGGCCTGTA 
Sequence 184 

TTCCTCCTTTGTGACAATGGGGACAATAACAGAAACTGCCACACAAGAGGGATGTTAGAG 
AACTGACCAAGCGAATGTAAGTAATGACTTGGAGCAACGTATGGCTAAGAGTAAGTTAGG 
GGTTGTTACGATGACTTTCCTGAGATAAACAGGGAGATCTTTTAGCTGTGGGTCAAGTGC 
ANGAGGACAGTGGAGCTCAAGGCCTAATGGAAACTAACACTAATTGTACCT 
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Sequence 185 

AGGTACMTGGA AACTCA ACGGATTACACTATTTTCTAAAGGGCCACTCATAAAAATGAA 

GTTGGGTTATGG I I I i I M I I ! CCCCCTTCCATTCTAATTTAGGAAGTGGAAAGTCACAG 

CAAGGGCAGGGGCTTACCCACGGTGACACAGCTTAGCACAGCGTCATCATTTCATCCTGT 

GCTTCCCATGGATTCTATCCTCTCAAAGATTAACTMGCATTTTGCTATTTGATCTGGGT 

AGAGGGAAAAGGTGCCCAAGACATGAGAAGATAAAA7TCTTGACAGGAGGAACCAAGGTT 

AAAACATGTAACAAGGTTCACTCACTTAAAAATATATA 

Sequence 186 

AGGTACAGAGCCTGGCACGTAGCAAGTGTTGAAATATCTCTTGGAGAGAGAGAAAAAAAA 

TGMTCTATTAAMTGTCTCTATGATGCACTGCGGAAGCTAGACTTGGTAGATCGAGTTT 

AAGGGMTGAAGAGGGGGTGMTGTTTGCTACTTCCGTATATTTGGAGTATTTGATAAAA 

CAGANNACCANTTTCTCTTCAAACTTCATGTCGGTTTGTAGGCCACCGCTACCAGGGNTT 

AGGGTACCTGCCCG 

Sequence 187 

AGGTACTCAAGCCACCTGCCTGTCATGGCCCCGCAAAATGCTGGGATTGCAGGCCACCAT 

AACCGGTCATCGGGGCAACTATCCTTCAGGGCTTCCTCCAGTAGGGCTCCCCAGGTTGGC 

CTTGAGGGCCACAATAAAAAATCTGACTTAGCTATACTCCTTCCCTGTCCCCAACCACAG 

TTATTCTTGTGGATGAGGCAAATGGCAGCAGGACAGCTGGAGTATACCAGAGTTGACTCA 

TTTATTTTATGCATGCTTAGTGTGGCATCCCTGGGGACATGAAATTCCAGTGAAATCCAG 

TCTTTGTGCTGGAGAAGCTCATAGATCTGTGGAGGAGGCAGATGCCCAC 

Sequence 188 

AGGTACGCGGGAGTAACTAATAGAAAAAACTCTAAAATGCATCGGGAGGGAAACACATTT 

AAAGCTCTGGAGGCATAAAAACCTGCATATAAATTCAGGCCGCACCGATTGATAAGTGTG 

TTACTTTCTCAATTTACTTATTCmCTTMCCTTAGTmCTCATCAGCAMCTATCn 

AATTACATGACTTTGTAATCCATAAAGCTTGCTTCAAGCTCTTAGCATGCACACAGCACA 

TCTGCAAAAGTAAGCATTTGCCACTTTCGGTTCTCAGGATAACATATTCACCCTTCCCTA 

CCTACTAGCAAGCTGTGCTCACAAGATTAATCAGGTAGTGCTAGAGAGAA 

Sequence 189 

AGGTACACGAGCTAAAGGAATCCAATTCACTATCCCAGGGTTGCATCGCAGTGGATTTCA 

TACAGGCCMGAAAAAACAAAMCGTTTTTAAGCCTCCATCCTCAGATCAAAACAATAAA 

C AAAAAATTATGATATCCTTAACAATGTC CTG GGTTAATATTTTATTCCATAAATC AAGT 

TTTCAGMCTCTTTACCCTCTMGAGATACAGATTGAACATGCCTTTATAGGCTCTGAAT 

TTCATAACTTCTAATCTATGCTCATAACCTGGAAAATGTCATCATACTGAATGCCCACAT 

GAAAAGGTACCTGCCCG 

Sequence 190 

AGGTACTTGGACTGTGAGACCCCTGAAGGCACAGACAATGTGGCCCTGGCACTTGGCATG 
CTGTCTAGCACATCACAGACCCTCAATGAATACTATAGGTTGAATTAAATGTAGGACCTA 
AGGGAGTAAAAAGAACAGCGTCCTCCAGCATTTATACAGCTGGAGGCCAGGAATCTTCCC 
TCTGGCTTTGCACTACAGTAMGATATTTACTTATTTTTTTMCAAAGTTT^ 

CTCAGGAGMTACATGTGCATGTTTCTTCTAT/iGGTAAACTCGTGTCATGGCGGTTTGTT 

GTACCTGCCCG 

Sequence 191 

GGGAGGTTGTGGCGGCGGCGCCCTGCTCTrCCCCGCTCGTCTGGGGCrrCTGCTGGCTAG 

TGGGGANMGTGCTCTTCTAGGGGGGCTTiTNTGGCCCTTGNCAGGGGTTTGGAGTTGGA 

GTCACANGGGCCAATAACTCATCACCCCAACACTTNTGAGGTGGCTCCTTGTGCCACNGA 

CTCCAANGAGTCCTTCANCTACTGMCACTNAAAGCTGAGGAGGGCMTTTAGGTTTTNA 

TGCTGTACCANAAATCTTCTGACAATNANTTACTGGAACTTGGAAAAAAGGA 

Sequence 192 

AGGTACACCTGTAATGGCTACCCCGCTGGCTCAGAGCTCTGCCTCACCGCCCTGCCCCGG 
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CACCTGCAAGCCAGTGTGCCACTTTTTTGATGTTGTATGTTTTTAATATTCCCTGAGACA 
GACTCCAGAGTTCTGAGTTTTCTTACTTGGGGTGTCAAATTATATTCCAGGAACCAATTT 
GGGTTTTCCTGGCTTC TGG7T AAAAAGCTGGACAGATTTTGAGTTGTTTGCTCTTAGTTC 

TCTCATAAACTGCACTATTTTTAGGGCACTATTTTGTAGAACACAAGATTTTTCAGGAAG 

TATATTGAAAGGCCTGTACCTGCCCG 

Sequence 193 

AGGTACAAGTTGTCTTTATGCTGCGAGATAAGTCCTCTCTTGGTTTGAGCTCCCACCTTT 

TCAGTGAACTCTTA CATTTT GGGGGATCTGCTCTTGTAAAGGACATCCTTTCTGGTGAGT 

ATTCTTTTGGmMTTmGGTTTGGTTAmGTGCATGAATTTAATCTCAmGG^ 

CAAGTTAAGTTGMTAGACCMCTAGTGAATTAATCCGTCACCAAAATATATGTTTTTGG 

CATTTACCTGTTTATTTTGAAACTCTTTGTAAGAAATGTAAACCTGTAATGATAATCTCT 

GCTTTGTAAGGATATCTCCCTCTCTGACACCTAAAACACTAGATGCnTC 
Sequence 194 

AGGTACTTTAGCTACTCGAACAGCATGCAGACAGGGTAAGAATCACAGAACCTGGCCGAA 

TACACAGCTCATTCAGCATCAACCACAAGGTGGCGCTGCTTCAATGCTGATTCAGATGCA 

CAGCCCTGCTAGAGACGTTTCTCCACCATCAAGGCTACATTTGCTTGTGTCCTTTCCAGG 

CCGGCCAAAAATGCTTTCTCCTTAAGGTTTCAGGTAATACACATATGGCTAACAGTCTAT 

ATTGAGAAAGTAAACCACTGAGGCACAGCAACTCCCTACCCCCATCTTAGGACAGACTCT 

GACTGCACCTGAGAGGACTTATCTCAGCCTCCCTTAGTTAACAAGGCAACAAAG 
Sequence195 

TCGGCTCCCAGCTACTCACAGAGMCGTTTCTGTTTCTGACAGCAGAGTAAGAGAAGAGG 

TGAGMGAGAGACAGCCCATTCTCTTTTGGCCTAATTCCTAAGGAATCATGCCTCTGCCT 

TCGGGCCATTCTCMGTCTTGTTCMGGATAAAATGATTTATTGATGGCCGTAATTAAAA 

AGCAATGCCAACGGAAGCAGCGTCAGCATTTTTCATTTATACCT 

Sequence 196 

CCGGGCAGGTACGCGGGTAGGGATGCAGAACTGGCCACAGTCAGGGCTGGCCAACAAGGG 

ATGGGATGGCMGGGAACCAGTGGGAGACTGTTTACACCTTTGACTTCCTCTTCACAGGT 

CAAGGCAGGACTGTAGCATTAGGTCTCAGAGATGCAATGAACAGGACAAATACACTTCCC 

CTTCATCTGGACCTGAGGCTCTGGGCAAGTCAGTACCT 

Sequence 197 

CCGGGCAGGTACTATGTTCACCCAGGAAACAGCCACTTAGGACACTGGTGAGCTACTCCT 

TAGGGGGTTGTGGAAATAGAGCCCAATCTGGCCTTGACTCCATCGTAGAGCACAATTTAC 

TGAGCAAATTCTAATAAATGTAAATTGAAATTTACATATGATGCTGTTCTTTTCAGAACC 

AAGATTCGTTCGCTTACTGCTTATCAACTCTTCCTCCTGTTTAGGTCTAGTTCTCTCCTG 

TGCAAGATTGCGACAGAAGCTGAGAGGAGCCTTCTGGTGTTTGATCGATACACTCCCTrG 

ACCACAATTTATTCACTGTAGTCACAATGGATAGTCTTATGAAATCTTAAATA 
Sequence 198 

CCGGGCAGGTACGCGGGCTGCACATCTGGTCTCTAACAATGAATCAAGTAATtCTCTCAG 

CTCATGTTGCTACAGCAGCCAGCCGGGACAGCCAAGTGGTTCGGAGAGAAATTGCT^AAG 

CCCTTAGCAAMGCT TTACG ATGCACTCACTCACCATTCAGATGGAATCTCCAGTTGACC 

AGGACCCCGACTGCCTTTTCTGTGAAGACCCCTGTGACTAGCTCAGTCACACCGTCAGTT 

TCCCAMTTTGACAGGCCACCTTCAAACATGCTGCTATGCAGTTTCTGCATCATAGAAAA 

TAAGGAACCAAAGGAAGAAATTCATGTCATGGCGCAATGCACATTTTATC 

Sequence 199 

AGGTACGTGACGGAAGGGAGGCTCACAGGGATGGTGTCTGCGACCCACTCAGCTCTGCCA 

TTTACCATCACCTGACTTTAAGCCAGTGTCTATTTTGTAGGGCTGCTGTGGGGACCAATA 

TGAGCTGAGAGCTCTGCCAGGCACCCTACATGATATACCTGTAAGGGAGCGTCCTTCTGC 

GGCCTGAACCTGAAGCTCTTCAGAGGGACCCATGCTGGGCTCTGAAGAAAGTGACTCACC 

TTGGAGGAGCAAGGGCAGGTCACATTCCATGATGAGAAATTCTCACACCACCCTCCTTCA 
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TATTTGACTCACATCATGCAGCATGGACTTCCACTCCAGTAAATGAGT 
Sequence 200 

CCGGGCAGGTACCAGGCTAGCCTTCCTGATGACTGCAGCAGCAAGGCTGCCCATGGGCCA 

CCCCAGACAGCCTGCTTAGAATCTCTAGAAGAGCTGCTTGATGTAAGACATTAAGACATT 

CTTAAACGAAGCCAGTCTTCMGGACTAGAATAAATCCTTAATTCTTCCAATACATAGAC 

ATTAACGTAAAGCAACAAGAAACAAGGAAAGTCAAATAGACATAGCACTCCCCAAAGCAT 

ATAATAATCTCTAAGTTGATCCAAAGATACAAACTGCCTGACAAATAATTCAAAATAACT 

CATTAAAAAATGCTCACTAAACTTCCATAGGATACAGGGAGACA 

Sequence 201 

AGGTACMGTTGTCTTTATGCTGCGAGATMGTCCTCTCTTGGTTTGAGCTCCCACCTTT 

TCAGTGMCTCTTA CATTTT GGGGGATCTGCTCTTGTAAAGGACATCCTTTCTGGTGAGT 

ATTCTTTTGGTTTAA I I I I I GGTTTGGTTATTTGTGCATGAATTTAATCTCATTAGGAAA 

CAAGTTAAGTTGAATAGACCAACTAGTGAATTAATCCGTCACCAAAATATATGTTTTTGG 

CATTTACCTGTTTATTTTGAAACTCTTTGTMGAAATGTAMCCTGTMTGATAATCT^ 

GCTTTGTMGGATATCTCCCTCTCTGACACCTAAAACACTAGA 

Sequence 202 

CCGGGCAGGTACAGACATTCAGGTTAAATGCTGGCCAGTGGTGGGGGGCNGGGGGTTGTC 

ATTATTTCCCATTACTAAGGMCAACAAATACCACATCTATTTATTTATATATTTAGTGC 

ATTGAGTAAAGATAAGAGAGACCTGCAAAAGAAAAAATATCTAGTGTGTATCAATGACTG 

AATTTGTGTCTACATTTGGGATAGGAAMGTGATMGACATTTTTAATATTCCCAGCACA 

AATTTCCTTCCTGTGTGGTTGTACCT 

Sequence 203 

AGGTACMGTTGTCTTTATGCTGCGAGATAAGTCCTCTCTTGGTTTGAGCTCCCACCTTT 

TCA GTGAA CTCTTACATTTTGGGGGATCTGCTCTTGTAAAGGACATCCTTTCTGGTGAGT 

ATTCTTTTG GTTTAATTTTTG GTTTG GTT ATTTGTG C ATG AATTTAATCTC ATT AG G AAA 

CAAGTTAAGTTGAATAGACCAACTAGTGMTTAATCCGTCACCAAAATATATGTTTTTGG 

CATTTACCTGTTTATTTTGAMCTCTTTGTAAGAAATGTAAACCTGTAATGATAATCTCT 

GCTTTGTAAGGATATCTCCCTCTCTGACACCTAAAACACTAGAT 

Sequence 204 

CCGGGCAGGTACTAGGATTGCAGGCATGAGTCACTACACTCGCCCAGAAATTTGl I I MA 

G AGTAACTAC C ATGCCATTAAC AAAC CTG AC AAAATTAAC AATTTCTTATCTTCAAGAAC 

TCAATTCATATTCAMTTTTTATCATCTCAAAGCTGGTTTGTTCAAACAGGAGTCCAAAG 

TCCCCATACTGCATGTCATTACTGAGACCATACTGGGACTAAGGCAGGGAGGAATGAGCC 

CTAAGTGAGTCTGTAACAAGCGAGAAAGAAGCTCATTCAACTAAAAGAATTAAATTGGAG 

GTCTGGCCCAAGAAAGTGGAAAAATAAACAAAGAAAACTAAAACTATTC 

Sequence 205 

CGGGCGI I I T JGI I I I I I I I I IMCTTAGTTTCTTTGCTTATTCAAGGGAAGAAAATCAG 

GTTTAGTTGGTGTCTATATGAATGATAGACCAATTAAATGCTTTTACTTTCCAGTTGCTG 

GAAGACCTGGGTCAAATTGGTATATAAGGACATGGATTTCTTATGTAACAATAAATAGAA 

AATTAGGACAGACCTCAGGCTGAACAGCAGTATCATTCAGGACTTAAGTATTTTTCTCCT 

GTTATCATTCATAGCATTAACTTCATCCCAAGGATGGTGCCCCCTCTAGTTGTTATAAGA 

TGCCTGCTGACACAGTTATGGCATATTCTTCCTGGTTCAGG 

Sequence 206 

ACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGTTCCATCTTTG 

GCCATAAGGAGACAATGGTGTCAAGAGGCTAAGGCCATATGATCTGTAGGTGAAAAACCT 

GGGATGAGAATCAAGGCCTCAAACTTTTTCCACTACATCACAGATTTAGGAGCAGTTAGA 

GGGAAGGCTTCAATCTGGAGAACACTAACAGGCTCTAGAGATTATTGGGTCATGGTGGCT 

GGGACAGAAATAAGGATTCACAGAAAGATGTTTGTTGTCMTGTTCTTTTACAGTAAATG 

TTCTTITGAGATTTTGCTTGATAAAAAGGTTGAGCAGAAGTGAAAAAAAI Mill ICTTT 
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TTTTCTTTC I I J I I C7TTTCTTTTTGCTTTT 
Sequence 207 

CCGGGCAGGTACCATTCTCCAGGCACATACAACATGAAATATCATACAGCTGTGAAAGAT 

CTGGTGGAAAGCTGGAATATTGAGGAAGAAACATTTTTTGAAMTTATTTCTCTCTCTAA 

GTCAGTAAATGTTTGATATAATTCTNGAGTGCATGGCTGTAGTCTGAAAGTTTATTGTGC 

CCCTAAAATTAAT ATGGT CAAAAATTCCCTAAANCCNNCCCCTGGTGATGATATCANGGA 

GGGNNGGGGCCCTTTTNGGGGGGNAAAAAA 

Sequence 208 

GGGGCCATTGAGACTGCCATGGAAGACTTGAAAGGTCACGTAGCTGAGACTTCTGGAGAG 

ACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGACCACTGGAACAATGAGAAGGAGAGA 

ATTCTNTNTGGGTCACAGACAANACTCTNTTGATCTGNAAATACNACTTCATCATGTTNT 

NNNGNGTGCAGCTGCAGCGGATTCCTCTGAGCGCTGTCTATCGCATCTGCCTGGGCAAGT 

TCACCTTCCCTGGGATGTCCCTGGACAAGAGACAAGGAGAAGGCCTTANGATCTACTGGG 

GGAGTCCNGANGAACAGTCTCTTCTGTCCCCGCTGGAACCC 

Sequence 209 

CCGGGCAGGTACGCGGGGAGACACATTCAGAGGTGAGCCCAGAGGGGGTAAAGTGGACTG 

GGGAGAACTTCGGAGGATGTTCATGTCCAGGAGCAGCCCCACGCCCTGTATGGTCGGTGT 

CTAGAGC CTCAC AGCAACTAAGACCAACCCAGCTCTCAGAAGAAGGAATGTCAAAATGTC 

ATGTTCMTTTTACATTCAGTGCCTGGAATCTTTTCTTCACMTTGAAATGAAATGTGCT 

GAAGGAGGTGAATCCATGCATTAATCTTCAGCTCACAAAGGAAATACTACATAAGAAGCA 

AGACCACAGACTCAAGACGGACATAATTGGATTTTTTTTGCCATGGCCTGGAAAGAAA 
Sequence 210 

AGGTACGCTGGGAAGATGCTGCCAAGCAGCAGGATCACCAGCTTAGCTGTGTGTGGTGAA 

GTTATGCAAATACCTCCCCCTGTCAACTGTGGCAGCAAAGTCCCCGGAAGGACTTGGACT 

GTCCTGGTTTGATATATGCTCATTCCCCCATCAAGCACTGTGGCCGGGAATCAGGAACTG 

TGACTGGTCTGACCTGGGTCATGTGCCCACCTCTGGCTGGTAGGAATGGTTTTGATTTTG 

TGATCAATCCTGAAGAGGAAGACAGGTGCATGGCAAAGGAAAATGGGATAGTACCTGCCC 
G 

Sequence 211 

CGCCGGGTTGGCGGCCGCCCGGGCAGGTACGCGGGGAGGTGGTGGCGAAGCCGCCTTCCC 

CTTTCCCGAAAGGTTTCGGAAGCTTGGTGGTAGCTCTTGAAGATAACGCTTGCGTTAGGG 

CATTACTTGCCNNAAC CNGG NAAAGGGAATGGGMCTTCCCATTGAAANCCAANTTTCTT 

GGANTTGGAACACNAATTTTAAANNAANCCCCATTTTTTTNAA 

Sequence 212 

CCGGGCAGGTACATTTTTTTCAATTTA 

ATTTTAAACA I Mill I AAATAAAAAAGAAAAGCAGATAATTTGTCTACCAAGCCTAGAT 

GGATGTATACTCATAAACAGGTAATACTGTAGTGTGCCATAGGGTGATAAGCGCCATGGA 

GGGGATTAATTTCATGCAGAGCGGGCCTGCAGGAAGTAAAGGAGCGAGCCGTGAAACTGT 

CTGGATTAACGAGTTCTCAGGTAGAGGGAATAGCAAGTGCAAAGTCTCTTGAGCGAATTT 
GCAGGGGGAAGG 

Sequence 213 

AGGTACGCGGGGACACATTCAGAGGTGAGCCCAGAGCGGGTAAAGTGGACTGGGGAGAAC 

TTCGGAGGATGTTCATGTCCAGGAGCAGCCCCACGCCCTGTATGGTCGGTGTCTAGAGCC 

TCACAGCAACTAAGACCAACCCAGCTCTCAGAAGAAGGAATGTCAAAATGTCATGTTCAA 

TTTTACATTCAGTGCCTGGMTCTTTTCTTCACAATTGAAATGAAATGTGCTGAAGGAGG 

TGAATCCATGCATTAATCTTCAGCTCACAAAGGAAATACTACATAAGAAGCAAGACCACA 

G ACTC AAG AC GG AC ATAATTGG AT 

Sequence 214 

CCGCGGTGGCGGCCGCCCGGGCAAGGTACAGTGAAGCTTTGACCTGAGGAGCTCTTCTGT 
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AGGTGAAGAGTGGTTAACTATGGTTCCTGTGTGCCTGGGGCCCCCCACTCCTAGGACTCA 

TCCACCCCCTACCCAGAACTTACTAGTACCT 

Sequence 21 5 

TCAC AGCAACTAAGACCAACCCANCTCTCAAAAGAAGGAATGTCAAAATGTCATGTTCAA 
TTTTACATTCAGTGCCTGGAATCTTTTCTTCACMTTGAAATGAAATGTGCTGAAGGAGG 
TGAATCCATGCATTAATCTTCANCTCACAAAGGAAATACTACATAAGAAGCAAGACCACA 
GACTCAAGACGGACATAATTGGA I I I I ! 1 1 I GCCATGGCCTGGAAAGAA 
Sequence 216 

■ AGGTACGCGGGCGTTTTTG 1 1 I I I I I I I I AACTTAGTTTCTTTGCTTATTCAAGGGAAGA 

AAATCAGGTTTAGTTGGTGTCTATATGAATGATAGACCAATTAAATGCTTTTACTTTCCA 

GTTGCTGGAAGACCTGGGTCAAATTGGTATATAAGGACATGGATTTCTTATGTAACAATA 

AATAGAAMTTAGGACAGACCTCAGGCTGAACAGCAGTATCATTCAGGACTTAAGTATTT 

TTCTCCTGTTATCATTCATAGCATTAACTTCATCCCAAGGATGGTGCCCCCTCTAGTTGT 

TATAAGATGCCTGCTGACACAGTTATGGCATATTCTTCCTGGTTCAGGTTCAGTGGAAGA 

ACAC 

Sequence 217 

CCGGGCAGGTACTCCCTATGTCTCTTGTCCTTCTTGGACAGGGGAGTCAI I I I IATCATA 

AGATGACAGAGGTGCAAGAAGCTAAGAAGAAAATGAAGGACAI I I I I AGGTTAGGCTCTG 

TATGGCCACACTGTTACCTCCACCCTCATGTGCTTGACCAGCGCAGTTCCTGTGGCTGAG 

CCCAAAGCCAAAGGGTGGCGATTACACTGGCTTCCAGGGAAAGACTTCAAAGCCATATAA 

CATAAGGGTGCAGTATTGGGCCAATAATTCAGCCTGCCATACTTCTCAACCCATATAGTT 

TGGTATGATCTGAATCTCCCACGATCTCCATATACTTCACATTAATG 

Sequence 218 

TCGGCTCCCAGCTACTCACAGAGAACGTTTCTGTTTCTGACAGCAGAGTAAGAGAAGAGG 

TGAGAAGAGAGACAGCCCATTCTCTTTTGGCCTAATTCCTAAGGAATCATGCCTCTGCCT 

TCGGGCCATTCTCAAGTCTTGTTCAAGGATAAMTGATTTATTGATGGCCGTAATTAAAA 

AGCMTGCCAACGGAAGCAGCGTCAGCATTTTTCATTTATACCT 

Sequence 219 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAATTATTTTGACTTTTATGTCTGAAAATGGTC 
ATCACAAAATGTTAGGGAAAAAAATAAGAGATGTGGGTTAACATCTCTGGTCTCTGGTAT 
CTTCATAGCCACAGACTTGGTTTATACCTCATTACATCI I I I I I I II I I I I I I I I I I II I 
TTTTTGGAGACAAGAGTTTTGCTCTTGTTGTCCAGGCTGGAGTGCAATGGCATGATCTCA 
GCTCACTGCAACCTCCACCTCCTGGGTTTAAGCGATTCTCCTGCCTCAGCCTTCCGAGTA 
GCTCGGATTACAGGCATTGCGCCACCACGCCCGGCTAAI I I I I I GTATTTTTATTANAGA 
CAGGGTTTCTC 
Sequence 220 

AGGTACCCCTTCATGAGAGGCGAGCACAACTCGACCTCCTATGACTCTGCAGTTATTTAC 

CGTGGTTTCTGGGCAGTCCTGATGCTCCTGGGGGTAGTTGCTGTAGTCATCGCAAGCTTT 

TTGATCATCTGTGCAGCCCCCTTCGCCAGCCATTTTCTCTACAAAGCTGGGGGAGGCTCA 

TATATTGCTGCAGGCATCCTATTTTCATTGGTGGTGATGCTGTATGTCATCTGGGTCCAG 

GCAGTGGCTGACATGGAAAGCTACCGAAACATGAAAATGAAGGACTGCCTGGATTTCACC 

CCTTCTGTTCTGATGGCTGGTCATTTTTCCTGGCCCCAGCTGGGAT 

Sequence 221 

CCGGGCAGGTACTGTCCATCCCTGAGCCGGCCCACMGTGTCTTCTGTTTTCTTTACCCT 

CTCCTTAGCACCAGCTGCCTAACAAAGTCAGCGGGCAGCCAGGGCAAAGAACAGTTTGGG 

CGACTTCAATATGACTCTCTCTTCTGAGCCCCCGATCCAAAAGNAAGGCTCACCCTTAGT 

GAATGTGCCTTTGCCAGAGGGTCTTCCCTAGGGTGAAGGGGAAAAATCATGTTTCACAAT 

TTNAG 

Sequence 222 • 
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TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGGGGAGGGCGTAAGAATGA 

CTACCAGCAATCTGGCTGAAAGAGAGAATGGGATAAGGAAGACTGAAGGAGAAATGATTT 

TAGAGTGGGGAGTCTCAGGAGCTCTATCTCAGATATACCAAGTTTGAGGTGTATTAAGAC 

AACCAGCCAGGTGCAGTAGCTCATGCCTGTAATCCCAGCAAGTTGGGAGGCCAAGGCGGG 

CAGAACCCGAGGTCGGTAGTTCGAGACCAGCCTGGCCAATATGGTGAAATCCCGTCTCTA 

AATTTTGTATCTAAAGATACAAAAATTAGCCAGGTGTAGTGGCACCGTGCCTGTAGTCCT 

AGCTACTCAGGAGACTGAGGCAGAAGAATCACTTGAATCCCGGGGGCGGAGCTT 
Sequence 223 

AGGTACCCCAGTCCCTCCCTCTTAGCCTGGAGCAGCTCCTCAATGTGCTAAGCCTTCCAA 

GGATAAGCTGAGCCCTGGACAGAGGAGGCTCCCCTTCAGGCCAGCACTGCCCATTACTTG 

AAGACAAGGTGAGCCTCAGCCTGATGTTGAAGGCTAGCATTAGGAACCCCGCTCAGGATC 

TCCCATTCTCCAAGATCTGCTTTATCAATAAGAAAGGGGATGCTGCCGGGCCTGGTGGCT 

CATGCCTGTATTCCCAGCACTTTGGGAGGCCGAGGCGGGTGGATTACCTGAGGTGGGGAG 

TTCGAGACCAGCCTGACCAACATGGAGAAACCCCGTCTCTACTGAAAAT 

Sequence 224 

CCGGGCAGGTACTGTCCATCCCTGAGCCGGCCCACMGTGTCTTCTGTTTTCTTTACCCT 

CTCCTTAGCACCAGCTGCCTAACAAAGTCAGCGGGCAGCCAGGGCAAAGAACAGTTTGGG 

CGACTTCAATATGACTCTCTCTTCTGAGCCCCCGATCCAAAAGAAGGCTCACCCTTAGTG 

AATGTGCCTTTGCCAGAGGGTCTTCCCTAGGGTGAAGGGGAAAATCATGTTTCACAATTT 

CAGGGAAGTCTGMTGGTAGGAGCCAGGTTTTCACATTCTGGTCTTTAATGCATAGTGGC 

CATTCATTCCATTAGGCTTCTTATGTCATGACATAA 

Sequence 225 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGG 

CGGGTAAAGTGGACTGGGGAGAACTTCGGAGGATGTTCATGTCCAGGAGCAGCCCCACGC 

CCTGTATGGTCGGTGTCTAGAGCCTCACAGCAACTAAGACCAACCCAGCTCTCAGAAGAA 

GGAATGTCAAAATGTCATGTTCAATTTTACATTCAGTGCCTGGAATCTTTTCTTCACAAT 

TGAAATGAAATGTGCTGAAGGAGGTGAATCCATGCATTAATCTTCAGCTCACAAAGGAAA 

TACTACATAAGAAGCAAGGACCACAGACTCAAGACGGACATAATTGGA T T I 1 I I I I GCCA 

TGGCCTGGAAAGAAAGGTACCT 

Sequence 226 

CCGGGCAGGTACGCGGGGAGACTGAGTGGGGTCACAGCACAGGGCACTGTCTTGCCTGGC 

TTTATCTGAGCCAATCACACCTCTCCTGGCCACTATCTGTGGTCTAGCCCCCTTTGTGCA 

GAAA GAGA AAGAAGAGCCTTGAGGACCAGCCTAGTCAGGCTGAAGAAATGTCAACAATTG 

GGAGTTTTGAAGGATTCCAGGCTGTGTCTCTGAAGCAAGAGGGAGATGACCAACCCTCTG 

AGACTGACCACCTATCGATGGAGGAAGAGGACCCGATGCCAAGACAGATTTCAAGGCAGT 

CAAGTGTGACCGAATCAACTCTTTACCCCAATCCTTATCATCAGCC ' 

Sequence 227 

AGGTACCTTTCTTTCCAGGCCATGGCAAAAAAAATCCAATTATGTCCGTCTTGAGTCTGT 

GGTCTTGCTTCTTATGTAGTATTTCCTTTGTGAGCTGAAGATTAATGCATGGATTCACCT 

CCTTCAGC ACATT TCATTTCAATTGTGAAGAAAAGATTCCAGGCACTGAATGTAAAA7TG 

AACATGACATTTTGACATTCCTTCTTCTGAGAGCTGGGTTGGTCTTAGTTGCTGTGAGGC 

TCTAGACACCGACCATACAGGGCGTGGGGCTGCTCCTGGACATGAACATCCTCCGAAGTT 

CTCCCCAGTCCACTTTACCCGCTCTGGGCTCACCTCTGAATGTGTCCCGCGTA 

Sequence 228 

AGGTACCCCAAMCCAGGGGCTGCAGCTCCAGCAATTGGGMGGCCMGTTTTCTTGCAC 

GCTCCCCAGAAAAGGGGCCAAGTCCATGGGGCTGAGCAGTGATAGACTGCAGACCTCACC 

ACCACTGAACCTTGTAGGATAAGGCCCACTAGCCTGGGATGCTAGTGAGGCCACCCTAGT 

CCTCCTGAGTTCTCC AGCTG GGAGCAGCTCTACACTTCTCCGGCATGCAGCTCCCAAAGA 

GAGAGGCAGTCCACCTTTTTGCTGTCTCGCAACCCTCCCTCCTTGCTGCTCTCAGGCTTG 



Page 38 of 869 



WO 01/51628 



PCT/USO 1/00798 



Table 2 



GGAGGGTGCACAGCAATTAGGGACTATCACAGAACCCCAACACAGTGCATCTGGTGAA . 
Sequence 229 

CCGGGCAGGTACTTCCTGCAATGGTGGAAATGTTCTCTGTCTACACTGTCCTATATCGTA 

GCCACTCGCCACATGTGGCTATGGAGCTCTTGAAGTGCGGACAGATGGACTGAGAAACTG 

AMTTTAAATCTTACATAATTTTAATCATGAGATAGTGGCTACTTGACAAGATGCTGGAG 

AACAGGCT7TGGAACCAACATGATTCACTGACATGAAGCTCCCACATCTTGGCCTATGGA 

CACAATGAACAATGCATCTACTAGCTGTTTAGGGCAGCCTGTGAACAGAGCACACAGCTA 

TTGCAAACAAACAGACCCGGGTTCAGAGTCAGGA 

Sequence 230 

AGGTACTTGATTCAAAGCCTGGGTCTCCCAGAAGTGAGCTGTGTGGCTTGGGCCACTTTA 

TCTCCTGAGGTCTCAGAGTTrrCATGTATAAAATTGTAGMCTTGTCTTGATGGTTTTGA 

ACATTCTTTCTAATTCTGCACMCAGTCCATTTGTGGGTTAGATGAGCCTTTACCAAACT 

GT ATTTC ATGGAACMTGGTCCCMGAGCTGTTCTGTGAAAAGTAAGTTTCATCATCAAA 

TAATTTTGAGAAATTCTGMTCCTATATGCGTGTTGATTCCTATTTCTTTCCTTCCATTC 

TTTCTTGAGTCCACTCTGATTGGTCCTTAAAG7TGGAGTTTATTTTTATCATTATTGTTA 

TTGTCATGGT 

Sequence 231 

CCGGGCAGGTACTCAAACTATAGCCCCAGGGGAAGACAAACGGTCACAGATGAGGCAGGA 

AGAGTGCCTGGGAGAATTTCCCCATTCACCATGGAGCCAAGTGAAAGACCTCACCATCAG 

CTGCTGCACAGAAATGACACAGCCACCCACACAAGTGGGAGATCGAGGAGGAGGCACGGC 

CAGTTTGTTTGGAATGGACAGGTGTGTTAGGTAGGAGTCCAGCCTTCACACCCTGTGTTT 

CCTTTCTTTTGCTTTGGACCTTCTTTCCTAACTAAAMGAGAGCCAGTGAAGGTTCTTGA 

GAAGGG GATGGCAAAGAGCCATGAG ATTTTATCTTTGTAAACCTCGAAGTGTGG GT 

Sequence 232 

CCGGGCAGGTACGGAAGAACATGGGAGTCTGTATGCAGCACGACGTCTTGTTCAGTTACC 

TCACTACTAAGGAGCACCTTCTCCTATATGGTTCCATCAAAGTTCCTCACTGGACTAAAA 

AGCAGCTNCACCGAGGAAGTAAAANGGACTTTAAAAGATACTGGACTATATAGCCATCGN 

CATAAGAGAGTTGGAACACTGTCAGGAGGCATGAAGANGAAGTTATCTATATCCATAGCT 

CTCATTGGTTGGATCMGGGTAGTAATTTTGGATGAACCATCTACTGGAGTTGACCCATG 

TTCTCGCCGAAGT 

Sequence 233 

TATAGGGCGAATTGNAGCTCCCCGCGGTGGCGGCCGAGGTACTCTGCTATAACATGAGAT 
CTGTCTCTCCAGGCTCTGGCTTCCTCAAGGACAGCTGCCACCAGGTCCTATCCATCTTGT 
AAGTCCAGTTCTTAACGTCCCCGCGTACCTGCCCG 
Sequence 234 

CCGCGGTGGCGGCCGAGGTACTGGAGGTCCGAGCTACCACAGAAGACAACAAAAATCAAT 

ATAAGGTATAGGAAGATTGGAAAGAAAGAAAACAAACCACCTAGGCTGAAATCCCAAACA 

AACTGGACACGCAAATTATTAMGGTGATTTCATAMTGTTATGTCATTTAAGACAGCCA 

TATAAAAGGAATCATTATTCCCATTTTCTCTATGAGGAMTAGAGGCTCAGAAAGGTTTT 

GAGCAACTTGCCTGAAGTTGAAGTCCCCCCAGCATATTAAGTGCTGAAGCCAGAATTCAA 

ACCAATTGATAATTTCAGTGCTCTTCACCGTAGACCACATGGCCTCCCCTNAGTTAATAT 

TCACC 

Sequence 235 

AGGTACTTCCATCAAGTCGAGGGAGTAACGCAAACTAGAATAGTAGCCATCTATTAAGGA 

TAGTTCTGAGGTTTGGAATTTGTMCTGATTTTTAGGAGCTACGACATGAATATCATATC 

AGGTAGAMTACTCAGAGTGATCACCTTTTATGAAAGTTAGTTCTATCTTTCATGTCTAG 

GAAAATATAACAGTGMTGTTTGCTTTTTGTATGTTTTGTCTCCAAGCAGAGGAAACTGA 

CAGTTTTATGTTTTAGGTGCCGAMGTTGTTTTTTGGCTAGGCAG 

CTTGGGATGCTAACTTGAATTTTTGTCTTTAAMTGTTTTAGATT^ 
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Sequence 236 

CCGCGGTGGCGGCCGAGGTACCTACTATGGGACCTAGACAGGCTGTTTCACTTCTCTTAT 

ATCTGTAATATAAGCAGAGTAACTGGAGCCCTTTTGTTCACCTGCAGTTGGATCTCAGTG 

ATTTGGGGAAGGATCTTGGGCCTCCACTTGCAGTGACTATAGTCACTATAGCTTCATCAG 

CAGGGGCGGG AAAT GATGC TGCC TCCAGCTTGAGAGTGATGAGGCCAGCACCTCCTGGGA 

GCCCTCGATMTTTTATCTTCTTTTGCAATCCTTCCCCAAATCTCTTGGTTCCACTCAAA 

ACTAGGAACAGTCCTGCCCACTGTGGCAAGCTGTGGTGCCCACCTCTCTTACAGCTGTCC 

CCACAGGACTACTATAATTATTCACTTTTATCCCATTGCCTGTAATGC 

Sequence 237 

CCGGGCAGGTACTTTTAGAAGAGACAGGGTTTCACCATGGTGGCCAGGCTGGTCTCGATG 

TCTTGACCTCGTGATCTGCCTGCCTCGGCCTCCCAAAGTGCTGGGATTACAGGTGTGAGC 

CACCACACCCGGCCTGAATTGCACTmGATGGCTGAGCCAMTGCTCATTCTATTTAAT 

ATTGCACAGCCTGAACCAAAACCCGTAACATTATAAAGGAAGAGATGAGAGCCATTTCAA 

ACTGTGAGAGAAGAGAAGACATCAAGCAAAATCTGGGGGTTTTAGCCAACAGACTTCAGT 

CCTGGGACCTCGCAGCAAAAAAACACCTTCANTTGCCCCAGGGCTTTACAGGG 

Sequence 238 

CCGGGCAGGTACGCGGGTGTCCTTGAGTTTTGAGCTCATGGTTAAAGCTGAGAATTAAGC 

ACTGCCCTTACAACTGCTAGGTAAGCTCAGTAAGGTGTTGCAAGGTGGTGCAAAGGGCCC 

TGGGAGGAGGGTCCCAGTTGCCCACCCTTTGTCCACTGCTTTGTGATGAATTCTAGATCA 

CCTTGTGGTTACCTGCCTATTAGGAGGTGCTGTTGAAAAGCGACTTTAAAAAAATAGGAC 

TTTGGGAGAGTGGGGCA GATC AAATGAAGGGACCAGCAGTGTTCATAATGGTGCTGAAAT 

TTMTTTCAGTGCTGACCTTTTATTTGACATTTTAGATAATGCTATGGATGTGCTCAGCT 

ATTTGTTCTTAACAT 

Sequence 239 

CCGGGCAGGTACGCGGGGAGAGCGGGTAAAGTGGACTGGGGAGAACTTCGGAGGATGTTC 

ATGTCCAGGAGCAGCCCCACGCCCTGTATGGTCGGTGTCTAGAGCCTCACAGCAACTAAG 

ACCAACTCA GCTC TCAGAAGAAGGAATGTCAAAATGTCATGTTCAATTTTACATTCAGTG . 

CCTGGAATCTTTTCTTCACAATTGAAATGAAATGTGCTGAAGGAGGTGAATCCATGCATT 

AATCTTCAGCTCACAAAGGAAATACTACATAAGAAGCAAGACCACAGACTCAAGACGGAC 

ATAATTGGA1 I I I I I I I GCCATGGCCTGGAAAGAAAGGT 

Sequence 240 

GGGGCCATTGAGACTGCCATGGAAGACTTGAAAGGTCACGTAGCTGAGACTTCTGGAGAG 

ACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGACCACTGGAGCAATGAGAAGGAGAGA 

ATTCTACTTGGTCACAGACAAGACTCTCTTGATCTGCAMTACGACTTCATCATGCTGAG 

TTGTGTGCAGCTGCAGCGGATTCCTCTGAGCGCTGTCTATCGCACCTGCCTGGGCAAGTT 

CACCTTCCCTGGGATGTCCCTGGACAAGAGACAAGGAGAAGGCCTTAGGATCTACTGGGG 

GAGTCCGGAGGAGCAGTCTCTTCTGTCCCGCTGGAACC 

Sequence 241 

AGGTACTTMCCATGGGTTGCCATGCTTTTATAGCACATGACTTCACTACTCTGAGCACA 

GCTAAGAGAATTAAAAATCAGCAACTGATGCAAAGGTAAGGTAGACATTTTCAAAATAGC 

CAGAAGCTGATGATGTGACCTGGCAGCTCTGCCCAAAGGGGCATACTACATGAAGTTGTC 

ACTAG GCAA AACCAATCAGGTCCTCTCTTAGGAAGGCTGAATACAGAGTCAACAGACACC 

GTAAGTTTTGCATAAGACAGGAAAAACACAAAATAAAAGCAGTAAGCTAAGAGGCTGAGA 

AGAGATGAGATAGGCAGGTCCTCCGCAGTGGAAGAGG 

Sequence 242 

AGGTACCTCTCATAAACTTTGTTCATTCAGCCACTTGAAATAAGCCACGTGGGTTATGTG 
CCACG CAAA ATTTCAGTAATGGGTTTTGGGGGTGTTTAGATAGCAACAACTTCATAGACT 
GAACCTTTTGG I I I I I I I I i CTTTCTCTCCAACTCAGAGGTTAAAAATTAACCTGAAAAA 
TCAATGCTGCAATATGAATGCGCTGTATGTTTCTTGCATGGACTTGCAGACACTCTGTAA 
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CGGTGTTTCCAGTCTCCAGCAGTTCATATTGGAAGGTGAGCTTTCACATGGACAAGCCCT 

TTGCGTTGCCAAGTAGTAGTGATATTACAAGTTTGAAAGACAT 

Sequence 243 

AGGTACNCCAAAACCANNGGCTGCAGCTCCAGCAATTGGGAAGGCCMGTTTTCTTGCAC 

GCTCCCCAGAAAAGGGGCCAAGTCCATGGGGCTGAGCAGTGATAGACTGCANACCTCACC 

ACCACTGAACCTTGTANGATAAGGCCCACTAGCCTGGGATGCTAGTGAGGCCACCCTAGT 

CCTCCTGAGTTCTCC AGCTG GGAGCAGCTCTACACTTCTCCGGCATGCAGCTCCCAAAGA 

GAGAGGCAGTCCACCTTTTTGCTGTCTCGCAACCCTNCCTCCTGCTGCTCTCANGCTTGG 

GAGGGTGCACAGCAATTAGGGACTATCACANAACCCCAGCACAGTGCATCTGG 
Sequence 244 

CCGCGGTGGCGGCCGAGGTACACAAGTAGAACTGTTAAAACCGTGTGGCTTCCAACTGCT 

CTMTMCTTTAAGATTCACAGTCTTCAACTTTCTTACAGTGGAAGCCATGGGGGCATTG 

TCCTGTTGCCTTCTCTCTCATTCCTCATTAATGAGAAAGATGTTAAGCTCACTTGACGTA 

AGTCCCAGTGAAGTCTTCTCCAGAACTCAGAAGGGCCATAAATAGAGAGACAAAGGGCAG 

GAGGTGGGAGTTATTCACCAGCGTGGAGGAAGeAGGATAAGCCCAGAGGAATGGCCTCTG 

GGAATTATACAAAGAATTTGGAGGCGACTGGGGG 

Sequence 245 

CGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGNCAGGTACNCGGGGACATTCANAGGT 

GAGCCCANNTNNGGTAAAGTGGACTGGGGAGAACTTCGGAGGATGTTCATGTCCAGGAGC 

AGCCCCACGCCCTGTATGGTCGGTGTCTANAGCCTCACAGCAACTAAGACCAACCCAGCT 

CTCANMGAAGGAATGTCAAMTGTCATGTTCAATTTTACATTCAGTGCCTGGAATCTTT 

TCTTCACAATTGAAATGAAATGTGCTGAAGGAGGTGAATCCATGCATTAATCTTCAGCTC 

A€AMGGAAATACTACATMGMGCAAGACCACAGACTCAAGACGGACATAATTGGATTT 

TTTTTGCCATGGCCTGGAAAAGGAAAGGTACCTCGGCNCGCTTCTAGGAACTAGGTGGGA 
TCCCCCC 

Sequence 246 

AGGTACAATCTGMCAACCCTTTTTATTGCTTACTT^ 

GAAGTATCTCCTAGCACAGGGGATGAGAAGAGGGGAAAGAAGGCAGGGCATCACCAATCC 

ACTATTCCCTTTTACTAATCCTCACAGGAAAAGACCATCCTACAACATCCTCCAAGTCCA 

GCTGGCTATCTGTTCTGCCTCATTTCTTCTATGTTTTATTCCACCAACACGAACAGGCTT 

AGTTTTCCCCAGCCCTGMCAGMGCTCTGJCCTCCTTAGGAGACACCTCATTTTGCAGT 

TTGTTTCCAAGCTTCAACTTCAT 

Sequence 247 

AGGTACGCGGGTGAGAAAAGGCTCTAACATGAGTTGATCTTGAGCCCAATGTTGAACAGC 

TCCAGACCTTACAATTTAAAAAAGTAGTTCAGACCGGGTGTCTTGAGTTCCGCAGTATAC 

CTGGTCCAGTGGACACTTGCAGAATGCCGTTGGCACATGCCACTGGCCATTGAGGGGTCC 

CCAGATGCATAAACAGCCTGATCCTGTAGCTCCCCTTCCCCACACGGATGCATGGAGGCA 

GGTGGCGAACTCTAAAGCCTCACTGAAAGGCAGATTGCTGGACTGTTAGAAGGGGCTCAC 

TCAC CTGCCTCCAAGCGCTTGTCTTCTGTTTACAT 

Sequence 248 

AGGTACGTTTCCTCACAGATATATCAACTATTGTGCATATCTCTAGTTCTGATTCTAGTT 

MTCAATATCAGCAGAAAGCTTATCATTTCATGTCTAAGTGTCTGTGTGGGCATGAAATT 

GTTAGGAATTGTTTTAGTGWMCTGATTCAAAMTTCATTGCTTAAAAACM 

i n u lACAGAAGCATTCAG MCTCCA MGAGATTATMCTCACAGAGMTTATTTTAAA 

MGCAGATAATATCTTATTCCTTTTTTCCACACATCTCCACATACCMCTTAAATCTTCA 

CCAGTAAAGAGGAATCCAGGGGAAATCATAAGAGAAGCATCTTTA 

Sequence 249 

ATCGACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGGGGCCATTGAGACTG 
CCATGGAAGACTTGAAAGGTCACGTAGCTGAGACTTCTGGAGAGACCATTCAAGGCTTCT 
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GGCTCTTGACAAAGATAGACCACTGGAACAATGAGAAGGAGAGAATTCTACTGGTCACAG 

ACAAGACTCTCTTGATCTGCAAATACGACTTCATCATGCTGAGTTTGTGTGCAAGCTGCA 

GCGGATTCCTCTGAGCGCTGTCTATCGCATCTGCCTGGGCAAGTTCACCTTCCCTGGGAT 

GTCCCTGGACAAGAGACAAGGAGAAGGCCTTAGGATCTACTGGGGGAGTCCGGAGGAGCA 

GTCTNTTCTGNCCCGC 

Sequence 250 

CCGGGCAGGTACTAGAAAAACTCAGAAGCTGGAACGTGGGAATACATCTTCATTTCACAG 

ATGAGGAGGACCTGTCTCAGGAAAAGAGGTGACTCACCCAAGGTTATAGCTGGTTAGTGG 

CAGAACCAGGACCAGAAGCAAGGCCTGCAGCCTCTTGTGTAGGCTTTCCTAGTGCGTGCT 

GCTGACCCTCAAAGCTGTCCTTGGCCAACTCACCACCTGTGTGACACTTACCCCCAGGCA 

GCTGTAGGGAGGAACACAGATGTTCGGGGAAGGCACTTGTGTTACTGTATAAGCACAAAG 

GAAGCTGGGATCTGAGTATGGTGGCCCTTGTCGGTCTC 

Sequence 251 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGACACATTC 

AGAGGTGAGCCCAGAGCGGGTAAAGTGGACTGGGGAGAACTTCGGAGGATGTTCATGTCC 

AGGAGTAGCCCCACGCCCTGTATGGTCGGTGTCTAGAGCCTCACAGCAATTAAGACCAAC 

CCAGCTCTCAGMGAAGGAATGTCAAAATGTCATGTTCMTTTTACATTCAGTGCCTGGA 

ATCTTTTCTTCACAATTGAMTGAAATGTGCTGMGGAGGTGAATCCATGCATTAATCTT 

CAG CTCACAA AGGAAATACTACATAAGAAGCAAGACCACAGACTCAAGACGGACATAATT 

GGAI I I I I I I GCCATGGCCTGGAAAGAAAGGTACCTTGCCCG 

Sequence 252 

AGGTACTACAGTCACTAMGCTCAGTTTCTTAMTCCATATTTTAATGGGACTCTGCTGG 

ACATTGAATACCACCAGTCCACTTGTCAGGATGTCTGTTCTCATGTGCAGTGTGATGAAG 

CCTCTTTCATGATGTATTGACTTTACTGGACTGAAATATTTTGAAGAGTGTCAGCCCTTT 

GACAAGATCCATCTGACCATTTCTGTTCACAAATGATTTCTCCACACGATTCCACTCAAA 

GCACACTCATTTATATAACACAAAGTAACTTTCCCAAATMTCATCACTTTCAAAAAMG 

ATCGTGAAAATACTGATAAAAAGAATTCTCCCGGCCA 

Sequence 253 

CACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGTCA 
GCCTCACTGGCATAGAAAGAAGGTGGAGGAAATGGGTGCTGCTTTCATTTCTGAGGTATA 
AGTGGTGATGTAGATAACTGCTATAGGAAATGCCCACGCATAGAACAAATGGTAAAGAAG 
TTCC AGGTAGTC AGCTGGTTTCAACAATG AATAAAGCAG G ACACCCTG CTTAG AGGTTGA 
AATCAAAATGAAGAGCTTTGGAAAGAGATACCTGCTCTACAGTGTGGTGAGTGCTGGAAT 

GGAGGAGAAAGCAAGAGGAATAGCAATAGAGTCAAGTCTTGAAGGATGACGGCACATAAG 

GAGAAAGTGTTCCCCACACTAAGGAAG 

Sequence 254 

ACTNCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGATCTGACTTATGT 

TTAGTAAGATCCTGGGCCACTGGGTGCCCTGTAGAGAAGAGACTGAACTGGTCTAAGAGC 

AGAAACTGTGAGAGCAGTTAGGGCACTACTAAAATATGCCAGGTGAGGCATGGTGATTGC 

CTAGACTGGTGTGC^AGCTGTGGATGCAGAAAAAAGTAGCTGGATTGTCGATGGATTCTG 

MGGTGGAGTCAAGMTTTGGTCATGGACACGGGTGTGAAAGAAGGAACCCAAAGGATTC 

TGGCCTGAGCCACTAGAAGACTTGGAGATGCCATCATTTGAGATACAACCAAGGCTTGTG 

GGMGAAGAANAT1TTGGGGTGGGAGTTGAGGTAATTGGGGAAAAAGGGTATTAAGGGAG 

CCTCAATTCGGGA 

Sequence 255 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTCTCAGTTCACTTCCATGTGGGAGGGTCATTT 
GCCATCCATCCAGATTACCTTTTTGATTAGGCTTTTCAGTGTCTATAMTATTTCGGTCT 
TTTCTGATCATTTAATTTCTAATACATGCATATGCTCCTATGTATAATAAATAGACAACT 
TCAMTTTGCAGTTTCTAGATGGTTGGAAAAGGGAMCATTGTGGTGTGTAATTTATCAG 
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C CATC MGATCCTAGATAmGAGATTTTAACTAAGCAAGGTATTAAGATAGACCATGTT 
QTTTTG GCTTC ACAGAATTCATTCATATTGTGCATTACACAATTCAGTGTGCATATTGCA 
CAATTGATTTTATCTGTAAGTTGTCTTTTATCAGT 
Sequence 256 

GCGGCCGAGGTACCCTGGGCAGTAGAATTTCATTGAAAACGCCTNTGGACTNCAACTAAA 
GATGCAGAGAAGCCCAAGTTCCTCCCTCTCTCTTGCTTCTGGAGCTCCTTGTAGTTACAT 
CTATTATAGTATCATTATATCMGTTATMCMTCTGmcCCTTTATCTCCTCTAGTGG 
CCTGGGAACCTCTAAGAGGCAAACACAGTCCCCATACAGGAATGGGATGTCTGCAGAGCT 

GTATAACTCCTCTCTTCTGTCTGMGAATAAAAACAACATTATACAAAGGAAAAAGAA 
Sequence 257 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGTGTCCTTGAGTTTTGAGCTCATGGTTA 
AAGCTGAGAATTAAGCACTGCCCTTACAACTGCTAGGTAAGCTCAGTAAGGTGTTGCAAG 
GTGGTGCAAAGGGCCCTGGGAGGAGGGTCCCAGTTGCCCACCCTTTGTCCACTGCTTTGT 
GATGAATTCTAGATCACCTTGTGGTTACCTGCCTATTAGGAGGTGCTTGTTGAAAAGCGA 
CTTTAAAAAAATAGGACTTTTGGGAGAGTGGGGCAGATCAAATGAAGGGACCAGCAGTGT 
TCATAATGGTGCTGAAATTTMTTTCAGTGCTTGACCTTTTATTTGACATTTTAGATAAT 
GCTATTGGATG TGC TCAGCTATTTGTTCTTAACCATATTTTCCATCTTTTGGGGGGC 
Sequence 258 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACACACGTTAACCACAA 

AACAGGCCTCTCTGAAAAAGCCATTGCCATGGACTGCCAGACAGACAATGACAAGACACA 

AATACCTTCTGGTGTGTGAGCCACGGGACATGTGAGCTTCCCCGCTGATGCTCCTCTTAT 

ATCAAAGATCACTTTCACMGATGAGCGACTCAATATCTTTTATCAAACCAATGATCACC 

TGCAAGCTATGGGTATATTTTTGCAGCTGTGTANAAGCTATTGTGGCATGAGAATGTGGG 

ACTTATAAATTGCTGATCCAATAAATAGACATTATGGGCAACAGTGTCTTATCAGCTAGT 

GTGGTACTAAGGTTTCAGGAACAGNTGTTCTGACCTTACTATCCAACGAGGGAGTAACTG 

Sequence 259 

CCGCGGTGGCGGCCGAGGTACTTCCGTGAGATATAAGGCTGATGATAAGGATTGGGGTAA 

AGAGTTGATTCGGTCACACTTGACTGCCTTGAAATCTGTCTTGGCATCGGGTCCTCTTCC 

TCCATCGATAGGTGGTCAGTCTCAGAGGGTTGGTCATCTCCCTCTTGCTTCAGAGACACA 

AGCCTGGAATCCTTCAAAACTCCCAATTGTTGACATTTCTTCAGCCTGACTAGGCTGGTC 

CTCAAGGCTCTTCTTTCTCT7TCTGCACAAAGGGGGCTAGACCACAGATAGTGGCCAGGA 

GAGGTGTGATTGGCTCAGATAAAGCCAGGCAAGACAGTGCCCTGTGCTGTGACCCCACTC 

AGTCTCCCCGCGTACCTGCCCG 

Sequence 260 

A GGTAC GCGGGTAGGCACCTGCCACCACGCCCGGCTGATTTTTGTATTTTGTAATTTTGT 

ATTTTTAGTAGAGACTAGGTTTCACCATGTTGGCCAGGCTGATCTTGAACTCCTGACATA 

AG GTGA TCCGCCGACCTNCCAAAGTGCTAGGATTACAGGCGCGCCCAGCCCTGTTGTATT 

AGTTTTCTATTGCTTCCATGACAACAMTCACCACAGGCTTGGCAGTTTAAATACCACTG 

ATTATCTCACAA1TCTCTAAGTCAGATGCCTGAGTTCAGCCAGGCCCAGCTGGGTCCTCT 

GCTCANTCTCATGGGGCCAGTGTCTTGAGGTGCTTGGGTTCACCCTTGGAGGCCTTCAAG 

CTTATGCACAGTTGGCTCAATTATTCCTTGCAGCTGTAGGACTGGGTC 

Sequence 261 

AGGGNGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCCCAAACAACCATCTAAAAA 

TATACCNTTCCTGNCAGGTGCCAAACTTGCTTTTTCAGTTTCTTGGACAAAAATTCTGGG 

ACCATCCTTGAATTTTCTATTTCCCAGATACACTGCAGTTAATTCACTAACAAATTGTGT 

TATCTGTGCCTTAAAAAACATGCCCAAAGGCCCGGGCCGTGTTGGCTCACGCCTCTAATC 

CTAGCACTTTGGGAGGCAGAGGCGGGTGGATCACGAGTTCAGGAGTTCAAGACCAGCCTG 

GCTTGCACGGTGAAACCCCGTCTCTATTAAAAATACAAAAATTGGCTGGGGCATGGTTGG 
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CGGTCACCTGTAGTCCCCCGGTTACCTGCCCGGGGC 
Sequence 262 

CCGCGGTGGCGGCCGAGGTACGTGGGCTTTACGTAAAGAAAATGTGCCATTTTTTCAAGC 

CAGATATAGCTTCTGTATCAACAMTGGAACACAAATCTTTAGTCTAGGACTTAGAGTTG 

GGGAAAAAGGGAAAAAACAACTATAACACTTAMTGAAATCTTTGTATCTTAGGTTTGTT 

GACCAGAGTATTATATATATATCTAATGCTACTAAAATTATGATGTTATCGAATATTCAA 

G7TAATTCAATAAGTGGTGGTTATCTGTAGTAATCATAGAGAAAGGAACTATTACTATGA 

GGGACCTGCGGGTACCTGCCCG 

Sequence 263 

ACTATTTAGGGCGANTTGGAGCTCTTCGCGGTGGCGGCCGAGGTACATTTCACGGANGTG 

GGACCAAAAGAGGAAAGAACAAAATNTNCTAAGTGCCAGCATGTTGGGAATGCATCATTG 

CAATAAGATTGTGAAACTAATGCTGGCAGGCAAACAACAAAAAACAACAACTGCCACAAA 

CTGCAATATGCTTATCAGGGTCTGACCAAGAAGAAAGGCCCCCGATCTTCCACAGAGGAC 

ACTGGCACCAGTCCCAGGTCCTGCAACTCTTGAATAGCTCCATGTGCCGTGTTCTCTTCA 

NGTACCTGCCCG 

Sequence 264 

CGGCCG CCCGG GCAGGTTCGCGAGGGAAGGAATTACACATTTAGAANNTTATAAGTAAGA 
GGTTGTGTTTTTGTTAAGGGAAAAAGAGAGTATTTTTTGTCT 
TGTTCCAAAAAGAAAAAGAGAAAAAATATAGACAAAAACTGAATAAGA 
Sequence 265 

GCCCGGGCAGGTACCTAATTTTACTCTTAAGCANNTGCGCCTTGTAATTTAGGTTGGTTT 

CATTTCATTTCAACGGGAGAGAGTCCTGCTTCACACCTGTTGCAGTGTGTCCTTCCAGTG 

AACACCCAAGCCTGGGCTACAAGAGCGAGAGTCCATCTCAAAAAAAAAAAGGAAAAAAAA 

ATAAAGAAAAAAAGAGGCAMGCATCCACTGATCCTACATTCGTGTNTTTATCACATTAC 

CTTGTCTCATCTTCAACTTCGGTCTCTCCACATTGAAGGGAAGGTGACATGATGTCTTCT 

TAG N C ATTCAAATCCAATTAACTTTTAATTCTGAAGGTTAAAGTAATG AGG ATATTAAAC 

TAAGGAGATGGATAATATTCAATTTTTATGTTCACTGAA 

Sequence 266 

NCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGGGGCCATTGAGACTGCCATGG 

AAGACTTGAAAGGTCACGTAGCTGAGACTTCTGGAGAGACCATTCAAGGCTTCTGGCTCT 

TGACAAAGATAGACCACTGGAACAATGAGAAGGAGAGAATTCTACTGGTCACAGACAAGA 

CTCTCTTGATCTGCAAATACGACTTCATCATGCTGAGTTGTGTGCAGCTGCAGCGGATTC 

CTCTGAGCGCTGTCTATCGCATCTGCCTGGGCAAGTTCACCTTCCCTGGGATGTCCCTGG 

ACAAGAGACGAGGAGAAGGCCTTAGGATCTACTGGGGGAGTCCGGAGGGAGCAGTCTCTT 

CTGTCCCGCTGGAACCCATGGTCCACTGGAAGTTCCTT/vTGGCTACCTTTCACTGGAGCC 

ATCCTATTGAAA 

Sequence 267 

GGGCGNTTGGAGCTCTNCGCGGTGGCGGCCGAGGTACAAGTTGATTATGNAGGTGGCCGG 

GGTNCAGGCTTCATCTTNCTGTGCTCTGAACACACCCCACACCTCCTCTATTTTAGCACA 

TTTCCCAGTTGATTCCAAATGCGTGCTCACTCCTCTTn'CCCCCCACTAGAGAGTAAGCC 

TGATCTTGCTCACTCGCAGTTAAGCTTCTGGACMCTTGTGGCGTATMTAAATTTTGCT 

CMCAAAATTTATTAAATGAAGGAGTATCTCCMmGMGGCAMTmcmGTTATC 

TTGGAAACTGCCATAGGAGCTGTGTCTGAATGTGGTGATGGGTGAAGATCTTGTACCTGC 

CG 

Sequence 268 

TACGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTT 
MCAACCCCATGAGGAAATTAAGGCAGGGTGGTTCAGTAATTTGTCAAGGGAAACAAGGA 
AGTAGAGCAGGCAGCAGAGGCGTTCTGGCTCTGAACTACTCTGTTCTGTAGAAATCAGTG 
CAAAAGTCCAAGCTTCAGCATACTTGTTTCATCTTGTGATCACAGGTAG 1 I 1 1 1 I 1 1 
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CTGGGGCAGNGGGAANCTNGCAGCNGCTGGNGANGANCCAAANCCTCNGCGCTCACCTAA 

GCCGCAGGNAGATACACCCAACTGGNANATGANGAAACANCAACCNATANAGGAGAACTA 

ACCCACACAGGATCATTNCNCGAAGGAACNAGGCTGAAGAACCAGACCTGGACTTTCTTA 

NGACAAACTTACTGCAGCTTGAAGGAGCCAACCATGGATTTGAGGCGTGTGAAGGAATAT 

TTCTCCTGGCTCTACTATCMTACCAMTCATTAGCTGCTGTGCTGTTTTTAGAGCCCTG 

GGAGCCGATCTATGTTTAACACCATNTTACTAACCATTATTGCTATGGTGGTATACACTG 

GCCTATGTCTTTATTCCAAATCCACATTCGCCTGGCTTGGGGAATTT 

Sequence 270 

CCGGGCAGGTACACTGTGTTCCAGACCCTGTGCCAGGCACATGGAAGACACATCTTTGTT 

TATACCCTTCCCATGGTGTCTACATGACACTGCCTGTATTATATTTATCTGTGTTCATAT 

CTTGCACAAGAGGTTTGAMCCAGTGCCCAGGCCTAATTCTTTTCTGGTGGTCCCAAAGC 

ACCCAAAGGCCTTGGGGGGNAAAATTNCCCCNAAAGGGGNAAGGNCCCTTCCNAATTAAG 

N TTG G CTTTT AATTG G AAAAAAAATTTG G AAAATTTTTTG G N AC CGG NTG G NTAATTAAC 

CCNNCCAAAAAGGTCCTTGGGGCCANGGAAGGGGNCNTTGGGGAANGAAAAATGAACCAA 

GGAAAANTCCCTNTTGGCCTAAAAAMCCTTCNAACCANGNGGTAANNTTTNCAATAACC 

CCCCCCAAMGNGGGCTTTNCTTTAGGNCCTTGGNTTCTTCNAATTCCAAAAAAGGGGGC 

CCCAATTCCTTGGGGGGNCCTNGGMGGGGCCAATTTCNAATTTGGGGTTCCMGGTTTT 

CCAANCCTAANGGGGTCANANAAAANGGGAAGGGGGGNCTCAACCCCACNCCAANAGGGG 

TTGGCCTTGNCCTTNGGGTTGGGGCCTTTTNNATTTTTAAGGGGGGGMCCCCTTTTCCC 

CAAA 

Sequence 271 

CCGGGCAGGTACCCAGGGAACAAATGCTACTGGGACTCCACACCTACCTAAGAAGCAGCT 

CTACCCAGACTCCACATGGCTCTCTGTTTTGGTCTGGAGACCCCAGCTGGGGTATCTCCT 

GAGCCCAGGGATTCAAAGGTTCGTGGCAGAAATATGCATCCCACGGGACTCTCACTCACT 

CACCATTTTCTTGTAGGGGGATTCCCCTGGGTCTGTGCCACTCCTGGGTGAATGGCTGAT 

CTGTCTCACTCTTCTCCGTGATCCGAAGGTCACACTATGTCACTGATGAATCCTTATGT 

Sequence 272 

CCGGGCAGGTACACATTTTTCCATTCTATCTCACGAGCTCAAGAGCTTTATACTAACCAG 

CCCAG MGATATTTTCCTTGCTCTAMTAACTGTAATTTTGGGGGTCATTTATACCCCAC 

mm i ATTTCTTTC AAGGACCTCTTACTGCTCATTATAAATGCCATTATC CTTCTCTAC A 

AGCGATCAMTATTATAAGGTCTGACMGTTAGTTTTATGTTTGCATTGTAACTGAAGTC 

CCATCCATCTATCCTTCGGGAGTTTCTGAAAACTCTATATGAACTTCACTCAAAATACAC 

AAGTAGCATGTAAGCGTGCTGATCATATTTTTAAAA1TATGAAG 

Sequence 273 

AGGTACTTGGGAGGGAGCAGGAGGATTGCTTGGGCCCAGAA'GTTTGAGGTTGCAGTGAGC 

TATGATTAAACCACTGCACTCCAGCGTGGGCTACAGAGTGAGAGCCTATCTCAAAAAAAA 

AAAACCAAACCCAAAACAATAAAAAAMTACTTATGTCGCCTATCACATTTGACCTTTTC 

ACTGCAAATTACTCTATATTTGAAATMTTTTATCTTTCTAGTAAAAAAGGGAGATTTCA 

CCATATCTAATGTTACTTAATATAATACTTAATGTTACTTAATGTTAATATAATACTTAA 

TGTTACTGGCTCAGAGACTTTGCCAAAGGATAG/y\ATAATTCATAAAACA 

Sequence 274 

AGGTACAGTGTGGTTGGGAGCAGCTGTGATCCCGCCAAAGACGTGCAGCAGAGACTGGAG 

AGAGCTCGAGCCGCCACCCACTTGTGACCAGCCCTGAGGATGCAGCTGCGTGAAAGAACC 

TGGCAGAAAATAGAAGCCGCCACAGATGCGAATGACGCCTGAACGCTCAGTGTGCTCCCT 

GTCCA CACACAGCCGTGGGCCAAGGAAGCCTTACTGTGCTCGAGGTATTTGAGCCCAGCG 

TTTTTCCAGTGCTGGCTAATTGCTAAGCTGTGCAGACACGGTGGTGACCCCTTGGAAGCC 

AAGGTTGAAAATTAAAATAGGCATGTGGGAAGGAACGGGGTGGCCACGAG 

Sequence 275 
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CCGGGCAGGTACGCGGGGACGTTAAGAACTGGACTTACAAGATGGATAGGACCTGGTGGC 
AGCTGTCCTTGAGGAAGCCAGAGCCTGGAGAGACAGATCTCATGTTATAGCAGAGTACCT 

Sequence 276 

AGGTACACGCCTGTAGTCCTAGCTACTCAGGAGGCTGAGGCAGGAGGATGGCTTGAGCTG 

AGGAGGCAGAGGCTGCAGTGAGCCAACATTGCAACACTGCACCCCAGCCTGGGCATTAGA 

G C C AGACCTAGTCTCAAAAAATAAATAAAATAAAATAAAATAAAATAAAATGTTTAATAA 

TGCCTACTCAGACAAAGAAAGGCTCAGGCAAAAGACTGAGTCTGATCTTCTACATTGAGG 

AGACCATGTTGATTAGTGTTATATTCAAAAAAACCAATTCCCAAGTAATGCATACACAGC 

TGAAAGAAGGAAGTATTCCAGCTACAC 

Sequence 277 

AGGTACAGAGCCTGGCACGTAGCAAGTGTTGAAATATCTCTTGGAGAGAGAGAAAAAAAA 

TGAATCTATTAAAATGTCTCTATGATGCACTGCGGAAGCTAGACTTGGTAGATCGAGTTT 

MGGGMTGAAGAGGGGGTGAATGTTTGCTACTTCCGTATATTTGGAGTATTTGATAAAA 

CAGAAGACCAGTTTCTCTTCAAACTTCATGTCGGTTTGTAGGCCAGGCTGCCAGGGTTTA 

GGGTACCTGCCCG 

Sequence 278 

AGGTACAGAGGGTGCCAGGCACTGTGGTGCACACCTGTAGTCCCAGCTACTTGGGAGGCT 

TGAGGCGGGCGGATGCTTGAGGCCAGGAGTTTGAGGCTGCAGTGTGCTGTGATTGCTCCT 

GTGAGTAGCCACTGCACTTCAGCCCTGGCAACATAGCAAGACCCTGACTCTAAATATAAA 

AAAAAGTATAGCGGGCACAAATGGGGCGAAGCCAGGGTGGGATGCAGAGTCTAGAGGGAG 

ATGAAGCTGGAGAGGCAGGCAGTGGTCAACATGTGGCCATGGAAAATAGTTTAGGTTTGA 

TTCTGATGGCACTGGGCTGCCATTAGAGGGTTTGAAGCAGGTCAGTGAT 

Sequence 279 

TGTTAAGGGAAACCACAGGACAACAGGTGAGTCACCAAGCAGCAGGACAGCAACAGTGAG 

TTAATCATACAGAGGGCTACCACCCCACTGGGCTGATTTCTTCTGTTTAAAGAACACATT 

CAMTCCAAAGCAAAGAACGAAATAATGTATTTCTCCCCATTTCTCCAAAAGACAATGCT 

GAGCTCATG AC CACATCTCAAAAAAGCTCCTAACCACACCCAG C AGAAC CTC AGCATTGA 

GTATATAATGCTTGGCTGACAATGTGCCCAGGGGAAATACAGTACCT 

Sequence 280 

CCGGGCAGGTACGCGGGTAGGGATGCAGAACTGGCCACAGTCAGGGCTGGCCAACAAGGG 

ATGGGATGGCAAGGGAACCAGTGGGAGACTGTTTACACCTTTGACTTCCTCTTCACAGGT 

CAAGGCAGGACTGTAGCATTAGGTCTCAGAGATGCAATGAACAGGACAAATACACTTCCC 

CTTCATCTGGACCTGAGGCTCTGGGCAAGTCAGTACCT 

Sequence 281 

AGGTACAMGTTGGGAAACAGGTGAGCAAAGAAATCCATGGGTTGAGTTAACAGTGATTT 

CTAACTTCTTGAAGAAAGAGGGCATTTATTTTTCAACACTCTCTTCTTTCCAGCTCACCC 

TAAAATATCTMTTATAAGTTTACGCCATACATTAAAGTGAAATTAAATTGAATATGTTT 

TAAGAAATAAC ATTC CTAATATTCTAGGAAATAAGAGAATTCCTAGGGGCTCCACCATAT 

CAMCCTTTGATTTTACTATCACACAATTTCTTTTCTCCTATTTTCGAACTACCACTGT^ 

TCTAGACAAAATACCAGATGGTTATTTTGTTTCC 

Sequence 282 

AGGTACCAATATTCTGCAGAAAACCAAAACCCCACCTGCATTCCTACTCCTATCATGATG 
TTCTGCCCCCCGGGACCCTTGTTTATNGCTGGTCTCTCCTTTTGAAAGAACGTTACCC'<T 
TTCTTCACCCAGCAAATTCTTACTCAGACTTCAAGACTCAGTTCAGAGTCTGGGGATGAA 
TATTCTACAAATGGCCCCCAGGATTGTCATGGTCCTCACGGCAGGTACCTGCCCG 
Sequence 283 

CCGGGCAGGTACTAGGATTGCAGGCATGAGTCACTACACTCGCCCAGAAATTTGTTTTTA 
GAGTAACTACCATGCCATTAACAAACCTGACAAAATTAACAATTTCTTATCTTCAAGAAC 
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TCAATTCATATTCAAATTTTTATCATCTCAMGCTGGTTTGTTCAAACAGGAGTCCAAAG 

TCCCCATACTGCATGTCATTACTGAGACCATACTGGGACTAAGGCAGGGAGGAATGAGCC 

CTAAGTGAGTCTGTAACAAGCGAGAAAGAAGCTCATTCAACTAAAAGAATTAAATTGGGG 

GTCTGGCCCAAGAAAGTGGAAAAATAAACAAAGAAAACTAAAACT 

Sequence 284 

AGGTACACAGAGACACACACACTGAGACACACATACACAGACACACATACGCTTCTCATA 

GCTGAAAACATGGAAGGCAGTCCATCTTTCTAGACAGTGAGTATCTAATACAAGTCAAGA 

CACAGTAATTACGCTACAGACAAAGGGAGGGAGGATAAAAAGGAAGGTCACTCTGATTGA 

AGAGGAGATGGTTAGTAGGGGGATGGATGCAAAGGCCAGATGTGGGAGAAAGAGACTGGC 

AGGCCTMGGTCMTTCTTATCTTGTCAGTGTGGAGAAAAATGTGTTACCAGCAGCGTTT 

AGTCTAATGATGTGTGAGAGATACACAAAACAAAAAGGGGGGTGAGGGTTTTGGGT 
Sequence 285 

AGGTACTTMGGMGGGCAAGCGTTGGTAGCMGCTTGGCTTTTTAGTGTCCTGGAGCTC 

CCAGGAGAGGAGCAAAGTCTTGTCTAACTAAGGCTTGTTCCTGTGAATATGAAGATGCTT 

CTCTCCAGTCGCTCAGTGGGAAAGTCAAATCCAAGGCTTTGGGGAATGAGAATTCCAACA 

TTAAGGGGTAAGGAGAAGAGAGACTCAGCATTCCATAGGATGCTGGGCTTGAAGATGGCT 

CTCCCTCCTACCACGGCATAGGGATGAGAAGAAGGAGGGGGCACAGGTTAGGTAGGAGCT 

TACTCAGTGCATCTGGAGGATGGAAAACCAATGGCGCACTCCTCCCC 

Sequence 286 

AGGTACAGGAGGGACTTTGTGGAGAATTCAGGGTGGCAGAATAAGCTCCTGAGAATTTAA 

CTTAATCTTATTCTTCTCAAATTTAGCTTCAATGAAGAGAAATGAAAGCTATTGTTTCTC 

ATTC TCACATCTCCCAGAAAGTMCATTGAAGGTAATCTCAGTGTCCTTGAAGTTAGCCT ■ 

TTTrGAGTGGTTAAGTTGAAAAGAAGTTCTATCTTGGTGGGCTCCCCCAGAGGCTAAATG 

GGCCTTTGCATTGGGAGGTCAAAGTCCGGGTTCATGCCATTCTCCCGCCTCAGCCTCCTG 

AGTAGCTGGGACTACAGCCCATGAGGCTACTGAGCTAGAGTTTTTGT 

Sequence 287 

CCGGGCAGGTACCACGGTTTGTCGGTTTGTCGCTGGCCCTGAGTCGCTCCTAGGGATGGG 

GTGAGTGAAATTGATGTGGAGTGGCCCACTCTTGCCGCAGACCTCTGGGAGCCTACCTGT 

GGGAGACCCCATGATTCTCATGAACATCTGAGCTGGCAGGAAGAACTTCCTGGAGAGTTG 

ACAGAGATGGAACTCTAGCCTGCATGGAGCCCAGAGGGTTTGTTGTGGGAACAGTTGCAG 

TGGAGCATGGCTGTGGATATCCATAACACCATACTCCTCTAGGCAGCTCTAGCCTTTGTT 

GGCTGCCAGACCTGGACAATAGAGGGCTGGCTTGCTTGTGGAATGGGACCAGTC 
Sequence 288 

AGGTACAGTTCCCCTGACTGTTGACGACGGAGTCCTGTTTGTGGACCCGCCACCTGGGTT 

TTCTTCCTTGGCATGTGTAGTCCGTGGGTGGGATGGCTGGAGCATCACTGGCTATAACGT 

GTTTCCAGAGCAAGCCTGTCCAGGACAAGAMCTGTGTGTTTTTCCTATGGGAACACAGA 

TTTCCAGGAACAAGCCTGCAGCTTCTCTCGCAGGTCAGAGTGATTTGTCCAATAGCAGTG 

GCTTCAACCAGGTCAAATCACGGATCTGAGAG 

Sequence 289 

GGGGCCATTGAGACTGCCATGGAAGACTTGAAAGGTCACCGTAGCTGAGACTTCTGGAGA 

GACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGACCACTGGAACAATGAGAAGa\GAG 

AATTCTACTGGGTCACAGACAAGACTCTCTTGATCTGCAAATACGACTTCATCATGCTGA 

GTTGTGTGCAGCTGCAGCGGATTCCTCTGAGCGCTGTCTATCGCATCTGCCTGGGCAAGT 

TCACCTTCCCTTGGGATGTCCCTGGACAAGAGACAAGGAGAAGGCCTTAGGATCTACTGG 
GGGAGT 

Sequence 290 

AGGTA CTAC ATGTAACAATGACAAAGCAATTCTGTGACAGAATTGTTGTCTGATGGATGC 
CTGAGTTTTCTGATTCTAGATCCCAAATATTTATAGACTGTGAGAACGCACCTTGCTAGC 
ACTGGTCMGCCAATGTTTCTGACTTTITTTATTATCAGTGAAGAGGGCTTTTTGTTATA 
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GTTTTAATTTTCTTTTATTCTAGAAATTGCCTCAGACAGGGATTGATGTCAGGAGCCTTC 

AAGGAGAAAAATATACATGAATGGGAGACCCCAGGTTGCGGGCAACAAGAGTCACAGGTA 

TGGGAGAAACCAGGAAGTCTGTGCCATGTTGACTTTGCTCATCTCATCGTGCTTTGTACC 
TGCCCGGCG 

Sequence 291 

GGGGCCATTGAGACTGCCATGGAAGAC7TGAAAGGTCACGTAGCTGAGACTTCTGGAGAG 

ACCATTCAAGGCTTCTGGGCTCTTGACAAAGATAGACCACTGGAACAATGAGAAGGAGAG 

AATTCTACTGGTCACAGACAAGACfCTCTTGATCTGCAAATACGACTTCATCATGCTGAG 

TTGTGTGCAGCTGCAGCGGATTCCTCTGAGCGCTGTCTATCGCATCTGCCTGGGCAAGTT 

CACCTTCCCTGGGATGTCCCTGGACAAGAGACAAGGAGAAGGCCTTAGGATCTACTGGGG 

GAGTCCGGAGGAGCAGTCTNTTCTGTCCCGCTGGAACCCAT 

Sequence 292 

AGGTACT^ATGTAACAATGACAAAGCAATTCTGTGACANAATTGTTGTCTGATGGATGC 

CTGAGTTTTCTGATTNTAGATCCCAAATATTTATAGACTGTGAGAACGCACCTTGCTAGC 
ACTGGTCAAGCCAATGNTTCTGACTTTTTTTATT^^ 

NTTCTMTTNTCTTTTATTCTANAMTTGCCTCAGACAGGGATTGATGTNAAGAGCCTTC 

AAGGAGAAAAATATACATNGAATGGNAGACCCCAGGTTGCGGGC 

Sequence 293 

NCCNNCCNNGNACTGCNNCNGATTNNCCTATGATNGTGNNNNCAATNAATNANACTAACN 

NATNTTNCNTCATTTTNNNTCTNAGGGTGNGNATATAANCCCTTAATTNTTATGGGCTCA 

ANNNNAGNNAGNTNGNCANNNNNNTATNTAAAAGNNCTGTAGGGGACATNACCCAATTCC 

CTAGCCCTGCAAGGATGTCTATAGCAAAGACTGCCAGTTGCCTATCCAATACCCCTTCTG 

CCATCTTCTGCCTTTTATGGCTCACAGCTGCCCAGTCCCTCATAGCTAGATGTGACCTTG 

TGACTAAGTTCTGGCCTATGAGAAGTAAGACGTATCTCATGGTGTTTTGAGAAAT 
Sequence 294 

CCGGGCAGGTACATGGAACCTTATGGGACACAAACAGGAGAGTCAGGCAATGTGAAAAGA 

CTTCTACACTATTGTTATTGTGTTTCCAAAAAAAGTAAAAAATAGATGGATAAAGTGGAA 

AGACAACAGAGAGATAGTTTGGCTAGCTGGGTGAGTTCAGTTTTTCTCCCACAACTTAAC 

CTAAGCAAAAGCTTTTAAGAAATTCAGACTGGGAAGGCTGGGTGCGGTGGCTCACGCCTG 

TAATCCCAGCACTTTTGGGGAGGCCAAGGCAGGTGGTGGATCACCTGAGGTCAGGAGTTC 

AAGACCAACCTGACCAACATGGCGAAACCCCATCTCTACTAAAAATACAAAAA 
Sequence 295 

AGGTACI I I I I I I I I I I I I | | | l | | | | | | | j | | | AGGGGAGACAGGGTTTCACCATGTTG 

CCCAGGCTGGTCTTGAACCCCTGATCTCAAGTGATTCGCCTGCCTCAAACTCCCANAGTG 

CTGGGATTACAGGCATGANCCACCGTGACTGGCCAGTATNTTTATTTTAAAGATGANACA 

AGTAACTTGCTCAGGATTGCTCACGGAGTAAGTAGGGAATGTGGGATTTGAATTCAGGCA 

GTCTGATTCCAGANTCATTCTCTCTTGCTCTGTATGACTGTGTGTTCTGTGACCAATAAA 

TACACACCCCCAACTTGGAGGAAAGGAATATCATTTCCAAGNTGAACTACCC 
Sequence 296 

CCGGGCAGGTACTAGGTGCMCTGGGACAATGCTGTGAATTTTCATGAGGAGCTTTGATG 

GTAATGTAATGGGGAGAAAAAAAGATGTTATTTTAAAAGCATGCTGAAATTGTTTGAAAT 

AGTGGGAAAAAAGCAAACAATAAACTGGCAAAACCAAACTAAAGACTACTATACAACCAT ■ 

TTAAAAGAATGAAGTCACAATATTGATGACATTGGAAGGAGCTCCAAGATTTAGGTGGTG 

GAAGAGGGAAGGCCAAGTTTCAGAAAATGTGCACAATATGATACCACTTATGTTAAAATA 

CAAAGTAAAACTACAAAAACTGAAAAGGCTTGGAATGGTTACCTCT 

Sequence 297 

C C G GGC AG GTACTAAT TTTAC AAA ATATG G AGG CCTTTCCTAGAATAAC AAAATC ATTAT 

AGGTTTTTAGTGGCAGGTTTTTTTGTTTGTTTGGCATG 

AATTTCTTGATCAGGAAACCCAGTTCTAACATTTCT 
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TAGGAGTATTTGGAAGTCAAATGTGTAAGCGACTTCTGTCAGGTTGACTTAAAAGTAGTC 
ACCTCCTATAATAGCGCTGTTTCAGGTGGTTTCTTCCTGGACACCTGGACATTTCCCAGG 
TGTTGTCCCCTCTTACTCTTCAACCCTGCAACCTAATTCTACAAGCTTTGCCC 
Sequence 298 

CCGGGCAGGTACCCTATACCTTATCCCCTCCCACCTCTTGAGCCTGGCCTTTGTGTAGCA 

AATGTTGC7TAAGGGGCAGTGGCTTCTCCCTTCTTCTGCCTTGACCCTGCCCAT1TATAG 

AGT GACCTGCCTATGTATT CCTTGATTTTTGATTCTGATAAAGCACTTGACATTTTAAAC 

ATTCI I I I I I I I I M I I I I ICTTTTTGANACGGAGTGTTGCTCTCTCGCCAGGCTGGAGT 

GCTGTGGCGTGATCTTGGCTCACTGCAGCCTCCGCCTCCCGGGTTCAAGTGATTCTCCTG 

CGTCGGCCTCCCGAGTAGCTGGGACTACAGGTGCACACCACCATGCCCGGCTAATT 

Sequence 299 

CCGGGCAGGTACTTCCTCTACTTTCTGATAAGCTTTAAAAGAAAAAAACTAAACTAAATA 

TTTCAAAATCAATGGACTTCMGTAATCCAAGAGAATTTGGAAAACATTTCTCTAAAAAC 

TAATCCAGAACACAAGGTGTCAGTTTGGAAATAACGGAAGCATATTTCGTTTTAAAAATC 

TAAAACATTTTAAAGACAAAMTTCAAGTTAGCATCCCAGTGATTAACTCTGGAACTGCC 

TAGCCAAAAACAACTTTCGGCACCTAAAACATAAAACTGTCAGTTTCCTCTGCTTGGAGA 

CAAAACATACAAAMGCAAACATTCACTGTTATATTTTCCTAGACATGAAAGA 

Sequence 300 

CACACTAGCTGATMGACACTGTTGCCCGTAATGTCTATTTATTGGATCACNCAATTTAT 

AAGTCCCACATTCTCATGCCACATAGCTCTACACAGCTGCAAAAATATACCATANCTTGC 

AGGTNATCATTGGTTTGATAAAAGATATTGAGTCGCTCATCTTGTGAAAGTGATCTTTGA 

TATAAGAGGAGCATNAGCGGGGAAGCTCACATGTCCCGTGGCTCACACACCANAAGGNAT 

TTGTGTCTTGTCATTGTCTGTCTGGCAGTCCATGGCAATGGCTTTTTCAGAGAGGCCTGT 

TTTGNGGTTAACTGNGTGTACCTCNGCCGAG 

Sequence 301 

CAACAAGATTGAGAATCCGGCAGAGCTGAACCAGTGACAGATTCGCAAGGTGGACTATTT 

GGTGGCACGACCCTGGTTGTGTTTGCCACTGCTGAGCTTTCCTTGATTTCTGCTCATTTT 

CAGATCTGGTTCTCCCATGTTCTTGCTGATTCTGAGAGCTGCCCAACCTACTTCAAtAAA 

TTCCTTTTCTGCATAAGACAGCCAGAGTCAGTTTCTTGGTTACAGACAAGACTTCCAACT 

CATAGTCACCCAAAATATACCCATCCTAACTCCCGCTGTAATAATTAGCCACATTCATTT 

GTTTAMTAAGCAAAGATCACTCAMGTTCCTGAAAACTCCATTATTTAATGACTGTGTC 

TATCAATACCAAACTTCCTAATTCAGAAGGACAAACAAAAGAACAAAAGAATAACAAATA 

CNATCAAMGGTTAGGTCANMGTGCCGGATTTNGAAAGMTANATGAAATATGCATTTT 

C ATTTTC AG N ATTTTTATTTATTTTTTT 

Sequence 302 

GGTGGCGGCCGAGGTACNNTTTTCTTTGTTTATTTCAAGAGGGTAAATTGGGCAATCTAC 
ACAAGTTTGGGGTCCTTGCMTCMTCMGACCCGAAGGGAGTAGCGGTTTCTCAGTTTT 
ATTACTCCTGTAAATCCTTCTTCTAGTTTATTTCTGGTC7TGCAGAAATATTTTTCCTAC 
CAGCTGCATTGGACATGGTGAAG I i I I I I I GTAACTTTTGAAGATTTTCAGTTTTCTGCA 
TTAAATAATACTTTGGAGTATATACAA I I I I I I I GTTCTACATCATATATCCACACTGCA 
CATAGATATTTTCTGGCAGMCAATATCCCATAAAAMTCTTMGCTTCTTTAGTTAGAA 
TACCATAACCTTCATATGCTACATGGTAACATTACCCGCGTACCT 
Sequence 303 

AGGTACCTCCGCATCCCTTTCCTCCCCAACTGGCTTGAGTGGCTTCAAGCTCTAGCACTG 

CCTAACAAACAAACAMCACCTNAGTGACTTGTGATTTCCCCCCAGAAAAATATCTATTA 

ATTATTTCCCAATGGTCTCCACCTCGGCATCAGGGCACAAGGTGTAGGCACATGCTCTGA 

TCTTGCACCTAGGAAACCGTCATTATCAAGGGCATCCCCTAGTCCACTGTGCTCCCCGGT 

AAGTACCTGCCCG 

Sequence 304 
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AGGTACTTTAGCTACTCGAACAGCATGCAGACAGGGTAAGAATCACAGAACCTGGCCGAA 

TACACAGCTCATTCAGCATCAACCACAAGGTGGCGCTGCTTCAATGCTGATTCAGATGCA 

CAGCCCTGCTAGAGACGTTTCTCCACCATCAAGGCTACATTTGCTTGTGTCCTTTCCAGG 

CCGGCCAAAAATGCTTTCTCCTTAAGGTTTCAGGTAATACACATATGGCTAACAGTCTAT 

ATTGAGAAAGTAAACCACTGAGGCACAGCAACTCCCTACCCCCATCTTAGGACAGACTCT 

GACTGCACCTGAGAGGACTTATCTCAGCCTCCCTTAGTTAACAAGGCAACAAAGTGGGGC 
AGAAAGG 

Sequence 305 

AGGTACTCACATGCTGAGGGTMGAGGCAGMTTTCTTTTGCTTGTAAGTAACAGGATCT 
TTTCCAGAGCCTGGGTCTGAAGACCCCAGGTGTAGATGGAACTTCAGTTATGACTGGATC 
CGTGAGTTCAAAGGATGCCACTCCCCGCGTACCTGCCCG 
Sequence 306 

AGGTACAGTTCCCCTGACTGTTGACGACNGAGTCCTGTTTGTGGACCCGCCACCTGGGTT 
TTCTTCCTTGGCATGTGTAGTCCGTGGGTGGGATGGCTGGAGCATCACTGGCTATAACCT 
GTTTCCAGAGCAAGCCTGTCCAGGACAAGAAAeTGTGTG ) Willi I A TGGGAACACANA 

TTTNCAGGAACAAGCCTGCANCTTCTCTCGCANGTCANAGTGATTTGTCCAATANCAGTG 

GCTTCAACCAGGTCAAATCACGGATCTGANAG 

Sequence 307 

CCGGGCAGGTACGCGGGGACACATTCAGAGGTGAGCCCAGAGCGGGTAAAGTGGACTGGG 

GAGAACTTCGGAGGATGTTCATGTCCAGGAGCAGCCCCACGCCCTGTATGGTCGGTGTCT 

AGAGCCTCACAGCAACTAAGACCAACCCAGCTCTCAGAAGAAGGAATGTCAAAATGTCAT 

GTTCAATTTTACATTCAGTGCCTGGAATCTTTTCTTCACAATTGAAATGAAATGTGCTGA 

AGGAGGTGAATCCATGCATTAATCTTCAGCTCACAAAGGAAATACTACATAAGAAGCAAG 

ACCACAGACTCAAGACGGACATAATTGGATTTTTTTTGCCATGGCCTAGAAAG 

Sequence 308 

AGGTACCTGTCTC ATTCTG GCAACTTGTCACATTTCTCAATCCAAACAATATCTTTAAGG 

TATTAAACCCTATGTTTTTGGCTGAAGAAACTGACCTCAGAAATGACAAGCAATGTATCT 

AAAGAGCAGTAAGTAGGCCGGGTGCAGTGGCTCACGCCTGTAATCCCAGCACTTTGGAAG 

GCTGAAGCGGACAGATCACCTGAGGTCGGGAGTTCGAGACCAGCCTGACCAACATGCAGA 

AACCCCATCTCTACTAAAAATACAAAATTAGCTAGGTGTGGTGGCAGGCACCTGCAGTCT 

CAGCTACTCAGGAACCTGAAGTAGGAGAATTGCTTGAACCTGGGAGGCGGAGGTTGCAGT 

GAGCTGAGATCACGCCACTGCACTCCAGTCTGGGCGACAGAGCAAGACTCTGTCTCAAAA 

AAAAAACAAAAACAAAAACAAAAACAAAAACAAAAAAAAAAAACGGTAAGTAGCAAATCT 

GAGATTTGAATTGAGAATTGACTCTAAAAGCCCATGCTGCTTCTAAAACAGAACCTTCTG 

CACCTTCTCTCTATCCCTAATACTTCTTTTCCACC 

Sequence 309 

CCGGGCAGGTACGCGGGGACGTTAAGAACTGGACTTACAAGATGGATAGGACCTGGTGGC 
AGCTGTCCTTGAGGAAGCCAGAGCCTGGAGAGACAGATCTCATGTTATAGCAGAGTACCT 

Sequence 310 

AGGTACGCGGGGGGAATGGCTTGTGCCTTACCCCATTCTGATCGGTCACTGATGCAGGAA 

ACAGGGAATGGCGTGCCTTACCCCATTCTGATCAGTCACTGATGCAGGAAACAGGGAATG 

GCTTGTGCCTTACCCCATTCTGATCAGTCACTGATGCAGGTAACAGGGAATGGCGTGCCT 

TACCCCATTCTGATCAGTCACTGATGCAGGTAACAGGGAATGGCGTGCCTTACCCCATTC 

TGATCAGTCACTGATGCAGGTAACAGGGAATGGCGTGCCTTACCCCATTCTGATCAGTCA 

CTGATGCAGGTAACGGAATGGGCGTGCCCTAGGCAAGGCTTTGCTTTCAGATGAAGCAGG 

GCTTAAATTTGAATGACATGAGGGCTATTTTATGCCTTCCTGGGACTCCAGATGGCGCCT 

CATAGGATCTCACACAAGTGAATCATTTATATCATGGGTTCGTCATGTTTTTAGCATGTT 

AACAGATATGCAAAGTGTAATGGAGAATCAATGAAAAGAAAAATAATGTATGAATGGAAA 
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TATTAAGAAAGG 
Sequence 31 1 

CCGGGCAGGTACAGATCCCAAGGCCATTGTTCGCATAGACAGTGTGCACACCTGGTCAGT 

CAAACAGCCTTGCCCGTGCCTGTGCCTCTCCTGTGGGGCCTCTGAGCCTGGACTCTTCAC 

CCTAGCTCAACCACTCCTCTGTCTTTTTCTTCACTGCTGTTTGCTTATTTCAAAGGGAGA 

AAATCTGTTGGTCACAGTGGTCACCACCGTATCCACTGGCCAGTCTTTCTATGCCACCTG 

CAGTCTGCTGGCTGATTGTCCTTGAGTTACATATCCACTCCAAGCAAACATACACTCCAA 

TCACCCATGGCCGAGATGTTGGGTTCACACTTGGCACTAGTGAGATTTCTTATGGGAGTG 

GAT 

Sequence 312 

AGGTACTGCCACAAATTTGCTATCTGGTAACATATGAATGGTTTACTCTCAACAAGTCTA 

CCGAGACATATTTCCTGTCTTGTTAAGACACTTTAAGAGAAGAGTCTAGTAATTACACAT 

GTAATGACTTTGCAGTGGCTCACGCCTGCAATCCCAGCACTTTGGGAGGCTGAGGGGCGG 

TGGATCATGAGGTCAAGAGATCGAGACCATCCTGGTCAACATGGTGAAACCCCGTCTCTA 

CTAAAAATACAAAAAGTAGCTGGGTGTGGTGGTTCATGCCTGTAATCCCAGCTAGTTTGT 

AGGCTGAGGCAGGAGAACTGCTTGAACCTGGGAGGCAGAGGTTGCAGTGAGCCGAGATCG 

TG 

Sequence 313 

CCGGGCAGGTACGCGGGGAGAGCTCAGGACCTCTGAGAAGAATGGAGCCCTCCTGGCTTC 

AGGAACTCATGGCTCACCCCTTCTTGCTGCTGATCCTCCTCTGCATGTCTCTGCTGCTGT 

TTCAGGTAATCAGGTTGTGCCAGAGGAGGAGATGGACGATCAGAGCCATGCACCTGTTTC 

CTGCACCCCCTGCGCACTGGTTCTATGGCCACAAGGAGTCTTACCCAGTAAAAGAGTTTG 

AGGTGTATCCTGAGCTGATGGAAAAATACCTATGTGCCGTTCCCTTGTGGGTTGGACCCT 

TTACGATGTTCTTCAATATCCATGACCCAGACTATGTCAAGATTCTCCTGAAAAGACAAG 

GTAAA 

Sequence 314 

AGGTACTGCATMCTAAMTTGACTTTTTTTAAAAGCCAGAACTATGTCACAGTTGGTGC 

TTATAATTGTTTCAAAATTATACCGACTTTATTTAATTCCATGCAACGCATTTCCACATA 

GAAATGGAATAGCAAACTCCTACTCAATAGCAAACAATAAAATAAAATAAAATAAAGTAT 

GTMGCTTTTTCCTCTTCTTAGACACTGTCCCCAGAATGGAAAAGTTATAAAGAAAGGTA 

GGTAAAAATCCTGCAGCTTACACATTTTTCCACTGAGAAGGCAAATCCTTACTCCTCCAG 

AAACTAAAATATTAGACCCCAAACAGTGTAGGGGGAAGAGGGGTAGGAAGTTGAAAAAAA 

GAAAAAA 

Sequence 315 

CCGCGGNGGCGGCCGACGCTAGATTNTTTGAGNGGCAAAANAANCCNCTCACATCATGAG 
TGAGTCTTCTAACTCGGACAATGCNTTCAACAAATTCTAAGACCAAATCCCGGAGGCACT 
GGCTCCAAAAAAAAAAAAAAAAAAAGGACCTCGGCCGCTCTAGAACTAGT 
Sequence 316 

CCGGGCAGGTCCGGGCAGGTACTTTGTTGGGAATTAAGGATGATGATGCAGATCTGAGfC 

AGGCTCTTTGTCTGGCCATCTCCGTGTCAGAGATCCTTCAAGCGAATCAGCTACAAGGGG 

AAGGAGTCCGGTTCTTTGTGGTGGATTGCCGTCCTGCANAACAATATAATGCTGGGCATT 

TATCAACTGCTTTCCACTTAGATTCAGACCTGATGCTTCAGAATCCATCTGAGTTTGCAC 

AGTCAGTAAAATCCTTGCTGGAAGCACAGAAGCAGTCCATTGAGTCTGGCTCCATAGCTG 

GTGGGGAGCACCTCTGTTTTATGGGCAGTGGCAGGGAGGAAGAANACATGTATATGAACA 

TGGTCCT 

Sequence 317 

AGGTACMTGCCCTTGGAGGGAGAGGGGCTTCATGGAAAATGGTTGCCATGATAGGATTT 
GGGGAGAACCTCTATGTTTGAGGGAGTTTGGGTCAGGACCTATCCGTGGAAGCAAAAACT 
AGTGCCCAAATATTCAAAACATAGTTTGGTGAAAGATCACCTATGATTCTGAACGAAATA 
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GTGTAGATAMGACATCTGGAAGTTTTCCTATTGAAAAAAAGAAAGTGTGAAATCAGATA 

CTTTTGTTTTTAAGGCTCTCTTTCTTAAATTTCCTAACTTGGCCAAAATGTCAATGGTAC 
CTGCCCG 

Sequence 318 

CCGGGCAGGTACCATCGCCCTTTCCTTATCTATTGCAAACCCAGAGTAGGGAGACAGGGA 

ACCAGCAATGTCAGAAGGAGCCGATATTTTGCATGGAGCTAAGGCTTCTGCTCATCATTT 

CTCCTCTAGGAAGCAGGAAGCCATTGAAATCCATCCCTAATGTATCCCAAATGTCCACTT 

GCTGAGGGCAGGATTGCACAAGCAACAATCTTTTGCTCATGAAGCTTAGGGAGCACTGTC 

CCAGGGTATCAATTGGCCCAAAGCTGGTCTCTTCCTACAGACCATGCGGCAAATCAGAAC 

ATAAGAGGACTTGAGGCCCACAGCCCTGAGTTTCAATTCCAGTTCTGTATTTTAGGTCTG 

TGTTCGTAATTAGTATCATAATTAACTGAGAAGGTTATTTAGATTTATTGAGCCTCTTTT 

TTGTAAAATGGACTATTCATATACAACTGCAGAAAAACCAAATCCCCCCAAAAAACCCTT 

GCTACTTAATACAAAAACCTAACTGCAGGCACTTCTACTTTGCGTGGTTGACAGAATAAC 

CCATCACTATCACCACCAAAACAAACA 

Sequence 319 

CCGGGCAGGTACTTAGGACTATGGAGAAACAATGGAAGACATTAATTCTGCTTGTCTTGA 

TGGTAAGGGTGGGGATGAAAATACAGATCGAATTTGGGGGAGCATGTTAAGGTTAGGGGA 

ATAGAGATAGCTTATATTTAGAACAAGTTAGTAGAGACTGTATGATTTTCCAAATAGAGA 

AGGGTAAAAAGACATTCCATAGAGGAAGAAAATTACATGCAAAACCACAGAGACCTTAGT 

TTATAAAGAGTAACTGGGTATGGCCGGGTGCGGTGGCTCACGCCTGTAATCCCGGCACTT 

TGAGAGGCTGAGGAGGGTTGATCATGAGGTCAGGAGATCGAGACCATCCTGGCTAACACG 
GTGAAACCCGT " " " 

Sequence 320 

AGGTACACTTTAATATCCCAAAGTGGGCTTGGCAGCTCATCTGCGCAGAGATTCCAGCAT 

TCTCTTTAATACCACCACGGATCTGGGGTGTTTGTTTTGGTGGTGATAGTGATGGGTTAT 

TCTGTCMCCACGCAAAGJAGAAGTGCCTGCAGTTAGGTTTTTGTATTAAGTAGCAAGGG 

i I I U iGGGGGGATTTGGTTTTTCTGCAGTTGTATATGAATAGTCCATTTTACAAAAAAG 

AGGCTCAATAAATCTAAATAACCTTCTCAGTTAATTATGATACTAATTACNAACACAGAC 

CTAAAATACANAACTGGAATTGAAACTCANGGCTGNGGGCCTCAAGTCCTNTTAT 
Sequence 321 

AGGTACCTGAGGCTCTGTGCCAGTGGGGATGACGGGGGCTGATGTCTCTTCCCATGGGCT 

CCTATGGATTTGCCCTTCAGATGACCTGGTAGGGCTCTGAGTGACTGACTTAAAGACCTT 

CGTAACGCCAAAGTGGCMGATTTTGGAGACAGATTTGTGTCTCCTCATCCATGATTTTT 

GACTTTMTCTTTTCTGAGAAGACCAGAAATCAGAGAGTTTGTCTTCTCAGCCTCTCTGT 

GCAAGGATCTGCTGAGCTCAAGCTTCCCTTTTCAGCGTCAGCTTTTTAAGTTGAGAATAA 

TAATTAAAGAAAACTCCAATCTCAATTCCAACATCAAGTGTCCATGAAGCT 
Sequence 322 

CCGGGCAGGTACAGTAGTCACTCTTGTCACAGTGGTTATTGTATCAGACGATACAGAACA 

TGTTAATCATCATAGAAAGTTCTGGACAGTGGTGCTCTAGAGTCTCAACATGTCACGTGT 

TCGGGGGATCACTGGATCTAAAGCCATTCCTTATGGGAAATGTTGGCTTTCTTGAAMTA 

CACCGGAGACCAGAGAAGGTCACCATGTGGAGAAGAGGCAGCCTTCACTCTGCCAAGAAT 

GCTGGCCAGGCTTCTCTGTTTTATATCTCATCACAATGAGGGCCATTCTGGCCCTCCAAA 

ATATGGATACCAACCTTTACGCACTGTCAGAAATGGGCCACAATTAAGTGT 

Sequence 323 

AGGTACCAGAGTCCATGTTCTTAAACCATTAATCTACGATGCCTCTCAAGTAAATAGTCA 

ATTAGTGGTAGGTGTGAGGCTGTAGGAAAAAGAGGAGACGAAGGGACAGGAAAGCAGTCC 

TCATTGGAAAACTAGGTTTCGACTGGCACTGTTGGACTGGGCTGCAGTTTAGGTCCTCCC 

ACTGCTGAGTTCCCTCACTTCATCTCCTGGTGCTTCTGGTTTCCCATTTATAAAATGCAA 

GGGAAGGATTCTATTTATGTCACAAGGTGCATCTAAGGAGTAAAAAGTCACGTCATGTTT 
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GACCMTGTTAGTCAGATCMGATTTATTTTAGATAAATTATCTGTTAGT 
Sequence 324 

CCGGGCAGGTACAAGCAGTAGGCTGAACTCATGTGCCAGGCAGAGGTCTGATGAACACCA 

CCGCCAGTCCACCGGCACTGGGGTCAGTGTG7TAGCGGGGATATGTGGTGCTGTTTCCCA 

GAGTGCAAAAAGGACAGGGAGGCCCTGTCACATGAATGCTGAGTCAAATTTATATATATA 

GAAACAAGACATTGCCTGGCAGGGAAGATGAGCCATGTTAGTGTGGGCTGTGCCACCACT 

TCACCATTGCACTGATTACATTGTCGCACAGATAGCTACTGCCCATAATCCTGTCTCCTG 

GCTTMGTAAGTCAGAAATCACTGCTTTATAAATGAATTCCTAAGGTGGTTAGCAAT 

Sequence 325 

CCGGGCAGGTACTMGTTGMGTAMGCTCACCCTTTACTTCCCTACTTGAAAGTTCTAC 
TCTGAGCCTTGACT CTTAGCCACAGTGAGGCATGTTGAAGG7TCCTGCTGGTTTATTCTT 
I I I I I I I II I I I II I AGACAGAGTCTCACTCTGTTGCCCAGGCTGGAGTGCAGTGGCACG 
ATCTCGGTTCACTGCAAGCTCCGCCTCCCGGGTTCATGCCATTCTCCTGCCTCAGCCTCC 
CAAGTGGCTGGGACTGCAGGCACCCGCCACCACGCCTGGCTAAI I I I I I GTATTTTTAGT 
AGAGACAGGATTTCACCATGTTAGCCAGGATGGTCTCGATCTCCTGACCTCGTGA 
Sequence 326 

AGGTACCTACCCCAGTGTTCCTCCTGAGATGGCCTATGGTTGGATGGTTGAGCACAAACA 

CAGGCACCGTCACAMCACAGAGMCGCCGTTCTTCTGAACAACCCCAGGTTTCTATGGA 

CACTGGCTCTTCTCGATCTGTCCTGGAATCTTTGAAGCGCTATAGATTTGGAAAGGATGC 

TGTTGGAGAGCGATATAAGCATAAGGAAAAGCACCGTTGTCACATGTCCTGCCCTCATCT 

CTCTCCTTCAAAAAGCTTAATAAACAGAGAGGAACAGTGGGTCCACCGAGAGCCTTCAGA 

ATCTAGTCCATTGGCCTTGGGATTGCAGACACCTTTACAGATTGACTGTT 

Sequence 327 

AGGTACTACAAAGATCCTGGGGAGCCTGTAGGTGACACGTATGATTTGCTGCAAGAAATT 

TCACAATTAATAACAATTCTAGGTCTATATGGTTGATTTGTGATAAAAGTCACCTATGGC 

AAAAACAAGAGAGAAGTTGTAGGGAGTAGTTTGGGAAGTGGTAAGGTGAGCAAAGGCAGG 

MTCTTCTGGACACCAGGAATGTAMTTGGGGGAAGAAAAATATTCCAAATTTTCTAGTC 

ATTGAGGAACTGCCAATAACATTAATTATGAATCATCTATGTGAGCTGTAGAATGGTCTT 

TTCTGTGTTTGTTCATCCATTCAATAAATACATACTG 

Sequence 328 

AGGTACGGGGGGACGTTCAGAGGTGAGCCCAGAGGGGGTAAAGTGGACTGGGGAGAACTT 

CGGAGGATGTTCATGTCCAGGAGCAGCCCCACGCCCTGTATGGTCGGTGTCTAGAGCCTC 

ACAGCAACTAAGACCAACCCAGCTCTCAGAAGAAGGAATGTCAAAATGTCATGTTCAATT 

TTACATTCAGTGCCTGGAATCTTTTCTTCACAATTGAAATGAAATGTGCTGAAGGAGGTG 

AATCCATGCATTAATCTTCAGCTCACAAAGGAAATACTACATAAGAAGCAAGACCACAGA 

CTCAAGACGGACATAATTGGA I Mill! I GCCATGGCCTGGAAAGAAAGGTAC 

Sequence 329 

CCGGGCAGGTACGCGGGTGCCCGGCCTCAATTAI I I I I CTTACACTAGTTCC AGAACGTT 

ATATGTATATTATATGTGCTCATACA I I I I I ACCTTTCCTAGCCTC ATTTAAAAATGTTC 

CAGCTGGCTCACACCTGTAATCCCAGTTACTCAGGAGGCTGAAGCAGGATAATCGCCTGA 

ACCCAGGAGGCGGAGGTGGCAGTGAGCTCAGATCACACCACTGCACTCCAGCCTGGGTGA 

CAGAGCAAGGCTCTGTCTCAAAAAAAAAAAAAAAAAAAAAAAATTAGTACCT 

Sequence 330 

AGGTACTCACATGCTGAGGGTAAGAGGCAGAATTTCTTTTGCTTGTAAGTAACAGGATCT 
TTTCCAGAGCCCGGGTCTGAAGACCCCAGGTGTAGATGGAACTTCAGTTATGACTGGATC 
CGTGAGTTCAAAGGATGCCACTCCCCGCGTACCTGCCCG 
Sequence 331 

AGGCATGCTGTCTAGCACATCACAGACCCTCAATGAATACTATAGGTTGAATTAAATGTA 
GGACCTAAGGGAGTAAAAAGAACAGCGTCCTCCAGCATTTATGCAGCTGGAGGCCAGGAA 
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TCTT CCCTCTGGCTTTGCACTACAGW 

TTTTAGGCTCANGAGAATACATGTGCATGTTTCTTNTATAGGN 
Sequence 332 

GTACNCGGGGACACATTCAGAGGTGAGCCCAGAGCGGGTAAAGTGGACTGGGGAGAACTT 

CGGAGGATGTTCATGTCCAGGAGCAGCCCCACGCCCTGTATGGTCGGTGTCTAGAGCCTC 

GCAGCAACTAAGACCAACCCAGCTCTCAGAAGAAGGAATGTCAAAATGTCATGTTCAATT 

TTACATTCAGTGCCTGGAATCTTTTCTTCACAATTGAAATGAAATGTGCTGAAGGAGGTG 

AATCCATGCATTAATCTTCAGCTCACAAAGGAAATACTACATAAGAAGCAAGACCACAGA 

CTCMGACGGACATAATTGGATTTTTTTGCCATGGCCTGGAAAG 

Sequence 333 

TNGGGCAATTGGAGCCTCCCCGCGGTGGCGGCCGAGGACACACAGTTAACCACAAAACAG 

GCCTCTCTGAAAAAGCCATTGCCATGGACTGCCAGACAGACAATGACAAGACACAAATAC 

CTTCTGGTGTGTGAGCCACGGGACATGTGAGCTTCCCCGCTGATGCTCCTCTTATATCAA 

AGATCACTTTC ACMG ATGAGCGACTCAATATCTTTTATCAAACCAATGATCACCTGCAA 

GCTATGGTATATTTTTGCAGCTGTGTAGAGCTATGTGGCATGAGAATGTGGGACTTATAA 

ATTGCTGATCCAATAAATAGACATTATGGGCAACAGTGTCTTATCAGCTAGTGTGTACCT 
GCCCG 

Sequence 334 

GCNAATTGGAGCCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTAAGTGTCCCACGCA 

ATGGGTTGGGCCAGGGAAAGCTCAGTGGTTTGGGTCTTACACTCAGCCTTGGGGGAGGGG 

AGGGGACTGGTGGGGAGAGGTGAAGCTGGGTAGAGCTGGACTGGGCAAGACAGCCATTTG 

GTCCCCCAGTAGCAGGCACAAGCACTGGCTCTGACAGGGATCTAGGGACAGTTGCCAAAC 

CCTGTAGAAATGCCTAGGCAAAGATGAGAGCAACTGACATTGCATCAATGTCCCAGCACA 

GGAAGGCAAGGGCAAACCCATTGTTAAAACCAGGCAAACAGTATGGGACCTGCCTCTCTC 

TCTCTCTCCAGAACCATCAGGACTC 

Sequence 335 

CCGGGCAGGTACTCTGCGTTGTTACCACTGCTTTCCCGCGTACATGCTGATATAGGGACA 

GCTTAGAGCATTAGGGAGCTGCTAAGAGAGAAGAGGGCTGGAATAGCAAGCCAAGAGTTA 

CGTGGGCTCAGAGGGAGGAGGAAGCCCTCCCAGGTGGCAGTCTGCGAAATGAGGAAAGGT 

CCCATAGAGAAGATGTCATTTGAGCAGATCTGTTTTGATAGACAGTATTTAGATGCAGCC 

TAAACGAAGAGAACTGCATGGGCAGAGACTCTGAGGCAGGAAAGTACCT 

Sequence 336 

CCGGGCAGGTAC7TTTAGTAGAGACAGGGTTTCACCATGTTGGCCAGGCTGGTCTCAAAC 

TCCTGACTTCAAGTGATCCGCCCACCTCGACCTCCCAAAGTGCTGGGATTACAGGCATGA 

GCCACCGCAGCTGGCCA^^GGCAGAGCTTCTTATAAGGTGAGAGGAGTAAAAGAGGAGCAT 

GTGTAGGTTGAAAAGGTGCACAGGTGAGTGTGTTCACTGACACACATCTTATCCCCTGAA 

AMTATTTAAAATGWATCJAATACATCCCAGGAAAAAATAAATCAACTAATCAACTCTG 

AAAATCCTTCATCTCTACTTAAAAATGACACCACGCTGACTCTACATAAGATTAAAACAA 
AAT 

Sequence 337 

AGGTACTGGAGGTCCGAGCTACCACAGAAGACAACAAAAATCAATATAAGGTATAGGAAG 

ATTGGAAAGAAAGAAA/\CAAACCACCTAGGCTGAAATCCCAAACAAACTGGACACGCAAA 

TTATTAAAGGTGAT7TCATAAATGTTATGTCATTTAAGACAGCCATATAAAAGGAATCAT 

TATTCCCATTTXTCTATGAGGAAATAGAGGCTCAGAAAGGTTTTGAGCAACTTGCCTGA 

AGTTGAAGTCCCCCCAGCATATTMGTGCTGAAGCCAGAATTCAAACCAATTGATAATTT 

CAGTGCTCTTCACCGTAGACCACATGACCTCCCCTCAGTTAATATTCACCCGTTTTGCCA 
GTG 

Sequence 338 

AGGTACCTCCGCATCCCTTTCCTCCCCAACTGGCTTGAGTGGCTTCAAGCTCTAGCACTG 
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CCTAACAAACAAACAAACACCTCAGTGACTTGTGATTTCCCCCCAGAAAAATATCTATTA 

ATTATTTCCCAATGGTCTCCACCTCGGCATCAGGGCACAAGGTGTAGGCACATGCTCTGA 

TCTTGCACCTAGGAAACCGTCATTATCAAGGGCATCCCCTAGTCCACTGTGCTCCCCGGT 
AAGTACCTGCCCG 

Sequence 339 

AGGTACACAGGTAACCACATTTAGATGGACTGGGATGTTGCCACACATACAAGCATTGAT 

AACTGACTTCTCATTACCTGAATACATTCTTCTGTCAGAGCAACAGACTCAGCTATGCTT 

CTGGCAAAATTGTTCTTAATTCTCTATTGATTMTTTATTCGGTMGTATTTATTGGGTA 

TmCTGTCTGAAMGTGCGATTCCAGGTGCTTTATGTGTCTCTGTGTGTGGGTGTTATA 

TAAATACTTATAATACTGTATCCATACTCTTGAAAAGCTTAGTTGGGAAGGCAAGGCATG 

CAATAAGGAACACAGAATTTTAGTCATTCCACAACCATCTGTTGAATGGC 

Sequence 340 

AGGTACTCTAAGACTCCTAATCCAGGCCAATTCCTAGTGGACCAATAACAAAAAGCCCAA 

TTACCTTCTTCA ACTAC ACTTAAAATCATAAAGCTTCAAAGATGCAATGACCTAATACAA 

CCCACCTTCCAGTTTTGGAAGCACTGTGCTCCATGCAATATTCTGCGATGGTCAGTCTAG 

TATAAAAGGACTTCCTGGAGGGTCTGCTCCAAATACTAATGCAGAGGTCACAAATTGGTG 

GCCCTCAGGCCACATTTGGCCCATTAAAGTATTGTTTTATTTGCTTCATAGTCTTTAAAA 

AAAAAACTCMTTAGTCATCMCATTTACAGAATAAAGATGTCACATAAAMTTTGTGTT 
ACTAGCT 

Sequence 341 

AGGTACTACAATTAMCCGCAGCCCATCTCCAGATATTCAGACCATTTAATCTCCAAATA 

GCAGGCATTTAACACCTCATCTGTGACCTGGGGGTTGACAGAACCACCTAGGACTCAGGA 

CAGATAGCAGGAAMGAGGTAACACTTAGGTAACACCTTCAGCTTGTCAAATTGTATCAC 

TGTTCATCAAAGGGAAACACACACATGTAGCCATGAAGTTTGAAAGGCATCCTGGACTAC 

CTCTGCTGTGGAAGAAGAGGCTTGCCAGGGTTTTCAGAACCCGAATTTGTGCTGTGGCTC 

TGTGTTGTGATTCTAGAGGGTTCTATAGTCCTAGCATCCCAGGGCCTGAGGCTTACTAGA 
CA7TTC 

Sequence 342 

AGGTACTGAGCTGAGCCATGCCAGAGAGCCTTTTATTGCMGTTTCTCTGGACTGAGGAA 

GAAATTTCCTCTGAGCACTCTGAGAAGTAATCTTATATATCAAAATTATATAAGGAAGCA 

CACGGCTAAGAAGATGAAGACCGTGCTGGGCCCAAGCCAGCGTGTCCCATGTGATAATTC 

TCAACMGTCTTTGGGCTCTTTCTTGGATCACATCCCTGTAGGGGAGCTGAATAAGGAGA 

GAAGTGACATTCCTACACTGCTTCCTGGTTTATGAAATGTTCTTACCTACCCTAGATCTT 

CACAACMTCCAGGAAGGAMCGGGCATTACTGCATTGTTTTAAAGGTATGGATACTAG 
Sequence 343 

AGGTACACAAGGGTTGGCCTGAGTCCCTGTGTTGTGAAAGTTGGCCTACAAAATGAGTCC 

ACAGGGACTGTCATGGGTTTTGAGTTTATGGAGGTTTGTCCATTGCTGGAGTCTACTGGG 

ATGAGCCTTGGTCTGCTGAGCAGGTATGGAACCTGAGTCTGGGGAAGTGTGGGGCCATAG 

AGGTCAGACTGAAGGGTAGGGTTATGGGGACTAGCCTAGTACCTGCCCG 

Sequence 344 

AGGTACCTTAAGAA.TTGCTGACACATTTATTACAATTTCAAGGCTACAACTCAAATCCCA 

GGAACACAGGATTGCAGTGACCTACATAATTTGGGCTCAAATACCTTCAAGGCATTCCGA 

AGAGACAGAGAACACCAGAACAAGGGAGAATTGGAGTGGAGGTGGATGCTGAAATTCCAG 

TNTAAAAATCTACTCTGGCTGAGGAATAAGACAAAATGTAAGTGATTTAGGAAAATTATC 

CCAGACAAA TGAGA G GTGAA CTTTGGAAGGGGGTATAAATCAAGATAATAATTTCCATAT 

TTGCTGNTTCTTTTACTTTTTGCAGAAGAAATGC 

ACTGAG 

Sequence 345 

CCGGGCAGGTACGCGGGACCCACTGTGCCCAGCCCTTGAATTTCTCCTAGCCCTTCAGTT 
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CCTCAAGCTTCATGCCTCCCAAGGTTGCACCCCCCATGGCCTGTCCTGCAGTCCTGGGGC 

CCTTCTCTGAGGGGATGAAGGGGGCACTTCCCTGAACAGAGAGAAAGGGAGACCCACACC 

ACTTGGATTAAACTCCCTGGGCCCCAGCTGGCTCTGCAACTTGGCCAAGAAGCCCTATCT 

CCTAGGGCCTCAGTGTCTTAGTCCGTAAMTGGTTCCTGAACTTTTAAAATACAAACACT 

CTTTTGTAGAGTCGAAACACTTTGCAGACCTGCACTCTGTGGCAGTACCTCGGCCGCTCT 
AGAACTAGTG 

Sequence 346 

CCGGGCAGGTACTATCCCATTTTCCTTTGCCATGCACCTGTCTTCCTCTTCAGGATTGAT 
CACAAAATCAAAACCATTCCTACCAGCCAGAGGTGGGCACATGACCCAGGTCAGACCAGT 
CACAGTTCCTGATTCCCGGCCACAGTGCTTGATGGGGGAATGAGCATATATCAMCCAGG 
ACAGTCCAAGTCCTTCCGGGGACTTTGCTGCCACAGTTGACAGGGGGAGGTATTTGCATA 

ACTTCACCACACACAGCTAAGCTGGTGATCCTGCTGCTTGGCAGCATCTTCCCAGCGTAC 

CTCGGCCGCTCTAGAACTAGTGGA 

Sequence 347 

GGGGCCATTGAGACTGCCATGGAAGACTTGAAAGGTCACGTAGCTGAGACTTCTGGAGAG 
ACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGACCACTGGAACAATGAGAAGGAGAGA 
ATTCTACTGGTCACAGACAAGACTCTCTTGATCTGCAAATACGACTTCATCATGCTGAGT 
TGTGTGCAGCTGCAGCGGATTCCTCTGAGCGCTGTCTATCGCATCTGCCTGGGCAAGTTC 

ACCTTCCCTGGGATGTCCCTGGACAAGAGACAAGGAGAAGGCCTTAGGATCTACTGGGGG 

AGTCCGGAGGAGCAGTCTCTTCTGTCCCG 

Sequence 348 

AGGTACCACCATGCCTGACTAATTTTTATTTTTTTAGAGATGGGGTCTTGCTATGTTGCC 

CAGGCTGGTCTTGAACTCCTGGGCTTAAGCAGTCCTCCCACCTCGGCCTCCCAAAGTGCT 

GGGATTACTAATAGGCATGAACCACAACACCCAGCCGGCATATCTGTATTTTGGTTGCAC 

GGAGGCTGCTGCTATAAACCGTGGGCACCAGTGCCCACGAGTCATACATAATTGCTGGCC 

CCCATGGCTGGAAGTATCTGAGGGAACCTCAGGCAAGGCCGTTTCTTTTCTGGAAGCTCC 

AAGTTCTGGGTCCTTCTTAATAAATCTTCTC 

Sequence 349 

ANGTGGNAGAATCACCTGAGCTTGGGAAGTTGAGGTGCAGTGAGCTGANATCGCACTGCT 

GCATTCCANCCAGGGTGATGGGAGTGAGACCGTGTCTTAMCAAATAATAATAATTTGGG 

GGTGATGTANACAGAAATMGGACAGAGGTATGAAAAGGAATAATTATTTAGCATTTTAC 

TAGCCAGAGATTTGGGCAGTCCCAAAAANCANCTTTTCTAATCTCCCTCTCCTTTCCCAA 

AAAACTTTMTCAGATCCNNTAGTATCCTATCATCTTTGGGmTTTACATACTACTTTT 

TGGAGGGGGGATTGNNGGAAAGCTATCGAGATCT 

Sequence 350 

AGGTACTATGTCCCTCTCAAGTCCTGAATCTAACTTAGGGAGTAGCCAGGAGACTATGAG 

AAGGAACTACTCCAGGGAAAGAACATCCCATGCCCTTTCTG/vAACACAAGTGGCAACCAT 

AAGACACCACTCTTTATCCTACCTTTTAGCAGACTATGCACAGTCCCAGGAACGAGGCCA 

GAACTGATAAGCAAGTGTGGCATTTGGCTCCAGCCAAGG7-ATGGGATTCGGGAACCTCC 

CCATCCTTGGAATTGGGCTGGGAGGGGTTTCACCTGGGAGCCACAGTTTTATGGCCTGGG 
GTTATATTGCAAGTCCAAAGACAA 

Sequence 351 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACATAGTTGCTCATTGCCTAAAAAAAAAATGTT 

TGC CCGTG TGAGA CATCGAT GCATCAATAGTGCTAAT.ATAATGACAGCTTCAAGCCAC 

AGTATTTTTCCACATTTTTTTAGTTCTATGTTCTCTTAAAACTAGTCTACCTAGGCTGGG 

CGTGGTGGCTCACGCTTGTAATCCCAGTTCTCTGGGAGGCCAANATGGGCGGATCACCTG 

AGGCCAGGAGTTCAAGACCAGCCTCCCCAACATGGTGAAGCCCCATCTCTACCAAAAATA 

GAAAAATTAACCAGGCATGGTGGCGCATGCCTGCAGTCCCAGCTACCAGCAAGGCTGAAG 

CTGGATGATATCTTGAACCCGGGAGGCAGAGGATACAGTGAGCCGAGATTGCGCCACTGC 
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ACTCCAGCCTAGGAGACAGATGGAGACCCTGTCTCAAAAAAAAAAAAAAANGGG 
Sequence 352 

GG CGAA TTGGAGCTCCCCGCGGTGGCGGCCGANGTACATTATTCCTCCTAAGCTCCTGAT 

TTATTTTGCA CCTAAT TATTTCAGMCMGCTTGAGGATTTCTCCTCCTCTTCCAGAACC 

CTGTGACAGC i i I I IGACACTTGAAGGATATTTTCACACACTTGTAGGTCGGCAGTTCTG 

CTCTGAACATTGAGGCTGTGAAAAATCCTATCACTCACGGGTGGCCCTAAACTCTCTCTC 

TTCACAGAAGACTCATTTGMCTGAMTATTTCAAGAAATGAAATTTGACCAGAAAGGGT 

AAAATACCTGACAAAGTGGCAAATAAATGTGTCAGACTAATGGGTTCAGCACCCACATCC 

AATGACTCANGCAGCCACACATGCAATGACAAAGCCATAGAAGGAGAATCCTACTGCCTG 

TCAGCCACTGAGCCACAGGGTGGAACTATGCCAGTAGTACCTGCCCG 

Sequence 353 

CCGCGGTGGCGGCCGAGGTACTACTGGCACTGAGCCAATGTATGCTATCAAGGAAAGCTT 

TATCTGTCACTGAGCAAMGGTGMGTTCMTTAGGTCAGTTTTATCACTTCTTTTCCTA 

CACACAAACTATGAGGAGAGATCATTTCTTTTCmCT^ 

GACAGGGTCCCACTCTGTTGCCCAGGTTAGAGTACCTGCCCG 

Sequence 354 

CCGCGGTGGCGGCCCGAGGTACTACGCCAGATGCTGTTTACACATAGAGTTTTGAGCTGT 

TTCTCAAGGAGGGGCATTGGTGGCATTTGGGCAGAATGATTCTTTGATGGGCAGAAAGGT 

CCCACCCTTCGCAGAATGTCTATCACTAAACTGCAATCATTCTCGCCAATCTTTAGGGTG 

GATGGGAAGAGTAATGCCCAAATGAGAGCCCTCCCAGATGTTCAGAGCAGCGGATATGAA 

GTCATACAGCCGGGTTTGGTTTTTATTCCATCCATTCCTCTCCTGGGGACCTTAGACAAG 

TGATAATGCTCTATGCTTCAGTTTCCCCCCTTGTAAAATGGTGGTAATTCCCGCGTACCT 

GCCCGGGCGGC 

Sequence 355 

CCGCGGTGGCGGCCGAGGTACTACGCCAGATGCTGTTTACACATAGAGTTTTGAGCTGTT 

TCTCAAGGAGGGGCATTGGTGGCATTTGGGCAGMTGATTCTTTGATGGGCAGAAAGGTC 

CCACCCTTCGCAGAATGTCTATCACTAAACTGCAATCATTCTCGCCAATCTTTAGGGTGG 

ATGGGAAGAGTAATGCCC AAATG AGAGCCCTCCCAGATGTTCAGAGCAGCGGATATGAAG 

TCATACAGCCGGGTTTGGTTTTTATTCCATCCATTCCTCTCCTGGGGACCTTAGACAAGT 

GATAATGCTCTATGCTTCAGTTTCCCCCCTTGTAAAATGGTGGTAATTCCCGCGTACCTG 
CCCG 

Sequence 356 

CCGGGCAGGTACAGTGAAGCTTTGACCTGAGGAGCTCiTCTGTAGGTGAAGAGTGGTTAA 
CTATGGTTCCTGTGTGCCTGGGGCCCCCCACTCCTAGGACTCATCCACCCCCTACCCAGA 
ACTTACTAGTACCT 
Sequence 357 

ANGTTGGAAACTGACCTGGGGGCCTCANCTTGCTGTGACTGTCACTGCCCATGTGTTCTT 

CCCCATGCCTCACCTTCCTCCTNCAAGTGCGTGAAACATCAATGAACCTTGTGCTTTTGT 

CGTGTGATCTGTAACACCCCATCCTTTACCANACACCCCTGCATCCAACAACGCTATGAA 

CAGANTTCTNCANCAACCTGMTTCTTTGACAAAMGGTGGTTTATAATGTTTGTCTCAA 

AAGAAACATAAACATGTTGNCCAAATGAAAAATCAAATCTTTCNAGTNCTGAGACTCTGG 

CTGTGGAAATCGAAATANCATNTAATGAATAGCACCAAATTAATTNTAGCACCCATTCTT 

TACCTANAAACTGATTTATTTACAAAAAAATAAGCAN'iTTGGTAAAAAA 

Sequence 358 

GGAGCTCCCCGCGGTGGGCGGCCGCCCGGGCAGGTACACATCTGCTCCTGGGCAGACCTT 

ACAAGTATCTCAAACTCATATCTCCATGAGAGGTAGATAAGCTACGTAAACAAGGGAAGC 

CAAGATAACAGGGAAGACAGGAGACTATCTGCCTTCTAAAAACACTCAAATTCTTTTCTA 

TAAGCACTGCTGATTTAATCCATTACACCACCAGCAGCTGCCAGTTTACAATCTCTGCTA 

CACAAGCCCTCCATTGAGTTTAAAGACAACCGTCAAAATATCTCTCCAGTTTCTCCATTC 
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ACCTGACTGTTACCATAGAGAATACAGGTATTCCCACTCTGGCCAGCTGTGTGCAGTGTT 

TTCTCCTATCAGAATCCAAACCACCTACCAGTCCCCGTACCT 

Sequence 359 

ATTGGAGCTCCACCCGCGGTGGGCGGCCGAGGTACCGGANGGGATGCAGCAGTTTGCTCC 

CTGTTCTACTTGACCTTCCTTTAAATATTTGAAACCAGATATCAGCTCACTAATCCACTT 

TCCCAGCTTGCTCTTGTCTAGCTAMCTGTCCCAAGGTCCTTCAACGGTTCCTTGTTTAA 

GTCGCTTTTCAGGCCTTTCACTACCCGGGTTCACTCCAGCTGTGTAGCAACTCTCCCCAC 

AAAATGGAAATGCGATAGCTAGTACCTGCCCG 

Sequence 360 

CCGCGGTGGCGGCCCGAGGTACTTTCAMGACTTTAGAACCAAATGTCATTCAGCTGAAA 

CCATMTGTATTATAAACTACGTCTTCTAGAGTTTTGTTGTCATAGAGTCAAATGAGATG 

CCAAATGCTAGCAATCCTATTTAGGAGGCCTACTATTAGGATTTAAAAGAAGTGGACTGC 

CAGAGATGAATTATGTGACACAGATGCTAACATCTAGTTATTATGAGTGATCCATTCTGT 

CAT GA GGTAGCACTTGAAAAACAACCTGGGACAAAAGAACAGGCCTCTGGCACTACTGAA 

T7TTTGGAGA GAGAA ACCTTT AGAA GTTAGCATGTGGTTCTTTCTGGACATGTTCATTGC 

CTTAGTTCTTCTTTTGCAAATATTTTGTAAATATTCATGMGTG 

AAGGCAGATCTGCAGATCCCCTAAAGAGCATTGTCAGTTATGCTGGGCTGTTCCTACCCT 
ATCCCTCTGTTTACCTGCGGCGGCCTCTCAGAGCCGTGATCGTGGTGGCTTTGTGGNGGT 

Sequence 361 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACAGGGCTGGATCCTTCCCAGCCAGGGTGCCTG 

GGATTATTGGCTTTCAGCAGCCGACTCACGGAGGCCTCCTTTGGGCTAGGCACTGGGACA 

GACCCCTGCCTCCCCAAAGGCATGAGACCCTGGCCTGCCATTTGGGAGCTCAGAGCCTGG 

CTAGAGGCAGGAGAGGTGAGGAGGGAAGATAAGGACACAGATGGTGAATGTGGTGAGAAA 

GGAAGCATGAGGGGGTGGGCATGGTGCATGGGAGAACAGAGGAGGCGATGGAGGAGTCCA 

GGAAGCCTTCCTGGGAGGCGGTCGCCTGAGCTGAGCCTCAAGGAGTATGAGCAGCTTCTC 

AACAGGAGAAGAGAGTGGAGAGCACTTTGGGCAGAAAGTTACA 

Sequence 362 

GACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGGTACTTAATAAATT 

ACTGTTATTATTAATAACTCACCATTATTAGTTCAAATGAATGAACTTATCCAAAGTTTC 

AAAGAGTCTACACAAAAATATTTGCTTTCATGTGCTTGAGTAAAGAATTTAGTATCTAAG 

AAGCATMCACCTTTMCGTTTAAAACAATGGTTAATATAATTTATAGGAAAM^ 

AATATTCTGTTAAAAACAGCAATAAAAGAAAGAAGATGC 

Sequence 363 

GATTGGAGCTCCACCGCGGTGGGCGGCCGAAGTACCTNTNTTTCNNGGCCATGGCAAAAA 

NAMTCCAATNATGNCCNNCTCGAGTCTGTGGTCTTGCTTCTTATGTAGTATTNCCTTTG 

TGAGCTGAANATTAATGCATGGATN NACCTGNTTCAGCACATTTN ATTN NAATTGCNAAN 

AAANNATTCNNGGCNCTGAATGNAAAATTGAACATGACATnTGACATTCCTTNTTCTNA 

CAGCTGGGTTGGNCTTANTTGCTGNGAGGCTNTAGACACCGACCATACAGGGCGTGGGGC 

TGCTCCTGGACATGAACATCCTCCGAAGTTCTCCCCAGTCCACTTTACCCGCTCTGGGCT 

CACCTCTGAATGTCCCCGCGTACCTGCCCG 

Sequence 364 

GGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCACCTTGGTTGAGAACTCCTGACT 

AAAAGAAGGCAGGGCAGAGCAGGGTATGCCATTGATGTTGATTCCTAGCATCCAAGAACC 

ACATCTGGGCATTTGTCTGAAGGAAGAAATTGGGAAGTTTCCATGATAAAAGCTCCAGTG 

GAAATTGTAGACCTTGGGAATGAATATGCTAAACTGAGGTTCTCAACCAAGGGCCAGATC 

ACAACTTCAGAGGCCTGAGGCAGACGTGTATTTTGGAAAAGCTTCCCAGGTGATTCTGAT 

TCAGGTCCCTCCCTCAGTGAGTCATAAGACTGAACAGGAAGACAAGAAAAGTTGCCCCAT 

AATGATGGCTCATGGTTATGGCCTTATGATGGGGATATAGCTGAGGCAGCACACACCTTG 
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GAGCTGGGCTATANGCAGAGCTCTTATTTTAGATGGTGGCAGAAGAAACCACACACAATA 
TGMGGCAGAANAGCAMGGTCACTGAGAGGCCATGTTTTGCCCTTCANTGTTGCCAGGT 
CCTGCTGTGTGTCCT 
Sequence 365 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACAGGGTTTACTCAGTCCTGTTGTGTAATAGGA 

AGCAGGAAAGGGCAAGGGCCCTGGGAACAACCTCAGCCCTGCCACTTACTGGTCATGTAA 

CCCCGAGCAAAATGACCACCTCTGAGCCCCAGTCTCCTCTTCTGTAAATGGGTATACCAC 

CACCTACTCATAGGCTGCGTGATGAGAAATGGAATAATGTCAGGGAGTGCCTGGCATGAC 

GCTGGCACCCTGGAGGCTCCCCAGGAACGGTGGCTACTGCCTTCAGAAAAACTGAATCTA 

CTTCAACTTATTTTACGTAGGCCAAAGCATCCAGTCAGTGAGTAAATCAAGCTGCAAGAT 

TCATTTCTACTTAACTTTTAGGAGGAGTCATCCCATGATACCTCGAATAGGTCTCAAAGC 

ACATGCATGATATAAATGATTCACTGATATGACTCTGGTCAGATGTATGTTCTTATGAAA 

AACCTAAGTTCANGGGTTTACCTMCATACATGGGGATACTCATCTTCCACCTNCCAAAG 

CTTTCAMTCAAGGCCAGAGACTTCCGGTCATCATCACTATTTTT 

Sequence 366 

GAGATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACGCGGGGTCACTGGACTGAAAGTTC 

CCTMGGCCAAAGGCCATTCCTCTTTGGCTTTCAGTAGTATGCCCTGGCATTTTCTGCAA 

ACTTACCACTTGATGGGTGCTAAMTATGTTTTAAACTGAGTATTGCACCCAACGGATGA 

AAATTTTCTAAAAGTGCTTCTATCAAGAATTAGGAAACCAAAGTATAGAAATAAATGAAG 

CTGATAAGCATAAGTTTGTGATGCTAAATAAAATTACACTGAAGTCCTAGCCACAGATTG 

GCMGGACAGTTTTGTGGGGGCAGCCCCACATGCCATAGTCCTCTGCAGGGAGCCATAAC 

TTAACATCCCCATGTTTAMTTTTAAATTATATACCCAGCCCCTTGTTTTGTCCATTACC 

AGTTGAGMGTGAGGTMTTCATTCTCCTTTCTACTGCTTCTGCCCATGGTGAGAACAGA 

ACTTCCATATACATCTTCTCTGCCCAGCTGGAGGGGGCTGATGGAGGACACAGCTTCTGT 

CATCCACGCCTCCCCTGGGAGTCTCTTTACTGATTGGTCTTCATTTGATCTACATTANGG 

CACATTTACTGATGGATGGAATAMCTGCTTTAACAGAATCATGGTCTNCAAGTTTCACT 

TCACCTTCTGTCCTCANGTGTGGCANGTGCCCTGGCCCGAACCTGCTATTCTNTTCNGCT 

TCCTGGTTCTTGGGGGCTTTTATCTACCGGTCNG 

Sequence 367 

AGGTACAATGGGCTTCATTATTGCCAACTTGGAATAAGAAGTAGCAGAGGCTCAAAATAG 

TAATGTGAATGCTCAATGTTACATAATAATTCAATGGCGAGGCCAAGATTGGGATTCCAA 

TTCTCACATTTCTAGCCACTGCTCTTCTTTTACACAATAAAGATGGGGAAAAAACATTTA 

AGCAAGTTTGACAGGAGAGTGCCCTCTCCACCAACAAAAATTATTGAGGCTCAAAAGTTC 

AGGGATTTGCAGAGACCATTCAAATGGGAAAGGGCAAGTCTCTTCAACATGGTGGTGGAA 

AAACTGGATATCCACATGCAAAAGAATGAAGTTGGACCCTTACCTTATACCATATACAAA 

AATTAATTCAAAATAGATTANAGATCTAAACAGAAGATCTAAAGCTATAAAACTCTTAAA 

ACACAGGAGAAMGCTTCAGTGACNTTGGATTTAGCAATGATTTGTGGATGACACCCAAN 

GCCAGGCCATTAAAGTAAAAATATGTTACTGGGCCCTCCAAMTTAAAANTTTGGGCCCT 

GGAAAAACCACCCGAATTTTAAAGCCCACTCCCTATNGGGGAAAAAANTTTGCCAATCNT 

I I I I I I GGTAAGGGGTTTNTTTCCCNAAA 

Sequence 368 

TGNAGCTCCACCGCGGTGGCGGCCCGAGGTACCTTTCTTTCCAGGCCATGGCAAAAAAAA 

TCCAATTATGTCCGTCTTGAGTCTGTGGTCTTGCTTCTTATGTAGTATTTCCTTTGTGAG 

CTGAAGATTAATGCATGGATTCACCTCTTCAGCACATTTCATTTCAATTGTGAAGAAAAG 

ATTCCAGGCACTGMTGTAAAATTGMCATGACATTTTGACATTCCTTCTTCTGAGAGCT 

GGGTTGGTCTTAAGTTGCTGTGAGGCTCTAGACACCGACCATACAGGGCGTGGGGCTGCT 

CCTGGACATGAACATCCTCCGAAGCTCTCCCCAGTCCCCCCGCGTACCTGCCCG 

Sequence 369 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCGGGCAGGTACATTTTCTAAAA 
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TCCAAAATTACCCTTAATAAATCTTGGCAAAACACACACTATAGGAGATAAAGACATCCC 

TAAAATTTCACTCTAATTTACTGAAAATAGAAGTCAAATTAAATTCAAATGTTGGCTTAC 

ATTAAAATAGCACAGGATCTTATAATTTACTTTTAATCTTGGAGGAATTCTCATTATTCT 

CTCATCAATTCTTCTTCAAAAAAATAAATTAATAAAAACTTCCTTGTTTGTCACATACAG 

TGTCCTAAACAGAAACATCTGATTTTTCTTTGGGTCAGCAAGAACAACTTAACTTCCAAA 

TTCCTTTATGGCCACTATTTTAGACACTAAGACCAAGAACTCCAAAGAAAGCTATAAAAA 

AAAATGCTACTATATGACTATATAGTAGCATATATATATATAGAAAAAAATATAGTATAT 

GATACCAAATATTATAGTTTACAAAACCACATTCATATACAACAGGGTTCMTTTTTTAA 

ATCACATCACGAGTGGAAAAGAAACCCAAACTATATATTCTANGGAAAAAAGCACCGGTC 

ACAGTGGCTTAAGCCTATAATCCCAGCACTTTGGGAGGGCCAGGCAGACAGATCCTTGAA 

GGCCAGGAAGTTTGANGGNCAGCCCGGGCCAACATTGGGTGGAAACCCCTCTCTTCnCAA 
AAATA 

Sequence 370 

AGGTACTACTGGCACTGAGCCAATGTATGCTATCAAGGAAAGCTTTATCTGTCACTGAGC 

AAAAGGTGAAGTTCAATTAGGTCAGTTTTATCAGTTCTTTTCCTACACACAAACTATGAG 

GAGAGATCATTTCTTTTCTTTCTTTTATTTATTTATTTTTTTTGAG 

TGTTGCCCAGGTTAGAGTACCTGCCCG 

Sequence 371 

ATTGGAGCTCCCCGCGGTGGCGGCCGGGGGCCATTGAGACTGCCATGGAAGACTTGAAAG 

GTCACGTAGCTGAGACTTCTGGAGAGACCATTCAAGGCTTCTGGCTCTTGACAAAGATGG 

ACCACTGGAACAATGAGAAGGAGAGAATTCTACTGGTCAGAGACAAGACTCTCTTGATCT 

GCAAATACGACTTCATCATGCTGAGTTGTGTGCAGCTGCAGCGGATTCCTCTGAGCGCTG 

TCTATCGCATCTGCCTGGGCAAGTTCACCTTCCCTGGGATGTCCCTGGACAAGAGACAAG 

GAGAAGGCCTTAGGATCTACTGGGGGAGTCCGGAGGAGCAGTCTCTTCTGTCCCGCTGGA 

ACCCATGGTCCACTGAAGTTCCTTATGCTACTTTCACTGAGCATCCTATGAAATACACCA 

GTGAGAAATTCCTTGAAATTTGCAAGT 

Sequence 372 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACTTGGCACATCTGGAAAATGACGGAGTCTTTC 

AAAAATACTGCTGGGAAGATGACCTGTTTTCACTCTCTCTCTAGGCAGGTTCACATCACC 

AGATGATGCTGGATATGGCTCCACTCCAAGGAAAACAACCTTTTCTTGGTCTTTCAATGG 

CTGTTTATGAGTATACAACAAGTGCTTTCTATATACTGCAAACTCGTTAGACAGACAAGT 

CCCTCTGTCAATCCCACTCTACAGTGCATATGCTTCTGTGTGGTACCT 
Sequence 373 

CGCCCGGGCAGGTACGCGGGGACACATTCAGAGGTGAGCCCAGAGGGGGTAAAGTGGACT 

GGGGAGAACTTCGGAGGATGTTCATGTCCAGGAGCAGCCCCACGCCCTGTATGGTCGGTG 

TCTAGAGCCTCACAGCAACTAAGACCAACCCAGCTCTCAGAAGAAGGAATGTCAAAATGT 

CATGTTCAATTTTACATTCAGTGCCTGGAATCTTTTCTTCACAATTGAAATGAAATGTGC 

TGAAGGAGGTGAATCCATGCATTAATCTTCAGCTCACAAAGGAAATACTACATAAGAAGC 

AAGACCACAGACTCAAGACGGACATAATTGGATTTTTTTTGCCATGGCCTGGAAAGAAAG 
GTACCT 

Sequence 374 

CGCNCGNGCANGNTCGCGNGATTNTNNCNGCCTCATNNTTNNGNAAGTNGCTAGNNNANG 

TGNNTCATGNGGTATAATGNATNAATACCNCCTTTNTTATNNTNACCTACTNNCTNACGA 

CNCCTCTNATTATGNCNTNACNGAGATCGNNNGCATANNNAAGNNNCGCCCCTNATTGCC 

NNNNNNNNGNACNNCCNANAAATNTANAAAGNACATGGGANAANTTATAACTAANTGNTN 

NNCTCACNTTCCTGAATNAAAGNTGACACNCNTGCANNNNAANAAAACACTTGNNNNNNN 

GTNNGACNNCCTTGACATTAATCTNTNATAGGATANGAAAACTCTTCATCTATNCATAAA 

GCNAACATGTCCTNGGC 

Sequence 375 
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GCNNATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACTAGGCCCATGGTAGACAGTCAAC 

AAATATTTGTCCAACTACTTAAGAGAAGATAAATTATTTTTCTAAATACAACTTTA 

TGCTCTTGCTTTCATAAACAATACCTCCTAAGAGCAGAGATTAATATGAGTAGCTGAAGT 

CAATTTAATATAGTGGAGTTAACTGCTCTGACTGGATGTTATCTATCAATCAACAATCAT 

AAATTGAGCATCAACTATATATTCAGCATGAATATCCTCATCAATATCTATGTATGTATG 

TATGTATGTATGTATGTATGTATCTATCTATCTATCTATCTATCTATCTATCTATCTATC 

TAATCCATCCATCCATCCACACTCTGAAGATGCTATATATAGAGGCCATGGGAAATACAG 

AANTCTATCTATGGCAGGGCGCTGCCCTCAAGGAAATGGNGGGGGTCCAACAGTATTCAC 

TGNGGTNGAAGGGACCTGCCCGGGCGGGCCGGCCGGGCAGGTACTTNCCCAACCCCTAAC 

CCATCTTANCANAAAGTTCAAAGCTTTTTGTTGC^ 

ATTGTTNNTGANGGCGGTNTAAATGGATCCTAATATGCTCCTNNAGGANTCNTTTANAAC 
ATTCNTT GNTN TTCATANGGN CTTTATGTACTAAGGAAAAATCATTC CTTTNGGGAAA NC 
CAAACCCTTTTGGAANGGGCTGNTTCCCTTGGAAAA 
Sequence 376 

GCGAGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTCCTCTACTTTCTGA 

TAAGCTTTAAAAGAAAAAMCTAAACTAMTATTTCAAAATCAATGGACTTCAAGTAATC 

CAAGAGMTTTGGAAAACATTTCTCTAAAAACTAATCCAGAACACMGGTGTCAGTTTGG 

AAATAAC G G AAGC ATATTTC G TTTTAAAAATCTAAAAC ATTTTAAAG AC AAAAATTC AAG 

TTAACATCCCAGTGATTAACTCTGGAACTGCCTAGCCAAAAACAACTTTCGGCACTTAAA 

ACATA AAACT GTCAGTTTCCTCTGCTTGGAGACAAAACATACAAAAAGCAAACATTCACT 

GTTATATTTTCCTAGACATGAAAGATAGAAC 

Sequence 377 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGAGTAAGAAG 

CCCATTTGGTTTGTGTCTGGGAGGATGGGCAGTGAGGATGAGTGAGGATGAGTTTAATTT 

AGCTTGGTGAGAGGGGGACAAATCCACCTTTTCACGAGAGCTGTTTCTCATCGCTTTTTG 

TCCCCCTTCATCCACAACCCCCGCAGCCAAGGCAGTGTCAGTAAATTTTGGGATGGAAAA 

GCCTTAGGTAGGATTCTTATGTTTTCGAAATGTGACTGATTGGAATTGGGGGAAAGCTGT 

ATTAAMTTGGATATGTGGTAAAAGCTGGATATGTGGTTATTACCCATATCCAGTAAGTG 

AGTCATGTTCCTGTGCTACCATGATCTGCCCGCAGTAACCTAGCAAGACATCAAATCACA 

GTGTGCAGTAGATGGTACCT 

Sequence 378 

ACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCGGGAGGCTGAA 

GGAGGAGAATTGCTTGAACCCAGGAGGCAGAGGTTGCAGTGAGCAGAGATCATCGCACCA 

CTGCACTCCAGTCTGGGCAATAAGAGTGAAACTGTCTCAAAAAAAAAAAAAGGAAAAATT 

AATTACACCTGAGGGAAATTCCCATGCAACATTTAATGAATGCCTTCAGTGCTGTTATAA 

TGCATTAATTTCTATGAGAAATGTTCTCTCTTTTTTGTGCCTTTTAGGAAAATT 

TGGTGATAGAAGATAAAAACTTGGCTAATTAGGCCAGGCATGGTGGGTCATGCATGTAAT 

GATATGGGAGGCTGGCATATCACTTGAGGCCAGGAGTTCGAGACTAGCCTGTCCAACACA 

GTGAAACCCCGTCTCTACTAAAAATACAAAAATTAGCTGGATGTGGTGGTGGATGCCTGT 

AATCCCAGCTACCTGGGAGGCTGAGGCAAGAAGAATCGCTTGAATCCAGGAGGTGGAGGT 

TGAAGTGAGCCGAAATCACGTCACTTGTACCTGCCCG 

Sequence 379 

TAATACGACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACAGAAC 

TTAAACACCACTATTTGTTGAGATGAAAAAAAGCATATATAGGAAGCCTTCAAAATGAAA 

TGGTCAAGGGTGAGTTTACACAGATAGATAGATTTAGGTCTCTTCCTTTTGCTTTGTGAA 

AGCATCTAGTGTTTAGGTGTCAGAGAGGGAGATATCCTTACAAAGCAGAGATTATCATTA 

CAGGTTTACATTTCTTACAAAGAGTTTCAAAATAAACAGGTAAATGCCAAAAACATATAT 

TTTGGAGACGGATTAATTCACTAGTTGGTCTATTCAACTTAACTTGTTTCCTAATGAGAT 

TAAATTCATGCACAAATAACCAAACCAAAAATTAAACCAAAAGAATACTCACCAGAAAGG 
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AAACCAAGAGAGGACTTATCTCGCAGCATAAAGACAACTTGTACCT 
Sequence 380 

CGAGGTACAGCTTGGGCCTCCAGCTTTGGGCACTAGAATTGGACCCCTGCACCCCCATCA 

^?5I^^ GATATATGGTCT G T CTGATTCCTGTGGTTGGGATGAGGAAATGAGTCCT 
G^GGTTCTGGAGAGAGAGAGAGGCAGGTCCCATACTGTTTGCCTGGmTAACAATGGG 
TTTGCCCTTGCCTTCCTGTGCTGGGACATTGATGCAATGTCAGTTGCTCTCATCTTTGCC 
Ii? GGAmCTACAGGGmGGCMCT GTCCCTAGATCCCTGTCAGAGCCAGTGCTTCTG 

C rr GC I?rI?? GGG ^ 

AGTCCCCTCCCCTCCCCCAAG GCTGAGTGTAAGACCCAAACCACTGAGCTTT 

CCCTGGCCCAACCCATTGCGTGGGACACTTAAGTACCTGCCCGGGCGGCCGCTCT 
Sequence 381 

^II GG ^ GC ™ CGCGGTGGC ^^^ 
AAA ^J GG ^^ 

TGGGAGACTTTTGCCTTTTGCCCTGCTTACAAACTAGCATCATATTCTAAAATCTCAGGA 

ACTAAAATTACTCATTGTAATTTCTCTAAAATTCAAAGCCTATTGGAATCCCTGCTCTTG 

TTATGTAAMTATAAATATTCGAGACATTTCACACCACATnTCATTATTCAGCATTCCC 

ATCAAGTGCTGGAAGGCCAGTTCGCTATTTTATAATGAACGTNCTGGGTTTGTTCTGAAA 
GGGCANCCTTGTACCT 

Sequence 382 

CCGGGCAGGTACTGTGCTCAGAAATGTGACCAGCAATGTAAACCGACAGAACCAATGGCT 
TTTTTGAAAATAACAAAGTATTTCCGATCCTTTTTTAAGTAGAGAATTTTCTCCCAGCAC 
ATCCTATTTCCAATTGTGCGCAAAAAACCTTCCAGCTTACAAATGTTAATTCAGGGGTTG 
GCAAATAAAGGCCCAGAGGCCAAATCTGGCCCACCACCTGCTCAGTCTGCAAGCTAAGGA 
TGi I I ICTTACATTTTTAMTGGTTGGGGAGAAATAGAAAGAGTATTTTTGAATATGTGA 

^ATTCACGTA^TACCT TG ^ CAGTGTCMTC 
Sequence 383 

GCAGGAN.CCCGGGAATTACCACCATTTTACAAGGGGGGAAANTTNNNTCATAGAGCATTA 

TCACTTGTNTAAGGTCCCCAGGAGAGGAATGGATGGAATAAAAACCAAACCCGTCTGTAT 

GACTTATATCCGCTGCTCTGAACATCTGGGAGGGCTCTCATTTGGGCATTACTCTTCCCA 

TCCACCCTAAAGATTGGCGAGAATGATTGCAGTTTAGTGGTAGACATTCTGCGAAGGGTG 

GGACCTTTCNGCCCATCAAAGAATCATTCTGCCCAAATGCCACCAATGCCCCTCCTTGAG 

AAACAGCTCAAAACTCTATGTGTAAACAGCATCTGGCGTAGTACCTCGGCCGCTCTAGAA 
CTAGTG 

Sequence 384 

TGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAAACTTTCTAAACATCCTGTTTTCCCTCA 

AGACAGGGCACTGTTCTTAATAAACTGAACATTCTTTTGCACTGGCCCTGGGGCAAGGGT 

GTCCCAAAAGCTTTGACAGGGGTGCCATTTGGCATAAACAGTGTGGTAATAGGGTGGAGT 

CAGAAAGAAATTCTCTCATCCCTTTTACAACTCCAGGCAATTCTAGAACCAACCCTTGGG 

AACCCAGGGAAAGAACTCTGAAATGATACGCTGGCTGAGGTTNGCCCGTGGCCACACCTC 
AGGGTTAAAGCN 

Sequence 385 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTGAGATCCTCTTCCCAAACCAATCAAAGGCCC 

ATTTATTGAGAGGTAACCAGATAAAATAAAATGGTTGTAAACCAAACACTCCCTGGTGGT 

TCCTCGTAGGAAGCCATGCTTCCCCAGCTTCAAGCCTTTCTATGAATTCTATCCAAAACA 

GGGTTGGATCANATTCTTACTGAGATCCTACTTGACTTAATCCTTGGCTTGGGGTTACTA 

GGGAATTMGACATATTCAMGGAAAAGAAAAGGAAAAAATAAGCAGCTGGAGTTTGTGT 
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GCTTGGGTATGCTTCTATGTATGTATACACAATAGTTTMGCCTGGCTMGACTTTTATG 
A 

Sequence 386 

ATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACCCTGATCATATCANGGGCCCCATTTTA 

AAAG GAGA CATTCTGATGAATCAAAGGAAAGATATNCTAGGAGCANAAAMTGTTTTGAA 

AGGATTTTGGAGATGGATCCAAGCAATGTGCMGGAAAACACAATC7TTGTGTTGTTTAT 

TTTGAAGAAAAAGACTTATTAAAAGCTGAAAGATGCCTTCTTGAAACACTGGCATTANCA 

C C AC AT GAAGAATATATTCAGCGCCATTTGAATATAGTCAGGGATAAGATTTCCTCN TCT 

AGTTTTATAGAGCCAATATTCCCAACCAGTAAGATTTCAAGTATGGMGGAAAGAAAATT 

CCAACTGAAAGTGTAAAAGAAATTAGAGGTGAATCCANACAAACACAAATAGTAAAAACA 

AGTGATAATAAAAGTCAGTCTAAATCCAACAAACAATTAGGAAAAAATG 

Sequence 387 

CCGGGCAGGTACTTTGAAACATTCCTGGCAGGCAATGTGTCCAAGAGGGAGAATCTGAGA 

CCCACTTTCCTGAGGTCATCCAATCCCCTCCCATCTTATCAGGGGTATGGTATGGCAGCT 

GTGGCAGAGGGATGGAGGAGAGGAGACACCANAGGGAAAAAAAAAACNCAAATCCTGTTC 

ACCCAGGAGGTCAACATTGGANGGAATNCAGAGGAAAAAGCNCCAGGAGGGTGTACCAGG 

CNTACCCCNANATNAAAAGGGGTTTGGNCAAATTTGGATGTGCAAACAANTCAACCCCAC 

CATGACNTGAAGGAGGCCACCGTNTGAACACAGGGCCTGGAACAATGGGTGTAATGAGTT 

ACTGCCACGTCAAAAAAATTGGTTGCTCCTGTTGATATGGTTTGACTGTGTCCCCCCCAA 

ATCTNATCTTGAATTGNCCT 

Sequence 388 

NATTGGAGCTCCCCCGCGGTGGGCGGCCGCCCGGGCAGGTACACTTGTGAATGTTATGGA 

GCTGTTGATAATGACGGAGAAGTGGCCCCACTACAATTACATCAGGGCCATGGCTTTTGG 

AATATCGGAAGTCACAGAACTGACAATTATAGCTCGTCCATATTATCCTCGGCTCCCTTG 

CTGGAGAAGTCCTTCTTTTTAGCCCCACTCGAGCTCTTGTCTGTCTTGGTCATTGTCTCT 

CCTTCTGAACTTTTGGCTAGATCATTCTGATTAATGACAGAGCCCCTGGAAAGCTTATCA 

TTTAACTCTGGATTAAGGCCGCCTGACTGCACTGCTCCGTGCTGCTTGCCATAATGTTCT 

AGCAGTTTAAGTGAGCTAGATGACTCACAGCTGAAACTACAAAATTTACACCAGTAGTAA 

CTGGTGGCCTCTGTACCT 

Sequence 389 

CCGCGGTGGCGGCCGAGGTAGAAAATACTTTCATCMGMCCAAACGTTTTATTTCCTAT 

TTAGCTGAAGAATTAATACTTTTACAGAAGCAATGAGTGAGCACAAAAAGCTTATTTTCC 

CATTCTCCATGTTCTCTCCTGAAACAATGATCCAAGTGTATTTAGAACAAAGAATCCTAC 

CTTCAAATGAGTGTAGACCCTCTACAGGGGTATCTACCACGTTCCTTCATGTCCCCAAAA 

TAGCTCATAGACCTTGCTAGAAGAGAGTGATGTGATTTTCCTTACCTACCACCAAAATGA 

AAGAGAGAGAGACAGAGAGAGAAAAAAAAAAAAACCCTCCNNANCAAAAAACCCCCNTAA 

ATCCGCNTTNGGTGGNGGGCCCCCNCCCCCNCNTAANTAAANCGGGNNCNNNGGGGCCCC 

C AANCNGAN CNGNNGTNNAAACCNNANTTCAGTTNTTNGGNGGGCAAATTCAAAAGGGNA 

G i i I I I l l l l NAAAATNTCATTTCAATAANANAAAAAACGGTTCNNTACGGTCCCAAACA 

NAACCNTGGGGGAAAGNGGTTTTCCAAAAGGCTTCCTCCCCAAAANTNCTGCCAAACCTG 

GGGCGGGAAAAAGGGMNCTNGGANAGNGAAAANCCNGAMTACTTTTTGACCTMTTTT 

i I I I I 1 1 1 IA 

Sequence 390 

CCGCGGTGGCGGCCGAGGTACCAAAAGAATTCTGTGTTTTAAAACACCATCCAAAAGCAC 

AGCCCAGTGGTGGGCCACCTCCCCACCCTGAAGTTCTGCGTTACATATAACCTTTCAGGC 

ACAAACAAGAAGGGTGTGGGATAGGGTGGAGGAGGCTAGAGATTTGTCTGCAAGGGCTTT 

ACCATCCACTAA AAGC TGTGTGCAGGACAGAAGCATCTTTCTCTCCTCACTTTTTGCACG 

TGTTGAGTATCATTTTGCTGTGTGCCCAGTGGATGTAGCGAGGATGCAGAATCTCCAAGC 

AGCTTAGAGTCAAGAAGGCTGCTTGTCTTCCTTTCTGATTCTTCATGTAGTTCTTCCTGT 
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GTTTCmGCATTCTCMGTCATCmGGATTTTACTCTGCTGTTTATTCCCATATATGC 

AGANTCAGATTACCAGAAGTCTTGNMGACTCTTTTCAGAATATGAAAAGACCTTTNGGT 

AGCAAAAGAAGTGAATTCTCTAAGGGCCGAGGTTGATGTTTTGGAGANTAGTCACCTTNT 

AAAAATGAGGATGGGCCTTTTGNGGACCACNATGAAAGGGGTCCTTTAANCTTACCAAAA 

TCCAGAACTTTAATAGGGTGGAAAGGCTTACCATTNTCTNGAAAGGTAGACNTTTCCTTT 

AATACCAATTNA7TTTTTTTCAGAACTNTTATTAC7TTTAAATTGA 

Sequence 391 

CCGGGCAGGTACGCGGGTAGGGATGCAGAACTGGCCACAGTCAGGGCTGGCCAACAAGGG 

ATGGGATGGCAAGGGAACCAGTGGGAGACTGTTTACACCnTGACTTCCTCTTCACAGGT 

°^ GGGAGGACTGTAGCAm ^^^ 

CTTCATCTGGACCTGAGGCTCTGGGCAAGTCAGTACCT 

Sequence 392 

? GG ^^ GGGGTCAAGAC ^ CAGCA ^ CmGCGGGGGACC ^ 

o^T- GCAAGCCTCAATCACCTGTTCCAGTG AGAAACAGGACTGAGATCAGAAGGAATGA 

GGCAGAGCTGCCCTCGATTGGTTCTCCTCTCCTCCTCACCTGCCCG 
Sequence 393 

o^ G Z ACAmAC ^ GC ^ GATCTCTGATOCCCTCACTGGG T^ 
G ^ GGGG ^ GG ^ GAGGG ^ GGA ^ GAG ^CTTGACACCAGGCCCTGAGCCTTTCCTCTGTGTGGT 

GTGCCCTCCCCATTCCCTCTGCTGAGGCTCGAACCAGCATGTCAGGCTGCTCCCCCATGT 

GGACACCTGACTCTCTTTGGGCTCTTACTCTACACTGGCTGTACCTGCCCG 
Sequence 394 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTCACAATTTAAAAAAACGG 

GGAAAAATAGACATCTGACAAAAATAAAAATAAAAGACCAAGAACACATGAGCTGGAATG 

ATTAATATGTTAAACCATGTGCCAAACTCAATGGTGAATGTGAATGTGTATTTTAATCCC 

CTGAGCTAGCCAGAATMGAAMCTTTTCTAAACACATGGACAACTTCTATTTTAAAAGA 

TTCAGGGGAAGTCTAAAAAAGATATGAAAGTTTAAAAGATATAATTTTTTTTTAAAAAAA 

TCAATAGAATCATCATAGTTGATCAGAGTAGGCATCTGACCCTTAGAGAAAAAAAACAAA 

ACAAAAATGTCAACTCTAGAGTTGGAGTAAGCCCTGACAGTCCCCTTCCTGTATTTAGGT 

ANATAAACCACTATCCCCACGCCATGAATGAAGCAGAAGCGCTCANAGTGGACTGCACCA 

TCCAAGACTGGGAGACACTGGGTNGAACAGCGAGAAA rGCACCA 

Sequence 395 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACATAAATCAGGGGTCTTGAGACCATCTTTCTT 

CCCCTGGACTGAAGGAACTCAAAGCTATnTCTCCTGGGATTCACAGGAACAAGCTCTGA 

ACAGTTGGGTGCCATTGTGGTCTGTAGACTTTTACAGAAAAAGCTCTACCAGACTGTTGC 

CTTGAAACTGTTAATCTGTTTGTCATTAGTCATTCCACCTGGCTGGTGGTTTTAAAACAC 

TAAATAGTCGTTGGAAAGTTAGGTAGATTTCATTAAAGCAAGACTTGAAAAATAGTTGCT 

TCAGGATCCCCAGGAGGCTTTTGCACTTGGGGTTGGGGTGAGGAATGGATATTCATAAGT 

TCAGTGTTGATATTCTGCTGCTTTTGGGGAGGGGTAGGGAGAGAGAGGGAGCATTAAGCT 

CTCCAGACTCCTGGTGTTTGGGCAGGTCTAGGTCTGGTTTGGGCAG7TGGTTGTAATCTG 

AGTGCCTGGACTTGCCCGGTAAGTTTTGCTGANAGAGAANCCAAATGTTCCTTANACTGG 

AAGANGAAAAGCTGCGTTTCAGGGTCAATTTCATAGGTGCATTTGNTGNTTCTNATCGCT 
ATGGGTTAAAAA 

Sequence 396 

AGGTACGCGGGACTAAI I I I I I I I I I CTTCTACTTAGAATAAAATTCATGTAGTGCAGTA 

TCCAGAGCACATTAACTGAAGTATTTGGAGACTAGATTTAATCTCAGATCTCACCCTTGG 

TAAAATCATTAAGCTTCTGAGCATCAGAGTCTCAAACTTTTCATGAGAGGGGTAAGTGTA 

G JT^t A ^^ GGAGAGAAGGAGGAAGA ^ A GAGTTTCTTGAGCTTCTCCGATGTCGTAGGT 
ACCTGCCCG 



Page 64 of 869 



WO 01/51628 



Table 2 



PCT/US01/00798 



Sequence 397 

CGCCCGGGC AGGTACT I rTTTTTTTTTTTTTTI rGTAGAGACGGGGTTTCACCATCTTGG 
CCAGACTGGTCTTGAACTCTTGACCTCGTGATCCACCCTCCTCGGCCTCCCAAAGTGCTG 
GGATTACAGGCGTGAGCCACATAACAGCAGTTCTCACCAAGCATCTGACAGCCGAAAAAC 
TGAAAGCATTAGAGMTGCTACAGGACAGGCTTTGGCTTAGTAGTGAGTTCTGAGTTCAA 
GCTGAGGC T CTGCACTTATTAGCTAGMCTTTGCTTAATCTCTCACAGGTCTCCTCTAAA 
TGATTAAATGGCGTAACGGAGG TAAGCGG ATTAGCACAATTC CCGGCAAAC AGTAAG CAC 
TCAATTGTTACTACTGAATTGCCAGTTCACAGGCACAACCAAC 
. Sequence 398 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAGGCTAGCCTTCCTGATGACTGCAGCAGCAA 

GGCTGCCCATGGGCCACCCCAGACAGCCTGCTTAGAATCTCTAGAAGAGCTGCTTGATGT 

AAGACATTAAGACATTCrTAMCGAAGCCAGTCTTCAAGGACTAGAATAAATCCTTAATT 

CTTCCAATACATAGACATTAACGTAAAGCAACAAGAAACAAGGAAAGTCAAATAGACATA 

GCACTCCCCAAAGCATATAATAATCTCTAAGTTGATCCAAAGATACAAACTGCCTGACAA 

ATAATTCAAAATAACTCA7TAAAAAATGCTCACTAAACTTCCATAGGATACAGGGAAACA 

ATTCAATGGAATCAGGAAAACAATAMTTACCAAAAAGAGATATTTACCAAATTACCAAA 

ATTACCAAATTACAAAATTACCAAAAGAGATATTTMCAGACAGATTGAAATTTTAAAAC 

AAACAAACAACAACAACAACMCAAAAATCAAACAGAATTTCTGGGGCAGAAAATTAGAG 

TGGAACGAAATAAAAMCCACCAGMGAAAATATCAGTAACAGAATTTGATCAAGTAGNA 

AAAAAGGATTTATTCATGAATTTGGAAGACAGTTATTTGAAAAATTTNCCGTCCGGM 
Sequence 399 

CG CCCGGGCAG GTACTAAAAATATAGCAATGTGGGTGGGCTCC AAGG ACTGTTCCTTCAG 

ACCTCAAATCCACATGCTTATAGAAGACCACATAGGTCAGAGCAAACTCTCTACACCTGT 

CTTTGTCAACTGTCACCCAGTTGAAGATCTCTGAAGCTGCATGCCGATTTAGGCCAACAA 

TCAGACTATGTAGCATTTACATTTACATTTGTCTGTGTGTGATAACATGAAGCCAGAAGG 

AAGGAAMTTTGTGGCTGCTGGTTTCCCTGACAAGCAGTTTGAAGGTCAAAAGAAACCAA 

CCCTGGAGGAACTCGAGTTATGACAGAGCAATGTTAAGTGTAGACCTAGGAGGGAAGCCT 
ATAGGGCTAGG 

Sequence 400 

AGCGGGTAAAGTGGACTGGGGAGAACTTCGGAGGATGTTCATGTCCAGGAGCAGCCCCAC 

GCCCTGTATGGTCGGTGTCTAGAGCCTCACAGCAACTAAGACCAACCCAGCTCTCAGAAG 

AAGGAATGTCAAAATGTCATGTTCMTTTTACATTCAGTGCCTGGMTCTTTTCTTCACA 

ATTGAAATGAAATGTGCTGAAGGAGGTGAATCCATGCATTAATCTTCAGCTCACAAAGGA 

AATACTACATAAGAAGCANGACCACANACTCAAGACGGACATAATTGGA I HUM I GCC 

CTGGCCTGNAAAGAAAGGTACCTTGCCCG 

Sequence 401 

CGCGGTGGCGGCCCGCCCGGGCAGGTACTGTTGATGAGCACTCCCTACCCACACTCCTTT 

GCTGCTCTCCCTCTCTCCTCCACTCTCCTGGCACTGACATGGGCTAGTCTGCTCTCTCTC 

ATTGGCACAGCAAAATCTATTTGATTCGTGCCACCACCTCTTTTACCTCATTCTTTCAGT 

GGCAGGCAGATACCT 

Sequence 402 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAGGCAACTGAGAATCAGAATCCACCCATGGTA 

ACAGCAAAAGCAGCAACATCTAAACGTCGCTGCTAACGTTTACCTCTGATGAAGTTGTCA 

AAGTCATCCTGCCAGAAACCCCAAAAGTTTGAATTACTAATGTGGCGTAACTCACACATG 

AAGCAAAATGGGAGAAAATATCAGTTATTTGAGCATGGAAACTGCTGTATTTGTATATAC 

TCAGCTCMCCAATGCtTTCTGATAATCAGAGGAGGTGAGAAGTAACTTAGAGAAGCACA 

AGTTCAAGCCAACACACTGAACCTTTATCACACCTGCCAGAAAGCATTAGGTAGATACAT 

GTAAGGGTGACTGTGGAGCTCATGGACAGGAACTCCTCAATCTGCGGAAACCCATCAAGA 

CGAGAGAAAGACCACAGACGTATTTGAGGGTTGGGAAAATCCAAGAGGAAAGCCACCGGA 
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CAGAAGAAAAGAAGAAATGGAAATAAAAGTTTCTGGATTTCCTGGTGCTGTTCAAGTTCA 
TTAAAATGG ACTG GGTCTC ATCG G NCTTTTATTTTTTAAAAGG 
Sequence 403 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGATATATTCAAGGTATACCGATGTGATG 

CTAGGTGGGGATGTAGGGTTTGGGAGAGAGGAGACAGCATTGCTTAGATCAGTCACGTTC 

AGAGACCTCTTTACATAATCTTTACTTCCATCACAGGTGCAGATAGGTGGCTTCAGTTCA 

TATCTCTTCAGGAATACACACACAAACTGGTGGAGAAGAGCAAAGGCATGCTGAGCTGGA 

GCCCAGAGCAAGGCCAGACTCACTGTGCAGCTCTTTTCTCCTTTACTCAGAGAGGTCTAA 

GAGAGGTGTGGCTCTTTCATG ATTGG GCAG AATGATAAATGTTCAG ATGCC ATGG ACTTT 

CAGTATGGTTCACAAAGAGTGGAAACCACAAATGATAATCTGTGAA7TCCAAGCTACTGT 

ATTATATGGATATATGAATATAGTTTATGMTATATATTAAACAAAAATAGGGATCATAC 

AATGATGATGTT TTACAGCTT GCTCTTTCTCAATTAACAGTATCTTTCCATGTCAGCACA 

TACAGACCTAACI I I I I I I I I AAGTATTCCA7TATATAGATATTCCACAACCTTCTAAAC 

TA TGCAG NTNACCTGTGAACAACCTGGATTTGNACTGCATGGGGTCCACTTTACACATTG 

G N TTTTTAAAAATAAATATATTTGG AAAAT r 

Sequence 404 

AGCTCCCCCGCGGTGGCGGCCCGCCCGGGCAGGTACACGGGGGGCTGTTGCCATGGATAC 

GCTTTGTGTAGCGGCTATGGGCGCTGTCTTACAACAAAGCCAAGGAATCTCGCTGCTGAG 

GGCAGTTCTGTGCTTTATTATGAAGAATAATGGACGATGATGATGCAAAGCTCAAAGCAG 

AAATAGMGCTGAATTGGATAAACTCANCATTTCCTCCTTGGAAAAAGAAGACAATGAGA 

GTGATGCAAAATCAGAAACCCAGAGTGATGATAGTGATACAGATTCAGTTGAATTACCAG 

AATCAGTTCTTCACTGTATTAACATCATAAAGAACGGGAGTAAAGCTGTTGAAGAGCTCA 

TTCTTCAGGAC CTGGAAG ATCTGATATTTTAAAG 

Sequence 405 

TATACGACTCACTATAGGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACAAGG 

TCTGAGGAGAAGGTTGAGGTCTCCCCAAGACTTCTCTAGAACACTTGGTGAGAGGCGCCA 

TGAGGACAGGTGATGCTTGGGCTTATAGCTCTGACTCANGACATCACTGCTCAACAGCCT 

GAAGCCCACATCCAAAGCTCATGTCATGCTGGCAATTGAAGGGGTAGCAGAGCCCAAAAG 

TCCAGAGGGGCTCATCACAGGGAGCAGATGTCTGTNAGCAGGAGAGACAGCGGCTGTAGG 

GATCGATGAGACTTGGTAAGCATGATGTGCTCTTGTCAGCATAGCATCTTTGAGCTCACA 

CAATAATTGATGTATAATTGTCCATTTCAGTAAGGAGTTTCCACATCCACTATCTTCCAC 

GATTCTCACAAGGGCCTTGAGTAAATATTACCTGCATGTTATGGGTGAGAAAGGGGTAGT 

TTAAAAAGCTGAAAGGATTTGGTGAANGCCACATAATGAGTTAACCTGAAATCAAGGCCA 
GACCCCCAGG 

Sequence 406 

CCGCGGTGGCGGCCGAGGTACAGATCAATAAGCAATCTGCACTACTTCGCAGAGCATTGG 

GTATTTATGAGCATGGTGAGGAGGTTATATAGGATTTGGGAAGGAGGGACTCCAAACCAC 

AAAGGTAAATGGTT GGGC CTGCACTGACATTAAGCTTGCTGTGGTTTTTCTGCTGATGGG 

ACCCATGAATGTAGTTTTCCTCCTTTGGTGGTTTTAATGCTGAATGTGTAAAACCCTAGG 

CAGATTGTGTAACATCCCCCAGGCCCAGCTGCTGGTCCTGCCCATTAAAGGGGAACTTCA 

AGGCCCGATTCCTTGGCTGGCAGCTGCACTGTGGAGGGCATCCCAGGAGAAGTTGTTGGG 

AAAGGGT GGTGTT GCTTTGTMGTCGGTTCAAGTTTTTGTAAAGCAAGTCAGGAAGCCCC 

AGTGACC I mm I GAGCTTATACTGGAGCAATTCCAGTTCCTTTTTTTCTTCCAACTTAC 

CTGTGAAAAGGC 

Sequence 407 

CCCGGGCAGGTACATGTTATCTGCCCTTGGCTGTATGNAAAAGCTCAANTTANCAAACAA 
GGAGAGAGATAAAAAGGAGTAAAATATTTTGTTTGGATAAAGCTCTGTCACTGAATGTCT 
TTGAACCCAAAGAAGACACCAGAAAGTCTCCAAACCTCTAATTCATCTGTAAACTGGGAA 
CTCATACTAAGTCATGCTGACTGTAGGAATAAAAAGATGAAGAGGAAGGGGGGGCTCACT 
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AGCAATATCAGGCACTACCCAGAATTAGTGCCAGTTGATCCCTGGTCAGGGTGCATAGGC 
TTGGAGAAGAGTAAATTTGCAGGGCATAATGGACCATGCTCTGAGATGCAATTTTCTTAA 
ACTTAATTGGCTCTGGATGAGTGCATTCCAATGTGACGGAGGAGGTATTTGCTTCCTGGG 
AAGGTTTTCTTTCACMCAGGCAAGAAAAATTGCTCTTTAAMTATACCTCTTCGGGGTT 

G AAAAGACAAAATCCTCATGG CTTAAAATTCTAAATG GGCTAGCC ATC ATC AAAAGTACC 
T 

Sequence 408 

TCCGGCGAATTGGATTTTTACCNGCGGTGNCCGGCCGCCGGGCAGGTACAAATCNTNATG 
. C ATATTTGAC ACA l Mil iGGCmTNTNTTGGGGACCAGCAGACCACCCCGCAGATACA 
ATGC ACGGAA TCTAGCAAACAATAAAGAGCAGTGTGCTCGCGTGGCAGATCGTTCGCTTC 
CTTCCTTTTTCGCCTTCATTTTTC 

ACATAGCAATATAAAATGCCAGAGGATGTTCAAATGGGCGTGTGGAGGAAGAGGGAAAGC 

AGGAATTCTGGGCCAAAGAGCAGATGGCCAGCCAGTGACCTANGAAGGGAAGCTCANGCT 

TTTAAACAATGGCA 

Sequence 409 

CGCGGTGGCGGCCCGACGTACATGTGACGACGTCCAGGATAGTTTTGAAATGAGAAATGA 

GAGCAGCATTTCCTTGGTGTCGTGAAAATATATTATACATTTATTAAATATTGAAGAAGT 

GTGGGCTGGGCGTGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAAGCTGAGGCTGGT 

GGATCACCTGAGGTCAGGAGTTCNAGACCAGCCTGGCCAACATAGTGAAACCCCATCTCT 

ACTAAAAATACAAAATTAGCCGGGTGTGGTGGTGCATGCCTGTAATCCCAGCTACTCGGG 

AGGCTGAGGCANGAGAACCGCTTGAACCCANGANGCAGAAGTTGCAGTGAGCCAAGATCG 

CGCCATTGCACTCCANGCTTGGGCAAA 

Sequence 410 

AGGGCGAATTGGAGTTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAGCTCAGGATTCT 

CCGAATGAAGTTAAGGTCCCTTGCCCATCCGGAGAGGCCTCAGCTGTGTCGGTATGATCT 

TGTCCTGATGGAAAACGTTGAAACAGTCTTCCAACCTATTCT7TGCCCAGAGTTGCAGTT 

GTAACTGTTGCCAGGACACATGGCCATAAATAATAGAAAGAAAGCTACAACCACAGGCTG 

TTTGAAAGCTTCACCTCACCTTTCTGCAAGGCAGAAAAAGTATGAAAAAAACCAAGGCTT 

TTTTCAGTAGCGTCCTATGGATGTCACATTGTACCT 

Sequence 41 1 

CCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGAACTTAAACACCACT 

ATTTGTTGAGATGAAAAAAAGCATATATAGGAAGCCTTCAGAATGAAATGGTCAAGGGTG 

AGTTTACACAGATAGATAGATTTAGGTCTCTTCCTTTTGCTTTGTGAAAGCATCTAGTGT 

TTTAGGTGTCAGAGAGGGAGATATCCTTACAMGCAGAGATTATCATTACAGGTTTACAT 

TTCTTACAAAGAGTTTCAAAATAAACAGGTAAATGCCAAAAACGTATATTTTGGAGACGG 

ATTAATTCACTAGTTGGTCTATTCAACTTAACTTGTTTCCTAATGAGATTAMTTCATGC 

ACAAATAACCAAACCAAAAATTAAACCAAAAGAATACTCACCAGAAAGGATGTCCTTTAC 

AAGAGCAGATCCCCCAAAATGTAAGAGTTCACTGAAAAGGTGGGAGCTCAAACCAAGAGA 

GGACTTATCTCGCAGCATAAAGACAACTTGTACCT 

Sequence 412 

GGTGGCGGCCCGCCCGGGCAGGTTCCAGTTGAGAAGCTGAATGTTGGTGTTGTGNTTNTG 

ATAAACTCCCAACATGAAAACTGAGGACCACCAGCGTTTTCAGCTCACCTTTGACTCACG 

CTGGCAGCTCCAGGAACACTGAACAGGAGTCCCCACTGAACCTTTGCTGCGTTCCGAAGC 

AGCACAGCCATGGTTAGGCACTGCCAGGCAGCTCTTGGAAACCACAGGGAGGGAAGTTTT 

GGTTCCTTCTTTCTGTCTTTGTTTTTGAGAGGAATGAGCTTGGAATAAA^ 

AATGACTGATTTGCAGGTTAGGTGTAAACATTAAATCCTCATAACANGCTAATGAGAGGN 

GGGAGAAATTATCCCCATTTTACAGATGAAGGAACAGAGGCTCCTAGAGACTAAAGAACT 

TGCTACTCAATGAGGTCAGGATTCACACCTAGGGTGACCTGAGGATAGCTGCTGCAANTG 

CTCACCCTTTCTACCTTCCCAAGAAACAATACGCAAGAGAATTTCTTTCTAAATATTGNG 
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AAAMTCTCTTTMTAAAAACAAAAAAGTGATGCTAGTGGT/W^TAAAA^TN 



Sequence 413 

acttccagccaatccccgcgtacctngggcgctctagaactagtg ^^^acc 

Sequence 414 

TGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGACTAGCCTTCATCfAGTTCAGGACTCCT 

C ^a^S A SI CCTOCATGACTCC ^ GCCTCTGATGNCA ^ 
^ AGAC i^ 

TGAGGTCATTACTGCTTTATGAAACTTCTGTTACATGAACATATATGTGATGAAAAAAA 
Sequence 415 

? CG ^ GGAGCTCCCCGCGGTGG ^ 
AGATCCTGTCTCTGAAAAAATTTmAMTTAAAAAA 

TTATCCAMTGTATTACCCAAATATCTTCAAATGCTAAGAAGTAATTTTTGGATTAAAGG 

S3!IV^ GAGTCTACATGCCTACACTCTC ^ 

CCGGAmCMGTGGMCATTTAMCTGACATCTTCTGATGTTATTTCATCCAAATTACT 

TTGAGATAAAGAAATGTTGCAGACAATTGTACCT 

Sequence 416 

CATCTCCCTGCACTGCCATTTCATTTGAGCAGGGCTTTGCCTGTCAAGAACATTCATATC 
TTTTGATAMCAGTGCATTTTTTCCCTGATAGTCTGTATGGGACAGCCTCCTTGTTTGCA 
™ A ^5 GACAGGGACTOAGGACCTCCTGCTGTG GTCACCMGGMTCACATO 
TCTCAGCMGGTGmTGGTCCTTCTCAGTTTGCCTACGATCCATTGAGGCAAAGGTTTC 

CTTCTCTCAATACACCCTCTGCTTTANCCAGGGGGACCTCCCACTGNCCTCTCTATGTGT 

NCTATTTGTTCCTGACTGTTAGTACCT 

Sequence 417 

? CGAA IT GGAGCTCCCCGCGGTGGCGGCCGANGTGNACG GATCACAAGGTCAGGANATGG 

AGACCATCCTGGCTAACGTGGTNAAACCCCATCTCTACTAMTACAAAANAGTACCCGGG 

^ G I GACAGCATGTGCCTGTACTCCCAA NTACTCANGATGCTGANGCAGGACAATGGGGNG 

AACCCGGGAAGCAGANCTTGCANTGAGCAAGAGATNGTGCCCCTGCACTCCAGCCTGGGC 

GAACAGAGCGAGACTATGCCTCAAAACAAAACAAAAACACTANAGGGCAATTTTGCATGC 

AAATTATGCTAAGGCATAATTTTTAGTATTATCTTAGTATTTCTNAANNATGTAGTCNTC 

TATGGATTTACTGAACAAATTTGCCATATCTGCT 

Sequence 418 

AGG I AGA I RCACGGAGGTGGGACCA AMGAGGAMGMCA^ 

A ft G IZ G GG AA "'" GG A "'" G A ^" GG AATAAAATTGTG AAACTAATACTG GC AG GC AAAC AAG AA 

AAAACAACAACTGCCACAAACTGCAATATGCTTATCAGGGGTCTTGACCAAAAAAGAAAG 
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GCCCCCGATCTTCCACAGAGGACACTGGCACCAGTCCCAGGTCCTGCAACTCTTGAATAG 
CTCCATGTG CCGTGTTCTCTTCAGGTAC CTTGCC C G 
Sequence 419 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACACGGTAACCACATTTAGATGG 

ACTGGGATGTTGCCACACATACAAGCATTGATAACTGGCTTCTCATTACCTGAATACATT 

CTTCTGTCAGAGCAACAGACTCAGCTATGC7TCTGGCAAAATTGTTCTTAATTCTCTATT 

GATTMTTTATTCGGGTMGTATTTTATTGGGGTATmCTGTCTGAAAAGTGCGATTCC 

AGGTGCTTTATGTGTCTCTGTGTGTGGGTGTTATATAAATACTTATAATACTGTATCCAT 

ACTCTTGAAAAGCTTAAGTTGGGAAGGCMGGCATGCAATAAGGAACACAGAATTTTAGT 

CATTCCACAACCATCTGTTTGAATGGCTGCTATTGTTAGTATCCGGTGGTTGGAAACTGA 

GAAGCAAAAGATGACTATAATAGGGATCTCTTTTCTG 

Sequence 420 

ATAACCCACCCCAGTGAAAAACTCCAGCCCAGTGAACTGGAGGGTGTTCTTTTTGGCCAC 

GGTAAACTGAGGGATTATGATGAAGCTGAAGTTCAGGATGAATGAGAAAATGTTGAACTT 

CAAAAGCCATCTCAGAAAGTTGAAATTAGGAGAGGACGCTGGTTCCAAACTTGCCTCCAA 

TGATCTTCAGCGTCNTTCTNGCCACAGGCTTGATGGAATTCAGAATTTCCCGACAGGCTG 

TTCTTGGATCTCCGATAAGCCCGGGAAATGGANTTCAAACACTGAANTACAANCAMTTG 

AGCNTGAAGGATACGATGGGTGTGATTTGCTTTTTTCTTTGGCAACTATTTNCCTAAGGT 

TTCCTTTTCTCTTCCATGG 

Sequence 421 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCCAAGCTTCACAGCCCTGTCTCTGCCATGGAG 

AGCAGGCCCACGCACTGCTGCCTGCTGGAGCAACCGTGCTGGTCACAGGGCCAGAGGCAG 

AGAGCCCTAACAAGCAGTCAGGGCGCTAATGAGGAAGGGGGGGGACAACCTGTCCTCTAC 

AGAAAAATAATGAAGATTAATATGTTCAACAACTCCCATCCAGAATCCAGCAGGAAGTTA 

AACCAAACGACACATGATTCTCTGACGAAGAAATAAGAATACGGTTGACTGG I I I I HA T 

ATTTATTTACCTGAAAGCTTCTAATGTTTCTAACTAGTGACATATTTCAGGCCAGGAATT 

CTGTTTGCCTAGAATCATGATGTCTAAACACTGCACACTTGACCAACTTTGGATCCAAAT 

AACTAGCTTTGGTATGCTCTTG 

Sequence 422 

CCGGGCAGGTACTGGTGACTCACCATTCTGCCAGCAGCCACTTTGCACTGGGCACCAGCA 

CATACCAGCCCCCATCAGGAAATGTGCACAGTCCATCTCATCTTGTTTCATCTTCATTTT 

ATTAATGAGGAACTGAGAGTCCAAGAAGGTTCAATCATTTGCTCAAGGGACAAAGAAGTG 

CATATCAACTCAAGATCTGAGCGACTCCAAAGCCCAAGCTCTGTGCCCCCGATAACACTA 

AAGGGCTCTGGAACCTTCCCTCACTGTGCCAAATTCTGGTCGCCACACTTTAGGAAAGGT 

ACCT 

Sequence 423 

AGGTACCGCGGGACTAAl I I I I I I I I CTTCTACTTAGAATAAAATTCATGTAGTGCAGTA 

TCCAGAGCACATTAACTGAAGTATTTGGAGACTAGATTTAATCTCAGATCTCACCCTTGG 

TAAAATCATTAANCTTCTGAGCATCANTAGTCTCAAAACTTTTTCATTGAAGAAGGGGGT 

AAAGTTGGTANNTTTTTTCCAAAGGGAACAGNAAAAGGGGAGNGGAAAANTACCTTACCA 

GNTTTCTTCCNTTTGNMGGCeTTTTCTTNCCNGAMTTGTTCANTTM 

TTNGNCNCCCGGGGGNCAGGGGCTNCGNNTTCCTTAAGNAAAACNTANGGGTTGGGGATT 

CNCNNCCCCTNGGGGNCCTTNGANAATGGGGANAATTTCCTGNAATTAATTTCCAAAAAG 

GCN I I II I AATTCCGGAATTAACCCCCGG 

Sequence 424 

AGCGGGTAAAGTGGACTGGGGAGAACTTCNGAGGATGTTCATGTCCAGGAGCAGCCCCAC 
GCCCTGTATGGNCGGTGTCTAGAGCCTNACANCAACTAAAGACCAACCCANCTCTCATAA 
GAAAGGAATGTCAAANTGTCATGTTCAATTTTTACATTTCAGTAGCCTGGGAATCTTTT^ 
CTTCACAAATTTGAAAATGAAACTGTGCTTGAAANGGAGGTNAATTNCATTGCATTAATC 
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Sequence 425 

AGGTACCACAGGACACGCCCAGGATGTTGGGTTCCTGGTGCTGACCACAGTGCTATGGTC 

ACTCCACAGTCCAGCACTGCAGTGCACAAAGGGTGCCCTTTGACAAAAATGCTGCAGCAA 

GAGTNTGACCAACTTGAAAAGCANCAANATCAGCAAGATAAAAGATTCCTGAGAAGTNCT 

AAGTAGCCCTCTACGAAGATACAAAACATGCACAC 

Sequence 426 

AGGTACCATAATTCATATATCCTGTTTTTATTTGTATAGGAACCTAGGAGAAAGAGAAGG 

■AAAATTGAAAATAAATTAAGTCAAATGCAATACTTGTGCTAAAAATGTTGTCAAAAGTTG 

AGGCCANCTTCTAGGCAAAATGTTCGGTAGGCANGANTTTGCCTTTTCAGTCTTNTTTTC 
TCAATTTANCCAAAAAGTTTT 

Sequence 427 

CCGGGCAGGTAC I I I 1 1 1 I I I 1 1 1 I I 1 1 1 | I GTGGGCAAAATGAAATCTGAGTGGACCAA 

TGACCTCACCATGTCCAGTGTTTGGTTGCTAGTTTGAGATTTGGAAATGGCTGTTCAGGT 

GAAGAGGTAAATGGATCCAAATGCCTTGTTATAAGGCATTTGACAATGCTTACAGCAAGT 

AN^TGMTAGAGGGCCATTAAATTCTANCCAfAAACATTTCCCGGGGCTACTGGTTAT 

• mi 1 1 GGTTGNAACTTACACTTTTGAGGTTTTTCACAANGAGACAAGGNTNTACAAGGA 
TACCCTACATGT 

Sequence 428 

CCGGGCAGGTACGCGGGGAGTGGCATCCTTTGAACTCACGGATCCAGTCATAACTGAAGT 
TCCATCTACACCTGGGGTCTTCAGACCCAGGCTCTGGAAAAGATCCTGTTACTTACAAGC 
AAAAGAAATTCTGCCTCTTACCCTCAGCATGTGAGTACCT 
Sequence 429 

CCGGGCAGGTACTAGACATCTCTCTGAGGTCATGTCACACTCACCATGCCACCCCATCAT 

GTCCTTATAGATGGGACCCTCCTCTTCTGGTTAACACATCATCTGAAGATAGCCAGAGAG 

CCATCTCTAAGCTGTGCAATGACAAGGAGTCATCCTTCATAGAAGACATCTGCCCCCCAA 

ACAGGTCGTCTATACTTGATGGTGACTTGTTAAGAAGTTCACCCTGTGCTTTATTTCCAA 

CTCAGGACACCTGGAAAAGGTAAAAGAACCACCATAGTTGTACCT 

Sequence 430 

CAGTNTATTTGCTNGAAAAGTGGTAGTAATGCTGAAGAGTTTCAAATTATTTGAGCCTCT 
CTCTCACCTCTCTCAGTTTCCTCATCTTTGAAATGGGAATACTAATACATACCTCAGTTG 
ATTATGAGGGAAAGAGTTAAATGACGTAATATGTGGAAAGGGCTTAGCCAACTGCTTAAT 

AGGCATCGCTCAGTAAATGTTACCTAAGTACTCCAGCCTGGGTGACAGAGCAAAGACTCC 

GCCTCAAAAAAAAAGTGCGAATCCAACCGTCTACC 

Sequence 431 

AGGTACTCTCTGAGCCGAGTGCAGTGGCTCACACCGGTAATCCCAGCAATTTTGGAGGCC 

GAGATGGGCAGATCACTTTTGAAGCANGAGTTTGAGACCAGTNTGGCCAACGTGGNTAAA 

NGCTGTCTNTACTAAAAAGGNNAANNAATNACNCNCNNGGGNCTGTNGCCTNAACTCNNN 

TTGACACTTTNTTNANGGGNGGGT7TNNCGCACTTTGNAATCNTTTNNCGAAACNAAAAA 

NNGNNAGNGGNACCATTNNANATNGCCNNCCGAANATCCCTTTGGAATGAAATNTTTCGN 

GCCAACCCTNNACCCTTNTTGCCAAANTCCTAGGAATTNGGGNGAGCCCNCNAANAAGCA 

GNCCCCNAANGGGGNAACCCTNTGGGMNGNAAANTGTTTNTTTGGGGAAAAAAAAAAA 
Sequence 432 

GCGGTGGCGGCCGAGGTACATACTGAGCTCCTCCNTCACTGGNGCANACATTCACTGCCT 

GTCCTCTTACCTGTCCAGATGGGTGTTAGGGCCCCAAAGAAGATTCAGCTCAGGGCTTTG 

TCTGCAGGATGTTTCTAATGTGGTAGGAGACAAATAACGCCTGACATTCAACCAAGGTGG 

ACAAAGGNNNCGGGCTCAGNGGCTCACGGTTGTATTGCCAGCACTTTGGGAAGCTGTGGC 

AGGTGGATCACCTGAGGTCAGGAGTTCGAGACCAGCCTGACCAACATGGTGAAACCCCGT 

CTCTACGAAGAACACAAAAAATTAGCTGGCCGTGGTGGTGCACACTTGTAATCCCAGCTA 
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CTTGAGAGACTGAGGCAGGAGTTCNCTTGAACCCAGGAGGCANAGGTTGCAGTGAGCTGA 

GATCGCACCGTTGTACCTGCCCG 

Sequence 433 

CCGCGGTGGGCGGCCCGCCCGGGCAGGTACCGTAAGTTGAAGTCCAGGAGGGTTCCAGAC 

ACAGAGCTTCTGGTGTCTTCTCCCTGTGGAGTCAGGACTCATTACTCTGCTGTCTTTGAT 

GTGTGACAACAGGCACCAACTATCGCCAACTAGAGAAGCTCACCAGAGCTCTGATGTTGG 

AAGTTTTTATTGGGGCCTCAGCATGTAGACATGATTGACTGGTTGGTGGATTAACTGTCT 

ACATGGTTAATCTTANTGTCCAGGTCAGCTGATACTGCGTGACCAAAAGCTCCTACCCCG 

AATCCCTTGAT 

Sequence 434 

TCCCCCGCGGTGGCGGCCGAGGTACACAGCCTGGCACGTAGCAAGTGTTGAAATATCTCT 

TGGAGAGAGAGAAAAAAAATGAATCTATTAAAATGTCTCTATGATGCACTGCGGAAGCTA 

GACTTGGTAGATCGAGTTTAANGGANTGAANAGGGGGTGAATGTTTGCTACTTCCGTATA 

TTTGGAGTATTTGATAAAACAGAAGACCAGTTTCTCTTCAAACTTCATGTCGGNTTGTAG 

GTCAGGCTGCCAGGGTTTAGGGTACCTGCCCG 

Sequence 435 

GGGCAATCTACACAAGTTTGGGGTCCTTGCAATCAATCAAGACCCGAAGGGAGTAGCGGT 

TTCTCAGTTTTATTACTCCTGTAAATCCTTCTTCTAGTTTATTTCTGGTCTTGCAG 

ATTTTTCCTACCAGCTGCATTGGACATGGNGAAG I I I I I I I GTAACTTTTGAAGATT7TC 

AGTTTTCTGCATTAAATAATACTTTGGAGTATATACAATTTTTTTGTTCTAC 

TCCACACTGCACATAGATATTTTCTGGCAGAACAATATCCCATAAAAAATCTTAAGCTTC 

TTTAGTTAGAATACCATAACCTTCATATGCTACATGGTAACATTACCCGCGTACCT 

Sequence 436 

GCGGCCCGGGGGCCATTGAGACTGCCATGGAAGACTTGAAAGGTCACGTAGCTGAGACTT 

CTGGAGAGACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGACCACTGGAACAATGAGA 

AGGAGAGAATTCTACTGGTCACAGACAAGACTCTCTTGATCTGCAAATACGACTTCATCA 

TGCTGAGTTGTGTGCAGCTGCAGCGGATTCCTCTGAGCGCTGTCTATCGCATCTGCCTGG 

GCAAGTTCACCTTCCCTGGGATGTCCCTGGACAAGAGACAAGGAGAAGGCCTTAGGATCT 

ACTGGGGGAGTCCGGAGGAGCAGTCTCTTCTGTCCCGCTGGAACCCATGGTCCACTGAAG 

TTCCTTATGCTACTTTCACTGAGCATCCTATGAAATACACCAGTGAGAAATTCCTTGAAA 

TTTGCAAGTTGTCTGGGTTCATGTCTAAGCTTGTTCCAGCTATCCAGAATGCCCACAAGA 

ATTCAACTGGATCTGGAAGAAGAAAGAAACTGATGGTGTTAACTGAAC 

Sequence 437 

AGGTACGGTGGGGCACCCAGGTAGTAATNTGCAGGAAGTAGAATTGGCAACAAAGGACAC 

AGAATGAAATGGTGAGATGGCTAGCCGGAAACATAGGGAGAATGGCATCACAAAGGCAAA 

GGGGGGAAAGAATTTCAGTTTAGTGGATAGTCAACCMGGCATTTCACTTAGCAGTCAGG 

AATGAAAAAACGATACTGAATTTGMCATTANGAAAGCTTGGTAAATTTTCAAGAGTNTA 

ATTTCTGCAAAGTTGGAACACAGTGAATTAAAAMGTGCTTAAAAATTGAGGACAATTGA 

AAAG7TNACCAATGTTAAAACCAAACCCAAAAAMTTTTTNAATTTTTA 

TTCATTNGGGGGGGTTTTTTTGGNAAGGCCCCN I iTTTTTl 1 AAAA 

Sequence 438 

TATAGGNCGAATTGGAGCTCCACCCGCGGTGGGCGGCCCGCCCGGGCAGGTACGCGGGTA 

GGGATGCAGAACTGGCCACAGTCAGGGCTGGCCAACAAGGGATGGGATGGCAAGGGAACC 

AGTGG G AG ACTGTTTAC ACCTTTG ACTTC CTCTTC AC AG GTC AAG G C AG G ACTGTAG CAT 

TAGGTCTCAGAGATGCAATGAACAGGACAAATACACTTCCCCTTCATCTGGACCTGAGGC 

TCTGGGCAAGTCAGTACCT 

Sequence 439 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATTAAACCTGGGGCTAAATC 
CTGTAAATAATTTCAGCCAAGAGGCGTTTTCCCCTTCTCATTCACCAAAT1TCCTTTCCC 



Page 71 of 869 



WO 01/51628 



PCT/US01/00798 



Table 2 

?r^^^ GTCAC ^ CGGTMGMmCAGTCCCC "^ CCTTGT CTGTAACTGTCAA 

AGACCTGGGCGATCCACAAAGGAGCAGGAGAGGCCTGAGCAGGCAGGGATGGCCAGAACA 

AAAAA ? G J GAGC ^^ 

TACMAAACCCTGGCGTTCTTCATGGCTAGGCAGCTCTACCTTGTTCrnCTTCCCTTCA 
TCAAMGTCMCAATATCCTCTTATGTTTTTCATATAGCTGGCTGCAAGTATATTTTTCA 

Sggg^^^ 

Sequence 440 

S^^ GG ^ GGGGG ^ GGGGGGGGG ^^^^^ < ^^'2GGGAGGGGGTAAAGTGGACTGGGGAGAAC 

£S GAGG A A ^^^ 

TCACAGCAACTAAGGCCAACCCAGCTCTCAGAAGAAGGAATGTCAAAATGTCATGTTCAA 
TTTTACATTCAGTGCCTGGMTCTTTTCTTCACMTTGAAATGAAA^ 

l^l CCATGCA ™ TCTTCAGCT CACAAAGGAAATACTACATAAGAAGCAAGGAACAC 
GC A AA ^ GATCTACAGCTCTGATCTCCA GGATAGTGAAATGAGGTGGTGM 

CAGACTCAAGACGGACATMTTGGATTTTTTTTGCCATGGCCTGGAAAGAAAGGTACCT 
Sequence 441 

i GG ™ TCCGCATCC ^ 

CCTAACAAACAAACAAACACCTNAGTGACTTGTGATTTCCCCCCAGAAAAATATCTATTA 

tZ^]?^^^ gaa ^ gg ^ g ^^ a ^ g ^ ggg ^ a ^ ag ^^ a ^ a aggtgtaggcacatgctctga 

I C ^a^ CCTAGGAAACCGTCA ^ ATCAAGGGCATCCCCTAGTCCACTGTC 
Sequence 442 

T^mAAACCAGGTGAGAAAAATTAAATTATGTATTCTAACAAAGTAATATGTGAGAT 

TTTGCAMTGATTTTATA GAMT ACACAAAATMCTCTTTAGCTTGCTCTGAGCATTTTT 

TTCTTTrCTGATAGCAACTTTTTAACGTTGTGGATCCACAGAACTTACTGCTTTGCT^^ 

TCTTTTGGGGTCATAATTCCTCTCCCCTTGGAGTGTCCACTCCATGCATGTGCACTTAGG 

A I GTGTGGCTGTGTGTGTG ^ GGGAAC CCTCACGGACACATAAGGTTCTATTGTCATCA 

AGTAGAAMCCTATCTCATTATCATTATMTGTCTTCAGATGCTTTCTAAGGTTCACCTC 

TrmMCATTAGAAGTCAGTGAATGCAGCTTTCAmTAATTTTTMTACmAAAAT 

GTTTTTGTATTANCTGCCANAATGCTCAGCAGCAAAAGTTATGACTCACTTCTAGCAAGT 

GTGGTAGTTCTTTGCTTNMGCATTTGGGTTTCATGTAGCTTTTCTTCTTATTTTTTCTT 
TGG 

Sequence 443 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTGCCAATTAGTTGCTTAAAACTTTTTGTCATT 
GTTAAAACTGTTTACATGGCTTTTTAAAATGGGGGAAAAACAATAGAAACATGCTGTGGT 
GC AACGAATGAATACAGTGTG ATTAACAAC TTAA I I I I I I I I I I CCTGTTTCAAACATCC 

ACCTATTTTTAGCATTTTGTATACCCATAGGTTAAAMTATTTACATTTTTCTTAGAAAA 
GTAACCTGTTTAAGAACAAAAAAGGCCAAGCGTGGTGGCTCACATCTGTAATTCCAGTAC 

Sequence 444 

t^?^ AGGTA0] 11 1 " I 1 I I I I I 1 1 1 I I I I I I GTAGAGACAGGAGTCTCCCTATGT 

i GCCCAGGC J G ^ C ^ GAACTCCTGGG CTCAAGCAATCCTCCTGCCTTGGCCTCCCAAAG 

^ TGGGA I rGCAGGCGTGAGCCACC ATMTTGGAAAAAGTAACCCCTCACGCATCCATC 

^ G ^ CCCCGC ^ CAAAGCAGGGG TCCTTCCACTGCCCTGAGAAACCGGCAGTGGGAAC 
ACGCTCTACTTCCTGCCTCAAGCCCCTAAAGCCGCCCTCCTCCCTAGCTAAGACCAGTCA 
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TTCTACCTGGGCCCTAATCTACGTCCCACGTCCTGAAGGATCGCATCCAATCATCTGCTC 
TCCTCCACTGTGTAGGCAAGGGTCTTCTTCCACTGGCCCTGCTGGGAGAGATCTCTCCCA 
TCTGAAATGCCCTTTG 
Sequence 445 

CCGCGGTGGCGGCCCGAGGTACCTCTCATAAACTTTGTTCATTCAGCCACTTGAAATAAG 

CCACGTGGGTTATGTGCCACGCAAAATTTCAGTAATGGGTTTTGGGAGTGTTTAGATAGC 

AACAACTTCATAGACTGAACCTTTTGG I I M I I I J CTTTCTCTCCAACTCAGAGGTTAAA 

AATTAACCTGAAAAATCAATGCTGCMTATGAATGCGCTGTATGTTTCTTGCATGGACTT 

GCAGACACTCTGTAACGGTGTTTCCAGTCTCCAGCAGTTCATATTGGAAGGTGAGCTTTC 

ACATGGACAAGCCCTTTGCG7TGCCAAGTAGTAGTGATATTACAAGTTTGAAAGACATTT 

AGATGATAATTCCAAAGGCCCTCTTATCTCCAAATCCATCAGAAGCTACCTATAATTTCA 

GTGCAATGAGTGCTGGGCATAGGTGGCCATTCAAGTGAGTGAGTACCTGCCCG 

Sequence 446 

CGAGGTACTCAGAGGTTTCCTGGCTCCAGCCATTGCTCTCGACCTCACAGCTACCCATTG 

GGCCAAGCCACCTCCAGCTCTCACCTGACTTGCTGCAGTGGTCTCCCAAGAGGTCTCTCC 

ACTCAGTCCCTGCCACCTTCAGCCTGTTCTCAGTGGAGGGGGCCAGAGTGACCCTGTTAA 

AACATGAGACCTTTCACGTCCAACCTCAGCTGAGACTTCCCCATGGTTCCTATCACACTC 

AGGGTCAAAGTGAGAGTCCCTAAGATGAAACACAAGTAGGGAGAATGCCCTTCTCAGCTG 

GACTGGGACAGCTCCAGGGTATCCCTATTGTCCTTGTACCTGCCCG 

Sequence 447 

CCGGGCAGGTACGCGGGACTATAAAAAATAACATGTTCATAGTTTTAAAATTTTAACCAG 

TGAAGAATAGTATAAAACGAAAMTTAGTCTTCTGTCCTGCTAATATTTATGCTCATTCT 

GTTTCTTACATATGCTTACAAAAATGTTCTTGCATATATAMTACACGTCTTTGTCTATT 

TTTTATGTTTATGAAACTAGGATCATACTTAAGATTCTGCTTTTTC 1 Mi l I CAATATAT 

CTTTGACATC I I I I I I I I II I I I I I IGAGGCGGGGTCTCGCCCTGTCACCCAGGTTGGAG 

TGCAGTGATGGGATCCCAACTCACTGCAACCTCTGCCTTCCCAGTTCAAGTGATTCTCCT 

GCCTCANTCTCCCGAGTGGCTTGGACTGCAGATGTGGGCCACCATGCCCTGCTAAATGGT 

TTGTACCT 

Sequence 448 

AGGTACCTGTCTCATTCTGGC/y\CTTGNGACATTTCTCAATCCAAACMTATCTTTAAGG 

TATTAAACCCTATGTTTTTGGCTGAAGAAACTGACCTCAGAAATGACAAGCAATGTATCT 

AAAGAGCAGTAAGTAGGCCGGGTGCAGTGGCTCACGCCTGTAATCCCAGCACTTTGGAAG 

GCTGAAGCGGACAGATCACCTGAGGTCGGGAGTTCGAGACCAGCCTGACCAACATGCAGA 

AACCCCATCTCTACTAAAAATACAAAATTAGCTAGGTGTGGTGGCAGGCACCTGCAGTCT 

CAGCTACTCAGGAACCTGAAGTAGGAGAATTGCTTGAACCTGGGAGGCGGAGGTTGCAGT 

GAGCTGAGATCACGCCACTGCACTCCAGTCTGGGCGACAGAGCAAGACTCTGTCTCAAAA 

AAAAAACAAAAACAAAAACAAAAACAAAMCAAAAAAAAAAAACGGNAAGTAGCAAATCT 

GAGATTTGAATTGAGAATTGACTCTAAAAGCCCATGCTG 

Sequence 449 

CCGCGGTGGCGGCCCGAGGTCGTAGACAAGAGGTCCTCCATCAACCTTCTCAGGTGGAAG 

TCTTCTTTTAGAAGGATGTTTGTAAACCCTTTTGCATCAGGMTGACAGTCCAATTCm 

TCTGTCAGTTCTTGTTCCAGAACAATTAGGGTGAGTCATACGGAGTAACCTTGCCATAAA 

AAATGGMTTCGTATGGGAATTGTATGMGGCCTTCTTACTGAAAGTCATTGAGATCCTT 

TAGATGATGTGTTTGTGTATTCTTATGATTTAAATACATGCAGAAATGAAAGGATGCAAC 

GCAGTGCTAGAAAGCAATTTCTCTCTGCGTTTCACTCAGGGTTTCCTGTAGCCACTGCTT 

CTCTCCATCCATGGACATTCTCCCTCTTCTGGTGAGTCGTTCC 

Sequence 450 

CGCCCGGGCAGGTACATCCCACTGAGAACCACTGAGACAGATCCTGTATGGCCTTGTGGT 
CATAAATATGATTTTGTTCTAAGTGTAAGGAAGAAGCCAGTAGAGAGCTATAAACAGAGG 
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T I I 1 1 1 I I 1 1 i 1 1 I 



S^SS^Z CTTGCTCTGTGGTCCAGGCTGGANT GCAATGGTGCGATCTTGGCTNACT 



ACAGGGAATGGC^TGCCTTACCCCATTCTGATCAGTCACTGATGCAGGAAACARfiRAATT 



CAGTCA 



CTGATGCAGGTAACGGAATGGGCGTGCCCTANGCAAGGCTTTGC 
GCTTAAATTTGAATGACATGAGGGCTATTTTATGCCTTCCTGGG 

CATAGGATCTCACACAAGTGMTCATrTATATCATGGGTTCGTCATGTTTTTAGCATGTT 

£a^mga^^ 

Sequence 452 

tcScatgcagS?^ 

GCAcISGCATCTrrT^A^ 

A^wrA^rA^ 
Gtc?^G AG A r^^ 

GACCTTTGTCACCTGAAGCATTCTTAAAATAAGGAGACTGACATTAAACAGGA 
Sequence 453 

Sequence 454 

TTGGTCTGGAGACCCCAGCTGGGGTATCTCCTGAGCCCAGGGATTCAAAGGTTCrTrrrA 

GGGTCTGTGCCACTCCTGGGTGAATGGTTGATCTGTCTCACTCTTCTCCGTGATCCGAAG 
GTCACACTATGTCACTGATGAATCCTTATGTGTC ^< <^ • ^ I laATCCGAAG 

Sequence 455 

CGAG A ?J ACCCTGCTGA ^^ 

AATTAGAAATTATGGGGCCACmGCCATTCrrCACACCTGCMTGMC^ 

GAGTGTTGGCTGTTGGTTGACAGAGGAGCTGCATCCTCCTGGAGGAAACTGAACTCAGCT 
G £ GA ^ GCCACCTGGTCC ^^ 

tcI^ac^c^ 

Sequence 456 
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CCGCGGTGGCGGCCCGAGGTACTTCCATCAAGTCGAGGGAGTAACGCAAACTAGAATAGT 
AGCCATCTATTAAGGATAGTTCTGAGGTTTGGAATTTGTAACTGATTTTTAGGAGCTACG 
ACATGAATATCATATCAGGTAGAMTACTCAGAGTGATCACCTTTTATGAAAGTTAGTTC 
TATCTTTCATGTCTAGGAAAATATAACAGTGMTGTTTGCTTTTTGTATGTTTTGTCTCC 

AAGCAGAGGAAACTGACAGTTTTATGTTTTAGGTGCCGAAAGTTGTTTTTGGCTAGACAG 
TTCCAGAGTTAATCACTGGGATGCTAACTTGM 

TTAAMCGAAATATGCTTCCGTTATTTCCAAACTGACACCTTGTGTTCTGGATTAAGTTT 
TTAGAGA MTGTTTTC CAAA1TCTCTTGGATTACTTGAAGTCCATTGATTTTGAAATATT 
TAGTTTAAGi I I M i ( CTTTTAAAGCTTATCAGAAAGTAGAGGAAGTACCTGCCCGGGC 
Sequence 457 

CCGGGCAGGTACAATGGGTGTTCAAMTTTCTCTTGTAGGGTTGAAAATTTTCAAAATM 
AAAGCTG GGGATAA GATGATTTGGGTGGGTCTGGCCAGAAATAATAAGAGCAGGATTCTC 
AGCTCTCCi I I I I IGGCCACTGTGCTTCTGCAGTGGGACTCAAAAAGAAGTACCT 
Sequence 458 

AGGTACAATGCCCTTGGAGGGAGAGGGGNTTCNTGGAAAATGGTTGCCATGATAGGATTT 
GGGGAGAACCTCTATGTTTGAGGGAGTTTGGGTCAGGACCTATCCGTGGAAGCAAAAACT 
AGTGCCCAAATATTCAAAACATAGTTTGGTGAAAGATCACCTATGATTCTGAACGAAATA 

GTGTA GANNAA GACATCTGGMGTTTTCCTATTGAAAAAAAGAAAGTGTGAAATCAGATA 

CTTTTGTTTTTAAGGCTCTCTTC 

CTGCCCG 

Sequence 459 

CCGCGGTGGCGGCCCGAGGTACTTAATGAAACATTCTCACTTATGTTATGCTACTTGAGA 

CTCAGAACAACTGTGGGAGCTTGGGACAGATGACTTCTCATTTTCCAGATGAAGAAGTTC 

GAATCCAGAGAAGTCTAGGATAAATAAGAAGGAAAGGAAGAAGGAATAGGAAGTGAGAAA 

GAGGGAAAAAGAAAAAAGTGTAAGAGCTAGTAGAGAAGCAGTTAGAGTCCTTCCAAGGTC 

CTGAGCATGTTTAGTTTAGGATGGAGGGAGGGGAGTTCTCAGAAGAGAAAGCAGCAACTC 

CACACCATAGGTTGAGGTGGGTGGGCAGAAGAGGACATGCAGGATATGACTCCTCCTCTC 

CAGTAAGCACTGAGCGACTCTACTGTCCTCTTGCCTTCTTCTGCTCCTTTGCTCTGGGTC 

ACCAGAAGTGGACTCTTGTTCCAGCCTCCCACCCTGACAAGGAGCTCGTAATCATCTCCT 
CCTT 

Sequence 460 

AAACCCAGAGTAGGGAGACAGGGMCCAGCAATGTCAGAAGGAGCCGATATTTTGCATGG 

AGCTAAGGCTTCTGCTCATCATTTCTCCTCTAGGAAGCAGGAAGCCATTGAAATCCATCC 

CTAATGTATCCCAAATGTCCACTTGCTGAGGGCNGGATTGCACAAGCAACAATCTTTTGC 

TCATGAAGCTTAGGGAGCACTGTCCCAGGGTATCAATTGGCCCAAAGCTGGTCTCTTCCT 

ACAGACCATGCGGCAAATCAGAACATAAGAGGACTTGAGGCCCACAGCCCTGAGTTTCAA 

TTCCAGTTCTGTATTTTAG GTCTGTG TTCGTMTTAGTATCATMTTAACTGAGAAGGTT 

ATTTAGATTTATTGAGCCTCTTTmGTAAMTGGACTATTCATATACMCTGCAGAAM 

ACCAAATCCCCCCAAAAAACCCTTGCTACTTAATACAAAAACCTAACTGCAGGCACTTCT 

ACTTTGCGTGGGTTGACAGAATAACCCATCACTATTCACCACCCAAAACAAACACC 
Sequence 461 

CGAATTGGATTTTCCC NCGGTGG CNGCCGCCCGGGCAGGTACGAGCANTATTGCTTATCA 

GAACACACCAACAGTCTTT7TTTAATATTCGCTTTTCCCCTTTGGTTCAGCCGAGCCTGC 

TGACAGATCATTCAAATTCAATTGTTTTATGTCTAAGCCGAACAGCTAATTAAAGTAGGG 

ATGATTAACCTGACTGGCAGTGCCGCTCTCCCTCTCGCGCTCTCTCTCCTTCTCCTTTTC 

TCTCTCTCTCCCTCCTGTTCCAATCAAGTCTTTTTCTCACACACTCTCTCTCTCCGTCTC 

TCCTCTCTCTCCCTCTCTCTCTCTCTCTCCCCCCCCTCTCTCTCTTCCCCCTCTCTCTTC 

CTCTCTCTCAGTCTCTC TCTCCCT CCCTCGGTTGGTGTCTCTCTCTCTCTCTCTCTCTCT 

GTCTCTTGCTCTACTGGTTTTTTTTCCTCTCTCTCTCCTAAACGTGGTCTGCTTGCTGAT 
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TTTTTTTTAAAGAANTATNNGGAACTTGGGAAGANGGGGGACAAAAAAAGGTXm 



Sequence 462 



™ G ~TL GG ^^ 

GGCAGTTTC 

CACTGGGTCAGGAGCACNAGAACAAA^GTGGATC^CCAC " ' """^ ' "^^^^^AGCCT 



I£ AG £^ 



G^NCCTTGGAACTATGTATATACCTCTACTATAGCTATCACATTTTATTGTAACTTTTT 

IS™£ AG ^ 



Sequence 463 

^ACT^ 

A^^TGTCTTTGGCCCAGA TC ™^^^ 
Sequence 464 

^ GCA I^ GTGmAGGTGTCA G A GAGGGAGATATCCTTACAAAGCAGAGATTAT^ 

ATTTTGGAGACNGATTAATTCACTAGTTGGTCTATTCA <j^^aaaaacATAT 
Sequence 465 

o C ^7 A G ^ AGCTCCACCCGCGGTGGCGGCCCGA GAGTCTCCCATGGTGACTGTG 

A^CGG^^ 
Sequence 466 

GCAC^TGCAAAACTGGGCCCMGTAmAAMCATCTCTAAAAMTATCTAG^ 
GGTTTTTATCCCTAGTTTTAACAGTCTGAAAGAGGGTTCGTTACTGAGCAC^^GAAG^G^ 



TCMCCCGGACACAAGGGAGAGCGCGGTTACCCTGGCMTATTGGTCCCGTTGGTrrTrr 
AAGTGCACCTGGTCCTNATGGCCCCGTGGGTNCTGCTGGCAAACATGGAA^ 

NTGACTTTCCCACACTTTGGGATGGTGGAAGAATTGGGAAGG 
Sequence 467 

^ C ^ GCAGGTACCACCACACCCA AAGCCTGGCTCAGAATGMTTCAGCACCCATTCC 

TTGCTTTTTAAACCTCAAAAGTTCCCGACTCAGTGAAAGGCTTCCATTTTGCT^ 
TCTCTGTCTGCATmACTCTGTMm 

Sequence 468 

rrATAGAATCTAGGAGGGCTGTGATTTCTCTGAGTAATGGATTATAGCAGAAGCGGTGCT 
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AAGTCGTATACAAGTCTGGGTAATAAAAGATGATGCAACTTCCACTTTGTTCACTAAAAC 
ATTTACACTTGGAGCTCTGAGCCACTGCATTAGTAGTTTGACTACTATGAGGCCATAGCC 
TTCTGGCAATGCTAAGCTGCACAGAAAGGCCTCAGGTACCTGCCCG 
Sequence 469 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACTAATTAGATCTGCCCATCCGTTACTTGGCAG 
AGGAATGGGCTGGAAATGGGAAAGGGTCAACCTTTCACCTGCTGGACACTCAAGGTGAGA 
AGAAAAAGGAAAGGGGCCAAGAACTAACTGTCTGGCTAGACAGTAATCATGGACTCAGGC 
TTTAGGTATTTGATCTTTTAGAATTCCMTTTC^ 

AAATGGTTGGAAGTGGGAGGAACTTCACATTTAAATAGAAAACTATAAATAAGCCAATGA 

AAAGAAAACACTCATMTTATCCAATATMTTTTTTTATAGTTCATAGA^ 

ATATATATCTGCTGCATCCTTCCAMCATATTACTTTTACTCTTAAACCAGCACCACTAC 

TTATTTTCTATMTAGTGTATTCATmCCATGTTTCTGTTCAGAAAGTGTAG 

ATGTATGAACCTATCCTTANGGAAAAATTNCATTCTCTAAACCACATGACAATTCTCAGT 

GGGGATATTCCACTGTTCCAACATGAACAAAATGCCCCC 

Sequence 470 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTACCCACTCTGTGCC 

AGGCAATGAACAACATGGGCAAATAGCCTACCCTCATGGAGCTTACATTTTAATAGGGAG 

ACAGTCAATAAAGCAAACGAACATGCAAATAGATAATGGACTGTCAGTTGGTGACAAATG 

CCATAGAGTAGGCTMGATGCTATTTTAGAAGGGCTTGGAAAGTCTTCTGCATAGGGTAA 

TATTGGAGCAGAGACTAAATGGTGGAGGAGAGGAATATGGGACTGTTCANAGGAAGAATA 

TCCCTGGTGAGGGAACTTCAAGTATAAACACACTGAGGCAGGAGCATGCCAGATNCAAGG 

ATGACANGAAGGGCAGTGTAACTGGAGCGCANTGAGCAGGGCAGAGAGTGGGTAGATACA 

TGTTTATATATCTGCATCTCTAGAAGTTCAAGGGAGC 

Sequence 471 

CTATAGGGCGAA TTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTAC I I I H H I U 

TTTTTTTTTTTTTNGGCGATGGAGTCTCGCTCTGTCACCAGGCTGGAGTGTGGTGGCGCA 

ATCTCGGCTCGCTGCAAGCTCCGCCTCCCGGGTTCAAGTGATTCTGGATAATCTGTGCTA 

TCTGATCAAACTGTTAGCTCCAAGATGTAGAAAGAAGAACCTGTGGTATGAATATTAAGT 

CAGCCAAA CAGCT TTACCCTTTGTATGGGAGTGCTTTTTGCTTCCTGTAAATAGCCTTGT 

AAAT GGTTGT TTTACAGAGAATTATTCAGTAAGGAGAAGCCATGGAAAAATTAAGAGGAA 
ATGATTTTTTCCCAAT^ 

mini iGAGANGGAGTCTCGCTCTGTCACCAGGCTGGAGTGNGGGNGGCGCAATCTCGG 
CTCGCTGNAAGCTCCGCTTCCGGGTTTNAAGTGAGCCCAAATCGNGCCATTGGAAGTCTA 
ACCTNGGGCGANAAAGCAAAAAACTTTTTCTTAAAGNAAAAAA 
Sequence 472 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCNGGCAGGTACTATATCAGCTCATCCCT 

GGCATGCACAACTGCTCANTCTGGCCAAGGCACCAACTCCCNGAAGGATAATGTGCTGCT 

TCAGCTCATGCCCAGGGAGCATACTCTTTTGGGTGGGCCAGGCACCATTTCCCTGTGATG 

CTGGGANCTGCTTCAGCTTAGGTACCACTGCACTCCAGCCTGTGTGACAGTGAAATTGTG 

TCTCAAAAAATAAATAAATGAATAAAGACACAGAATCTGCAGGATGGCTCAGTCAGTTGG 

AGACCCAANAAAGCCATGCTG 

Sequence 473 

GCTAATTGGAGCTCCCCGCGGTGGCNGCCGCCCGGGCAGGTACGCGGGGAGAGCTCCAGC 

ATTTCATCTAGGGGGAGGTGGAAGTGTGGCTTTTANAGATAAAAGGACAGGGTTGTTTTG 

GGGTGCTCTGAGAAGGGGAAAAGGAGACCAGTGTATAACTTTGAGTAGCTAGAACGGGAA 

GATTTGAGGGGGCTGAATGAATTGATGAGAGTGATTTAACTTAAAGGACTGGGGCATTAT 

CTGTGGGCCCGGGTATACTAGCTTCACTTTTGAGGAAAAGTGAAGCTGGGAAATGTGAGA 

TCTACTCAGTGGGTCCGGGGAAATGTGAGAGGAGTAAGGAAAGTGAAGACACAGATGTGT 

TGGAGGCTCCAGGNTAGCATGGTAMGAGAAAGGGTTTCNTGGATTCTGATTACAGAATG 
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GACCTGTGGGGGGTCTTCANATTGAGGCCTNACANGCATTTAAAAGGAGCAACCTGNNTA 

NAACAGANGTTTGGGGAAAT 

Sequence 474 

CTACTATTTGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGGGGGCCATTGAGACTGCCA 

TGGAAGACTTGAAAGGTCACGTAGCTGAGACTTCTGGAGAGACCATTCAAGGCTTCTGGC 

TCTTG AC AAAGATAGACC ACTGG AAC AATGAGAAG GAGAGAATTCTACTGG TCAC AGACA 

AGACTCTCTTGATCTGCAAATACGACTTCATCATGCTGAGTTGTGTGCAGCTGCAACGGA 

TTCCTCTGAGCGCTGTCTATCGCATCTGCCTGGGCAAGTTCACCTTCCCTGGGATGTCCC 

TGGACAAGAGACAAGGAGAAGGCCTTAGGATCTACTGGG 

Sequence 475 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACAATGCCCTTGGAGGGAGA 

GGGGCTTCATGGAAAATGGTTGCCATGATAGGATTTGGGGAGAACCTCTATGTTTGAGGG 

AGTTTGGGTCAGGACCTATCCGTGGAAGCAAAAACTANTGCCCAAATATTCAAAACATAG 

TTTGGTGAAAGATCACCTATGATTGTGAACGAAATAGTGTAGATAAAGACATCTGGAAGT 

TTTCCTATTGAAAAAAAGAAAGTGNGAAATCAGATACTTTTGTTTTTAAGGC 
TTAAANTTCCTAA 

Sequence 476 

CTTANGGCG AATTGG AGCTCCCCGCGGTGGCGGCCCGAGGTACCTTTAGTCTTTAAACCC 

TGATTCTTCC l nil IGGTTTAGTCTAATGGATGAAAAACTCAAACTGACTGATGTGGCT 

GTAATCTTGGGCCTGTGAGGTAGAACAGCAGAAAAAAANTAGGAATTTAGAAGAAGAAGA 

CTTGGTTTGAATTTATCTCCTTTAATAGGTGTGTGGCGGCTGGGCACGGTGGCTCACGCC 

TGTGATCNCAGCACTTTGAGAGGCTGAGGCGGGCGGATCATGAGGTCAGGAGACTGAGAC 

CATCCTGGCTGACGCGGTGAAACCCCGTCTCCACTAAAAATACAAAAAATTAGCCAGGCA 

TGGTGGCATGCGCCTGTAGNNCCAAGCTAATCAGGAGGCTGAGGCTGGGANAATCGCTTG 

AACCCANGNGGTGGAGGNAATNGGTGAGGCCCNAGATCGCANCACTACACTTCAGCCTGG 

GGCGACAGTAGGGAGACTTCCTCTTCAAAAAA 

Sequence 477 

AGCTCCACCGCGGTGGCGGCNCGGGGGCCATTGAGACTGCCATGGAAGACTTGAAAGGTC 

ACGTAGCTGAGACTTCTGGAGAGACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGACC 

ACTGG AAC AATGAGAAGGAGAGAATTCTACTGGTCAC AG ACAAGACTCTCTTGATCTGCA 

AATACGACTTCATCACGCTGAGTTGTGTGCAGCTGCAGCGGATTCCTCTGAGCGCTGTCT 

ATCGCATCTGCCTGGGCAAGTTCACCTTCCCTGGGATGTCCCTGGACAAGAGACAAGGAG 

AAGGCCTTAGGATCTACTGGGGGAGTCCGGAGGAGCAGTCTCTTCTGTCCCGCTGGAACC 

CATGGTCCACTGAAGTTCCTTATGCTACTTTCACTGAGCATCCTATGAAATACACCAGTG 

AGAAATTCCTTGAAATTTGCAAGTTGTCTGGGTTCATGTCTAAGCTTGNTCCAGCTATCC 

AGAATGCCCACAAGAATTCAACTGGATCTGGAAGAGGAAAGAAACTGATGGTGGTAACTG 

AACCCATTTGATTTGAGACCTACCAGGGCTGATGTCATTCATTGGAAACCGCACAAACTT 

GGCTATTNCCTTTCCCGTGGGGAAGTATTGGGTT 

Sequence 478 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTACTTATTGTATAT 
TTCTCACATTCTTTCAGAAAGTTATTAAAAT 

AGAGAGAAGATTGAAATTTATTTTGCAGCTATACACTTGGTTTTGGATTATTCATGTGTA 

TAAGTTATTTGAGCTGCTACCACTTCCTCTGACCTAAGGCTGTGGCTAAAATGGGTTTGG 

AGAAGAAATTCTTGCCTGTACCT 

Sequence 479 

GGCCGAGGTACCTTTCTTTCCAGGCCTGGCAAAAAAAATCCAATTATGTCCGTCTTGAGT 
CTGTGGTCTTGCTTCTTATGTAGTATTTCCTTTGTGAGCTGAAGATTAATGCATGGATTC 
ACCTCCTTCAGC ACATT TCATTTCAATTGTGAAGAAAAGATTCCAGGCACTGAATGTAAA 
ATTGMCATGACATTTTGACATTCCTTCTTCTGAGAGCTGGGTTGGTCTTAANTTGCTGT 
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GAGGCTCTAGACACCGACCATACAGGGCGTGGGGCTGCTCCTGGACATGAACATCCTCCG 

AAGTTCTCCCCAGTCCACTTNACCCGCTCTGGGCTCACCTCTGAATGTCCCCGCNTNCCT 
GCCCNGGCGGCCGCN 

Sequence 480 

TGGCTAAA ACATAT TCCATTTACTTGTCAGGACTTACAATTTCATTTCACGTGCATATAA 

AATTAGTCATTTTTCAMTTCACTATTAAAAATCATATCCAAGGCAGCATTT^ 

ATTMCCCATGTATTATTTTA^ 

CCAATTGTTACTATT7TGTAAAATTCACATTTCTAGCCACATAMTTAACAACATTCCTG 

AAAAACATAAGAATCAGATATAACTCTGCAGTAGGCAATCCTGGGTNCTGACACTGACTT 

TNCTACCTGAGGACAACCACATTCAAACTGCAACCATTTGACAAATGGGGAGTCTCAAAA 

AGTACAGAAGATAGCMTCCAGATCACGGCAAAAAACATGGCTTTTTCTTGCCAATNAGA 

AAAAANAAAAAANTTAAAAAAAAGTACCTGCCCNG 

Sequence 481 

CCGCGGTGGCGGCCGAGGTACGCGGGGGAGTTTCTAAGCCCCCGCCTGCGGTCTGAGGCA 

CCGGCTGAACCATGTCGGAGATCCTGTGCCAGTGGGCTCAACAAGGAGTTGAAGGTGTCC 

CGGACCGTGAGTCCCAAGTCATTTGCAAAGGCATTTTCCAGTGGGCTATCTACTTGGAGA 

AGTTCTACACAAGTTTGAACTTNAGGATGATTTTTCAGMTTTITTGGACANCAA^ 

TCAAGTGCCAAACTTMTAATTTTTCTCGCTTGGAGCCAACACTTACCTTCTGGGTGGTG 

CAAGNTTTGATCAAAATGTGGNCCATGGGCATCATCACAGAAAAGCCTGGGGTGGGCAAC 

AAAGCTGTTATATCAATTGTACCTGCCGGGCGGCCGCTCTTAA 

Sequence 482 

CCGCGGTGGCGGCCCGCCGGNCAGGTACTTTTANAAGAGACAGGGTTTCACCATGGTGGC 

CAGGCTGGTCTCGATGTCTTGACCTCGTGATCTGCCTGCCTCGGCCTCCCAAAGTGCTGG 

GATTACANGTGTGAGCCACCACACCCGGCCTGMTTGCACTTTTGATGGCTGAGCCAAAT 

GCTCATTCTATTTAATATTGCACAGCCTGGACCAAAACCCGTAACATTATAAAGGAAGAG 

ATGAGAGCCATTTCAAACTGTGAGAGAAGAGAAGACATCAAGCAAAATCTGGGGGTTTTA 

AGCCAACAGACTTCANTCCTGGGGCCTTGCAGCAAAAAAACACCTTCAGTTGCCCCAGGG 

CTTTACAAGGTGTGGCTCGCCTGTTCTGTTGCTACACGCTCAAACCCCTGAGGGGAGGGG 

ANCATGCAGATGGACAGGTGCAGGAGCCTAAGTGGGAGTGTGTTACAATGTGCCCTTTTA 

ACCCTGCTGTCCATGGATGGCTTGAATGTTAATCAAACTN 

Sequence 483 

GCAGGTACCGCGGGTCCCACAGTGCCCCTTGATTGAGAACCCTCTTCCCAGCCAGAGGCA 

GGAAGAGGACGAGGCACATGAGCCTGGTTGGCATCATGCCCTAATCATGCCCTTTGCAGC 

CTTTTACTCCTTCACAGCCCACATGCTCMGGTCGTCTGTCTTTTCTGTCCCTCCCCACC 

TGTTCCAGTTAGATTTCTACTAAAGTAATAGTTAGCATTTTTGAMCAGTGTTGGCTTGT 

TTAAT CCTCA CCACAAACCCATGAGGTGGGTTAAGAGAGATGAAAGAACTTACAAGGTCA 

CACAGTTTTACGCAACTGCTGCCTTTTCTTCTCTTGGCTTCAGGCTTGGGTTCCTTCTCT 

TTGTGTCTGGCTGTACCT 

Sequence 484 

CCGCGGTGGCGGCCGAGGTACCTAGGAAAGGGGTGACAGGTCGGAGTTTGAAACCCGCTT 
GAAGGGAGGAGTCACAGAGCTTGGAAAATGCTAGTGCTAGGCCAGAAATGACTATCACTG 
CTACGAAGTCAGCAGTTATAGTGCCTGGTATTGTCAGTCTAGTCAGACAGCCATTAAAAA 
TAACTTAGGATTTCCAGGTTTCCTTTTCAAM 

ATmAAAAACTAGAATATGTGTCAGTTCCATTTCTACATCTGMTTTAAGGTAAGCCTC 

CACTATCAATTCAGCTTCTACCACGGTATATATTTACTCTCCCTTTTTCACTCACTCAAC 

CACTCTGTTTACAGGCAGGACAAACTGTGGGGGTAGCCAAAGTCCTTCTGTCCTCATACT 

GGGCACCTTCTATTCTATCCTCCAACTTGTCCCTGTGTCTCCAGGCANGCCTTGTACCTG 

CCCGGGCNGGCCGCTCTAGAACTAAGTGGATCCCCCNG 

Sequence 485 
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TTATGCTGCGAGATAAGTCCTCTCTTGG7TTGAGCTCCCACCTTTTCAGTGAACTCTTAC 

ATNNTGGGGGATCTGCTCTTGTAAAGGACATCCTTTCTGGTGAGTATTCTTTTGGTTTAA 

TTTNTGGTNTGGTTATTTGTGCATGAATATAATCTCATTAGGAAACAAGTTAAGTTGAAT 

AGACCAACTAGTGAATTAATCCGTCACCAAAATATATGTTTTTGGCATTTACCTGNTTAT 

TTTGAAACTCTTTGTAAGAAATGTAAACCTGTAATGATAATCTCTGCTTTGTAAGGATAT 

CTNCNTCTCTGACACCTAAAACACTAGATGCTTTCACAAAGCAAAAGGAAGAGACCTAAA 

t !iI a Z™ tctgtgt ^ ctc ^ 

C I 1 1 I I I I CATCTCAACAAAATAGNGGTGTTAAANTNTCTGTACCTCGGCCCGNTCTTAA 

ACTAGTGGATNCCCCGGGNCTGCANGAAATTNGATTTTCAAGCTTATCGATCCCGTCCGA 
NCCTTNGANGGGGGGGCCC 

Sequence 486 

CGGGCAGGTACAGTGAAGCTTTGACCTGAGGAGCTCTTCTGTAGGTGAAGAGTGGTTAAC 

TATGGTTCCTGTGTGCCTGGGGCCCCCCACTCCTAGGACTCATCCACCCCCTACCCAGAA 
CTTACTAGTACCT 

Sequence 487 

CTCCCCCGCGGTGGCGGCCGAGGTACCTTAGTAGAGCAGTTCCAAAAAACCCTGCCAGTC 

ACTGCTGTGACCTGCTCCACAGGACTCAACCTGAAATTGGACATCGCCTCCCAGGAAAAT 

GGTGCTTTAATGGANCAGCTCCACATACATCTTNCAGTCACTCCTACAGCCTTCCCCACT 

ATGCCAGAGCTAAAGCAATAACTGGCATCCCAGGGAAATGATGCATTGACTGCCAGAGCA 

AGTTATATTCCCAGTGCCTGGGCTGAAGCAGTCCCCTGTCTCCTGGGAAAATAGTACCTG 
CCCG 

Sequence 488 

AGTACCTTTCTTTCCAGCCATGGCAAAAAAAATCCAATTATGTCCGTCTTGAGTCTGTGG 

TCTTGCTTCTTATGTNGNATTTCCTTTGTGAGCTGAAGATTAATGCNTGGATTCACCTCC 

TTCAGCACATTTCATTTCAATTGTGAAGAAAAGATTCCAGGCACTGAATGTAAAATTGAA 

CATGACATTTTGACATTCCTTCTTCTGAGAGCTGGGTTGGTCTTAGTTGCTGNGAGGCTC 

TAGACACCGACCATACAGGGCGNGGGGCTGCTCCTGGACATGAACATNCTNCGAAGTTNT 

TCNCAGTCCACTTTACCCCGCTNTCCCNGCGTACCTGCCCCGGGCCGGCCGCTCTAAGAA 

ACTAGTGGATCCCCCAGNGCTGCAAGGAAATTCTANATCAAGCTTGATCGGTACCGTCTA 
CCTANGAGGGGGGGCCCGGT 

Sequence 489 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACTGCCCTCCAGGGCTTACAGTCTGGTGAAGGC 

ACAGGCAACAGTAAGATGATGTGGTAATTACTGGGAGGGGGCAGGAAAGAGAGAGAGCAA 

AGCAGATTCCAGGAGAACAGGAGGGAAGAAGGCTTGCGGGAATAGACTGGCATATGCAAA 

GGTCTA GAGG AGGAAAGAACCAGGTTTAAGTCAAAGAACTCAAAAATAGTTCAACACNGG 

TCAGACTTTTGGAGTCTGGTGAAGGGACAAGGATGTACCTCGCCCGCTCTAGAACTAGTG 
GGATCCCCG 

Sequence 490 

CGCCCGGGCAGGTACCCCAAAACCAGGGGCTGCAGCTCCAGCAATTGGGAAGGCCAAGTT 

TTCTTGCACGCTCCCCAGAAAAGGGGCCAAGTCCATGGGGCTGAGCAGTGATAGACTGCA 

GACCTCACCACCACTGAACCTTGTAGGATAAGGCCCACTAGCCTGGGATGCTAGTGAGGC 

CACCCTAGTCCTCCTGAGTTCTCCAGCTGGGAGCAGCTCTACACTTCTCCGGCATGCAAG 

CTCCCAAAGAGAGAGGCAAGTCCACCTTTTtGCTGTCTCGCAACCCTCCTCCTGCTGCTC 

TCAGCTTGGGAGGGTGCACAGCAATTAGGGACTATCACAGAACCCCAGCACAGTGCATCT 

GGTGAACTTAAAAAAATCAACAAGTGAAAAACAAACAATCCCA 
Sequence 491 

GTGGCGGCCGCCCGGGCAGGTACCATTATCATTTTTAGTGCAGTAATAAAGAAATTAAAA 
GTCTTAACATTGAGCATTGGACAAAAACACTCTGCTTGGCCTCCCAAGGTGCTGGGATTA 
CAGGCGTGAGCCACCACACCTGGCCTGGGCTCCTGCACTTTAATTCAAGCCCTGTGACTT 
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qp?TI]^^^^^^^^J ®^^^ A ^^^GAMTATMTTACTCTGTCATCTGGAACAA 

ctc<£T ^ M ^ G ^ agagagttcctcantcca ctagtttgactttggccatggtcgtac 

Sequence 492 

J A S AGGGCGMTTGGAGCTCCCC GCGGTGGCGGCCGAGGTACCCCAACGTTTCGCTTC 
2552 CATCAGACCAGAAACAGCCGAG GCAACAGCATCATAACCAGCTCGCTGAGAAAT 

Sequence 493 

ATCGCATCTGCCTGGGCAAGTTCACCTTCCCTGGGATGTCCCTGGACAAGAGACAAGGAG 
^? GGC ™ GATCTACTGGGGGAGTCCGGAGGA ^^ 

Sequence 494 
AG r C o^ 

a™tctgctgtctttgatgtgtgacmcaggcaccaactatcgccmctagagaagct 

GGT^GG^I™^ 

ga ^ aactgtctacatgg ™ tcttagtgt ccaggtcagctgatactgcgt 

G £ CAAAAGC ^ 

Sequence 495 

r G r AGAA K CA ™^ GmGAGAGATGGTATCTAGGCM ^ ' 

CAGTCCTNTATTAGTCCTTCAGACAATGTGGCACATfCTGTTCATTGTATGAA 

Sequence 496 

^ g ^^ g ^ gcggccgcccggg c a ggtacttancnnnanntggagactcttttggctt 

GC JZT CCmAA ^ GGmATAGAGAAAA TATAACTAAAAGTAATGTCATATGAAAT 

a ^I g ^ gc ^ a ^ aaaaatacata ctgaaaatatgcactgtcaatttcagatactcga 

I GA I^ ATA ^ A ^ GGCATATAGC CTACTTAATTCTGGAAGGTGGCACATGTCTCCA 
^J GATAGAATATA ^ GGC ™^ 

AAu u n , , , i AAATTTCATTCCATCGGGAAAAGTGGAACTATCTTTACACCGGAGNGT 

ACCA II? AmCAACCGG ^ CnNTG N G NTGNAAACCAAAAAGGGMTCTTTGGATTCCT 

^™ A T^ ACCCCAAANAANMNGGNNTGGN ^ AAA N A NGAAAAA^ 

AA^SS^ TCTTTrrTTAC CANNCCCNNAAAATACCGGNCTTTTGGGAAA-nTGGCCA 
AAAANACGCTTCTTTAGGCCAAAAAATNCCAAAGGACATNCCTTGNGGGGTCCTTTTT7T 
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TTNCCTTTAAACCCTCCAAAACTTCCAACANCCANAACCAAGTNGGCCTTTNTTTCAAAC 

CCCCCAANACCCNGGGGGCCCCCCGGGCCC 

Sequence 497 

CCGGGCAGGTACCATTTTAGGCCAAAGCTATCCCATGACTATTGCCTAGAACGGGAGAGG 

TGAACAGAAGAGGTAGGGGCTGTGAATGCTTTCGCAAAACTGACAAGCAGGGCGAAAGGG 

AAACTCTGGAATA7GAGCTTCCAGACTGCACTTGCATCTTGTTTATCTCACGGTTCTCAG 

GGGCCACCACCTTACCTGGCACACCGAATGTGCCTGAGGAATGCTTGATGCACTAAACCA 

CAAATCAGTGCAATGGAATTGCCATCCCCCAAAAGTCCTTTTTTAAGATAAGAACGTATT 

TTCTGMGTGGATTCTGATAGCTATTAGTTTTGGTGGTCTGGAGTATGCTTTTACTCTTT 

CTCCCTGATACTGGCAATAAGAACACATAGGTCCACAGGTTTG 

Sequence 498 

CGGGGCAGGTACTAGCTACTCTGGAGGCTGAGGCAGGAGAATGGCGTGAACCCGGGAGGC 

AGAGGTTGCAGTGAGCTGAGATCACACCACTGCACTCCAGCCTGGGCGACAGAGAGAGAC 

TCCCTCTCAAAAAACGAAACAATGTTCTTGGCTGGGCCGCAGCAGCNTNACCACCTGTAA 

TCCCAGCAAGGTACCCTTCGGCNCGCTCTTAGTAACTAGATGGGATCCCCCCGGGGNCNT 

GNCAAGGAAATTTTCTGATTATCCAAAGGCTTTAATNCGAATACCCGGTTCGTACNCTTC 

CGNAGGGGGGGGGG GGCCC CCGGGTTACCCCCAGNCTTTTTTGGTTTCCCCTTTTTAANA 

TGGAAGGGGTATAAAATTTTTGGCNGCCGGCCTTTGGGCCGTTAAAAATTCAATGGGGTT 

CAATTMGNCCTTGGNTTTNTCCCCTTGGGTGGTTGGAAAAAAATTTGGGTATAATTCCC 

CGGCCTTTCCAACCAAAMTTTTTCCCCACCNACCCAAA 

Sequence 499 

CCGGGCAGGTACTCCCAGGATTTATTTCATCTATAGCAAGTTGAAGGAATAGCAAAGCAC 

CGTGGCAACAGCCCCAACTTGCAATTGGCTCTTTTTCCTGTGAGGCATTCCCCTTTCAAA 

TGAGCATCCGCATCCAACAATA GCTTT ATTTCAAGCATCCAAGCCCGGAGTCATGCCATC 

GCCTGTGCCTAGGAGACCTGGCTTTTGGAGGTGAAGCCAATGCAAGCCAGTATTTCTGCC 

TGCTGGCTCAGTGCACCAGGCATTGTTCTCAAAGATGTCCAATCATCCTCCTTCTGGGTA 

ACCCTGCTTGCGTATGTGTGTGTGTTTCCCATGATACAATATGTTTGTAACCAGGCATCT 

CTGAATCAATCATCTCAGCTAGCATATTTTTACTTATTGAA 

Sequence 500 

ACTNCTATAGGGCGNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCACTCCGATCCCT 

GGTGGAGGTCCCCGCNCCTCTCTGGAGACCAGTGGTCTGGCCCCAGAGGT 
Sequence 501 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAAGGACAATAGGGATACCCTGGAGCTGTCCCA 

GTCCAGCTGAGAAGGGCATTCTCCCTACTTGTGTTTCATCTTAGGGACTCTCACTTTGAC 

CCTGAGTGTGATAGGAACCATGGGGAAGTCTCAGCTGAGGTTGGACGTGAAAGGTCTCAT 

GTTTTAACAGGGTCACTCTGGCCCCCTCCACTGAGAACAGGCTGAAGGTGGCAGGGACTG 

AGTGGAGAGACCTCTTGGGAGACCACTGCAGCAAGTCAGGTGAGAGCTGGAGGTGGCTTG 

GCCCAATGGGTAGCTGTGAGGTCGAGAGCAATGGCTGGAGCCAGGAAACCTCTGAGTACC 

TCGGCCGCTCTAGAACTAGTTGGGATCCCCCGGGCTGCAGGGAATTCCGATATCAAGCTT 
A 

Sequence 502 

CCGGGCAGGTACCCTGAGCCTCCTGTGCCATTCTTGCTAGCTGGACAGAAGACATTGCTA 

CTGAGCCAAGAAGACTTGATCTCTGTTACACCGAGGAAGAAGGGAAGGTCTGATGCAATG 

AAACAGCACGTTTGCTTATAATTCTAGATCAGAACCATAATAGTAATTTGCCTACTAACT 

ATGTAACATTGAACAAACTACTGTTTTGTGAGCCTGAAAAGTTGGTACCT 
Sequence 503 

CACCGCGGTGGCGGCNCGCCCGGGCAGGTACGCGGGGGAGGGGGTAAAGTGGACTGGGGA 

GAACTTCGGAGGATGTTCATGTCCAGGAGCAGCCCCACGCCCTGTATGGTCGGTGTCTAG 

AGCCTCACAGCAACTAAGACCAACCCAGCTCTCAGAAGAAGGAATGTCAAAATGTCATGT 
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TCAATTTTACATTCAAGTGCCTGGAATCTTTTCTTCACAATTGAAATGAAATGTGCTGAA 

GGAGGTGAATCCATGCATTAATCTTCAGCTCACAAAGGAAATACTACATAAGAAGCAAGA 

CCACAGACTCAAGACGGACATAATTGGATTTTTTTGCCATGGCCTGGAAAGAAAGGTACC 

Sequence 504 

GGNGGCGGCCCGAGGTACGCGGGGGAGACACATTCAGAGGTGAGCCCAGAGCGGGTAAAG 

TGGACTGGGGAGAACTTCGGAGGATGTTCATGTCCAGGAGCAGCCCCACGCCCTGTATGG 

TCGGTGTCTAGAGCCTCACAGCAACTAAGACCAACCCAGCTCTCAGAAGAAGGAATGTCA 

AAATGTCATGTTCAATTTTACATTCAGTGCCTGGAATCTTTTCTTCACAATTGAAATGAA 

ATGTGCTGAAGGAGGTGAATCCATGCATTAATCTTCAGCTCACAAAGGAAATACTACATA 

AGAAGCAAGACCACAGACTCMGACGGACATMTTGGATTTTTTTTGCCATGGCCTGGAA 

AGAAAGGTACCTGCCCG 

Sequence 505 

CGCGGTGGCGGCCCGCCCGGGCAGGTACTCACAGCTACATCACACTGAATATCATCTCCC 

TGCACTGC CATTTC ATTTGAGCAGGGCTTTGCCTGTCAAGAACATTCATATCTTTTGATA 

AACAGTGCATTTTTCCTGATAGTCTGTATGGGACAGCCTCCTTGTTTGCATCTGAAGGAC 

AGGGACTTCAGGACCTCCTGCTGTGGTCACCAAGGAATCACATTCAAACTTCTCAACAAG 

GTGTTATGGTCCTTCTCAGTTTGCCTACGATCCATTGAGGCAAAGGTTTCCTTCTCTCAA 

TACACCCTCTGCTTTAGCCAGGGGGACCTCCCACTGNCCTCTCTATGTGTTCTATTTGTT 

CCTGACTGTTAGTACCTCGGCCGCTCTAGAACTA 

Sequence 506 

CCGGGCAGGTACCAACAACTGCCAGGCTGCCAAGCAGTTTGGAGTATTGGAAAAAAACGT 

TCGAGACTGGCGCAAAGTGAAGCCACAGCTTCAAAACGCCCACGCCATGCGGCGGGCATT 

CCGAGGCCCAAGAATGGGAGGTTTGCTCTGGTGGACCAGCGTGTGGCCGAATATGTCAGA 

TACATGCAGGCCAAAGGGGACCCCATCACCCGGGAGGCGATGCAGCTGAAAGCTCTCGAA 

ATNGNCNAGGAAATGAACATTCCANAGAAAGGGTTCAAGGCAAGCTTGGGTTGGTGTCGA 

AGAATGATGAGAAGGTATGACCTGTNTCTNAGGCATAAAGTGCCCGTGCNCCAGCACCTG 

CCCGGAAGACCTGACTGAAGAAAACTCGTCACTTTACCAAACGCAATGNTNCTGGCTNTT 

NCCANGGGCGCATTGAACTATTNANGTAACTTCANATNGGNGAATGCAAGAATGAAAACG 

CCCCATTTGTTTAAGAANGTGCCATTCACGGGTTACTTNTTGGATTANCCAGGGGCCAAA 

AAGGNCTGTCNTTGGGTTCAAGAACACCNAANCCANGGGGAAAAAACTTGA 
Sequence 507 

AGGTACCCCAAAACCAGGGGCTGCAGCTCCTGCAATTGGGAAGGCCAAGTTTTCTTGCAC 

GCTCCCCAGAAAAGGGGCCAAGTCCATGGGGCTGAGCAGTGATAGACTGCAGACCTCACC 

ACCACTGAACCTTGTAGGATAAGGCCCACTAGCCTGGGATGCTAGTGAGGCCACCCTAGT 

CCTCCTGAGTTCTCC AGCTG GGAGCAGCTCTACACTTCTCCGGCATGCAGCTTCCAAAGA 

GAGAGGCAGTCCACCTTTTTGCTGTCTCGCAACCCTCCCTCCTGCTGCTCTCAGGCTTGG 

GAGGGTGCACAGCAATTAGGGACTATCACAGAACCCCAGCACAAGTGCATCTGGTGAACT 

TAAAAAAATCAACAAGTGAAAAACAAACAATCCCATTTAAACGTACCTGCCCGGGCGGCC 

CGCCCGGGCAGGTACTTTGCGCTTACTTTGTAGCCCTTCATCAAGGGTTTT 
Sequence 508 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACCCACCTTACAATGTGGGACAA 

GATTAAT TTAGG GAAAACACGTAGCACAGGGCTTCACCCATGGTAAGTATTTCATAMCG 

GTAACTATTTTTAACAATGACAATTCACAGAGTNAATCCTAATAAAAAATGATTACAGAN 

GANGACGCCTCCACCCACAGGAAGGAGCCANCACNGTGGGGAGCTGTNTAACCAGGGGAG 

TGTTCTCACAGCTGACACACGTGGAGTTGTGGAGGTATCAAGCAGAGTTGGAGGAGGAGG 
GGG 

Sequence 509 

CCGCGGTGGCGGCCGAGGTACTGTATAGTGATAAAAGGTTAATTGTATCAAGGTTACACA 
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ATCCTGAAGCTTTACATGCCTTCAAAATACATAAAACAAAAATTGTCCTTAGTAAAATCA 

gaactagacaaacccacaatcacagtgggagatttt™^ 

I^ C 7 G ^ MCCTCTGTCTCCCAGAC ™ GTGA ^CTCCTGCCTCAGCCTCCCAGG 
I G ^ A 7 TACAGGTGTGCGCCACCACGT GCAGCTAATTTTTGTATm 

GTTTTGCCATGTCGGCCAGGCTGGTTTrGAACTCCTGACCTGAGGTGATCCTCTTGCCTC 
GGCCTCCCAAAGCGCTGGGATTAACAGGTGTGAGCCGCTTTGCCTGGCCAGGAGATTTTA 

ATATACCTCTAAAGTAACTGACAAAACAACCACCATGAGTCAGTNAGGATATTGAGGATT 

TGAACCTCATCATGTACCTGCCCG 

Sequence 510 

CGCCCGGGCAGGTACTTGGCACATCTGGAAAATGACGGAGTCTTTCAAAAATACTGCTGG 

GAAGATGACCTGTTTTCACTCTCTCTCTAGGCAGGTTCACATCACCAGATGATGCTGGAT 

ATGGCTCCACTCCAAGGAAAACAACCTTTTCTTGGfCTTTCAATGGCTGTTTATGAGTAT 

ACAACAAGTGCTTTCTATATACTGCAAACTCGTTAGACAGACAAGTCCCTCTGTCAATCC 

CACTCTACAGTGCATATGCTTCTGTGTGGTACCT 

Sequence 511 

CCCGGGCAGGTACAGCTCAGTGCATTTTNCCCAAAGTGAACATAGCAGTGAACCCAGACT 

ACGGACCATAACATTACCAGAAGGCACGAAGTCCCTGGAGTCCTCTTCTAGTCAGTAGCC 

CTCACGAGTAACCAGGATCCAACGTCATCTTGTTCGGCTTTGACTTTTACAGGAATGGAG 

TCACCCAGTTTGTCTTCTTTTGTGCCTGCTTCAGCTTGCTGCGGGTGCTGTAGCTGGCCT 

GCCCTGGTGTGGCCGGGGACTCCCTTCCGTGAATATACCCCATCGTTTCTCAGTCCCTTG 

AACTGTTTCCAGNTGGGGCGATTGTGGATAGTGCTGCTGTGAACAGGCTTAAGCACGTCC 
CTTTTTCTCGAATTTTAT 

Sequence 512 

CCGGGCAGGTCTTCAGCTCCTGGACTCAAGTGGTCCTCCTGCCTCGGCCTCACAAAATGC 

TGGGATTACAGGCATGAACCACCACACCTGATCCTAAGTTGTATTTTTTTTCTGCCATAA 

TTTAi 1 I lATATTTAGATTTTACTATTTTGACGGAGGTATTTCCTAAAAACCGAAGTCCT 

TGTAGATAGGGGCACAATTAATTCATTAGTTAATTAAGACCATTCATGGTTTGATCCTTG 

CCTTCCTCTCCCACCTTATTTCTAACTGTGGTCTCCTTCCTTCTGTCTCTGCTCCTCCCA 

TTACACTGGCCTTCCTTGGGTCCTTTGTACCTCGGCCGCTCTAAGAAACTAGTGGGATCC 

CCCCGGGGCTGCAGGGAAATTTCCGATATCAAGCTTATCGATACCCGTCGACCCTCGAGG 

GGGGGGGCCCGGGACCCCAANCTTTT 

Sequence 513 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCACTCACTGAATGGCC 

ACCTATGCCCAGCACTCATTGCACTGAAATTATAGGTAGCTTCTGATGGATTTGGAGATA 

AGAGGGCCTTTGGAATTATCATCTAAATGTCTTTCAAACTTGTAATATCACTACTACTTG 

GCAACGCAAAGGGCTTGTCCATGTGAAAGCTCACCTTCCAATATGAACTGCTGGAGACTG 

GAAACACC GTTACA GAGTGTCTGCAAGTCCATGCAAGAAACATACAGCGCATTCATATTG 

CAGCATTGATTTTTCAGGTTAATTTTTAACCTCTGAGTTGGAGAGAAAGAAAAAAAACCA 

AAAGGTTCAGTCTATGAAGTTGTTGCTATCTAAACACTCCCAAAACCCATTACTGAAATT 

TTGCGTGGCACATAACCCACGTGGCTTATTTCAAGTGGCTGAATGAACAAAGTTTATGAG 
AGGTACCTGCCCG 

Sequence 514 

CG AG GTAC TTTC AAG CTC ATTG CTAGTAAAGATATC AGGCTAATAAC AACTC AG AAACTC 

AAGTTTATTTTCTACACTATCTTAAAAATAAAGTGAAAGCCAACAACTATTGAGTAGCTA 

TGTAGCAGACAGGACTTTACTGACACTACTTCTAAAGAAATCTACAATCTAGGCTGGGCG 

CGGTGGCTCATGCCTGTAATTCCAGCACTTTGGGAGGCCGAGGCGGGTGGATCACCTGAG 

GTCAGGAGTTCGAGACCAGCCTGGCCAACATGATAAAACCGCGTCTCTACTAAAAATACA 

AAAATTAGCTGGGCATGGTGGCGGGTGCCTGTAATCCCAGCTACTCGGGAGGCTGAGGCG 
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GGAGAATCGCTTGAACCCGGGAGGCGAAAGTTGCAGTGAGCCCGAGATCGAGCCACTGCA 

CTCCATCCTGGGTGACANGGCAAGACTCCGTNCCCCCGCAAAAAAAAAAA 

Sequence 515 

AGGTACAGAGATAAAACTGAGGCTTAGAGAAGGTCAATACCTGGCCCAAACCATATCTGT 
TAAGTGGCAGAGCCAAGATTTGAGCTGTCTGACTTCAGACTACTAAACTATTCTACCTCT 
AATATAACATATATGAAAACAACTATCAAATGGTAAAGTCCTAGGCAAACATCAGATCTC 
ACTATCATTAAAACTTTTCTCTGAATCTTTAC 

GTAATCCCAGCACTTTGGGAGGCCGAGGTGGGTGGATCATGAGGTCAAAGAAATCGAGAC 

CATCCTGGCCAACATTGATGAAACCCTGTCTCTACTAAAAATACAAAAAATTAGCCCGGG 

TGTGGTGGTGGGCACCTGTTAGTCCCAGCTACTCGGGAGGCTGAGGCAGGAANAATTGCT 

TGAACCTAGGAGGCTGAGATTGCAGTGAGCCGAAATCACGCCACCACACTCCAGCCTGGG 

GCAACAAGAGGGAAGACTCCCATCTCAAAAAAAAAAAA 

Sequence 516 

CAGGTACAGGCACTTTCTTCAAAGCTCCTTCTTTTGGTGACATCTGGAGCTTCAGAGGCA 

GTGGGAGATGGAATGGGGGTCAGATCCATGGAGATCATTATTGGCAGCATTATTGGCAGC 

AATTCC TGAGA AACTGAAACAATTGAGAGAGAGATGGCTTACTTAGCTCACTTAGCTTCT 

AAGCCATTTTTCATTTCCTGAMGTAATGGMGCCAGCAGCCTATTCTTAGGATTTAATA 

GGAGATTTGAGTAAAATTTCATTCAGGAAGGTTCTCATTTTTCATTTTTA^ 

TTCCCTGTTCATATAGAGCAGAGCTATTCTTAACAATATTCCATATTTAAAAAGAATCAA 

GTTGTATCCTTGAGATACANATCTCTATCGCTTGGTAATCATCAGCTCTAGCATCTTATT 

TCACTTAAAAGCAGTGCTCACAGAGGCNGTTAGAAGATGAGGATAACTGAAAAAACATTT 

ACGATATNAAGCCCCNACTTTAAGACTGGAAAGTGATGCTATCACTTATGTAAAAGAAAT 

AGCAACCGGCTTAAAAATGTTCAAAGTATTTTAAAATTAGCCTAACAGTTGGAATCATAT 

GAAATNTGCNACCTTTNTACATTTGCCAGAMTCACTC 

TTNGGGTTCCNCC CGGGT TCCTNGGNCCNTTTTAAAACTAANGGATCCCCCGGCCGCNGG 

AATCCAAATTAAGCTTTTTNATCCCGTNGCCCTCAGGGGGGGGCC 

Sequence 517 

CATTGAGACTGCCATGGTTTACTTGAAAGGCCACGTATTTGTTACTTNTNNTAGAGACCA 

TTCAAGGCTTCTGGCTCTTGACAAAGATAGACCACTGGAACAATGAGAAGGAGAGAATTC 

TACTGGTCACAGACAAGACTCTCTTGATCTGCAAATACGACTTCATCATGCTGAGTTGTG 

CGCAGCTGCAGCGGATTCCTCTGAGCGCTGTCTATCGCATCTGCCTGGGCAAGTTCACCT 

TCCCTGGGATGTCCCTGGACAAGAGACAAAGGAGAAGGCCTTAAGGATCTACTGGGGGAG 

TCCCGGAAGGAGCAGTTCTCTTCTGTCCCGCTGGGAAACCCATGGTCCACTGGAAGTTCC 

TTTATGCTACTTTCACTGGAGCCATTCCTATGAAAATACCACCCAGNTGAGAAAATTTCC 
TTTGNAAAATTT 

Sequence 518 

NNATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTACTTAGGGCACAGAGAGGTT 

AGCACACAATGGGTCTAAATATACCACTGTTTGGGGATAAATAGCTTCCAGGGATCTGGA 

TGCAGAATAAGCTAAGCCAGAAAGATTCAATGCAGTCACCTCAGAAAGTCAGCCACATGC 

ATCTTGGAGAGGCCTTGTGGTTTCAGGGCCATTCTAAGAACGCAGCTATACTATAGTCAG 

TCCAAGCGGTCATCACATTAAGGAATCTCAATTTGTGAAAACTAGTA7TTCAATAAAAAA 

GGAGAACACTNGTTTTGTTAAAGGAGTGATAACTTCACGTGGATATTTGTCATAGGTTTT 

ATTTAAATGAAAG CTCC CTGACTACATTAAAAATATGATTTATTTAAmAAATGTA^ 

TGCACATATCCACTTTTAAAGAACAAGTTCATTGCTTMGCCAATTAAACTCAGATTGNA 

GAAATATTTCTTATNTGCCMGNCNTCTGGATTAAAATAGATTCCCNGNCCATTAAAGAC 

TAT TTATC TTTTCTTTCAAAAANGAAATTCCACCCTTATATTAACTCATTA^ 

CANTTTTCCATGGAAGGAACCAGAA 

Sequence 519 

ATATAGGGCGAATTGGACTCCACCGCGGTGGCGGCCGAGGTACGCGGGTTTTGTGGAACA 
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GTAAGACTCACTTACAAGNMGTAACATGCTGTGTTTCTGGATGATATTCTTATCTGAGT 

ACAAAGCACGATGAGATGAGCAAAGTTAACATGGCACAGACTTCCTGGTTTCTCCCATAC 

CTGTGGCTCTTGTTGCCTGCAACCTGGGGTCTCCCATTCATGTATATTTCTCTCCTTGAA 

AGCCTCCTGACATCAANTCCCTGTTNTGANGCAATTTTCCCAAATTTAAAGAATTATGAA 

GAAATAAAAAAANGGTNCCNTGNCCCCTGGGCCGGGNCNGGTTTCTTAAAAAACTANNTT 

GGGATTCCCCCCCCNGGNCCTGGNAAGGGNAAATTTNNAATATTTCAAAGNCTTTNTTCG 

GATTTCCCGGTATCNNAACCCTTCAATAGGGGGGGGGG 

Sequence 520 

AGGTACACACAGTTAACCCAAAACAGGCCTCTCTGAAAAAGCCATTGCCATGGACTGCCA 

GACAGACAATGACAAGACACAAATACC7TCTGGTGTGTGAGCCACGGGACATGTGAGCTT 

C C CCGCTGATGCTCCTCTTATATCAAAG ATCACTTTC ACAAGATGAGCGACTC AATATCT 

TTTATCAAACCAATGATCACCTGCAAGCTATGGTATATTTTTGCAGCTGTGTAGAGCTAT 

GTGGCATGAGAATGTGGGACTTATAAAATTGCTGATCCAATAAATAAGACATTATGGGCA 

ACAGTGTCTTATCAGCTAGTGGTGTACCTGCCCG 

Sequence 521 

AGGTACTCAAGCCCCTGCCTGTCATGGCCCCCCAAAATGCTGGGATTGCAGGCCACCATA 
ACCGGTCATCGGGGCAACTATCCTTCAGGGCTTCCTCCAGTAGGGCTCCCCAGGTTGGCC 
TTGAGGGCCACAATAAAAAATCTGACTTAGCTATACTCCTTCCCTGTCCCCAACCACAGT 

TA1 3SIT GTGGATGAGGC ^ TGGCAGCAGGACAG 

TTATTTTATGCATGCTTAAGTGTGGCATCCCTGGGGACATGAAATTCCAGTGAAATCCAG 

TCTTTGTGCTGGAGAAGCTCATAGATCTGTGGAGGAGGCAGATGCCCACCCACTTCACTG 
TACCTGCCCG 

Sequence 522 

CCGCGGTGGCGGCCCGAGGTACAAGCAGTAATTGATTCACTGGCCTTGGACTACTTGCAG 

GTCAGCTTGTCTCACATAACAGGTTGGTATATGTATAACTATCACATAATTATGCATTTT 

AGTAAAAATAATTGTTTAGAACTGGCTTNGGGCAGTTGTGACCTCTAACTGTAATTTCCT 

TGCTTCTTCTGTATGTTCCACCTCTTGTGCTGTGCGCCTAAGCCAAAATCANGGTGCTCT 

TGATAAAAATTCTTCTCAATTTAGGCAGCTCATCAAGATTCCACTTCTTTTTAACTTAAT 

TNTCCCAANGGTTTTCCCAAACTTCTTTTCAGATAAGGGCCCTGCCCTA 
Sequence 523 

CGAGGTACAAGCAGTAATTGATTCACTGGCCTTGGACTACTTGCAGGTCAGCTTGTCTCA 

CATAACAGGTTGGTATATGTATMCTATCACATAATTATGCATTTTAGTAAAAATAATTG 
TT 

Sequence 524 

ACATTATGAAT GAAC A CAGG ATTATGAMTTGTAMGGTTACTTTCTGCTCTCTAATTCC 

TTTACTCGTAATTTTAGTTT TCTTA ACGGTATAACTTGATGCAAATATATACAC 

TAC TAACT TTCACTGAAGTGTTTTCGGGAGGGAGGGGCACTTTACAAGATGTGTTGCCTT 

TAGTTTTTCCGGTAAGGAGACAGGAAGACACAGACAGAGGCTCTTAGGGCACATGGAAAG 

CGCCTGCCCCTGTGCCAAGAACTTAAAGAGAGAGCCGGGGATGGACCCTCCTTGCTGTGG 

CTCCTGAACAGTGCAGCCTCTCTTCTGATGCACTCACCCTGGCGAGGAGAACCGCTTGTG 

TGGCGACTGCTTGGCCCAAGAGCGAGTAGGATTGTTGACTCAACTCTCTTCGTGTCTCCT 

CAGAGGGGAGGAACACCTGGCTTCACCTGTTTACATAACTCTTGGTGATTCTTGTCCCTA 

AGCTCTTCCAGMTGCTCTmGGGCTTTTCTCAAMGGTTTTTAGAAAGAACAAG^ 
TTG 

Sequence 525 

ACCGCGGTGGCGGCCGAGGTACTAGTAAGTTCTGGGTAGGGGGTGGATGAGTCCTATGAG 
TGGGGGGCCCCAGGCACACAGGAACCATAGTTAACCACTCTTCACCTACAGAAGAGCTCC 
TCAGGTCAAAGCTTCACTGTACCTGCCCG 
Sequence 526 
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CGAGGTACAATCCTGCCCTCAAGAAGGTTAAGGTCCAGTTGGAGGAGACTTAGGAGTTCA 

ACTGACCCAGTTCTGCAACCTTAGGTGGCTGTCCTCATCCTGACAGGAGACAGGGGACAC 
ACACAAAACAGTGATTTAf?nrr;TTAArAAAr;Tr!^^T-r-rA^A * /v^^-r^-r-. ~. 



ACACAAAACAGTGATTTAGGCGTTAACAAAGTGGCTTTACAAAGGTGTAGACAGAATATA 

GGAAATCCAACAAGGCCTGGTGGAGCACCATGAAGCTGGTGACAGCAGGGAACAGTGGCC 

TCCTTAGACTTAAGGGGCAAGAGCAGGAAGAGGTTCTGTGGCCCAAAAGGGGTGAGGCAC 

ACAGAGAGGGTCTGGACAGGAGCTCTGGTCACCAGCCTCCAAGGACAAGGAGGGAGGCCC 

GCCTGGGGAATATGTATCTTGACCTCATCTTCTTCCTCCTTCCAGTCTCCTATTAGAATC 
TCCCACCGGTCAACr.nnAArr Ar;AMAr:(~r-A/^A/^i^^o a/^-t^^-^^,^ t ^.„ . _ . 



-w^.vj,^,^, i ono^ i^n i o i ioi I UCAGTCTCCTATTAGAATC 

TCCCACCGGTCAACCCCAACCAGANAGCCAGAGGGCAGTGGGGGTCCATTAATGTGATCC 
ACA7TAGGGGGCACAGGGCANGGATG 
Sequence 527 

CGCCCGGGCAGGTACAGGGTTTACTCAGTCCTGTTGTGTAATAGGAAGCAGGAAAGGGCA 

AGGGCCCTGGGAACAACCTCAGCCCTGCCACTTACTGGTCATGTAACCCCGAGCAAAATG 

ACCACCTCTGAGCCCCAGTCTCCTCTTCTGTAAATGGGTATACCACCACCTACTCATAGG 

CTGCGTGATGAGAAATGGAATAATGTCAGGGAGTGCCTGGCATGACGCTGGCACCCTGGA 

GGCTCCCCAGGMCGGTGGCTACTGCCTTCAGAAAAACTGAATCTACTTCAACTTATTTT 

ACGTAGGTCAAAGCATCCAGTCAGTGAGTAAATCAAGCTGCAAGATTCATTTCTACTTAA 

CTTTTAGGAGGAGTCATCCCATGATACCTCGAATAGGTCTCAAAGCACATGCATGATATA 

AATGATTCACTGATATGACTCTGGTCAGATGTATGTTCTTATGAAA 

Sequence 528 

CGAGGTACACCTGTAATGGCTACCCCGCTGGCTCAGAGCTCTGCCTCACCGCCCTGCCCC 

GGCACCTGCAAGCCAGTGTGCCACTTTTTTGATGTTGTATGTTTTTAATATTCCCTGAGA 

CAGACTCCAGAGTTCTGAGTTTTCTTACTTGGGGTGTCAAATTATATTCCAGGAACCAAT 

TTGGGTTTTCCTGGCTTC TGGTTA AAAAGCTGGACAGATTTTGAGTTGTTTGCTCTTAGT 

TCTCTCATAAACTGCACTATTTTTAGGGCACTATTTTGTAGAACACAAGATTTTTCAGGA 

AGTATATTGAAAGGCCTGTACCTGCCCGGGCGGCCGCCCGGGCAGGTACTTNNNNTTTNT 

NTNTNNTNNNNTTNNNAMGGGNGGNGCTGGTTATTTTTAAAATCACACATTGAATACAC 

ACAACAATCAGATTTCTTACCAAACCCCCMTTTTTTAATAACTGGCTCTATTTANCACC 

AAAAACTCCAGTCTGGNGGGAAGTGCCAGACACAAACTTNACTTTTGGNTNTTGGGCNAN 

GCAATCCATCAAGGGNAATTGGTTANGGTCAAGGACTTGCCCTTTTTTTCCTT 
Sequence 529 

CGCCCGGGCAGGTACATGCGGTTTCTTTGGGCAGGAACACCCTCCCTTCTACCATGGTCT 
GTAGGAATTGAACCTTCCGTGCCGTTTTCAAATGCTGTCTGTGCTCGGAAAATACATAAT 

ATCAAGTCCCTTACTGAACCCCCTACAGATTGTGTTCATAAGGTGCTTCACTGCTGTTGG 

TCTACCTGGCCTACTGTTTGTACCT 

Sequence 530 

CGCCCGGGCAGGTACTGTGGCATGTCACACTACAAGGAGCAGAAAGAACAGCTCCTCTAA 

TATAGAATTGCCAGGGTACACTGGCAGGTCATCATAAGCTGCCCTGGCTCTATAGAGCTC 

CAGCATTCAGACTAGGAACCAATGCACGCGTGCAGCCATGCTTGTCAGTAATAAACTTCC 

CTGGACTTATACTAAAACCGTATTTTATTGATATGTCCAGTATGTTTTAAGAAAACAGTC 

TCAAACTCGGGTCTATCTAATATTACCAGCCACTTAACACTATGTGATAACTTTTTATGC 

TTTCAGCCCTGGATATrCTTTGAAGMTATTTTCTATAAGGAGTTCTAAAATCCTTCGAT 

CCTCATATTGATACAAAAAAATTAAACACATGAACAAAATCTCTGTGCTTCACACAAAGG 

TGAGATCCAACCTGTGGCCCCATACAGTCACCAGTGTCATCAAATAGGCCACTACAGTGT 

ANGGGGATGCAGGGGCAGCTTCAAGCCATGAATGAGACAGCATTACACTCTGTATACACA 

GTGCCCTAGAATATCTGTGTGAAACAGGTGGANGAAAAAAAGATGACTAATTTGTGTGTG 

TGTGTGTGTGTGTGTGTGANAAAANAGAGAGAGAGAGAAAGACCGAGAGGAGGTGAAAGG 

TTNCATTTGGTGGTGGGCATTGNCTTTTNACTTNTTTTTCATAGGCNTNAANAAAATTC 
Sequence 531 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACATGAGTCCCTTCAGGAAGTCATCAGC 



:TTTGT 



Page 87 of 869 



WO 01/51628 



Table 2 



PCT/US01/00798 



TCAGTTTCCTCAGATTAGATAGTAAAACTGAGATTATGAACTATAAAGATGTGTGTAATT 

TATCTGTCAGTGAACTTGACTTTAATAAMGCTTTTTGAAAAAGAACTCTGGGTGGGGTG 

CATTGGCTCACACACATAGTCCCAACTACTGTGGAGGTCAGGGCAGGAGGATCACTGGAG 

CCCAAGAGTCCAAGATCAGCCTGGGCAGGATAGCGAGACCCTGTCTATAGAAAATATTAA 

AAATCAGCTAGGCATGGTGGCTTGCCCTTGCATTCCCTGCCACTTGGGAGGCTGAGGTGG 

GAGGTTCGCTTGAGCCCAGGAGCTCAAGGCTGCAATGGGCTGTGATCGAACCACTGAATT 

CCAACCTGGGTGACAGAGTGAGGCCCTGTCTCAAAAAGAGAACTCTCGATGTCACTGGCT 

TTCCATGTAAGCAGAGCACATCATGTGAGCCCCATTCGT 

Sequence 532 

CGAGGTACAGAACTTAAACACCACTATTTGTTGAGATGAAAAAAAGCATATATAGGAAGC 

CTTCAAAATGAAATGGTCAAGGGTGAGTTTACACAGATAGATAGATTTAGGTCTCTTCCT 

TTTGCTTTGTGAAAGCATCTAGTGTTTAGGTGTCAGAGAGGGAGATATCCTTACAAAGCA 

GAGATTATCATTACAGGTTTAGATTTCTTACAAAGAGTTTCAAAATAAACAGGTAAATGC 

CAAAAACATATATTTTGGAGACGGATTAATTCACTAGTTGGTCTATTCAACTTAACTTGT 

TTCCTAATGAGATTAAATTCATGCACAAATAACCAAACCAAAAATTAAACCAAAAGAATA 

CTCACCAGAAAGGATGTNCTTTACAAAGAGCAGATCCCCCAAAAATGTAAGAGTTCACTG 

AAAAGGTGGGAGCTCAAACCAAGAGAGGACTTATCTCGCAAGCATAAAAGCAACTTGTCC 
TCGGC 

Sequence 533 

CGAGGTACGCGGGATGGCCCAATATTAAATTGGGCCAATGAATGTCTTCTTTTTCAGTGC 

TAAGGGAAAAATAATTTTGGGAGGAAGAGGTAAATACTTAAATACTTAAAACATTTTTCA 

TTAAAATGAACTTTTATTGCATTTTTTTCTTTTTTTAGATTCm 

ACAAGCTCTCACCATGTTGCCCAGGCTGGTCTCGAACTCCTAAGTTCAAGTGATCCACCC 

ACCTTGGCCTCCCAAAGTGCTGGGATTACAGTCCTAAGCTACTGCTCCTGGCCGAGACAT 

ACATTTTTATAATGTTAAGAGTTATTTAAAAAAAAAATTTAAGGCACGTAATGAAGGGTG 

GTTGGTATTCATTACATAAGTGTTTATTCTCTACAAGCGTAGAGAAATGAACACCATCAT 
GAAATGAAATAAAAGTA 

Sequence 534 

CGACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTACAAC 

TTTTGATACAGTCAATGTTTCTCTCCAGTTTCCGCAAATTGTTAACTTTTATCATGAAAC 

ATGAGA7TTGAGGGTGTTTATCCTTTCTTGCATAGAGCACATGAACAGGTGTCAATGCTG 

ATCATAGCTAGAAGTAGAGTGGGTCCCATGTGTAACTCCGGATGGCTAAAGAAAGTGTTC 

TACTTACAAGAGGATGCTTCATTTGACTAGGATTGGCTCTCAACAAGGCCCGTGCACCTT 

AGAAGAAAAAAGTCACTTCTGTCACATAAGAGCTCAACTGATTCACTCAACTTCAAAAGG 

GCACTG AGAATG CCATGCTTTGAAAAACATACAAAACAGTAAGAGGAGCAGAGAAAAGCT 

TGATGTGTTTTTCTCAAATGACACAAAACAATTTGTCTCTACAGGTTTAAGGAAAATCAA 

TTAAAACATAATCTGTAGATCAGGAGTGGTAGCTACTTAAATACCAAAAAAGGAAAATAA 
CCCCAAATNGAAAAGAC 

Sequence 535 

CCGCGGTGGCGGCCCGAGGTACGTGTTTCAAGTCACTGGGCTGACCTGTAACACAGCACA 

ACCTTCATAGCCACTGTATGAAGAAGTAGAAGACCCANACTCTTGCTTTATGTTGGTATC 

AAAAGTCATTTCAGAGTCAGGCTGATCACTCCCAAGTAACCCACTGACTTCTTTACTCCA 

GCTCTCTGTCTGCTGTTGACTCANAATGTTACACTTCATTTTCTCCATTGCTGATATAAT 

CATATCTGCAACATAAAAGTGGGCATTTTCCTTTTCTACATCAACAGGCAGCACAAATAC 

CTCTGGTGAGAAGGAATTCNAAGAAATGGTTCTTCTACTGACTTGAACAGCACCTTCATC 

AGCAGCAGATGTCAGATGGGAAGGCTCAAAAAAAGCACTGTCAGTCTCAGGATATGGGGA 

GGTGGCCAAAATNCCANGGCTTGTGGGCAGAGACACCCGATGGGAGTATTCTGGGCGGAC 
CATGTGTCCTGCCCG 

Sequence 536 
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CCGCGGTGGCGGCCCGCCCGGGCAGGTACTGCCTGAAAGGGCTGTAGGGACTTACCTGTG 

TCACCCAGTGTCAGCACATAGGGGCTGAGTGATGCTGGAAACATAGAATAGAGAGCTTCC 

TCTGGACTGCTCTCTGGCTGCCTGGGACCCAGGCAGACCTCCTGTTTGATCTCTGCTGGG 

GTTGCTGGGAGCAGGAAACACCTACCTCGGGGGCTTTCTCTTGCTAAGGGGAAGGAAGTA 

GAGGTTCTGAAACCTACGAAAGTCTCCTGCCCCTGGGTAGAGGATTACAAGACACCAGGG 

CTATAGAAGCCACTGGTGCTGTTACCTGGTAGGGGACAGGCCTCCCTGCAGTTGCTGGTG 

ATGTGGGCGGACCAGGCCTATAGCTTCTGCTGAGTATGGAGTGGACAACAGGAAGGCCCT 

TATGGGCTGCATTGACACTGTACCT 

Sequence 537 

AGGTACI I N I II I I I I I { i I I I IGATACAGCGAATCACTTGAACCCAGGAGGTGGAGGT 

TGCAGTGAGCCAAGATCGCACCATTGCACTCCAGCCTGGACCACAGAGCAAGACTCCATT 

TCAAAAAAAAAAAAMTAGGAMTTAGATCATGGCCTTCTAGAAATAGGCAATATGCCAT 

TTCCTCTGTTTATAGCTCTCTACTGGCTTCTTCCTTACACTTAGAACAAAATCATATTTA 

TGACCACAAGGCCATACAGGATCTGTCTCAGTGGTTCTCAGTGGGATGTACCTGCCCG 
Sequence 538 

CCGCGGTGGCGGCCCGAGGTACGCGGGGATGGGTGTGAGTGTGCTCTGCCATGAGAGATT 

GTCCTGTTCAGGGCTGGTTCCCACCTCACTCCGTGAGCTGCTGGGATGGGCTCTGGCCAC 

CCATGGNCATGAACTGGAAATAAGCAGGTATGGACAAGGTTCTGACCCTTATTATTGGAA 

CTCAAATCCAGATGGAGTT TGTTG GGACTTCACATCATTGGAAGGTGAATAGAAAAGTAT 

GTCTTAAGAATTTCACCATGTTTTGGAAGAATTGCAGCMGAAAAAAAGMCTCCTTTGT 

TTCCACCCCATTTCTTTTAAGGAAAACCTGGAATATCTGAGGGTGCAAGCTGAGAAAAGT 

CTGCTTATCTGTTCCCTACTCTGGCAATCAAAAAGGATCGAAGTCTACTGTTCTCTTCTC 

CTGGCACTCTTAACTTGCCACATTGCAAACCTTTTATGGCAAAAGTAATGGTCAAGCTAT 

CCTAA CAAGGA CCAGC CCTT ATTAAATAATTCACTGTGGTTGGGAAATGGTGAGGAAGCC 

ATCTTC M I I i GGCTTCTTTTCTCTGAAAAATAAAAATTTTGGCCTAAATTT 

Sequence 539 

CCGCGGTGGCGGCCGGGGGCCATTGAGACTGCCATGGAAGACTTGAAAGGTCACGTAGCT 

GAGACTTCTGGAGAGACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGACCACTGGAAC 

AATGAGAAGGAGAGAATGCTACTGGTCACAGACAAGACTCTCTTGATCTGCAAATACGAC 

TTCATCATGCTGAGTTGTGTGCAGCTGCAGCGGATTCCTCTGAGCGCTGTCTATCGCATC 

TGCCTGGGCAAGTTCACCTTCCCTGGGATGTCCCTGGACAAGAGACAAGGAGAAGGCCTT 

AGGATCTACTGGGGGAGTCCGGAGGAGCAGTCTCTTCTGTCCCGCTGGAACCCATGGTCC 

ACTGAAGTTCCTTATGCTACTTTCACTGAGCATCCTATGAAATACACCAGTGAGAAATTC 

CTTGAAATTTGCAAGTTGGCTGGGTTCATGTCTAAACTTGTTCCAGCTATCCAGAATGCC 
CACAAGA 

Sequence 540 

GGAGCTCCCCGCGGTGGCGGCNCGCCCGGGCAGGTACGCGGGGGGATAGGCAATTAATTT 
CATGGAGAGTAGGAGGTGTGGGTTCAGCTGCTGC 1 I I I 1 I I I I I 1 I CAGTGAGGCATTAT 
CTGAATGATGCTCATTCTCTCTTGAGTCTGGACTTCGTGCAACAAGCAGCAACTGGATAT 
TATTTTAAAATAAGCATAAGCCACCCAAAGATACTGATCTGGGTCCTTCTTTTGGAAGAA 
GAGTTGATTGAGMGTGCCTCTTGGTTAAGGATTAACCACAGGGAAAAATCCAGCAGAAA 

CAGAAGAACTGTGGGTTTCTTACCCCAGCCCTCAAGGAAGCTATGCCGTGAAAGGGGTAC 

CTCGGCCGCTCTAGAACTA 

Sequence 541 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCTGGCCTATTCCTGGCTTTGAAAAAATGGTTT 

TCG GACA CTAAATCTTTTATTATAATTCAAACACATTATAGTTGCTTCTGGGAGACTTTT 

GCCTTTTGCCCTGCTTACAAACTAGCATCATATTCTAAAATCTCAGGAACTAAAATTACT 

CATTGTAATTTCTCTAAAATTCAAAGCCTATTGGAATCCCTGCTCTTGTTATGTAAAATA 

TAAATATTCGAGACATTTCACACCACATTTTCATTATTCAGCATTCCCATCAAGTGCTGG 
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Mggccagttcgctattttataatgaacgtcctgggtttgttctgaaagggcagccttgt 

ACCT 

Sequence 542 

NGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAATGCAAGAATCA 

AGTGTGTAATCTCTGTGGATTTAACCCTACACCACATTNGANTGAGATTCAAGAATGGCT 

TTGTTTAAATTGCCAAACCCAGAGAGCAATATCAGGACAGCTTGGAGACATACGCAAAAT 

GCCACCTGCACCATCAGGACCCAAAGCATCTCCTATGCCTGTTCCTACAGAATCATCATC 

TCAGAAAACAGCAGTGCCTCCCCAAGTAAAATTAGTGAAAAAGCAAGAACAAGAAGTAAA 

AACGGAAGCTGAAAAAGTCATTCTGGAAAAAGTAAAGGAAACACTATCAATGGAAAAAAT 

TCCTCCTATGGTAACCACAGATCAAAAACAAGAAGAGAGTAAACTAGAGAAAGACAAAGC 

TTCAGCTCTTCAAGAAAAAAAGCCACTCCCTGAAGAAAAAAAACTAATCCCTGA 
Sequence 543 

CCGCGGTGGCGGCCGAGGTACCAGAGTCCATGTTCTTAAACCATTAATCTACGATGCCTC 

TCAAGTAAATAGTCAATAAGTGGTAGGTGTGAGGCTGTAGGAAAAAGAGGAGACGAAGGG 

ACAGGAAAGCAGTCCTCATTGGAAAACTAGGTTTCGACTGGCACTGTTGGACTGGGCTGC 

AGT7TAGGTCCTCCCACTGCTGAGTTCCCTCACTTCATCTCCTGGTGCTTCTGGTTTCCC 

ATTTATAAAATGCAAGGGAAGGATTCTATTTATGTCACAAGGTGCATCTAAGGAGTAAAA 

AGTCACGTCATGTTTGACCAATGTTAGTCAGATCAAGATTTATTTTAGATAAATTATCTG 

TTAAGTTTTAATTACCATGACTCAGATTTTATGTTCACTTTTTGGGCAAAGCTAAATAAA 

ATATATAAATATATAAAATCTGGGCTCCAAC 

Sequence 544 

CCGCGGTGGCGGCCGAGGTACCTTTCTTTCCAGGCCATGGCAAAAAAAATCCAATTATGT 

CCGTCTTGAGTCTGTGGTCTTGCTTCTTATGTAGTATTTCCTTTGTGAGCTGAAGATTAA 

TGCATGGATTCACCTCCTTCAGCACATTTCATTTCAATTGTGAAGAAAAGATTCCAGGCA 

CTGAATGTAAAATTGAACATGACATTTTGACATTCCTTCTTCTGAGAGCTGGGTTGGTCT 

TAATTGCTGTGAGGCTCTANACACCGACCATACAGGGCGTGGGGCTGCTCCTGGACATGA 

ACATCCTCCGAAGTTCTCCCCAGTCCACTTTACCCGCTCTGGGCTCACCTCTGAATGTGT 

CTCCCCCCNCGTACCTGCCCG 

Sequence 545 

CCGCGGTGGCGGCCGAGGTACTCAGAGGTTTCCTGGCTCCAGCCATTGCTCTCGACCTCA 

CAGCTACCCATTGGGCCAAGCCACCTCCAGCTCTCACCTGACTTGCTGCAGTGGTCTCCC 

AAGAGGTCTCTCCACTCAGTCCCTGCCACCTTCAGCCTGTTCTCAGTGGAGGGGGCCAGA 

GTGACCCTGTTAAAACATGAGACCTTTCACGTCCAACCTCAGCTGAGACTTCCCCATGGT 

TCCTATCACACTCAGGGTCAAAGTGAGAGTCCCTAAGATGAAACACAAGTAGGGAGAATG 

CCCTTCTCAGCTGGACTGGGACAGCTCCAGGGTATCCCTATTGTCCTTGTACCTGCCCG 
Sequence 546 

CCGGGCAGGTACTGTTCTAGCTAGGGCTTTCAGGTACGATAAGTTCCATTGGAAGGATAA 

TTAGAGGGAAAAAATGGTTAGAATTTATAGAAAAGTCTTTATTTGAACAGCTGTCATAGA 

AACAAGGAGTA GACAG TCAAGTTNNTACTAAAGGNAGAACTTAAATCAATAANGAAATTA 

TAAGMGGAAGATTTTACCTAMTrrrTAMCMTTTTTTCCTTTTGGCCAGGCACCATG 

GCTCATGCCTATAATCCCAGCAATTTAGGGGGCTGAAACAGGTGGATTACTTNAGTCCAG 
GAGTTTGAGACCAGC 

Sequence 547 

AGGTACT GTGTC ATGCTTTTTACATTGAGCTATGTGCCATTGGGGGTGGGGGGTAAAAGC 

TATTCTTATTTTTAAACTCTTCMCATTTTTTAMGTCTTCAACATGATCTTTATCATGT 

TGTGATGTGACCGCCTAGATTTCCATTTTGAGGCTCTCATATAATNTACTTGACCATGTC 

CCTACTGGTTGACAGGAGACCATCATGGTTCCTATGATTTCTCAAAGTGTGACCCCCAAC 

AAACACCTGTCCTAGAAACACCTGTCCCANAAACACCCAGTACCTGCCCG 
Sequence 548 
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NATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAGATGTAGTGAATC 

TCTGCCTGTGTGGACAAAGAGACACATTCAGAGGTGAGCCCAGAGCGGGTAAAGTGGACT 

GGGGAGAACTTCGGAGGATGTTCATGTCCAGGAGCAGCCCCACGCCCTGTATGGTCGGTG 

TCTAGAGCCTCACAGCAACTAAGACCAACCCAGCTCTCAGAAGAAGGAATGTCAAAATGT 

CATG7TCAATTTTACA7TCAGTGCCTGGAATCTTTTCTTCACAATTGAAATGAAATGTGC 

TGAAGGAGGTGAATCCATGCATTAATCTTCAGCTNACAAAGGAAATACTACATAAGAAGC 

AAGACCACAGACTCAAGACGGACATAATTGGA'I Itllll I GCCATGGCCTGGAAAGAAAG 

GTACCT 

Sequence 549 

NCGCGGTGGCGGCCGAGGTACTGCTGAGAGCACAAAATCATCAATCAAACCGCGGAGAAT 

GCAGGAGGCTGCTCCTACAGATCCTGACTTACCACCAGGTTATGTGCAGAGTCTGATTAG 

ACGAGTTGTAAATAATGTAAACACTTNGNGGATGAAAATAATGCTCATACCTATAAAATA 

CTGTTGGAAAGNATGGATACTCAGTNCCTTTTCCGGTCANATAATTCANCTTTCNTGCCA 

G GAATGGTTACTAC CACGNTAAGGTTG GAAATTTAATGGGG N ATTCCGNGGNCATTGTNC 

ANTGGGNN7TNTNTNTCNTGGCCAAAACCTGNAATTATTNGGGNNTGGCCTTGGAGGAAA 

AAAGGGGTTNATTCCAAAATTNTTTGTTTCNTTGGMGTTGNNNANCNCTTGGGCCCNCC 

GNGNNNGNGGNCNCCNGNTCCCCGGGNGNNCCAAGGGGTTAACCCACCAA 

Sequence 550 

GCGGCCGCCCGGGCAGGTACCGAGCTTCAGTAGCAGTCTAACAGGTTCNNGCTGNCTCTT 

CCAAAAAGCGGGGACTTATCCTCATGGCTTCAAGGTCACCTGCAGTCCTTAGCTGTCCCG 

ATAGACTTNTCTCTCCAAAAGTCCCTATTTTTAGCTGTTTTCACAGGGGCACCCAGAGGC 

AAATATTATACAGAAACTGCCACATGTAGACTGTGATAACCCACACCTGCTGGACGGGAA 

GTTAACCTTTGTATAMGAAGAGGGTTGGCTGAGTCAATTTATGACAAGMCMTTTGGA 

AAAGGTAATGCTGGMCTACTCAAGTCCTTTCTNMGCATCCTCCAGCATTTCAGAAGCA 

CCTACCAATTAATATGTNAATTACAAACCATTCTCACCTGTTCAGGAAAGGGCCC 

Sequence 551 

NNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGTGTGGCCTAAAACAGAAGAATGTT 

TAACTGCATGAAGGCAGGGTGCTTTGTATTGCTGGGCTTGGTGTATATTTCTTTGCTATC 

TANTTTAATATATTGAGCTTTACATCTGTGCCAGCCTTGCATGTCCATATACCTTTGGCA 

GGCATTTCTAGTCAGGTGGCATGGGGCAAGGGGTGTGCTACGTTTTAAGTCCCTCATTTC 

TCCAGCCTGTCCAGGTAGTGTCTACGTCTCCAACTCACTCANGAAGGCAGGAGACTTCCA 

GATTCACTCCACTGGTATCAAGAGTTAGGTTCTGGTGAGAGAGCTGGCAGAAGCTTCAAA 

GGACCTTGCGTCTTAGCCCCTC I (Nil I CCTGTCCTTAACAGCAAGTTGTTGCCTCTAA 

TTTTCAGAAAATCGCNGCACATTTNCAGGAAGACCTGAAATGCCNGTGGGACTGCTTC 

Sequence 552 

ATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGGGAGAATCACCTGAGCTTGGGAACGTTGA 

GGTGCAGTGAGCTGAGATCGCACTGCTGCATTCCAGCCAGGGTGATGGGAGTGAGACCGT 

GTCTTAAACAAATAATAATAATTTGGGGGTGATGTANACAGAAATAAGGACAGAGGTATG 

AAAAGGGAATAATTATTTAGCATTTTACTANGCCAGAGATTTGGGCAGTCCCAAAAAGCA 

GCTTTTCTAATCTCCCTCTCCTTTCCCAAAAAACTTTAATCAAGATGCTTTAGTATCCTA 

TC ATCTTTGGGTTATTTAC ATACTAC I I ITTGGAGGGGGGATTGTGGAAAGCTATCCGAG 

ATCTTTGAAAGCATTGATGATTAGCAAGATGATTATTCCTGTAAGAAGCCCN 

Sequence 553 

AGGTACTGACTTGCCCAGAGCCTCAGGTCCAGATGAAGGGGAAGTGTATTTGTCCTGTTC 
ATTGCATCTCTGAGACCTAATGCTACAGTCCTGCCTTGACCTGTGAAGAGGAAGTCAAAG 
GTGTAAACAGTCTCCCACTGGTTCCCTTGCCATCCCATCCCTTGTTGGCCAGCCCTGACT 
GTGGCCAGTTCTGCATCCCTACCCGCGTACCTGCCCG 

NATGGTCATAGCTTGTTTCCTGTGTGAAAATTGNTATTCCCGNTCAACAATTNCCACACA 
AACATACGAGCCCGGGAGCISIATAAAGTGTAAAAGCCTGGGGTGCCTTAANGAGTGAAGCT 
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AACTTCACATTAAATTCAGTTGCGCTCACTNGCCGCTTTTCC 
Sequence 554 

GGAGCTCCACCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGCTCTCTTCTACTGTCC 

CAGCTGTCTCTGi mi 1 AGGCAACTTTCTGTGGCTTTTTCAAGAGTGGAGATTTGGCTG 

GAGCAGGATTAGGAAGCCTCCTGCTACACGCTTTCCCTGAGAAAAACCTNCTCTCTGGAA 

GGGGCCTGAGGTGAAAATGATTCAGAAGGCAGCTGAGGAGTTCTGGAAAGAGCCCCAAAG 

ACCAGGCTCTGAGGCCTNCCTGGGCCACTTACTAACTAAATGGGAGAAGATTACTTGAGA 

AAAAGCCTTNACAGTGCCCACCATTGATACCCCATT 

Sequence 555 

CCGCGGTGGCGGCCCGAGGTACTGTTTCACTTTCATCACCAGTAGGTAATTATTACTATT 

ATATAAGTTTCTAAAAGGTTTGTTTTTGCATATTCAAATAAATATGAATTTCCATCTTCT 

TTTACTCAAATGGTGGCATATTATACACACAGTTTTTTTTGAGACAGAGTTTTGTTCTTG 

TTGCCCAGGCTGGAGTGCAGTGGCGCAATCTCGGCTCACCACAACCTCCATCTCCCGAGT 

TCAAGCGATTCTCCTGCCTCAGCCTCCCGAGTAGCTGGGATTACAGGCATGGTGCCACCA 

CGCCTGGCTAATTTTGTATTTTTAGTAGGGACCAAGTTTCCCTA 

Sequence 556 

CCGCGGTGGCGGCCGCCGGGCAGGTACTTTGAATGCTTCCTCCTGGAAGTGAGCTACTCA 

CATTTTATGAGTCAAAGCAAGTCTCTTAGCCCCACTTCACTTTAATAGGGCAGGTATCAC 

TAGCCCTAATGACCACTTCAAGGGGTAGAATATGAGGCTGTTAATATTAAGGAAACAACT 

CTGAAGTCTAACATTATTGAAACCAGCATTCTGGTAAACACAGACAAACTATCAAAGACA 

ATAATAATAGGATCATAAAAGTTTCAGAAAGCACAGACACGGCTGAAACACTAACACCCA 

ACTGGCAATGGAATCATTCCTTGCCCTGCC 

Sequence 557 

NCGAGGTACATCAGGCAGTCACTGCTCCCGTCATACCGGGCACATTTCACCCACAGGCCT 

GTGTAAACAGTCAGGTTCTTCTCGTTTCTGTTGAATGTGATCAATCGTAATTTTCTCCAG 

TTGGGAAGCAGAGTCCCTGCAAAGAGGCCTGCTACTGAGGCGATTCCACACAGGAAGGAA 

AGGACTGTGGCTGCGTGGACATCCCGACAGCCCATGGCAGGCAAGCTTGTGGAGTACCTG 
CCCG 

Sequence 558 

ANGTNCCANGTNCNAGCCTGGGCCTTAAGAGGTCTTGCTTGCTTCTGTTCCCTTTGGAAC 

TCTGTTACTGCTGTGAGCACGTGAACAGGCTCACTTGGTGGAAGGGGAGCAGCTAAATGG 

AGCAGAGCCAAGTCATTTTACTTGAGTCTGGCCCTCTCAGACCAGCCGTCAGCCAGCTGA 

CCAAAAATGTGTTAAGACAGAACAGCCAACATCAGCAGAGCCACCTACCTAACTTGTACC 
TGCCCG 

Sequence 559 

AGGTACTAGAGAAGACACCAGGATGGCCAGGAGACAATGGCAGCATTAGAAGGCCCTAGT 

TGAACAGMTATATAATTTCATTTAACAGCTGCCGAAGCTCATTACGCTGGCATGAATGA 

TTCTTTACMCCACAAAAAGAAAGAAAAACAGACAGCAGCCTATTTACAGATGCAGTGAG 

AGCTGTCCCTTGCTTTCCAGCTTTCAAGGGAAGCAAAACATGTCTGCATAATAACTTTAA 

CCTATAATTCAACGCTATTTAAAAGTGTAGCTTGCCATAAGCTTACAGGGATATTTTTAT 

ACTACATTTTCAAAACCTCCACAATCTCTCAGCAGTTTCTGACTGGATTGCTTCCAGGCT 

TGCTGCTACCTGCTGCGGGGCCAATCAAGGCAGCAATCTANCCAGCACAGGCATTCTTTT 

NTGGCNCCCACCCTTCTTGGCACTCAAACATTTTCTATCCAAACATACTTTGCATATCTT 

CATGTTTTCAACCTGAGGAAAGTATAGCTCAAATGCCTT 

Sequence 560 

CCGGGCAGGTACCAGCCAGTTTTGAGATATACCTGGTTTGCTTTAATGTTATTAAATTGT 
NNNNGGTGAGTTTTG CAAATG GACAATTGAACTTTGTTCTGCCAAATTCTTTTTTATTGC 
ATTT AAACTTC AN G G N TTTTTATTTCC GGCTTATTATG AATG G ATTAGTC ATTTTGTG AA 
TATMCTCTTCATTGAGTTTAGGGAGAAATANGACGTTAGATCCGAAGATGCTNTTTGTT 
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TAATNACCAGAATGGACAAAATTATACTTCATTTACTATCANGGAGGCATGGGGAGAATA 
GAGGCCTAGGGGTAGCCGTCTTGTGGCTCTTACTGGTTTTGTGNATNTGGTTTTTTTTT 
Sequence 561 

CCGGGCAGGTACACTCAGTGCCTTACACGTGTTGTTTCATTTCATCTTCACAATAGCCTT 

AGGAGATAGGTGCCATCATTTCTACTTTATATATGAGGAAACTGATGTGAGGGGACAATG 

TGTGCCCAGGCCACCCGATCAGTGATTGGTAGGGCTAGAACTAAACCCAAGGCCTGTGAC 

TCCAAATCCCACCTTTTCACAAACCAGAGTGCCTCGACAGCCAAGTCTTCAATACTAATT 

TACTCAGGGTGTTTGTAATAAAATATGAGGGTAAGAGAGGTCAATG I I I I I I I I I I GAAC 

ATTGAGATTCTTCAGAACCACACAACTGTCTAGAGCTATGCT 

Sequence 562 

CCGGGCAGGTACCATTCTCCAGGCACATACAACATGAAATATCATACAGCTGTGAAAGAT 

CTGGTGGAAAGCTGGMTATTGAGGMGAAACATTTTTTGAAAATTATTTCTCTCTCTAA 

GTCAGTAAATGTTTGATATAATTCTGAGTGCATGGCTGTAGTCTGAAAGTTTATGTGCCC 

CTAAAATTAATATGTCAAAATCCTAACACCCATGGTGATGATATCAGGAGGTAGGGCCTT 

TG GGAAAGTAATTAGGTCATTAAATTAATAAGGA.AATCGTATC C ! I I I I I I I I I I IGAGA 

CGTAGTCTCGCTCTGTTGTCCAGGCTGGAGTGCAGTGGCACAATCTTGGCTCACTGCAAC 

CTCCGCCTCCTAGGTTCAAGCGATTCTGCTGTCTCAGGCTGCTGAGTAGTTGGGATTACA 

GGAGCCTGCCACCACGCCCAGCTMGTTTTGTATTTTTAGTAGAGACTGGGTTTCACCAT 

GTTGGCCAGCCT 

Sequence 563 

CCGGGCAGGTACAGCTGGATGTTGACATAGTCAGGTCTGTTCTCTTGCCAAAACTTGTTA 

TGC AACATAC AGAGTAAATCTGC AAACCAACGGAAGGACTGG ATATCTCTG C ATAC CC AA 

ATAAAGTATAGTCTTCTAAGCTTGTATGGTTTCCAGTCATCCAACAGGGTGTTGAGTATT 

GATGCAAATGGAGTTACTCCAATGCCTCCAGCCACGCAGAGGCTGACCTCATAGTTCAGT 

GATTCCTCAAATGGACTTCCAAAAGGACCATCAATATACAGCTTGGGATAATTTCTAGAT 

TGAATGAAGGGCAGAATTTCGGAGTCTTGACTAGATGAAGGCAGTAGTAAATCTCGAAAT 

CGTTCTGTCCAGTCTCCTACTATTTTAAGATGAACCCCAAATGTTGCTrTGGTTTCAGTT 

GGACACATTGTGAGGGTAAATGGATGATTTTCTAATGCAGATACACTGGGACAATGTAGA 

GTAATATACTG ACC AGGTCTTG CTTTAAAAATTTTCTTTTGACC ATTCG GATTTCC ATG 

Sequence 564 

CCGGGCAGGTACGCGGGTATGACGAGCCTGTGGCTTTCAAGCTGTGGACATCTGGCCTAG 

CTAGATTTCTACTTTTGTTTGTTTGTTTGGGTTGTGTTTTCCCCCCA 

GAAAGAGAAATAAGAATTGGCAGAATGTTGCGAAAGCACGGCACATCTTTGGCCAAGTGA 

GGGGTGAGAAGGCCCTGATGTGTCTTCAAGATTCATCTGGGGAACCCCCTAATGCTACTT 

CTTAGAAATGCTTCTGCATTTAGAGCAGCCAGATGAGGTTGTGTTAACTTTTGGGGTTTC 

TGAAGGCCTTTGGCTTCTTCCTCTTGGTCTAAACAGAAGTTCAAAATNNNACATNAAAAA 

AANATNTAAGTACCTNGGCCGCTCTANAACTAGTGGATTCCCCCGGGCTGCA 

Sequence 565 

TCGGCTCCCAGCTACTCACAGAGAACGTTTCTGTTTCTGACAGCAGAGTAAGAGAAGAGG 
TGAGAAGAGAGACAGCCCATTCTCTTTTGGCCTAATTCCTAAGGAATCATGCCTCTGCCT 
TC GGGCCATTCTC AAGTCTTGTTCAAGGATAAAATG ATTTATTG ATGG C CGTAATTAAAA 
AGCAATGCCAACGGAAGCAGCGTCAGCATTTTTCATTTATACCT 
Sequence 566 

AGGTACGCGGGGTGTTTTTGAAAGGGTCTGATAGGAATTGCAATGGTATAGGTTAAGTAA 

GATAAAGTCACATTTACTGAGTGGGCTCTTTGGGGACAGCATACATCAGAGCATATAGTC 

TTCTTGGTCTAGCAAAGAATAATGGTTATGAAAATTAACCATTTAGGGCTAGAGAAGGTC 

TTGGAAATTGTATAGCCAAGCCTCTTCATTTTGTCAGTGAGGGGATTGATTCTCCAGCTG 

ATTTAGGTCTAAATCTGTGACAATGATTTATATACCTATTCATGATCACTTAACTGGTAT 

TATTAATGGCAATTAATATGAGTCAAGCAGTGGTTTAAATGCTTTGCATATATTAACCCA 
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TTACTCCACAACAGCTCCATAGGTAGGTGTATTATTATCCCGCTTGTAAAGATGAAGAAA 

CTAAGGTAAGAGAGTGTAGTTCACT 

Sequence 567 

ANGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACACAGTTAACCACAAAACAG 
GCCTCTCTGAAAAAGCCATTGCCATGGACTGCCAGACAGACAATGACAAGACACAAATAC 
CTTCTGGTGTGTGAGCCACG(3GACATGTGAGCTTCCCCGCTGATGCTCCTCTTATATCAA 
AGATCACTTTC ACAAGATGAGC GACTCAATATC TTTTATCAAAC C AATGATC ACCTGCAA 

GCTATG GTATATTTTTGC AGC TGTGTAGAGCTATGTGGC ATG AGAATGTGGG ACTTATAA 

ATTGCTGATCCAATAAATAGACATTATGGGCAACAGTGTCTTATCAAGCTAGTGTGTACC 
TGCCCG 

Sequence 568 

TCGATAGAGCGCCTGCGNGCGCNCANNACCTGNCGCACGAACGCCANAACACCNTNGATC 

AACTGTGGGAGAGCTTGTCTCAGATAAGGTCACATCTGAGATACNGGGGGTCANGATTCT 

ACATGTGAGTCTGGGAAGACACTGTTCACCCCTTAACATCTCTGTTCTNCAGGGTCTTNA 

AATATANTTTNGGGTGCAAGGAAAGGCAGCTAAGTACCAAGTTAATTATNAGAAGGACCA 

GTTTTGAGGATTTCAGGAAAAAANAAAAAAAAAAAAAAAAANGTGCGGC 

Sequence 569 

AGGTACCATTTCCCCTTAAGCAAGCAGGATTGCAGGTGGGCTACTGCGCTAAGTTTTTAG 

CAAAAGAAAAGTAATCTGATTTGAATCAGGACTCATGGCAACTCTAAATCACGTATAGTT 

TTTCTCATCAGTTAAATAATAAAATGACAAAACTGTTTCGGTAAAATTCTGATTTTAAAG 

ATAAAATA TTATT ATAATTGATATTGMGAAAGCAAACAACGTTCATTGCAGGACTCCTA 

GTTGCATATTTTCTTGTAGAAGATATTTTCTTGAAC^ 

CCATGTTTCAAGAGTTCCACAGGCTGTACCTGCCCG 

Sequence 570 

AGGTACTGGGGAGAACTCAGAGAAGAAAGGGGATGCACTGCTGGGCCTCATGAGTTGGCT 

TATACTTCACCTTGAGCTAAGCATACTAATACCCAGTCCAGGCAGCATTGAACAGCTCCG 

CGCAGATCTCACCCTCCAAAAAACAGGAAGAAAAAGCCCTCTCACTGCAAGCAAGATCCA 

AAGAGAGAAGAGCAAGGGAGGGGCTGATGTGAGCATGCCCAGATCCATCCCCCACCCATA 

TTCGCCACCAAGGGCGGAGACAGGCAAAGAGGGCCCCTGTCCTGGAAATCCCTCCATGAG 

GAAGCAGTGAAGGATGCAGGGCCCACTCTGTCAGTGTCTAGTGTTCCACCGATCCTACAT 

AGTAATTTTGAAAGACGATAGCATGATGTTCCTGAAAGGAAAAAAAAAGTTTCCTCTATG 

AGGTCATTAACATATATCTTGAGTGACTTTATTGGNCNTCTGGGTCATTTTGACATTTT 
Sequence 571 

AGGTACCATTCTCCTTCAGGCTGGCTCTATGTTCTTCATGGGCAGCGTCTCCACGGTCAT 

GCACCATGCCGCTGGCTTCTGGACCCCGAAGGTGCTATGGAGCCCAGCTCCATTCCACTG 

AGTCCAAAACTCACGAATACAGAGGCCACCAGCACTACTGAAATGCCAGCAGGGTGTGTC 

GGCATTTGGAGGGCCTTCCAGCAAAGCTTTGAACATGCTATAATTGCTCCAATATAACAA 

GTGATGTTCCAATTAATAAGATAATTTACAGCAATCCACAGTCTCTGCCACAGGTGGTTN 

TTTCTTCAAGAGGGCTGTCCCGGGACAGAACTGGGACAATGCTTGAAGCTTCTCATCNTG 

TCCCCTCTGCCCATAAATTAACTCCTGTCTTCATAAGGGANAATAAATNTTAAGGAAAAT 

AAACCAATTTGTGCTCTTNAGGGGGGAAANGGANGGGGAAACAAAACACTTTGGGGAGGT 

TCAAGAAAAAGACCCTTTCCGCTTNAAAAATAGAACNCAAATTTGNTNCCTGGATTTTTA 

CTCCTACATTTGNTTTTTNTTCCCCCC 

Sequence 572 

TCGAGCGGCCGCCCGGGCAGGTACTCACAGAGGAGATGGGCACGAGCTCAAAGCCTAGCA 
TCTTTGATCTTACTGTTCACCTGGTAATGTTAAGAAATTGCCCCTTTGTGCTAATCCTCA 
GCAGCAGAGAGAGCTATGCAGGTTCAATTACTAAAAAATTCTCCCAA7TAATTTTTTGGG 
CATTAGTCTCAAAACTGGTTACTCCGTGTGTTGTTCTAATGGTGTTTCTTCTTTTTCCAC 
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AAGG ATTTC CTG AGATTGGCATAGTCC CCTAATTAATTG AAAAGGACCTAAAAGAATAAT 

TGGAAATATGC TTTTG ATGGTTGTTGTATACAAATGTTCTCAGAAGCTTAGGGAAATGGG 

TGTGATTTCAGATTTTCACATACATAAAAGGTGCAAGGTCTTCTGAGGGAATGGTTTGCA 

TGAGCTTTAATGTATATAGTTCAAAGGACTTACAGATGAATGAATTTGGTTAGTCAAGTA 

ACCCATACTAATTGGACAGATGGAGCAAGACATGGATGAAGAACACCATCCTGCCAATCA 

ANGTCATTTATAATTT 

Sequence 573 

TCGAGCGGCCGCCCGGGCAGGTACCATATCCTGGGCTGGTTAAAAACATACCATCCCTGT 

GATGGTGATGACCATTGACTTCACTC7TCACAAGAATTTATGTCAACGTGTCCTAGACAT 

TTGGTAGTAGAGGTATAGCTGTAGTCCAAACCTCTACCAAGCTATGGTTTCTGAGAGCCC 

TGTATGTTAAGTGCCTACGGAATGCTCAGCATAGTCAAAGACACCTTAGATTCACATTAT 

ACGAGGAGCCACACTGCAGTTATTCAAGATTAAGCACTGGCTGGAGAAACTGGCTGTATG 

AACTCCAGCCTTGGGTGCGCTGTACCTCGGCCGCGACCAC 

Sequence 574 

CGCCCGGGCAGGTACGTCAAGACACTTTTAAACTATACATGACAGTTCCTTGGGGATCTT 

TCTCC AACTC C CTAACCTTAATTATGTCACTCCCAG ATATGCTCTC AAAGTATC CTGTAC 

I I I I I I I I I I GAGATATGGGGAGATGGCAAGGAACAAAAAGAGAGGAAAATGGACAGAAA 

TGAAMGGTGAGGTCACCCMGGCGMCTTAGTCAGCTTGCCMTTTTACCTGGGGTTCA 

GACGTCACTCGATTCTTTCCATTTCCATGTCGTGGATGGGGGGAGGTGCAGGAAAGGGGA 

CACTCATCTGGGACTGGGTAACTGTCAAGCCCAATCTAATCAAAACCACCCTAAAGCACA 

CAAGGGACTGGAGAAGAACACTAAAAATTTAGAACAAGATATTTAGACAAGATGCTTCCT 

AAATTCTGTGAAATTTTCATGAAGTAAGAATTTCCAAACCCAAGNTCATTATCTTCTCTC 

TTTCC 

Sequence 575 

AGGTACTAGTAAGTTCTGGGTAGGGGGTGGATGAGTCCTAGGAGTGGGGGGCCCCAGGCA 
CACAGGAACCATAGTTAACCACTCTTCACCTACAGAAGAGCTCCTCAGGTCAAAGCTTCA 
CTGTACCTGCCCG 
Sequence 576 

CCGCGGTGGCGGCCGAGGTACCTTTCTTTCCAGGCCATGGCAAAAAAATCCAATTATGTC 

CGTCTTGAGTCTGTGGTCTTGCTTCTTATGTAGTATTTCCTTTGTGAGCTGAAGATTAAT 

GCATGGATTCACCTCCTTCAGCACATTTCATTTCAATTGTGAAGAAAAGATTCCAGGCAC 

TGAATGTAAAATTGAACATGACATTTTGACATTCCTTCTTCTGAGAGCTGGGTTGGTCTT 

AGTTGCTGTGAGGCTCTAGACACCGACCATACAGGGCGTGGGGCTGCTCCTGGACATGAA 

CATCCTCCGAAGTTCTCCCCAGTCCACTTTACCCGCTCTGCCCCGCGTACCTGCCCGGGC 

GGCCGCTCTAGAACTAAGTGGAT 

Sequence 577 

AGGTACTTTCTGTGCCCCTCTCCTCACTTCCATCCCTGTCACTCCTAAAGGATCTCCCTC 

TCCTGCTAGGATTCTTCAGACCATGATCTGAAAACAACTGATTTAGAGTAAGTAATGCTT 

TGGGTGCTCTTCCAAGAATTCTATCCCACCTTTTACTTTTGTGATACCCAGTTAATCCTT 

TATGACCTAACTTTCCATTACATCAAAGGAGCATAGGGTCCAGAATCTTACAGGAGTGCA 

TGACTTTTCCACAGGCACTTCTGGCTTGCTTGTCAGGCCACCTGAATTAGGTATTGCTTN 

TTTCTCTCATAATCAGCCATATGCTAATCCCTCCCGTTCTCTATTGGACAGACCAGAACC 

TCTAAGGAGGGGCTGAGCTGCTTGCTTGACGAGACACAAAAGCACTGCTACACAGCAAGA 

AGGTTGAACTGTTNGGACACAGGCACTGCGGGGGGG 

Sequence 578 

GTCGACCCCGCGTCCGATTCCAGAAACACATCGCTTTCTTCCTCTGTACTGGAGTGTAGC 
AACACAAAGACACAGCCAACAATTTCACTTGGTGGTCTGATGAACTCTCATAAATGCCTT 
GTTTACAGCATAAGAATGTAAGTACATATTTATATATCTCTCTCTGACAAAATATCCTTT 
CCATATATCTGATGGCCATTAATTTGGACTTAACTAGTTTTGTCCAAAAATGAAATTCCA 
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AGGGCCAGGGAAGAATAAAATGTACTTGAAAAATTTCTTCTAACTGGTATAAATATTAAA 

TTAAACTTATATGCTAATTGGAACAATAATGGCCCTTTGGATGATTCTACATTACAAGCG 

TCAACTGAAATAAGGTCCCCAATTGTCACTGAGAACATCAAAGGGATCTTACTTTGAGGA 

ATAAATATGTGATTATAGTGATTATGTGTGAATTGNGGTTATATTCTTCAATAAAAGAAA 

CATTGCTGGGATTTACAACTCTCCTCTGGAGACTTAAGTCAAATAAAATTAACACATTAT 

TTGAAACACCCACANGGATTCCTGGCAGCCAGAAATAAACCAAATTATTTTTACCCAAAC 

c 

Sequence 579 

CCGGGCAGGTACGCGGGGGGACTTACAAGATGGATAGGACCTGGTGGCAGCTGTCCTTGA 

GGAAGCCAGAGCCTGGAGAGACAGATCTCATGTTATAGCAGAGTACCT 

Sequence 580 

CCTATAGGGAGTCGACCCCGCGTCCGTTCTAAGCCTGCCCGCCTGGGGCTCTGTAACTAT 

GGCTCCTAACGTGGGGAGCCCACAATCTAGCTATATCATGTTAGCGAGAGCTCTTTGTAT 

ACTGAAACAATAAACTTACCATCTTGATGAGTTTTACATGTAGAATGGCTTAGGTTATAA 

GGTACTGCCACCTCATTCCACTGTTGAGAATTGTCTAATGCAAAGCTCACTTTTTAAACA 

TGGATAATTGGACAGTCATGAGTGGATCATCGAAGGGATAAGCAAAATATATGTCAAACT 

GAGAATTAATATCTATCAAAACAC1TGCTATTATTTGGTAGATGGGAGGTTATTTTATTT 

ttaagaagtagattggctgaatggaaaagcaaaatgtggcataaaataaaatgggatatt 

GTTCAGCAGTGAAAGGAATAAAGTACTGATATGGACTACAACATGGATGAACCTTGAAAA 

cattatggcaaagtgaaagaagccagttacaangccgggcaccatggctcatgcttgtaa 

TCTCAGCACTTTTGGGAGACCGANGTGGGTGAATCACTTGAGGTCANGAGTTCGAGACCA 

gtctggacaaccatggggaaaccccccgtctctactaaaaaaaaaaaaaa 

Sequence 581 

agtcgacccacgcgtccgcagtcaccagggaaacaagcaacatcatcatggcacaggtga 

gcagggacctagagggtaaaccaaagcccaacatgcagagccatcataaagatcttcacc 

cagagggaaagactctgaaatctgaacatctatctataagcaaccgccttttcagtcctt 

taaagamggmttmtmcaaatgttttcaagacatccacaattaaaaccactcatgca 

ccacacacacagcaaccaccctgcacacactgtctcattaaacccagacactgagaaatc 

actcaaggccttctctgcagttcagtaaggctcatgatgatgatgagatctgttttagaa 

agtaaactcttcaggagccaaactatattttctttgcacacttcagttgtgtgagtaaat 

gtgcagcccaactacagtatgattacaaaggacaaccaatgaaacacaaagtctgtaaag 

cggaagmgaccacctggagatttttcttttcaatcagcagagattaattagcctacatt 

tcttagamtgtggactttaccatagacaatatttttcttcccaaggatgaccattctta 

gacngnccaatgctttccaatggtgtggcccccaaaatttttggacccacttgtgtaaaa 

CC i i i i i 
Sequence 582 

gggagtcgaccccgcgtccgtgcatttccaactgaggtaccctgttcatttcactgggac 

tggttaggcagtgggtgcagcacacggagggcaagcagaagaagggtggggcatctcctc 

acccaagaagtccaaggcacgggcaagggagtctccctttccaagccaagagaagctgta 

atctacaaggmcttaaacaaattttcmgaaaaaaacaaacaaccccatcaaaaagtgg 

gtgaaggatatgaacagaatttctcaaaagaagacatttatgcagccaacagacatatga 

aaaaaagctcatcatcactggtcattagagacatgcaaatcaaaaccAcaatgagatacc 

acctcatgccagttagaatggcaatcattaaaaagtctggaaacaacagatgctggagag 

gatgcagagamtcggaatgcttttacactgttggtgggagtgtaaattagttcaaccat 

tatggaagacagtntggcgattcctcaagaattatagaaccagaaatccatttgatccca 

cmtcccattacttggtatatatcccaaaggattattaaattcattccactattaaagac 

ccatgcccnccttatgtttatttgcagcactttttaccaatagccaaagactttgggaac 

ccaccccaatgccccatcaatgattagactggattaaagaaaaaattccncatacnccat 

gggggatactaccgcagncnttaaaaaaagmccgagttctttttcctttgccnggggnc 
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ATTGGGATGAAANCTGGGAAAACCCTTTATTCTTTAGNNNGGCCCCCCCCCCCCCCCCCC 
CC 

Sequence 583 

ACNTTTATAGGGAGTCGACCNACGCGTCCGCGGACGCGTGGGTAAAGAATGGGCAGCCAA 

GCCAGGCGTGGTGGCACACACTTTTAATCCCAGCACTTGGGAGGCAGAGGCAGACGGATT 

TCTGAGTTCGAGGCCAGCCTGGTCTACAGTGAGTTCCAGGACAGCCAAGGCTACACAGAG 

AAACCCTGTCTCGGAAAAAAAAAAAAAAAGAATGGGCTGCCAATCCACAGCCACCCTGAC 

A ACGGG GGGTGGGGGTGGGGTAGCTAAAGCCAATTTGAAGAAAGCCCCTATGAAAGGGCG 

CTTTTTCCTTCAAGGTACAGACCAGTCCAAAATGTACAGACACCCCTCACTATGAACACA 

AACAATCCTGGTGTAA ATTACC TMCTACAGTTGGGAAATCTAACTGGCTTTAAAAGGAA 

AAGAGAGCCTGCCGTCTTTTTGTGGGTGCCATTTTATNTAGATGGATATATNTTTGAGCT 

TTTGGGTCTGTTATGCTT 

Sequence 584 

GGAGTCGACCCCTCGTCCGCGCCCCCGCCCGGCTAATTTTTTGTATTTTTAGTAGAGACG 

GGGTTTCACCGTGTTAACCGGGATGGTCTCGATCTCCTGACCTCGTGATCCGCCCGCCTC 

GGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCACGCCCGGCTGATTTCATTGTT 

TTTTATGTCTGTGTAGTATTCCATGGTGTGTATGTGCTACATTTTCTTTATCCAATTCAC 

CATTGATGGGTAC7TAGGTTGATACCATGTTTTTGCTGTTGTGAATAGCACTGCAATGAA 

TGTATGAGTGCCTGTGTCTTTTTGGTAGMTGATTTGTTTTCCTTTGTGTATATACCCAG 

TAATGGGATTGCTGGGTCAGTTGGTAGCTCTGTTTTAAGTTCTCCGAGAAATCTCCAAAC 

ACGGTGGAATGTTGGCCTGTAGTTTTCTTTTTTTGGTGTGCCTCTGCCAGATTTTGGTAT 

CAAGG ATGAT GCTGGCTTCATAGAATGAGCTAAAGGAGGANGCCCCTTCTCCATTTTTGG 

TATAAGTTTTCAGTAGGAATTGGTAATGAAGTTCTTCTTTGTAACATCTGGGAANAATTT 

TGGCTGTGAAACTCAATCTGGGCCAGGNGCT 11(1111 I GGGTTGGTAAGG I I I M I I 1 I 

AAAATNACTNGATTCAAGTTTTTGNAAATTTNGGTM 

AN CTTT CNTTCCTGGGTTCNAATCTTTGGAAGGGTTGGGGGGTTTCCANGAAACCNTNTC 
CATTTTCCCNNTAACATTTTCCTTTTCC I rTTTTTTTTTT 
Sequence 585 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTACAGACCCATTTAAAAGGTGAAGAAACTAAT 

GGA CATAA GTTTCACAGACTCAAAGTCTTACGGAGAGTTTGGTAAACCCAGATCTCAACT 

CAGATTTTCCAGTTCTGCTMTTTTCCCCATTCCCCTCTGGTGCTGCCTTTTCAACTTGG 

GTTGTCMCAGGCAAATGGGACCAGTGGAAGCATTCAGAGAGAAGATTGATGGTTTTCAG 

ATGTTTCTTTCTCTCCCTGACATGATGTCTTAGGCACAACCACAGTATTTCAAGTGGTTG 

AAAATCAAGCTGAGCTAATGGGGGCTGGAGTGAGGGTGGAAGGGCAGACCCAGAGACTCA 

ATCTACCAGATGCTCATTCTCTTCCAGCACCCACCCTACCCCATCAATTGTTCTGGCACC 

CACCACAAAGCCCTAAGAGCCTTGGTGGAAAAAGTCTGGGAAAATACTGTGGTANGGGCC 

AAT CAAA GTGACGATNGGCCACCACTTANTAAGTTCTNCNATAAACCANGGGCTTCCAAA 

GCATTTTNGAAACCTAAGAAGGGTAAATCATTCCATGCAACCGGACATACCAAACAGATG 

CTCAGAAGCAATTAAGTACCTTGGGCCGCTCTTANNAACTTAGTGGGATCCCCCCGGGGC 

CTGGCAGGGAATTCGANTATTCAANGCTTTATNGATACCCGNTCGANCCTCNAANGGGGG 
GGGG 

Sequence 586 

TAGGGAGTCGACCCCGCGTCCGGCCACCACATATGTGGTAATGTGTTATAAGCAGCAATA 

GGAAATTAAAACAATTAGGAAACTGTGATAGGGACAACCCATGTGAGAAGTGACAGTGGA 

TAAAGAGAAATTGTTGAACTGCAGMGTTTCAAGAAGATAGAATTGCTGGGCACATCAGG 

GTACGCTTAAGTAAATAAAAAAATAAAATAAAAAGAATTGTTGAGTGTGCTGGCTCACAT 

CTGTAATCCCAACACTTTGGGAGGCTGAGGTGGGCGGATCGCTTGAGGCCAGGAGTTCAA 

GACCAGCCTGGGCAACACGGCGAAATCCCATCTCTACAAAAAATTCAAAAATTAGCCAGG 

CATGGTGGTGCACGCTTGTAGTCCCAGCTACTCAGGAGCTGAGGTGGGAAGATCGCTTGA 
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GCCTGGAGGTCAAGGCTGCAGTGAGCTGTGATTGTGCCACTGCACTCCAGCCTTTTTTTT 

GACACAGAGTGAGACTCCATCTCANNNAAAAAAANNNANNNNGGGAANGGAANNNNNNNN 
ANNNNNANNNGTGCCGGCCGCT NN 

Sequence 587 

TTAGGGAGTCGACCCCGCGTCCGGTCAATTCCATGTCCTAAAAGGGAGGTCAGAGAGTTT 
TCTGACAAGTCTTATAGGTTGTCAGCACAATTGTGTCAAGGTACTCAACAGTTCTTGAGC 
ATTTACTTAAACGAACTTATGAAAGACTGACCCATAAGATAAATGGGrrATATCCACTTA 
CAATGAGAGATACCTTTCCAGTGACCTGAAGAAGTGTCTAGAGTACCAAGTGTCCATTAG 
•GAATGATGAGAGGCAATACTGAGATTAGTCTGTGAAAAAATTAAGTAACTAAAATACTAG 
GAATATATGATATGTACATACATCTTCATATATGTTTAAGTTTACAAATATATGTGGTAT 
ATAACAGGGGACTGAT7TAAGTTAGATTTCAAGTTTGAAGGAAACCTAAAGGATAAGCTG 
AAGGGAAAGATAGAGAAAATAGAGGCCTAAACTAAGCCCGGTAGTGGGAGTTAAAAAGAA 
AGGATGAATACAAAATTTCTGCATTAAAATGAGTGGTACCGTTTGTCCCTCAGTATCTGC 
AGAGAATTAGTTCCAGGACCCCCATTTACCAAAACCTG 
Sequence 588 

GTGGTCGCGGCCGAGGTACTATGGCCTGACTGAGAAGGCCTAGATCCTTGAGAAATCGCC 

CCMGGTTTCATAAAGATCAGCTAAGCAACTCATG7TGTCCTCGCCTTCGCAGT7TTTCT 

CATACTGCTTCAATGAATCGMGTATTCTGTTGCCATTGCACTTTTGTCTTTGCCAACAA 

ACTGCCMTAACTCAGCAACTCAGCAAAGTATCCCCTTTTATAAAGGTTTTGAGACACAA 

AGAGATTAAGAAGGCAACCATGCAGCTTCTGTTTACTTCCCTGCTGCTGAAAAAGCCACG 

GGAGTTCATCTGCGCTTCTCCAAGTCACTCTGTCCTGACTTAGCTGCAAGGTGAAATAGT 
TGATTAGCTTTT 

Sequence 589 

GAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCATGCTGACATTCTGCAAAGTCTGCA 

TTCGCAAAGCCTGAAATGATTCCAACTGCTGCTGAAGGACTTTCAATGCTGATTGTGGTT 

GGATTNGATTTTNTGATTCTGCTATGCAGCCATATTTTAGCA7TCAGAGAAGCAACGGAG 

CATANCGTGGTGGTGAANACAGGCAGTCCGGAGCCGCCCTGCTGAATGCCCGCTCCATTC 

CTNGCTGNCTGNGTGACCTTANGGAGTTAACCTCTCTGTGCCCGTNTACTCATNTGTTTT 

ACTGNANNNCACTTTGCTTTANCCCCTCNNTAAGAATTTAANGATCTTGGAACGCCNGNA 

ANGGAGAACCGGGAGTCAAACTTTNATCCTTCTTACCCGGTTGCCGCTTTTTATNGTANC 

CACNGGGNTTACCNTGCCCAGGCCGGGAGAAACTGTGCTAACATAANTGGGANCCCCCCN 

GGGGGTGCAGGGGAAATTTCTGATATNCAAAGNTTTANTCGNATACCGNTNCGAACCCTC 
TGAAGGGGGGGGGGGCCCC 

Sequence 590 

CCGGGCAGGTACAAGTTACCTACCCTTTCCTTGCCTCAGTTTCCTCCTTTGTGACAATGG 

GGACAATAACAGAAACTGCCACACAAGAGGGATGTTAGAGAACTGACCAAGCGAAtGTAA 

GTAATGACTTGGAGCAACNTATGGCTAAGAGTAAGTTAGGGGTTGTTACGATGACTTTCC 

TGAGATAAACAGGGAGATCTTTTAGCTGNGGGTCAAGTGCANGAGGGACAGTGGGAGCTC 

AAGGCCTAATGGAAACTAACACTAATTGTACCTNGGGCCGCTCTAGAAACTAGGTGGATC 

CCCCCGGGGCCTGCAGGGAATTTNGATATCAAGCTTATCCGATACCGGTCGACCTCNAGG 

GGGGGGGCCCCGGGGACCCCAAGCTTGTTGTTTCCCTTTTANNTGNAGGGGNTTTAATNT 

GCGGCCGCCTTGNCCGTNAAATCATTGGGTCATTAAGCCTGGNTTTTNCTTGTTNTGGAA 

AAMTTTGTNTATTCTNGCNTCCAACAAATTTCCCACAACCAAAACAATACNGAAGCCCG 

GGGGNAGCCAANTAAAAAGTGGTTAAAAGGCCNTTGGGGGGTTGGCCCCTTAAATNGGAG 

GTTGGANG CCTTA AAACTTTCAACAATTTTAAATTTGGCGGGTTGGCCGCCTTCAACCTG 

GNCCCGGCTTTTTCCNCAAGGTCTGGGGNAAAAAACCATGNTNCGGNGGCCCCANNNTTT 

GCCATTTTAAANTNGGAATTTCGGGCCCAAACCCCCNCCGGGGGGAANNAGGGCCGGGTT 

Sequence 591 
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CCGCGGTGGCGGCCGAGGTACAAGTTGTCTTTATGCTGCGAGATAAGTCCTCTCTTGGTT 

TGAGCTCCCACCTTTTCAGTGAACTCTTACATTTTGGGGGATCTGCTCTTGTAAAGGACA 

TCCTTTCTGGTGAGTATTCTTTTGGTTTAATTTTTGGTTTGGTTATTTG 

AATCTCAT TAGGA AACAAGTTAAGTTGAATAGACCAACTAGTGAATTAATCCGTCACCAA 

AATATATGJ I I I iGGCATTTACCTGTTTATTTTGAAACTCTTTGTAAGAAATGTAAACCT 

GTAATGATAATCTCTGCTTTGTAAGGATATCTCCCTCTCTGACACCTAAAACACTAGATG 

CTTTCA CAAAG CAAAAGGAAGAGACCTAAATCTATCTATCTGTGTAAACTCACCCTTGAC 

CATTTCATTTTGMGGCTTCCTATATATGCTTT^^ 

AAGTTCTGTACCT 

Sequence 592 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTAAAAATATAGCA 

ATGTGGGTGGGCTCCAAGGACTGTTCCTTCAGACCTCAAATCCACATGCTTATAGAAGAC 

CACATAGGTCAGAGCAAACTCTCTACACCTGTCTTTGTCAACTGTCACCCAGTTGAAGAT 

C TCTGAAGCTGCATG CCG ATTTAGGCCAAC AATC AGACTATGTAGC ATTTAC ATTTAC AT 

TTGTCTGTGTGTGATAACATGMGCCAGAAGGAAGGAAMTTTGTGGCTGCTGGTTTCCC 

TGAGAAGCAGTTTGAAGGTCAAAAGAAACCAACCCTGGAGGAACTCGAGTTATGACAGAG 

CAATGTTAAGTGTAGACCTAGGAGGGAAGCCTATAGGGCTAGGATAGGGGTGCTAACAGA 

TAAAGTGGCAAAGCAAGAGAAGAAGGGAAAGACGGCAGATGAGGAAAGGTGCCCTATCCA 

TAGCAGGGATGCAATGCAAAGACAAATCTAAGGTTTCTGTTTCTTCACCACTCCTGGCCT 

AGGTCTGGCTTTGCTATTTCCTAGCAGTGTGACTTGTCCTCGGC 

Sequence 593 

GTGTGTGCCTGTAGTCTCAGTTACTCAGGAGGCTGAGGTGGGGGAATCTCTTGAACCTAG 

GAGTTCAAGGCTGCAGTGAGCTATAATCACACCACTGCCCTGCAGTCTGGGCTTCAGAGT 

TAGATTTCATCTCATGAATGAGGGAATAAAGGATTAATAGAAGTAGGCAAGCAAAAGGGG 

GAATGAAGGGTATTCTAGGCAGAGGGAACAGTAAAGTCTGAGACAAGACGGAGTGGTATG 

TTAGAGATACTACAAGACTTTAAGTACCTGCCCG 

Sequence 594 

AGGTACGCGGGGACATTCAGAGGTGAGCCCAGAGCGGGTAAAGTGGACTGGGGAGAACTT 

CGGAGGATGTTCATGTCCAGGAGCAGCCCCACGCCCTGTATGGTCGGTGTCTAGAGCCTC 

ACAGCAACTAAGACCAACCCAGCTCTCAGAAGAAGGAATGTCAAAATGTCATGTTCAATT 

TTACATTCAGTGCCTAGAATCTT1TCTTCACAATTGAAATGAAATGTGCTGAAGAAGGTG 

AATCCATGCATTAATCTTCAGCTCACAAAGGAAATACTACATAAGAAGCAAGACCACAGA 

CTCAAGACGGACATAATTGGA I I I I 1 1 1 I GCCATGGCCTGGAAAGAAAGGTACCTGCCCG 

Sequence 595 

AGGTACTTGGTGAAGTGAGATTCAGACTGAGTGGGGTCACAGCACAGGGCACTGTCTTGC 

CTGGCTTTATCTGAGCCAGTCACACCTCTCCTGGCCACTATCTGTGGTCTAGC.CCCCTTT 

GTGCAGAAAGAGAAAGAAGAGCCTTGAGGACCAGCCTAGTCAGCCGGGGGCCATTGAGAC 

TGCCATGGAAGACTTGAAAGGTCACGTAGCTGTACACTTCTGGAGAGACCATTCAAGGCT 

TCTGGCTCTTGACAAAGATAGACCACTGGAACAATGAGAAGGAGAGAATTNTACTGGTCA 

CAGACAAGACTCTCTTGATCTGCAAATACGACTTCATCATGCTAGGGTTGAGTTGCAGCG 

CGCAGCGGATTCCTCTGAGCGCTGTCTATCGCATCTGCCTGGGCAAGTTCGCCTTCCCTG 

GGATGTCCCTGGACAAGAGACAAGGAGAAGGCCTTANGATCTACTGGGGGAGTNCGGAGG 

AGCAATC Mill GTCCCGCTGGAACCCATGGTCACTGAAAGTTCCTTATGCTACTTTCAC 

TGAGCATTCTATGGAAATACACCANNGAGGAAAATTCCTTGA 

Sequence 596 

AGGTACTCAGAGGTTTCCTGGCTCCAGCCATTGCTCTCGACCTCACAGCTACCCATTGGG 
CCAAGCCACCTCCAGCTCTCACCTGACTTGCTGCAGTGGTCTCCCAAGAGGTCTCTCCAC 
TCAGTCCCTGCCACCTTCAGCCTGTTCTCAGTGGAGGGGGCCAGAGTGACCCTGTTAAAA 
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Sequence 597 

J™ GTGCTGGTCACAGGGCCAGAGGCAGAGAGCCCTAA CAAGCAGTCAGGGCGCTAAT 
GAGGA A GGGGGGGA ^^ 

TCCCATCCAGAATCCAGCA GGAAG TTAAACCAAACGACACATGATTCTCTGACGAAGAAA 

TAAGAATACGGTTGACTGGTTTTTTATATTTATTTACCTGAAAGCTTCTAATGTTTCTAA 

G J A ?^ A ^ A ^ A ^^^ AGGG ^ GG ^^^^^^^CCTAGATCATGATGTCTAAACACTGC 

ACACTTGACCAACnTGGGATCCAAATAACTAGCTTTGTATGCTCTTGGGCCCTGCAGTA 
v_/C« I c 

Sequence 598 

GC JrZ G ^ AC I GG ^^ 

A ^ G ^ CAGAGCCGAGACAGTCACTCTGT GGAGAGAGAACAGAGGGCTGGGMCCCAGCC 

AC ^CAGGGMCTGGACTCTMGGACAGCAGCAGAGTCTGCCTTTGCAGCACGTTATAC 

i?I^ ACGTA ™ CC ™ m ^^ 

T^GTTCCTAAACCATGAGCACAAGACTTTCTGGAGGAAAAAGGAAAGCCCCAGGAAGAA 

GCGGAATTTGTCACCCAGGTCAAGCGAGCTTGCCCTGAGCAAAAAC 
Sequence 599 

S GCGG 7 GGCGGC ^ 

2SS!~ TGTGATGGAGAAGTAACAGGGACG ATGAGGACATAGAGCAGGGAAGGGGGCC 

TAGGAAGCTATTCTGGAAGAAGTGAGTTGGTTCCTGAAGTTGAAGCACCTGCAGAGGTGA 

GGAGGTGGGGAGCAGAAGTGCAGGATCCATCTGTGCAAAGAGCCTGGGGCCTCAACAACA 

TGAACTGTTTTAATCAGATTTGCATTTTAGAAATGATAACCCTAGGCCTGGAGCGGTGGC 

TCATGCCTGTAATTCCAACACTACGGGAGGCAGAGGCTCTACTAACAAATCTCTACTAAA 

AATACAAAAATTAGCTGGGTGTGGTGGTGCATGCCTGTAATTCCAGCTACTCAGGAGGCT 

GAAGCAGGAAAATCTCTTAACACAGGAGGCAGAGGACGCAGTGAGCCAAGATCACACCAT 

TGCACTCCAGCCTGGGGTGACAGAGCGAGACTCTGTCTAAAAAAAAAAAAAAAAA 
Sequence 600 

AGGTACGCGGGGGGGATGGGTGTGGAGGCCGCGGAACTTCTGGCGGGCGCAAGCTATATG 

CCTTGTTGATCTTGCCGCATCTTTTTTACTAGATGTCGGCTGAGTGAGTAGCTACAGTTT 

TTAACATGTTTMCTTGCGGCCTTCAAAAATTGTTAGCGGACTTTCTCTGTTGCCTCCTC 

AGTGAGCTCTGCCCTTTCTCGCAACTCCTTGTCGTCCCACCGCGCCTCTGCACCTCAGAG 

GAAGGAGAAGAGTAGCGAGGCCGAGCGGGTTTTTAGATTTCTCTGTGATTACCCCCAGCC 

TTGTCTTTTCTGACTTTACCGTCCCCGCTTGGGGTACCTGCCCG 

Sequence 601 

CCGGGCAGGTACGTAAGTTGAAGTCCAGGAGGGTTCCAGACACAGAGCTTCTGGTGTCTT 

CTCCCTGTGGAGTCAGGACTCATTACTCTGCTGTCTTTGATGTGTGACAACAGGCACCAA 

CTATCGCCAACTAGAGAAGCTCACCAGAGCTCTGATGTTGGAAGTTTTTATTGGGGCCTC 

AGCATGTAGACATGATTGACTGGTTGGTGGATTAACTGTCTACATGGTTAATCTTAGTGT 

CCAGGTCAGCTGATACTGCGTGACCAAAAGCTCCTACCCCGAATCCCTTGATTAGTATTT 

CTGGCATAGCAAGCCCCCATCAGAAACAAAGACATTCCTCTTAGAGGTAATATAGATTAC 

CTTTCAGCACCAGAGGGCAAAGGCCAGACTACTTTTTAGGCAAGTACCT 
Sequence 602 

AGG I ACACGTGCAGCAGAAGGCTGG TAGGATAGTTTCTTCCACATCCATGTGAACCTCAG 

AG ^ CA I CTGACCA ^ GGCC ^ G ^ CT CTTCCAGGTTCTAGCTGGCCAGAGACTTTnTGG 

CCTCATACTAACTGCAAAATTGCACACAGTTGGCGAATACACCACTGGAAGCCACTGTGG 



Page 100 of 869 



WO 01/51628 



PCT7US01/00798 



Table 2 



CTG CATTGTCAC AGTTCACCAATTGTGC ATGTTATTTCCTG CAGTGGG CCCAAGATC ACT 

GACAGGATGCCGTCGGGGTTGGGCTGAAGGTGGAAACAGCACACTAGCTTGCTGTTTGTG 

GCCTCTTTACAACTGCCGGCCCCATCTGAGATCCGGAACCAACACGTTGCTAACCTCAGC 

CCATTGGGGCCAGTGGGGCAGCCCGGCAGTGGAACCATANCTTTTCCACACCTATGTCTA 

TTGGCACCGGTAGCTGCCCGGCCCCCAAGTTTCAACTGNCATTCCCCCTTTGNCATTGTG 

GCCCCCCCAANAANAAAANTTAGGGCCCAAAAAAAAAAAAGCCTGGCCTTTNANGGGNNG 

CCCCAAAAGGGGTNATNTTTNGGGGGGGGNTNAAACTTTTTTGCCCAAAAAGGGGGGGNA 

AAAAAAAAATTTTTTNCCTTNAAAAGGNNC I ITTTTTTTI I NAAACCCNGGGGGGGCANA 

AAAGGGGGCCCCGGGAGANNNN I I I I I I I I NTTNTGNNNNCCCCCCTTTNTTTAATAANA 

ANAAAATNNGTTTNTTTTTTTNCAAAAAACNCGNGGGAAAAN I rTTTTTTC 

Sequence 603 

AATTCTCACATGTGCTTATTTTGAAATTAAACCMTGCAAAAGTTTTTACCCTGAGG 

TGCTAATTCATGAACCTATTCTCTTGACAGTTTTTCCCTCCAATATAACCTATATCTTAT 

TAAATGTCTTTTCTTCTTCTAGATTG 

GCTCCAAATTAGAAAAACGCTGAGATTAGCCAAATATTTGAACACTGTAAGGTATTAATC 

AGTTTAATCTGAAAAAAAAGGTATAAGAACACTTCCTACAGTTCCAGAAAGAGTGAGCTG 

TCCTGGCTGAGGGTTATTTNATTAAGCACCTACATGTNTGGACCACTAATNGGGGNGGGC 

NNTCNCCNNAGGGAAAGNTGAAAAAANNCCNGGGACCCGGGGGNTTTNNNGGTGGGATCC 

CCCCCCTTTTTCAGGGGNANAAAAGTTTITGN^ 

AGGCTTTTCTTNAAAATCCCATGGGNTTTNCNTTGGGGCCAAGGGGGC I I I I I I I I I 1 I N 

NCCCCAAAAAATNTGGGGGGGGGGGAANAAANCCCCNTTAAANNNCGGGGGGGCCTTTTT 

TTNNNNTNGANATTANNNGGCCCAAAGGANNNANGGGANAACCCGNTNTTCTNTNTTCCC 

NATGGGGGGNGGNGNNTTAAAAACNCCTNTNTTGGGGGNGTTTTTAAAAAANCCCCCCNN 

ANAAAANTTTTTNNAANGNGAAAACCTTNGTGNTNTNNTTGANANCCCCNCCCCCNCTTT 

TTTTTTNAAGAGGGGGGGGNNNNCAAAAAAAAAA 

Sequence 604 

CCGGGCAGGTACTGCCTTTTTGGAGCTCCTCACGCATCACCATTTCTCTCTACCTATGAT 

TCCTTCAGCTTTTGTTAATGCAATTGGCTCGC I I I I f GTTTTGTTGAGTCTTGATTTAAT 

TTCCAGGCTTTGCTCTTCCAGCTTTGTCCAGAGACACATCCTGTCCTGTCCTTAAGGACT 

TCTGCAAATCAGTAATCACAGATGTTGCGTGCAGTTCACTGATATCTACTCTCCTCGTAT 

TTTGGGGTGGCTTACTGCATACCCACGAGTMGAACTTCTTTGCAATCAACAGGCAATTT 

GCTGGGGTGCTGGGGTTTGTTCCATTATCAAACAAACTGGGATTCCTACCCCAGTTCCAT 

CACTTACAGGTTGTGTGTGACCTCGGGATGAGTTGCTTAACCTCCTGAGCCTAGNTTTCT 

CTGCTGAAATATANAAATCGGCAGCATNACAAATTTTTTAAAACTCNTTACTACCTGTTN 

ATTCACTTCANGTTTATGAACTGGGGGCCCTGGGGGGTACCCTTCGGCCCGCCTCTANAA 

ACCTAANGTGGGANTTCCCCCCNGGGCCTGCAGGGGAAATTTCGAATATCNAAAGCTTTA 

TTCGNANACCCGNNCNAAACCTCCNAGGGGG 

Sequence 605 

CCGGGCAGGTACACTCCAGCAATCCATTCTTTAGGTCCTGCTTCCTCAGGAGCTTTATAT 

GAGATCTTCTCTTTGAGAAGTGTGTCTTAATTACAAAGTCTCCTTATTCTTTATGCCTTG 

AGTCTTGGTGCTGGTTTTGATTTTTCCTCTGCCTATTCTCCTCTTCTCCTGACTGTTGGC 

CTGCTCTAGGCCTTGAGGTCTAGCTGTGCCATGATTGATAGCTTCACTCTGAGCCCTGAC 

ACTTCACATGGTTGCTGGGATCCACCAACTCTCCCAGGCACTAAGGACTAGCACTTTCTG 

GAAAAACAACAGCATGCAAGACTTTTTAAACCATGGGCTT 

Sequence 606 

CGAGGNACT1 I I I 11 II II I M I I I I II I I I 1 I GGGACGGAGTCTCGCTCTGTCGTCCAG 
GCTGGAGTGCAGTGGCNCGATCTTGGCTCACTGCAAGCTNTGCCTCCCGGGTTCACGCCA 
TTCTCCTACCTCAGCCTCCTGAGTAGCTGGGACCACAGGCGCCCGCCACCATGCCTTTTA 
AAAAATNAGCCAAGGCACAGTGGCATGTGTNTGCAGTCCCAGCCACTCAGGAGGCTGAGG 
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CAGGAAAGGACCTGCTTTGAGCCCANGAANAAGNGGGGGCTNCAATGTCATCCAACTGCN 

ACTCTGGCCTGGGGTGACAANANNTGAGACCCAAGAAAAAGGAAAAGGAAAATGGTTAAT 

GNNGGTTATAATANTTANGATTTACCANCCCNCCCCNTTATTTAGTTTTAAATTAAATTT 

AAATTACCTAAAATTTAAAATAAAATTTAAANTAATTNTAANAANGGNAGGGAACAAGGG 

NNAAAAANNAATGGGGAANGGGGGGAAGGGGGGGAAGAACTTATGNAATTTTAAAAAAAA 
AAA 

Sequence 607 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGGGGCCATTGAGACTGCCATGGAAGA 

CTTGAAAGGTCACGTAGCTGAGACTTCTGGAGAGACCATTCAAGGCTTCTGGCTCTTGAC 

AAGGATAGACCACTGGAACAATGAGAAGGAGAGAATTCTACTGGTCACAGACAAGACTCT 

CTTGATCTGCAAATACGAC7TCATCATGCTGAGTTGTGTGCAGCTGCAGCGGATTCCTCT 

GAGCGCTGTCTATCGCATCTGCCTGGGCAAGTTCACCTTCCCTGGGATGTCCCTGGACAA 

GAGACAAGGAGAAGGCCTCAGGATCTACTGGGGGAGTCCGGAGGAGCAGTCTC7TCTGTC 

CCGCTGGAACCCATGGTCCACTGAAGTTCCTTATGCTACTTTCACTGAGCATCCTATGAA 

ATACACCAGTGAGAAATTCCTTGAAATTTGCAAGTTGNCTTGGGTTCATGTCTAAAGCTT 

GTTCCANCTATCCAGNAATGCCCACAAGAATT 

Sequence 608 

AGGTACATTATTATTACTCACATGCTGTCATGGAAACAGAAGCCACAGTGGTTGGGTCCT 

GTCCAAGG1TAATGGAGGAAGAAGTAGGGAAGCTAGGATTCAGATTCAGATTTCACTATG 

TAMGACCTGTTTCCATTATGCCACATTACCCCTCAGGTTTTCTGAGGTCTAGGTTGCTT 

CCTATCTTTTCCTGTGTGTTCTATCTTCCTTCCATCCTCATGGGCACTCATGGGATACCC 

CCCTCTCTGTATAGGCATCAGAGTCCATTCAAGTTTCCAATGGTCCTGTGGTCTCCACAA 

CCTGGAGGTGCAGATCCATTAGAGAAAGGAGTGCAGGTTNTTGATTGGACATAAGTAACA 

GGCCCAGAATCACATCCCCAAATGTTCAGATCAGCCCCAGNATCNCCAGGGCTTTCCAGC 

CTTGAACCA ATGG GTANGGAACTCTCTGANATGCTANAAACTTAAGCCAGCAACCTGTTN 

~*~y^*"y^~ *"^^^^"n~TTN CC AAG T*C~I~1~N NTTTAATC AAAAQCTTTTTQG G N TT"CQQ"I~I~TTGGC5 N 

TTrTTGGGNTTTAAACTGGNNACCCAGGAACCTTGCCCGGGNCGNGNCNNNTTTTANNAA 
CTAAGTGGGNTCCCCCCGGGGGCNTTGCATGGGAATTNCTGATATTCAAANNCTTTATTC 
GATNCCCCGNCCGAACCTTTTGANGGGGGGG 
Sequence 609 

AGGTACCTTTCTTTCCAGGCCATGGCAAAAAAAATCCAATTATGTCCGTCTTGAGTCTGA 

GGTCTTGCTTCTTATGTAGTATTTCCTTTGNGAGCTGAAGATTAATGCATGGATTCACCT 

CCTTCANCAC ATTT CATTTCAATTGTGAAGAAAAGNATTCCAGGCTCTGAATGTAAAATT 

GMCATGACATTTTGACATTCCTTCTTNTGAGAGCTGGGTTGGTCTTAGTTGCTGGTGAG 

GCTNTTAGTACACCGGACCATACAGGGNCGTGGGGCTGCTCCCTGGGACATGGAACCATT 

NCTCCCGAAGTTTCTCCCCCANTCCACTTTTTAACCNNGCTCCCCCCGGCGNTACCCTTG 

CCCCNNGNNCCGGGCNCCGGCTTTNCTANNAAACTTAAGNTGNGNATNCCCCCCCGGGGN 

CCTGGNNAAGNGGAATTTCCCNNNTTANTTCAAAGGCCNTTTAATTCCNAAATANCNCGG 

GTCGAACNCATCTGGAGGGGGGGGGGGGGGCCCCCG 

Sequence 610 

AGGTACATCTTCCTCCATGAGTGGATCAAGAAATATTGAGATGGCTAAGCTGGAAATTAC 

AAAAATGTTGGCATACTATTATAGCATATGGTTATGAAAATCAATGMTATTTATATTTA 

GTATTATGCTTTAAGTATATATCCAAGCTGATCAATTAATTACACTTTCACTGATGAGAT 

GTCAATTCTACATTTAGCTGAACTCTCATCTGAACTGTGTGGCTTCTCAATGACAGGGCA 

TATTAAAGAACATGATGAATGTTTGTAATGTAATGATATAAATTATTATAATGTGTTGCA 

rrAMGACATATGGTGTAGCATTCTACGTTCAGCTTTTGCATTTATTTTCTCAGTGTCAT 

GATTTGCTCCTGTGATTCCCAGATCAG-mTCTCCTTATCAGTCAGCATATACATATTGG 

TATTAACACTTTTTG CAAAA ATCATATATAAATGGTTGTAGAATGTTGTTGTCATTCTAC 

CAGCAACTGGTTAGTTTTTGNTCANCGATTAGCTTGNTTTTCATTTATTTTAAAGAATTT 
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CAGGGCCCAGTCCGTGGGTGGGCTCACCGCCTTGTTAAATTTNCCAGCCNCCTTTGGGGA 
AGGGCCGACCCCCGGGCCCGGATCCACCAAGGGTTTAGGGAGGATNCCAAGACCCATTTC 
TTGGGCCAANTNCCGGNGNAAAACCCCAATNTTTTACTTTAAAAATTACNAAAAAAAAT 
Sequence 611 

NCCATAGTGGGAGTTGGGGTTGTTTACCCNACACTTCTAAGGGGATTTTCACAATCGGNT 

AANCCCCCGTTAATNCTCACATGGTTATTGCGGCGACCCCCATAGCCCACGGTCCCAAAC 

GTGTGCATNTGAATCATCCGAAGCTCTTTCTAAAGGCAGGCTCCCCTGGCCGCCTCTCCT 

AATCCGACTCGGTACCT 

Sequence 612 

CTATAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCTCTTTTCCTCCTTTC 
ATGCCCGACATATGCCTTTTTTCCTACTTATTGTGTAAAGTTCTTCATTTGTCATGGCCA 
TCGTGAGAAAAATCTGGGGCTTCCCTTTCATCATAAATGACAGG I NUN \ CTGCTTCC 
AAAATCTAGAATTTTCATATTTGTAACACCTGGAGGTAAGGGTCATGGTGTTAATTATAA 

AGAGAGCANAGAGAGTANGCATCTGTAGGAAACATCTNGGGTTTTTTTAATTATTTGCTG 
GAT 

Sequence 613 

AGGTACTGCAACCGTAAGTGGGGTATATGGTTTCACCGAGAATGTGACATCAGAGCAAAC 

ACTAACATCAGCAAAGGAGTATCTCAGAAAAGAGCATTTTAGGCACAGGAAGGAAGAAAT 

ACAAAGGGCTATGGGCCTGTAGCTGGCGTTTTCAAGGGACCACAGAGTGAGGGGAGAAGG 

AGGGACCAGGTGGTGGTGAGAGAACTGGTATCAGAGGAAGTAGGAATCTGTGGGTTACTC 

AAAGGACCTCGGCCTTTACAATGAGTAACATGGCAAAGTTTTTTTAATATTGTTTM 

CAAGAAAGATACTAATTTACACAGTAAGATTTATACTAAAAAAAAACAAACTATGACAGT 

TACTATAAAAGAGTAGCTCACCAGTTCAAGTTGAAATGAATTAAAAGGGGCTGGGCACAG 

TGGTAGAAAATAATTAMCAAAACATAAAAATTCCTGCCAAAAAGMGTTTATGTTATAG 

TGGGAAGACAGCAAATCCTATCCCACAAACNCCCA A I I I I II I I I I l A A 

Sequence 614 

CCGGGCAGGTACTATGGCCTCCACACTCACAGGGTCTCAAGTTGCTATGACTGGAAAACA 
AACACTATCAACAAAAGAAATTATGGGAAAGAAATAGACTTCAAAATCATCATTTTTCTG 
TCAATAGAATACTATGACAATCTTGTCAGGCATGAATATAGGAAGAAGATCGACCGAAAA 
TTACGCACATACATTTTGTTCAGGAAGTTGMGTCCTTT^ 

CCAGAAAGATCAGTTCTCTTTTGGGGTTTATAGTGGACGGCTTATTAAAAGCTACTTGCT 
CTAACATGATTTCTAAGATGTGTATTCTTTGCAGTATCTTTCAACTCACCTTCAGCTGAA 
GTAAAAACATCCCAAATAAGGTCAATTTTCATGTTTTTATTAATAA 
Sequence 615 

AGGTGGGAGGATCACCTGAACCCAAGAGGTTGAGGCTGCAGTGAGCTGTGTCTGTGCCAC 

TGAACTCCAGCCTAGGTGACAGAGTGAGACCCCATCTCAAAAAAATAATACTAATAGTAA 

TAACC AC AGC ACTTAATAAAAATAATAGC AG CTAGTATATATCTAGC CAC ATACTCTAC A 

CTACACACTGTTCTAAACACGTCTCAAATATCAACTTACTGAATCCTCAAAGCAACCCTC 

CGGTTCTCATTCCCATCTTATAGCCGACAAACTGATGCACAGAAAGCTTGAGATGCTTAT 

GAATAAGGAGTGAGGTTGGGATTCTCACCCAGGAGGCGGGCTGCAGTGTCTAAGCTCTTC 

AATGCAGGGCCCTAAAAAAGTGCCCAAGAATATCCACTGAGAGACACCCCANAGCTGCTC 

GACCCTTTTAGGGCCTCAAGGGTGCCTGAGTTGATTTCTGTTGCTTGCAACCCAAGAACC 

TGGATTGGTAAAACAGATCACTTTACTGGGTTTTACACTTTTTTGCCAAACTGGGCCGGG 

ACTTGCTTTCCAGTGCTTGCAATGCNGGGGGCATNTTACCAAATTTTTACAAGTTTGGGA 

GTCAGTGGATNNGGGGCATTCAAGGGCAANCTTCATTTTCCAANAACAAGCCATTTTTTC 

CC 

Sequence 616 

CCGGGCAGGTACTCAATCTGTGAAGTGTGTTCTCACAGTGTCCCACACCTCCCNGNGGGG 
CTCACCTGCCCACAGCAGTGCTCTCTGCTCTTNAGTCCACCTAGATACTCTCTTCTCTTT 
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GATCTTGAAGNCTCCTTATCCTGACCCATCATTCTTGGAACCACTAACCTATAAAGTAAT 

TGCAAATCANACTCCTGTTTCAGGGTCTGTTTCTGAGGATCCTCAACTTCANACACTNTA 

AATTNTTTTCATGGTAATGGTAAGGAAAANGCATTATTTTATCTGAAATATTCATT^ 

CAATCTTTTCATTAATTATCCAATTGTTGMCAGAAAAAAAATTGCTTATTACT^ 

AATGCNCACACATAT TNAATA TCTGTTTAGAATCATAGATTTCTGGCAAGTCAGCGTNTC 

C ATAATN GTAAACANATTTTTCTAACANAAAAAAATAGC TTTAG AGTTTCCCTC ACCCTA 

TTCATCCCTTAATATTTCATACAT^ 

CCCAACTGNAACCTANGGAGGCTTTNAAAATTA 

Sequence 617 

AGGTACCTTCCAGC TCAGT GGATCAATATGACTTTATGAAGCAAAGTAAGGGAAAGAGGT 

CACTAAAAACAACGATTTTATACATGAGMGGTGAAAATTCAAAAACTCCCATTTTTTC^ 

ACATAAATGGCTCATGTCATGTGCTANGAAACATAAAACCCTAGACTATCTTCCATAAAT 

GAAAAGACTAAGGCACAATTCTGTCGTCATGTAATTCAGTCTACTANAAGAGACAGACCC 

AAAAAACAAAGAGCAATGCTACTGAGTGATAAGAGCAGTAAGGAAATGTGCTCTCACGGT 

TGCAGCTGCTATGGGAACATAGGAGAGGAAGGCATTTTTCTCAAGAACAGTGGGTAGGAT 

AATCATAGGAAAATTACAGAGAAAGTGACATTTGAAA 

Sequence 618 

CCGGGCAGGTACTTTTAGAAGAGACAGGGTTTCACCATGGTGGCCAGGCTGGTCTCGATG 

TCTTGACCTCGTGATCTGCCTGCCTCGGCCTCCCAAAGTGCTGGGATTACAGGTGTGAGC 

CACCACACCCGGCCTGAATTGCACTTTTGATGGCTGAGCCAAATGCTCATTCTATTTAAT 

ATTGCACAGCCTGAACCAAAACCCGTAACATTATAAAGGAAGAGATGAGAGCCATTTCAA 

ACTGTGAGAGAAGAGAAGACATCAAGCAAMTCTGGGGGTTTTAGCCAACAGACTTCAGT 

CCTGGGGCCTTGCAGCAAAAAAACGCCTCCAGTTGCCCCAGG 

Sequence 619 

CCGGGCAGGTCGGAGGAGGGGAACCAAGGATGCTCATGGAGTCAGCCCTGGAGATTGTCA 

GCAGACATTGTTTCTTTGGGGTAAACAGGAAAAGGAGAGCAGTGGNTGGCACTATGGAGA 

GATGGGGTAGAATGGAGGACAGAAAAAGGTGTTCACATGGAAGGCAAAGACTATCTAANA 

AGCTAAGAGGAACAGAAGTGGGGAAGACAGAAAAATTTGCCGTGTACCTTN 

Sequence 620 

CC GGGCA GGTACCTGCTCAAGATGCCTGGCTGAGAGTGTGTGGGAGTCTGGTTGGAGCCC 

AGATTTTATATATTTATATATTTTATTTAGCTTTGACCAAAAAGTGAACA^ 

GTCATGGTAATTAAAACTAACAGATAATTTATCTAAAATAAATCTTGATCTGACTAACAT 

TGGTCAAAC ATGAC GTGACTTTTTACTCCTTAGATGCACCTTGTGACATAAATAGAATCC 

TTCCCTTGCATTTTATAAATGGGAAACCAGAAGCACCAGGAGATGAAGTGAGGGAACTCA 

NCAGTGGGAGGACCTAAACTGCAGCCCAGTCCAACAGTGCCAGTCGAAACCTAGTTTTCC 

AATGAAGGACTGCTTTNCTGTCCCTTCGTCTCCTCT 

Sequence 621 

CGAG GTACCTGAGAAAGGAAAACAAGGTGAGTCCTATTAATGCTTCCCAGACtGGCCAGA 

TTTTCTCACTGCAGCCAGCCCTATCTCACATGTATTAATGTTATAATTTCACATACTGTA 

TGATGCTATCCTTTGATTAGCGTTTAGTTTGATGGGAACTCCAAACCATTAAAGAAAATT 

CAGGCCAGCCGTGGTGGCGCCGTGCCTGTGGACCCAGCTACTCAGGAGGCTGAAATGGGA 

GGATGGCTCAAACTCGAGCAGTCAAGTGAGCCGTTATTGTGCCACTGCACTCCAGCCTGG 

GCTGTAACAGAGCAAGACCCCCATCTCTAAAGCAGGAAAAAAAAAAAAAAAAAAAAAAGT 

ACCTGCCCG 

Sequence 622 

GGGGCCATNGAGACTGCCATGGAAGACTTGNAAGGTCACGTATGCTGAGACTTCTGGAGA 
GACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGACCACTGGAACAATGAGAANGAGAG 
AATTCTACTGGTCACAGACAAGACTCTCTTGATCTGCAAATACGACTTCATCATGCTGAG 
TTGTGTGCAGCTGCAGCGGATTCCTCTGAGCGCTGTCTATCGCATCTGCCTGGGCAAGTT 
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TCACCTTCCCTGGGATGTCCCTGGACAAGANACAAGGAGAAGGCCTTAGGATCTACTGGG 
GGA 

Sequence 623 

AGGTACCCCAAAACCAGGGGCTGCAGCTCCAGCAATTGGGAAGGCCAAGTTTTCTTGCAC 

GCTCCCCAGAAAAGGGGCCAAGTCCATGGGGCTGAGCAGTGATAGACTGCAGACCTCACC 

ACCACTGAACCTTGTAGGATAAGGCCCACTAGCCTGGGATGCTAGTGACGCCACCCTAGT 

CCTCCTGAGTTCTCCAGCTGGGAGCAGCCCTACACTTCTCCGGCATGCAGiCTCCCAAAGA 

GAGAGGCAGTCCACCTTTTTGCTGTCTCGCAACCCTGCCTCCTGCTGCTCTCAGGCTTGG 

GAGGGTGCACAGCAATTAGGGACTATCACAGAACCCCAGCACAGNGCATCTGGTGAACTT 

AAAAAA 

Sequence 624 

CCGGGCAGGTACGCGGGTGACATACTTTTGGGCCACATTCTTACAATAATTAACTGGAAG 
GAATTTAAAGMTTCATAGTATTCTATTCCATTTTAAAAAATAATCTAAACAGACAAACA 
TATTCAGTTAAAAGGTGACACTGAGTTAAMGGCTACCTAATAATTTAGGAATTATTTAT 
CCTTCTCTTCTTTGGCCTACTTTTGACATATCT^ 

ATTACATACAAGTAAATGTGTATTGATTAAAGGTGAACAANGTGATTTATCTTATGACTA 

TTGAATTAAGTATTAGAAACTTCATGCTACAAATGCCAAAAAACTAANATACTGGCCAAG 

AANTTTGGAAATACAGGNGGGCCCTGGGNACAGCATGTGTNTGAACTGTGCTGGNCTGCT 

TATTCATGGGAATAAACATG 

Sequence 625 

AGGTACTTTCAAAGACTTTAGAACCAAATGTCATTCAGCTGAAACCATAATGTATTATAA 

ACTACGTCTTCTAGAGCTTTGTTGTCATAGAGTCAAATGAGATGCCAAATGCTAGCAATC 

CTATTTAGGAGGCCTACTATTAGGATTTAAAAGAAGTGGACTGCCAGAGATGAATTATGT 

GACACAGATGCTAACATCTAGTTATTATGAGTGATCCATTCTGTCATGAGGTAGCACTTG 

AAAAACAACCTGGGACAAAAGAACAGGCCTCTGGCACTACTGAATTTTTGGAGAGAGAAA 

CCTTTA GAAG TTAGCATGTGGTTCTTTCTGGACATGTTCATTGCCTTAGTTCTCCTTTTG 

CAAATATTTTGTAAATATTCATGAAGTGAAATGTAGATGTTTGAAGAAGGCAGATCTGCA 

GATCCCCTAAAGAGCATTGTNAGTTATGCTGGGCTGTTCCTACCCTATCCCTCTGNTTAC 

CTGCGGCGGCCTCTCAAMNCGTGATCCGTGGTGGCTTTTGTGGGGGGTTTTAAATGTA 

Sequence 626 

AGGTACAAGGAGACCCTGGGCAAGCAGCTAGCAGCCCCCTTTCACAACCCTGCAGAGCGT 

CCTGGAGGTCCCAGCAGCCACTGTTCTTCCTCCTAAGCTTGCCTGCCATGGCAGCATCCA 

CTCCACCCTTCAGTGCTTGAAGCCAAGTGGTTGTCGCCCCGGAAGGGTCTGCCCTTCTGG 

TTTTAGACTGCTGTGGAGGAGGGAGCCACTTGGAGATACAAAGCCCACCTCTCTAGAATC 

TTCCAGGGTCCCCACAAATTATAAAACTATGGNNACACCAGGAAACCAAGGCA'TnTCAA 

TAACAACCTTTCCTACTGATTTCCAGCTTCATCTTCCAGAANTCCAGNGCACTGCCATTC 

ATACCATNCATTAACTGTTTCTCCTTTCACTAGACCACGAGCAGGGNNTACAGCCACACC 

CGAGGAAACCCATCTCATGANGTNTAACTACTGGACCCGCCCCC 

Sequence 627 

CCCGCGGTGGCGGCCGCCCGGGCAGGTACTGAGTAGAATAGGACGTGAGACAAGTGCTCT 

AC TTGAM TAAGCACAGACGTGGMGMGGGATGTGTAGTTTTGTCTGGGGATTCGGCAC 

AGTTTTTATGGAGGATGTGGTCTGTGAAATGGCTGATTTGAAACATGAAGGAGGCTGCTT 

ATGGGATGGGTTATTNTGGGAAGCAAGCAAGTGCTGAGGGCNCCATGAAGGTATGAAGGG 

CC7TGGAGATACCTCGGCCGCTCTAGAACTANGTTGGATCCCCCNGGCTTG 

Sequence 628 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCTAGGGCCTTGGCCTG 

AGCAG ATGTG TGGCCCTCTTGGAATGAACTGCTACATGACATGCTGGTGGTGTATTTGGT 

CATACCTTTTACCATGTTCAGGCTGTTTGTTTATTTG 

ATGAGACTTTTAATTTCATAATATGCTATGTTTCATAGACTGACTGATGATAAACATAGA 
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CTGATAATCATTTTGCAGATAAACGCAGTCMCMTTGNMTTTAGTTTTTTATAAAAGT 

TATTATTTAGTCAGTAAATTCAAGGCTTGGACAAAAATAGTTCTCACTATGTGTGTGTGC 

CCCTCAGGGTGTAACATACTTTNGGACTCAGTCTTTTTATTGGTTTGTTTTTGTTAAGT^ 

NGATGAAAAGCTCAANTAGAGCAGGCCCCTCAGCAAAAAGTTGTGGATGTAAAAGAAAAT 
TCTTGGGCC 

Sequence 629 

ACGTACCATAGTGGGAACTGCTGGGCCCCTAATGACNAGAGAGATGGATGAATGGACCCC 

ACTCCCATTTCTCTTCTTCTCTCCCCTTCTNTTNTCTCTCCTTCCTCCCTTCCCCCAATG 

GCCAAGGACAACACTAAAATCAGTATCACAAGTCTGCTGTGGCACATGGCCATGGAATGT 

GACCGTCAGAGGCAGCTCAGAGAGGACACAACTGGAAGAACAGGTGGTGTGGAATTTTAG 

AACCACTGCCTTAAGCATTCTGCTCAAAGTCCCAGAATGGNTGCTCTGATGGGAGCTTTT 

TGAGANATGATCTCTCCCTGCAAAGAGAATAANAAATGGGTGAGGATGATAGTATCCTGG 

ANAAGCTNTAACTTTGACAAGCTTTAAGGGGTCTTAAAGCAACCAAGTCCTNCTTTGACA 

GAGAGGTCAAGCTATTTNACATCTGGTTTGCCCAAAAAACTGTGCCTGTGTTTTGCTTTA 

AGACACAAGGGAACCCGCNCGCTCTAGACTAGNGGGATCCCCCGG 

Sequence 630 

AGGTA CCAGG AAGCCCAGCTGTGCTGTAGTTCCTGCTCTTAGATTTCTGTGGGATTCCTC 
ANAGTA I I I I IACAACATCGCCTCATCTTGGGTTTGCCTGATCTGGGCTGTGTCCCCTCT 
ACCATATAAGCTGTAACGAACATACTACAGTAGGCAGATCTCTCTGCTGCTTAAAATAAT 
AAAACAATGGAGGCGACAAAAAAAAAAAA 
Sequence 631 

TAATACGACTACTATAGGGCGAATNGGAGCTCCCCGCGGTGGCGGCCGGGGGCCATTGAG 

ACTGCCATGGAAGACTTGAAAGGTCACGTAGCTGAGACTTCTGGAGAGACCATTCAAGGC 

TTCTGGCTCTTGACAAAGATAGACCACTGGAACAATGAGAAGGAGAGAATTCTACTGGTC 

ACAGACAAGACTCTCTTGATCTGCAAATACGACTTCATCATGCTGAGTTGTGTGCAGCTG 

CAGCGGATTCCTCTGAGCGCTGTCTATCGCATCTGCCTGGGCAAGTTCACCTTCCCTGGG 

ATGTCCCTGGACAAGAGACAAGGAGAAGGCCTTAGGATCTACTGGGGGAGTCCGGAGGAG 

CAGTCTCTTCTGTCCCGCTGGAACCCATGGTCCACTGAAAGTTCCTTATGCTACTTTCAC 

TGAGCATCCTATGAAAATACACCAGTGAGAAATTCCCT 

Sequence 632 

TACTATAGGGCGAATGGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTNCAGGAGACA 

GGTCAGCAGCTGCGGTCAGCANAGGCACACACGCCAGCCCCTGGAGATGGGTGGCTTGTC 

CTGAGCCAAAGCTGGGCTGGTTTATAAACAGCACCTATGCCTTTTCCATTTGAAT1TTTA 

CAGCTACCACTTCCTCCACGAGATGACTTGCTCTCTGGATTGGAGGTGTGTCTGCATTTC 

AGGAACAAGCAACT7TTNTCTGTGGCTACAGGCTGTACCT . 

Sequence 633 

TGGAGCTCCCCGCGGTGGCGGCCGAGGTACACACAGTTAACCACAAAACAGGCCTCTCTG- 

AAAAAGCCATTGCCATGGACTGCCAGACAGACAATGACAAGACACAAATACCTTCTGGTG 

TGTGAGCCACGGGACATGTGAGCTTCCCGCTGATGCTCCTCTTATATTAAAGATCACTTT 

C ACAAG ATGAGCGACTCAATATCTTTTATCAAACCAATGATCACCTGCAAGCTATGGTAT 

AM I I IGCAGCTGTGTAGAGCTATGTGGCATGAGAATGTGGGACTTATAAATTGCTGATC 

CAATAAATAGACATTATGGGCAACAGTGTCTTATCAGCTAGTGTGTACCTGCCCG 

Sequence 634 

AGGTACAATTGATATAACAGCTTTGTTGCCACCCCAGGCTTTTCTGTGATGATGCCATGG 

GCCACATTCTGATCAAACTGCACACCCAGAAGGTTAAGTGTTGGCTCCAAGCGAGAAAAA 

TTATTAAGTTTGGCACTTGAAACCCTGCTGTCCAAAAATTCTGAAAAATCATCCTGAAGT 

TCAAACTTGTGTAGAACTTCTCCAAGTAGATAGCCACTGGAAAATGCCTTTGCAAATGAC 

TTGGGACTCACGGTCCGGGACACCTTCAACTCCTTGTTGAGCCACTGGCACAGGATCTCC 

GACATGGTTCAGCCGGTGCCTCAGACCGCAGGCGGGGGCTTAGAAACTCGCCAGGAACCA 
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AGCCTGCCAGCCCGCGTCCTCCTGTAGCCAAGGGCAACCCTTTGCTCCCGCGTACCTGCC 

CGGCGGCCGGT 

Sequence 635 

NCGAGGTACAAGGGGT7TTCCTCTCTGTCTCTTCCATCACGGGATGGTGTCTAGGCACAT 

GGTAAGGGCCATCTTGTTCTTGCAGTTGTCACTGGTTGGTTGAGGGCTCTTATATTGTTT 

AATGTCAAGCCAGGTCTTCTGTGGTTATTGAAAATTAAGTGTCTTTTGCAGGTCATTCTC 

ACACCCCTGTTGATGATCTTCACTGTTGCTTCGAGCCATAAGCTTACCTTTCACTGTGAA 

CTCTGGCCCCAACCCCAGACCTACTTCTGTCCACTAATCCTACAGCTTCCCTTGGTACCT 

GCCCG 

Sequence 636 

AGGTACTCTGCTATAACATGAGATCTGTCTCTCCAGGCTCTGGCTTCCTCAAGGACAGCT 

GCCACCAGGTCCTATCCATCTTGTAAGTCCCCCCGCGTACCTGCCCG 

Sequence 637 

TTGGAGCTCCCCGCGGTGGCCGAGCGGCCGCCCGGGCAGGTACCTACGATCCTCTGATTT 

GGGGATGGGGTTGGCGCAGCGTTGGCTGTTATACTTGGCCACATATTCACATTGCTTGAA 

GGGGGCAGTCTTGTCCTCCAGAACGTGTCTGATCAGAAGGCGTAGCCCGTTGAGTCGCCT 

CTGCTTGCACAGTTTGGGGCTATATGAGCACAAGGGCTTATTGTCAACCTCAGAGAAGTG 

TATATGTTTCCCTTCATACATCACGTGACTCTATTCCTTGTGMCATCAGCTGATTTTAA 

ATCTTCTGCACCATAGCAATGTTAAGAGAACCCCCAAGGATACCGCAATTTGGAATGCCG 

CAAAATCAGGATGAAGCAGATTGTCTTTCAGCTGC I I M 1 I I I 

Sequence 638 

CGAGGTACCAGAGGGGATGTAGCCTGATGCCCTTATCAACAAAGTCAGGGATGGTGGCAC 

ACAAGGTTTGACTACTGCATACACGGTCACAGTGCTACTTCCAGATGGCCTGAATTCCCC 

TGCTCTCTCTGGTGGGGAGAAGGGCTGGCAGAGCCATTAGCATGAGCTTCAGCCAATCCT 

GGCCACTTTGATGCTCCTGGTGCTGACCCAGGGTCCTGGAGGATGGGCTGAGGTGGGGGG 

GTAGAGATGTTCANGGCANCAGCAGCTCCTTTCCATCCACACTGGAACTATTTCAGTATT 

TCACCCCAATTCATCTATTCCCTTGTGCGCTGACTGAACATCAGCCCTGCTCCAGGTCTN 

AGTTTCCCCTTTGTAAAGGGAAAGCTCTGGATTCAGGGGTGGTGAGAGGTCATCATGGTC 

TTGAGATTTCAGGCCTGTAGGCANGGGGGTGAGAGGTTNACTATGANTGCANAATACCAA 

TGCTGGGTANAAGCAAANGAAAA 

Sequence 639 

GGAGCTCCCCGCGGTGGCGGCCCGAGGTACCTTTCTTTCCAGGCCATGGCAAAAAAATCC 

AATTATGTCCGTCTTGAGTCTGTGGTCTTGCTTCTTATGTAGTATTTCCTTTGTGAGCTG 

AAGATTAATGCATGGATTCACCTCCTTCAGCACATTTCATTTCAATTGTGAAGAAAAGAT 

TCCAGGCACTGAATGTAAAATTGAACATGACATTTTGACATTCCTTCTTCTGAGAGCTGG 

GTTGGTCTTAGTTGCTGTGAGGCTCTAGACACCGACCATACAGGGCGTGGGGCTGCTCCT 

GGACATGAACATCCTCCGAAGTTCTCCCCATCCACTTTACCCCCTCTTGGGCTCACCTNT 

GAATGTCCCCGCGTACCTGCCCG 

Sequence 640 

AGGTACACACTAGCTGATAAGACACTGTTGCCCATAATGTCTATTTATTGGATCAGCAAT 

TTATAAGTCCCACATTCTCATGCCACATAGCTCTACACAGCTGCAAAAATATACCATAGC 

TTGCAGGTGATCATTGGTTTGATAAAAGATATTGAGTCGCTCATCTTGTGAAAGTGATCT 

TTGATATAAGAGGAGCATCAGCGGGGAAGCTCACATGTCCCGTGGCTCACACACCAGAAG 

GTATTTGTGTCTTGTCATTGTCTGTCTGGCAGTCCATGGCMTGGCTTTTTCAGAGAGGC 

CTGTTTTGTGGTTAACTGTGTGTACCTGCCCG 

Sequence 641 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTCCGTGAGATATAAGGCTGATG 

ATAAGGATTGGGGTAAAGAGTTGATTCGGTCACACTTGACTGCCTTGAAATCTGTCTTGG 

CATCGGGTCCTCTTCCTCCATCGATAGGTGGTCAGTCTCAGAGGGTTGGTCATCTCCCTC 



Page 107 of 869 



WO 01/51628 



PCT/US01/00798 



Table 2 



TTGCTTCAGAGACACAGCCTGGAATCCTTCAAAACTCCCAATTGTTGACATTTCTTCAGC 

CTGACTAGGCTGGTCCTCAAGGCTCTTCTTTCTCTTTCTGCACAAAGGGGGCTAGACCAC 

AGATAGTGGCCAGGAGAGGTGTGACTGGCTCATATAAAGCCAGGCAAGACAGTGCCCTGT 

GCTGTGACCCCACTCAGTCTGAATCTCACTTCACCAAGTACCTGCCCG 

Sequence 642 

TNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTCTCATAAACTTTG7TCATTCAGCC 

ACTTGAAATAAGCCACGTGGGTTATGTGCCACGCAAAATTTCAGTAATGGGTTTTGGGAG 

TGTTTAGATAGCAACAACTTCATANACTGMCCTTrTGGTTGTTTTTCTTTCTCTCCAAC 
TCANAGGTTAAAAA 

Sequence 643 

ATTGGAGCTCCACCCGCGGTGGCGGCCGCCCGGGCAGGTACTTATTGTATATTTCTCACA 

TTCmCAGAAAGTTATTAAMTGGGCTTTACAGTTGCGTTGCTTTTITTTAAAGAGAGA 

AGATTGAAATTTATTTTGCAGCTATACACTTGGTTTTGGATTATTCATGTGTATAAAGTT 

ATTTGAGCTGCTCCACTTCTCTGACCTAAGGCTGTGGCTAAAATGGGTTTGGAGAAAAAA 
TTCTTGCCTGTCCT 

Sequence 644 

AGGTACTAGTAAGTTCTGGGTAGGGGGTGGATGAGTCCTAGGGGTGGGGGGCCCCAGGCA 

CACAGGAACCATAGTTAACCACTCTTCACCTACAGAAGACTCCTCAGGTCAAAGCTTCAC 
TGTACCTGCCCG 

Sequence 645 

GAGCTCCCCGCGGTGGCGGNCGCCCGGGCAGGTACGCGGGGACGTTAAGAACTGGACTTA 
CAAGATGGATAGGACCTGGTGGCAGCTGTCCTTGAGGAAGCCAGAGCCTGGAGAGACAGA 
TCTCATGTTATAGCAGAGTACCTCGGCCGCTCTAG 
Sequence 646 

GAGCTCCCCGCGGTGGCGGCCGAGGTACTGCAATGAAGAAGAGGAGAAAGGTTACTTCAA 

ATCTTGAGAAGATCCATCTAGGCTATCATAAAGATTCTTCANAAGGAAATGTTGCAGTGG 

AGTGTGACCAAGTGACCTATACTCATTCTGCAGGAAGACCAACTCCTGAAGCTCTTCACT 

GTTACCAGGAACTTCCTCCCTCTCCAGATCAGAGAAAGCTTTTAAGTTCTTTGCAGTATA 

ATAAGAATTTGCTAAAATATTTAAATGATGATAGGCAGAAGCAACCATCTTTTTGTGATT 

TACTTATCATAGTGGAAGGAAAAGAATTTAGTGCACATAAAGTAGTCGTTGCTGTCGGCA 

GTAAGTTATTTTCATG CGTGT TTGAGCAAAAATCCAAGCACTGATGTTGTCACCCTGGAT 

CACGTAACACATTCAGTTTTTCAGCATTTGCTTGAATTTCTTTACACATCAGAATTTTTT 
GTGTACCT 

Sequence 647 

CTNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTACGCCAGATGCTGTTTACACATAG 

AGTTTTGAGCTGTTTCTCAAGGAGGGGCATTGGTGGCATTTGGGCANAATGATTCTTTGA 

TGGGCAGAAAGGTCCCACCCTTCGCAGAATGTCTATCANTAAACTGCAATCATTCTCGCC 

AATCTTTAGGGTGNATGGGAAGAGTAATGCCCAAATGAGANCCCTCCCANATGTTCATAG 

CAACGGATATGAAGTCATACAGNGCGGGNGTGGTTTTTATTCCATCCATTCCTCTCCTGG 

GGACCTTAGACAAGTGATAATGCTCTATGCTTCAGTTTCCCCCCTTGTAAAATGGTGGTA 

ATTCCCGCGTACCTGCCCG 

Sequence 648 

CCGGGCAGGTACCCAGCATAAAGCCCTGCTCTCAGCAGACACTGATATATGTTTATTGAA 

TGTGTGAATGAATCTAGCTTCATCACATTCGATGCCATATGAGAAAATCATTTTAGGGCA 

ATACAGAGTTTATAAATGAATTAAAATTTGAAATCAAATTTCCAAAGATTTTATTTTTTA 

AAAAATGTCAAATTCCTGTAAAATCTTACCCTTGAAAGTGCCCTTGAAATGCACTTTCCC 
CCAGGTGTACCT 

Sequence 649 

CGCCCGGGCAGGTACGCGGGGATGGTAGGGACTCGGTTGCCGCTTCTCTCTTCTACTGTC 
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CCAGCTGTCTCTGTTTmAGGCMCTTTCTGTGGCTTmCMGAGTGGAGATTTGGCT 

GGAGCAGGATTAGGAAGCCTNCTGCTACACGCTTTCCCTGAGAAAAACCTCCTCTCTGGA 

AGGGGCCTGAGGTGAAAATGATTCAAAAGGCAGCTGAGGAGTTCTGGAAAGAGCCCCAAA 

GACCAGGCTCTGAGGCCTCCCTGGGCCACTTACTAACTAAATGGGAGANGATTACTTGAG 

AAAAAGCCTACANCAGTGCCCANCATGATACCCCGTTGTGAATNACCATTTGGAGGTAGT 

CCGTCCCCACCTGGAGTGGATCAAAGACAACAGAGGCCAACGGGGAAAAAGTTTGAGCTT 

TGCCANGCTGATATTGGCTGCTGAATAAGGGTGATTTTCTCTTGTAAAATGGCTTGAACC 

CCAAAGCTATAGCACAAGCAAGCAGGGTCATCANAAGCCACTCCTTCTTCTTGGAAACCT 

GCAGGGGAAAGGGAACCCC 

Sequence 650 

ATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACACTAGCTGATAAGACACTGTTGCHCA 

TAATGTCTATTTATTGGATCAGCAATTTATAAGTCCCACATTCTCATGCCACATAGCTCT 

ACACAGCTGCAAAAATATACCATAGCTTGCAGGTGATCATTGGTTTGATAAAAGATATTG 

AGTCGCTCATCTTGTGAAAGTGATCTTTGATATAAGAGGAGCATCAGCGGGGAAGCTCAC 

ATGTCCCGTGGCTCACACACCAGAAGGTATTTGTGTCTTGTCATTGTCTGTCTGGCAGTC 

CATGGCAATGGCTTTTTCAGAGAGGCCTGTTTTGTGGTTAACTGTGTGTACCTGCCCG 

Sequence 651 

GGAGCTCCCCGCGGTGGCGGCCGGGGGCCATTGAGACTGCCATGGAAGACTTGAAGGGTC 

ACGTAGCTGAGACTTCTGGAGAGACCATTCAAGGCTTCTGGCTCTTGACAAAAACGGATC 

CATTAAGCCAGCACCMGGAGAACATCTTGACACTTTATGGAATTTCAAAAAGTATGATA 

TGATGGCAATAAAGTGGCTCTTCAGCCAAAAAAAAAAAAAAAAA 

Sequence 652 

TTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACTTTCTCTTTGTCTCTGCCTTCCAGGCAA 

CAAGGGATTTTGGGGTAGTAGTAAGCTCTACAAATATCTTTGACCAGTAAAMCCTTTGC 

AACCTNAAAATTANTGGTTTTGGCTTCCTTTGGGAAAAACCAGTGGAAACCTGCCAAATN 

CTGAACTTAACNAGTAAAGGGTTTGTCTTTTTNAAAATGGTGGGCCTGAAGTTAAGGGTT 

CAATTTTTAANCCTAAAAAAATGAACCACTTTCTTGGTTGGCATTTTGGGGTGAACCTGT 

GCCC ATTTGGTTG AATCTTCCTC CTTTTAATAMCTTGTCTTAAAATTTTC NTTTCTNTG 

ANCACCTGGGAAGGTTACATTTGGGAAAAGTTAAAAAGCCAGGAACCCGCGTACCTTGCC 

CCGGCCGGCCGNTCTANAACTAAGTGGA 

Sequence 653 

GGCAGGTGTGGCTAAAGACTGGACTGTAGAACAGGGAGAGTATCTGGGGTTGTCCAGTGG 

GCTCAATATAGTCACAGTCAGGGAATGTTCTCTGGCCGGAGGCAGGAGAGACATGGCANA 

AGACTAGAGGCTGAGACTGCCCAACCCACCATTGCTGGAGGGGGCCACTTGGAAAGCATG 

AGAAGGAAGGGGGACCCAGTCCTACAGCTGCAAGGAATAATTGAGCCAACTGTGCATAAG 

CTTGAAGGCCTCCAGGGTGAACCCAGCACCTCAGACACTGGCCCCATGAGACTGAGCAGA 

GGACCCAGCTGGGCCTGGCTGAACTCAGGCATCTGACTTANAGAATTGTGAGATAATCAG 

TGGTATTTAMCTGCCAAGCCTGTGGTGATTTGTTGTCATGGAAGCAATANAAAACTAAT 

CAACAGGGCTGGACGCGCCTGTAATCCTAGCACTTTGGGAGGTGGGCGGATCACCTTATG 

TCAGGAGTTCAAGATCAGCCTGGCCAACATGGTGAAACCTANTCTNTACTAAAAATNCAA 

AATTTCAAAATCAAAAATCACCCGGCGTGGTGGCAGGTGCCTACCCGCGTACCTNGGNCG 

CCNGGCAGGTCTTGGGCCCCAAA 

Sequence 654 

CCGGGCAGGTACAGGGTTTGTGAGGCAGTGTGTGGGGGACCCGGCTGGATTTGTATAAAG 

TCCCAAGAAGTGCTCCTTTTCCAGAAGGGCTGCTACAATCCAATCTCAGGAACGTGGACT 

GTTATCTGAGTTCTGCAGTTTCACTTTGTTCCCAGGGCCACCCCCAGTCGCCTCCAAATT 

CTTTGTATAATTCCAGAGGCCATTCCTCTGGGCTTATCCTGCTTCCTCCACGCTGGTGAA 

TAACTCCCACCTCCTGCCCTTTGTCTCTCTATTTATGGCCCTTCTGAGTTCTGGAGAAGA 

CTTCACTGGGACTTACGTCAGTGAGCTTAACATCTTTCTCATTAATGAGGAATGAGAGAG 
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AAGGCAACAGGACAATGCCCCCATGGCTTCCACTGTAAGAAAGTTGAANACTGTGAATCT 

TAAAGTTATTAGAGCAGTTGGAAGCCACACGGTTTTAACAGTTCTACTTGTGTACCT 
Sequence 655 

GATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGGGAGAATCACCTGAGCTTGGGAAGTTGA 

GGTGCAGTGAGCTGAGATCGCACTGCTGCATTCCAGCCAGGGTGATGGGAGTGAGACCGT 

GTCTTAAACAAATAATAATAATTTGGGGGTGATGTAGACAGAAATAAGGACAGAGGTATG 

AAAAGGMTAATTATTTAGCATTTTACTAGCCAGAGATTTGGGCAGTCCCAAAAAGCAGC 

TTTTCTAATCTCCCTCTCCTTTCCCAAAAAACTTTAATCAGATCCTTTAGTATCCTATCA 

TCTTTGGGTTATTTACATACTACTTTTTGGAGGGGGGATTGTGGAAAGCTATCGAGATCT 

TTGAAAGCATTGATGATTAGCAAGATGATTATTCCTGTAAGAAGCCTTAANAGGAGAAAT 

GCAAGTGTAAGTTAACACTGGCTTCTTAAATTGGTCAGTGAAAG7TGGAT 
Sequence 656 

CGCGGTGGCGGCCGNCCGGCCANGTACACACTAACTGANNCNACACTGTGGCCNATAANG 

NCTATTTATTGGATCAGCAATTANATAAGTCCCACATTNTNATGCCGTATAGCTCTACAC 

AGCTGCAAAAATNTACCATANCTNGCAGGTGATCATTGGTTNGATAAAAGATATTGAGTC 

NCTCATCTTGTGAAAGTGATCTTTGATATAAGAGGAGCATCACCGGGGAAGCTCACATGT 

CCCGTGGCTCACACACCAAAAGGNATTTGNGTCTTGTCATTGTCTGTCTGGCANNCCATG 

GCMTGGCTTTNTCANAGAGGCCTGTTTTGTGGTTAACTGTGTGTACCTCGGC 
Sequence 657 

GAACCATGGAGCTCAGCGTNCTNCTCTTNCTTGCACTCCTCACAGGCCTATTGCTACTCC 

TGGTTCAGCGTGACCCTAACTCCCATGGCACCCTCCACCAGGGCCCCGNCCTCTGCCCCT 

TTNGGGGAACCTTCTGCANATGGACAGAAAAGGCCTACTCAAATCCTTTCTGAGGGAAGT 

GGATCTGATTGCCG TCGAT CAATAATACCTTTGACCAAGGGACTCCAATCAATTCAGCAT 

^^^^^CTAATGCCTTTTGTTTGTAATACTGTTGCGGGACAATCAAAGACTGGAGAGAT 

CAAAAAAAAAAAAAA 

Sequence 658 

AGGTACCCCCTGACTTGGTGTTATCTGACATTGGCTAATCTCCAACTTATACTGAATACC 

CGAGCATTTCTAAGTGCCCTGGCCTTAGCTGGGGCAGGCTTAGCAACAAGGGTAANACAA 

GAGAAGACGAATAGGCTGGATGCAGTGGCTCATGCCTGTAATTCCACCACTTTGAGAGGC 

CGAGGCTATCAGATTACTTGAGTCCAGGAATCAAAGACCAGCCTGGGGCAGGGCATGGTG 

GTGTGTGCCTATAATCCTAGTGCTTTGGGTGGCTGAGGCAGCTGGATCACCTGAGGTCAG 

G AGTTC NAG ACC AGCCTGG AC AAC ATGGC AAAAC CCTATATCTACTAAATATAC AAAATT 

AGCCGGGTGTGGTGGTGCAGCCTGTANTCCCAGCTACTTGGGAGGCTGAGACAGGANAAT 
TGCTTGAAC 

Sequence 659 

GCTCCACCGCGGTGGCGGCACGAGGTACTTGGCCCTACCCTTTGGCCTCTTTCACCCCTG 

ATGTGGAATCTTCC7TTCTCATATGCGTGACTTTTTCATATCATCTGCTCATCTTGGCGT 

CTCCTTCATCAGAATTCCACGGTTATTTGGGAGGTCTGCTCAAGACTAANTTCAGTGACA 

TTCCCCATTGCCCCCTCCAGCCCTTTCACACACACACAGACATACACACATTCCCTCTGT 

CTCTTTAGGGTTTCAGGTAATCAAATAACTCACTGTCTATGGTGTTCTGCTCTTTCCACC 

GAGTCCTTTGGCTGCTGAATGTGTCTCTCTCCTCTTACGCTGTTCCTACTTTGGCTGGCG 

AATAAGATGAACAGAGCCACTCCCTGGAAAGGACTTCANATTTGGCCTCCGCTGAAACAA 

TGAAGATGAGGGAGAGCAGGAAAGCCAGGCTGGGCGTANAATGTGTGTNACAATGANGCT 

GGMTGAAAAGGGAGATGGGMTTTGCAGTTTGCTCACACCCTCATTCCAAATGTTATCC 

CCGCGTACCTTGCCCG 

Sequence 660 

CTCCCCGCGGTGGCGGCCGAGGTACTTTTTATTATTTTCTTTAGTTTCACCAGTCCTATA 
TCTGGAAGTCTCT TCTAA GCATCANGATAGAGGTCATCTAGTTCAAGATGCTTAAATTCT 
CTCTTCAGTATATCTTTTCAAGAACAAGAGGTGTGACCAATACACAATAGGCAATCAATA 
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TTCAATGAAAATGTTTGCTCAGAATGAGTAAAGACCTCCAGTGATAAGGAATGCATTCCT 

TTCCAAGGGAACCCATTCTA I Mil I GAAGAGTTCACAACATCAAAAAAAAGTTCTTATG 

CTGAGCATGAATCTATCTCCTTTTGACTTGCCCATTGAACTCAATTCTGTGCTTCCAANG 

GAGATACAATTAAAGCTCAATACTTTCACCAAACGACAGCAACTCACAAACCTGAGGGGC 

GGGTTGAGACTACTCTTCCTCACAGTCCAATCTTCAGCCCTCTCCATTCCTCAGTGACAA 

GGGTTTCCTTTCTCC 

Sequence 661 

ATGCAAAAATCGTGTGAAAATCAACGTGGAACAGGAAATGAGGTGCGGGTGTCAACTGGT 
TACAAAGGGTCAGAAGTTGTCTGGTGCCTGACAAGGACACACATCTCATTCATAACATAT 
GGTGGTAATTTAAAAATGAAATGATGTTA I Nil I CCTTCGAATTCATATGTA I HUM 
TCAG GTGGC C ACTTAGTTGTAAGG AC ATAAATACTCATTAAGTTGTTTGGACCTATTTAA 
TAAATGGACATGTTAGGTATTTCACCTGCCCG 
Sequence 662 

AGGTACCAATGCGCGCGCGCACACACACACGAACATGCGCGCATACACAGACGCTCGTGC 

ACACACGCNAACAGGCACATACGCGAACAAACACGGGCGCGCACACACACGAACACGGGC 

ATATGCATACTTGGGCGCACAAGAAGATCTGGCTGGCTCAGTCTGGGTTAAGAATCCCCC 

TTTGGCCTTGGCATCGGGGCCCACCGGGTATCGTGTGTATTGGGAGTAGTTGCCAGAGAT 

GGGGGAATTGTGTTCAGCTCAACTGCTAACCGCAAGTAACTCACCCAATTAGTAAATGAT 

GGAGGAI I I I I CT7TCTTTCCAAAGTTTTGGACTTTTCCTACCACGCTGTGTACCTGCCC 

G 

Sequence 663 

GGAGCTCCCCGCGGTGGCGGCCCGAGGTACAGAGGGTGCCAGGCACTGTGGTGCACACCT 

GTAGTCCCAGCTACTTGGGAGGCTTGAGGCGGGCGGATGCTTGAGGCCAGGAGTTTGAGG 

CTGCAGTGTGCTGTGATTGCTCCTGTGAGTAGCCACTGCACTTCAGCCCTGGCAACATAG 

CAAGACCCTGACTCTAAATATAAAAAAAAGTATAGCGGGCACAAATGGGGCGAAGCCAGG 

GTGGGATGCAGAGTCTAAAGGGAGATGAAGCTGGAGAGGCAGGCAGTGGTCAACATGTGG 

CCATGGAAAATAGTTTAGGTTTGATTCTGATGGCACTGGGCTGCCATTAGAGGGTTTGAA 

GCAGGTCAGTGATGTGATACAACTAAAGTCTTCAACAGATCATTNTGGAGACAGTGCTAA 

CATGGACTGGTGGGCAAAAGAGTGAGAAGCCNAAGCCTTAATCCAGAAGGAAATGAAATA 

TGTCTGAATTCAGGCAGGGCAGTAAGGCTTGGANAGCANGGGCATGGTTAAACAGGACAG 

CAAAGTACCTGCCCGGGCGGC 

Sequence 664 

GATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACATTTTTATATGTTTTATTCTTTCAGG 

CTTTGTGAGAGGGGGTTATTTATTTCATTTGCAAAGCAGTGAATTGAGATCCAGGGATTT 

GTCCAAGGTCACATGGCTTGTTCTGTGCCATTCCTAGTCCCTGCTCTTTCTTCCCATCAT 

ACCACACATGTCATTTCTGCCTCTAGCTCACAGCCACAGTCTAGTCCTTTGATAAGAAGC 

AATTCCAGAAATG AAGATTCAAATCCCAG CTATTCTGTTCTACTC AAAATATG G CCC ATG 

GACCAGCAGCACTGAGATCACCAGGGAGCTACTTAGAAACCTGCAGTTTCATACTTGGGA 

GGCTGAGGTGGGAGGATCGCTTGAACCCAGTTTCAGGCTGCAGTGAGTGATGATTGTAAT 

CCTGCACTCCAGCCTGGGTGACAGAGCAAGATCTTGTCTTTAAAAAAAAAAAAAAAA 

Sequence 665 

AGGTACCACCACCTATCCACACTCTCACACAATCACCCCTTTTGAAGTAAAGCCTATATT 

TTAAAAATAACCATTACAGATTGTGAGTGCAGTCTAGGATCACATAGCAGAACTAAAAGC 

TAACTGGCCCACCCACAAAGTAAATGTGAGACAGAGGGCTCGGGGTAGCCATACTCTAGC 

CAAGTAGTACCTGCCCG 

Sequence 666 

GATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACTTAGCAATGAACGAGAAAACACTTG 

ATAGATGCTAGTGTCCTCCAGAGCAGCATTCCTCTTGTGTCCCTGAATTAAGATGAAACA 

ATTTGATAGGTCGCATTATTTAGTTGCGTTGCTTTAAAGAAAATAGAAAACTTTT^ 
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^I^ TATMGGGATATATA ^ CT ^ TATAGGGGATATA ^CTGAAMCTTTTCCCA 

C^GT^c^ 

Sequence 667 

AGG 7 ACCTG ^^ 

TCTAGCCTCCTTAAGACAGGTCTTATCAGCTACTTTTCTTTGGCAGGCGAAATCTTGCAA 
ATCCTACAAAAAGCTCACTAAAGCAGCTCTCACATGTAAACACAATCATACTCGACTCTT 

CTAGGAGGGAATGGTACCTGCCCG 
Sequence 668 

?™? GAG ^^ 

ATGAAAAAAAGCATATATAGGAAGCCTTCAAAATGAAATGGTCAAGGGTGAGTTTACACA 

GATAGATAGAmAGGTCTCTTCCTTrTGCTTTGTGAAAGCATCTAGTGTTTAGGTGTCA 

GAGAGGGAGATATCCTTACAMGCAGAGATTATCATTACAGGTTTACATTTCTTACAAAG 

AGTTTCAAAATAAACAGGTAMTGCCAAAAACATATATTTTGGAGACGGATTAATTCACT 

AGTTGGTCTATTCAACTTAACTTGTTTCCTAATGAGATTAAATTCATGCACAAATAACCA 

Sequence 669 . 

C ™ A r GGAGCTCGACCGCGGTGGC CGGGCAGGTACGCGGGGGAACTTCGGAGGATGTTC 

ATGTCCAGGAGCAGCCCCACGCCCTGTATGGTCGGTGTCTAGAGCCTCACGGCAACTAAG 

ACCAACCCAGCTCTCAGAAGAAGGAATGTCAAAATGTCATGTTCAATTTTACATTCAGTG 

CCTGGAATCTTTTCTTCACAATTGAAATGAAATGTGCTGAAGGAGGTGAATCCATGCATT 

AATCTTCAGCTCACAAAGGAAATACTACATAAGAAGCAAGACCACAGACTCAAGACGGAC 

ATAATTGGA i iimii IGCCATGGCCTGGAAAGAAAGGTACCTCGGCCGCTCTAGAACTA 

GTGGA 

Sequence 670 

AGGTGTACCACCCAGACCCGGCCAACCGCCCATCCGTCGACGGCCAGCCCATCGTCGCCC 
ATACCCTCCTCCAAAGGGGCTTGCCCGTACAGGGCGTCATGGTGATTGCCAACGGCACAG 
ACCCAGAGCGGGTGTTGTACACGCCGGATGCGCCCGACGGTTTGAGCTTCCGCGAACTGG 
CAGACCAAAACGCACTCAACACCCTGCTGGATAAAAACGAATGGAAGCTCTACACCGCGA 

ACCGGAAGTCGCCAGTCAATAAGGATGACGTCGCCAAGGCAAGGGACGCGCTCAAACAGC 
AAGCCAGCGCTCTGGGCACCAATCCGG ^ 
Sequence 671 

GCNNATTGGAGCTCCCCGCGGTGGCGGCCGTGATTTGCCTTTTAACTGAGATTTGCCCCT 

GATCTTCAGTGAAGTTCACAGATGTTGCCTTATCAGAAAGTTGGTATTTGGGAAAAGTTT 

TCAGAAAATATATCCCCTATATTTAGAATATATATCCCTTATATATAGAGGAAAAAAGTT 

TTCTATTTTCTTTAAAGCAACGCAACTAAATAATGCGACCTATCAAATTGTTTCATCTTA 

ATTCAGGGACACAAGAGGAATGCTGCTCTGGAGGACACTAGCATCTATCAAGTGTTTTCT 
CGTTCATTGCTAAGTGTACCT 

Sequence 672 

TGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTAATTAGATCTGCCCATCCGTTA 

CTTGGCAGAGGAATGGGCTGGAAATGGGAAAGGGTCAACCTTTCACCTGCTGGACACTCA 

AGGTGAGAAGAAAAAGGAAAGGGGCCAAGAACTAACTGTCTGGCTAGACAGTAATCATGG 

ACTCAGGCTTTAGGTATTTGATCTTTTAGAATTCCAATTTCTGAAAAATAAAGGTGTTAT 

ATGTAATTAAATGGTTGGAAGTGGGAGGAACTTCACATTTAAATAGAAAACTATAAATAA 

GCCAATGAAAAGAAAACACTCATAATTATCCAATATAATTTTTTTATAGTTCATAGATTA 
AAATAT 

Sequence 673 

AGGTACATTTTACTTGCTTGATCTCTGATTCCCCTCACTGGGTCACCCGCACCTCTGAGT 
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GCCCTCCTCACCCTGCATGAGCCTTGACACCAGGCCCTGAGCCTTTCCTCTGTGTGGTTG 

TCCTCACAGTCCTCTGTGTGGGATCCAGAACTAGGCACCAAGCTCCCCACGTAGGAGGGT 

GCCCTCCCCATTCCCTCTGCTGAGGCTCGAACCAGCATGTCAGGCTGCTCCCCCATGTGG 

ACACCTGACTCTCTTTGGGCTCTTACTCTACACTGGCTGTACCTGCCCG 

Sequence 674 

CCGGGCAGGTACTGTGCMTGACTGGTGAATCACAGTCTCTAAAGGAAGCTTTTTGAGAT 

TTTTACACAGATGGCTATGCAAAGCATAAATAAGTGTGAGGAGGAAGGGAAAAGAAAGGA 

GGGAAAAAATAAATGTGTGGGAGGACAGGCMCTGCAACTTTCATTGTTCTTTATTTTCT 

TGACCATTGAACTMCTACTTAGATAAACCCTACAACTCTTCTCTTCAATTGCCATTACA 

TGTCCACATAGCCTTTTAAAMCTCTAMTTTCACCTCATTCGATTTAGAA 1 rTTTTTTI 

TAACTTTACACCTTGTAAGTGATTTGGGGAATGAGTTTCT 

Sequence 675 

AGGTACATACGAGGTTTCATTATGTTGTCCTGGCTGGTTTTGAGCTCCCGGCCTTAAGCT 

ATTTGCTTGCCTCAGCCTCCCAMGTGCTGGGATTACAGGCATGTGCCATTGAGCCTGGC 

CAATAATTATTCCTTAAAGATGAAAATMTAGGTATTT^ 

CTTAAGAGGGGTGMTCACTCTCCTCCCCTGATMCATTTGGTAGTGTCTGTTGACAATG 
CGGTGTTGGGGTGCTGTGGTGCTGTCAGCACCTGATAGGTGAAGACCAGAGATGAACACC 
CTATCATGTGCAGAGTAGTCCCCCCAACCAAGAATTATCCA 
Sequence 676 

CGGGCAGGTACGCGGGGAGATGTAGTGAATCTCTGCCTGTGTGGACAAAGAGACACATTC 

AGAGGTGAGCCCAGAGCGGGTAAAGTGGACTGGGGAGAACTTCGGAGGATGTTCATGTCC 

AGGAGCAGCCCCACGCCCTGTATGGTCGGTGTCTAGAGCCTCACAGCAACTAAGACCAAC 

C C AGCTC TCAG AAGAAGGMTGTCAAAATGTC ATGTTCAATTTTACATTC AGTG CCTGG A 

ATCTTTTCTTCACAATTGAAATGAAATGTGCTGAAGGAGGTGAATCCATGCATTAATCTT 

CAGCTCACAAAGGAAATACTACATAAGAAGCAAGACCACAGAC 

Sequence 677 

CCGGGCAGGTACAGATGGGGGGGTTCTCACTATGTTGCCCAGGATGGTCTGGAACCCCCA 

TGCTCAAGTGATCCTCCCACCTTGGCCTCCCAAAATGCTGGGATTACAGGTGTGAGCTAC 

CTCACCTGGCCCAGATGACAATTCTTGAAACACACCTTACAATTTCTGCAGTAGTTTCAA 

ATAACTAATCGCATTCCTATCTTCACAACCTTCTGAAATAAGGAGAAAAGTTATCTCCAT 

TATTCCCACTATCCGACATCACTCACCCAGGCAATTCACATACTTGGAACCTAAAATCAA 

ATTTCCTGACTCTTCTTATGACCCTACGG 

Sequence 678 

AGGTACACACAGTTAACCACAAAACAGGCCTCTCTGAAAAAGCCATTGCCATGGACTGCC 

AGACAGACAATGACAAGACACAAATACCTTCTGGTGTGTGAGCGACGGGACATGTGAGCT 

TCCCCGCTGATGCTCCTCTTATATTAAAGATCACTTTCACAAGATGAGCGACTCAATATC 

TTTTATCAAACCAATGATCACCTGCAAGCTATGGTATATTTTTGCAGCTGTGTAGAGCTA 

TGTGGCATGAGAATGTGGGACTTATAAATTGCTGATCCAATAAATAGACATTATGGGCAA 

CAGTGTCTTATCAGCTAGTGTGTACCTGCCCGGGCGGCCGCTCT 

Sequence 679 

GGGGCCATTGANACTGCCATGGAAGACTTGAAAGGTCACCGTANCTGAAACTTCTGGAGA 

GACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGACCACTGGAACAATGAGAAGGANAG 

AATTCTACTGGTCACAGACAAGACTCTCTTGATCTGCAAATACGACTTCATCATGCTGAG 

TTGTGTGCAGCTGCANCGGATTCCTNTGAGCGCTGTCTATCGCATCTGCCTGGGCAAGTT 

CACCTTCCCTGGGATGTCCCTGGACAAGAGACNAGGAGAAGGCCTTANGATCTACTGGGG 

GAGTCCGGAGGAGCAGTCTCTTCTGTCCCGCTGGAACCC 

Sequence 680 

CCGGGCAGGTACAGCCAGTGTAGAGTAAGAGCCCAAAGAGAGTCAGGTGTCCACATGGGG 
GAGCAGCCTGACATGCTGGTTCGAGCCTCANCAGAGGGAATGGGGAGGGCACCCTCCTAC 
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GTGGGGAGCTTGGTGCCTAGTTCTGGATCCCACACAGAGGACTGTGAGGACAACCACACA 
TAGGAAAGGCTCAGGGCCTGGTGTCAAGGCTCATGCAGGGTGAGGAGGGCACTCAGAGGT 
GCGGTTGACCCATTGAGGGGAATCANGAGATCAAGCAAGTAAAATGTACCT 
Sequence 68,1 . 

CNNNTACCCTGNNNTGCCANANTTGCNNNNTGGACNTATCACNTCTNGTNCNGNTCCANG 

NGGACGNCNANGCTCNCNTACTNNTGTTGANTANTAGGAGAGGGTTNATNCGNATCGCTN 

CNGCNNAGTTCANCGGTATATATTCTNNGTATCNTGAACCATAATAGTAATTTGCCTACT 

AACTATGTAACATTGAACAAACTACTGTTTTGTGAGCCTGAAAAGTTGGTACCT 
Sequence 682 

AGGTACACATTCCCTCTNCTGGAGGTTGTCCCTCAGCATGACGCTGACTGATGTTCATCA 

AGACTTACTGTCACACTGGCTAGTGCTGCCAAGCCATCTAGACTACAACTTATTCTAGAT 

TCACCCTGGAGAGATCTTAGAGGGCATATCTTNTTCACCCAGAGAAGGCAT7TATGCCTT 

ATTGAGGCCTCAGTTTACCCAGAGGAAACAGCTCTTNCCTATGACCCTGANAGGTGCTGG 

TGCTACTCAAGGCACCTGTGTTCTGGTGCACCTGGCCCTTTAACTAAAAGATAATGGGGC 

AAAGGANAGGGGGGGGAGCAAGTGTAAATACCTCATTGTGGAGCAG 

Sequence 683 

CCGGGCAGGTACAGCCAGTGTAGAGTAAGAGCCCAAAGAGAGTCAGGTGTCCACATGGGG 

GAGCAGCCTGACATGCTGGTTCGAGCCTCAGCAGAGGGAATGGGGAGGGCACCCTCCTAC 

GTGGGGAGCTTGGTGCCTAGTTCTGGATCCCACACAGAGGACTGTGAGGACAACCACACA 

GAGGAAAGGCTCAGGGCCTGGTGTCAAGGCTCATGCAGGGTGAGGAGGGCACTCAGAGGT 

GCGGGTGACCCAGTGAGGGGAATCAGAGATCAAGCAAGTAAAATGTACCT 
Sequence 684 

CCGGGCAGGTACACATCTGCTCCTGGGCAGACCTTACAAGTATCTCAAACTCATATCTCC 

ATGAGAGGTAGATAAGCTACGTAAACAAGGGAAGCCAAGATAACAGGGAAGACAGGAGAC 

TATCTGCCTATCTAAAAACACTCAAATTCTTTTCTATAAGCACTGCTGATTTAATCCATT 

ACACCACCAGCAGCTGCCAGTTTACAATCTCTGCTACACAAGCCCTCCATTGAGTTTAAA 

GACAACCGTCAAAATATCTCTCCAGTTTCTCCATTCACCTGACTGTTACCATAGAGAATA 

CAGGTATTCCCACTCTGGCCTGCTGTGTGCAGTGTTTTCTCCTATCAG 

Sequence 685 

AGGTACACACAGTTAACCACAAAACAGGCCTCTCTGAAAAAGCCATTGCCATGGACTGCC 

AGACAGACAATGACAAGACACAAATACCTTCTGGTGTGTGAGCCACGGGACATGTGAGCT 

TCCCCGCTGATGCTCCTCTTATATCAAAGATCACTTTCACAAGATGAGCGACTCAATATC 

TTTTATCAAACCAATGATCACCTGCAAGCTATGGTATATTTTTGCAGCTGTGTAGAGCTA 

TGTGGCATGAGAATGTGGGACTTATAAATTGCTGATCCAATAAATAGACATTATGGGCAA 

CAGTGTCTTATCAGCTAGTGTGTACCTGCCCG 

Sequence 686 

CCGGGCAGGTACCCTAAACCCTGGCAGCCTGGCCTACAAACCGACATGAAGTTTGAAGAG 

AAACTGGTCTTCTGTTTTATCAAATACTCCAAATATACGGAAGTAGCAAACATTCACCCC 

CTCTTCATTCCCT TAAACTCG ATCTACCAAGTCTAGCTTCCGCAGTGCATCATAGAGACA 

TTTTAATAGATTCA I I I I I I I I CTCTCTCTCCAAGAGATATTTCAACACTTGCTACGTGC 

CAGGCTCTGTACCT 

Sequence 687 

CCGGGCAGGTNCACACAGTTAACCACAAAACAGGCCCCTCTGAAAAAGCCATTGCCATGG 
ACTGCCAGACAGACAATGACAAGACACAAATACCTTCTGGTGTGTGAGCCACGGGACATG 
TGGGCTTCCCCGCTGATGCTCCTCTTATATCAAAGATCACTTTCACAAGATGAGCGACTC 
AATATCTTTTATCAAACCAATGATCACCTGCAAGCTATGGTATATTTTTGCAGCTGTGTA 

GAGCTATGTGGCATGAGAATGTGGGACTTATAAATTGCTGGTCCAATAAATAGACATTAT 

GGGCAACAGTGTCTTATCAGCTAG 

Sequence 688 
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CTTTCCAGGCCATGGCAAAAAAAATCCMTTATGTCCGTCTTGAGTCTGTGGTCTTGCTT 

CTTATGTAG NATTTC CTTTGTGAGCTGAANA7TAATGCATG GATTC ACCTCCTTCAGC AC 

ATTTCATTTCAATTGTGMGAAAAGATTCCAGGCACTGAATGTAAAATTGAACATGACAT 

TTTGACATTCCTTC7TCTGAGAGCTGGGTTGGTCTTAGTTGCTGTGAGGCTCTAGACACC 

GACCATACAGGGCGTGGGCTGCTCCTGGACATGAACATCCTCCGAAGTTCTCCCCAGTCC 

ACTTTACCCCCTNrGGGCTCACCTNTGAATGTCCCCGCGTACCTGCCCG 

Sequence 689 

AGGTACCTGCCACCACACCTGGCTAATTTTTGTAT7TTCAGTAGAGATGGGGTTTCACCA 

TGTTGGCCAGGCTGGTCTCGAACTCCTGACCTCAGGTGATCCGCCCGCCTCGGCCTCCTA 

AAGTGCTGGGATTACAGGTGTGAGCCACCACGACCAGCCAGAGGACACTTTCTTAACAAA 

ATATTCCCmAGGGTAAGATCTTTCCTTAGAAAAGTTATTCATGMTATTTTCTGGT^ 

CTAGCTAAAATATAAG CTTATG ATATTTATCAAMTATTTCAATTTAACTCTGAAG ATAT 

TATCACTTGGAATAAAGTrTGTTAAAAATTTGGAAGAAATGCA 

acagttttgaaaaggagctatattttattc^^ 

AATAGTTATCACTAACTATACTAGATTACTGTACCTGCCC 
Sequence 690 

AGGTACCTTTCTTTCCAGGCCACGGCAAAAAAAATCCAATTATGTCCGTCTTGAGTCTGT 

GGTCTTGCTTCTTATGTAGTATTTCCTTTGTGAGCTGAAGATTAATGCATGGATTCACCT 

CCTTCAGC ACATT TCATTTCAATTGTGAAGAAMGATTCCAGGCACTGAATGTAAAATTG 

AACATGACATTTTGACATTCCTTCTTCTGAGAGCTGGGTTGGTCTTAGTTGCTGTGAGGC 

TCTAGACACCGACCATACAGGGCGTGGGGCTGCTCCTGGACATGAACATCCTCCGAAGTT 

CTCCCCAGTCCACTTTACCCGCTCTGGGCTCACCTCTGAATGTCCCCGCGTACCTGCCCG 

Sequence 691 

CCGGGCAGGTACTCCACAAGCTTGCCTGCCATGGGCTGTCGGGATGTCCACGCAGCCACA 
GTCCTTTCCTTCCTGTGTGGAATCGCCTCAGTAGCAGGCCTCTTTGCAGGGACTCTGCTT 
CCCAACTGGAGAAAATTACGATTGATCACATTCAACAGAAACGAGAAGAACCTGACTGTT 
TACACAGGCCTGTGGGTGAAATGTGCCCGGTATGACGGGAGCAGTGACTGCCTGATGTAC 

Sequence 692 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGACATTCAGAGGTGAGCCCAGAGGGGG 

TAAAGTGGACTGGGGAGAACTTCGGAGGATGTTCATGTCCAGGAGCAGCCCCACGCCCTG 

TATGGTCGGTGTCTAGAGCCTCACAGCAACTAAGACCAACCCAGCTCTCAGAAGAAGGAA 

TGTCAAAATGTCATGTTCAATTTTACATTCAGTGCCTGGAATCTTTTCTTCACAATTGAA 

ATGAAATGTGCTGAAGGAGGTGAATCCATGCATTAATCTTCAGCTCACAAAGGAAATACT 

ACATAAGAAGCAAGACCACAGACTCAAGACGGACATAATTTGGATTTTTTTGCCATGGCC 

TGGGAAAGAAAGGTACCT 

Sequence 693 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCTGTTATCATCATTCCTCAAGAGCAGCATGTG 

GCACAGCAGGAAGCCAGGCAAGCTCCTGCCATTAGCACACAACGGCAAAAGTGGATGTCT 

GTGTCAACATTTCTGAAGACCCAGAATGGGCCGTAGATAGGCATGATTTTCCTAATGACT 

TTCGCACACAGGCAAAAAAATAGTAATGATAACTACTAATATTATAGGTGCTTCCTATGG 
CCAGGGGATGTACCT 

Sequence 694 

CCGCGGTGGCGGCCGAGGTACTGTGCCGAAAGACTTTACAGGTATATCCGGAGCAATAAG 

CCAGTCA CCATC ATTTCGGTCATAAGTCATCCCTCAGATGTCATGGAAATCCGAATGGTC 

AAAGAAAATTTTAAAGCAAGACCTTGGTCAGTATATTACTCTACATTTGTCCCAGTGTAT 

CTTGCATTAGAAAATCATCCATTTACCCTCACAATGTGTCCAACTGAAACCAAAGCAACA 

TTTGGGGTTCATCTTAAAATAGTAGGAGACTGGACAGAACGATTTCGAGATTTACTACTG 



Page 115 of 869 



W ° 01/51628 PCT/US01/00798 

Table 2 



Sequence 695 



:AGGTA 



g a ag / ag ?™^ 

:ttttgaggaaaagt 
3agaggagtaagga 

Sequence 696 



ACTGMGATACAAAAAAAT^^ , c c rAC Tc 



AAAAACAnTGGCTGGGCTTGGCAGNCnTGGGGTT 
Sequence 697 

Lque^ 

? ca ^s g ggca ^ gggcagmtga ^ cmgat gggcagamggtcccac^^ 

g ^S^ c I atcactaaactgcaatca ^ ctcgcc ^ t ^ 

araccamtgagagccctcccagatgttcagagcagcggaw^ 

g^gttmattccatccattcct^^ 

^S G S CGAA ^ GGAGCTCCCCGCGGTGGCGGCCC GCCCGGGCAGGTACCTGGTTTCAGCT 

TGGAGCCAGCGGGGTAAGGAGAGAMGGMGTGAGAAATGAGAGGG^ 

GTTCTTGGAGGGCTGGTGCCAGGCACTGGGGCTTGGCMCAGCTct^rrS 
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CCTT GGTTT GAANCATCTTGAGTCACCTGAGTGGGTGGAATGAGGCTTANTGACAACACT 
ACACTTTTTACAGATTAAAGGATAAAATGGGAAATATTATGAACAACTTATTNCTATNCA 
TTTC AACAGCTTNAGATGAAAATAGNAAAAATTCCTTTAATGATACCACATTACCCAAAG 
TT7TCTC AAAAAAGAAATAAG ATAAGCTGTNTANTCC CTAC NCTTCTTTTAAAAAT 
Sequence 700 

CGCGGTGGCGGCCCGCCCGGGCAGGTACCTAACCCCGAGTGTTAGCTTCCTTTGCATACT 
ACTGTTCTGAGATGTGTTGGTTGAATTATTAGTAAGGATTTGAAGGCTATGATGCATTCA 
ACA CTCAG TGCGGATTATAGATCCAAGMCACAGACTTTCTTAAAGTATGAGGCTTACAG 
GACTTTTTGTTTGGCTMTTAAAGTGATAGTC^ 

AGATGGGGTCTTATTATGTTGCCCAAGGCTGGAGTGCANNGGCTATTCACAAGGCATAAA 
TCTGAGCTCACTGCAGCCCTCAAACTCCTGGGCTCAAGCAATCCCCCTGCCTCAAGCCTC 
TGAGTAGCTGGGGCTGTTTCAAAATTTTMCATTGGTGCTGTMCANACGCACTGTTTAT 
TGATTAGTTGGGGACTGGTGAGAAAATTGTNGAGTCTCTNAAAAACTTAACT 
Sequence 701 

CGCCCGGGCAGGTACACCAACAACAGCCAAGCTCAGAGCCAAATCAGGAAGGCAATACCA 

TTCACAACTGCCACAAAAAGAATAAAATACTTAGGAACACAGCCAGCCAGAGAGGTGAAA 

GACCCCTGCAATGAGAATGCAAAGCACTCTTCAAAGAAATCAGAGATGACACAAACAAAT 

GGAAAAAAAGTCCATCCTCATGGAGAAGAAGATTTAGAGGCTACTGATTAGGAGTTTCAG 

TGGAAAGTGGGGTCTGAAGCCAAATTGCAGAAGTCTGAGGTGTATGTATGCAGTAAGTAA 

AGAG ACAGCAGATAATGAATCTTCTTTAGAACACAGAGCMTTGTTTTCC Al rTTTTTTT 

TTTTCATGGAAGGGAAAGCAAGTGTCTAATATATCATGACAGCCCCATGTTGTTCTCTAA 

ACACTACTATGCATCCTGAATTTTCTTTGAATTTTGCACTCCACTTATT^ 

AGATACTAAATATTGNGTATCCTGGTTTTGCAAACTGGAGAAAAGACTTGGGTGCAGGCA 

GAAAGGTCAGGGAAGAAAAGAAANGTGTTAGGANGGCATGGTTTGAACTGGGTTGCCCTA 

TNGGCAACTGGAGCTGGGCCTCACTGANAGCTTTGGAGGACCCAACAAGAACACACTTNT 

GGGGATCATCCCTTGGAA 

Sequence 702 

CCGCGGTGGCGGCCCGAGGTACCTTTCTTTCCAGGCCATGGCAAAAAAATCCAATTATGT 

CCGTCTTGAGTCTGTGGTCTTGCTTCTTATGTAGTATTTCCTTTGTGAGCTGAAGATTAA 

TGCATGGATTCACCTCCTTCAGCACATTTCATTTCAATTGTGAAGAAAAGATTCCAGGCA 

CTGAATGTAAAATTGAACATGACATTTTGACATTCCTTCTTCTGAGAGCTGGGTTGGTCT 

TAGTTGCTGTGAGGCTCTAGACACCGACCATACAGGGCCGTGGGGCTGCTCCTGGACATG 

AACATCTCCGAAGTTCTNCCCAGTCCACTTTACCCCCTCTGGGGCTCACCTTTGAATGTG 

TNTCCCCCGTACCTGCCCGGGCGGNCCGTGCTCTAGAACTAAGTGGATCCCCCCGGGCTG 
CAG 

Sequence 703 

GGGCCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACAGAAACCAGTTAAGAGGCTG 

TTATTGCAGCAGTGCAGGACAGATATGATGGAACTCTGAACTTGAGGGATGGATGTCAAG 

ATGGAAAGAAGTGAACACATTTAAGAAATAATGAAAGAGTAGGTAATATCAGTAAGCCTT 

AACAATGGATTGACAGTAGAACTGAGGGAGAAGCAAAGATCAAGCACAGCCCTTAAGCTT 

TAGTCTTCTAAAACTGAATGGACCGTTGAGACAGGGAACATTGGATAAGTTCCAAGTTGG 

AGGAGAAACCATATCTTTAGMGAGGAAAAAMTTCAAATTTATGCATTCATATCm 

CCTCA NAAC ATAGAAAAGGGATATTAGGATATAGAAAAACAAATTACCTACTTCTGAAAG 

TTGCCTTTTAGATTAAGGGAAAAGTTATAGCAGTGACTCTATGACCATTGATTTTTTGG 

GAGCCCTCACTTAAACATTTCTGGCCCTMTCCTNANTTTAATCTATCTTTTGCCCAGTT 

CNTANCAATATTNTCTTTGCAGCANCATGGGGGGGG 

Sequence 704 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACGCGGGTATTTTGGGATGTGG 
ATGCCATTCGCCACACAAAAACACTTGGGATTCATTCCTAGATACCNNAATTCATCCCAA 
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TTTCCCCCCTTTAAAGTCANATCTCACCTACCTGTTTGGTCAGAGAGATGTGTGTTrT a a 
AATCCCCAANGMGGAGGCAAGGACTGTGCCCTCAGATGATTATTGGTGAAGTG^ 

Sequence 705 

^^I^ GAGCTCCCCCGCGG "TGGGCGGCCCGAGGTACACACAGTTAACCACAAAACAG 

CTTCTGGTGTGTGAGCCACGGGACATGTGAGCTTCCCGCTGATGCTCCTCTTATATCAAA 

^I GGT C TA11 ^ GCAGCTGTGTAGAGCT ATGTGGCATGNGAATGTGGGACmTA^ 

Sequence 706 

? A ™ TATAGGGGCGAATNGGAGCTCCCCGCGGTGG cggcccgcccgggcaggtacccc 

^G G ?CG A GC^ 

AC SS? C ^ GG ^ ctcgctactcttctccttcctct gaggtgcagaggcgcggtgggacgac 

Sequence 707 

I^I a ! a ^ gaa I nggagctccccgcggtgg cggccgag 

tctg^ctttgccctctggtgctgaaaggtaatctatattacctctaagaggaatgtcta 
JIgtttctgatgggggcttgctatgcc^^ 

^ggtcacgcagtatcagctgacctggacacta^^^ 

CTCTGGTGAGCTTCTCTAGTTGGCGATAGTTGGTGCCTGTTGTCACACATCAAAGACAGC 
AGA GTAATGAGTCCTGCTTCCACAGGGAGMGA^^ 

NCTGGACTTCAACTTACGTACCTGCCCGGCG ^ ito lb I UTGGAACCCT 

Sequence 708 

TGGGGGGTAAAMGCTATTCTTATTmAAACTCTTCAACATTTTTTAAAGTCTTCAACA 

I GA J™ TCATG ^ GTG ^^ 

TACTTGACCAAGTCCCTACTGGTTGACAGGA 

GTGTGACCCCCMCAAACACAGTGTCCTAGAAACACCTGTCCCAG^ 
Sequence 709 

£a^ GAA £ G ^ 

AGAACTGGACTTACAAGATGGATANGACCTGGTGGCAACTGTCCTTGAGGAAGCCGGAGC 
CTGGAGAGACAGATCTCATGTTATAGCAGAGTACCT uuaaoUHjuAQiC 
Sequence 710 

Sgac^a? GCAG tc^??^^ 

CTGCCCG CCCrrCAGCCGCAGGAAGCG GCAAGGGAAGCTGATGCTGTAGCACCTGGGTAC 
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Sequence 711 

CCGGGCAGGTACAGTGAAGCTTTGACCTGAGGAGCTCTTCTGTAGGTGAAGAGTGGTTAA 

CTATGGTTCCTGTGTGCCTGGGGCCCCCCACTCCTAGGACTCATCCACCCCCTACCCAGA 
ACTTACTAGTACCT 

Sequence 712 

GGGGCCATTGAGACTGCCATGGAAGACTTGAAAGGTCACGTAGCTGAGACTTCTGGAGAG 

ACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGACCACTGGAACAATGAGAAGGAGAGA 

ATTCTACTGGTCACAGACAAGACTCTCTAGATCTGCAAATACGACTTCATCATGCTTGAG 

TTGTGTGCAGCTGCAGCGGATTCCTCTGAGCGCTGTCTATCGCATCTGCCTGGGCAAGTT 

CACCTTCCCTGGGATGTCCCTGGACAAGAGACAAGGAGAAGGCCTTAGGATCTACTGGGG 

GAGTCCGGAGGAGCAGTCTCTTCTGTCCCGCTG 

Sequence 713 

AGGTACTTTACTTTTTATTTTTGTATCTCAGGCCAAGCCAAGTTGAATGTGTAGAMCAT 
CTGCACAGAAAGAGAGACCCAAACAGAAACCAAAATTAAACTGTAACACTTTTGACCCTA 
TTAACGGCATAACAGTAAGGAAATCTAGCTCCTCGAACCAATATACNGGGGTCGGCAATC 
ATTGTGCCACAAAGTTTAACAATATTTGTAGAGACTATGAAGCCATTrGCTTCATGTCCT 

CTACTGAAATCCAATCTTTCCATTTTATTTCCATCCCTCAATCCAACCTGCTACCTGGTA 

CCTTCAACACTGTGTCTTTGAAGGATTCTGTA 

Sequence 714 

CCGGGCAGGTACGCGGGGCAGAGTCATTAAGATGGGCAGGAGTCAAATCAGATAAGGACA 
TGGGGTAAACTGCAGGCCAGGCCCATAGATTGTTCTGAGAGTCATGGGAGACTATTGGTG 
GGu i iGAGTAGGAGMTCACATTTGCTTTTTGTTTTTCAAAAAGCATTTTGGCCGCTCT 
GCAGGCCTGCCTGTATTAAAGGCTAGAATAGAAGCCAGGAAAGCAATAGTCCAGACAGGA 

GATCATGATGGCGTTGGCTTGGGTGGCAGTAATGGAAGTGGTGAGAAGCAGTCAGGCTCC 

AGATACAGTCAGGCTGAACACAGAACTGGC 

Sequence 715 

AGGTACTAGCCCTCATCTGCAAATCATCCTCTGGGATGGGCTGCATGCCAGATGGTGGCT 
CTTAATTGCTTGAGGGCATTGAGAGAGCTGTGATCTTCAAGACCTACCTCTCCCAGGTGG 
CCTGAAATAGCACCAAGTCTGAAAGCAGGGCTCTGGGCAGTGACTTGGGAAAACAGGTGT 
GGATACCAAGTGTGATGAGGACCCTTGAGCTTAGCTTTGTTCCTGTGGTGGCAAGTGGTG 

GGTCTCACCCCCCAGACAACAGCCTGGACTATGAATAGAGACCACAGCCCCCGAGAACCC 

ACCTGGCCCATTCCTCCTCCTTCCTCCTTCCC 

Sequence 716 

CCGGGCAGGTACGCGGGGGTGGAAAACTATGTCCCTGGCAAACTGTCTGCAATGGGCCTG 

GGATATGAAGATATAGACGAGATTGCTCCTCACATCATCTATTGTTCCATCACAGGGTAT 

GGTCAGACAGGTCCAATTTCTCAGCGAGCTGGTTATGATGCTGTTGCCTCGGCTGTTTCT 

GGTCTGATGCACATCACAGGGCCTGAGGTGGCGTGTTTGTCTCACATAGCTGCAAATTAT 

CTTATTGGTCAAAAGGAAGCGAAACGTTGGGGTACCT 

Sequence 717 

ACGACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAAGAAACCC 

TTTGCAGACAGCTCATTTATAGAGAGMGCTMGTAT1TATTTACTTGCTACTCAATTTA 

AAATACGACCATGGTAGGCAACCAACCAGGAGAAGAGCTAAAAGTTTGAAAGCACTGCCC 

TTTTCTCTACTATAGAAGTTACAGGGTCAAGCCTTGGAAAATTTCCACAAAATAAGGCTT 

TGCTAAACATATTTACCTTATCAGAAAGATTGTATCTCAAAAMGTTACCGTTTTTCTTC 

AGGCAAGACAAGAAAAACTGCCTGTCCTAATCGACAGTACCTGCCCGGGCGGCCGCTCTA 
GAACTAGT 

Sequence 718 

CCGGGCAGGTACAGCCAGTGTAGAGTAAGAGCCCAAAGAGAGTCAGGTGTCCACATGGGG 
GAGCAGCCTGACATGCTGGTTCGAGCCTCAGCAGAGGGAATGGGGAGGGCACCCTCCTAC 
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GTGGGGAGCTTGGTGCCTAGTTCTGGATCCCACACAGAGGACTGTGAGGACAACCACACA 
GAGGAAAGGCTCAGGGCCTGGTGTCAAGGCTCATGCAGGGTGAGGAGGGCACTCAGAGGT 
GCGGGTGACCCAGTGAGGGGAATCAGAGATCAAGCAAGTAAAATGTACCT 
Sequence 719 

CGAATTGGAGCTCCACCCGCGGTGGCGGACCGAGCTGGACTCTGGGGCCTGCGGNGGCTN 

CAGCCTCAACTCAGAGGGCAACAGTGGTAGTGGTGACAGTAGCAGCTATNACGCACCAGC 

TGGCAACTCCTTCCTAGAGGACTGCGAACTCTCCCGGCAGATCGGGGCGCAGCTTAAGCT 

GCTGCCTATGAATGATCAGATACGGGAGCTACAGACCATCATCCGGGACAAGACAGCCAG 

TANAGGTGACTTCATGTTTTCTGCGGATCGTTTGATCAGACTTGTTGTGGAAGAGGGATT 

GAATCAAGCTGCCATATAAAGAATGTATGGTGACCACTCCAACAGGGTACCCT 
Sequence 720 

CCGCGGTGGCGGCCGAGGTACACTTAGCAATGAACGAGAAAACACTTGATAGATGCTAGT 

GTCCTCCAGAGCAGCATTCCTCTTGTGTCCCTGAATTAAGATGAAACAATTTGATAGGTC 

GCATTATTTAGTTGCGTTGCTTTAAAGAAAATA 

GATATATATTCTAAATATAGGGGATATATTTTCTGAAAACTTTTCCCAAATACCAACT^ 

CTGATAAGGCAACATCTGTGAACTTCACTGAAGATCAGAGGCAAATCTCAGTTAAAAGGC 
AAATCA 

Sequence 721 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACAGAAGGGCTGAGGCTT 

CTGTCTGAGTTGACCGTGACCTGTGGTTACAGCATTTTAGGACTTCGTAACAGAATGATC 

CACAGAGGTCAGCAGGCCCTTGAAAGAACGCTGAAGGAAACTTGCATACAAACAGTCTCA 

AGGAGGCAAACACATGCTTCAGCGCTGCCCGCGTACCTGCCCG 

Sequence 722 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACAATGGGTGTTCAAAATTTCTCTTGTAGGGTT 

GAAAATTTTCAAAATAAAAAGCTGGGGATMGATGATTTGGGTGGGTCTGGCCAGAAATA 

ATAAGAGCAGGATTCTCAGCTCTCCTTTTTTGGCCACCGTGCTTCTGCAGTGGGACTCAA 
AAAGAAGTACCT 

Sequence 723 

AGGTACTCTGCTATAACATGAGATCTGTCTCTCCAGGCTCTGGCTTCCTCAAGGACAGCT 

GCCACCAGGTCCTATCCATCTTGTAAGCCCCCCCGCGTACCTGCCCG 

Sequence 724 

TTATTGGAGCTCCCCGCGGTTTTGGCCCGCCCGGGCAGGTACGCGGGGGGACTTACAAGA 
TGGATAGGACCTGGTGGCAGCTGTNTTTGAGGAAGCCAGAGCCTGGAGAGACAGATCTCA 
TGTTATAGCAGAGTACCTCGGC 
Sequence 725 

CCGGGCAGGTACGCGGGGAGTGGCATCCTTTGAACTCACGGATCCAGTCATAACTGAAGT 
TCCATCTACACCTGGGGTCTTCAGACCCAGGCTCTGGAAAAGATCCTGTTACTTACAAGC 
AAAAGAAATTCTGCCTCTTACCCTCAGCATGTGAGTACCT 
Sequence 726 

AGGTACATTACCCTACGCTCCAAGCGCAGCAAGCCGGCCAATATCCGTGGCACCGTGAAG 

CCCAAGCGCAGGAAAAAGCATGCAGTGGCATCGGCTGCCCCAGGGCAGGAGGCTTTGGTC 

GGACCATCCCTTAGCCGCAGGAAGCGGCAAGGGAAGCTGATGCTGTAGCACCTGGGTACC 

TGCCCG 

Sequence 727 

CCGCGGTGGCGGCCGAGGTATCGCAGCAAAATGCAAAAATCGTGTGAAAATCAACGTGGA 
ACAGGAAATGAGGTGCGGGTGTCAACTGGTTACAAAGGGTCAGAAGTTGTCTGGTGCCCG 
ACAA GGACACACATCTCATTCATAACATTGGTGGTAATTTAAAAATGAAGTGATGTTATT 
TTTTCCTTCGAATTCATATGTA I I I I I I I I CAGGTGGCCACTTAGTTGTAAGGACATAAA 
TACTCATTAAGTTGTTTGGACCTATTTAATAAATGGACATGTTAGGTATTTCTCCTGCCC 
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Sequence 728 

CGAGGTACCTTTCTTTCCAGGCCATGGCAAAAAAAATCCAATTATGTCCGTCTTGAGTCT 

GTGGTCTTGCTTCTTATGTAGTATTTCCTTTGTGAGCTGAAGATTAATGCATGGATTCAC 

CTCCTTCAGC ACATT TCATTTCAATTGTGAAGAAAAGATTCCAGGCACTGAATGTAAAAT 

TGAACATGACATTTTGACATTCCTTCTTCTGAGAGCTGGGTTGGTCTTAGTTGCTGTGAG 

GCTCTAGACACCGACCATACAGGGCGTGGGGCTGCTCCTGGACATGAACATCCTCCGAAG 

TTCTCCCCAGTCCACTTTACCCCCTTCTGGGCTCACCTCTGAATGTCCCCGCGTACCTGC 
CCG 

Sequence 729 

NCGCCCGGGCAGGTACAAAACAGATCCTCAAAGATGGGGTTCCAGGGTTGTTTTAGACAC 

ATCCTTCACTTTCCTTTGTCATAGGTAGTTGTATTGAAATTMCTTTATTGCTATGGGTC 

ACTGTGGATAGTGCACCAAGTGTCTCGGGAACAAATAGCTACTGTCCTTGGTTCTTATGG 

CACGTAGTGATAGCTCCAAAGGCAGTGGAGCAGGAGGCTGCTCCTGGGAGGACTAGAGTA 
CCT 

Sequence 730 

CCGCGGTGGCGGCCGAGGTA CTGTG TCMGCTTTTTACATTGAGCTAGTGCCATTGGGGG 

TGGGGGGTAAAAGCTATTCTTATTTTTAAACTCTTCAACATCTTTTAMGTCTTCM 

GATCTTTATCATGTTGTGATGTGACCGCTAGATTTCCATTTTGAGGCTCTCATATAATTT 

ACTTGACCAGTCCCTACTGGTTGACAGGAGACCATCATGGTTCCTATGATTTCTCAAAGC 

GTGACCCCCAACAAACACCTGTCCTAGAAACACCTGTCCCAGAAACACCCAGTACCTGCC 
CG 

Sequence 731 

ANGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTACCCAACCTTGCAAA 
TGGGAGCTGGCCAGTGGGTTGGGGTGACCAATCAATGAACAAGAGAGGTCTGAGACCTCC 
CTGTCCGTCGGGTCTGAAGGGCTGCGTGGGGGCATGTGGCCTCACCTGTTCTCTAAGGTA 
GAACTGCTCCATAAAGGGCCANGTGTGCAAATCCTGGTCCTGGGATGTGAGTGCTGCTGA 

GCCAAGGTGCACGGAGCATTAGTTCATNCTTCTTGAAACCTGCGGTGGCAATGGTTCTTG 

ACAGGTATTGGGTTA 

Sequence 732 

AGGTACATTACCCTACGCTCCAAGCGCAGCAAGCCGGCCAATATCCGTGGCACCGTGAAG 

CCCAAGCGCAGGAAAAAGCATGCAGTGGCATCGGCTGCCCCAGGGCAGGAGGCTTTGGTC 

GGACCATCCCTTCAGCGCAGGAAGCGGCAAGGGAAGCTGATGCTGTAGCACCTGGGTACC 
TGCCCG 

Sequence 733 

ACTACTATAGGGCGAATTGTAGCTCCCCGCGGTGGCGGCCGGGGGCCATTGAGACTGCCA 

TGGAAGACTTGAAAGGTCACGTAGCTGAGACTTCTGGAGAGACCATTCAAGGCTTCTGGC 

TCTTGACAAAGATAGACCACTGGAACAATGAGAAGGAGAGAATTCTACTGGTCCAGACAA 

GACTCTCTTGATCTGCAAATACGACTTCATCACGCTGAGTTGTGTGCAGCTGCAGCGGAT 

TCCTCTGAGCGCTGTCTATCGCATCTGCCTGGGCAAGTTCACCTTCCCTGGGATGTCCCT 

GGACAAGAGACAAGGAGAAGGCCTTAGGATCTACTGGGG 

Sequence 734 

CCGGGCAGGTACGCGGGTCTAGAMCAGTGATTGTTAAAATTGCCTATAAACACACAATA 

TAAACAATATGCCACAGACCCAGAGGCCCCCAGCATAGTTTCTGGMGGTGAAGTCTCAC 

CTCTGCCAGAGTAGAGTCACAGTGGGAATTGAGGCTTGGCATCTGGGTGCCTCTGAAGCT 

CTTCCTGTGGCTCCTGTTAGACAATTGACCCCGAAACACAGAAGATAGAGATTGCACATT 

GCGTTCAAGTAATTGTTGTTTATTTATGTTATTTCTGAAAATAAAAGACAGCACAGGCTG 

CTTATCTTTAAACCACACAGTACCTCGGC 

Sequence 735 
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GGGGCCATTGAGACTGCCATGGAAGACTTGAAAGGTCACGTAGCTGAGACTTCTGGAGAG 

ACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGACCACTGGAACAATGAGAAGGAGAGA 

ATTCTACTGGTCCAGACAAGACTCTCTTGATCTGCAAATACGACTTCATCATGCTGAGTT 

GTGTGCAGCTGCAGCGGATTCCTCTGAGCGCTGTCTATCGCATCTGCCTGGGCAAGTTCA 

CCTTCCCTGGGATGTCCCTGGACAAGAGACAAGGAGAAGGCCTTAGGATCTACTGGGGGA 

GTCGAGCGGCCGCCGTCNAGGTACACCTGGTGCCCCN 

Sequence 736 

CGCCCGGGCAGGTACATCTTGGTCCAAGCCACCATCTTCATCTCTTGGTTGGATTAGAGC 

AGTGACTGCCTGTAGATCTCCTTGCTTCTACTCTTGCCACCAACAAAAAAGTCAGTAAAA 
ATGAAGTCAGTA 

Sequence 737 

CCGGGCAGGTACCATGGCGACGCGCGGTGGGGCTGGGGTGGCAATGGCGGTTTGGTCCCT 

CTTATCCGCCCGGGCCGTGACCGCGTTCCTTCTGTTGlTCCTCCCTCGCTTC-rTACAGGC 

CCAGACCTTTCTTTCCCnTCCAGCAGCCGGAGAAGTGCGACAACAACCAGTACCT 
. Sequence 738 

ccgggcaggtacacgccagcctattctcagcaSagctgctagagtgtccttatacccttc 

ATTGACTTAAAATCCCACAGTGGCTCTCCATTTCACTTAGTGAATGCTATCATGTATTCA 

GTGGCCCACAAGGGCTCCATCACCTAGCCCTCTGCTCCCTAGTACCT 
Sequence 739 

CCGCGGTGGCGGCCGAGGTACGCGGGGGAGAACTTCGGAGGATGTTCATGTCCAGGAGCA 

GCCCCACGCCCTGTATGGTCGGTGTCTAGAGCCTCACAGCAACTAAGACCAACCCAGCTC 

TCAGAAGMGGAATGTCAAAATGTCATGTTCMTTTTACATTCAGTGCCTGGAATCTTTT 

CTTCACAATTGAAATGAAATGTGCTGAAGGAGGTGAATCCATGCATTAATCTTCAGCTCA 

CAAAGGAAATACTACATAAGAAGCAAGACCACAGACTCAAGACGGACATAATTGGATTTT 

TTTGCCATGGCCTGGAAAGAAAGGTACCTGCCCG 

Sequence 740 

CGCGGTGGCGGCCCGCCCGGGCAGGTACTTGGGTGTGATTCAGGAGTATGCAAGCCTTTC 

TGCCCCACTTTGTTGCCTTGTTAACTAAGGGAGGCTGAGATAAGTCCTCTCAGGTGCAGT 

CAGAGTCTGTCCTAAGATGGGGGTAGGGAGTTGCTGTGCCTCAGTGGTTTACTTTCTCAA 

TATAGACTGTTAGCCATATGTGTATTACCTGAAACCTTAAGGAGAAAGCATTTTTGGCCG 

GCCTGGAAAGGACACAAGCAAATGTAGCCTTGATGGTGGAGAAACGTCTCTAGCAGGGCT 

GTGCATCTGAATCAGCATTGAAGCAGCGCCACCTTGTGGTTGATGCTGAATGAGCTGTGT ' 

ATTCGGCCAGGTTCTGTGATTCTTACCCTGTCTGCATGCTGTTCGAGTAGCTAAAGTACC 

Sequence 741 

CGAGGTACGCGGGGACATTCAGAGGTGAGCCCAGAGCGGGTAAAGTGGACTGGGGAGAAC 

TTCGGAGGATGTTCATGTCCAGGAGCAGCCCCACGCCCTGTATGGTCGGTGTCTAGAGCC 

TCACAGCAACTAAGACCAACCCGGCTCTCAGAAGAAGGAATGTCAAAATGTCATGTTCAA 

ttttacattcagtgcotggaatcttttcttcacaattgaaatgaaatgtgctgaaggagg 

TGAATCCATGCATTAATCTTCAG CTCACA AAGGAAATACTACATAAGAAGCAAGACCACA 

gactcmgacggacataattggattttttttgccatggcctggaaagaaaggtacctgcc 

CG 

Sequence 742 

ccgcggtggcggccgaggtacaacttgtcataaatggcagtgtgggacatttacattctt 

ttggtctacagttctgagactattttttatgcatgtatgtatttattcagtaaacattta - 

tcctctgctgcacgtcaagcactatgccagacccgataggagagaacccagggtgagtgt 

gacccggcctctgcttggaacgagatgggtgtctctagccaggccaggtgcttaccatgt 

gcctggtgtggggtccatgacaccaagcagggcagacctaggcatcctgttggtgcagct 

ctgtaaaggggacaaagacagacaacaaaccatgactccgagaaagtgcaacaggtgcca 
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CGAGGAAACANAATTCCAGGACAGGAAGGTCAGGGAATGCCTTTTCTGGACCAAGGGGGG 

AAAAAGAACCAGCAGGGGAGGGTATTCTGGCAGAAGGGGCAGTAGCTTGAAGCCTTCAAA 

AGTGGAAGTGTTATTAATTCTTTGGANGGGGGGNAAAGGGAAATGGGAGCTTGGGAAAGG 
GG 

Sequence 743 

CGCCCGGGCAGGTACAAGGACAATAGGGATACCCTGGAGCTGTCCCAGTCCAGCCCGAGA 

AGGGCATTCTCCCTACTTGTGTGTCATCTTAGGGACTCTCACTTTGACCCTGAGTGTGAT 

AGGAACCATGGGGAAGTCTGAGCTGAGGTTGGACGTGAAAGGTCTCATGTTTTAACAGGG 

TCACTCTGGCCCCCTCCACTGAGAACAGGCTGAAGGTGGCAGGGACTGAGTGGAGAGACC 

TCTTGGGAGACCACTGCAGCAAGTCAGGTGAGAGCTGGAGGTGGCTTGGCCCAATGGGTA 

GCTGTGAGGTCGAGAGCAATGGCTGGAGCCAGGAAACCTCTGAGTACCTCGGC 
Sequence 744 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCCTGAGCCTCCTGTGCCATTCTTGCTAGCTGG 
ACAGAAGACATTGCTACTGAGCCAAGAAGACTTGATCTCTCTTACACCGAGGAAGAAGGG 
AAGGTCTGATGCAATGAAACAGCACGTTTGTTATAATTCTAGATCAGAACCATAATAGTA 
AmGCCTACTATGTMCATTGAACAMCTACTGiTrTGTGAGCCTGAAAAGTTGGTACC 

Sequence 745 

CCGGGCAGGTACTGTGTTTCTCCACCTGACCTAGAAAACCTTGATTCCTGAAAAGCCTCA 

CCCTAACTTTGCCACAACCTGGCGATCCATCCATACTGTTCACAGCCCTCAGGAGAGACT 

GGATAATTATGAACACACCTTGCTTTCTTCCATCATTTCCCTTTTCCCCAAGGGAACATA 

GCTGCACTGAGAACACCAATCTCTCAACCACACCCTATCCCAACCCCTGTGGAAAATATA 

GTAGCTACCTCACCAATGGGAGTTGAGGGAATGAAGACTCATGTCTTCTGGGATCTGGGT 

CAAGCCCAGGTCAGCCATTCTTCAGACTTGCACAAGTACCT 

Sequence 746 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCCTGAGCCTCCTGTGCCATTCTTGCTAGCTGG 
ACAGAAGACATTGCTACTGAGCCAAGAAGACTTGATCTCTCTTACACCGAGGAAGAAGGG 
AAGGTCTGATGCAATGAAACAGCACGTTrGTTATAATTCTAGATCAGAACCATAATAGTA 
ATTTGCCTACTATGTMCATTGAACAMCTACTGTTTTGTGAGCCTGAAAAGTTGGTACC 

Sequence 747 

NCGAGGTACTGGGATTACACGTGTGAGCCACCGCACCCAGCAACTGTTCACATTTTTTAA 

AAAACAGCTCTACCTCTGAGTTAAAACTGATTTTTGTCAAATTGTAAACTTTAAAATATT 

AGCCTTTCCCC AGTGCTAT GTGCTCTTTAAGAGCATAGTGCTTAAAATTTGGACTGTAAC 

TAi I I lAAAGGCi I I I I I I iCTTTGCTGCCAtGTTCCAGGTCACGTCTGTTAAGTGTTTT 

TGTTGTTGTCACAGTGTTTTAGAATTATTCTGATGCAAAAAAAAAAAGAATTATTCTCAT 

GCAGTAGCTGAAACCAATGCAGAATATGCTGGTTTTGCAACAATTTCATGTCATGAAATG 

CTTANGCCAATTTTCAGTTAATAGAACTACCTCCATANGTTTAAAAAGTTATGATAAATT 

AGAAGCCATTTGNTCTTTTACATTAATAGGAAGCATTANTTCAAGATAAATTACTTCAGG 
ATTAGGGATTA 

Sequence 748 

CCGGGCAGGTACCACCACCTATCCACACTCTCACACAATCACCCCTTTTGAAGTAAAGCC 

TATATTTTAAAAATAACCATTACAGATTGTGAGTGCAGTCTAGGATCACATAGCAGAACT 

AAAAGCTAACTGGCCCACCCACAAAGTAAATGTGAGACAGAGGGCTCGGGGTAGCCATAC 
TCTAGCCAAGTAGTACCT 

Sequence 749 

CCGCGGTGGCGGCCGCCCGGGCAGGTACACGGAGAATTTTAAATGGGCAGAAGAGAATTT 

AAAGGTGTTATTTACATAAATGTAAAAGCCCCTAAAGTGGAGAAATCTCTCATAAGGAAT 

TTCAAACATTTGCTTGACTCTGGAGGCCTGGCATACTAAGGAATAGGATTCCTCCTAAGA 



Page 123 of 869 



WO 01/51628 PCT/US01/00798 

Table 2 



ATTACACATCTGAAGGCTGTCCTGAAAGACTTTCCATACTCGATGATGCTCTTTTGGGTA 
CCT 

Sequence 750 

CGAGGTACCCGAGTCGGATTAGGAGAGGCGGCCAGGGGAGCCTGCCTTTAGAAAGAGCTT 
CGGATGATTCAGATGCACACGTTTGGGACCGTGGGCTATGGGGGTCGCCGCAATAACCAT 
GTGAGGATTAACGGGGGCTTAACCGATTGTGAAAATCCCCTTAGAAGTGTCGCGTAAACA 
ACCCCAACTCCCACTATGGTGCCGGACCCCAGTGCCTCACTGTACCTGCCCG 
Sequence 751 

CGCGGAGCTCCCCCGCGGTGCGCCGCCCGACNTACATGNNTTCGGNTAATCNGCCAGAAA 

AATTACATTAGGGTAACAGGCTAGAACAGTCTGACTTTTCTTGTTTTTCTATCCCTTGCT 

TTCTTGATTAGAATGAATAGGAGGTGGGTCTGGATATAGCAGCTGGAAACCTGTGTTCCA 

TGAGTGATGGGGAAGAGAGGGAGGGAATAGGTTCCTCTGATTTTTGGCATTTTCTAAGAC 

CTGATGCCCACCTTGTCAGAGAATGCGATGACTACTTTTGTGTTCTTNCTTNTCCCTTTT 

TCTC CCAA TATAAAATNGTTTTCTCTTTCAGM 

CCACTTTTGATGTTGTTAATTTAGCTGAATACCTMGTGAACATTTTGTGTCAAAANCCC 
CTTNGTTTANGAAATNCAGNTGGTCTAGTCACCCTTACATTTTCTGCCTCATAATTGGCC 
CTAAGCCCT 
Sequence 752 

CCGCGGTGGCGGCCCGAGGTACTTCTTTTTGAGTCCCACTGCAGAAGCACAGTGGCCAAA 
AAAGGAGA GCTGA GAATCCTGCTCTTATTATTTCTGGCCAGACCCACCCAAATCATCTTA 
TCCCCAGCl I I I I ATTTTGAAAATTTTCAACCCTACAAGAGAAATTTTGAACACCCATTG 
TACCTGCCCG 
Sequence 753 

CGCCCGGGCAGGTACGCGGGGACACATTCAGAGGTGAGCCCAGAGGGGGTAAAGTGGACT 

GGGGAGAACTTCGGAGGATGTTCATGTCCAGGAGCAGCCCCACGCCCTGTATGGTCGGTG 

TCTAGAGCCTCACAGCAACTAAGACCAACCCAGCTCTCAGAAGAAGGAATGTCAAAATGT 

CATGTTCAATTTTACATTCAGTGCCTGGAATCTTTTCTTCACAATTGAAATGAAATGTGC 

TGAAGGAGGTGAATCCATGCATTAATCTTCAGCTCACAAAGGAAATACTACATAAGAAGC 

AANGACCACAGACTCAAGACGGACATAATTGGA I J I I I I I I GCCATGGCCTGGAAAGAAA 

GGTCCT 

Sequence 754 

CGAGGTACCTCAAGAAAACTTTATCACATGCGTGCCAGGTGCCTGACATGATTCTAGATG 

CTGAGGATGTAGCAGTTAGCCAAAGGGATATAACTCTTGCTCTCATGCAGTTTATATTCT 

AATTTGGATATTCTAAATATTTCATGTAGAATGTGAAATAATACCACAATTAAGATATCA 

CATAAGAAAGATACTTTCAGAGCTTCTCTATTTAATAATTTCATGTAGGAAAAGAAGGTC 

TCCTTAGTGTATCTTATCTTTCAGACCTTTATATGTMTAACATGATTTTTCCTAAm 

GAATTAACAGTTTCTCTAGCATGTAAAGAAAAMCTGGACCTGTGTTTTGGAAATCAAGA 

CCTAAGGTTCAAATCAAGGACCATAAACCNCACCATTAAGGTTAGGGGG 

Sequence 755 

AGGTACCTGCCACCACACCTGGCTAATTTTTGTATTTTCAGTAGAGATGGGGTTTCACCA 

TGTTGGCCAGGCTGGTCTCGAACTCCTGACCTCAGGTGATCCGCCCGCCTCGGCCTCCTA 

AAGTGCTGGGATTACAGGTGTGAGCCACCACGACCAGCCAGAGGACACTTTCTTAACAAA 

ATATTCCCTTTAGGGTAAGATCTTTCCTTAGAAMGTTATTCATGAATATTT^ 

CTAGCTAAAATATAAGCTTATCATATTTATCAAAATATTTCAATTTAACTCTGAAGATAT 

TATC ACTT GGMTAAAGTTTGTTAAAAATTTGGMGAAATGCACAAGTTATATTTTGAM 

ACAGTTTTGAAAAGGAGCTATATTTTATTCATATAAATATT 

Sequence 756 

CGGGCAGGTACGCGGGGAAAGGCCCTGAGGAAAACAAACCCCAGCTGGGAAGCCTGAGAA. 
CACTTAGCCTTCATGAGTGTCCCCACCATGGCCTGGATGATGCTTCTCCTCGGACTCCTT 
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GCTTATGGA 
Sequence 757 

CGCCCGGGCAGGTACCTTTCTTTCCGGGCCATGGCAAAAAAAATCCAATTATGTCCGTCT 

TGAGTCTGTGGTCTTGCTTCTTATGTAGTATTTCCTTTGTGAGCTGAAGATTAATGCATG 

GATTCACCTCCTTCAGCACATTTCATTTCAATTGTGAAGAAAAGATTCCAGGCACTGAAT 

GTAAAATTGAACATGACATTTTGACATTCCTTCTTCTGAGAGCTGGGTTGGTCTTAGTTG 

CTGTGAGGCTCTAGACACCGACCATACAGGGCGTGGGACTGCTCCTGGACATGAACATCC 

TCCGAAGTTCTCCCCAGTCCACTTTACCCGCTCTGGGCTCACCTCTGAATGGGTCTCTTT 

GTCCACACAGGCAGAGATTCGCTACATCTGGAACTTGGCTCCTCCAGCTTGGAAGTAACC 

ATGAAGATGTGGTTGGGGTGGGGTGGGAAACC 

Sequence 758 

CCCCGCGGTGGCGGCCGAGGTACCCAAAAGAGCATCATCGAGTATGGAAAGTCTTTCAGG 

ACACGCCTTCAGATGTGTAATTCTTAGGAGGAATCCTAATTCCTTAGTATGCCAGGCCTC 

CAGAGTCAAGCAAATGTTTGAAATTNCTTATGAGAGATTNCTCCACTTTAGGGGCTTTTA 

CATTTATGTAAATAACACCTn*AAATTCTCTTCTGCCCATTTAAAATTCTCCGTGTCCTG 

CCCG 

Sequence 759 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACGCGGGGGGACTTACAAGATGGATAGGACCTG 
GTGGCAGCTGTCCTTGAGGAAGCCAGAGCCTGGAGAGACAGATCTCATGTTATAGCAGAG 
TACCT 
Sequence 760 

CCGCGGTGGCGGCCGGGGGCCATTGAGACTGCCATGGAAGACTTGAAAGGTCACGTAGCT 

GAGACTTCTGGAGAGACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGACCACTGGAAC 

AATGAGAAGGAGAGAATTCTACTGGTCACAGACAAGACTCTCTTGATCTGCAAATACGAC 

TTCATCATGCTGAGTTGTGTGCAGCTGCAGCGGATTCCTCTGAGCGCTGTCTATCGCATC 

TGCCTGGGCAAGTTCACCTTCCCTGGGATGTCCCTGGACAAGAGACAAGGAGAAGGCCTT 

AGGATCTACTGGGGGAGTCCGGA 3GAGCAGTCTCTTCTGTCCCGCTGGAACCCATGGTCC 

ACTGAAAGTTCCTTATGCTACTTTCACTGAGCATCCTATGAAATCACCAAGTGAGAAATT 

CCTTGAAATTTGCAAGTTGNCTGGGGTCATGTCTAAGCTTGNTCCACTANTCCAGAATGC 

CC C C AAG AATTC AACTGG ATCTTG G AAA 

Sequence 761 

ATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTGAGCCCTACCTCCAGAGTCAAGTCCC 

ATGTCCTACC T CAATGCCATGCCTATAAAGGGGGTAGCACTGTGCTTGACACACCATCCA 

ATCTCCTCTCTCTTTGCCTGAAGCTTTAGATTCCCTTTCGATTCAGGGTTCC7TGTTTGT 

GGCTTGGAGTTGGCCTCTGAGGTCTCTTGTGCTTAAAGATTCCATGATTTATGGCATTTT 

ATGTAAACATTGAGGACACACTCAGCTCTGCACAGGACAGANAGGAAGGAGGAGATACAG 

GCTGGCATGGTCTCTAATTCCAAGACGGTAACAATAATGTTAGTGGGGGAGGCAACACAT 

CCAGATCAGCTGTNCTCTGCTTCTTCTGTTTATCCTGAAACACCCTTAGGCCATGACTGC 

AACTCAGCCGACAAGATTCGCTCCTGCATCTGGCTTTGTGAAGGGATTTNIGAA/GGGATG 

G 

Sequence 762 

TATNGGGCGAATTGGANCTCCCCGCGGTGGCGGCCGGGGGCCATTGAGACTGCCATGGAA 

GACTTGAAAGGTCACGTAGCTGATACTTCTGGAGAGACCATTCAAGGCTTCTGGCTCTTG 

ACAAAGATAGACCACTGGAACAATGAGAAGGAGAGAATTCTACTGGCCAGACAAGACTCT 

CTTGATCTGCAAATACGACTTCATCATGCTGAGTTGTGTGCAGCTGCAGCGGATTCCTCT 

GAGCGCTGTCTATCGNATCTGCCTGGGCAAGTTCACCTTCCCTGGGATGTCCCTGGACAA 

GAGACAAGGAGAAGGCCTTANGATCTACTGGGGGAGTTCGGAGGAGCAGTCNCTTTNTGT 

NCCGCTGGACCCATG3TCCACTGAAGTTCCTTATG 

Sequence 763 
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CGAGGTACAAGGCTGCCCTTTCAGAACAAACCCAGGACGTTCATTATAAAATAGCGAACT 

GGCCTTCCAGCACTTGATGGGAATGCTGAATAATGAAAATGTGGTGTGAAATGTCTCGAA 

TATTTATATTTTACAT AACAAGAGCAGGGATTCCAATAG GCTTTGAATTTTAGAG AAATT 

ACAATGAGTAATTTTAGTTCCTGAGATTTTAGAATATGATGCTAGTTTGTAAGCAGGGCA 

AAAGGCAAAAGTCTCCCAGAAGCAACTATMTGTGTTTGAATTATAATAAAAGATTTAGT 

GTCCCAAAACCATTTTTTCAAAGCCAGGAATAGGCCAGGTACCTGCCCG 

Sequence 764 

GGGACACATTCAGAGGTGAGCCCAGAGCGGGTAAAGTGGACTGGGGAGAACTTCGGAGGA 
TGTTCATGTCCAGGAGCAGCCCCACGCCCTGTATGGTCGGTGTCTAGAGCCTCACAGCAA 
CTAAGACCAACCCAGCTCTCAGAAGMGGMTGTCAAMTGTCATGTTCAATTTTACATT 
CAGTGCCTGGAATCTTTTC ITCACAATTGAAATGAAATGTGCTGAAGGAGGTGAATCCAT 
GCATTAATCTTCAGCTCACAAAGGAAATACTACATAAGAAGCAAGACCACAGACTCAAGA 
CGGACATAATTGGA I I 1 I I I I I I GCCATGGCCTGGAAAGAAAGGTACCTGCCCG 
Sequence 765 

AGGTACTCACTAGCCTACACCTATGCAATGTGATATAGAAAGATAGAGTTTCATTTTCAT 

TCAMTTTATATTCCTTTTCTCMTAGTGATGTTGAMTTCACATM 

GATACGTATTCTTCATTTTTGAGCTGTATTGACTCAGTTTTA 

TTCAAAAAACAATTTTAAACACATAGTAAATTGAATCATAAAACCTATAAGTATTACTAG 
CCTTTTMCATTGNTTGATTTCCCCATGCTTGCTTGTGGGTGGAGCAGTTAAATTTTCM 
CATGAGCACAAAGGATACNGTTAGAAAATTCTACTTTAAAAATATTATTCAAGGCCGGGC 
ACAGTGGCTCATGCCTGTAATCCCAGCACTTTGGGAGGTCGAGGCAGGTGG 
Sequence 766 

CCGCGGTGGCGG I I I I I I I ICCCCCTTGTAAATTTCTATACATGTCAAAGGCTACCAACT 

CAM I ! I ATTTAGAGTTTGAGATTTGATTTGAAAAATATACTGCAGCCAGCTATATGAAA 

AACATMGAGGATATTGTTGACTTTTGATGAAGGGAAGAAAGAACAAGGTAGAGCTGCCT 

AGCCATGAAGAACGCCAGGGTTTTTGTAACCCCTGGTGTGCTTTGCACGCAGAAGGCACA 

CACTTAAATTGGCATGGCTCACTTTTTTTACTGTGCCTTTGCTGAGCTTGCTGCAGGCTG 

AGTGCACTGGTAGAGGAGTGCAGGGCTGTGTTCTGGCCATCCCTGCCTGCTCAGGCCTCT 

CCTGCTCCTTTGTGGATCGCCCAGGTCTCTGCTGCTTCCTGGTGCTGNCTCCTGGATCCT 

C7TGACATTTTTCCCGTCTGTAATTTGATTGACAGTTACAGACAAGGAAGGGGACTGAAA 

TTCTTACCGTTTGTGACTTNCTAAAACNGGAAAGGAAATTTGGTGAATGAGAANGGGAAA 

AC 

Sequence 767 

CCGGGCAGGTCCGGGCAGGTACTTTGTTGGGAATTAAGGATGATGATGCAGATCTGAGTC 

AGGCTCTTTGTCTGGCCATCTCCGTGTCAGAGATCCTTCAAGCGAATCAGCTACAAGGGG 

AAGGAGTCCGGTCTTTGTGGTGGATTGCCGTCCTGCAGAACAATATAATGCTGGGCATTT 

ATCAACTGCTTTCCACTTAGATTCAGACCTGATGCTTCAGAATCCATCTGAGTTTGCACA 

GTCAGTAAAATCC 

Sequence 768 

CCGGGCAGGTACCAGAGGAGGAGATGGACGATCAGAGCCATGCACCTGTTTCCTGCACCC 

CCTGCGCACTGGTTCTATGGCCACAAGGAGTCTTACCCAGTAAAAGAGTTTGAGGTGTAT 

CCTGAGCTGATGGAAAAATACCTATGTGCCGTTCCCTTGTGGGTTGGACCCTTTACGATG 

TTCTTCAATATCCATGACCCAGACTATGTCAAGATTCTCCTGAAAAGACAAGATCCCAAA 

AGTGCTGTTAGCCACAAAATCCTTGAATCCTGGGTTGGTCGAGGACTTGTGACCCTGGAT 

GGTTCTAAATGGAAAAAGCACCCGCCAGATTGTGAAACCTGGCTTCAACATCAGCATTCT 

GAAAATATTCATCACCATGATGTCTAAGAGTGTTCGGATGATGCTGAACAAATGGGAGGA 

ACACATTGCCCAAAACTCACGTCTGGAGCTCTTTCAACATGTCTCCCTGATGACCCTGGA 

CAGCATCATGAAGTGTGCCTTCAGCCACCAGNGCAGCATTCAGTTTGGACAAGTACCTT 

Sequence 769 
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CGGGCAGGTACAGCAAAACCCACCTGTGTAAACACACACAGCAAAGTGATGTAAGAAGTT 
TCCATATAAAGGGCTGCAGTATGGAGAGGTAATGTGCAGGCTGGTTTGCGGCTGTAGGGG 
CCACCTTGTGCAGCTCTCCACTGATATGGTACCT 
Sequence 770 

AGGTACAAGTTGTCTTTATGCTGCGAGATAAGTCCTCTCTTGGT7TGAGCTCCCACCTTT 
TCAGTGAACTCTTACATTTTGGGGGATCTGCTCTTGTAAAGGACATCCTTTCTGGTGAGT 
ATTC TTTTG G TT AATTTTTG GTTTGGTTATTTG TG C ATG AATTC AATC TC ATT AG G AAAC 
AAGTTMGTTGAATAGACCAACTAGTGAATTAATCCGTCACCAAAATATATGTTTTTGGC 
ATTTACCTGTTTATTTTGAAACTCTTTC 

CTTTGTAAGGATATCTCCCTCTCTGACACCTAAAACACTAGATGCTTTCACAAAGCAAAA 
GGAAGAGAC CTAAATCTATC TATCTGTGTAAACTCACCCTTGACCATTTC ATTTTG AAGG 
CTTCCTATATATGC 1 Mill ICATCTCAACAAATAAGTGGTGTTTAAGTTCTGTGCCTCG 
GCCCGCTCTAGAACTAGTG 
Sequence 771 

CCGCGGTGGCGGCCGAGGTACGCGGGGACA7TCAGAGGTGAGCCCAGAGCGGGTAAAGTG 

GACTGGGGAGAACTTCGGAGGATGTTCATGTCCAGGAGCAGCCCCACGCCCTGTATGGTC 

GGTGTCTAGAGCCTCACAGCAACTAAGACCAACCCAGCTCTCAGAAGAAGGAATGTCAAA 

ATGTCATGTTCAATTTTACATTCAGTGCCTGGAATCTTTTCTTCACAATTGAAATGAM 

GTGCTGAAGGAGGTGAATCCATGCATTAATCTTCAGCTCACAAAGGAAATACTACATAAG 

AAGCAAGACCACAGACTCAAGACGGACATAATTGGAI MUM I GCCATGGCCTGGAA 

Sequence 772 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACAGAACTTAAACACCACTA 

TTTGTTGAGATGAAAAAAAGCATATATAGGAAGCCTTCAAAATGAAATGGTCAAGGGTGA 

GTTTACACAGATAGATAGATTTAGGTCTCTTCCTTTTGCTTTGTGAAAG 

TTAGGTGTCAGAGAGGGAGATATCCTTACAAAGCAGAGATTATCATTACAGGTTTACATT 

TCTTACAAAGAGTTTCAAAATAMCAGGTAAATGCCAAAAACATATATTTTGGTGACGGA 

TTAATTCACTAGTTGGTCTATTCAACTTAACTTGTTTCCTAATGAGATTAAATTCATGCA 

CAAATAACCAAACCAAAAATTAAACCAAAAGAATACTCACCAGAAAGGATGTCCTTTACA 

AGAGCAGATCCCCCAAAATGTAAGAGTTCACTGAAAAGGTGGGAGCTCAAACCAAGAGAG 

GACTTATCTCGCAGCATAAAGACAACTTGTACCTCGGC 

Sequence 773 

CCGCGGTGGCGGCCGAGGTACGCGGGGACTTCGGAGGATGTTCATGTCCAGGAGCAGCCC 

CACGCCCTGTATGGTCGGTGTCTAGAGCCTCACAGCAACTAAGACCAACCCAGCTCTCAG 

AAGAAGGAATGTCAAAATGTCATGTTCAATTTTACATTCAGTGCCTGGAATCTTTTCTTC 

ACAATTGAAATGAAATGTGCTGAAGGAGGTGAATCCATGCATTAATCTTCAGCTCACAAA 

GGAAATACTACATAAGAAGCAAGACCACAGACTCAAGACGGACATAATTGGAl I I I I I I I 

ACCATGGCCTGGAAAGAAAGGTACCTGCCCG 

Sequence 774 

AGGTACGGGAGGTGATATGAGCAAATTTAAGTATGCCTCTTACTCATGTTTCCCTGTCCT 
CCAATTTTCTTCCTCAGAGGCMCCTCTCTAACAC^ 

GCATGAACAAATTACATGTGTATTTGTATATATGCATATATATTTTCTGACTTTTTTCCT 

TCTTTCAAAGAATTATTAGTGATATTCTAAGCTCCAGTAACACACAGGGAAAGGTGGCTT 

GCTAATAG I 111 I I I GTGTAA l MM I GAGACTGGGGCTTGCTCTGTTGCCCAGGCTGGA 

GTGCAGTGGCACGATCACAAGCTCACTGCAGCCTCTATCTTACGGGCTACAGTCAAGCCT 

CCCACCTTGGCCTCCCGAGAAGCTGGGACTACAGGCATGTACCTGCCG 

Sequence 775 

CCGGGCAGGTACTGTGTTTCTCCACCTGACCTAGAAAACCTTGATTCCTGAAAAGCCTCA 
CCCTAACTTTGCCACAACCTGGCGATCCATCCATACTGTTCACAGCCCTCAGGAGAGACT 
GGATAATTATAACACACCTTGCTTTCTTCCATCATTTCCCTTTTCCCCAAGGGAACATAG 
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CTGCACTGAGAACACCAATCTCTCAACCACACCCTATCCCAACCCCTGTGGAAAATATAG 
TAGCTACCTCACCAATGGGAGTTGAGGGAATGAAGACTCATGTCTTCTGGGATCTGGGTC 
AAGCCCAGGTCAGCCATTCTTCAGACTTGCACAGTACCT 
Sequence 776 

AGGTACATCCCCTGGCCATAGGAAGCACCTATAATATTAGTAGTTATCATTACTATTTTT 
TTGCCTGTGTGCGAAAGTCATTAGGAAAATCATGCCTATCTACGGCCCATTCTGGGTCTT 
CAGAAATGTTGACACAGACATCCACTTTTGCCGTTGTGTGCTAATGGCAGGAGCTTGCCT 
GGCTTCCTGCTGTGCCACATGCTGCTCTTGAGGAATGATGATAACAGGTACCTGCCCG 
Sequence 777 

CGAGGTACGCGGGGGAG7TTCTAAGCCCCCGCCTGCGGTCTGAGGCACCGGCTGAACCAT 
GTCGGAGATCCTGTGCCAGTGGCTCAACAAGGAGTTGAAGGTGTCCCGGACCGTGAGTCC 
CAAGTCATTTGCAAAGGCATTTTCCAGTGGCTATCTACTTGGAGAAGTTCTACACAAGTT 
TGAACTTCAGGATGATTTTTCAGAATTTTTC 

TAATTTTTCTCGCTTGGAGCCAACACTTCACCTTCTGGGTGTGCAGTTTGATCAGAATGT 
GGCCCATGGCATCATCACAGAAAAGCCTGGGGTGGCAACAAAGCTGTTATATCAATTGTA 
CCTGCCCG 
Sequence 778 

TAATCAGGTTGTGCCAGAGGAGGAGATGGACGATCAGAGCCATGCACCTGTTTCCTGCAC 

CCCCTGCGCACTGGTTCTATGGCCACAAGGAGTCTTACCCAGTAAAAGAGTTTGAGGTGT 

ATCCTGAGCTGATGGAAAATACCTATGTGCCGTTCCCTTGTGGGTTGGACCCTTTACGAT 

GTTCTTCAATATCCATGACCCAGACTATGTCAAGATTCTCCTGAAAAGACAAGGTAAAAA 

CCAAGAGGGGCTCACAGTACCT 

Sequence 779 

AGGTACGGGAGGTGATATGAGCAAATTTAAGTATGCCTCTTACTCATGTTTCGCTGTCCT 
CCAATTTTCTTCCTCAGAGGCAACCTCTCTAACACA I I 1 1 I GTGTTTAAATAT A 1 H I M 
GCATGAACAAATATACATGTGTATTTGTATATATGCATATATATTTTCTGAC 1 1 I I I I CC 
TTCTTTCAMGAATTATTAGTGATATTCTAAGCTCCAGTAACACACAGGGAAAGGTGGCT 
TGCTAATAG I I II II I GTGTAA 1 1 I I I I G AGACTGGGGCTTGCTCTGTTGCCCAGGCTGG 
AGTGCAGTGGCACGATCACAGCTCACTGCAGCCTCTATCTTACGGGCTACAGTCAGCCTC 
CCACCTTGGCCTCCCGAGAAGCTGGGACTACAGGCATGTACCTGCCG 
Sequence 780 

CGAGGTACACACTAGCTGATAAGACACTGTTGCCCATAATGTCTATTTATTGGATCAGCA 

ATTTATAAGTCCCACATTCTCATGCCACATAGCTCTACACAGCTGCAAAAATATACCATA 

GCTTGCAGGTGATCATTGGTTTGATAAAAGATATTGAGTCGCTCATCTTGTGAAAGTGAT 

CTTTGATATAAGAGGAGCATCAGCGGGGAAGCTCACATGTCCCGTGGCTCACACACCAGA 

AGGTATTTGTGTCTTGTCATTGTCTGTCTGGCAGTCCATGGCAATGGC I I I I I CAGAGAG 

GCCTGTTTTGTGGGTAACTGTGTGTACCTGCCCG 

Sequence 781 

CCGGGCAGGTACCTGCTCAAGATGCCTGGCTGAGAGTGTGTGGGAGTCTGGTTGGAGCCC 

AGATTTTATATATNTNTATATTTTATTTAGCTNTGACCAAAAANTGAACATAAAAACTGN 

GTCATGGTAATTAAAACTAACAGATAATTTATCTAAAATAAATCTTGATCTGACTAACAT 

TGGTCAAACATGACGTGACTTTTTACTCCTTAGATGCACCTTGTGACATAAATAGAATCC 

TTCCCTTGCATTTTATAAATGGGAAACCAGAAGCACCNGGAGATGAATTTGAGGGGAACC 

TNATCAAGTGGGGAGGACCTAAAACTTGCAGCCCAGTCCCAACANTGCCAGGTCGA 

Sequence 782 

AGGTACATTACCCTACGCTCCAAGCGCAGCAAGCCGGCCAATATCCGTGGCACCGTGAAG 
CCCAAGCGCAGGAAAAAGCGTGCAGTGGCATCGGCTGCCCCAGGGCAGGAGGCTTTGGTC 
GGACCATCCCTTACCGCAGGAAGCGGCAAGGGAAGCTGATGCTGTAGCACCTGGGTACCT 
GCCCG 
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CGCCTCCATTGTTTTATTATTTTAAGCAGCAGAGAGATCTGCCTACTGTAGTATGTTCGT 
TACAGCTTATATGGTAGAGGGGCCACAGCCCANATCAGGCAAACCCAANATGAGGCGATG 

TTGTAAAAATACTCTGAGGAATCCCACAGAAATNTAAGAGCAGGAACTACAGCACAGCTG 

GGCTTCCTGGTACCT 

Sequence 784 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAATAGATGGCAGGTTCTAAATCAACATCAGAG 
CAATGTTATGGTCCAGGAATGTGACATGGGTTGAATATCTTCTAACTGGACTTAGAAAAC 
CAGAGATCTATAAACATGTTTGTGAATTCCAACTTTCCTCCACCCTGTTTCTGCTACCAG 
GG CACAT TCCACGTTGATCTCTTGTATC^ 

ACATTTTAAACA CTGAT TTTCACAGAGTTACAGTCCCCGTGGCCTCCCCACACCACAACC 

AAACAAATGCGCTTTTACTGCCTCAAGGMGATGTAAGTAATTTACTTGTAAAAGTGAGC 

TGAGCTCTCAGGAGAMTTCAGCCTCATGCTTCGTTCTGATGGATTTCAGTTCTCAATCT 

GTCCGGGACAGGTCAATTAAAGATTAAAGTTGGAATTGAAAGTGAAACCTTCCAAGGCTC 

GTTTAGGTAGGGGCCAGACACCCTCCATCANGCCCAAACAAGGGGATGGGGGAT 

Sequence 785 r 

CGAGGTACCTTTCTTTCCAGGCCATGGCAAAAAAAATCCAATTATGTCCGTCTTGAGTCT 

GTGGTCTTGCTTCTTATGTAGTATTTCCTTTGTGAGCTGAAGATTAATGCATGGATTCAC 

CTCCTTCAGC ACATT TCATTTCMTTGTGAAGAAAAGATTCCAGGCACTGAATGTAAAAT 

TGAACATGACATTrrGACATTCCTTCTTCTGAGAGCTGGGTTGGTCTTAGTTGCTGTGAG 

GCTCTAGACACCGACCATCAGGGCGTGGGGCTGCTCCTGGGCATGAACATCCTCCGAAGT 

TCTCCCCAGTCCACTTTAC 

Sequence 786 

ACGCTAGATCAACTGAGATGCAAAACCCATGTTCACATCATGAGTGAGTCTTCTAACTCG 
GACAATGCAATCAACAAATTCTAAGACCAAATTCCGGTGGCACTGGCTCCAAAAAAAAAA 
AAAAAAAAGTCCT 
Sequence 787 

CCGCGGTGGCGGCCCGAGGTACAGAAACTCTGCCAGCCAGCCATTTTAATGTTGATTTCT 

TTACTTATATTTAMCTTC TAAAT MTAATAACTACCATTACTGGGTGCTTCATTTATGT 

TGGACACTCTGCTAGGTGTTTTGTATGGTATTTCATTTCATCTCCCTGTTGAAAGCTATA 

GATTAGGCAGAGAAGCTCAGAGAGGTAMGTGAATTTCTGAGAACACAAACTTCTAAATG 

GAATTGGGATTTGTGCTTAATTTTMTCTGTCTGACTTTCACAGTCATCTTCAGTATGAA 

CAGAAG AAAATCCCTGAACACCTTTG AAATTTAAAG C AAATTAAAGC AAATACTATGC C A 

TTAGAGTATATACATATGAATATGATTCACTCCCTGTGAACTCTGCTGTATTCTCCTACC 

TCTAATGTGCCTCGCCTCTGCTTCATTGGGAAGTAAAAAACCATTTTACTACCTGGGAAT 

TGTGAAGCTATCTGGCCTTT 

Sequence 788 

CCGGCCCGCCCGGGCAGGTACTGATAGCTGGGCAGTCTACTGAGACTTAACCTTGTGGTT 

AACTACTTGGAAAATACAGAATTGGCTAGTGAGCCACAGACCCATTTGAGGCCAAGCCAT 

ATGGCAAGACCTTTGGGAAATAGGTCATCAAAAAGAGGTAACTATTTATCATGTGTCAGG 

CCATATGCCTTTGGCCACCCCTAGTAATGATGAGGCAGATGCCTTGGCTAAGGTCAGATG 

GTCAGAGTCAGCACCAACACAAGATGTGACCTTGTGGCTACACCGGAAACTGGGACATGC 

AGGGGGTAAACTGATGCAACAATTCAATAAGTGTTGGGGTCTGTCCCTTCCAAGCAAGAC 

ATTTGTGAGGCTTGTCAGAAATGCCTGGCATGTGTTCAGACATATCCTAAAAAGAGGCAG 

CTGCCCGGTGTTATACAACAAGTAACAATAGGGTGAGTGCCCTTGACCAGGTGGGAAAGT 

AGACTACATCGGGCCGCTTGCCAAAGTCGCGAGGGTATACCGCATGCCACTAACGGCTTG 

TAGACATTGGNCACAGGCCTGTTGTTCACCTACCCCTTGCAGGGGTGGGCCAAC 
Sequence 789 

CCGGGCAGGTACCATATCAAGTGGAGAGCTGCAGCAAGGTGGCCCCTACAGCCCGCAAAC 
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CAGCCTGCACATTACCTCTCCATACTGCAGCCCTTTATATGGAAACTTCTTACATCACTT 

TGCTGTGTGTGTTTACCAGGTGGGTTTTGCTGTACCT 

Sequence 790 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACATGTGAGGACCGTCCAGG 

ATAGTTTTGAAATGAGAAATGAGAGCAGCATTTCCTTGGTGTCGTGAAAATATATTATAC 

ATNTATTAAA1 ATTGAAGAAGTANNNGCTGTCTTGGTGGCTCACGCCTGTAATCCCAGCA 

CTTTGGGAAGCTGAGGCTGGTGGATCACCTGAGGTCAGGAGTTCGAGACCAGCCTGCCAA 

CATAGTGAAACCCCATCTCTACTAAAAATACAAAATTAGCCGGGTGTGGTGGGTGCAT 
Sequence 791 

CCCCGCGGTGGCGGCCGAGGTACACACAGTTAACCACAAAACAGGCCTCTCTGAAAAAGC 

CATTGCCATGGACTGCCAGACAGACAATGACAAGACACAAATACCTTCTGGTGTGNGANC 

CACNNNACATGTNANCTTCCCGCTGATGCTCCTCTTATATCAAAGATCACTTTCACAAGA 

TGAGCCGACTCAATATCTTTTATCAAACCAATGATCACCTGCAAGCTATGGTATATTTTT 

GCAGCTGTGTAGAGCTATGTGGCATGAGAATGTGGGACTTATAAATTGCTGATCCAATAA 

ATAGACATTATGGGCAACAGTGTCTTATCAGCTAGTGTGTACCTGCCCGGGCCGGCCGCT 
CTAGAACTA - 

Sequence 792 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATGTGCACAACGTGCAGG 

TTTGTrACATATGTATACAT GTGCCA TGTTGGTGTGCTATTTTTTGACTACTGGAAAGGG 

ATGTGTTNGTAANNAACATCTTTTTTNNNANCANGTTTTCCATGATTGCTAAGTATTGTC 

TCCTCCCTTGAGAGTTTCAGTGGTGTGTTTGCTACATTACTTCAGTCAGTATTTGCAGCA 

AATAACAGGAATAAGCTGAGAGAGAGAGTCACCTTCATCTCAGGGTTATTAATAAAAACA 

TGAAAATTGACCTTATTTGGGATGTTTTTACTTCAGCTGAAGGTGAAGTTGNAAGATACT 

GCAAAGAATACACATCTTAAGAAATCATGTTAGAAGCAAGTAGCNTTTAATAAGCCATCC 

ACTATAAACCCCAAAAGAAAACTGATCTTTCTGGGGAAAAATCAAAGAATGGCCTCAAAG 

GACTTCAACTTCCTGAACAAAATGTATGTGCTAATTTTCGCGATCTTCTTCCTATATTCA 

TGCCTGACAAGATTGCCATTNGTATTCTATTGACAGAAAAATGATGATTTTTGAAAGCTT 

TTCTTTCCCATMTTTCCTTTTGNTGATANGNGNTTGTTTCCCAATCATTAGCAACNTGA 

GACCCTNGTGAGTGGGGGAGGCCATAATANCTTNCCCGGGCCNGCCNTTNTAAAAACCNN 

GGGNACCCCCCGGCTTNNAGGAATTNGANCTNAAAGCTATCNNANCCCCNCGNCCNCGAG 
GGGGG 

Sequence 793 

CCGCGGTGGCGGCCGAGGTACTAACATCAATAAGTCGAGAAAATTATATTAACTGAAAGA 

AAACAAAATAATAGAGAATTTTATTAAACGTATTTCTAATGTTTCTCTTCATGTTTGGAG 

AAAAGCTGCCACATAATTAAAACAATTCTTCCCTGTAAAACTGATTGTCTTCCAATCTCA 

GGAGGTTTACATTAACAGGAATATAGAATAAGAAACAGGCCTATGGCCGGGCTCCGTGGC 

TCACGCCTGTAATCCCAACACTTTGGGATGCCGAGGCGGACGGATCACGAGGTCAGGAAA 

TCCAGACCATCCTGGCTAACACGGNAAAACCTAGTCTCTACTAAAAATACAAAAAAAAAA 

AGAAAGAAAGAAAGAAAAAAAGTCCTGCCCGGGCGGCCCGCTCTAGAACTA 
Sequence 794 

GCGGTGGCGGCCGAGGTACCTTCTCAGCCAGCAGTTAGCCATTGTGGAGGTGCCAGGGGA 

GTTAATGCCCTTAGGAGCAACTCTTGGTCCTTGGAGGCAGGAGTCAGCACCCACTGAACG 

TCAGCATACATTTCCCACCTTCCAGCCCTTCCAAGGGATGATCCTCAGAAGTGTGTTCTG 

CTGGGTTCCTCCAAAGCTCTCAGTGAGGCCCAGCTCCAGTTGCCCATAGTGGCAACCCAG 

CTCAACCATGCCCTCCTAACACCTTTTCTTTCTTCCCTGACCTTTCTGCCTGCACTCAAG 

TCTTTTCTCAATTTGCAAAACCAGGATACACAATATTTAAGTATCTCCCAAACCTAAAAT 

. AAAGTGG AG TG C AAAAATTCAAAG AAAATTCAG GATGC ATAGTAGTGTTTAAAAGAAC AA 

CATGGGGGCTTGTCATGATATATTANACACTTGCTTTNCCTTNCATGAAAAAAAAAAAAA 
TGGAAA 
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Sequence 795 

CCCGCGGTGGCGGCCGCCGGGCAGGTACTGTTCTGGTTCCCACTCAGAGATCTCTCGTAA 

GATGCAATCTATCAGGCAATGTCTACCACACACACATGCGCGAGTGCGCGCGCACACACG 

CACACACACACAGAGGAAGACAGACAAAACCTCAGACAGATTACATCACAGCAAAGTCAG 

ACTAGAAAAACCAGAGGAAACACAAGATACTTCTCATTTCTAACACCAACTCTCCCAAGT 

CACATCACAGAGTCAGCCAACCCAGAGACCACTGGCTCCTGTGGTTATTTGGGTTAAACA 

AGCANCATGATGCTCCCTCTTCAACTCTAACCAACAGAATTAACAATGACCCCTTCTCCA 

ACACATGATTTACACAGCCAACAAAAAGAGCCACAAGGGACAATATGGAGCAGGANTCCT 

CGCCGCTCTAAAACTAAGTGGATCCCCCNGGCTTGCANGAAT 

Sequence 796 

CCGCGGTGGCGGCCGAGGTACGCGGGGAGGGATTGCTCCCTGCATTGTGGGTGGGAGATT 

GGGTTGGGAAATGTAMGCTCTCTTTCCTTACCAAGTTTGTTTAACATGGCNCTTCCCAC 

CCCA CCCCAAC CACATCTTCGTGCTACTTCAAGCTGGGAGGAGCCAANTTCCAAATNTAN 

CCNAA! III! I N NCCCNTGTTGGACAAAGAGAC ACATTNANAGGTTG AGCCCAGAG GGGG 

TNAAGTGGACTGGGGAGAACTTCGGAGGATGTTCATGTCCAGGAGCANCCCCACGCCCTG 

TATGGTCGGTGTTCTAGAGCCTCACAGCAACT-AANACCAACCCAACTCTCATAAGAAGGA 

ATGTCAAAATGTCATGTTCAATTTTACATTTCANTGCCTTGGGAAATCTTTTCTT 

TTTGGAAATGGAAAAT 

Sequence 797 

CCGGGCAGGTACCTTTCTTTCCGGGCCATGGCAAAAAAAATCCAATTATGTCCGTCTTGA 

GTCTGTGGTCTTGCTTCTTATGTAGTATTTCCTTTGTGAGCTGAAGATTAATGCATGGAT 

TCACCTCCTTCAGCACATTTCATTTCAATTGTGAAGAAAAGATTCCCNGGCCCTTGAANT 

GTAAAAAATTGAACATTGACATTTrGACATTCCTTCTTCTGAGAGCTGGGTTGGTCTTAG 

TTGCTGTGAGGCTCTAGACACCGACCATACAGGGCGTGGGGCTGCTCCTGGACATGAACA 

TCCTCCGAAGTTCTCCCCAGTTCCACTTTACCCGCTCTGGGCTCACCTCTGAATGTGTCC 

CCGCGTACCT 

Sequence 798 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAGTGGCATCCTTTGAACTCACGGATC 

CAGTCATAACTGAAGTTCCATCTACACCTGGGGTCTTCAGACCCAGGCTCTGGAAAAGAT 

CCTG TTACT TACAAGCAAAAGAAATTCTGCCTCTTACCCTCAGCATGTGAGTACCTCGGC 

CCGGTTTTAAAACCTAGTNGNATCCCCCCGGGGCTGCAGGAATTCANATATCAAAGCCTT 

ATCGATACNCGTCGACCTCTGAGGGGGGGGCCCGGTNCCCAGCTTTTNGTTCCCTTTAGT 

GAGG 

Sequence 799 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCATGGTACCCCAAAACCACGGGG 

CTGCAGCTCCAGCAATTGGGAAGGCCAAGTTTTCTTGCACACTCCCCAGAAAAGGGGCCA 

AGTCCATGGGGCTGANCAGTGATNGACTGCAGACCTCACCACCACTGAACCTTGTAGGAT 

AAGGCCCACTNGCCTGGGGATTGNTTNTTGAGGCCCANCCCTAAGTNCeTTCTGAGTTCT 

CCAGCTGGGAGCAGCTCTACACTTNTCCGGCATGCAGCTTTCCAAAGAGAGAGGCAGTCC 

ACCTTT 

Sequence 800 

CCGCGGTGGCGGCCCGCCGGGCAGGTACAGTGGTGCAGTCATAGCTCATTGCAACCTTGA 

ACTTCAGGCTCAAGAGATCCTTTTGCCTTACTCAGCCTCCCAAGTAGCTGGGACTACAGG 

CATGTGCCACCATGCCCAGCTAATTTTCATTTTAA I I II I I GTAGAGACGGAGTTTTGCT 

ATGATGCCCGGGTTGATCTTGAACTCCTGGCTTCAAGTGATCCTTCTGCCTTGGCCTCCC 

AAAGTGCTAGCATTATAGGCATGAGTCACTGTGCCTGGCCTTGAATATTTTTACATTGCG 

TTGTCACAAGTGAGCTGTGTCATTTTGGTATATTTTAATTAAAAGGCAAGGTTTCTACTT 

TGATATTATTTAAAATCCACACCCTTGTATCTTACATCAAAATTTAAGAATCAGTCAGCT 

TTAAATTATTTGCTTTTATTCTTCTCTAGGAGCTGCTGTTTTGAGGATTCCTATTCTGTA 
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TGGGGAAGTTGAAAAAGCTCGAAGAAAGGNCTGTGACTGTTATGGTTTGGATAAAAGTGC 

AGTTCAAGCAACAAGTCAGCCNAACATGGATNACTGGCAGNAANANGTTTCCCCACACAT 

GTCAAAGATNGGGGCCACTGGGTNCCCGGCAGCTAACAAAANAAAAANAATNCTNGTNAN 

AAAGNNTNCCTGGAGTCCNTGTCTTANCGNAAGNTNCACTTTGTCTTTTNNATGCTGGAA 

CmTNACANGCTNTNGCCNTNGGCCGGTNTAANAACCAAGTGGGNNCCCCCGGGCTGCN 

AGGAAATTNGATTCAAATCTTNTNTAANCCCGNNNTACCCTTNAGGGGGG 
Sequence 801 

NGCGGNACGCCCGGGCAGGTACAGTGAAGCTTTGACCTGAGGAGCTCTTCTGTAGGTGAA 

GAGTGGTTAACTATGGTTCCTGTGTGCCTGGGGCCCCCCACTCCTAGGACTCATCCACCC 

CCTACCCAGAACTTACTAGTACCTN 

Sequence 802 

AGCTACTTGGGAGGCTGAGGTGGGATGACTGCTTGAGCCCAGGAGGTTGAGGCTGCACTG 

AGCCACAGTCACGCCACTGCACTCCAGCCTGGGTGACAGAGCAAGACCCTGTCTCTAGTT 

TTGTCCATCTGTCTTCATGGTCCTAGCTGAAAAGTATCTTTTTAGATTCCCTATTTCCTT 

CGACTTCCCTCCCCTAGAATATAAACTCCATGAGCTCCTTGTCCACCCAACTCTCTGCAG 
TCTTCTGTACCT " 

Sequence 803 

AGGTACAAGAAGGGCTGAGGCTTCTGTCTGAGTTGACCGTGACCTGTGGTTACAGCATTT 

TAGGACTTCGTAACAGAATGATCCACAGAGGTCAGCAGGCCCTTGAAAGAACGCTGAAGG 

AAACTTGCATACAAACAGTCTCAAGGAGGCAAACACATGCTTCAGCGCTGCCCGCGTACC 
TGCCCG 

Sequence 804 

GCTCCACCCGCGGTGGCGGCAAGAGTAAAATCAAGGCTGGTAAAAGAACACAGCACGTTT 

CAAGCCTAAGCAGCTTAAGTGCAAAGTGGCTCCGGGTGGAAATGATGAGAGAGCAAGAGC 

ACAAAGAGGACCCTCTAAAACAGAGCCAGTGACCATGTGAGCTGAGGGGGAGGGCTTCTC 
ACAGGGCAAACGGTACCT 

Sequence 805 

CCGGGCAGGTACTGGTGACTCACCATTCTGCCAGCGGCCACTTTGCACTGGGCACCAGCA 

CATACCAGCCCCCATCAGGAAATGTGCACAGTCCATCTCATCTTGTTTCATCTTCATTTT 

ATTAATGAGGAACTGAGAGTCCAAGAAGGTTCAATCATTTGCTCAAGGACAAAGAAGTGC 

ATATCGACTCAAGATCTGAGCGACTCCAAAGCCCAAGCTCTGTGCCCCCGATAACACTAA 

AGGGCTCTGGAACCTTCCCTCACTGTGCCAAATTCTGGTCACCACACTTTAGGAAAGGTA 
CCT 

Sequence 806 

CTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGGAGACACATTCAGAGGTGAGC 

CCAGAGCGGGTAAAGTGGACTGGGGAGAACTTCGGAGGATGTTCATGTCCAGGAGCAGCC 

CCACGCCCTGTATGGTCGGTGTCTAGAGCCTCACAGCACTAAGACCAACCCAGCTCTCAG 

AAGAAGGAATGTCAAAATGTCATGTTCAATTTTACATTCAGTGCCTGGAATCTTTTCTTC 

ACAATTGAAATGAAATGTGCTGAAGGAGGTGAATCCATGCATTAATCTTCAGCTCACAAA 

GGAAATACTACATAAGAAGCAAGACCACAGACTCAAGACGGACATAATTGGATTTTTTTT 

GCCATGGCCTGGAAAGAAAGGTACCT 

Sequence 807 

GCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATCGAATAGGCAAGAACACAGGCAGTA 

GGGCAGCTTTGACAGCCTCAGTAATTTCCAAGGGTATTTTTTGACAGAAGTATCACAAGT 

CTTAAAGTGATTGCATCGAAAATTTCTGATTCATTTGGGTAAGTGATTATCTTTCCATTT 

GCCAAGGACCTGTTGTATGTATAGTGGTAAAACAATTTGTATATTTATAAGGGAGATATG 

AGGTANAACGTAAGACTTCTTTGGGGAGCCGACAGTACCT 

Sequence 808 

GCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATCGAATAGGCAAGAACACAGGCAGTA 
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GGGCAGCTTTGACAGCCTCAGTAATTTCCAAGGGTATTTTTTGACAGAAGTATCACAAGT 

CTTAAAGTGATTGCATCGAAAATTTCTGATTCATTTGGGTAAGTGATTATCTTTCCATTT 

GCCAAGGACCTGTNGTATGTATAGTGGGTAAAACAATTTGTATATTTATAAGGGAGATAT 

GAGGTAAAAACGTAAGACTTCTTTGGGGAGCCGACAGTACCTCGGC 

Sequence 809 

CCGGGCAGGTNCNCNGGGATNGTCACAATGAGATGTCTTGCACCCTGCCACTCTNNCCAT 

TTGNCTGAATTCTANTATCACTCANGTGATCTGGAATCTAATTCTCACCCNNAACTTTAA 

TTGCATGAAAAACTAANATATATAGTNGNTAGCTTTCCACCATTTGCATCTTGCANGAAA 

TGTCNGAAATATATTGGAGTGGC 

Sequence 810 

CCGGGCAGGTACGCGGGGGACTTACAAGATGGATAGGACCTGGTGGCAGCTGTCCTTGAG 

GAAGCCAGAGCCTGGAGAGACAGATCTCATGTTATAGCANAGTACCT 

Sequence 811 

TCCTATCGACTTACTTAGGGCNAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGG 

GACACATTCAGAGGTGAGCCCAGAGCGGGTAAAGTGGACTGGGGAGAACTTCGGAGGATG 

TTCATGTCCAGGAGCAGCCCCACGCCCTGTATGGTCGGTGTCTAGAGCCTCACAGCAACT 

AAGACCMCCCAGCTCTCAGMGAAGGAATGTCAAAATGTCATGTTCMTTTTACATTCA 

GTGCCTGGAATCTTTTCTTCACAATTGAAATGAAATGTACTGAAGGAGGTGAATCCATGC 

ATTAATCTTCAGCTCACAAAGGAAATACTACATAAGAAGCAAGACCACAGACTCAAGACG 

GACATAATTGGA I I I I I I I I GCCATGGCCTGGAAAGAAAGGTACCTGCCCG 

Sequence 812 

AGGTACCCCTTTCACGGCATAGCTTCCTTGAGGGCTGGGGTAAGAAACCCACAGTTCTTC 

TGTTTCTGCTGGATTTTTCCCTGTGGTTAATCCTTAACCAAGAGGCACTTCTCAATCAAC 

TCTTCTTCCAAAAGAAGGACCCAGATCAGTATCTTTGGGTGGCTTATGCTTATTTTAAAA 

TAATATCCAGTTGCTGCTTGTTGCACGAAGTCCAGACTCAAGAGAGAATGAGCATCATTC 

AGATAATGCCTCACTGAAAAAAAAAAANCAGCAGNTGAACCCACACCTCCTACTCTCCAT 

G AAATTAATTGCCTATCCCACCTTAAAAAACATGTN ACCTGCCCCCG NTTAC N TGCCC 

Sequence 813 

AGGTACCAACCTGGTCTCCAAAGTCCTCTGGGAATTTCCACAATACCACAGGATCTTCAT 

TTTCAGAGGCATGACATTTCACTTTCAACACCAAGTCAAAGGTTCTGTCTGGATTTGCCC 

TCTGACCAGNTTCTGCTTGTNCTCCCTTATCTTTGTTCCTCAATCTGCCAGACATTGTAN 

AACTTAAAACAGTCCTTGGAGTTCAGAACAAAGAATAATTGANGCAGAGTGTCACCAACA 

GATGGTAATCAGTGGAGTCAAATTTTCTTCAGAGATTCTTTGAGGATTCTCCTTTGGACT 

TCAAGAAAGAGTGCTGTCCCGCGTACCTGCCCG 

Sequence 814 

AGGTACACAGGTAACCACATTTAGATGGACTGGGATGTTGCCACACATACAAGCATTGAT 

AACTGGCTTCTCATTACCTGAATACATTCTTCTGTCAGAGCAACAGACTCAGCTATGCTT 

CTGGCAAAATTGTTCTTAATTCTCTATTGATTAATTTATTCGGTAAGTATTTATTGGGTA 

TTTTCTGTCTGAAAAGTGCGATTCCAGGGGCTTTATGTGTCTCTGGGGTGGGGGTGTTAT 

ATAAATACTTATAATACTGTATCCATACTCTTGAAAAGCTTAGTTGGGAAGGCAAGGCAT 

GCAATAAGGAACACAAGAATTTTAAGTCATTCCACAACCATCTGTTGAATGGCTGCTATT 

GTTAGTATCGTGGGTGGAAACTGGAGAAAGCAAAAGATGACTATMTAGGATCTCTTTTC 

TGGGAGATGCCACAGTGGACACCGTAGTTATATGAATGAATGAATAAAGGGACCTCCCAG 

AATAGTTCTTATACATTGATGCTCTNGGGGCCCCAATGCCAGAATTCTTGATTGAAGAAA 

ACCAAATTAAACCTCTTTTTTTGGCTTGCTTC^ 

CCTCAAGGANGG I I I i M GG 

Sequence 815 

AGGTACAAGGTCTGAGGAGAAGGTTGAGGTCTCCCCAAGACTTCTCTAGAACACTTGGTG 
AGAGGCGCCATGAGGACAGGTGATGCTTGGGCTTATAGCTCTGACTCAGGACATCACTGC 
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TCAACAGCCTGAAGCCCACATCCAAAACTCATGTCATGCTGGCAATTGAAGGGGTAGCAG 

^^ A ^ AAGTCCAGAGGGGCTCAGCAC AGGGAGCAGATGTCTGTGAGCAGGAGAGACAG 

CGGCTGTAGGGATCGATGAGACTTGGTAAGCATGATGTGCTCTTGTCAGCATAGCATCTT 

TGAGCTCACACAATAATTGATGTAGAATTGTCCATTTCAGTAAGGAGTTTCCACATCCAC 

TATCTTCCACGATTCTCACAAGGGCCTTGAGTAAATATTACCTGCATGTTATGGGGTGAG 

AAAGGGGTAGTTTAGAGAGCTGAANGGATTTGGTGAAGGCCACATAATGAGTAAGCCTGA 

AATCAAGGACCAGACCCCCAGGC1TCTAAATGTTAGTCTTGGGCTCCTTGGTTCATGGGA 

CTATTATCATCCCTTGTACCTGCCCGGGGCGGGCCGCTCTAAN 

Sequence 816 

AGGTACTCACATGCTGAGGGTMGAGGCAGMTTTCTTTTGCTTGTAAGTAACAGGATCT 

TTTCCAGAGCCTGGGTCTGMGACCCCAGGTGTAGATGGAACTTCAGTTATGACTGGATC 

CGTGAGTTCAAAGGATGCCACTCCCCGCGTACCTGCCCG 

Sequence 817 

CCGGGCAGGTACGCGGGGGGACTTACAAGATGGATAGGACCTGGTGGCAGCTGTCCTTGA 

GGAAGCCAGAGCCTGGAGAGACAGATCTCATGTTATAGCAGAGTACCT 
Sequence 818 

CCGGGCAGGTACAAACCTTTGCCCTTCTCAAACCTTTGCTAGATAAAATCATATCAGGTT 

TTCACTTACATTTAAATCCTTATTTAGGCATGTGTGCAGCCTTMTrTCCATATTTTAAG 

CTATATTGCAGGTTACTTATAGCAATGCTAGMTTCAGTGAAATATAAATAACACTTTTT 

GAGACCCACTGAAGATTGTAAACAAATGCTTAAAAATTGCAAAAGTGTAAGTAGGCAGAT 

GTTAATAGCAGTAGCAATCATGCATGTATTTCATTAAAATTAAAAATCAGTTTATATGGA 

MTCTGCAAATGTGCTAAAGACTTTAAAGGACATCTTCATCAGAGTTCTTTGTGACTTGG 

AGAGAGAACACCTTTGGAGAAACCACTGTACCT 

Sequence 819 

CCGGGCAGGTACAACTGTGATCTGTCCCAACTTCCTTTTGCTTTCAGACCCCCATCAGCC 

TTTAGCCAGGATCCTTATACATGCAGCAACGAAAGCATCCTATGAAGTCATTCTAATTAC 

GTTACCATCCAGCTCAGAAGTCACCAGTGACTGCCAGCTACCAACACAGACACACTGGCT 

CCTCAGAGCAGAACTCATGCCTAACTTCCACAGGTAGAGGACTAGACAAAGGACATACTA 

GAAATGTGGCTGAATAACAGCACCAGAAAAGCAAGGCAGCTATAATCTCTCCCTAAATGG 

CCTAAAAAACAGTTTCTTTTCTAAATGGCTGCTAGTTTTTTGACGTGCTGCCTACAATAT 

AAAATACTCTCAAAAATACCAAGCTACGCAATATTTGTTCACACCTACCCAGGAACTATT 

TCAGAGGTTAGAAACACTGGTGAAGGAGGGAGAAGGAGGGAGAGACTCATGATAAAGAAC 

AGGTTTCTAAAACCTAAAATAATAACTACNAAAAGAAATAACAAAAACAAATCAGTCTTA 

AGATAAACGTGCCAGTAAGAAAAAAAACAGATAAAATTATGACATATCAAACAGTAA 
Sequence 820 

AGGTACAGAACTNAAACACCACTATTTGTTCGAGATGAAAAAAAGCATATATAGGAAGCC 

TTCAAAATGAAATGGTCAAGGGTGAGTTTACACAGATAGATAGATTTAGGTCTCTTCCTT 

TTGCTTTGTGAAAGCATCTAGTGTTTTAGGTGTCAGAGAGGGAGATATCCTTACAAAGCA 

GAGATTATCATTACAGGTTTACATTTCTTACAAAGAGTTTCAAAATAAACAGGTAAATGC 

CAAAAACATATATTTTGGTGACGGATTAATTCACNTAGTTGGNCTATTCAACTTAACTTT 

GTTTCCTWGGAGATTAAATTCATGCACCAAATAACCCAATCCCAAAAATTTAAACNCA 

TAAAGAAATACTTCACCCAGAAAGNGATGGNTCCTTTTACAAGAAGCAGGANCCCCCCAA 

AAATGTAAAGAGTTCACTTGAANAAGGTGGGGAAGCTCAAAACCAAGGAGGAAGGACCTT 

TATTTTCGCANTCATAAAAGCCCAAACTTTGTTACCNTCGGGCCCGCTTCTTAAGAACCT 

ANNTGGGATTCCCCCCG 

Sequence 821 

AGGTACTCAAGCCACCTGCCTGTCATGGCCCCCCAAAATGCTGGGATTGCAGGCCACCAT 
AACCGGTCATCGGGGCAACTATCCTTCAGGGCTTCCTCCAGTAGGGCTCCCCAGGTTGGC 
CTTGAGGGCCACAATAAAAAATCTGACTTAGCTATACTCCTTCCCTGTCCCCAACCACAG 
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TTA TTCTT GTGGATGAGGCAAATGGCAGCAGGACAGCTGGAGTATACCAGAGTTGACTCA 
TTTATTTTATGCATGCTTAGTGTGGCATCCCTGGGGACATGAAATTCCAGTGAAATCCAG 
TCTTTGTGCTGGAGAAGCTCATAGATCTGTGGAGGAGGCAGATGCCCACCCACTCACTGT 
ACCTGCCCG 

Sequence 822 

CCGGGCAGGTACTGATATGCTGGGCAAGTCTACTGAGGCTTAACCTTGTGGTTAACTACT 

TGGAAAATACAGAATTGGCTAGTGAGCCACAGACCCA7TTGAGGCCAAGCCATATGGCAA 

GACCTTTGGGAAATAGGTCATCAAAAAGAGGTANCTAT7TATCATGTGTCANGCCATATG 

CCTTTGGCCACCCCTAGTAATGATGGGGCAGATGCCTTGGCTAAGGTCAGATGGTCAGAG 

TCAGCACCAACACANGATGTTGACCTTGTGGCTACACCGNAAACTGGGACATGCAGGGGG 

TAAACTGATGCAACAATTCAATAAGTGTTGGGGTCTGTCCCTTCCCAAGCAAGACATTTT 

TGAGGCTTGTCAGAAATGCCTGGCATTGTGTTCAGACATATCCTAA 

Sequence 823 

AGGTACAAAGAAACCC7TTGCAGACAGCTCA7TTATAGAGAGAAGCTAAGTATTTATTTA 

CTTGCTACTCAATTTAAAATACGACCATGGCAGGCAACCAACCAGGAGAAGAGCTAAAAG 

TTTGAAAGCACTGCCCTTTTCTCTACTATAGAAGTTACAGGGTCMGCCTTGGAAMTTT 

CCACAAAATAAGGCTTTGCTAMGATATTTACCTTATCAGAAAGATTGTATCTCAAAAAA 

GTTACGTTTTTCTTCAGGCAAGACAAGAAAAACTGCCTGTCCTAATCGACAGTACCTGCC 

CG 

Sequence 824 

AGGTACCTTTCCTAAAGTGTGGTGACCAGAATTTGGCACAGTGAGGGAAGGTTCCAGAGC 

CCTTTAGTGTTATCGGGGGCACAGAGCTTGGGCTTTGGAGtCGCTCAGATCTTGAGTTGA 

TATGCACTTCTTTGTCCTTGAGCAMTGATTGAACCTTCTTGGACTCTCAGTTCCTCATT 

AATAAAATGAAGATGAAACAAGATGAGATGGACTGTGCACATTTCCTGATGGGGGCTGGT 

ATGTGCTGGTGCCCAGTGCAAAGTGGCCGCTGGCAGAATGGTGAGTCACCAGTACCTGCC 

CG 

Sequence 825 

AGGTACGCGGGGGAGAACTTCGGAGGATGTTCATGTCCAGGAGCAGCCCCACGCCCTGTA 

TGGTCGGTGTCTANAGCCTCACAGCAACTAANACCAACCCAGCTCTCANAAGAAGGAATG 

TCAAAATGTCATGTTCMTTTTACATTCAGTGCCTGGMTCTTTTCTTCACM 

GAAATGTGCTGAAGGAGGTGAATCCATGCATTAATCTTCAGCTCACAAAGGAAATACTAC 

ATAAGAAGCAAGACCACAGACTCAAGACGGACATAATTGGAI MUM I GCCATGGCCTG 

GAAAGAAAGGTACCTGCCCG 

Sequence 826 

CCGGGCAGGTACAGTGAAGCTTTGACCTGAGGGGCTCTTCTGTAGGTGAAGAGTGGTTAA 
CTATGGTTCCTGTGTGCCTGGGGCCCCCCACTCCTAGGACTCATCCACCCCCTACCCAGA 
ACTTACTAGTACCT 
Sequence 827 

CCGGGCAGGTACTTCATGGTTACCATAGAAAGCAGCAACAGCAGCAGGTGCACGCAGTAA 

GTTACCCATAAGTAGCTACACAGAGAGAAGACATCCATGGGAGAGGCTTGTGTGTGGTAA 

CCAGAACTGCCTGGGTTATAAGTATATGAACTACAAGACTAGGCATCCCTAAGACACAAT 

AAGTGCTTTTTGGAGCAGTGCTGGAGTTCTTGTTCTACACAGGAGAGGGTAAAAGTCAGT 

GCAGTTCAAGTCACTCCTGCCAAAGAATGCTTGGTAGCCATGGGCTAGTGAGACCTGGAG 

AAATGCACATTGCACAACANTTTAAAATGTATTGAAGACCTACTCAATGCTTGTCACTAA 

AGCTGGATTCTGTGGGGAGTATGCAATGTCTAAGGCACTGCCTGGACATTAACTGAATGA 

ACAAATGCATGAATGGATGAACATATTAAGATGGTTAAACACATNCCAAGGCCAGTGGTT 

TCTTCAAAGTGTGGCCCTGGAACCAANCANCATTCAGCTTTCACCTAGAAATTTGTTTNA 

GAAAATGAAAAATC 

Sequence 828 
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JS»^ ^^^^^^^^TCTAAANGNTCAATGANNNANGCTNTT 

tcqSgc^ttc^ nacntnncgangcagacactattta cntangnggatctcggccgc 

Sequence 829 

..aggtacacactgcgtgcaagactctgctgtgtgcagtcagccctgcatcgtttctaacca 

gagggtcacagatactgaacaacc agtgag7tcccctatgtggattgccagaac 
gtgatgcttgcaggatcggtacctgcccg 

Sequence 830 

A^ G ^o™ CGATCCTGCAAGCATCACAGC ™ GCAGACG TC 

TGTGACCCTCCCTCGCTCA TGTCTGTATAATCTACATAATCTACCC 

IJIII ^ TCTCTGCTCTCCTTCCTTCCACTC TTCAGCATCAACATGTGTAGCTCTGGCCT 

SiJS ATGATGAAAGTAAGGCCAGTATAACT GGTTAGAAACGATGCAGGGCTGACTGCAC 

ACAGCAGAGTCTTGCACGCAGTGTGTACCT 

Sequence 831 

AGGTACCATANGTGGNCANNTGATNGNATNGNAAGGGAGGGATCTTTAAANCTTGCCTAA 
GATG7TANNATATANGGNCTGANATTGNAACACAAGCNNTCTGNTTCTAGAGTCCCTGTA 
TCATNTTTGGTATCCTTCACCMGGNMGGATGGTCTCTTTCAGNCTrAGTTTTCTGACA 
C ™r GmTmAAGmAGATGAACAG ^ G ^ G AATGmTNANAT^ 

AACATCTGTTGGAATGACCCTGTCATTTTCCTTTGGTCTGTTAATATAGTAATTAGCAGT 
^^CTCATTTTGMCTGTGCT™^ 

TGGTTTTGTTTTCAAATNTAmAGACTTTTCTCTMCCATACTAC^ 

^AWG^A GGCCCMNG ^ TGATACCAGATGGAAATCG 
Sequence 832 

CCGGGCAGGTACACACTAGCTGATAAGACACTGTTGCCCATAATGTCTATTTATTGGATC 

AGCAATTTATAAGTCCCACATTCTCATGCCACATAGCTCTACACAGCTGCAAAAATATAC 

CATAGCTTGCAGGTGATCATTGGTTTGATAAAAGATATTGAGTCGCTCATCTTGTGAAAG 

TGATCTTTGATATAAGAGGAGCATCAGCGGGGAAGCTCACATGTCCCGTGGCTCACACAC 

CAGAAGGTATTTGTGTCTTGTCATTGTCTGTCTGGCAGTCCATGGCAATGGCTTTTTCAG 

AGAGGCCTGTTTTGTGGTTAACTGTGTGTACCT 

Sequence 833 

GAGAAATGGGGTCTCACTATGTTGCCCANACTGTTCTCATTCTCCTGGGCTCAAGCAGCC 

NACTNTTATTAAAGTAAAACAGGATGGAATAGGTAAAAAATGCCAGCAGATAATCAAAAT 

ATTAGCCTAAGAGCCCATGTGTMTACCACAATTCCTAGCAATAATTTGAGTTGAGCTTT 

CTCATATTTTCCTTTCCATGTAAAAGATACTCATCTNTTCCCCTTGTTTACCATATCCAG 

CCCCACTAGCTGCCTmAGGTTTTTGATTACCACTTAGGATCTGGTATGCGAAATTCCT 

TCTGCCTAAAAAGCCTTTNTCCCCAGTNATCACATTGGNGTCTCCTTGTTGGNTCnTTG 

TTATANAACTTAACCGTCATTTTTCAGAGAGGTTTTCTNTNACTACCCTATCCAAAGTGC 

TCAGTGAATGTTAATTGCTGTCATTTATTAAATGAGATNITCCCTTGNTATTCTTTGGGCT 

GANCANCCTGTATTTCCTTCATAGCATTGACCACATTTTGGAAATTTCTTT 
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Sequence 834 

ANGNACCNNTTCCNAANCTGNGNCTNAAGAGGNNTNNNTNGCTNCTGNACNCTTTGGAAC 

TNTNTNACTGATGTGANCACGTNAACANGCTNACTNNGTGGAAGGGGAGCAGCTAAATGG 

ANCAGANCCAAGTCATTTTACTTGAGTCTGGCCCTNTCANACCAGCCNTCAGCCATCTGA 

CCAAAAATGTGTTAAGACANAACAGCCAACATCAGCAGAGCCACCTACCTAACTTGTACC 

TGCCCG 

Sequence 835 

ccgggcaggtacaagactccacattnttttagtttgatataggnacatttcatgcttcat 
. aggamcaattaatttcatctattaanagaaaccaaatgagaaaacctccaactaaatct 
atcactnattnmcctataagcccagaaaaaaatttagcttcttcttgcctcggtttatc 
tacattagctcctgmcgmtacactggggcagggagamgtcattttctagtcttgttg 
ccatacaaacttgagttcttctccctgmtaacaggagtaacagtccmttttaccagaa 
caccagttctgagagaatcaactgaaggaacaaggtatcacttcagccactgtgtctttc 
taaaagtttctttcagggatgataccttaatttattt^ 
taaaatactaagtatacaccaggttggataaataattggcattcagtaaatacacttatt 
attatctttttmtccatamgatc 

cggtcttgctctgtcacctgggctggagttgcagtggcactccacaccaaaaatggntca 
ctgtactttnacctcctggactcaagcccatccctttacctcgggcggtctagnaactan 
nggatccccccgggcttgggangaanttcgatntnaangcttatttnattcccgtnnacc 
cttggnngggggggggccccggntacccanc i i i i igg 

Sequence 836 

CCGCGGTGGCGGCCGAATTGGACTGGAGTAGCTGCAGCAACTGCGACCTGGCAGCAAGTA 

ACCGGACGAAATGGACAGATGGCCGTGGAGGAGAAGCCTGACCTGACTGAGGGAGAGGAC 

TGATCCGACCAGGAGGGGAGTCCCCCTGAGCCTCTGGGGGGGCTTAGGAGGTGTCCCCTC 

CCGGGTTTCGGCACCAATGAAAGGCAATGGAAGGGCACCATGGATCAATAAGGCTGAGAG 

GTGAGAAAAAGAGGGCAAAAATCTTCTTTATTGAGCCCTCGGGCGAGGTTCACTGGTCCG 

CAGGGGGAGGGCCAGGGAAGTCGC 

Sequence 837 

NCTNACATNTGTAGNGAACAAAGATCAGAATTATAAAATCAAAAATAAAAATGTAGCCCT 

ATTTCCAGGCTGAAGTGAGCTGACAAGCCACATCCACTATGTGCCTGGGGGTCCCTGTCT 

TGGGTCATTCAGNCTCCCAGAACTGTCCACAAACCAGTAGACAGTTTCCTTCTCATCTTG 

GAGAGGTGGGGAGAGCTGGGGTCCTGAGTTGAAAGGCTTGAGCTACTGGCAATGGCTCTG 

ACATGAAAACAAGCCCTATCTCTCTGTACCT 

Sequence 838 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGACATTCAGAGGTGAGCCCAGAGGGGG 

TAAAGTGGACTGGGGAGAACTTCGGAGGATGTTCATGTCCAGGAGCAGCCCCACGCCCTG 

TATGGTCGGTGTCTAGAGCCTCACAGCAACTAAGACCAACCCAGCTCTCAGAAGAAGGAA 

TGTCAAMTGTCATGTTCAATTTTACATTCAGTGCCTGGAATCTTTTCTTCGCAATTGAA 

ATGAAATGTGCTGAAGGAGGTGAATCCATGCATTAATCTTCAGCTCACAAAGGAAATACT 

ACATAAG AAGCAAG ACCAC AGACTCAAG ACGGAC ATAATTGGA T \ WWW 1 GCCATGGC 

CTGGAAAGAAAGGTACCT 

Sequence 839 

AGGTACGCGGGGACATTCAGAGGTGAGCCCAGAGCGGGTAAAGTGGACTGGGGAGAACTT 

CGGAGGATGTTCATGTCCAGGAGCAGCCCCACGCCCTGTATGGTCGGTGTCTAGAGCCTC 

ACAGCAACTAAGACCAACCCAGCTCTCAGAAGAAGGAATGTCAAAATGTCATGTTCAATT 

TTACATTCAGTGCCTGGAATCTTTTCTTCACAATTGAAATGAAATGTGCTGAAGGAGGTG 

AATCCATGCATTAATCTTCAGCTCACAAAGGAAATACTACATAAGAAGCAAGACCACAGA 

CTQAAGACGGACATAATTGGAI I I I I I I I GCCATGGCCTGGAAAGAAAGGTACCTGCCCG 
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Sequence 840 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAATCCCAGTGATAGGC 

ATGGCTGGTCTCAAGATGGGTCTGGGGCTGGGGGTGTGAGTGCTCAGGGAACTGCCATAC 

AGCTTGAAGAAGCAGAGGTGGGATTCCGGGCTCCAAAGCCTCTTTCCACGCCACTACTTT 

TG CCATC TTCTTAAAAACCATC ATTTTTCACACAATTC ATTAGAAATACAGAATTTCTAC 

CCTATGACACAATTACATGCTTATAATCAATATATTTTGTATATTGTMCCTTTTCTTAT 

ATAT TATTC TGTTGTAAGGTGTAMGGGAAAATTAGAAAGAACAGAACAGGAACGTTCTA 

TGCATT1TACATCCCTTAAAATCTCCTGCAGAATATACAGTGATTACTTAGCACAATGCC 

.AAGTATATGGCACACTCAGTAAGCTATAGTTATTGTTATCTGTGTTATAATATTATTGCA 

ATTTCMCATGATGAAAATTGTATTTGTAAGTGAAGGCAATGTGACCCAGAGGTTACACA 

ACTAGATGTCANGCCCCAGAACTGGCCTGATTCCCCCGCNCATGGTTACCGCACTACACT 

ATGCTGGTATTCCTACCTGTNTNNTTNNACGCAGTACCTGCCCCGGCCGGCCGCTCTA 
Sequence 841 

AGGTACTTTTGCTCTCTGTCTTTGCCAATATTTACTTTTGA 

TTGCTTTTGCTCTGTTTAAAACAGGCTNTTCAGAAAGGCAATAAGGAAGCTGCAAGAATA 

CATGCTGAAAATGCAATCCACCAGAAAAATCAAGCAATTGATTTCTTAAGAATGANTGCT 

AGAGTTGGTGCTGTGNGCAGTCAGAGGTCAAAATGCAGTAACAATGGGCAGNGACTTCTG 

CCTTGATGGNACAAATTCAMCACCAGTTTTGAAAACACTGGGATATTCAAACACCAACA 

AAAATGGGAAAGATAAAAATGGAGGTAGCNTCTTACANCCCTAATAACACCCACATCACC 

TTAACATGTTGGTTTAGTCAAATGGAAAMGAAGAACCTTTGTTTATTTACTTACCTT 

Sequence 842 

CCGGGCAGGTACGCGGGGACATTCAGAGGTGAGCCCAGAGCGGGTAAAGTGGACTGGGGA 

GAACTTCGGAGGATGTTCATGTCCAGGAGCAGCCCCACGCCCTGTATGGTCGGTGTCTAG 

AGC CTCAC AGCAACTAAGACCAACCCAGCTCTCAGAAGAAGGAATGTCAAAATGTCATGT 

TCAATTTTACATTCAGTGCCTGGMTCTTTTCTTCACAATTGAAATGAAATGTGCTGAAG 

GAGGTGAATCCATGCATTAATCTTCAGCTCACAAAGGAAATACTACATAAGAAGCAAGAC 

CACAGACTCAAGACGGACATAATTGGA'I I I I I I I I GCCATGGCCTGGAAAGAAAGGTACC 

T 

Sequence 843 

GGCACTTTGATCATCTCACTCACTGCAAATGTCATGTGAGGTTTCATGGAAGTATTAAGA 
GATCCAATTAAGAAGAATAGTTCTGAGTCTAAACCAGCCCAGAGTGGCTTCTCTAGGGGA 
AACTCCCCGCTCAGCTGGTACCT 
Sequence 844 

CCGGGCAGGTACTATCCCATTTTCCTTTGCCATGCACCTGTCTTCCTCTTCAGGATTGAT 

CACAAAATCAAAACCATTCCTACCAGCCAGAGGTGGGCACATGACCCAGGTCAGACCAGT 

CACAGTTCCTGATTCCCGGCCACAGTGCTTGATGGGGGAATGAGCATATATCAAACCAGG 

ACAGTCCAAGTCCTTCCGGGGACTTTGCTGCCACAGTTGACAGGGGGAGGTATTTGCATA 

ACTTCACCACACACAGCTAAGCTGGTGATCCTGCTGCTTGGCAGCATCTTCCCAGCGTAC 
CT 

Sequence 845 

GGGGAAGCNTCCCACCCGCGGGTGGGGCCGGGCCCGAAGGTACCCCCAACNTTAAAAAGG 
ACCGTTTAGGGGTCAAAAGGGGTNGTTANCCCCCATTGGAGGGGTGGGGCCAAGGAAAAA 
ATGGGGCCTAACAA I Mil I CTTAACCCCCAANAAAAAACTTACCGAATAGGCCCCTTTA 
TTTGAAAACTTTAAAGGGGGTTCCGAMNGGTGGGG 

TGAAGAAGTTAAGGAAGGTGGCCTTTTMGTTTGGAAACAAGGGCCCAAAAAATNNNANA 

ANATNNAANAANNNTNTATNAAAAGGTTNCCCTTGCCCCCCGGGGGGCCGGGCCCGGCTT 

CTTAAGNAAACCTTAGGTTGGGGAATCCCCCCCCCGGGGGCCTTTGCMGGGGAAAATTT 

TCCGAATTATTCAAAAGNCTT7TAATTCCGGAATTAACCCCGGTCCGNACCCCTTCGNAA 

GNGGGGGGGGGGGGCCCCCCGGGTTACCCCCCAAACNTTTTTTGGTTTCCCCCTTTTTAA 
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GNTTGGMGGGGGTTTTAAAATTTNGGCCGCCCGNCCTTTGGGCCNGTTAAAATTCCAAN 
TGGGGTNCCAATTMGGCCTTGGTTTTTNCC^ 

TANTTCCCCGCCTTCCAACCAAAATTTTTCCCACCCACCAAAACCANTTTACCCGNAAGG 

CCCCCGGGGGGAAGGCCAATTAAAAAAAGTTGGTTAAAAAAAGGCCCCTTGGGGGGGGGT 

TGGCCCCTTAAAATTGNAAGGTGGGGMGGCCTTAAAACCTTCNACCMTTTTAAMTTT 

TGGCCGGTTTGGCCNGGCNTTCAMCTTTGGCCCCCGGCNTTTTTTCCCAAGTTCCNGGG 

GGAAAAAACCCCTTTGTTCCGTTGGCCCCMNCCTTGGCCAATTTTAAMTGGNAAAATT 
CGGGG 

Sequence 846 

CCGGGCAGGTACGCGGGTAGGGATGCAGAACTGGCCACAGTCAGGGCTGGCCAACAAGGG 

ATGGGATGGCAAGGGAACCAGTGGGAGACTGTTTACACCTTTGACTTCCTCTTCACAGGT 

CAAGGCAGGACTGTAGCATTAGGTCTNAGAGATGCAATGAACAGGACAAATACACTTCCC 

CTTCATCTGGACCTGAGGCTCTGGGCAAGTCAGTACCT 

Sequence 847 

AGGTACATTACCCTACGCTCCAAGCGCAGCAAGCCGGCCAATATCCGTGGCACCGTGAAG 

CCCAAGCGCAGGAAAAAGCATGCAGTGGCATCGGCTGCCCCAGGGCAGGAGGCTTTGGTC 

GGACCATCCCTTCAGCCGCAGGAAGCGGCAAGGGAAGCTGATGCTGTAGCACCTGGGTAC 

CTGCCCG 

Sequence 848 

CCGGGCAGGTACGCGGGGACACATTCAGAGGTGAGCCCAGAGCGGGTAAAGTGGACTGGG 

GAGAACTTCGGAGGATGTTCATGTCCAGGAGCAGCCCCACGCCCTGTATGGTCGGTATCT 

AGAGCCTCACAGCAACTAAGACCAACCCAGCTCTCAGAAGAAGGAATGTCAAAATGTCAT 

GTTCMTTTTACATTCAGTGCCTGGAATCTTTTCTCCACAATTGAAATGAAATGTGCTGA 

AGGAGGTGAATCCATGCATTAATCTTCAGCTCACAAAGGAAATACTACATAAGAAGCAAG 

ACCACAGACTCMGACGGACATAATTTGGATTTnTTGCCATGGCCTGGAAAGAAAGGTA 

CCTCGGGCCCGCTCT 

Sequence 849 

NNCGGAGNTGNCCGCGGTGGCGGCCGGGGGCCATTGAGACTGCCATGGAAGACTTGAAAG 

GTCACGTAGCTGAGACTTCTGGAGAGACCATTCAAGGCTTCTGGCTCTTGACAAAGATAG 

ACCACTGGAACAATGAGAAGGAGAGAATTCTACTGGTCACAGACAAGACTCTCTTGATCT 

GCAAATACGACTTCATCATGCTGAGTTGTGTGCAGNTGCAGCGGATTCCTCTGAGCGCTG 

TCTATCGCATCTGCCTGGGCAAGTTCACCTTCCCTGGGATGTCCCTGGACAAGAGACAAG 

GAGAAGGCCTTAGGATCTACTGGGGGAGTCCGGAGGAGCAGTCTCTTCTGTCCCGCTGGA 

ACCCATGGTCCACTGAAGTTCCTTATGCTACTTTCACTGAGCATCCTATGAAATACACCA 

NGGGAGAAATTCCTTGAAAT1TGCAAGTTGTCTGGGTTCATGTCTAAGCTTGTTCCAGCT 

ATCCAGAATGCCCCCAAGAATTTAACTGGGATCTGGAAGAGGAAAAGAAAACTGATGGTG 

TTAACTGAACC 

Sequence 850 

AGGTACACATACAGTTTrTAAACCTAAAGTTNCTGGNGAAACAAGCTTGCTATTACNNAG 

ATNTNTNNTGAATGTCNTAAAGCTATAAATCCACTCATCGNTGTGTTTAAGGAGGAACTG 

AAGCACAATTGTTAAGAACAAGTGAATTAGGTGAATATACATTGACNAAAGGTTGATAAT 

AMGTTNGTCAGGAATTTCAAAAATMTTTAGTGTGACTTGAMTCTTTAAMTCATGTT 

ATATTAANATTAAGTAACACTTTACTGATTTAAATTATTTGGAGTCATTTCTAAGGGAAA 

ATACTGAAATATCAATTGCTTMCMGAMGTTTTAAAATATMCGGTMTTTGGGCATC 

CTGGGTTTTCNCATGTTATGGGAAAAAGCCTTAMCCATTATTTGGGGGCCTGGTTAATT 

TAAAAGGGCCATAAAAAATTATTTTTATNGAAGMTGGTGTTCCATNTGCAAAAATACTA 

ACATGGATGGCACTTCAAAAATGCTTACCTAAATTTTTCACCTAAGAAATTAAGGTTACC 

TAAAGAAGTTAAT 

Sequence 851 
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CCGGGCAGGTACAAGTTACCTACCCTTTCCTTGCCTCAGTTTCCTCCTTTGTGACAATGG 

GGACAATAACAGAAACTGCCACACAAGAGGGATGTTAGAGAACTGACCAAGCGAATGTAA 

GTMTGACTTGGAGCAACGTATGGCTAAGAGTAAGTTAGGGGTTGTTACGATGACTTTCC 

TGAGATAAACAGGGAGATCTTTTAGCTGTGGGTCAAGTGCAGGAGGACAGTGGAGCTCAA 

GGCCTAATGGAAACTAACACTAATTGTACCT 

Sequence 852 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGACGTTAAGAACTGGACTTACAAGATG 

GATAGGACCTGGTGGCAGCTGTCCTTGAGGAAGCCAGAGCCTGGAGAGACAGATCTCATG 
TTATAGCAGAGTACC T 

Sequence 853 

CCGGGCAGGTACTTACCGGGGAGCACAGTGGACTAGGGGATGCCCTTGATAATGACGGTT 

TCCTAGGTGCAAGATCAGAGCATGTGCCTACACCTTGTGCCCTGATGCCGAGGTGGAGAC 

CA7TGGGAAATAATTAATAGATATTTTTCTGGGGGGAAATCACAAGTCACTGAGGTGTTT 

GTTTGTTTGTTAGGCAGTGCTAGAGCTTGAAGCCACTCAAGCCAGTTGGGGAGGAAAGGG 
ATGCGGAGGTACCT 

Sequence 854 

AGGTACATTACCCTACGCTCCAAGCGCAGCAAGCCGGCCAATATCCGTGGCACCGTGAAG 

CCCAAGCGCAGGAAAAAGCATGCAGTGGCATCGGCTGCCCCAGGGCAGGAGGCTTTGGTC 

GGACCATCCCTTCAGCCGCAGGAAGCGGCAAGGGAAGCTGATGCTGTAGCACCTGGGTAC 
CTGCCCG 

Sequence 855 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCCAAGCTTCACAGCCCTGTCTCTGCCATGGAG 

AGCAGGCCCACACACTGCTGCCTGCTGGAGCAACCGTGCTGGTCACAGGGCCAGAGGCAG 

AGAGCCCTAACAAGCAGTCAGGGCGCTAATGAGGAAGGGGGGACAACCTGTCCTCTACAG 

AAMTAATGAGATTAATATGTTCAACAACTCCCATCCAGAATCCAGCAGGAAGTTAAACC 

AMCGACACATGATTCTCTGACGAAGAAATAAGAATACGGTTGACTGGTTTTTTATATTT 

ATTTACCTGAAAGCTTCTAATGTTTCTAACTAGTGACATATTTCAGGCCAGGAATTCTGT 

TTGCCTAGATCATGATGTCTAAACACTGCACACTTGACCAACTTTGGATCCAAATAACTA 

GCTTTGTATGCTCTTGGGCCCTGCAGTACCTCG 

Sequence 856 

TCTGAAAAAGCCATTGCCATGGACTGCCAGACAGACAATGACAAGACACAAATACCTTCT 

GGTGTGTGAGCCACGGGACATGNGAGCTTCCCCGCTGATGCTCCTCTTATATCAAAGATC 

ACTTTC ACAAG ATGAGCGACTCAATATCTTTTATCAAACCAATGATCACCTGCAAGCTAT 

GGTATATTTTTGCAGCTGTGTACAGCTATGNGGCATGAGAATGTGGGACTTATAAATTGC 

TGATNCAATAAATAGACATTATGGGCAACAGTGTCTTATCAGCTAGTGTGTACCTGCCCG 

Sequence 857 

CCGGGCAGGTACGCGGGTGACGGTGTGTAACAGGTTCAAATAAAGATTAAAAAGTCAAAC 

GTGCACCAAGGCTCTCCTCTGAGGCCAGGCACAGAGCTAGGAACAAAGGGCAAGTCATGA 

GCTGGCCAATCTGCAATATCGTGAATACCCTAATCATCCTGGTTTATTTACAGGGTGGAG 

CTGTATGATCTGTAAATTCGCTTCTACATTTACATTTCTGTGATTGATCATTCAGCCTAA 

TTAGAGTTTTGTTTGGAATGTCAAAATGTTTATGGGGAAAAGACATCAGGGAGCCAAGGA 

AACCATTGGGGACTNGGGGMGTCTTAAGCCACAAGGAATCTTGCCTCATTAGGCCCCAC 

TTATACCMCCAAAACAGGGAGGGATATTTATGGAGGGGGATTTTCTGTGTCCCAACTTA 

Nl l l M l ICNTNTCCNCCCAAGNCCCTGGGGANNNTACCCTTCNGGGCCCGCTTCNTNAG 
AAAAACCTTAN NGTGGGAATTCCC 
Sequence 858 

CGAGGTACGATGGCCGGGCGCGGTGGCTCACGCCTGTAATGCCAGCACTTTGGAAGCCTG 
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AGGCGGGTGGATCACTTGAAGTCTGGAGTTCAAGACCAGGCTGGCCAACATGGTGAAACC 

CANTCTCTACTAAAAATACAAAACTTAGCTGGGCGTGGTGGCAGGCTCCTGTAATCCCAA 

CTCTCAGGAGCCTGAGACAGCANAATCGCTTGMCCTANGAGGCGGAGGTTTGCAAGTGA 

GCCAAGATTTGTGCCACTTGNACTCCAGGCCTTGGACAACAAGNAGCCGAGGACTACCGT 

CTCAAAAAAAAAAAAAGGGATACCGATTTCCTTATT^^ 

TCCCAAAGGGCCCTTACCTCCTGGAATATCATCACCCATGGGGTGTTTAGGGATTTTTTG 
GACATT ATTA ATTNTTAGGGGCCACATTAAACC7TTTNAAGACTAACAGCCCATGCACTT 

CAGAAATTTTATTATCCAAAACATTTTTTACCTTGGACCTTTAAGGAGGAGGAGGAAAAT 

TAATTTTTTTCAAAAAAAA 

Sequence 859 

CTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACGCCTGTAGTCCT 

AGCTACTCAGGAGGCTGAGGCAGGAGGATGGCTTGAGCTGAGGAGGCAGAGGCTGCAGTG 

AGCCAACATTGCAACACTGCACCCCAGCCTGGGCATTAGAGCCAGACCTAGTCTCAAAAA 

ATAAATAAAATAAAATAAAATAAAATAAAATGTTTAATAATGCCTACTCAGACAAAGAAA 

GGCTCAGGCAAAAGACTGAGTCTGATCTTCTACATTGAGGAGACCATGTTGATTAGTGTT 

ATATTCAAAAAAACCAATTCCCAAGTAATGCATACACAGCTGAAAGAAGGAAGTATTCCA 

GCTACACCCTCCAAGTTTTCTACATCTCAGTAAACGGCATCGCCACCCACCTAGAAACTG 

GGCATCATTTGGTTCCTCGTTCTCC I I II I II M I II I ! I GAGATGGAGTCTCTGTTGAC 

TCCGGGTTCAAGCCCCGCGTACCTGCCCGGGCG 

Sequence 860 

AGGTACAAGTTGTCT TTATG CTGCGAGATMGTCCTCTCTTGGTTTGAGCTCCCACCTTT 

TCA GTGAA CTCTTA CATTTT GGGGGATCTGCTCTTGTAAAGGACATCCTTTCTGGTGAGT 

ATTCTTTTGGTTTAA I \ I 1 I GGTTTGGTTATTTGTGCATGAATTTAATCTCATTAGGAAA 

CAAGTTAAGTTGA ATAGA CCAACTAGTGAATTAATCCGTCACCAAAATATATGTTTTTGG 

CATTTACCTGTTTAT1TTGAAACTCTTTGTAAGAAATGTAAACCTGTAATGATAATCTCT 

GCTTTGTAAGGATATCTCCCTCTCTGACACCTAAAACACTAGATGCTTTCACAAAGCAAA 

AGGAGGAGACCTAA ATCTATC TATCTGTGTAAACTCACCCTTGACCATTTCATTTTGAAG 

GCTTCCTATATATGCTTTTTTTCATCTCAACAAATANTGGGTGTTTAAAGTTCTGTACCT 

Sequence 861 

AGGTACAACAGTCCTAGGAACTCTTACTGTCTGTAGTGGAGATAGGGAAACAGGTTTATA 

AGGAATTAATAATTTATTGCTCAAGAGTAAGAGTAAATGAATAACTAGGAGATAATGAAT 

GTCCTGGCCATTTATTCTCAAAGAGCATGAATATACAGAATAGGCTGATTTGAGACATTA 

TACTGCTTTCTTCCTGGAAACTGACTTTGGTGGMTGGTCMGCTCTTACTTTAGAAGTC 

ATAAAGAGCTGTGTATGGAGAAGGTAACAGTCGGTATGTTAAAACAACAGATCCGGCCAT 

AGAATAGGTGCCTAATGGTTCTGAAATATAAGCAAACATGCTGGACAACTTTATCATTAT 

TTCTGGCATGGCTGGCTCTGGGGCTCATGTTTGTGATCCCAACTCTCTGGGAGGCCCGAG 

GCAGGGAGATTGCTTGAGCCTAGGAGTTTGAGACCAGCCCGGGCCGACATAA 

Sequence 862 

CCGGGCAGGTACACACTAGCTGATAAGACACTGTTGCCCATAATGTCTATTTATTGGATC 

AGCAATTTATAAGTCCCACATTCTCATGCCACATAGCTCTACACAGCTGCAAAAATATAC 

CATAGCTTGCAGGTGATCATTGGTTTGATAAAAGATATTGAGTCGCTCATCTTGTGAAAG 

TGATCTTTAATATAAGAGGAGCATCAGCGGGGAAGCTCACATGTCCCGTGGCTCACACAC 

CAGAAGGTATTTGTGTCTTGTCATTGTCTGTCTGGCAGTCCATGGCMTGGCTTTTTCAG 

AGAGGCCTGTTTTGTGGTTAACTGTGTGTACCT 

Sequence 863 

AGGTACCAGGTTGGGCTAAGAGATGAAAAACCAGGCTCAAGTTGGGCCAGAGATGGACAA 

TGAGTCACTTCC ACCAG AATTCCCAGCCACATATGCCCACCTCTGGGTCAGAAGAGCCAA 

ACGTATTTATCGCTTTTGCTCAGGGCCAACATAGACACTTGATTCTCTCTCAGGGCTGTG 
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TTrrcCCCCAAATGTATCMTACATCCCTATTTTCTTCCCCCTTTGTnTCCTTTCCCTG 

GTGACCTGCTACCCCACACCCCAGCACACTGATGGTGTTCCCTCCACCACACCGAGA^CC 

CGCCACAAGCTCTCAGCCCTCGTAGAGCACAGGCCCCACCCTGAAAGTTGGCAGAGCACC 

GGGCTATCAAGCTGTGTCATGTGGAGCCAGCMGCCCGGCACAACCGTGGATCCCGGGTA 
CCTGCCCG 

Sequence 864 

CCGGGCAGGTACCTGTCTTTGGGCTTTATGTCATGTAATACTCACCCNAGTGACCTTTAG 

GGGAGATGTCACCGTCGTCGTCGTCATCATCATCGTCATCATCCCGACGTCATCGTTCCT 

ATTGAGAAATGAGACCACTGAGGTTTGGGGAGGTGAACTGACTTNCCAAGTTCTTACAAT 

GAACAGCAACAGGGCTGGGTGGGACATAAATCCATTTNACTTACAGTAAAAGCCAAATTC 

TTTATTATGCTCCACATGATCCTGTCCCTGCATNTCCTGACCTCTTCCCCCATCCCCCTT 

CCGATCACTCACTNCAGCCACACTGGCTTGGCTGTTNCTNAAACACACCAGGCAGCCTTC 

AACTCCAGGGCCTTTGCAAGAGGTCATTCCTTNTGCCTGGAACACTTTTNCCCCAAAACN 

ACCGCATGGNTTACTTCAAGMTGTGCTTCAAATGTCAATNTTTTTAAAAGGGAATTCTT 

TmGTGMCTTTTCCTGTTmMCMmcmMCMCTTGNTCCMN 

AACCTTTTTGGNCCNTTGCCAAATTTCNCAGGGAAGANANNGGGGNAAAAATCGNAAAAN 

NGANAGGTCCCTTTCGGNCCGNTTTTTAGAAACTANNGGGGAATCCCCCCCGGNCCTTGN 

ANGGAATTTCGAAAANTNAANNCTTTATTCNANTNCCCNGTCCNANCCTTAAAANGGGGG 

Sequence 865 

CCGGGCAGGTACTGGGGATTGATTAGCCACAGTCTTTGCCACAGACTCTTTTGGACTGAG 

CCGATAGGTCAAAGGTTAGCTTATGCAGTAGCATGGGCTGTAGGTTTTTTGCACTGATCA 

ACACCGGAACTTTTAGAAGACTTTTAGAAGATTTGTTGCTGCTGTGAGTGACATCAGTTC 

ACACAGCCTGAGAGAGAACTCTCAGTCCATGCTGTCCCTCCGTGGGCCCACTCAAGCGAA 

ATGGGTCGAGCTGCAACACTGTGCGGTGCCAACCGGGAGAAATCTGCCTATCGGGGCTGA 

GAGACATCCTGTAGTGGCTACCCGTCTCCTTTGCCCCAAATGTCAAGTGGAAGAGGTGCT 

TCACATATTTATTTCTTCCTACATGGGCTCATTTATTGTAAGCCCATTGTGGACCCTTGG 

AMGCCMGCTTTTATCTTGATTTTGGAGTGTGATCAANGTGACAGTTGTTCTCCGGATG 
ACAGGGAATCTGACTTTG 

Sequence 866 

CCGGGCAGGTACGCGGGGAGACTTTCCCTGCCGGCACATGGACCTGGCCCAACCCTCACA 
GCCAGTAGACGAGCTGGAGCTCTCGGTGCTCGAGCGGCAGCCAGAAGAGAACACGCCTCT 

CAATGGTGCCGACAAGGTCTTCCCTTCTTTGGACGAGGAGGTCCCCC 
Sequence 867 

CCGGGCAGGTACAGACGGGGTTTCACCCGTTGGCCAGGTTGTTCTCAAACTCCTGACCTC 

AAGTAGGCCGTTCGGCTCGGCCTCCCAAAGTGCTGGGACTACAGGCGTAAGCCACTGAGC 

CTGGCCAGACTGAACATTTTAACCATGTCTACTCCTCCCAAATGAACTTATAGATCTAAT 

ACACTCTGGGCATGAATCAGGTTTGAATTCAAGGATCACATTCCTTAAGACATTGACAAC 

CTAACTCAAATAGCAGCCTGCTCTTCCAGCCAGCGCTCATTCTCCTCTTCCCCTGTTTTA 

i i i i I I ATAGCATGTCTCATCATCTGAAATTACCTTTTTGTTCTTTCGTCTCCAGTTTCC 

CCGGTAGAATTTTGTAAATGCCATCAGGGCTGGTGTGCCCTTTTCTGAGTATTCCCAGTA 
CCTCGGCCGCTCTAGAACTA 

Sequence 868 

AGGTACAGCTGTGATTCCAGAACTAAAATCCCCATTTCCTTGCCAGTTGTCAGCTGANAG 

CCCGTCCCAGCTTCTAGGCACTGCTCACATTCTTTGCCTTGTGGTTCCCTTTCTATTTCT 

TTTGTTTTCAAAGCCTGCAATAGCTGGTCAGTTTCTTCTCATGCTTTGAATCTCTCCTGC 

CTCTTGTTCTGTTTTGCATCTCCCCAACAAAACCTTCTGCCTTCCTTTTGCACTrnTGT 
TTGTTTGTTTG I I I I I I I GAGA 

Sequence 869 
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AG GTACI I M H M II M | M j H 1 1 1 | M M H J I 11 I NGGNTTNGNTTNGNTTTNGTT 

NGTTTT1 M IGTAAANACAGAATCTCCCTNTGTTGCCCAGGCTGATTTCANACTCCTGGG 

CTCAGGGGCTCCTNCCACCTCAACCTNNTGAAAGTGCTGGGATTACAGGGGTGAGTCACT 

GTGCCTGGCAAGAANATGAANTTTTNATCCCTNAACTGACANCTNTGCCCTGGAGGCCCA 

NCATGGGC7TCAGGCTGCCTNTGCCCCTGCTNANAGGGCATTNTATTTAGTTCAGCTNTN 

ACCCATTGNGGATAAAGGAGATTCACTTCACATTTTATGTTTTACAATTTTAGTATAAGC 

NCNAATGTTTAAATGATAACAAGGGAAGGAGCTCCCCAAANACTCTGNACCATCTTGATT 

TCTGGGATTAANATTTTANTCTGGGTAAAACAAGGATTTNTATTATTCCCAAANCNTTNT 

TNATAATTNANAGCCCGGGGGGGGn I 1 I I I I AAAAAATACCCTNTTNTNCAAAAATTTN 

CNNTCCMGGGAAAATTTTTTT 

Sequence 870 

CCGGGCAGGTACCANCAAGCCCTGNGGGCCGAGAGTTCTGCCTGGTCCCACCTGATCACT 

CTGTCATGACCTCAGTTCATGTGAACCTTTCNCCCACCTTCGTCTCAAACCCAGCCCTGA 

GCCCCTCCATGCTGGGATGTGTGCAGGGACCCCGTGCAGGCTTCCCAGAGCAGGGGAGCA 

CTCAACTATGCAGAACATGTCAACTCCACAGATACACAGCAGGTCGTGACTGAAATGACT 

TCCCAAGGGAACTGTCCTTGTCATCCTTTTTAAAAACATTAAATGTCAGAAAGACGATAT 

GAGAGCTCAGACTGCGCCCACGTGGGGAACTGAGAGAGAGAGGCCCAGTACCT 

Sequence 871 

CCCTTNNGAGCGGGNGCCCGGANANNAANAAAANATNCTNTT7TTCTGATCAGAACCCTG 

GCTGMCGGTNTTAAMTTCAGGTTCCAAAATGAGCCAGTGGTAAANAAGAAATAGAGTG 

GAAATTTAACACATCCATTTATTATACACTGCAAAATGCAACAGGGACAGATGCCCCATG 

AATATTTCTCCTCA 

Sequence 872 

AGGTACGCGGGAGGGAAGAGTGGCCTTATGGCTACTCAAGCCACGTGTTTACCTAAAATG 

TGCTCCTAGTCGCATAGCTTCTCAGAATAGTCAAAGAAGAAAGGAAAAGAAAGCTGGATG 

TCCCGGGTTTATTTTTTCTGCATCTTTGCAAATATGAGACACAACTCTGTTCCAGGAGCT 

TTCGATGCGCTTCTACAGGTATTTCTTTTTTCTTTGTGTCAAGGATAAATGTGAAGATAG 

CTGTGGAATTGTCCGAACATTTTTGCCTTGAAATTAAGCAATGTTTTATGCCTAATTTCC 

CAAAGGACAAGTTATTTATGCCAGTATTAAAAGCAGCAACTTATATATTCTACGCAAAAC 

AGAAAACCAGAAAGATCTAAATTTTGACAAGTCCCTAGTTTGATGAATT 

Sequence 873 

TCGNGGTGGCGGCCCGCCCGGGCAGGTACNCGGGGGAAAACAAAGAGGAGATTAAATGTG 

TGGGTGGCATGGGAAGGGTGAGTTGGTATGGGTTGGGAAAGGCAGCAACAACTTCATTTC 

CCTGAGCAGTTTCCATTGTAATCAATGTATTGGGTCAGCACCAACCTCTACCTGAGGTAA 

AATATTTAACGCATGTTTCTGTCAGCCTTGGAATATGCTTTGCCCTCTTAGGTTTATATT 

TGTGAGTTTGGGGTCTGGGTTAGGCCCAAAACTAGTGGCCTAGCCTTAGGCCAACCTCTT 

CCATGAGGTGTG I I I I I I I II I I I I CTTTTGACTGAGTCTCCTNTGTTGCCCAGGCTGGA 

ATGCAATGGNGCAATCTCAGCTCACTGCAACCTCTGCCTACCGGGTTCAAGCGATTCTCC 

TGCCTCAGCCTCCTGAGTAGCTGGAATTACAGGCACGCGCCACCATGCCCAGCTAATTT 

Sequence 874 

CCNGGCNGGTNNTNTGCTTTCAATCATTTCGGCAATACTGNTNANCCTTNAACCAAATGG 
AACTTANANCCCATTNAAATTGACGTTTCAATTCAATCTAGTCAAATTGGGGAAAGACAA 
AATC AAG AG AATTAAAACMTTTTTTCTTAAAGTGTC ATTCTTN C ATAAAATATC N TTTA 
TTATrrATTTATTCATTTNTAAAACAGAGTCTTGCTCTGTCGCCCAGGCTG 
Sequence 875 

CCGGGCAGGTACTGTGCTTTCAATCATTTGGCAATACTGCTAAGCCTTTAACCAAATGGA 
ACTTAGAACCCATTAAAATTGACGTTTCAATTCAATCTAGTCAAATTGGGGAAAGACAAA 
ATC AAGAG AATTA AAAC A ATrTTTTCTTAAAGTGTC ATTCTTC C ATAAAATTTCTTTTAT 
TATTTATTTATTCA I I I \ I AAAACAGAGTCTTGCTCTGTCGCCCAGGCTGGAGTGCAGTG 
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GTGCGATCTCAGCTCACTGCAACCTCTGCCTCCCAGGTTCAAGCTATTTTTGTGCCTGAG 

CCTCCTGAGTAG7TGGGATTACAGGGGCTATGATGTTGTCAACATGATGGAATAATCACG 

CAACTATGAAAAAAAACGAAGACTATGAGGAAACATTTAAAGGCATATCAGAACTAAAGG 

CTTCATGAGGTGGTGGCTCAAGCCTATAATCCCAACACTTTGGAAGGCCAACATGGAAGA 

TCACTTGAGGGCAGGAGTTCAAGACCAGCCTGTGCCACATACCAAATCTCTGGTTTCACA 
CCAAAAAAATAA 

Sequence 876 

CCGGGCAGGTACGCGGGATAGCATTCAGCTCCCAACAGMCCCTTCATTTTCTTTTTATG 

TAATTTTCTCTTTAAAGAACAACTGGTCAGGTCTAGTGTATGGAGAGCCCTTGTGAGGAC 

AGAGCCC TAATA TCTGGATGATGTTTCTGAATTTAGTTGACATTGTCCACTTGGATGTCT 

AGTGGTCATTTTAGGCAAACATGTCCAAACTGAACTCCTGTCTTCCCTGGACCCTGCCTC 

CTGTGGCCTGGCTGTCTTAGTCAGTGTCAGCCCTCCATGAGCTTAGGCAAAAAGTCCTGG 

GGTTATCTCTGACTTCTCTCTACTTTCCACAAACCATATCAAGCTCTCAGCAAGTCCTAT 

TGGCTCTGCCTTCAAGTGTAGATAGAATCTGCCCTCTGCTTATCCACTGCGTTGCTGTTA 

CTCGGCACAAGTTGTTACCTCCTACTGGATCACAAAGAACTTCCACATTGGACTCCTGCT 

TNCACCTGCCCCTTCAGTCTGTTTTCTGTGCAGCACCCANAGCTGGCCTGTTAAAGCA 
Sequence 877 

AGGTACTTATCTCATAGAGTAATTGTGAGAACGAATGCTTTGAGCACAGTGCCTGCCACA 

GAGTTAGAGCTATAAGCTTCTCTTTGGTCTATGTGGGAGTTTTCTTCCCAGGTGAATTAC 

CAGATC CTTAGGGCC AGAGAATTC CTGAATTCTCTGTGAGTATATAGTGTAGTAAC TTGC 

CTCTAGAAMTACCAACATATATTTATTAAATTATTAAAGATTAGATTGTTTTAGAGTAG 

GTCAGGGGAAATAAACAGAACTTGGGTCATGAGAATAAAACATTTCTTGATATTGATTGA 

TTAATGCACATTGTGGTGGATGAGTCTTTTTGTGMTGTTGTGAACATTTACTTAGATCA 

TTTCTAAAAGGGTGCTAGAGAATCTGCCGGTTCACCCTAATTGATGCCCCCTCTAAGTAG 

CAAAGCANAAGAACCAAAATTCAGTCTCAGCACTTTAAAGGTTTTGATTCCTAAACTTGG 

TGTTGGTCCCCTGCCTGCTTGAACATTGAAACACTTTCCAAAAAAATGA 

Sequence 878 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAAACTTTGAATGGAGTAATGGAGCAGTAATGT 

CTGTTTTAATCGCCCTGGCCCCTCTTTGCCTTATGGAGGCACCAGGATGAAGGAGGTGAA 

AGGCTGACTTAAGAGTCAGGTAAACTACCTCCTGCTGTGTTTTAATACTAATGCATTGTG 

ACCTTGAACAAGAACTTGAACAAGAGTTCACATCTGCATAAGAGAGGTCTTATCTACTGG 

TCTCCAAAGGATCTGGGGATATGGAGCAGGATCCAGCCATCACCCTGTAAATTACCTGCC 

ATTCTTAAAAACCCACCCTTACACGTAGCTTGGGAATCAGCAATCTGATTTCATGCGGGA 

GATGCTGGAAACATCTGCTGAAATTAATTCTCCAGTAATTTGGAAAGATCAAAACTCAAA 

CCTTTTCTGACCCTAAGAAAGAGGGTCCTAAGCTCCCTACAGCTGGACTTCCAGATAAGA 

AGGATCTAGTAGTGAGGAAAAAAAACTCTAAAAATACAATTTGGGACTTGGGACCCANGT 

GGCAGTTCANGAATCANGGCTGGAACACAAAAAAGANGTGAAAGGGACACCAGGATCGAC 

GTTTGGAAACAAAACNGTGGCCACCCCCTGNTTCCACTGACAAGGNGATTNCNTGGCTTG 
GCTTGACATT 

Sequence 879 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGATGCCAGAATTGAGAGTCCATAGGA 

CCCGTTTAACGGATCCCA CATTCAGTAG ACGGTTTTTTGTATGCTTCGAGGGAAGGAAGG 

TATTTTCAAATGCAACTTC I I 1 I I I I I i I CTTGCTCTAGTTAGGGCTGATTTTACCACTG 

GGCAAATTGCATACATTTGGATCCATGCCACGCTAAATACTGTTTATTTTGGAGATCTGT 

CAGAGGCMGCACATGGAAGACAAGGGGATTCTGGTTTGAAAAGGCAATTCTCATCCAGA 

CAGATTCAGATGGAGAGGAGCGGTGTGGATTAAAATGCGCCGAAGTGACAGCGTCTCACC 

CTGTCACGTAGGCTGGAGTGCAGTGGTGAGATCACGGCTCACTGCAGCCTCGACTTCATG 

GGCTCAAGCGATTCTCCTGCCTCAGCCTCCTAAGTAGTCGGGATCATAGGCACTTGCCAC 
CACATCCGGCTAATTTnTTGTAGA 
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Sequence 880 

AGGTACCCATGAACATAGTTTTTACTCTTTTTAAAAGCTGTAGGCACTGGAAGGTCAGTG 

TTCAGCCATGACAGTGGAAGGTCAGTGTTCAGCCATGACAAGGCCCAATGTCATGGGCCC 

TCTGTCATCATCCTCTGTCCGTAGTGATCCCCAGACACTTTTGCATGGCTGTTTAGACAG 

AATTTTCCTGCAGCTCTTTTGGGAGTAGCTCTTGAMGmMCCCCTGACAGTTAATGA 

AGAACAGTGAGGCTTAGGAGGCAGGTAGGCCTTTGAGCAAATAACTCTGAGCAACTGGTT 

GCCATTGTCAGTTCTCCTATCAAGTATGCGTTTGATTTTTAAAAGGATTCCCCAAAGTTC 

TATTTAGGACTACAGAGTCTCTAAGAATGTATT 

Sequence 881 

AGGTACTTTGAGAGGCCAAAGCAGGAGGATTACTTGAGCCCAAGACTTTGAAACCAGCCT 

GGGCAACATAGTGAGACCCTGTCTCTACAAAAATTTAAAGATTAGCTGAGTGTGATGGTA 

TGTGCCTTTGGTCCCAGATACTCAGGAGGCTGAGGCGGGAGGATCACTTGAGTCCAGGAG 

GTCAAGACTGCAGTGAGCTGTGTTCGTGCCACTGTACTACAAATCAGAACCCCTCCCCTT 

CGAACTGCTGTTAAACATCTACTAGAGACTATCAGTGTTGTTATTTATAGAATCAGGAAA 

TGCTAGGCTTCTCCTTCTTCTAAGTCCTAGATGCTGAGACTTCAGAMTTCACTGGTTTT 

GTTGAGGTCAGGAGTCCAAGACCAGCCTGGCCAACATTGTGAAACCCCGACTCTACTAAA 

AATACACAAAATTAGCTGGGTNTGGTGGTGGGTGCCTNTAGTCCCACATACTCNGGAGGC 

TNAGGCANGAGAATCACTTGAACCCGGGANGTGGAGATTGCANTGGGCTGAANATCATGC 

CACTTGCACTCCANCCTNGGCAAGAANTGAGAACTCTNTCCCTTCACCCAAAAAAAAAGA 

AAAGGAAATT AATN GGTTTTGTTACCTTATTGMNTNGNGNGGTGAACCGANCAAATT^ 

CCAAGTTTCATTTTGCCCAGCCCTAMNAATGGGCTTTTTNCCCCC 

Sequence 882 

GTGATTGCTTTCCAAGGCCACCGAGGGTATCTGAGACTCCCAGGCCCTGGTTGGTCCTGC 

CTCTTCTCCTTCATAGCGTCCCAGTGTTGTCAGGAGGGCGCTGGTGGTAGCTTGGACCTT 

GTGTGCCAACGCTTCCTCAGGTCTGGGCCTAACAGCCTCCACTGCCTGGGCTCACTCAGG 

GAGCGCCTCATTATTTGGGCAGCCATGGATTCTATCCCAGCCCCACCATCAGTTCAGGGA 

CACAACCTGACTGAAGATGCCAGACATCCTGAGAGTTGGCAGAACACAGGAGGCTATTCT 

GAAGGAGATGCAGTATCACAGCCACAGATGGCACTAGAGGAGGTGTCAGTGTCAGATCCA 

CTGGCAAGCAACCAAGGACAGTCACTCCCAGGATCCTCAAGGGAGCACATGGCACAGTGG 

GAAGTGAGAAGCCAGACCCATGTTCCAAACAGAGAACCTGTTCAGGGCACTGGCTTTCTT 

TTGNCACCCGGAAACGTNTGGACAAGAAACGTTCANTGCCCTGTNGCCACTGTAGAACTG 

GAAAAAAAAAAG 

Sequence 883 

CCGGGCAGGTACGCGGGTAGGGATGCAGAACTGGCCACAGTCAGGGCTGGCCAACAAGGG 

ATGGGATGGCAAGGGAACCAGTGGGAGACTGTTTACACCTTTGACTTCCTCTTCACAGGT 

CAAGGCAGGACTGNAGCATTAGGTCTCAGAGATGCAATGAACAGGACAAATACACTTCCC 

CTTCATCTGGATCTGAGGCTCTGGGCGAGTCAGTACCT 

Sequence 884 

CCGGNCAGGTACAGACAGGGTTTCACCATGTGGGCCAGGCTGGTCAACTCCTGACTTCAA 

CTGATCCCACCTGCCTGGGCCTCCCAAAGTGCTGGAATTACAGGCGTGAGCCACCACGTC 

TGGCTGGAAAATGTTTGTGGTTTATGCCACACTATTTATCTGGGCTTTTGTAGGTTCGTG 

ATCAAGCAGCCAATCATCTGCCTGGGCAACCTCCTACTTAGACCACACCTCCCGCGTACC 
T 

Sequence 885 

GGGGGACCTCCTCGTCCAAAGAAGGGAAGACCTTGTCGGCACCATTGAGAGGCGTGTTCT 
CTTCTGGCTGCCGCTCGAGCACCGAGAGCTCCAGCTCGTCTACTGGCTGTGAGGGTTGGG 
CCAGGTCCATGTGCCGGCAGGGAAAGTCTCCCCGCGTACCTGCCCG 
Sequence 886 

CGAGAAGGTCAAGCACCGGCTTCGTCCAGTCCGATTTCAAGGGCGACGTGCTGGGACGCG 
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^«Z^ CGCCTGMCGCCGGCGTGCGCTGGTCGCA GACGAAGCTCGACATCGACAACTTCA 
Sequence 887 

CGAGGTAC ( I I I I I I I I I I I I I I I I | | | | | | GATGCTAGTTGTGATTTCCTGTAATTCTC 

ATTTGCAAGGGGGTAAAGGGGGATCAATGAGATGATACAGAAAGGAGGAGCTGACAATTC 
TTTTGTCCATCACATCATGACAAATACGTNTCMCAGTCCTGCACTGM 

STTCTCCTTACACGATGAGTTTTC CCCTTCTAGGCAATCCTCCCCAGCTGCAT 

G f r c ; GGc T GccAAA ^ 

AAGTATNATTTGNCATGTCTTAAAACTTTTTAAA «™wu3 
Sequence 888 

G J G ^ GG JIT CC ^^ 

GAAGGAGA Z GCAGTATCACANCCACAGATGGCACTAGAGG ^^ 

AO I GGCAAGC 
Sequence 889 

c ^; g ^ tggcggccgaggtacgcgggactctgtaaatac 

TTTGTGATGCCAAATATTC ATTTGACTAC ATTN N NTTC ATTTCAAAATTTAAAATTTTAC 
TTTrCAAACAAACAAAAGTTGTGATTCAGAGTAATTGGAGGGGAAAAGAAAGCGATCTTT 

I??™ GTCAGGAGATTGGATACCA TCCTGGCTAACATGGTGAAACCTTGTGTCTACTA 

AAAATACAAAAAAATTAGCTGGGCATGGTGGCAGGCGCCTGTAGTCCCAGCTAGTCGGGA 
GGCTGAGG 

Sequence 890 

CCGGGCAGGTACGTTCCAGGAATGGTAGATAGAGCATACATATAGGGCAGGCTGAGAGGC 
Z GGA ^ GGCTGCC ^ GAATGCCAG GCTMGGMTTTGGACTTCCTAAGGAAGCATGGAGT 
GAATTTTAGGCTGGGAAGGGATAGGATCAGATCTATGTTAATATTGCTCACCCCAATTGC 
^ C ^ AG ^ GGATGGGATAGMGGGAATGTATGGA ^ CAGGGA G 

GATTCATGGTAGCAGGGTCTCGAGCAGGGGCTTGCTTGAGCAAGCAGGTCCCGTGGAGCA 

GG J^ GTC ^ AACTCTCG "^ GTC1TTCT G AA TGGGTATCAAAGAGGGGCTTCCTGCANCC 

I^?^I GACCGGGGCTGCCmCmG1TTGTGGGGAGGGA CGCATGCCTGGCAACTTCAT 

AGAGGTCCAGGCCATGGGCATTGCCAGAGGCTGAACANACCATGCTGAAGAAAG 
Sequence 891 

CCGGGCAGGTACTTTTTCTTCTTCTTC I I I I 1 1 I I I I I I I A AAGAGAACATCTGAATGAT 

GTATTCGGAAAAAAACATTTCCCAAGAATATGGANAAAGGGAATTGAACATAATTCCCAG 

TGTAAGAGTGGAATGAGCATGTGGCCCTGTGGGCCTAGTCCCAGTT 
Sequence 892 

CCG r CG i 3 J GGCGGCCGAGGTACGCGGG AGTTACTTMGAACCA 

A ™ GGAG £ rCAAC ^ CTGGCTTC CTACTTACCAAGAGGCAGTAAGTTATAACCTCTGG 

JJaSZ*^ atccactaaccactca ctaggtcacaaagttgatggctctggcttgacccag 

TGAATGAGAAGTGTCCTAGATG' I I I I I | H | | | | | i | | I CGCAAATCTTCATAGGCACAG 
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TTGCCTGTTTAATTTTTTCTTCAGCCCTTCCCTTAGAGGAGGGAGCCTTMGTTACCTTT 

CAAGGTTGGGGGTTAGGAATCTATCATAACTAGTCTAGAGATTTCTCACCAAGGGAAATT 

TTCCTTATCTAAAAGAGGAACTTCAGGTCTCAACCCTGCCAGTCACACCCAATTAATGTC 

CTTCACAAAAATAAACAGCATATGTTCCCTTTCAATTTGAGTTCAAGNGGCCCACACAAA 

AAATTACCCTTTAA 1 MINI 

Sequence 893 

CCGGGCAGGTACCCATGAACATAGTTTTTACTCTTTTTAAAAGCTGTAGGCACTGGAAGG 

TCAGTGTTCAGCCATGACAGTGGAAGGTCAGTGTTCAGCCATGACAAGGCCCAATGTCAT 

GGGCCCTCTGTCATCATCCTCTGTCCGTAGTGATCCCCAGACACTTTTGCATGGCTGTTT 

AGACAGAATTTTCCTGCAGCTCTTTTGGGAGTAGCTC7TGAAAG7TTAACCCCTGACAGT 

TGATGAAGAACAGTGAGGCTTAGGAGGCAGGTAGGCCT7TGAGCAAATAACTCTGAGCAA 

CTGGTTGCCATTGTC AGTTCCCCTACCAAGTATGCGTTTG ATTTTTAAAAG GATTC CCC A 

AAGTTCTATTTAGGACTACAGAGTCTCTAAGAATGTATTTCTGTTGTCAAGGTCAGGAGC 

AAGATCATCTCTGTGGACCTGTCGGTTTGAAGGAGTTGCAAGATATTCTCATCATTCATT 

GATTTATATACTCAAAAMCATATATTTGCATGTGCCAGGAAAAGGTAAGGAAAGGG 

Sequence 894 

GGTGTGGTGGCTTATGGAGTCCCTAACGTTTGCACAAAATGGTTGCTACATTCCACACTC 

CCATGTGCCCTGAGCAACCTCCAGGGAAACTTCAGATACCGTGGGGCTTGGCCTGCCAGC 

TGAGCTGACCCAGCGATCCTTTCTATCACAGCTTTGATCTTCACTGGGTTGGGAGCCTGT 

CTGAATCTAAACATCTGCCTCTCTACTGCCCACCCCTTCGGCATTTCCTAGCTCTCCCTG 

AAAGGTGCAAACATGCATGCAGTTACTCAAACAGAATATTTATCCTTAACTCCCTGTCCC 

TCACAATTTCTTTTAAAACAATCCACTATTGTCTACCCCCGCCCCCGCATCACCCTTTCA 

TCCAGGTGCCCATTCCCTTCTCTCTAGTCCTGCAATACCTAAGCATCGATCTGCCTTTAG 

CACATCCATCCCCGCCTTATTCTCCACGCAGGAGCTGAGTGACCTGCACAAAATCCCCTG 

CTAAATACGGGACCT 

Sequence 895 

CCGGGCAGGTACACCATGTCCAGTGATTCATGTCTGGACAGAGGCTAAGTCTTGGGTCAC 

TCGCTTCACAAAGAGGCATCACTGTTCACTCAGGGGCTCCTACCTGAATCCCAGAGCAGA 

CGGATTCAAATAAATGCACCAACCTCTATAGACCTGCCTTGCAACCTCAAACCTCTTGGG 

GTCAAAAGCAAGAAGAAATGTTTATGATCCTAGATTGTAGGGGAAATAATGGTGATTTTT 

AAAAACACAATTCCCCGCTGGGCGCGGTGGCTCACACCTGTAATCCCAACACTTTGGGAG 

GCTGAGGCAAGGCAGANCATGAGTTCAGGAGATCAAGACCATCCT 

Sequence 896 

AGGTACTGGGCCTCTCTCTCTCAGTTCCCCACGTGGGCGCAGTCTGAGCTCTCATATCGT 

CTTTCTGACATTTAATGTTTTTAAAAAGGATGACAAGGACAGTTCCCTTGGGAAGTCATT 

TCAGTCACGACCTGCTGTGTATCTGTGGAGTTGACATGTTCTGCATAGTTGAGTGCTCCC 

CTGCTCTGGGAAGCCTGCACGGGGTCCCTGCACACATCCCAGCATGGAGGGGCTCAGGGC 

TGGGTTTGAGACGAAGGTGGGCGAAAGGTTCACATGAACTGAGGTCATGACAGAGTGATC 

AGGTGGGACCAGGCAGAACTCTCGGCCCCCAGGGCTTGCTGGTACCTGCCCG 

Sequence 897 

ACCGCGGTGGCGGCCGNCCGGNCAGGTACCAGGGCACTCCAAATATGTAATTTAACACAT 

TCATTAATMCTATGATAAATAGATATTTTATTCCAAGAACAATATCATTAGAGAGTTAG 

ATTGGAGGGCAAGGGGATTMGTATCTTMGTTTATTGTTTCCCGAGTGACCATTTTTAA 

AAGCCTCCTTCCAGGAACTGTGGAATGCCTACCAGGCTCCCTGACTGCTGTCTGGTGAGG 

GTCCCTACTGGTGCTGTGGACAGCACAATACACAATGGGGCACAAGAAAGCCACGCAGCC 

TCCCCCTCCTAGACACATGCTGTGCACCCCCTTGGCTCTGCTTGGAAGGTTCAGCTCTGC 

AAATGGAAGGTTGCAAACATCAGACACCTTCTTTCTGTTTCTGTCTCTCTCCCCTTCTTC 

TCCCCACAGTGTCCAGCCTTGGCCCCTGACTTTTCCTGGGGTTGGGGAGGCCTGTAGGTT 

CAGAAGCTGAGCAGCTGGGTGAAGACCTCCCTTACTCAATGTGCTCTGGCCATAGGGAGC 
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AGACAAGCAAGGTGAGTTTGGGTAGCCCTAGACACATGGAAAAAGAAACAGCTTTCTTAA 
CCTTCACCCCTTCATAAAATGACTTCATTAAAATGTCGCTNGGTTAATAAGGGAAAAANG 
GTAGCACTTCCCTTGNTTANGGNCAAACCCCCCTNAAAAAGGGANTGGTTTTTTTN 
Sequence 898 

AG GTAC ACTGGC TGTG CTGTTTC TAGC ACC ATAAG G C AGTC ATC ATTATTG AG C C AGGGG 

CTGTG CAAA ATTGGGGTGATTGCAGAGCCAAAAAGCGGGCTGGAGGGTGGTTTCTGGAAT 

GCCTGTTTTACTATTCTTCCTGTTCCTTTGCTCTTTCTGCCAACTTCAGAATCCTTCCTT 

TCCTTAAAACATTTAGTGATATCCCTGAATTTATTTAATTGCTACGTGAGTGATTTTTGC 

CCCACAGCACTGGAAGATAATTGGAATGCATTCTAGCTTATGTGCTCTGGATAATATGCA 

ATTTACCAGCCAGCTGGAGAGTAGGTATGCCTTTGTCTCACAATGTCTTGTAATTTTTAA 

AAAGTTTCTCCTATTNGCTGGGAGTCTCTTGNACCTTTCTTACGGCCATCATATATCCTA 

CTAATGGGGGTCCCTTATNGNCTGAAATGTGTANMCACTTCTTTTTGGTAATACTGGAA 

NAAGCTTNCTGGGCCNAATNAGGGGAAGMNACCCGNCTTTTGGTTTTTNGTGAATGGNC 

NCTTGTGCNAGTATGGGCNCCCCAAATTAAAACCCCTACCGGGAANTTGNCCCCCCCCCC 
CCCCCCCCCCC 

Sequence 899 

CCGGGCAGGTACCTACTCAGTTCTGCTGATGGTAGCGTAAACGCCACTATATACATAAAG 

AMTGGGTGTGGTGGTGTCMTAAMCTTTATTTACAGACAGGCAGTGTCCTGACTTGGG 

TCCCATTGGCTGTCGTTTGCTGACCCCTGCAACTTGACGCTGCAAAGTGAACAGCCCAGC 

ACCTCCCAGGCTGCGGTTAGGACCTCAGCCCGTCACCGTCATCTCCTGGGTCTCAGTTTT 

CTCATCTGTAAAGTGGGGCTGAAGCCACCTGCCTCATGTTTCTCCTGGGATTTTTGTAAA 

ATAGTGCACAAAAGTAGACAGAAAGCTCCAAAGGCAGTTTGCAAATAGGAAGACAGAATC 

AAGTCTATCTGTGAAAAACATTTATTNTGAGAATCTAAAATCTGGACAAAAGTACCT 

Sequence 900 

AGGTACCAGGTTGGGCTAAGAGATGAAAAACCAGGCTCAAGTTGGGCCAGAGATGGACAA 

TGAGCCACTTCC ACCAG AATTCCCAGCCACATATGCCCACCTCTGGGTCAGAAGAGCCAA 

ACGT ATTTATCGCTTTTGCTCAGGGCCAACATAGACACTTGATTCTCTCTCAGGGCTGTG 

TTTTCCCCCAAATGTATCAATACATCCCTATTTTCTTCCCCCTTTGTTTT 

GTGACCTGCTACCCCACACCCCAGCACACTGATGGTGTTCCCTCCACCACACCGAGACCC 

CGCCACAAGCTCTCAGCCCTNGTAGAGCACAGGCCCCACCCTGAAAGTTGGCAGAGCACC 

GGGCTATCAAGCTGTGTCATGTGGAGCCAGCAAGCCCGGCACAACCGTG 

Sequence 901 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACACACGCATGCCAGCATGGGAT 

GGGAGGCAAGGTCTACTCACACATGCACCAGTAAAGCAATGTGTGTGTGTCAAGGGGGGA 

GGGGAGGTAGTGGTCCATGCCTGAGTAAGTGCCAGCAAAGCAGCAGTGGGGAGGCTGTAG 
TGGGTGANGTTGTAG 

Sequence 902 

GTCTCGAGCGCGCGCGCCATCGGCAGCATCGGCGTGATGCCGATGCCGCCCGCGATCAGC 

ACGGTGCGGGTGCTCCCCGGGTGCAGCGTGAATCCGTTCAGCGGCAGGCTGACCTCGAGC 

GCCATGCCCGGCTCGAATTCGTNATGCAGCACGCGCGAGCCGCCACGCCCGCCTTCCTCG 

CGCTGGACCGNGATGCGCAGCATTCCGCCCTCCTCGCCCACCACGGAATAGGCGTTGCGG 

TGGCGGGCGCCGGCGCGCGAAGTGAATGCGAGTTCGACGT 

Sequence 903 

GGGGGACCTCCTCGTCCAAAGAAGGGAAGACCTTGTCGGCACCATTGAGAGGCGTGTTCT 
CTTCTGGCTGCCGCTCGAGCACCGAGAGCTCCTGCTCGTCTACTGGCTGTGAGGGTTGGG 
CCAGGTCCATGTGCCGGCAGGGAAAGTCTCCCCGCGTACCTGCCCG 
Sequence 904 

AGGTACCAGCTACTTGGGAGGCTGAGGCAGGAGAATGGCGTGAACCCAGGAGGCGGAGCT 
TGCAGTGAGCTGAGATCATGCCACTGCACTCCAGCCTGGGCGACAGAGCGAGACTCCATC 
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TC AAAAAAAAAAACC AAAAAAAACAGTGCTTGTTGTTCTTTTGTTGTTATTGTTAAAG G A 

TCATTATCTCATTATCAACAACATTGGCAGGTAATTGAGCAACTGTTTGTCTCCTTGTCA 

ATGTGGAGATAGAAAATTGCCTACGAGGTTGGACAAGGTGCTGGAGCAGCGTCAAGGAAG 

AGAGATCGAACAAGAAGACTGCTTTGCCCTTGGACAGCTTTGTTTCTGAAGCACTTTCTC 

ATATTCTCCCATTTAATCATCCCCTCCCTGCCAACTT 

Sequence 905 

AGGTACAAAACTCAAACAAAAGCATATTCTGTGAAAAGCAAGTGTTCTCTACCCCCTGCC 

TTCCCATTTCTCTCCCTGGAGGCAGCCGATGATCCTGGTCCTTCTGTGTCCTGCTAGGAA 

TGTTCTAGCATAAAGCTTTTAGATGCTGTCCGGGACACTAACGTCAGGATTCTGCTTGCT 

GAGAAGGNGTTAGGAGCTGGCTGTCTTTGAGACAGAAMCACTGAGTATGTTTTGAGTTC 

ATGGGAGATGAANATTTGTCCCTGGNGANAATTTTNNTTGGNNTTTGCTTTTGGAGAAGC 

CCNNACCANATCAGAAATTAAACNTCTMCACCANGGGAAACAAAAAAGATTTTGGTTTG 

CCCCGGGGGTGCCTTGTTTTGGGNTCNGTGNTTCTTTCCCTNCATTTTANGGCCNTTGTT 

TTCTTG AATTGGG N G 

Sequence 906 

AGGTACTTTAGGAGACCCAGGCGGGCAGATTGCCTGAGGTCAGGAGTTTGAGACCAGCCT 

GGCTAACATGGTGAAACCCTGTCTCTACTAAAAATACAAAAATTAGCCGGGCATGGTGGC 

TCACGCCTGTAGTCCCAACTGCTTGGGAGGTTGAGGCAAGAGAATCGCTTGAACCCAGGA 

GGTGGAGGTTGCAGTGAGCCGAGATCGCGCCACTGCACTCCAGCATGGGCCGACTAGAGC 

AAGNACTCCATTCTCAANAATAAAGNAAAGNAAAGAAAACAAAGAANAAGNAAAAGCTTA 

TATTTGAACTTCTCTTAATAAAAAGNAANGAMGNAGCAAAGCCTTGGATTGCCACACCA 

AAATTCTAAAAATTTTGGGGCMGTTCGATCAMTTTAAM . 

GCAATCACCAAAAAATTAAATTTCTTTTTACNTTCCCCCCCCCAANAAAAATTC 

AAAAAAAGTTTNCAAAANTTAGGCNAAACCTTTTTTTCCCT^ 

AAAATTGGTTAATTCCAACCCCAAAATTTACNATTTGTTGGTNCCCCCCCAAAACCTTTC 
TT 

Sequence 907 

CCGGGCAGGTACTACAAGATCAAGA I I I I I I I CCTAAAGAGCCATTTGTCTTATTTTAGC 

TTCAAGCCAAGCCAGGGCATCTGAGAAATACCAAGCCTCCGTTGTGATGTGTCGCCATGA 

AAATGTTGGCTGCCCTCTGGATGCAAGTCTGCTTGTGCTGTGCTGTGGCTCAGAGTTAAA 

TTTAGATAAAAATCAGTTAGGAGCTAAGAATATTCCCAGCTTTCCTGACAGGTTGTATCC 

GTCATTATGGGAGGAAAAACAAGGAACTGGCTGCCTGGCGACAGGGAGCGGGCCAGGCTG 

AGTGTGAGGTCAGGCCTCGGCTGGAATCTCACGGACTTGAAAGGACAGAGACGTTTCCTG 

AGATGTCTGCGGCANAATGTGGGAGGCTTNAGCCTTGAGGAAAGTTCAAGTTTGGGAGAG 

AAAGGTTGCCANACCCTTTGAGGTGACCCACACTGGCTTTATAACTTCTAAGAAGGNTCT 

TTTAGGGGAAACCTGCTTTAGMTGAAAACCCCTGAAAACGTCCCTTTTGCT 

Sequence 908 

AGGTACAGTTGTTCAGGGTGAAGATTTTTACAGAATTCAGTAAGGATGGATTATAATATG 

CTGAACATTTGGTTGCTTTTGTAATAATAGAGGTGGACAGCTGAGACAAATCTAGACAAA 

ATCTTACACTGACTGTTTTGTAACGACATACGMTGACCCTGMCTTTTTAAGTAACTT^ 

AGAAATAGGCCGGGCACGGTGCCTCACACCTGTAATCCCAGCACTTTGGGAGGCCAAGGC 

GGGTAGATCACAAGGTCAGGAGTTCGAGACCAGCCTGGCCAACGTGGTGAAACCCCATCT 

CTACTAAAATTACAAAAATTGGCCGGGCGTGGTGCTGCGTGCCTATAATCCCAGCTACTT 

GGGAGGCTGAGGCAGGAGGATTGCTTAAACCTGGGAGGCAAAGGTTCCAGTGAGTTGANA 

TCATGCCACTGCACAACAGCCTAGGTGATAGAGCGAGATCGCGTCTTTGGAAAAAAGGAA 

AGAAAAGGAAANGTTAGCTTGACCACTCTTTNNGAAAATAATACTNGAAAAGGAAAAA 

Sequence 909 

CCGGGCAGGTATCCCTGTTGTCTGTGCTATACTGCAGGGGGATTGGTCAACAGGATGCAT 
TCCTTTCTGTCACAGTTCATTATACTGGTCAGAGTTCTGAGGTCACCAGCTAACCTGAGC 
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ATAATCAG7TCCTTGGAAGACATTGGGTGGAGTGACCTCATAGACTCATAGGAAACCTGA 
AAAGAAGCCAGGGGAGAAACAATTGTTTTGAAGGAGTAGGGAGATTTCCCAAAGCCCAGC 
TGGAATGCTGTTGCCAGGAGGAGGMCCAGTTMGGGGCAGGGACCAGCCTTTTTGGAAA 
TGC TTTGTAAGAATG CCCTAGGGAAGAGGAA 
Sequence 910 

CCGCGGTGGCGGCCGAGGTACGCGGGTCCAGAAGTAATTTATTGGCCTGACACCAGGCTG 

TGTATAAATTCACAAACGTATATAACCTACTCCCAGAGGACCATGTGCATGTGGGAGGGA 

TTGGACAGATTGTTCTATATTTCCTGGGTGACAGGCTGTGGTGGGAAAGGGCTGGGGACC 

CTGAGTGTTTGGCTTGTAGCCTACACCTAACACCAGCTTTTCTTGGTCAGTGCTGGTGAT 

GCCAAGGCAACGAGCGAGCTTGTGGTTCAGCGCTTTGCATGATGTGGGGCTCCCTTAAAG 

AACCAAAGTGTGGGGAGAGTGCCTTCAGGGCAGGGCTGCTGGGAAGTGGCAGGGCGAGGA 

GGCTTCCTTCTGTCCCCATGATTCCCTCACATAGGGTAGTGCTAAGCCGAGGTCCAGAGA 

AGGTAAATGGGAGAAGTCTCATAAAGAGTTGAACTTGTAGGAATCAGGGGTGCCCGTTCA 

GCATGGCAATTGAATCCTTGATCCTCAGGGAGTTAGCTAGATCCGACTCACACCTGCTGG 

ACCGTGGAGAGCCAATGTCCCTCCC 

Sequence 911 

AGGTACAAAGATCTGTCACTTGGTTTCAGATCATAGCTGTATGAGTGAACCTTCAACCCA 

TTAGTGACCTGAAGTAGTCTCATGAGTCCCAGATAAATGTGCACCTACAGTGCTGCCCCT 

CAGTTAAACCCACTCCAGGGCCTTGTTTTTCCCTGTCTTTACCCAGTCCACATGGACCTT 

GCACCTTGGTTTGAGTTMTCTCACTCTCACAGCCTTTATTTCTTTCACATTCAACCCTC 

ATTTGAAGTCACAAACAGGAGCTTCTAACCTCCCAAACTG I M I NI I A ATCTCCGCCTG 

TGACCTCAGGAGTTGCAGCTCTAGGCC7TCTTTTCAGACTGTTGCCTCTCCTTGCATAAT 

GCTG AC ATC ACGTTGTAGTAGTGGTAAACCTGTGCCTG GATTG C ATTTC AATTGTGCTAG 

CCCTAATAGACTAGGAATGCCAATTTCAGAGAGGAGGGACTAGAGCCAAGATTGAGAAAG 
AAACAGGA 

Sequence 912 

AGGTACCCGCCCGGCGCAGCCCCGATGCCTTCCTGTGAATGGGACAGATTGCCATATGGA 

TGAAGACACTGAACAAGCTGTAAAACTCTTGGCGACATGTGCCCGGGGGGATTGTCCCAT 

ATTAACACCACCTGGGCTGTGGCAATTGTGCTCTGTCTGTGCAGTGGCACTGAGGAGCCC 

AACAGGATGTTATTTTAGAGTACCTGCCCG 

Sequence 913 

GGGGGACCTCCTCGTCCAAAGAAGGGAAGACCTTGTCGGCACCATTGAGAGGCGTGTTCT 
CTTCTGGCTGCCGCTCGAGCACCGAGAGCTCCAGCTCGTCTACTGGCTGTGAGGGTTGGG 
CCAGGTCCATGTGCCGGCAGGGAAAGTCTCCCCGCGTACCTGCCCG 
Sequence 914 

GGACCTGTGGGTCAACTAAACCTCTTATCCTGNTAAATTACCCAGGGTTTTCCACTGGCT 

ATGACATACCCAAATGGAAATATCAAGTAAGCAGTTGGATATACAGGTCTTGAGTTCATG 

AGGGAATGAAACCAAANCTGGAGGTAGTAAATCTGGGAACCAACCACAGATAGATGGGAT 

CAACACCAAGCTAATACATACAAGGATCTGCTATTTAAATACCANTTATGAACCATGCTA 

ATTCTATGATCTTG AGAAAA TAACGACCTCACTCCTCCATCTTCTCATCTGAAAAATGGG 

AATGAAAGTAATGC l Mil I AGGGTGGNTGTGAGGATTNCCGCGTNCC 

Sequence 915 

CCGGGCAGGTACCTCCTTTTTAACCTCTAGCTGATCTTGGAATTTCTTTTGTAGTGTATA 

ATTGGGAAACAATAGGGATCAATGCTCTTTCCTGATCTCAAAGAAGCCCACTCTCTAGAG 

AGAAAAGCAAGACCTATMTATTGMGATTTGAGAAAAAGTTAAATTGTGATAAAACATG 

CTGATTTCTCACACTGACTTAGGAAAAGCACCTAAGCCTCAAAAACGAGAATTGTCTTAG 

GAGTTAATGCTGTGGCTATTCTTGTCCTGGCATTTGGGTGAATGCATGAATTCTTTATTT 

CAGTTTATAGAATAAGATAAACTATCGAGAACTTCAGGTTGCTGCATAATTTTGACAGCC 
ATTCCCTGTACCT 
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Sequence 916 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCCATGAACATAGTTTTTACTCTTTTTAAAAGC 
TGTAGGCACTGGAAGGTCAGTGTTCAGCCATGACAGTGGAAGGTCAGTGTTCAGCCATGA 
CAAGGCCCAATGTCATGGGCCCTCTGTCATCATCCTCTGTCCGTAGTGATCCCCAGACAC 
TTTTGCATGGCTGTTTAGACAGMTTTTCCT 

TTTAACCCCTGACAGTTGATGAAGAACAGTGAGGCTTAGGAGGCAGATAGGCCTTTGAGC 
AAATAACTCTGAGCAACTGGTTGCCATTGTCAG7TCCCCTACCMGTATGCGTTTGATTT 
TTAAAAGGATTCCCCAAAGTTCTAT7TAGGACTACAGAGTCTCTAAGAATGTATTTCTGT 
TGTCAAGGTCAGGAGCAAGATCATCTCTGTGGACCTGTCGGTTTGAAGGAGTTGCAAGAT 

ATTCTCATCATTCATTGATTTATATACTCAAAAACCATATATTTGCATGTGCCAGGAAAA 

GGTAAGGAA 

Sequence 917 

CCGGGCAGGTACGCGGGTGTCTTCTTCCTTTTTTCTTTTTCATTCTTCTTACACTTG 

AAAATCCAAAAATTATTCTTGGCCCACAGACCATATTTGCCCACCCCTGCTTTAAGGTGA 

TTC TTTCA TGTTTGATACATCCAAAAAATGAGAGCTGATACCTGATTCTCAAACAAATTA 

ATATTTTTGCCTAAATGMTTTCATGG7TTGCCATGGTGGCCCCCATCTGTAATCCCAGC 

ACTTTGGGAGGCCAAGTTGGGGGCAAATCCCCTGAGGTCAGGAGTTCGAGACCAGCCTGG 

ACTACATGGTGAAACCCCATCTCTACTAAAAGAACAAAAATTAGCCAGGTGTGGTGGCAG 

ATGCCTGTAATCCCAGCTACTTGGGAGGCAGAGGCAGGAGAATTGCTTGAACCCGGGAGG 

TGGAAGTTGCAGTGAGCTGTACCT 

Sequence 918 

CTAATTTAATCTCCTTGCCTAATGTTCTTTAGTCCTCCAACAGGTCTGCCTACTTAAACA 

ATTTCTAGTCACAGGATGNGGCTGCCACAGTTTTATCTCCATCTGGCANAACATAACCAA 

CCCGCTTGAGCTCTNAGCTCCTGTTTTATAGCAACTAAAGTTTCCTTGTTCACCCGNTCC 

CCCACGCCACCTCCATGACGTGCATTTCTCGTAAAACTACAGATCAGTAAACTANCTCAG 

GGAACTTGCAGTAATGTTTGACAAAAAAAAAAAAATCAAAAANCAAAAANCTAGTACCCC 

TGNGATGTGTANCAAAATTCATCANACTAGCTTGTNTGTTNTGGCACTGATTGCCAGGCA 

AGCTTCCATTGTGCCTTNATAGCTCATACACNGNGCNCTTTCCCTGTATTNAGGCCCCTT 

GTNGGGTTNTCTGGATCCATGGGCAAAAAACTTGANTAGGGGGNGGGAAANA 

Sequence 919 

ANGTACGCGGGNNAGTCTCGCTCTGTCACCCAGGCTGGAGTGCAGTGGCGCCATCTCGGC 

TCACTGCAAGCTCTGCCTCCGGTGTTCAAGCGATTCTCCTGCCTCAGCCTTCTAAGTAGC 

TAGGATTACAGGCGCGCGTCACCACGCTTGGCTAATTTTTGTATTTTTAGTANAGCCGAG 

GTTTCACTAAGTTGGTCAAGGATGGTCTCAAACTTCTGACCTCGTGATCCCGTCCGCCTT 

CGGCCTCCCAAAGTGCCCGGGATTTACAGGCGTTGAAGCCACCGCGCCCGGCCGCTCTAG 

GMCTAGTGGGATCCCCCGGGGCCTGCAAGGNAATTTNNNAATATTCAAGNCTTAATCGG 

ATTACCCGGNCGAACCCTTCGTANGGGGGGGGGGCCCCCGGGTACCCCCAA 

Sequence 920 

CCGGGCAGGTAC I I I I I I I I I I ! I I I I I 1 I i II I I I ) I GCTTGGAAAATTTTCCTCCATC • 

CCTTTATTTTGAGCCTATGTATGTCTTTGCATGTGANATGGGTCTTGTCTTTTTAT^ 

CTTGCCATTTTGTGTCTTTTAATTGGGGCATTCNACCCATTTACATTTAAGGTAATATC 

TTGAATTCMCCCTGTCATCATMTGCTGGCTGGTTATTTTGCANACTTGTCAATGTAGA 

GAGTGAAAATAAGTGAAGGCAGAATCCTGGGAGGTAAGCCAGCCTGAAAATATTGATGAC 

ACTGGATGGCCTTAAGTTTCCATTTTGACTGGCATGTAATCCAGCCAAAGA 

Sequence 921 

TCTGAGCTTCCCTACCTCTTGATTTGTTTGCAGTGCTGCTAGTCCACCCTAATCTCTGCC 
TGCTCAAAACTTCTCTCAATGCAATTCAGAGAGATAGAGTTTAGCACTACAGTGACTTAG 
ATGTATGTTGAATTTGATCTTTAMTAAATCATATGCATCTTAAGGGCACAGATGAATTT 
GATGATTCTTTTGTGACATCTTGCAGTTGATCCTTAACAAATAATTATTAGTAGATAACA 
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ATCAACAACCAAT CCAGTG TTTATTCTTACCTGCCATGTTAATCTCTCCATTTGAATCTA 
CTTGAATTATTCTC r I I I I GTTTCAGTGTTGGAGTGTAATTGTTCCAGCATAA . 
Sequence 922 

NGGCGCTTTTCTTCATAAGCTTCACCGCTGGTANGTATCCTCAAGTTTCCGGTNGTTAGA 

GTTCGGTTTCCGCTTCCCAAAGCCTGGGGCTTGTGGTGCCACCGAAACCCCCCCCCGTTT 

CAAGCCCCGGACCCGCTTGCGCCCTTATTCNCGGGTAAACNTATCCGTCCTTNGAGTTCC 

CAACCCCGGTTAAGGACACGGACCTTATNCGCCCACTGGCAGCCANCCCACCTGGTTAAC 

CAGGGATTTAGCAAGANGCCGAGGGTATGGTAGGCCGGTGCTTACAAGAAGTTTCTTTGA 

ANGTTGGGTGGGGCCCTAAACCTACCGGGCTTNCACCTTAGAAAGNAACCAAGNATTTTG 

GGTTATTTCTGCCGNCTCCTGGNTTGAAMGCCCCNGTTTACCCTTTTCNGGGAAAAAAA 

AGAAGTTTGGGTAAGCCTTCTTTTGATTCCCGGGCCAAAACCAAAACCACCCCGGCTTNG 

GTAAGCCGGGGGGGN ITTTTTTTTG 

Sequence 923 

ATCACCCGTTGGCCAAGGAGTCGCGTGTCAGTCTCAAGGCGTTATCCGAACATACGCTGG 

TCGTTACCGATCAGGTCCACAGTTGGCAGCATGTGTTGGACCTGTTCAGCTTTTACGATC 

TTGCACCCGCCGTTGTGCACCGTGTGCGCACGTTCGAGTTGCAGCGTAGCCTCGTCGCCA 

ATGGGTTTGGCGTGGCGCTGATTTATACC 

Sequence 924 

AGGTACAAATGTTTTCTGGACAATCTGTGCTGTGATTTTGGAAATTGTGTGTGAAAGAGG 

AGAGTAGCTGGGCGTGGTGGCTCGCGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGAGG 

GTGGATCATGAGGTCAGTTCAAGGCCAGCCTGGCCAAGATGGTGAAACCCTGTCTCTACT 

AAAAATAAAAAAAATAGCCAGGCATGGTGGTGGGTGCCTGTAATCCCAGCTACTCGGGAG 

GCTAAGGCAGAGAATTGCTTGAACCCGGGAGGCAGAGGTTGGAGTGAGTTGAGATCACGC 

CACTGCACTTCAGCCTGGGCAACAGAGTGAGACTCTGTCTCAAAAATAAAAACAAACAAA 

CAAAA MGAAG AGAGTGAAAGAATTTGAGTCCTTGCACAGACAATAACTGTTTTTTATGT 

TTAAACll l I I AGTTTCTTATGTTAAACTTTTTATGT7TAAACTTTC AAAAGGTTGAATA 

MTTTCACCTTATGCTTGCATTATMTAGCAAAGTTCCTTGATATGCAAAAGCCACAAAA 

GTAAl l i l l AGCTTAAGTACCTGCCCGGGCGGGCGCTCTAGACTAGTGGATCCCCCGGCT 

GCAGGAATTCCATATTCAAGCCTTATCGATACCCGTCCGANCTCGANGGGGGG 

Sequence 925 

TGCGCCTCGGTCGTTCGGCNTGCAGGCAGAGCCGGTATCAGCTCACTCAAAGGGCGGGTA 

AATACGGGTTTATTCCACAAGANATCANGGGGGGATAACCGCAGGNAANAGNAACATGTG 

GAGCCAA AAAANGG CCANGCCAAAAGGCCAGNNAAACCCGTAAAAAAGGGCCGCCGTTGC 

TTGGCCG I Mill ICCCATTAGGGCTCCGGCCCCCCTGGACCGAGCCATCAATAAAAAAT 

CGGAACGCCTCAAATTCAANAAGGTGGCGAAAACCCCGNNCAGGGACCTATTAAAAGGAT 

ACCCAGGGCGGTTTTCCCCCCTGGGAAGGCTTCCCCTTCGTNGCGNCTCTCCTTGTTTCC 

CGACCCTGCCCGCTTACCCGGGATACCTTGTCCCGCCCTTTTCTCCCCTT 

Sequence 926 

TTTT AI I 1 1 IGCCAAAGCATGTGANAGTAAGTAGCAAACATCATGACACTTCACCCCTCA 

AATATTTTCCCACGTNTCACTTAANAACAAGGGCATTCATTCTGTTACATGACCACAATA 

CANTCAGCACANATAGATATTAACATTGANACAATGCTCTGATGTATTGGTATCTACTAT 

GCAGTTCATAATCAAATAAGGGTTTTGGTAACTTGGGATCCATAAATGGTCTGCAAGGTC 

AGCCCCTGTANACCTGCACTTAGTGGGGGCCATTTCCCTGTCTCANGGGCCTGCTCTACT 

GGNGATATTTTTCACTGGCTCTGTTGCTCACTTANA 

Sequence 927 

CCGGGCAGGTACGCGGGTGGCCGAATCCTGGCTGCTGGAGGCAAGTCAAATCATCTTCAT 
TTGTGGTGCTTGGAAGCTAGGCAGCTCTTTAGAATTATCCAGATGCCCACTAAAGTTCGA 
GCCATTCGCCATCTGGAATTTCTTCCTGATAGTNTNGATGCTGGTTCTAATCAGGTTCTT 
GGAGTACCT 
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CCGGGCAGGTACGCGGGGAAGCTCTTGAGCTCCTCTACCTCTTAGAAAGCACAATTGAAT 
CAGATA TCATA TGAAAGACATACACACTTCATGTAATGCTACCTGCAAGTCTCCCTAGAA 
AAGCAG! l I I iGTAGGTGAAAACAATGAAGCCAGGTAATATTGCAAGGAGGCTGTAATTT 
TA GC AG AC C TACCAACAACACTG ATGTAGGAAG CTC ATTATTTTAATTTCTGG AGCCTTT 
TAA1 n I I 1CTTTAGAMGTGTATAAATMTTGCAGTGTTGCTTTGCTTCCAAAACTGGG 
CAGTGAGTTCAACAACAACGACAACAACAGCCGCAGCTCATCCTGGCCGTCATGGAGTTT 
CTTGAAAGAACGTATCTTGTGAATGATAAAGCTGCCAAGATGTATGCTTTCACACTAGAA 
AGAAGGAGCTGCAAATGAACACTT 
Sequence 929 

AGGTACAAGCTGTTCTCCCGCAGCAGCCTGTGAGTCTTTCTCTTGACCGTCTTGTTCAGG 

AGCTCTTCCGCTCTGAGAAAACCAGCAAGGCCTCTCCAAGCGCACCGAGCAGGAGGCATC 

CCGTGGTTGCACTGGTCAGGATCGCATCAAACAGCAATATTGATAAGCCAGCTGTGATTA 

TCTTCTTTACCTTGGCACCGAGCTGAAGATCTGTTAGAAACCCTCTCCAACCAGGACTTT 

CAGGAGAAACGGTGGCCACCGCCCAGAATAACMGGCCCCAGCAGCAAGGTGTCCTTTTC 

CCAGCAAGTGAAACCTGCCCCCCGCCACCACCTGTCCCCGCGTACCTGCCCG 

Sequence 930 

CCGCGGTGGCGGCCGAGGTACTCCAGAAGAGTGTCCCTGGTCTGAGAACTACTGCTCTAG 

GATGACCCTTCTTGCTGCTGCTCACCAGCCAGCCTTCTGGCCCCATGAGTATTCTTGGTC 

TTATTTAACCTATATCCCCCAAAGGCAAGGATTTAGAGGCTATTTGGTCAGTCTTAGAAA 

ACACTACTCATGCCTAAGAGTACCTGCCCG 

Sequence 931 

GGTACGCGGGGTGACTGCTAAGACTGCCGGGGCTGCTGCCTGCTGCTTCTCGCGCCTGTC 

CTCTCCAAGATGCAGTTTTTTGATGCTGAGGAGCTCCTGGTCGATGAAGAGGATGATGTT 

TTTGGTGAAGGTAAATGCATCTTTCTGGCAGGAGGCTCTCCGGTCTGCTTTGCTTTGNTT 

TGACACCACTGGACACTTTTGCCCAGCTTGGGCAAGGACTTGAGTATTTTGAAATGATAA 

AAACAAAGCATCCTGAGTGAGGTAGTGGCTGCAAATATCAACTTANGCACTGAAGGAATA 

CACAAGGCAAGGGGTGGMCAAGGGGTTGAGTCATTCCCGCAAATACATTTTTTAAAGTC 

ATGACTAAAATAAAGGTTTAANTTTANGCNGGAAAATAGTACCT 

Sequence 932 

AGGTACGCGGGGCACGTCAGAAGGAGAGGCCTCTGCAGACACGGAGCACACGTGTGTTAT 

CAAGCGGGGCTGAGGAGTCTTCAGCTGAACCGTGCACCTGAAGATGCCCATGACTCCATG 

ATCAGATAGAAACTATTGAGCTTGGCACCTGGGGCTCTGGAAATCCAGACACTAAGTTGT 

GATCCTGGCACCACTTCCCTGGAAACCCGTATNTGGACATGCCCACATNCTTCCGGCTTT 

GTGCCTCTTGTCTGGATTTCTTGGGACAGACACAGCAAAGCCTCACGTGTTCTGAGAAGT 

GACCTGGCCTTCCACTGACATTCTTGGCTGAAGTTTTCAACTCAAGATGGTCACCCTGCT 

TATCACACAGAACCAGATGCTTCAGCACCTTGGAGAGCTNCACAAGACCCACAGACCCGG 

GCCCCCCAACCACAAGAAATATAATAAAATACTGCTTTGAGACTCAATTTCANGCACTTA 

GCTACATAAATAAAAGTTTNCAGGAAGACTAAATCTTAGAACCAACAGGGAGATGAGCAC 

TTCCAGAGTTTTCATCAGCTTTCCAAATGATATTTTAAATATGANGAACGTGAGGCCTGG 

CTTGGGCCTGTGGGGGAAMTTTCATTACAGCATCCTTGGTAATAAATGCTTGAAAATTC 

ATGCCAAA 

Sequence 933 

AGGGCGATTTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACT1TC I I I I I I i I I I 

rTTTTTTTTTTTTTTl I ATTTCTGGAGCTCTTGCTCTGTAATAAGAGCTAGAGTGGATGG 

GCCTGACCCTAGATGTTTTATGTANATGAGGTCTTGGTTTGAGAGAAACAAGGATTGGGA 

AAAAAGCCTGTATTTTCCAGTCCTAGTGAAACGGGAGAGAAAGAAGGTNTNATNAAGTGG 

AGCAAAAGTCAGACACCAGGGTCCCCACTTAATTTTTAGCCCTTGGTCACCACAGCCCAA 

NAGGTTGGTGGGGAGTTGCTTTANAAACACCCANATGGCCACGGGGAAGCCCANCAACAC 
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TGGGGCTGGTGGAGTGCCTNGGCAGACCCTGGGAGAAGGTCAGCCCTTGGCT 
Sequence 934 

AGGTACTTATCAAAAATACTTTTTACAGTAGCTATTTCAATAGATGACAATCCCTCTATA 

CCCTTCAGACTATTAGATCTCCTAAGGTGTCTCTAAGCATGTAATCTTCAGGGGMGTAT 

TTGCTCTGCCATACGCATGGGGTGCAATCACAGGCTCTATCCAGACTGATTCTATCTGGT 

GACCAGGCAGTGAGATATCACATCAAAGTTGACTCAATCTGGCTTATTAATGGGGAAAGC 

AGAAATACAGTGAAAGCTGTCAGAGACTTTGCATCTTTTGGAAAAACTATCTTACAGTTA 

AACATCTGGAGCGTCTCCAAAAGTGAAGAAGGTCTCTCCCTCCTTGNTGTGCAGCTAAAT 

CATCAGAGCTCTTTCATTGGAGACCAGTTTCTTTTT 

tng mrm GGAAACCAAGGGTCTTTGGCTCTGNTTGCCCAANGCTGGANTGGCCANTG 

GTGCCAATAACCGGGTTACTGGGAGCCTTCCCCCTTNCANGGGTTNAAGCAATCCCTCCC 

ATCTCAANCCTCCGGNGTACTANTGGGGCCNNCAGGCATGCCNCCCACNTANCCTGGCCA 
AANTTTAATAATTAA 

Sequence 935 

GGGGGACCTCCTCGTCCAAAGAAGGGAAGACCTTGTCGGCACCATTGAGAGGCGTGTTCT 
CTTCTGGCTGCCGCTCGAGCACCGAGAGCTCCAGCTCGTCTACTGGCTGTGAGGGTTGGG 
CCAGGTCCATGTGCCGGCAGGGAAAGTCTCCCCGCGTACCTGCCCG 
Sequence 936 

CCGGGCAGGTACTT TCACT CTTGCTC I I M I I 1 1 I I I CTTTTTTTAATCGATATCCTGCT 

GGCCTGTGTCAGTAATTTTCTTGTGCTTTATGACTTCTCAGTAAGAAACAGTGGATGGTC 

TGATGTGTTAAGAAATGCATATGTAAGGCCGGGCGTGGTGGGTCACCCCTGTAATCCCAG 

CACTTTGGGAGGCCGAGGTGGGTGGATCACGAGGTCAGGAGATCGAGACCATCCTGGCTA 

ACATGGCGAAACCGTGTCTCTACTAAAAATACAAAAAATGAGCCAGGCGTGGTGGCAGGT 

GCCCGTAGTCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATGGCGTGAACCTGGGAGGCA 

GAGCTTGCAGTGAGCCGAGATCACCCCATTACACTCCAGCCTGGGCCACAGAGCGAGACT 

TAAAAAAGAAAAAAAAAGANGAAGAAGTTCGAGACCAGCCTGGCCAATATGGCGAAACCC 

CATCTCTACTACAAATACAA 

Sequence 937 

GAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCCTGCAGAATCTACCAAGG 

CACTCTGTAGCAAAGCAGGCCTTCCACCTGTCTCGGGGAACCCAGCTGGCCACCTTGCAC 

TCAGCCTCTGCCAGTCACCTCAGGGCTGGCATTTGGGAGTCCTGACTGCACGCCAAAGGT 

TG7TAAGTGGCTTTCTCTGAACTCATTTGAACAGCTGAGTAAACTGTTCTTGCTTTCACA 

AGGAAAAGAACTCCTGAAGTCATTAGAACTCACCAAACCAGNTTCCTTGTTTTTGAAGAA 

AGCAGGTAA 

Sequence 938 

TTGGAGCTCCACCGCGGTGGCGTCCTGGGTGCTTCTGGGTGAGATCAGACCTGCCTTGCT 

TGGAGGGGGGGNGGGAGAAAGCAAGAAAAAAAMNNTTTTNTTTNTATGCCCNGGNCTTC 

CCACCATTGAGAAGCTCCCATCTTCCTNTGCTTTCTTGCAGGGCAAATGTGCCANCAAAA 

CCATGTACCTCGGCCGNT 

Sequence 939 

AGCTCCACCGCGGTGGCTGCCGCCCGGGCAGGTACGCNGGGTGGCGGCNGACNACCCANG 
GGGNNNGNNGGGNCCTATCAAAAAAAAAAAN I I I I I I I II I I I I I CCTNCCATGGTGGAC 
CACCGGGTTGACGGGGAATCAGGGTTCNGATTCCNGANAGGGAGCCTGAGAAACGGCTTA 
CCACATCCAAGGGAANGCACGCAGGCGCGCNAAATNACCCAC 
Sequence 940 

GGGC GAATTGGAGCTCCACCGCTGGTGGCGGCCCGAGGTACATTTTTTGATGAATTAGTG 
ATTTTAGTGCAGAATGAAACCMCCCAAAAAACCAAAACCAAGAGTATGTAGTTGTATTT 
TCTTGGGGTTCTCAGATGAGTTGMGGTAAAGCAATTGTTTGTTGAGGAAACCAGGCTTA 
GAAGGCCTGGGCCTCTGCTTGTCTTTTGCCTGGGCACTGGCCACATGANGAAGAAAAAGG 
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CCTAAGCCTTCCCGANNAAGTNCCCTGTTGCTCCACCTNCGGGTGCCCCCANAAAGAGGG 

CTTGGGGGCCTAGGGGGTCCCCCTTCTTCCCTNGAGGGCNTGACCCCCATATCTTCAATC 

CCAACNTGGCNCAANGCNCCAATNGGGGGNCTTGGGCNTTTAAANATGGGTTCTTGGANA 

ATGGATTCCCCTTAAAAAATTATTTTCTTTTCNACCTTTNGGGAATTTNATNTTTTGCCA 

ANGG7TTCANGTCCCTTCCTTCTNCGTNGTNGGGAAACCTTAAAANCCAATTAAAAAATT 

NTNNTTGGNCCTTTTTCTTGGAAAC >wwvm 

Sequence 941 

ANACCAGCCTGGCCAACATCCTGAAACCCCATCTCTACTAAAAATACACACACACACACA 

CACACACACACACACAAGTTAACCTGGCGTGGTGGCAGGCACCAGTAATTCCCACCACCT 

GGGAGGCTGAGGCATGAGAATTGCTTGAGCCGGGGAGGCAGAGGTTGCAGTGACCAGAGA 

TTGTGCCACTGCACTCTGGCCTGGGCAACAGAGCGAGACCCCGTCTCAAAAAAAAAAAAA 

A1TCTTATTTCATTCCTCATGCTACTCAAGAAGCCATGGGTGGTATGTTTTGATTTACTC 

CACCTTTGGTCTGATGTAAAAGCCACTGATGTCNATGGATAAGCAGTGGTAGGATGGTTT 

GAGTTTGGGGATCTTGGAAACACTTGAAAATTCTTATCTATGCCCCATTTCCTAATAGCG 

ATGGGGAAATGTCTNCTGTACCTCGGGCCCGCTTCTANAACTAAGTGGATCCCCCGGGGC 

CTGCAGGGAATTTCGATATCAAGCTTATCGAATA'CCGTCGANCCTNGAAGGGGGGGGCCC 

c 

Sequence 942 

AGGTACAAATTATTTCCAGGTGCTGATAATTATCACTTGTGGTGTTTGATATTCCGGACT 

TTGCATCTAAATTTATACTTGACAAGTTAMCTTTTCATCGTGTGCATTTTCCCGTAATC 

TCTCAACAGCCAAAGACAAGGATAGACGGATTAGTGATGGGTGAAACCAATTCAACCCAA 

AATGGCAACACTTAATTACMGTCTTGTTGTTTATAATTMCTGTGTAMTCACTMC^ 

GAAAGTTTCTTTTCAAAGTATTTAAAACCMCTCTTTTCCCAGGGGGAm 

TATTATTGCATAGTTMGTATTATTTTGTTTGCTCTATGAACAAGATGTGAAAATGGCAC 

ATmACCGACTATCTTGCTAGCCAAAmGGGAAAAATGGTTCATGGAATTGGCArTTA 

CCTAGCNNCCATTTGGGTGCTTTCTCAAGTAATTTTTTTGGGACCATTTTT^ 

AANATTATGN AAAA GCCCAATGGGGAAAGAACCAAATNCCAACCTACCCCNAATGGACTT 

TTTGAAGTAATTTTCTTGAAMNTGGGGAAAANTGGAAGGGGGTC 

TTTCNGGAAAAA TTAAAA AAAAAAAAAAATGGGGGNCCCCCCAAAGNG I WWW I CNGG 

NAANGCCCNTNC I I I I I I I I I I GGGGCCCAATTTTTTAAAAA 

Sequence 943 

CCGCGGTGGCGGCCGGCTTTTACTATATTTTTAAAATGTATATATGGTGGTATACAGAAC 
AAATGAAACACGAGACTTTTCAAATAATTCAAGGAAGATGACTACCATGCCAGACTGAAG 
ATGATGTTACTGAATCAGCTTATTCAG I rTTTTTTTTTTi I AAAGAAAATTACTTGTATT 

TCAATTTGCCACACGTGGTGGCATACACTTGTAATCCCAGCTACTTGGGACTACAGGCAC 

GCGCCACCGCGCCCGGCCCCGCGTACCT 

Sequence 944 

CGCTTTTCCAGGTCGNGGGAAAACCTGTCGTTGCCAAGCTGCCATTAATGGAAATTCGGG 
CCCAACCGCCGCCGGGGGMNAGGGCNGTTTTGCCGGTATTTGGGGGCGCTTCTTTCCGC 

Sequence 945 

AGGTACGCGGGAAGTCTAAACCTTTTTGACAATACTCACCCTACCCTGGCCAAATACCAT 

CAGAAAAACCCCACCCCACCCCACCCCAAAGAAGCAAATAACAGTGCTCTCACTCCCCAA 

CCCAGTGGTGTGGGCTGGGGAGATCAGAGACCTAAGCACCCATTTCTTTGCCCAGCAAAA 

GCAGACAGCAC7TTGGTTCCCCCTGCTGGGTGGTTTTGGCAGACCAGGCAGGAAGCGTTT 

TCCCTCATTCCACTTCAAGCTGAAGGAAGTTATGCTCTTGCCAGGGTGGTGTCTAGCAGC 

AGTGGGGTCTAGTAGCAGATTGAGCCTCCACCCCAACCCAGAAGCAATGACACTTAACAA 

GGTGACNGTGAGCCAGGGCTAATCCATACTCTACCCTCACCTTACATCAACGAGGCAGAA 

CAAGATGGTTGGAGGTAGGATTCTCAACACATCTTACCTCCATTAATGTCAGTGGGGNCC 
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CAGTGGGAGAACTTGAGCCTTNACTCCCATTCTGGCAAAGTGGATGGGTGGNCAAGGGGG 
Sequence 946 

AGGTACACAGCGTGGTAAGGAAAAGTCCAAAACTTTGGAAAGAAAGAAAAATCCTCCATC 
ATTTACTAATTGGGTGAGTTACTTGCGGTTAGCAGTTGAGCTGAACACAATTCCCCCATC 
TCTGGCAACTACTCCCAATACACACGATACCCGGTGGGCCCCGATGCCAAGGCCAAAGGG 

GGATTCTTAACCCANACTGAGCCAGCCAGATCTTCTTGTGCGCCCAAGTATGCATATGCC 

CGTGTTCGTGTCCCCGCGTACCTGCCCG 

Sequence 947 

CCGGGCAGGTACCCTGCAGAATCTACCAAGGCACTCTGTAGCAAAGCAGGCCTTCCACCT 

GTCTCGGGGAACCCAGCTGGCCACCTTGCACTCAGCCTCTGCCAGTCACCTCAGGGCTGG 

CATTTGGGAGTCCTGACTGCACGCCAAAGGTTGTTAAGTGGCTTTCTCTGAACTCATTTG 

AACAGCTGAGTAAACTGTT CTTGCT TTCACAAGGAAAAGAACTCCTGAAGTCATTAGAAC 

TCACCAAACCAGTTTCCTTGTTTTTGAAGAAAGCAGGTAATGTGGCTTTGCCAGCAAGAA 

GCAGGCCTTGGGCGTCCATCTCCTTCTCAGGTGTGTTTACTCAGGAGAAGGTGACATTTG 

GATAGGGGCAGGTGTGTTTGCTGAACTGAGCTTCAGTCTAATCATTCCGGAAAACTTCCT 

GAGACTTTCTCTGCCCTGTAATTTCCCCACTGTTGATGGAAGTATATTGCAGTCTCATGC 

AGGACTGCGAAGGGAAGGTCACACTCACTAGATCACTTATATACCAAATTCTGGGGAACT 

TTTGGTGCATATTATACTTTAGATCTCTTTTCCTTTAATGATAATCATTGAAAATATTAC 

CATGGAATAAAATCTGGACTTTAGTTMTGGTAACATACCAACGTTGGTTTCTTT 
Sequence 948 

AGGTACACTGGCTGTGCTGTTTCTAGCACCATAAGGCAGTCATCATTATTGAGCCAGGGG 

CTGTGCAAAATTGGGGTGATTGCAGAGCCAAAAAGCGGGCTGGAGGGTGGTTTCTGGAAT 

GCCTGTTTTACTATTCTTCCTGTTCCTTTGCTCTTTCTGCCAACTTCAGAATCCTTCCTT 

TCCTTAAAACATTTAGTGATATCCCTGAATTTATTTAATTGCTACGTGAGTGATTTTT 

CCCACAGCACTGGAAGATAATTGGAATGCATTCTAGCTTATGTGCTCTGGATAATATGCA 

ATTACCAGCCAGCTGGAGAGTAGTATGCCTTTGTCTCACAATGTCTTGTAATTTTAAAAA 

GTNTTCTCCTATTTGCTGGTGTCTCTTGNTACCTTTCTTACGGCATCATATATCTACTNT 

GGGGGTCCTTATGTCTGAATGTGTAGACACTTCTTTTGGTATACTGAATAANCTCCTGGC 

GATGAGGGAAGACACCGCTTTGTTTTAGNGATGTCCTGTGCAGTTGGGCCCCAATAAAAC 

GCTACGGAAATG ATGGGT GATNAGACTTATTTCTTTAGATCTTTAAAATTGGGCCGATNG 

AATAACNNCCANGTTTTTAGAMCCCATANAAATTTTCNAGAAGGMCCTTT^ 

AGACCGGAAANCACCCCTTCCTTTTCCTTTGAACCAGGNCTGTCCCCCNANGTGGNCAAA 

NNTTGGAGCTTNCCCNTGGGGGAACCGGGCCCTAAAATNGGGGTTCCCATNNTTCANTGN 
TTN NC N N ACTTACC AAAAAAAA 
Sequence 949 

CCGGGCAGGTACTAAGTTGAAGTAAAGCTCACCCTTTACTTCCCTACTTGAAAGTTCTAC 
TCTGAGCCTTGACT CTTAGCCACAGTGAGGCATGTTGAAGGTTCCTGCTGGTTTATTCTT 
TTTTTTTTTTTTTTTAGACAGAGTCTCACTCTGTTGCCCAGGCTG 
Sequence 950 

ATCAGGGCGATGTAAAAAGAAGGTCAGACACTGCGGATGGGTGACTTCAATACGCTTCAC 

GCTAATAAA TAATT GATCCAGTGCTGGCAGTTGTAACAGCATCAATAATCGCTGGTGTAA 

GTGTGAGGCTTTTACTGCATCGCCGTTATGCCAGTGTAGGGTTGAACGAAGATAAAAGT 
Sequence 951 

CCGGGCAGGTACTGGGGATTGATTAGCCCAGTCTTTGCCACAGACTCTTTTGGACTGAGT 

CGATAGGTCAAAGGTTAGCTTATGCAGTAGCATGGGCTGTAGGTTTTTTGCACTGATCAA 

CACCGGAACTTTTAGAAGACTTTTAGAAGATTTGTTGCTGCTGTGAGTGACATCAGTTCA 

CACAGCCTGAGAGAGAACTCTCAGTCCATGCTGTCCCTCCGTGGGCTCACTCAAGCGAAA 

TGGGTCGAGCTGCAACACTGTGCGGTGCCAACCGGGAGAAATCTGCCTATCGGGGCTGAG 
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AGACATNCTGTAAGTGGCTGCCCGTCTCCTTTGCCCCAAATGTCAAGTTGGAAGAGGTGC 

TTCACATATTTATTTCTTCCTACATGGGCTCATTTATTTGTAAGCCCATTTGTGACCCTT 

TGGAAAGCCAAAGCTTTTATCTTGATTTTTTGGAGTGTGGAACAAANGGTGAACNAGNTT 

GTTCTTCCNGGATTGACANGGGAATTCTTGGACNTTTTTGGTTTTTGGCT 
Sequence 952 

CAGGTACI rTTTTTTm I CTTTTTTGTATTTTTAGTAGAGACGGGGTTTCACCATGTTA 

GCCAGGATGGTCTGTATCTCCTGACCTCATGATCCGTCTGCCTCGGCCTCCCACAGTGCT 

GGGATTACAGGCATGAGCCATCGCGCCCGGCCCTCTGGTTTTACTGTTATTGNGCCTCAG 

CTTTTGTTCTGATCCAGGGCATGGCCAGTCAGAAGAATGGACATTCATCCTCCTGTGTCT 

CTATAGGACAGTGTCTAGTCTTCAGCAAGAGAGGAAGTGACGAGGGACTCACAGATGTTA 

TGCAAGTCCACTGTTTCCATATGATTTCTAGTCATGTAACTCCTCCCTACAGCCCAGGGG 

AACATGCAATGCCTTMTTAAAATGTTCTGAGTTAGCTTAAANAAGCTTTTTTATTATTT 

NGAACAANATATGCTTTTGGGAACAACNGAACCTGGGATTTTAAAAAAACTTACCTTTNC 
TATATTTGGGGAATTCTTT 

Sequence 953 

CAAl I I INCTTGTAGTTCTNTTCCTTCANAGATGAGTAGGGAAATGGAGATCAGCACATA 

CCATTTGTAATTATACATGTTATAAAACATTATTTCCAAAGCCATATGTTACAATAACTN 

ATGTCAGATTNCTTACATTTNTATTCACTTTTGATGAGCCCATCTTTATCCAAATCCTTA 

TATTTTCTAAAGAGAATAAATAAGTGGTTACTGAAAATTAATATCACAATAAGTCTTTAA 

TAACAl I I I ATTAATAAGTGCTTAATTACATTTCAGAGTATTCTTAGGTTATTGGCAAGT 

ATCCCTCTTCTTGCATATATGTCTTCAAAATAATTGCAAACCATNATTCATATATATCTG 

TGTTATGCTTNNCGGGGGCTTTCTCCCTAAATTNTATNTGGTGCCTATTTATTGAACTAC 

TTATTATACAGATTGGAGCCATCCCTAATTAGGAAANATCCCAAAATGCTCCAAAATCCC 

AAACTTTTNTGAACACTTGACATGCTCACATTGACCATTGGCAGGNATTTNAAAATTTTG 

Sequence 954 

AGGTACCTTTGGTAGMTTCGGCTGTGAATCCATCTGTTCCTGGGGTTTTTTTGGTCTGC 

TGGAGTTTGCTGGAGGTCCACTCCAGACCCTGTTTCGCTGGGTATCACCAGCAGAGGCTG 

CAGAACAGCAAAGATTGCTGCCTGTTCCTTCCTCTGGAAGGTTCGTCCCAGAGTGGCACC 

CGCTAGATGCCAGCTGGAGCTCTCCTGTATGGGGTGTCTGNCGACCCTTGCTGGGAGATG 

TCTCCCTGNCAAAGAGGCACAGGGNTCANGGAACCATTNGAGGANACAGTCTGTCCCTTA 

GCAGAGCTCAAGCACTGTGCTGGGGAGAATTCTCCTTNTCAGGATCANCTTGCTCTCTTC 

AGAAGCCAGCAAAGCAGGAAAGTTTTAAANCNCGCTANAGCTTGCACCAANCAGNTTTCN 

CCNTTCCCNCAGGNNGCTTCTNTCTNAAGGAAAAANGGGTNNrnTNTATTCTGNANAGC 

ACTrrGGAMNNTNGGCATGCNTGACCTTTTNNTTTTCAANAAAANNNCCNCTGTGNAAT 

NNGNANGACGGGAAATCNT NTAAN NNGGCNNATTTCTGNGANCCACANNACCNNNTTTTT 

TTTAAAATGNGTGNNTTTGNTTTTTNATTCCCCCCCCCCANNNGNAANCCCCCCCCNNTG 

ANANTCTCNTAANAGGGGTNCNTCNCCCAACCCCNNCCNCCCGGGNCNN 1 1 I I 1 1 II INC 

CCCCCCCCCCCCCCCCCCC 

Sequence 955 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTCCTGAGGACTTTTT 

ATCTGAATGTGCAAAGGTTGCATCCTTTTGTGAATATCCAGAGTTTGGCTGACCAGGACT 

GGCTGCTGAGCAGAATGGCTTTTAGTCAATGAACCCTGGGCTTCGTATTTACTTAGGCTG 

GGGAGTGGAATTTCTCTCTGGTGACTTTCAGTTAGATGATCTTCTGACCTCCTCTCTAAC 

GGGCTTCCATTGACTTGCTCCTCATTGCTGCCCCTCTGCTGTTTGTTGAGCTGCTCAGCC 

TGAAGTGCCTCTGGGTTAAGGCGCTTTCTTGTTCTCCTCCTAATAATCTGCTCACCGTTG 

i i i IGTTTAATGATGTTTAAAGGCCTGGGAGTCGAGTGAAGCTTCTGGTAGAGGCCACAC 

GCGTTGCATACATATCCGCCATTTGCATTCTTTCGCCAGAGAGAGGTCTTTGTGGTCAGG 

CAATTGGCACAAAAAACACCGGAGCCTCTACGCCTCCGTAACAGGGACTGGGATTCATCC 
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TTGGACTTGTTTTTCTCCCGATGCAAGATACTG CTG GGGG GA 
Sequence 956 

GGGGGACCTCCTCGTCCAAAGAAGGGAAGACCTTGTCGGCACCATTGAGAGGCGTGTTCT 
CTTCTGGCTGCCGCTCGAGCACCGAGAGCTCCAGCTCGTCTACTGGCTGTGAGGGTTGGG 
CCAGGTCCATGTGCCGGCAGGGAAAGTCTCCCCGCGTACCTGCCCG 
Sequence 957 

AGGTACTTATACTGAAATTCTAGTCTAAGAGATGGATTAAAGAGGACAATGTGTMGCTG 

ATACAGTGTGGAAAGATGGGATGGAATGGCCCCCAGGGTGTGCTGATAGAGAAGAAGGAC 

CAGGCTGGGGAGATTGAGTGGAGGAATGAGTGATCATAAATGAGCAAATACTTTGTCCAA 

AAAGCCCCTCAGAGAAGTGCAAGGGACGTAGTTGGGTGGGTGGTTCAGGGAGTAGCTCAA 

GTAATGATTTGAAGCTGMGTCAGMGGMCTAGCCACATTTCTTCTAGAAAGAACTATA 

ATGGAGAGCTCACGCTTAGCATAGAGAGGGGGTAGAAGAGGAAGAGTGTGACCAGAGGAG 

GTTCAGTGGCTGGGGAGAGAACCACGGGTCCAAAATGTTGTGATGCAGGGGACTCAGAGG 

GTTACATCATGCTTTAATGATTGCTACATTGCCTGAGTCTGATACAACAATGAAGGTCAG 

TTATACACTTGAACTGA7TGCTTATCTTAGGACAGTGGGTCTCAACTGGGGGGTGATTTT 

ACCCCCAMGATGACATTTGACMTTATTCTAGAGACCTTTTNGTTTGTCCACCTATTTT 

GGGGGANCCAGGGTTTC 

Sequence 958 

GGAAATACATGAGGCGGTGGGTTTGAMGTCTTTTCTTAATTTGTAGTCAGAGCTTCATA 

ACTGGGTGAGGCAAGAGAACATGTGATCAAGAGTCAATGTGATGACAAAGAGAAATAGAA 

ATCACTATTAAATTACGGTGTTCGAGAATGGTGATGTGATTCCAGTAGAGCTTGAGGGGG 

AAGATGTTACTAGAGATTGAAAGACAAAGGAGAGGTGGGGAG7TGGGGACAGGTTGAAGA 

GTTAGTCTGTATGA TGCTGTGG TTTATGAATGGAGAAAAAATTGTGGCACACTAGAGAAT 

ATCAGCTTACAGGC TTTTTT TTATTCCAACTCATTAATATAGTGTGGAGAAAGATGAGCA 

AGAAATAATTGATTG nil I GAGTAAGAGCTAGTTAAAGAAATTTGGCTTGTAGAGAGTG 

AGAGAAGACTANATACAGAGAGTGGCCCCACGATATTGCTTAGTATTCCAAATTAAGTTT 

TTAG TTGA CCAAAAAGAA ATGG TGCCAGGGTTTTTAGAATTTTAAATCT^ 

TGGGTTTTCATCACCTGNTTTTCTTAACCTAANTTTTATGTCCNTATT^ 

CTTTTGTTGGGGGNTTTTC 

Sequence 959 

CCGCGGTGGCGGCCGAGGTACTGCAAAAGGCAGATGTGAGGCAGGGGTGGAAAGCAGGAA 

CAGGGTTTGAAAGAAGTTCTTAGACCCTCCCGGTTCCTTGCCCTGAGCCCACGCCGTCAC 

AGAACCACCTCTTCTCCATTTCTGCAGGAGATGGTTTAAATTCTCTGGAGATTAAATCAG 

AAACATTCAGGAGTTGGGGAACCTGGGCAGGGCATGATGAAAACAGAGGGATCAAGTAGA 

AGTCCTCAGGCTGACACCCCCAGCTCCGCCCCAGTCATGCCTCCAGGGCCTTGGCATCTC 

ATCTTATCATCCTTGGGTAGGTGATTAAGGTCTCATTAAATCAGGGGGATCCCCTTATAA 

AGTTAAACTTGGTACCAATATCCTATGAGTGAAAGCATTTCATTTTGACCNGCCANNATG 

TNCCCTCCCCGGGGCGGGCCCGTTTTTAAAAACTGGGGGGNTNCCCCCGGGGNNNTGGNG 

GGGNAATTNCNAAATTTAAANGGCTTTTTGGGAANCCCCGGNNNCCCNTTTTNGGG 

Sequence 960 

^^^^[^^5^ ^ GAG TG CTCTTTnrOXTnr ATOT AAATTTG G G AATTG TOTO C TTX ATXTC TTO 

TTATTTTAGAACAAACATTCCAACATTTGAAGTTGCATAATCATAAATTTCM 

GGACAAATATTTACCTGTTCATmGTATTTTTA^ 

CCAGGCTGGTTTTGAACTCCTGATCTCAAGTGATCCACCCGCCCCGGCCTCCTAAGGTGC 

TGGCATTACAGGCATGAGCCGCCA 

Sequence 961 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTAAAGAATGT 
GC ATTTATTTGTG G GG AG G AGAG G AGGTCTAAATAG AC AATAAAC ATATAAATATTAATA 
TTGTGAAAGTGGTATGAAGAAAAATAAAGCATGATAAGGAGAGAGAGGTGTTCCCTGGCT 
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CTCCAGAGGGCTGAGGTGGAAGAAAAGGACTCCTGTTTTAGGTAAGATATCAGANAAGAC 

CTCTCTGATTGACTGAAATGTATGCAAAGGCCTGCATGAAGCCAGGGACCATGCCGTGCT 

GACATCTGGGGGAGGATTATCCTCACAGAGGGGAAAGGAAGTGCAAATGCCGTGAGGTGG 

GAGGACCATGTTTGGTATGTTCAGGGAGGAACGGGGA 

Sequence 962 

CGGTTATCCACAGCAATCAGGGGGATAACGCAGGGAAAGAACATTGTGAGCAANAAGGCN 

CAGCAAAAAGGCCCAGGNAACNCGTAAAAAAAGGCCGCGTTGCCTTGGCCGTTTTTCCCA 

TTAGGCTCCCGGGCCCCCTTGACNGNAGCCATNACAAAAAATCNGACCGCTCNAANTCAN 

GAGGGTGGCCGAAAACCCCNGACAGNGACCTATTAAAAGATACNCAGGGCGTTTCCCCCC 

7TGGGAAGCTTCCCTCGTGCGC7TCTTCCTGTTCNCGACCCCTGCCGNTTACCCGGGATT 

ACCCTGTCCCGCCTTTTCTCTCCTTTCGGGGGAAAGGCCNTGGGCCGCTTTTCNTCAATN 

AGCCTCAACCGCCTGGTANGTTATCTTCAAGTTTCCGGNTGGTTAGGGTTCGGTTTCCGC 

TTNCCAANGCTTGGGGNCCTTGTTGGTNGCCNANCTGAAACCCC 

Sequence 963 

CCGGGCAGGTACTTCAAAGCCCACTTCTTTCCCTTAAACCAACACTTATTCCAAATGACA 

AACTCCCTCAATTACCCGTAACATTAATGTTTATACTCGGCATTTGTAATACAATAAAAC 

ATAAAATGTTTATACTCGGCATTTGTAATACAATTTATATTTTAMGTTTTCTGA 

GATGTATCAGATGTTGGTGCTTATCAGATAAGGAAACAAGGGTCCGAAAGGTTTCTTAAC 

TTCCAAGAGCTGAGCAGGCTGCTAACCTGGTCTGCTCATCTCCTAGTGGGA I fTTTTTTl 

TCTCTTACATGGCATACTTCTCTGCCATCATTTTTAAAACTCTGAGGGGGAAAAGGAGGC 

TAAATGTTAACATGCTGTGACCAGTAAACCAAAGATGCAGTGATTGCTAGTAGGCTCCAT 

CGTCGGCCCCTTGCAGGTGGTGGGGACATATTCAGGGGTGGTGGATATCAGTGAGCACAG 

CCCCTGACTGGTGGCTGCCCATGCAGAGTGTGGGGGTGGGGAGTAGATGGGGGGCTTTTC 

CTAATGCATCTAGCCCCAGTGGCTCTTTGGCTTCCTAG 

Sequence 964 

CGGCTTGCGGGCGAGCCGGGTATNAGGCTCACTCAAAAGGCCGGGTAATACCGGTTATCC 

ACAGTANATCANGGGGGATAACGACAGGGAAANGAAACATGTGGAGCAAAAAGGGCCANG 

CCAANAAGGNCCAAGGNAACCCGTAAAAAAGGCCCGCGTTGCTTGGCGTTTTTTCCATTA 

GGCTTCCGCCCCCCTGACGAAGCCATTCAACAAAAAAATCCNACGCTTCAAGNTCAAGAG 

GTGGCCGAAACCCCGACAGGGACTATANAAGGATACCAGGCGTTTNCCCCTGGA 

Sequence 965 

CCNTTCCCCCAACCCGGNCGGGGGTTTGGGGNCCCGGGGNCCCCCGNAAAGATTNTTTTC 

CCCCCCCCCAACCAACCCCAAAAAGNTTCCGCCTCCGGAAAGGCCCCCGGCNTAAAGGNA 

ACTTTTCAACCCAAAGGACGCCGGTCCNCAATTTCCTTTTTAACCCCACCGCCCCAAAAA 

CCCCCANCCCCCCACCGNNNNGGGNGGGTNAANCCCCTTNCNGGGGGCCCCCGCCNTTCC 

TTTAAGGTAAAAACCTTAAGGGTTGGGGGGAATTNCCCCCNCNCCGGGGGGGCCTTTGGG 

CTAAGGGGGAAAATTTTTCCGGAATTTAATTTC 

ANTTAACCCCCGGTTCCGGGAACCCCCTTCCGGGAAGGGGGGGGGGGGGGGGGCNCCCCC 

GGGGGTTAACCCCCCCAAGGCC I I I M II II I GGTTTTCCCCCCCTTTTTTTNANNATTG 

GGANGGGGGGGGGTNTTTAAAAAATTTTTNGGGCCCGNCNCGGCCTTTTTGGGGGCCCGG 

GTTAAAAAATTCCCMTTTNGGGGGGTTNCCANTTTAANAGGCCCNTTGGGNGGTTANTN 

CCCCCCTTTGGGTCNGGGTTGGGNAAAAAAAAATTNTNGGGGTTTTTAAAANTTTCNCCC 

GGGCNTTTTCCAAACCCAAAANTTTTTTNCCCAAACTAAACCCAAA^ 

TNAAANNGCCCCCCGGGGGGGGGAANGCCAATTTAAAAAAGGGTTNGGGTTANAAAAAAG 
GCCCCTTTNGGGGGGGGGTTTGGCCCCCTTTAAMTTTGGGAAGGTTTNGGAMGCCCT^ 
AAAAACCTTTNCAACCAAATTTTTNAAAAATTTTNGGGCCGNTT^ 
TAACCTTGGGGNCCCCGGGNC I II I 11 I II CCCCAAANGTTCCCGGGGGGGGA 
Sequence 966 

CGGGGNTGGGGGCCGGGGCTCCGGAAGNGGGTAAACCTNTNTNTATGTGTGTCTNTNTCT 
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NTNTCTNTNTNTNTNTNTGTGTGTNTGTGCATACCAAAAAANAAAACCTGGGTNTNTNAA 

TNNTNAATTAACCCAGANGNGTNGGGAGGNTGTNTTTNGGNAAGNGTTCCACCGAGAATT 

GGGGGGNCCTTTTTAATTCCGCGGGTTACGNGAGAATATNTNCNTAGGGNTGAAGACTGC 

AAGNCCNGCCGNGGGCCTAACCCCANGGGNGNCTCCTTCCCCAAAAAACCTTTTNTGTGG 

GGAAACCTTCCGNCCAACGNCGGNACCGAAAGGGGGGGTTCAAAGGCCTTTAACCCNACN 

CCNAGGGGGGGTTCCCCGNGGTCCAATTAACCTTGNGNANTTGTAGGGAAATNGGTACCN 

TTNTTGGMTACCNTACACTTNTCCTTGTTGGGGAAGAAGGGCNCCTTTTAATTCCCCAA 

CCAATTAAATTTTTTCCNTNGGGGGGTNAAAATTAAAGGGGGGCCCCCAACCCCAAGGNG 

GGGGCCCN I J I I I I I IGGNGGGAAGATAATTGGNGGNAACCTTNGGAACCGGNGGGAACT 

TAAAGGCCCNANTTAAAAAAAAATTNCNTTGGGGGGGNCCNCCANCCNGrnTGNNAANC 

CCCAANCCCCAACCNCCCCTTTTTTTAACCCNAGCCGGGGGGGAACCCANCNNGGGGGGA 

ATTNGGTTGTTTTAAACCANCCCCCCAAGGNCCMGAANAAATTNCNGGGNNTTTNCGCN 

TTTTNGGCCCCCGNAANAAAAAAAGNNCCCGGGGAAAAAACCTTTGGGGTCTTTCNCCAA 

AGGCCAAAGGGCTTTTTGGTNAACCNCTTTGGCCCCNCCGGGGGGTCCNGGGGCCNCCGN 

CTTTTTNTTTAANANAAAACCTTANGGGGNGGGGGGAAATACCCCCCCCCCCGG 

Sequence 967 

CCGGGCAGGTACCTTCTCTGCATGACTTAGAAGAAAATTAATTTTGGTTTTTGGACAGTC 

CTGTTGACACTTCTGTTTTGTATTMGTGATGAATTCCATGTTAAAATTATACTGGCCAG 

TTGATAGTTACCMGGAAATGCTAGGCTCCTGATTCCAGTCAGTGGGTTTGGATATCCTA 

ATCTGTGCTCTCCTCTCTTCTAGAATATGAGTGTTCTGCCTGTATTGGGAGCACATTGCT 

CAGTCTTTGAGTTATGCTCAGTTTCTTAATCCAGAGAATAAAATGTGTTTGGGTGCTGAG 

ATTTTAGAGCTTTTCTGTAAGGTCAATACACTGAGAGAAGGCTTAATATTTCTTATCCAT 

TGGTATTTTCCAGAAAACTGTATTTGCTTTTAC I I I I I I I I I I I I i I I GAGACGGAGTTT 

CGCTTTTGTTGGCCCAGGCTGGAGTGCAATGGT 

Sequence 968 

AGGTACGCGGGAGGGCAAAGGGAGAGAAAAGAACTGGACAATAGGCATTACATGGTTTTT 

TGGGTTGTGATGGTGAGCACAGTGATGTGATTATGAGGTATCAGTCAGTTTTGCTATAAA 

GGAATACCTGAGGTTGAGTAATTTATAAAAAGAAGAGGTTTACGGCTGGGCACAGTGGCT 

CATGCCTGTAATCTCAGCACTTTGGGAGTCGGAGGTGGGCAGATCACCTGAGGTCAGGAG 

TTCGAGACCCAGCCTGGCCAACATGGTGAAGCCCCACCACTACTAAAAATTCAAAAAATN 

ANANAAAAAAAAANGTTCCTGCCCGGGC 

Sequence 969 

CANGATGCCTAGGTATTAAAAGCTCTAATTCCGAAGGGGGCTTTAATGTAAAAGAGGTAA 

GAAGGTTAGCCAGAAGCTGAAATGATTAGGTMGAAAGTTAGTTGTTGCTTTGTAGCTAC 

AAATAAAATTTGGTGAGTTCATTAAAAAGATGTTTCCAGCCAGGCGCAGTGGCTCAAGCC 

TGTAATCCCAGCACTTCGGGAGGCCGAGGCAGGAGGATCACGAGGTCAGGAGTTGAAGAC 

CAGCCTAGCCGGGTATGGTGAAACCCTGACTCTACTAAAAATACAAAAATTATCTGTGTG 

TGGTGGTGCTCGCCTGTAGTCCCAGCTACtCGGGAGGCTGAGGCAGAAGAATCGCTTGAA 

CCCAGGAAGCAGAGG 

Sequence 970 

AGGTACTGACTTTTGGAGTTACTTAAGAACCACAGTTCCTAAAACATTTTATTTGGGA 

TCAACTTCTGGCTTCCTACTTACCMGAGGCAGTAAGTTATAACCTCTGGCCTTTGTCTT 

TACATTTTCAACTTTCTGGATCCCTAACCTCTGGTCCCAAGTTACTACGCTTGCCTATAT 

CCACTAACCACTCACTAGGTCACAAAGTTGATGGCTCTGGCTTGACCCAGTGAATGAGAA 

GTGTCCTAGATG rTTTTTTTTTTTTTTTTI I CGCAAATCTTC 

Sequence 971 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGAGGGCAAAGGGAGAGAA 
AAGAACTGGACAATAGGCATTACATGG i I I M I GGGTTGTGATGGTGAGCACAGTGATGT 
GATTATGAGGTATCAGTCAGTTTTGCTATAAAGGAATACCTGAGGTTGAGTAATTTATAA 
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AAAGAAGAGGTTTACGGCTGGGCACAGTGGCTCATGCCTGTAATCTCAGCACTTTGGGAG 
TCGGAGGTGGGCAGATCACCTGAGGTCAGGAGTTCGAGACCCAGCCTGGCCAACATGGTG 
AAGCCCCACCACTACTAAAAATTCAAAAAAAAAAAAAAAAAAAAAAAAAAAAGGTCCCTG 
CCCG 

Sequence 972 

CCCGGTGGAAGGAGGGGCCCTAGTCNGCAACAGCTAATCCGTTCAGCTTTACCTTCTTTG 

CAGACTTTGGTTGGTGGAGGCTGTGGCAATGGAACANGCTTGANAAACAGGAATGGTAGT 

GCTATTGGCCTTNCAGTCCCACCTATCACAGCCTTAATCACCCCAGGTC CTGTTCG TCAT 

TGCCAAATTCCTGACTTGCCTGTGGATGGGAGCCTACTCTNTGAATTCCN Mil I CATC 

TACCTGTTGGTTGCTATTTTCATTCAGTACCTGCGGCCGCNCTAGAACTANTGNGATCCC 

CCGGGCTTGCANGTAATTTGATATTAAGCTTATCGGATACCCGTTCNACCTTNGANNGGG 

GGGCCCGGGNACCCAGGCTTTTGTTTNCCTTTAATTGAGGGTTAAATTGCTCCCNTTNGG 

NNGTAAATCAATGGNGCATAGNCNTTGTTTTCCTGNGGNTGAAAATANGTTATTNCCGCC 

TTCNCAAATTTNCCNCCACCAAACAATAACGGAAGCNCGGGTGAANCCATNNAAAANTNG 

NAAAAAGCCCTT 

Sequence 973 1 

CCGGGCAGGTACTCATGTCTGTGGAGCAGAGATGCTGACTGGGAATAGAGACGCTGTCGC 

TCGACCACGGGCATGGCCTGCGAGGGCCCTGATAAATGGCATCGGTAGTTTGATGTGGAA 

ACGTGTGCTGGATTTATGCTCTGTATTCATCCCACTTCTCAGGTTCAACTGC7TTCATCA 

ATTGCCTTCATTTGAACTACACCATAAGTTACAAACTCACGTCAATGTAAGGTGTTGTGA 

TTTGCAATCTGAGCAATGATGTTTGACTGTGCCTTGAACCTCAGGATGAAACAAGACATC 

ATTGATGCAGGCAGTCAACTTTCACTGTAATTGACCAGCATATTATTTAGGCTTTGAATT 

AATGGTGTCATTAGAATTTGATGACTATTTTAGACCACGGATACCGTGCTACCTATGAGA 

TATGTTTGGGCATTCTTCTAAGGCAAGCAGTGGAGATATTTATGGCACAATACAGGGACA 

ACAAGGATGTATCCCCCAATTTTTCCTAATTACACTTTAAA I 11 I I I I 1 ATTTCATGTG A 

AAACGTTTGGGGGGAAAACATCTGTATTACATCAAAAGTANGTTTATCTTGTGCCTCAGT 

TGATTCTTGGAAAATCTGCTCTTGGGGTTTGGCC 

Sequence 974 

GGGNACNGANGAAAAACCTGTCATCCAATATGGGAGCTACTAGTCACATGCGCTATATAA 

ATGCAAATTGAAGCTGAATGAAATTGAAAATCTAGTCTCTCAATCATATTAATCATATTT 

CACCTATTGTATAATGCAAATGTGGAACATTTCTATCATCCCAAGCATTCTACCTAACTT 

TGGACAATAGTCACCCCTTCTGGGGAAGAAAGTGAGATTATCCTTAGGGGAGGGAAAGAA 

GGGGTTATTTTAGGGCAAAMTMCTTTMGMTTTACCCTCTGTATGTCACTTATGT^ 

TTTATTCTTTTACCGACAAAAMTGTTTTTCMGTGCCCATTT^ 

AAAAAATCAAAGCCCTACAGGTTGGCATTTATCATGGGGGATGAAAAAGGGAGGGGAAAT 

ATGGGTCAAA 
Sequence 975 

AGGTACC 1 H I I I I I I GTTGTTGGTGAAGTAGAATGTGTGTATGGGGAGGGCA(3GTGGAT 

AGGATATTTTGGAATTTGAAGAGAATGGGAATGTCTGGATAGTGCATGTATAGAATGGGA 

AGTAAACACTGGAGGACCTGGGAGTGTGGNGCTTATTGATGGATCGTTGGGTTTATCCAG 

GATTGGGGTTTTGCCAGATGGCTGTGATGAAAATCTT 

Sequence 976 

AGGTACTCGGCCTACTTCCCTATTTTCAATGCCTAATGATTTTTGCTGCGCCTGAACACA 
TCTGGAGGGAGCCAGGCCCACTGGCTGCCTCTCGTCCCACTCCAGGCCCTCCTACAAAGG 
TGCACTCAGCAAACCCTCCAAGTACCTGCCCG 
Sequence 977 

AGAGACCGGGTCTCACTATGTTGCCCAGGCTGGTCTCGATCTCCTGACCTTGTGATCCGC 

CCACCTCGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCGCGCCTGGCCTCCTT 

TGTTTTTAATTAAGGAAAAGGACATCAATATCTTCCATATTGAAGGGAGTGGAAGGCGGG 
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ACTAATTATATATATATTATCTTTTACTTCCATGAi I I H 1 1 CTCCT TTTGGGAGTTCGC 

AAATAGTTTTTTGATGACGGCAAATACTCCTCATTGCTMTTTTTGGTGACAGAGAATAT 

TAACTCTTTCTTCTTTAGAAAATAGGAGGAGGGGAAGCCCTCTCTTCTACTTTGAATTAN 

AGTACCT 

Sequence 978 

CCGGGCAGGTGCGGCAGTGGGGTGTGCAAGCTGGCAGCGTTCTGCACGGGAGGCCCCGTG 

GAGGGGTCATACTGCTGCTGACTCTGCCTCTGTCCAGCGAGCTGCGCGGCCTGCGGCGTG 

GGGGGCATTCCTCCCGCAAAATAAGAAAAGTGCCAAAAGCACTGATTCCCAAGTGTGGCA 

GCTTCTACCAATGAGTCACAAGGAGACACCGTCTAACACCCCAGTGCTTGTCCAAGACCC 

ACCACACTCACACGAGAAGTTGGCCACAACCGAGCACACACAAAAGGAAAGAGAGGCGCt 

TGACCTTTGCAAGTCAGCATCAAGCAAAACAAGCACCAGATCACCACAGGGCACCCCTAC 

AACAGCAGCAATTAGGAAAAACATAGGAGTAAAAATTGTTGAAAGACACTATGACAATGC 

GACGTCCCGCGTACCTT 

Sequence 979 

TCCACCGCGGTGGCGGCCGCCCGGGCTNGTACCTCNNGNACTNAATTGCAGNGGTAAAAA 
AATTGTTTTATACCTGCTGGTAAGCNATTCGAAAAATTGTCTATTTAAAAATGCANGTGC 
ATTTTAAAGAGTTACTATTTGAGGATCTAAAATATACAGAGAAGGCTGCATATTCTACTN 
' TG N ATCTTATN AC CC ATC AC AAAAAGGAAN AGCN CTN C CTAC G AAATN G ATTTTGGTC AC 
CTTTCCNTGGC I I 1 1 I CNTGGAAAGGAAAAGATACTTTTTGGNAGNTCCTTTTTTAGTCC 
CANGNGGTCNCTTGGGGGC I I I I I I I I I I ANGTTTCAGGTTTTGNAGNAAAAANTANCTT 
TTTCCATTTTTAAAAAAAACCCCT 
Sequence 980 

CGCTTACCNGGTATACCTTGTTCCGCCCTTTCTCCCTTCGGGGAAGCGTGGCCGCTTTCT 

CATAGCTTCACGCTTGTAGGGTATTCTCAGTTCGGTGGTAGGTNCGTTTCGCTTCCAAAG 

CTTGGGGCTTGTGTTGCAACGAAACCCCCCCGTTCAAGCCCCGACCCGGCTGGCGCCCTT 

ATCCCGGTTAAACNTAATTCGGTCCTTGGAGTNCCAAACCCCCGGGTTAGAGGAACAACC 

GAACTTNATTCTGCCCACTTGGGCAAGACAAGCCCACCTTGGGTAAACCAAGGGATTTAN 

GCCAAGAAGCCGAAGGGTANTGGTAAGGGCGGGGTGCTTACAAGGAGNTTCCTTTGAAAN 

GTGGGTGGGGNCCTAAACCTACCGGGGTTTACCACCTTNAGNAAAGAAACAAGGTTATTN 

TGGGNTNATAAATNGNNGCCTCCTGGGCTTGGAAANACCCMGGTTTACCCTTTTTGGGG 

NAAAAAAAGAANNNTNGGGTNNGGCNTNCTTNGGATNCCCGGCCAAAAAAAAAAAAAAN 

Sequence 981 

NCTCCNCCGCGGTGGCGGCCGAGGTACCTTGCTCCACCAATCCCAGCTGTCACATCAGCC 

CTATATGCCAGTCTCTGGGATTTTGCTTTTGTGTTTTCTGTGTAAAAACACTGCl I I I 1G 

TTTGGGCTCTGCTTTCCTGTGTTACTGTTTAGACAGTGTCCCCAGACAGAAAAATGGAGA 

ATGGCAAGATTACCTTATGTGTTTCCCCTTCTTTGMGGATCACAGCCTTTTTATGTCTT 

CTGTCTAATGCATGAATAGGGTTGCTTCATGTAGTTTATCCMTTTACAATTGTTTATGG 

CAAAAGGGTAACTTTGATACCTTTTACTCTGTGATGGCTGTAACTAGAAGTCCTATTAAT 

GCCI I I I I AAATTAAAAATAGCACTGCCAATAAAACCTTAATTTCCATACATCAAAAGTC 

AACCAAAGGAAACTAATAAGTAGTAAAATAAAAATGTGGGGCAAGGGTGTGAGCAGACTC 

GGTAATTTCCCATTATTTTAATTTCTTAGAATAAAAA 

Sequence 982 

AGGTACGCCTGGCTGATTTTTGTGTT1TAAGTAGACACAGGGTTTCACCATGTTGGCCAG 

GCTAGTCTCAAACTCCTGACCTCAGATGATCTACCTACCTCCACCTCCCAAAGTGCTGGG 

ATTACAGGCATGAGCCACCACTCCCAGCCTATCCTGTTTCTTATAAAGTTTCTATCTTAG 

AAAAGTGGCATAGAGTGAAGACTTGATMCTTTGTATCTAGGGAAAAAATGACATTTCCA 

TGTTGTCCAGTATAAAACACATATTATTATTGTTATATTCTGATGGGCCAAAAAGACATA 

TATGTTTAAGTTCAGATTCATAGGCACAAGGTATTTATTATGTTTCAAACTCCCAAAGGA 

TTTCATGCAATTTGAGAAATACTACCATGAATAAAGGACAAGAGGAATTCTGGAGAAAGG 
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GTGGTGCAAATGCATACCCTTATCTCCCTCCAATATGAATTTTGATCTGTCCACCAGCAC 
AGCTATCAATGCTAGCAGGCAGGTGGGGAATGGCAGATATGTGGGAACAATTGGCTTTTT 
GGTT7TGATAA 
Sequence 983 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGAACTTAATTCAGATTAAGAAAATTGTT 

TCATACTGAGGTMGCGATTGAAAAATTGTCTATTTAAAAATGCAGTGCATTTTAAAGAG 

TTACTATTTGAGGATCTAAAATATACAGAGAAGGCTGCATATTTTACTTTGATCTTATTA 

CCATCCA AMGA AAGCTTACATAGAAATGATTTGTCACTTTCTGCTTTCTGAAGAAGATA 

TATGAGTCTTTTAGCCAGGTCCTGGCTATTTAGTCAGTTGAGAAATACTTTCATTAAAAA 

CCTATTGGTAGAAATTTGCCAGTTTATTGGGCACTTGCTCAAGTAGTTGCTTTTCTGAGA 

GGCAGTGGTGAGACCAGGATGTTCCAGCCCCAAGGAGGGCCCTGTAGCCCATCTCAGCCT 

GGCCATGTCGTGCTATGAGAGGTTCCTCTATCCATTGGTCAAGGAAGGGATCAGGCCCTG 

CAGTCACACTGCCTTTGCCTCTTGCCGACTTTATTCTTGAGCGAGTCACTTGGCAGAGCC 

TCTGGTTTCTCTTAAGG 

Sequence 984 

ACGCGGGCTCCCCACCCCAACATTTTATAAAGAGCAAGTGACTTTTACAGCCTCTGGGTA 

TTGCCCMTAGCGATCTGGAAACAGGCTGCTCTTCGTTTAAATGACTGAACACACGGTTG 

GGGATCTGTTCTGAAAGGTGGAGGGTGCCCGCTTGGGAATAGGGACATTATAAATACTGG 

ACAATTG AAG AC AATATTTCTG AAGTGTAAAATGTTTGCATTG N TATAAAATAAACCC AA 

ACAGGCCCATGGAAGAAATCTCTTCTCCCAAGTCCTTTACTGAAAAAAATGATCCCACAG 

ACTGAGAAAGCCACTTTGACTTAGGTCAGCTTGGAAACCCACTAAATTCTCCAGGTCTCT 

AACAGGANAAAAAAAACTGGTTATAATTGTTACCAGAM 

AGAAATGGGCAGTGTCTGTCAGAGAAACCAACTGATTCCTGATATTATCACAAAGCCAGA 

TACTTGAAGAGCTCTATTAAAATATGACTGAGGCCCTTTGATGACCAGCCCAGNCTGTGG 

CAGATACCCGCACTGAGGAAACTTGTTACTGGGTGNGCANGTGCNATTGCAGTCCCGAGC 

TNCCCCACAGGTGCTTGGGCTGNGCTGNCNCCAGCTTGGATTTTNTTTGGTTGNAAATTG 

CCCTTTAAATNCTTTAAAATAAAAATCTNCCTTC 

AAAAGA 

Sequence 985 ' 

CGCTCAAGTCAGAGGTGGGCGAAACCCCGACAGGGACTATAAAGGATACCAGGCCGTTTC 

CCCCTGGGAAGCNTCCCCTTCGTGCCGCCTCTCCTGTTCCCGACCCTGCCCGCTTACCCG 

GGGATACCCTGTCCCGCCTTTTCTCCCCTTTCGGGGNAAGGCGTGGCCGCCTTTTCTTCA 

TTAGGCTTCACCGCTTGTNAGGGTATTCCTCAAGTTCCGGGTTGGTANGGGTTCGTTTCG 

GCTTCCCAAAGCCTTGGGGCCTTGNTGGTGGCCACCGAAACCCCCCCCCNG 

Sequence 986 

CCGGGCAGGTACACCGATTTTATTGCTAGAATACTCAAGTCAGGACCTGACCACAGGCTG 

AGCACAGATGTTAGTGGCAGGAATGGCTATCTTGTTAACATAGTAATAACACTGAGAAGA 

GGAAGAAAAAACCATGAGGATmAAAACATCTTTTCATCTTCCTGCATAAAAA^ 

TTAAAAAGAAGAACAATGGAATTTCAACGCCTCACTGAGCTCTGATAGAGCGAATGAAAC 

AGTATCGTGAATGGCTGGACAGGGAATACAGTTTCTTGCATTAAACAGGATCGCCTAGTT 

CAGAGGTCATCCGCACCCACATTCCAGCCCCTTCCTGGTATCCCTCTGGGGAATCAAGAC 

CAGGGAAGGCCAGAAAAGAAGAAGCATAGCTCCTATGATCCCCTGGAATCCCTCCACTGA 

TAAAGGCAACTTCCAATGACTCCTCTTCCTTCCACGCATCCCACTCTTTCCCTTTCCTCC 

ACCCAAACCCACAAAGGGAAAGA 

Sequence 987 

TATCAAGTTACCTATGAACCGATGTATTTCGTTCACTTAATACCATTCACTAAATTCAGT 
TATTGACCTAGTCAATAACTGCCCAACTTCACATTTCACATACAGTGAAGAAACTCTTGN 
TTCAGCATTC I i I I M GGGT 
Sequence 988 
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AGGTACATGGCAACTGAGGCAAAGAGCTCTTAAGTTACTTATCGAAGAATATCCAATAAG 

TAGGTGGTAGAGCTAGGATTTACACCCAGCAGTCAGCTTTATTTTCAACCATGCAATGCC 

TCCTTAACAAAGGAGATAGGTTAGGAAAGGCAGATCTGGGCAAGTGTTGTATTATACAAC 

AATAAAAAGGCTGTGGGGTCAAAAGCTATGGCTGTTAAAAGGTTAGGGTAACAGCCAGAA 

GAATCACCC TTAGG AAGGTTAAAATTTAGGACAACATCTATGATCCTTCCCAAACCTTAA 

ATCTTCATGCTTTTAAAAAGAATTTCCTGCCCAGGCGTGGTGGCTCACGCCTGTAATCCC 

AGCACTTTGGGAGGGGGAGGCGGGCGGATCACCTGAGGTCAGGAATTCCAGACCAACCTG 

GCCAACATGGTGAAAACCTATCTCTACTAAAACTACAAAAAAAAAATTAGCCAGATGTGG 

TGGCACTTGCCTGTATCCCAGCTACTCAGCANGCTTGAGGCACATTTGAACCCCGGAGGG 

ANA NGGTGC TGTGGGCTTAACAGCACCCACTTGACTTCANGCTGGGTAACCAAAGCGAGG 

ATTC I ll l n NAAAAAAAAAAAAAAAA 

Sequence 989 

CCNCCGTGGGCGGCCGGCCGGCTGNTNTCCCACACTCATACNCTGTAGCCCCAGGACGGG 

NCAGTTCAGTTGCCATTCCATCCTTGNACNGACNCTTGTCCAAGGNAAAGTGGAAGGGAA 

GGCAAGTCTCTCCTCCCCCCGCTNTAACCCTCAGGGCCCGCCTTCTTAAGCAAACTTATG 

NTGGGGATNCTCCCCCGCGCGTCTNGCTAAGGGAAATTTCGNATTATTCNAGAGGCCTTN 

ANTTCCGGAATACC CCGTT CCGCACTCTTCCNGAGGGCGGGGGGGGGGCCCNCCGNGATA 

AACACCCCAAAGCCTTTTTNTNGNTCTTCCC^ 

AAAAATTNTNGCCG GCCC GNCCTTTTGNGACNGNTTAAAATACCAATTGGGGTACCAATT 

AAGGNCCTTGGTTATTTTNCCCTTTGGTCGGTTGGGAAAAAANTCNCGGGANGTTAANTT 

CNCCGGGCGTTNCCANCCAAAATTTTNCNCCAACCAAACCAAAACCCAATTTANCCCGGA 

AGNNCCCCGGGNGGNAAGGCCAATTAAAAAAAGNTTGGNTTAAAANAANGCCCCCTTGGG 

GNGGGGGTTGGGCCCCCTNAAAATTGGGMGNTTGGGAAAGCCCTTAAAACCTTTCCAAC 

caattttaaaaattttgnccg™ 

TTT TNCCA ANTTTCCGGGGNGAAAAAAACCCNTTGGTTNCCGGGGGCCCAANNCC7TNGC 
CAAI I I I IAAAATTGGG 

Sequence 990 

CCGGGCAGGTACCAAGATTTATCTAGAAGATCTGCAATTTGACAGATCAGATTTCCAATC 

TAATTATCTACTT TTAGG TGACAGTGCTGCCACCAATTTCTCAGCATGCCAAATGGTAAA 

CTATGACAAACTATTTTTATTTTACAATCCTGTGTTTCGGTCCTGATTAATTTAC^ 

TCTTTGAGCTTTTGGTTTCTTCCTCTGTAAATGAAAATTATACTTACCTGAAGGATTGAA 

AGA^CAGAATATATAAAGTATTTGTCTCGCGATATGCTCATTGGAMTATTAGTTCCTG 

TTTTTAACMTGAGCAAGGTATCTAAATAATATCCACAGCCGTGGGAGTGAGGCCAATGC 

AGAGAGTCACCATCAGGGCAATTCCTAGTGGAGTTGCCCAGGCAGGGCCATCCCAAGACC 

CCAGACCAGTAGGGCCACCAGGGTGCAACTCCAGGCTGGGAGACCCAAAGGCATGCAACT 

CCAACACACAAGAAATGCAGCCAGCAATGTGATGAAGTAGGGTTGCCTGGAGCCTTGGGG 

GTCCAACTTCCACCCCAGTGTGTCCAGAAAATGGAGNAAATGGGGAGTTAAAAGACTCCC 

CCCCAACTTTAAGGGGTTNAATGGTTGGCCCTACTGGAGTTTTTTT 

Sequence 991 

AGGTA C MUlllllllll GTTTTTGTTTTTGTTTTGTCCTTTAGTAAAGTTTTACAGT^ 

TTTTTCCTGTAGGTCTTACACTTTTCTTGTTAGGTGTATTCCTGGATTTTT^ 

GTTGTTGTGAATTACATTTTTTTCTTCCATTTTACATTTTAT^ 

TGGTATGTAGGAAGGCTATTGATTTTTATGTTGATCTTACCATTTACTTAGTTTTCAG^ 

GCTTCTCTTG AAGTTT CTAGAAGGATGGCTGTATTTTCTGCAAATGATAGCATTTTGTCA 

TTTCCT TTTCCTTTTT CTGTATGTGTTTACACCCTTACAGG 

TTTTCGi I ll GGAGGACATTTGTTTTGTTTGTCTGGNCTTACTACATTGGTTAG 

GTTTCCAGAANAGGTTCATTACTAAGTTTTGATTTGGGTA 
Sequence 992 

AGGTACTTTANGAGACCCAGGCGGGCAGATTGCCTGAGGTCAGGAGTTTGAGACCAGCCT 
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GGCTAACATGGTGAAACCCTGTCTCTACTAAAAATACAAAAATTAGCCGGGCATGGTGGC 

TCACGCCTGTAGTCCCAACTGCTTGGGAGGTTGAGGCAAGAGAATCGCTTGAACCCAGGA 

GGTGGAGGTTGCAGNGAGCCNGAGATCGCGCCACTTGCACTCCAGTAATGGGCGACAGGA 

N CC AAGG CTCC ATCTC AAAAATAAN N N AAAG AAAGTAAAC ATG AG N AAAAG N AAAAG CTT 

ATATTNGAACTTCTCTAANAAAAAAGNAAAAAAAAAGGAAAGCCTGATGCCACACAANAT 

NTAANATTTTGGGCAAAGGTCGATCAATTAAAAGGGAT^ 

TAAAATTTCTTTTACTTCCCCCCCCCAAAAAAAATTpAAATTANAAAMGNTTTCAANAT 
AAGCCAACTTTTTTTCCTTTAAATGTGG i I ! I iAAAAA • 
Sequence 993 

CGCNGTGGCGGCCGAGGTACTTTANGAGACCCAGGCGGGCAGATTGCCTGAGGTCAGGAG 

TTTGAGACCAGCCTGGCTAACATGGTGAAACCCTGTCTCTACTAAAAATACAAAAATTAG 

CCGGGCATGGNGGCTCACGCCTGTAGTCCCAACTGCTTGGGAGGTTGAGGCAAGAGAATC 

GCTTGAACCCANGAGGTGGAGGTTGCAGTGAGCCGAGAATCCGCGCCACCTGCACTCCAG 

TNTTGGGGCCGAACAGNAGCAAGGGCTCCATTCTCTAAAATANAGTAACAGGAAAGTAAA 

CCAAAGTAAAAAGTAAAAGCTTTATAATNTNGAAACTTCTCTNNAAGAAAAGGGAAAGAA 

ATAGTAAAAGGCCNTGGATTGCCACACCAAAATCTTAMTTTGGGCAAGCTCGNATCANA 

TTAAAAGGGATAAI'l I I l ATTTTGCATTCNACAAAAAATAAATTTTCCTTTTTACTTCCC 

CNCCCCAAAAAAAAATCMNATTAAANAAMNGGTTTTCCAAAMTTNAGNCNANACTTT 

TTTATCCCTTAAATN GTGGGTGTTTAAAAAANTGGTAN AATTCC AACCC C AAAAATTAN C 

NATTNTTGTTCCCCCCCCAAACGGNTCNTTTNTCCCNAAGNTTTAATTAAATNTCCNTAT 

TTTGGNGGNTAAAAAGGNTGGANGGGTTNTAANCCTNTGNAAAGTNGTAGGGCNNAATTA 

AAGGTMNMGGTTTGNAGGCCTTTCTAAGNANCCCTTGCNCCTTNGGGAANANGGANGC 

CCNTGGTTTTCTTTTTNTTTNANGAAAGANGGTCCTTNAAGGGGAAAAATG 

Sequence 994 

CCGGGCAGGTACGCGGGAAGGAGATCTGCTTGGCCACTGGTGGCTCGGGGTCTGGAGTTC 

CTGTGTAAGGATTCCAGGAGGATCCCAGGGCTCCCTCCCACCCTGTTCCAGTTGATTACA 

GCAAGGGAAAGAGGACACCGCCCGCATCAGCCCATCCAGCATTCGTTCAACTAAGCATAG 

AGTGCCGGGGTCACAAGGACAAATGAGCCCCCCACCCCTGCTCTCATGGAGTTCCCAGTC 

TATCAGGGACCAGCCATTAAGTAGTTCCTGTGGAGTAAGTCACTTAACAGGTGTTCTCGG 

CTCAGGGCCCAGCCCTCAACCCCAGCCTAAGCCCAAGTACCT 

Sequence 995 

CCGCGGTGGCGGCCGCCCGGCAGGTACACGGAGGGTGCCATCAGAACAAAGAGGGGAGAG 

GCCTGCCMGGMGTTTCCCTGGAAATACTCTTGGACTGAGTGTTAAAGGATATGCTATT 

AATGCTATTATTGGTTGTTCTGGAGAGACTAAACTGAATTTATTATCCTTTGCCTAAAAT 

CTACATCTAATTTCACTTCCACTTCCTTATATGTCAAATCAATTCTCTAC rTTTTTTTTi 

im I AATGAGGCAGGGCCTCGCTCTTTCACCCAGGCTGGAATGCAGTGGCATGATTATA 

GCTCACTGTAGCCTCGAACTCTTAGTTGCAAGCAATCCTCCTGCCTCAGCCTCCCAAGTA 

GCTGGGACTACAGGCATGCCCCCACCACACCCGGCTAA I I 1 I I GTATTTTTAGTAGAGAC 

GGGGTTTCACCATGTTGGCCAGG 

Sequence 996 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGCACAGGGGTCTGGAATTACACAGAAGA 

CGGGTGACAGCCAAGGTGGATCATGAACGGTGAGAAGTCCAGCAGGTGACAAGGGGAAGG 

GTCTAAAGGGTGGGGGGCACAGCGCAAGCAAAGTCTTGGCAACAAAAGAGCTAATGCATC 

CCAGAAATGGGGCAGGTGGAGTACCT 

Sequence 997 

CCGGGCAGGTACCTTTAATCCTCCACTGACTTTGGAAAAACMCAAAATGCCAGCTTCCA 
CCAGATCATTTCTTCTTAATTAAAAGTTAGCCAGATTCTTTTGGAGCACTCMTTTTCTG 
GTCAGGTCAACAAAAAAGTAATGAACACCACACACACCTCCCTCCCCTGTCTTCCTAGCC 
CCATTCCTGGAGTTATGCAACCCTJGGGGGCGGCATACACTTCAGCTATTTGCTCTGTGT 
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TCATMTTAGATTGTCTAGACAGGAGATGTTGCTGGAGCAATAGGAAAATATTTTACAAG 

ATG7TCCGCAGAACCACACTTATAACAGTGATTAGGACACACTACCAGACGCGAAGCCAG 

CAGAATTGGCCCGGGCCACGAGCCGGTTTCACCCAGAGCAGAAGCCCTTTCAAGCTCTCA 

TTTCAGGAAGAGGTGCCAAATTAGAGCTCTGAGGGCATGGAGCAGCAACTTCTTGGCCTT 

TGGATCCGAGAAAGCAAGGGTGAGATAATAGAATGATGTCTGCGATGCCCTTCTTTCT 
Sequence 998 

CGAGGTACAAGAAGGATTTGTGAGGAAGGGATGTTGGGGAATAAATGAATGCAGGTGAGA 

TTGCTTAGGTMTTGTAGGCTTTAATCTGTATGTGAGGTGATAAGGGCTTAGATGGTTTT 

TC CTAAAGAAGAAATTATGCC AATTACMGATACAGGTTAGGAAGAATGGACCAGAGGAA 

TTAGGCAGTCAGTGAAGATGTGGACTTGATTGTGGATGTGATTGAAAAAAAAATGGTGTT 

GCTTCTGTAGCACCTGAAGGGCTTAGTTGGGAAATGTTCATGTGGGATTATCAGCATGGT 

TGGCTAAAGCTGCCTACTGAAGTATTAAACAGAGGAAGAACAGGAAGCAGATGAGGGGAT 

GTCTTGGGTTATATTCAGATAGAGAATGGAAGGAAAAAGAGGATGTAGGTTAAGAGAGAG 

GGTGAANGAAAATTNNAAGAGTGCNNTNATGGGAGCAGAAAANGGATGAGTGTTTCCCGG 

AACCTATAAATATTACNTGAAACCAAAAAGTTG 

Sequence 999 

GTTAATTGCCGCCGCTTGGCCGTAATCATGGGTCATAAGCTTGTTTCCTTGTTGTGAAAA 

TTTGTTTATCCCGCTCACCMTTTCCACCACCAACNATACCGAAGCCCGGGGGAGCATAA 

NAGGT GTAAA AAGCCTGGGGGGGTGGCCTTAAATGGAAGTGGAAGCCTAAACTTCACCAA 

TTTAAATTTTGCCGGTTTGGCCGCCTCMCTGCCCCGCCTTTTCCMGTCCGGGG 

ACCCTTGTTCCGTGGCCCAAGCCTGCMTTTAAAATTGGAAAATCGGGCCCCAAACCGGC 

CGCCGGGGGGGAGGAAGGGCCCGGGNTTTTGGCCGTTANTTTGGGGCCGGCCTTCCTTTC 

CGGCCTTTCCCCTCNGGGTTCAACNTGGAACCTCGGCTTTGCCGCNTCCGGGTTCGGTTT 

TCCGGGNCTTGCCGGGCNGAGGCCGGGGTAANTTCAAGCCTTCAACTTCCAAAAGGGGCC 

CNGGTTAAGTTACNGGGGNTANTTGCCACCANTAAATNNANGGGGGGTAATTAACCGNCC 

ACGGGAAAAAAANAAACAATTTGTTGGAAGCCAAAAAA 

Sequence 1000 

CCGGGCAGGTACAGGTTACTTGTCAATTGAATCTGTGGCTTTTCCTCTGGTAAACAGTAG 

CTT TC TC GGCAGCTCTGCTTACTTGACTTCTGGTTATATCCCATGAGCCAGGCACAGATC 

TGGTTTTACGGGACAATGCCAAAACAAACATTGTCAGGGTTCTGTTAGGGTCACAAGAAG 

ATTTACTCTATTTCTTGTTCTGTCTGCCTGC7TGAAACAATAGCAGCCA7TAAAAGTTGT 

TATGTGGAAGTCTTGTATTATATACTAGAAATGGGCCTTGTAAGAATGACTTGTATCATA 

TTGTCCACATCCTCATTGAGAATATTAAGAAGACAATGTTGAGAAGACTGACGCCCAGAG 

AAATGTCATTTGCCCAAGGCAAAATCAMTATTTTGGTGGCAGAGACTGCTTAGGAAACA 

ACCGGTCCTTATCTCAAGTCTAACTCCCTTAACATTTGGGGGCATTTCATAATTTGATCT 

TGAGGGTMTAGATATT AGAT GTATTAACTTCTTTTTCCAGCCTTACATAACAAAGTCTT 

CAGAAACAGAAAC CGTATT TTGCGTGGTTTGGTTGTAAGTAAATGCCTAGTATGGAGGTC 

ATGTAATGGATTTC I I I I I AAAATTTTAAAATTTAAAATTAATTGGATTTTTTGGNGGGA 

TANCATTAGNAAGGTATTATTTATTGGGGNTCCTCGGGCCCGNTTCTTAGAAACCTAGGG 

GGGATCCCCCCGGGCCTGCANGGAATTTCGAATATTCAAGCCTTATTCGNATTCCCGGTC 

NANCNTCTANGGGGGGGGCCCCGG 

Sequence 1001 

AGGTACI Mill nTTTTTTTTTI I GAGATGGAGTCTCACTCTGTCGCCCAGGCT 

GGAGTGCAATGGCGCAATCTTGGCTCACTGCAACCTNTACCTCCCAGTTCAAGTGATTCT 

CCCAC CTCAG CCTCTCAAGTAGCTGGGATTACAGGTGCCTACCACCATGCTTGGCTAATT 

TTTGTATTTTTAGTAGAGACGGGGTTTCTCCATGTTGGTCAGGCTGGTCTCGAACTCCCG 

ACCTCAGGTCATCCGCCTGCCTCGGCCTCCCAAAGTGCTGGGATTACAGGTGCGAGCCAC 

CTTTCCCAGCCCTCTCCCTGGCATCTTAATGCGCATAATAGTGAACACTTGCAAAGGGGC 

T7TACTGTTATCGNGCTACTATCTTGGAAAGGAAGTGGAAAAAGATGCTGCCTAGTTCCA 
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CGATTGGTGTGTGGATCCTCCTAGAAGTCACTGCAGACCTGTTAAAGTGTAAAGAGATGC 

CTAAAATTGAAAAAAAAAAAAGTTGGGGGAAGGGGGCTGGAATATGGGATGAGTCCTCAT 

TCCCAGGAAAAGGCCTCATCCTTTCTTGTGGACTGANCCCANATGTTCCCAGGGGGGCCC 

TTGCCTCCTTCCCTTTNCTGGGGGGGNT 

Sequence 1002 

GGGGGACCTCCTCGTCCAAAGAAGGGAAGACCTTGTCGGCACCATTGAGAGGCGTGTTCT 
CTTCTGGCTGCCGCTCGAGCACCGAGAGCTCCAGCTCGTCTACTGGCTGTGAGGGTTGGG 
CCAGGTCCATGTGCCGGCAGGGAAAGTCTCCCCGCGTACCTGCCCG 
Sequence 1003 

GTGGCGGCCCGAGGTACTNTGC I i ! 1 1 H I H M I i M I M I I GGTCTGCCAGGGACCAT 

CAGGCATGGACTCTGATGGAAGCCTGGTGGTGATAAGGTCATCCACAGATCTCATTATGC 

TGTGAGCTACAGCCGACGTGATTATAAAATTCTGAGGTGGAATCCTGAGAGACAGGAGGG 

CTGANCCAAAACCCTTCATGGGGTTGTGCTGCAAGGANTGCCTGCCGTCCANCAGGGCCT 

TGGNGAGCATAAGCAGGACACCTGAGCTGTGATAAATGTCCTGGNGATGACCAGAATCTG 

CCTNTNCTTGGGTAAGGGAGAGGCTACCCCGAGGAGCTGGAAACAAAGACATGGAAGCTA 

GGTAACCTGACGCTAGGCNCTCTCACATCCTGGAATGAGGGAGTCTTCACTGAGCCTATT 

TCTTTAAGG 

Sequence 1004 

GGGGCCATTGAGACTGCCATGGAAGACTTGAAAGGTCACGTAGCTGAGACTTCTGGAGAG 

ACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGACCACTGGAACAATGAGAAGGAGAGA 

ATTCTACTGGTCACAGACAAGACTCTCTTGATCTGCAAATACGACTTCATCATGCTGAGT 

TGTGTGCAGCTGCAGCGGATTCCTCTGAGCGCTGTCTATCGCATCTGCCTGGGCAAGTTC 

ACCTTCCCTGGGATGTCCCTGGACAAGAGACAAGGAGAAGGCCTTAGGATCTACTGGGGG 

AGTCCGGAGGAGCAGTCTCTTCTGTCCCGCTGGAACCCATGGTCCACTGAAGTTCCTTAT 

GCTACTTTCCTGAGCATCCTATGAAATACACCAGTGAGAAATTCCTTGAAATTTGCAAAG 

TTTGTCCTGGGGTTCAATGTTCTTAAAGCTTTGGTTTCCAAGCTNTNCCANNAAATGCCC 

ACCAAGAAATTTCAAACCTTGGGATNTTGGGAAAGAAGGNAAAANAAAACCTTGATTGGT 

GGNTNAACCTTGAACCCCCA 

Sequence 1005 

GGGTGCTGGTCGAGAAGCTCACGCCCGAAGCCCTGCTGGGCGCGATCGCCGAGCACCGCG 
CCACCGTGTGCTTCACGGCGCCGACCTTCTACCGCCAGATGGCGGCGCAGGCTGGCCGCT 
TCGACTTGTCGAGCCTGCGCGCCAGC 
Sequence 1006 

AGGTACGCGGGCCTGCCTTGGGGAAGAAGGAAGTGTGTCTCTGTGAACCTCCCACCTGGG 

CCGAAGGGAGGCCACTCTCTCTGCTGCCTCTCCCCAACCTTGGCCTTCTGTGCTCCTGGT 

GAACCTCTCACCCCCTGCCTACAGGCCTCGAATCTCAAGACCATGATGACCTCTGGTCAC 

CCCTGAATCCAGAGCTTTCCTTTTACAAAGGGGAAACTGAGACCTGGAGCAGGGCTGATG 

TTCAGCCAGCGCACAACGGAATGGCCGAATTGGTGGTAAAATACTGAAATAGTTCCAGTG 

TGGATGGAAAGGGGCTGCTGCCCTAAGCATCTCTACTGCCCGCCTCATCCCTTCCTCCAG 

GACCCTGGGTCAGCACCAGGAGCATCAAAGTGGCCAGGATTGGCCGGAGCCCATGCTAAT 

GGCTCTGCCAGCCCTTCTNCCCACCAGAGAGGGCAGGGGGA 

Sequence 1007 

CCGGGCAGGTACGCGGGGAGACTTTCCCTGCCGGCACATGGACCTGGCCCAACCCTCACA 
GCCAGTAGACGAGCTGGAGCTCTCGGTGCTCGAGCGGCAGCCAGAAGAGAACACGCCTCT 
CAATGGTGCCGACAAGGTCTTCCCTTCTTTGGACGAGGAGGTCCCCC 
Sequence 1008 

CCGGGCAGGTACTGTGCTAACCAGGCAACCATTGGTTAGGGAGCTTCCAAAACTCCGTGT 
CCACTCCAGGCCCCGCTCCAGCCATCTGGCCAAGCACACTCAGCTTCAGGACACAATGAG 
CATAGCCAGTCCCTCTTTGATGTCCTTGGTACCT 
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Sequence 1009 

CCGCGGTGGCGGCCGAGGTACTTTT I I I I I I I I I I 1 M I I I I H I M M I I I M I I | | j | 

m i I M I M I I II IN I M |,| IN | M | I | I I MA N I I I M I I I 1 1 H MM I NCCCCC 

C NNNCNGNC CCCCTTNAAAAAAAAAACCCCGGGNTTNNGGNAANNAAAANCCCCTTGGGA 

AN I I I I I i INNNNAGGGGNNCCCCCCGGGNCNTAAANNAANNNNCCCCGGGNNNCCCCCN 

AANNNNTTTNNTNCCCCTTNAAAAAAAAAAAA 

Sequence 1010 

AGGTACAAGTGCACATCACCACACCCAGCTGAGCAGCATTTTTAAAATTTTCTTTTCCTC 

CAAAGCTTTCTTATATCCTGATTTTCCCTCACTATCCTCATGTAATATTTGCATATATTC 

ATACACACACACACATGCAACTGCATTTTATACACATGTGTCTTGGGGTCCCGGAGAGCT 

CAGCGTGCTGGTGGGGTTGGTGGGGCTGGTGTCTCCAGAGTGGTGATGTGAAGAGCAGCA 

CCTCGCTTAGAGGTAGAGCAGGTGTGTGCCATGAGCTCAGGTGATGGCAGTGGAGCTTAC 

AGGAGATGGCTTGTGGCTTTTCCCTCCGTTGGTCTGTCACTCTCAGCTGCCACTGTCTGC 

CACTGTAGGGATTCCACTGTGGGTTCAACTTGTTCTTATGTCAGGCAATGTTGTTTGTGT 

ATACTCATAATAAGCACTGGTTAAAGAGGGAACATTAATGCTGAAATACTAGTGAGGGGT 

CGGGTGTGGTAACGCTTGCCCTGAAATCCCAGCACTTTGGAGGCCAAGGCGGACGGATCA 

ACTTAAGGCCAAGCATGGTGAAAAGCCATCCCTACTAAAAAT 

Sequence 1011 

CCGGGCAGGTACTCATCATTAAGAAATTAAAGATAATCCGTAAATAAAACAATGACAATA 

TAATAGTTTAGAGAGTTAAGATTTAGGAAGTAGTAGATAAAATAATTATCTGTGTTATGC 

AAATATAATCCTTAG AATG C AATATTTG ATTAATAG G AGTTGTC CTATATGTC ATTC ATA 

AACTTGTAGCAGAGAGAAACCTCTGCATAAGCTCAGGTGTTCAAGTATGCAAGAAACACA 

GTTTTCTGTAT7TTGCTCTTGCTAAACTTMTTTGTATTCTGGGCCCTGGTTT 

TGGCAGAAACTCAATCTCAGTCAGCCCTAACTGACTATCCTCCAGGACTCTGTCTACAAA 

ATGCCAGGTGGGGGGCCATCTGGCCCTCAGTGTCATCAGCCACCCCTGCTGGCACCCACT 

GAGACTTCCTGCTCTGTGGTCACTGGTCCTTGTAGGTTGCCACTGCAGCCTCCTTGTCTC 

ATTCCAAGGACACATTAGGCCTTCTGGCTCTATGAGTATTTTTCACTGGAGAAATTTTGC 

TTCTGGCTGGTACCTCG 

Sequence 1012 

CCGGGCAGGTACGCGGGCAGTTCGCAGAGACCTGCTAATTCATAACTTACAGTCTCTTGC 

CACTAATGTCACTGCCAGCAAAGTCACAGTCCCCTCAGCATCCTCCATCTTGAAAGGGTG 

ACGTGCCGCCCTGCCAGCGGAAAGCTTACCTGGCACTGTTGGTTTGTGTATCATCCACAT 

TCTCCCCAGl l l l I CTAMTTAACAATAGGAAGCATAGCTGGTTTAGGAATTCTTTTAGT 

TAGATTAGGTAATTCACAAAACTGGTAGGAAATCGTTCAACCTAAGTTAAAACCAAAACA 

GCCATATCTTCAGGAGACATTTTCTATAAAATTTTATCACTTTTAT^ 

CTGGCMTAATGAAAGGAAATGTCAGAAATGGAMGTTTGTCCCTTTTGTAACTGGCGAG 

AAACTGGGTTTGTGTTTTGCCTAACTTTTTNTCCTGGAAGATTTAA^ 

TCAAAGAAGANGAAATGCTCTANCATCTGCGAANACACTTTGTCTGTCTTCGATNAAACA 

ATAAGTTTAAGGNGGACCGGGNTGCCCNTTANTGNTGAAAANGGTGGNGGGGGGAATANC 

TTTTTTTANGGATTTCGGAATACCAAAAANNNACAGGATTTACTTAGGGGGCCTNNNACT 

GGGGTTNTTTTT 

Sequence 1013 

GGTAGAGACAGGGGTCTCACTATGTTGCGCAGGCTGACCTCAAGCTCCTAGCCTCAAGCA 

ATCCGATCCCCCAACCTCAGCCTCCCAACATGCTGGGATTCCTGGCGTGAGACACCATGC 

CGAGGCTCAATGTCTCCTNTGAAAGGAAAGAAGTGGACAAAGCCATTCTGCANATTCCCA 

ACACTGTCACCGAGGCTGCCCGCACAGCTCTATTATCTGTCCTGAAGCATTGTTAANANA 

CTGCGCTATCGTTCCATTGACAGTTTGAAGTTCAGTGACAGAATTATNTGGAGAGGGCAN 

ACCCTGNGGGATGAAGGCCACACAGCTTCAGTCACAAAAGAAGGGCTATGTGCACCCATC 

ACCAAGACACCCAGTGTCTGTGAGTCTTTGGTACCTGCCCG 
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Sequence 1014 

AGGTACTMTTTTGCTATTAGTTTATATC^ 

CAACAACAAACAATGAAAATTATAACTAAGAGTCACATTCAGTAGTATCTTCTCTTTACA 

AAATCTGMTATAGACMTCTTCAAGGGCTTCGGAAATACAMGAMGATTTCATTGCAA 

TGGATTTTC TCAACCCATTTGAAGTTCCTCCTTGTCTCTTCTGTCATTATTTGTCTG GTC 

TATCCTTAAGATCACTGAATAAACAAACCTATCCACCGATCTTGGCAC i ' I I I I I I I I I I 

TGAGATGGAGTCTCCCTCAGTTGCCCAGGCTGGAGTGCAGTGACCCAATCTCCGCTCACT 

GCTAGCTCTGCCTCCCAGGTTCACACCATTCTCCTGCCTCAGCCTCCCGAGTAGCTGGGA 

CTACAGGCGCACGCCGCCACAACCGGCTAA I I I I I I I I I I I I I I I I IGGNATTTTTAGTA 

GAAACAGGTTTCACCCGTGTTAGCCCAGGATGGCTCGATCTCCTGACCTTGGTATCCGCC 

CACCCTNGGGCCTTTAAAAGGGCTNGGGATTACANGCGGGGACACACTGNANCCGGGCCT 

GATCTTTGGCNCCTTTTTNAAAACTTTC 

Sequence 1015 

AGGTACAGATATGAAGTCAGGAAATACATGAGGCGGTGGGTTTGAMGTCTTTTCTTAAT 

TTGTAGTCAGAGCTTCATAACTGGGTGAGGCAAGAGAACATGTGATCAAGAGTCAATGTG 

ATGACAAAGAGAAATAGAAATCACTATTAAATTACGGTGTTCGAGAATGGTGATGTGATT 

CCAGTAGAGCTTGAGGGGGAAGATGTTACTAGAGATTGAAAGACAAAGGAGAGGTGGGGA 

GTTGGGGACAGGTTGMGAGTTAGTCTGTATGATGCTGTGGTTTATGAATGGAGAAAAAA 

TTGTGGCACACTAGAGAATATCAGCTTACAGGC I I I M I I I i ATTCCAACTCATTAATAT 

AGTGTGGAGAAAGATGAGCAAGAAATMTTGATTGTTTTTGAGTAAGAGCTAGTTAAAGA 

AATTTGGCTTGTAGAGAGTGAGAGAAGACTAGATACAGAGAGTGGCACACGATATTGCTT 

AGTATTCAAATTAAGTT 

Sequence 1016 

GGGGCCATTGAGACTGCCATGGAAGACTTGAAAGGTCACGTAGCTGAGACTTCTGGAGAG 

ACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGACCACTGGAACAATGAGAAGGAGAGA 

ATTCTACTGGTCACAGACAAGACTNTCTTGATCTGCAAATACGACTTCATCATGCTGAGT 

TGTGTGCAGCTGCAGCGGATTCCTCTGAGCGCTGTCTATCGCATCTGCCTGGGCAAGTTC 

ACCTTCCCTGGGATGTCCCTGGACAAGAGACAAGGAGAAGGCCTTANGATCTACTGGGGG 

AGTCCGGAGGAGCAGNCTNTTCTGTCCCGCTGGAACCCATGGTCCACTGAAGNTCCTTAT 

GCTACTTTCACTGAGCATNCTATGAMTACACCAGTGAGAMTTCGTTGAAATTTGCAAG 

TTGTCTGGG 

Sequence 1017 

CCGGGCAGGTACCCTGCAGAATCTACCAAGGCACTCTGTAGCAAAGCAGGCCTTCCACCT 

GTCTCGGGGAACCCAGCTGGCCACCTTGCACTCAGCCTCTGCCAGTCACCTCAGGGCTGG 

CATTTGGGAGTCCTGACTGCACGCCAMGGTTGTTAAGTGGCTTTCTCTGAACTCATTTG 

AACAGCTGAGTAAACTGTTCTTGCTTTCACAAGGAAAAGAACTCCTGAAGTTATTAGAAC 

TCACCAAACCAGTTTCCTTGTTTTTGMGAAAGCAGGTAATGTGGCTTTGCCAGCAAGAA 

GCAGGCCTTGGGCGTCCATCTCCTTCTCAGGTGTGTTTACTCAGGAGAAGGTGACATTTG 

GATAGGGGCAGGTGTGTTTGCTGAACTGAGCTTCAGCCTAATCATTCCGGAAAACTTCCT 

GAGACTTTCTCTGCCCTGTAATTTCCCCACTGTTGATGGAAGTATATTGCAGTCTCATGC 

AGGACTGCGAAGGGAANGTCACACTCACTAGATCACTTATATACCAAAATTCTGGGGAAC 

TTTTG CTGCATATTATACTTTAGAACTCTTTC C ATTAATG ATAATC ATTG AAAATATTAC 

CATGGGAATAAAA 

Sequence 1018 

CCGGGCAGGTACTTTGGGAACAGGGTCTGCCTTACCATAGAAGTGTAAGGACTGTGATGA 
AAGTGAGACTTCTCTCCTCCCCATC I I INI I AACCCTTTCTCCCCCTCCCCAAAAAACT 
GCTGT GAAAAGGACTCAAAGAAAGGGCATAGCTTGAGTTGACCATTTATTCTTGTTTTGT 
TTTTTCCTTCCCATGATCTAAATTAAGGGGGGGGAAAAAGCATAGGTTTAATGGCCAATA 
CCATAAGCATGGACACGAACATACACACATCCCCCCATACACACACCACACACCCCACTA 
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CTCACCCAGCAATTCAAATTGAGGTGCCAACCGCCCAACTGCTGGAACAGAGAGACCCCG 
^^CACTGCGCCTCCCTTACCACTCAAATCTTCCATTCCCACTAAAATTGTGTCCCTA 

Sequence 1019 

NGGGGGGGGNAGNGGGGTGTANAAAAAAAMTNTTTTTTTTTNTTNTNTTCCTCGCCTTG 

AGCAAGGCCAACGAGCCGCCTCGNANTNNNTNTTNTNTANTCGNNGGGCGGGCNTAAGNG 
ACCATTGTGTGGCCGTTTGCCGCTGGACN ' 
Sequence 1020 

AGGTACAAAGGTTCAGTGGTGAGAAGAGGGAGCAAGGCCTTTGGAATAATGAACTCCAGT 
TGTTCCTCATAGGTGCAGCAGAAATAGCGAGAGGTCAGGATTATGGAGATTGGTAAGGCG 
AGATCATCCAAGGGCCTTTTGCTTGGTAAGCCATTTTACTTTAATCTTGAGTGCCATAGG 
GATTCATTGACGGATTGATACAGGGAAATGAAATGA I I I I I I I I I I I I I I I GGTTGGGGG 

AGACAAGAGTCTTGCTCTGTTGCCCAGGCTGGAGTGCAGTGGCACAACGTCGGTTCACTG 

CAGTGTCTGCCTCCCAGGTTCAAGCAATTCTCATGCCTCAGCCTACCTTGTAGCTGGGAT 

TACAGGTGCACACCACCACACCCAGCTATTTTrrATATTTTTGGTAGAGACGGGGTTTTG 

CCATTTGTCCAGCCTGGTCTCGAACTCCTGGCCTCAAGTGATCTGCCCACCTCACCTNCC 

AAAATGCTGGGATCACAGGCATGAGCCCCCGTCCCAAGCCAATTCATTCATTTTAGAATG 

AATGCCCTGTCCTGGGGGTGAATGGACTAGAGGAGGGCAATACTGGGAGGCTGAGGCGGG 

TTTACCTGTGGAAGTGACCAANTGAGANTTGAACCTGAGGGGTATGGCCM 
Sequence 1021 

CCGGGCAGGTACCCAGCTGAGACTGGNTNNGTCTAATTCCTCCCNNACACTAACCTGCCT 

CTGCCAGGCCAGCTGTGCTCACTGGACAGAAACCACCCCATGCTTTTCACCCTCCTGCAT 

GATGGGGACATGGATTCCAGGGCCCAGCTGTCCTGGGACATCACCACCTCTGACTGACTT 

TCCCTCATCAGCAGAGAGGCCTGCCCAAGGAGGAGTCTCTCCAGTCAGTGGATCAAACGT 

GCCTAGGGTAGGGAGCGAAGTGCCCGCTGCCTCATTCATCACCTGCTGCCAGGGCCCGGT 

GCTCCTTAGGGTCAAATT1TCATTTTCTCMGCATCCTTGGTTTTCTTTCTTTGGGTGCT 

GGAGGGGGTTTGGGCCCTTTTGTCAGTGTCCAGTGGTTTTGTGGGAGTGGGAAAAGGAAA 

ACCAAGAACTTGGTGAGTNGATNCGAATTTTGGACCCTTTTGGGACCAAGACACCCCCCG 

GCGGTTACCCTTNGGGCCCGGCTTTCTTANAAAACTTAAGTNGGGGATTCCCCCCCCGGG 

GGNCTTGGCAAAGGGAAATTTTCNNATTTNTTNCAAAGGCCTTTATTCCGAANTACCCGG 

NTNCGNAACCCTTCGAAAGGGGGGGG 

Sequence 1022 

CGAGGTACCAGACACAGGCATATACTTGTTTCCTACNGAAACAGAAACTATATTATTACA 

ACTCTTGCAGAGTCTACAGAGGAAATTCANAAGTTCCTGAATCAGGGGACTGACTACCAA 

ATTTTCAACTCACAAATAGAATCCCCCACGAACTGCCCTACCTGGAGTTATCCATTAACA 

ATTAAAGAACTGAAGAAAAGCTTTATGGNCTGGAATCTCAAGGCTGCTGTCCATGAGTCT 

AATCCAGCCAGNAACATATGCTTTGTGTGTCCCAGGCATGATAATTTTTGGTTAAGAAAN 

ATTTGGTTTGTGGMTGCCTTTAAAATGGAGGCATTGCTCTTGCCTAAATTTTTACGGCA 

NTCTTCCACTGGAAAACAG CATTATC TNTAMGTTTTGGTNGMTTTTGGAGATTGCAAA 

CTCCCTGATA TAANA ATAACTTTTTTCTTTTAATTGGAGGGAAAGNAAAAACTAAGAAGG 

TTAGGAGACCTTTTTACCAGGAAATTTTCTTATTAAAAAATTACNAAAANATNGNGCCAT 

GGCAAAAAAAGGTAAAGTTGGGTGGACCCCTAAAAAAANGTTCNAACCANCACCCCGGAA 

AAACNTNATNNANGGGGGGGNNTGNCCNANAGNAAAACGGTTACCCCTTCGGGCCCCGNT 

TCTNAANAAACCTAAGGGGGGNATCCCCCCCGGGGGNCCTNGCNAGGGGAAANTTTCCNA 

TTTTTNAAAAGCCTTTANTCGGATTACCCGGTCCGAACCCTTNGGAGGGGGGGGGGGCCC 

Sequence 1023 

CCG CGGTGGCGG CCGCCCGGG ^ 

NAANTTTTTTTmNTTACNGGGATGCTGTTATGGANATTTCACCACAGGCCCANCCAGG 
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CCTNACGTTCTCATATTTAAATATCATTTGGAANCTGATGAAACTCTGGAAGTGCTCATC 

TCCCTG7TGGTTCTAANATTTAGTCTTCCTGGGAACTTTTA7TTATGTAACTAAGTGCTG 

AAAATTGAGTCTCAAAGCMGTATTTTATTATATTTCTTNGNGGTTGGGGGGGGCCCCGG 

GTCTGGTGGGTNCTTNGTGGGAGCTTNTCCAAGGTGTTNGAAANCCNTCTGGTTAGAAAT 

GTCAAGTTGGAAGGCCCAGGGTNACTTNTCNAAAAACACNGTGGAGGGCTTTNGCTTGTN 

GTNCTTGTCCCC 

Sequence 1024 

CCGGGCAGGTACCCGGGATCCACGGTTGTGCCGGGCTTGCTGGCTCCACATGACACAGCT 

TGATAGCCCGGTGCTCTGCCAACTTTCAGGGTGGGGCCTGTGCTCTACGAGGGCTGAGAG 

CTTGTGGCGGGGTCTCGGTGTGGTGGAGGGAACACCATCAGTGTGCTGGGGTGTGGGGTA 

GCAGGTCACCAGGGAAAGGAAAACAAAGGGGGAAGAAAATAGGGATGTATTGATACATTT 

GGGGG AAAACACAGCCCTGAGAGAGAATCAAGTGTCTATGTTGGCCCTGAGCAAAAGCGA 

TAAATACGTTTGGCTCTTCTGACCCAGAGGTGGGCATATGTGGCTGGGAATTCTGGTGGA 

AGTGGCTCATTGTCCATCTCTGGCCCMCTTGAGCCTGGTTTTTCATCTCTTAGCCCAAC 

CTGGTACCT 

Sequence 1025 

CCGGGCAGGTACGCGGGGGTGGAGGAGGGGTGGCTTTGGTCGCTGGACACACATCTCTGC 

CACGGTGACTGACGGAGGCTGTGAAAGGACAATCTGAGTGAAGAAGTGAATGAGGTGCTG 

ATTCCACAGGGACTTCCTGAGATGGGCGGAGTGGCTCCCAGCGGACGGGAATTAGAGACC 

GCCTGAGAACAGGAAACCTACCTCCTCTATTTCTGGGTTCCCAGTGTCCTGCCCAGAGCC 

CATCATGTGGCAGAAGCTCAGGACGCAAATGTCAGGGGCAGAAGACTAAGTCCTTGCTCT 

GGGCAGGGGTCAGGACGTGTGCAGCTCCTTGGCCCCAGAATTGTTTCCCTCCTGCCTCAC 

TCCCTGGTGTAGAATTTGGCCCAGACGCTCCCTGCCATGTGACTTGGCAGGAGCTCCCTC 

TAGANGAGGGGTAGTGATATAGCTGTAAATCTCACGTTGAGG 

Sequence 1026 

CCGGGCAGGTACCCGGGATCCACGGTTGTGCCGGGCTTGCTGGCTCCACATGACACAGCT 

TGATAGCCCGGTGCTCTGCCAACTTTCAGGGTGGGGCCTGTGCTCTACGAGGGCTGAGAG 

CTTGTGGCGGGGTCTCGGTGTGGTGGAGGGAACACCATCAGTGTGCTGGGGTGTGGGGTA 

GCAGGTCACCAGGGAAAGGAAAACAAAGGGGGAAGAAAATAGGGATGTATrGATACATTT 

GGGGGAAAACACAGCCCTGAGAGAGAATCAAGTGTCTATGTTGGCCCTGAGCAAAAGCGA 

TAAATACGTTTGGCTCTTCTGACCCAGAGGTGGGCATATGTGGCTGGGAATTC 

Sequence 1027 

CGAGGTACCAACAACTACTGGTTTGATGCAACCATTTCCCTTTTACGCAAAAAGGTGCAC 

TAAAAGTAGACCACGCATTGTTGTGTATCATTAGGCAAGCAATTTCTCTTCTCTAGGCCT 

CACTTTTCCCATCTGGAMGTAAAAGGGTCCAGTCACATGATTTCTGAAGTTCCTTACAG 

GGTCAGCGTCCTGGTGAGAAAACTATACTGCAGTGCCAAGAGGAATTGATGTAGGATTTT 

TCTTCTTGGTCCTCGGGCCAGCAATGCCCCACCTGTGCCCGCTCGGCCATGGAGGTGTGT 

CCCAGCTCACTGTCCCGCGTACCTGCCCG 

Sequence 1028 

AGGTACAGTTGTTCCGGGTGAAGATTTTTACAGAATTCAGTAAGGATGGATTATAATATG 

CTGAACATTTGGTTGCTTTTGTAATAATAGAGGTGGACAGCTGAGACAAATCTAGACAAA 

ATCTTACACTGACTANTTTGTAACGACATACGGATGACCCTGAACT7TTTAAGTAACTTA 

AGAAATAGGCCGGGCACGGTGCCTCACACCTGTAATCCCAGCACTTTGGGAGGCCAAGGC 

GGGTAGATCACAAGGTCAGGAGTTCGAGACCAGCCTGGTCAACGTGGTGAAACCCCATCT 

NTACTGAAGTTACAAGGATTGGCCGGGCCGTG 

Sequence 1029 

CGGATACCTGTCCGCCTTTTCTCCCTTCGGGNMGCGTGGCNGCTTTTCTCATAAGCTCA 

CGCTGTAGGTATCTNAAGTTTCGGTNGTTAGGNTCGGTTCGCTCCAAGCCTGGGGGCTGG 

TGGTGCACGGAACCCCCCCGTTTCAGTCCCGNACNCGCTGCCGCCCTTATTCCGGGTTAA 
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ACTATTCGGTCTTGGAAGGTTCCAACCCCGGGTTAAAGGACCAACCGGAACTTTTAATTC 

NGCCCACCTTGGGCAAGCCTAGGCCCAACTTGGGTTANACCAAGNGGAATTTAAGGCCAA 

GNAAGCCCNGAAGGGIMTANTGGNTAANGGGGCCGGGGTGGCCTTNCCAANGAAGTTTCCT 

TTGGAAANGTNNG GTNG GGCCCCTTAAAACTAACCGGGGCNTTANCNACCTTNAGNAAAA 

GNAAACCANGNTNTTTTGGGGGTNATTTCTNGGNCNGC7TCCTTGGCCTTGAANAGCCCN 

AGCTTANCCCTTTNGTGGGAAAAAAMNGMGTTTCGNGGCANGCTTCNTNNGAATCCCC 

CNGGCCAAAAACAAAAACCCAACCCGTTCTTGGNTAANNNCNGNGNNGGGTTTTTTTT 
Sequence 1030 

CCGGGCAGGTACAGAAACATGGAAAAAGAGCGATGCTA7TAAGGAGGAAAATGGAGCCAG 

GAGCAAAACCTAAGGCTATTTCTGAACTAAAGAGGTCTGAATAAGGTAGTGAAAGGACCA 

GGGAAAGCCTTCCCACGGGGATGGAGAGCCACCTCCTCCCCGGCTTTCTGGTTCAGATGT 

CTTGTCTCAACAGACGGCAGATTCGCAGGGAAGCAGGCCGAGCCTGATCTTTTCTCCGCT 

AGAACTGCTCAACAGGTGMGAATCTTTTTCCAGCACTCCTTGAGCCCTT 

Sequence 1031 

GCGGCCGCCCGGGCAGGTACGCGNTGAGACTTTCCCTGCCGGCACATGGACCTGGCCCAA 

CCCTCACAGCCAGTAGACGAGCTGGAGCTCTCGGTGCTCGAGCGGCAGCCAGAAGAGAAC 

ACGCCTCTCAATGGTGCCGACAAGGTCTTCCCTTCTCTGGACGAGGAGGTCCCCC 
Sequence 1032 

GAGTTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTAAAGAATGTGCATTTATT 

TGTGGGGAGGAGAGGAGGTCTAAATAGACAATAAACATATAAATATTAATATTGTGAAAG 

TGGTATGAAGAAAAATAAAGCATGATAAGGAGAGAGAGGTGTTCCCTGGCTCTCCAGAGG 

GCTGAGGTGGAAGAAAAGGACTCCTGTTTTAGGTAAGATATCAGAGAAGACCTCTCTGAT 

TGACTGAAATGTATGCAAAGGCCTGCATGAAGCCAGGGACCATGCCGTGCTGACATCTGG 

GGGAGGATTATCCTCACAGAGGGGAAAGGAAGTGCAAATGCCGTGAGGTGGGAGGACCAT 

GTTTGGTATGCTCAGGGAGGAACAGGGAGGCCCGTGGGGCTGGAGCAGGTTCAGCCATGG 

GAAGGCCAAAGATGAGATCAAAGAGAATAGTGGGAGATCACGTAGAGATTTATGGGCCAT 

TTGTGAGGATTGTGGCAGCATTTGAGAAGACTGGGTTGATGCTTAAAACAANCAGCCATT 
TTTAT 

Sequence 1033 

CCGGGCAGGTACGCGGGCTGCTCTTGTGGCACAGAAGAGGTGCCTGCACCCCTCCCTAGA 

GTGTTGTTCCAGCTGCCTGAGAACTGCCCCTACAGCCCTACCAAGGTCACGTCTGCCTCA 

GAGAGCCTGATCACGGGCTCACCAGACCCAGTCCCACCCAGCTTTGCCCCCTCTAGCCAG 

CCTTGGGAGCAGAGCATGGGATGGGACCACTGAGAGCTCCACACCCCCCACCCATCACCA 

AGAACACTCTTGTATTCCTTATCAACAAAGGCCAAGTAAAAATCCCACTGTCATCACTGC 

AGCTGCCTCTCACCTGCCAGCATCACCTACTGGCCAGGAGGTCAAAACTGCATGTCCTGT 

CGCAGCAACTGTTGATATCATTGCACAGCACTCAGACAGCTCTTACCTGCAGGCATTACC 

TGCTGGCCTGTAGGGGGAACTGCCCAACCCAATATNATTCCTGCTGACAAGGTCCTTNGG 

CCCGTTTANAACTTAGTGGATTCCCCCNGGCTTGCANGGAATTCCGANNTTCAANGCTTT 

TTGGNNNNCCGCCCNCCNTTNAANGGGGGGGGCCC 

Sequence 1034 

AGGTAC [ i M I I I I I I I ICI I I M I I i I IAAGAGATGGAGTCTTGCTCTATCACCCAGGC 
TAGAGTGCAGTGGTGCAATCTCGGCTCACTGCAGTCTCCATCTCCTGGGTTCAAGGGATT 

c iyT ACC TCAGC CTCCTGAGTAGCTAGGA ^ ACA ^ 

CTTmGTATTTTTAGTAGAGACGGGGTTCCACCATGCTAGCCAGGCTGGTCTTGAACTC 

CTGACCTCAGGTGATCCGCCTGCCTTGGCCTCCAAAAATGCTGGAATTACAGGCATGTAA 

TTCCAACCTTAGACACTCAGTTTAAACCCTCTTATCCTATCTTCACTGGAAGAGCAAAGT 

AATCTCCTACCTTGTTTTGTCAATMGAAGGGMTATTATTAAGCCCCAGTAATCCTGTT 

GATTACCATAGTGGGCATGTTTCTCTAGAAGTAAGTAGAATCTTTGGGGTTCC 
Sequence 1035 
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AG GTATTTTGGAGG AC AAAAACACTCAGTCTATAGCAAATGGGCAGTAGTTTTTGGG AG G 

AATGGAGAAGGTCAGGGCATTCTAGGTGGAAGGAACCTTCTAGGTGTGAAGGCACAGAGG 

GTGAGTGGTGTAACATCACTTCCTTAGAGCTTTGCCCACAGTGGGTCCAGGTTGCCATTT 

ATCACAGTGCTGATGGTGAAGGTGATTTCTTTAAAAATTGGAGCCTTGGAGGGTTTTTCT 

CAGTTCCTTGGGGCCTCATCCATGATGAGCTCTGTCTGTTCATCCATGATGATCTTTACT 

GTGGGTTCTTTCATTAACTGCGGGACGGGAGGMGGATTTTCCTTCGCATGGTTGTCATG 

CAGAGTGGTCCTGGATGACAGAGCATGTGTGCCCACCTCTGCGTGGCTTGATCCACCATG 

GAGGGCTGGTGCTATCTTTTGGCCTCTGAGCTTCGGTCGGCCACTCCCTGCATGAGGCTG 

GGGCTCTTTAACCCTCGTGTCTTTTCTNCATGCTTGACAAT 

Sequence 1036 

AGGTACCCATTTTCAGAGTCTTACTTTTCCTAGGGGCAGCTCGMCCTTTCACTGGACCC 

CCGCTTCCCAAGAGAGCATTCATGTGGCCCTTGTGTTCCTGAACTCTGTCAAAGATGACC 

ACTCCCTGAGGTTGCACACGAAGGCCAGAAAGGCTACTGACCTCAAGCTCTGCTGTGTCA 

TGTTTCTGTCACACACAGCTATTCCTTATCACCCCACGGTCACATCCGCAAAAGGACGCC 

CCAGTCCCTCCTCACCTCAAGCATCATCCAGCGAGGCTGTGTATGGTGCTTCTGATGGAA 

GCTGAGCCTGAGTCCCTGATCCCTCACTTCCTTGGCATGAGACTGAAGCCTGAGATACCT 

TTACTTTCTCTGTGCCAGGCCTGTGCTAGGCATGGAGGATATGGAGGAGAACATTAGATG 

TGAGAAAGTTTTTAAAAATAAAGTCCTTCTGCTTTGGGTATAAATCTCANGAGTAAGCTA 

GTTTTGCAAGGCCTCCCCCTTACCTTAGCATGTATATTAACATATTCATGATATAAAAA 

Sequence 1037 

CCGGGCAGGTACGCGGGCATCTGATCGAGCTCCTGCCCTGCTCCCTACCCCCTCCCACAG 

CCTGGTTTAAGGGTTTCAGGCCMGGCGGATCAAGAGGCATTTGCCAGCAGCTGTTGAAT 

GTCAAGAGGCCTCGCCGCTCTGTGGTTCTTGTCACCTAAGAGAGAGGTGTCTGCCTGAGC 

AGCTCACAACAGGCAGGGGTCGAGGAGAGGGGAGGAAAGNAAGGCAGGAGAGAGTCAGGG 

CCCTCCCATTTAGGNAAMCCTGGCTTGTTTTGCTTGAAGTGACCAGACCCAGAGGCCAG 

GC N AAGTTTAG AAAAAAN GG AAAAG N ATTAAN AAAATAAN AAAGTACCTTCGGGGCCGCT 

TNTAGAAACTTAGGTGGGATTCCCCCCGGGNCTGCAGGGAAATTCNATATTCAAAGCTTT 

ATCCGGNTTACCCCGTCCGGACCTTNNGAGGGGGGGGGCCCCNGGTGACCCCAGCTTTTT 

GTTTCCCTTTTAGTTGGAGGGGGTT 

Sequence 1038 

AGGTACATCTTCCTCCATGAGTGGATCAAGAAATATTGAGATGGCTAAGCTGGAAATTAC 

AAAAATGTTGGCATACTATTATAGCATATGGTTATGAAAATCAATGAATATTTATATTTA 

GTATTATGCTTTMGTATATATCCAAGCTGATCAATTAATTACACTTTCACTGATGAGAT 

GTCAATTCTACATTTAGCTGAACTCTCATCTGAACTGTGTGGCTTCTCAATGACAGGACA 

TATTAAAGAACATGATGMTGTTTGTAATGTMTGATATAAATTATTATAATGTGTTGCA 

TTAAAGACACATGGTGTAGCATTCTACGTTCAGCTTTTGCATTTATTTTCTCAGTGTCAT 

GATTTGCTCCTGTGATTCCAGATCAGTTTTCTCCTTATCAGTCAGCATATACATATTGGT 

ATTAACACi I I I IGCAA 

Sequence 1039 

GATAACCGCNANGGAAAGAACATGTTGAGCAAAAGGCCAGCAAAAAGGCCAGGTAACCCG 

TTAAAAAGGCCCGCGTTTGCNTGGCCGTTTTTTCCATAGGGCTCCCGCCCCCCCTTGACC 

GAGCCATCACCAAAAAATTCGNACCGCTCAAAGTTCAGGAGGTNGGCGAAAACCCCGACA 

GGGGACCTTTTAAAAGATACCCAAGGCG I MM I CCCCCCTTGGAAAGCCTTCCCCTTCG 

TTGCCGCCTCTNCCTGGTTTCCCGACCCCCTTGCCCCGCTTTACCCGGGGATTACCCCTT 

GGTGCCCGCNCCTTTTTCTTTCCCCCTTTCCGG 

Sequence 1040 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGAGATGCTAAGAA 

TCTAGCCATGTAAAAACTCCCAAAACATCCAAATTCAAAAAAACTGGAATCTAACGTGGT 

AATGCAGTTGTCMGAACACGCAATAGTCTTTGGGCTTTTACAAAGCACTGAAAACTAAA 
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TGTTGCATTTGTGCCCTTATCTTCCACMCTTCTTTCTTGGCAGCTTTTCTTGGCTCACT 

TCAGACACACAG AGTTTC AAGTAATCAGGGAAAGTTGGGACAGAAACTTCCTCTGTCTTT 

CCATGGCCTTTTCTTTTTCCTTTACGGATCTGTGATCCTATGCTTTCTCTCACGTTCACA 

TtATATGTGTCAGTTGGAGAAATCTTGATGCCCAGTTTCATTTGNTTCTTCTGTCCCACC 

TGTAACTAATTTAGGTTGAGTTCTCCGGCTCATGTATCAGAGTGTGTATGACAAATTATC 

TCAAAGCCCAGAGGGCCTCGGTGGTCCAAGTCCCAGCAACAA 

Sequence 1041 

AGGTACTTTAGGAGACCCAGGCGGGCAGATTGCCTGAGGTCAGGAGTTTGAGACCAGCCT 

GGCTAACATGGTGAAACCCTGTCrCTACTAAAAATACAAAAATTAGCCGGGCATGGTGGC 

TCACGCCTGTAGTCCCAACTGCTTGGGAGGTTGAGGCAAGAGAATCNCTTGAACCCAGGA 

GGTGGAGGTTGCAGTGAGCCGAGATCCGCGCCACTGCACTCCAGCATGGGCCGACAGTAT 

CAAAGACTCCATCTTAAAATTAATANAAANGAAAGGAAAACAAAANGAAAAGTAAAAAGC 

TTATTATTTGAACTTTCNTCTTAAANAAAAAAAGGAAAAAAAGAAATGGAAAAGGCCCTG 

NATTGCCN NCACCNNAAAT TCTTAAAATTTTGNGCCAAGNTNCGAATTCAAATTTAAA^ 

GGGGATTATTTTTAl Mil I GCCATTTCANCAAA/VWVATANAATTTCCTTTTTACCTTC 

CCCCCCCCCAAAAAAAATTCNNAATTANAAAA^ 

TTTTTCCCTTAAAATTGGTTGTTTTTTAAAAAANTN 

CCATTTGGTTGTTCCCCCCAAAACC 

Sequence 1042 

TCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAATAAGATAGTTGTTTAAGACTATAGCT 

CTCTAAA GAGTAGT CTTTTAAATTTAGAGMTTTCTTTATCTCATAAGTGACTCAAGCCA 

AAAAGGC i him ICATGGAA AGTCCCTGAGGTAACTTTCTGGGTTTAGAATACGACAGG 

CAAAATTGGCATC I I II I I I I AAATGGGTGCGAAAGATGCAGCCTTCATGATCCTCCCCA 

AAATTTACTCCCAGAAATAAGCTAAGATAGCATATCTCTGTTGCCACAGGCGGTTAAGGA 

TGGTGCAGCTGGTCTCCCTATGAGCACATGGACACGTGGCCACATGGGGGTGCCTNTAGT 

CACAGACCTGCCAACCTGTGACCGCCTGTGACACAGGGCANGCACTTTNAGGGATTAAGA 

CTTTCCCAGCACTAACCAGGCAACAAAAGATGGAAGCAACCAGAAGCCCTTTC 
Sequence 1043 

CGNCCGGGCAGGTACATGGCAACTGAGGCAAAGAGCTCTTAAGTTACTTATCGAAGAATA 

TCCAATAAGTAGGTGGTAGAGCTAGGATTTACACCCAGCANTCAGCTTTATTTTCAACCA 

TGCAATTGCCTCCTTAACAAAGGAGATAGGTTAGGAAAGGCAGATCTGGGCAAGTGTTGT 

ATTATACMCAATAAAAAGGCTGTGGGGTCAAAAAGCTATGGCTGTTAAAAGGTTAGGGT 

TAACAGCNCANGAAGTAATCACCCTTTAGGGAAGGTTTAAAATTTTTAGGNACAAACATC 

TATGGATTCCTTTCCCAAACCCTTAAAATCNTTTCATGCCTTTTTAAAAAM 

CTTGCCCCAAGGCCGTTGGGTGGCNTNACGCCCTGTTATATCCCCAGGCACCTTTTGGGG 
CAGGGGGGGG ' 

Sequence 1044 

GGGGGATAAGACCAGCGAGGGTCTCTAGTTTGCTAATATAGACTGTAGAGGAAGAAGGGA 

GAAGAAGGTGACTGAGCTGAGAAAGGCTTGGGCAGAGGGTGCAGGGAGACAGCTGAGCGG 

ACCTCAGTGGTGGGTCTCATGACTGGGATGCAATGAGACTGATTTTCCTCCTTCTCTAAG 

GGAGACTTGACTTGTTTCTAAGGGGAAACAGAAATTCCAGAATTTGAGCTGAAGTAGCAA 

ATTCTGATGGCCCGCAATGTCAAAATTTCCCCCAAAGAAGCCTGCTTGTATTCTATGACT 

CCTTCGAAGGTGTCCAAGTCCCTTCTAGAAATTGTGAAATAGGNTTGGCGTGGTGGTGTG 

AGCCACTTGTNCAGCCAGGATCTTCCTTTTTAMTCTNTCANGGCACCTTTTNTGAAAGC 

TNCTCTCAGTGTGGGGTTAGGNGAATTNAGGTGGGGCCACTN7TTTCNCANTTTCAAAGA 

GAGGAGCCAAAGGGGCTCCCTTCTTTTATTTCTTT 

Sequence 1045 

CCGGGCAGGTACTATCCTAGACAGCTCTGGGAGTGTTACATAACGTATGGTAAGTGTGCT 
ACATGGCCTTTCTGAAGCCCAAACCATTTTGCATTATGAAAATACCTGGCCCCAGGAATT 
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TCAGATAAGGGATTGTGGACTTATATTTTAAATCTGTAACTCCTATAGACAGTAATTTCA 

TAGAGCCAGCTTCATGGAGTTGTAMTATTACTGGGCTAGGGTCAGTTTCCTCATTTTAG 

GCTTCCTTTAATCCAGTCCAGACACCAAATTAAATGTCTTTGTGTAGTCCCCTTCAAAGC 

CTCTTGCAATCCCAGTCATGCTTCAGGGTAGGGATGATCCCATTAATCCAAGGACCTGTC 

CCTAGTGATAGAAGGAAATGGGTGGGAGCCACGGGATCTGGGATTGCAGTGAGAAGTNGA 

CAAGAAAACAAC 

Sequence 1046 

TGCTGGCCGTTTTTCCATAAGGCTCCGCCCCCCTTGACNAGGCATNACAAAAATCGAACG 

CTCAANGTCAAGAAGGTGGCCGAAACCCGNACAGGGACTATTAAAAGGATACCAGNCCGT 

TTTTCCCCCTGGGAAGCCTCCCTTCGTGCCGCCTNNTTCTNGTTCCGACCCTNCCCGCTT 

Sequence 1047 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCGTCACCAGG 

GAGTCCGGCGCATTCATAACCAGCAATGTrTGTAAGCATCTAGATTACTGGGGAGAGAAA 

TATTATCCCAGATGGTGAATGGAATTAATGGCCACCCCCAAGTGTGTCTCCCCGTCCAAG 

CAGGTATTGCCACCTGTAGGATTCGCATACCTAGAGGGCCCCCCAGGGAACAGGAAGAGT 

TCTCCTATAATTGTATCGCTGTAATGTGTMTTGMGTAACAMGGTGCTGTGTTTTGTT 

TGGGCAAGGGCAAAMTAGACTGTTTGGTTAAMGMGTCCAGGTTGGATTTCMGTGTT 

ATCCCGGGCCAGGCGCGGTGGCTCACACCCGCAATCCCAGCACTTTGGGAGGCCGAGGAG 

GGGCAGATCACCTGAACTCAGGAGTTTGACATCAGCCTGGGCAATACGGTGAAACCCCGT 

CTCTACTAAAAATACAAAAATTAGCCAGGCG 

Sequence 1048 

CCGGGCAGGTACTTGGAGGGTTTGCTGAGTGCACCTTTGTAGGAGGGCCTGGAGTGGGAC 
GAGAGGCAGCCAGTGGGCCTGGCTCCCTCCAGATGTGTTCAGGCGCAGCA.AAAATCATTA 
GGCATTGAAAATAGGGAAGTAGGCCGAGTACCT 
Sequence 1049 

TTCCAAACATCANCTTTGATGTCTATTGTCGGGACTTCAAGAGGACTCTTAGCACATNGG 

CAGGTGTTGGTTTTAGTGAGCAGGCGAGAGTCCCACATTCTTCCAAGGAGAGAGATGGGT 

CCCACCACATTCCTACCCCTAGAGCANAAGTTCCCGCGCTGNGCCTCTTCTCTGGCTGCT 

TGCTGAGGACGTCTCCCAGTGGGGTACCTGCCCGGGC 

Sequence 1050 

TTCAACTTTANTTAAGNGGGNCCGNAGAGTTTGGGGTAAGTCCTTCNCAACNCCGNCGGG 
GGGTGGGGGCCGGGGNTCCGCCCCCGGGGGACCAAGGGNTTACCGGCCGGGGGGNATTCA 
ACCAAGNNCCTTNACCGTNGNTTGGCCCCANGGGGCCAAGTTCCAATTCCAATTT CCCCC 
CCCCAANGAAGGGTTGGAACAAGGGGCCTTACCAAAAGGGGCCCGGGGGGGGGl ITTTTC 
CCTTTCACNAAAGCCCCTTGGTTCAAAGCCCCTTT^ 

TTTGGGGGGGGGTTGGAACCAAACCAAGGMTTANTTTGAAACTTTCCTTGGGGNANTTN 

GGGAAAAATTGCGNCCNTTTTCCCCCCCAAGGCTTTGGAAAACC I MINI ICCCCAACC 

CCTTGGGGAANGGTCCCTTTTTGGGGGGAAAAATTAACCCCCGGNGACCCGGGGGGAANA 

CCTTTCCI MUM I AAATTNAAGGGGTTCCCCAAAATTTGNAATTGGTTGGGAAACCCC 

CCAANNNTTCCCCTNGGCCAAANG I Mill II GNGGGGGGANAAAAATTCCAAANNCCCC 

CCCCCCMTTTCCCCGCNGGTTTAAAAGGGNNGTGNGCCAAGGTTCCCCNAACAANNAAA 

AAAAAACCTTGGGTTNCCCCTTTGGGGAAAAAN I I Mill I GGGCTTTNTGNN TTGGC CC 

CNAANNGAAAAAACNCGTTTNGNCNTTNAAAAAAAANNGGGGGAAAACCCCN I I l l iGNA 

NAATNGGGCGTTTTGGGAAAAAAANTNTTTrCC 

AAAAAAA 

Sequence 1051 

GGTGGGGNCNGGGGCCTCCGGGTCCGGGGGTAAGGGGCTTTGTACCNNAAGNTAAGTCCC 
CGGGGGNTAAGNGGCCGGTTTTANGNCCGTAAANGNGTAAAAGTAAGACAAAAANATACC 
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CCGTANAGTAACCGNAAAAGNCNCACCTTTACCAAGGCCGGTTCCGCCAANACCGNCCGG 

GGCGCCGGGGCCGTTTCTTGCCTTTTTTATTAAGGGAANNAAGGGGGGCCGCCMGGGCC 

CNCCCCCGGCCGGTNACNCNTTCCGGGCCCCGGCTTTCTTAGGNAAACCTTAAGNTNGGG 

GAATCNCCCCCCCGGGGGCCTTGGCCAANGGGAAAATTTTTCCGGAATTATTCAAAAGCC 

TTTTATrTCCGGAATTAACCCCGGTTCCGGACCCCTTCGGNAAGGGGGGGGGGGGGGGCC 

CCCCGGGGGTTAACCC CCCAA AGACC ) 1 1 I 1 1 1 1 1 NGGTTTTTCCCCCCTTTTTTTAAGN 

NTTGGGAAGGGGGGGGTTTTTAAAATNNTTGGGCCGNCCCGNCCTTTTNGGGGGCCCGGN 

TCAAAAANTTNCAAATTGGGGGGTTCCAAATTAAAGCCCTTGGNTTTTTTCCCCTTNGGT 

TGGGTTGGNAAAAAAAAATTTTGGGTNTTAATTCCCCCNGCNTTTCCAAACCAAAAATTT 

TTCCCCCAACCCAACCCAAAAACCAA7TTAACCNNNANGGCCCCCGGGGGGGNAMGGCC 

CAATTAAAAAAANGGNTTGGTTTTAAAAAANGCCCCTTGGGGGGGGGGTTGGCCCCCTTA 

AAAANTT GGGAAG GTTGGNAANACCCTTAAAAACCTTNCCAACAATTTTTAAAATTTTTG 

GCCCGGGTTTTTGGGGCNGGCTTTCAAAACNTTGGNCNCCCGGGCNTTTTTTTCCCCAAG 
ATTTCCCGGGGGGA 

Sequence 1052 

CGGTCGTTCGGCTGCGGCCGAGCGNNTATCAAGCTCACTCAAAGGCGGNTAAATTACGNG 

GNTATTCCACCAAGAATTCAGGGGGGATAAACGCCAGGGAAAGAAACATTGTGGAAGCAA 

NAAAGGCCAGCTAAAAAGGCNANGGGAAACCCGTTAAAAAAAGGGCCCCGCGTTTGCCTG 

GGCGG i i i M ICCATAAGGCTCCCGCCCCCCTGGACCGAGCCATCANCAAAAAAATTCGG 

ACGCCTTCAAAGGTTCANGAANGGTNGGNCGGAAAACCCCCCGACCAGGGAACTTANTTA 

AAAAGNATAACCCCNAGGGCGGTTTTCCCCCCCC 

Sequence 1053 

GGGAAAGNCCNTTCCCCCAACCCCCGTCCGGGGGTTGGGGGGCCCGGGGGCCCCGGGNCT 

TTCCTTTAAAGGAAAAACCTTT AAGGN TNGGGGGNAATTCCCCCCCCCNCCGGGGGGNCC 

TTTGGCCAAAGGGGGGAAGAATTTTTCCCGGGAATTTAATTCCCAAAAAGGCCCTTTTAA 

ATTTCCCGGAAATTTGACCCCCCGGGTCCCGGAAACCCCTTGCCNGAAAGGGGGGGGGGG 

GGGG GGGGCCC NCCCCCGGGGGTTANCCCCCCCAAAAGCC I I I I 1 1 1 1 I rG GGTTTTCCC 

CCCCNIii i i i I AAAGGTTNGGAAAGGGGGGGGGTTNTNAAAAATTTTNGCCCCNCCCCG 

CCCTTTTTGGGGGCCCNGGNTTAAAATTTNCAAATTNGGGGGGTCCCCAATTAAAGGCCC 

NTGGGGTTNTTNTCCCCCCTNGGGGNGGGTNNGAAAAAAAAATTTTTGGGTTTTAATTCC 

CCCCGGGCTTTNCNAACCNAAAATTTTTCCCCCAACCCAACCNAAAACCNNAATTNAACC 

CGGGAAGNCCCCCCGGGGGGGGGAAAGGCCCCAATTNNAAAAAAAGNTTNGGTTTAAAAA 

AAAAAGCCCCCCTTNGGGGGGGGGGNGTGGGCCCCTTTAAAAATTTGGGAAGGGGTTNGG 

GAAAGGCCCNTTAAANCCTTTCNAACNNAATTTTAAAAAATTTGGGNNCGGGTTTGNGGG 

NGGGNCT TTTAAA AATTTGGGGCCCCCNGI I I I I I I I I t CCCAAGTTNCGGNGGNNAAAA 

AAAAAACCTTTTTNTTTATGGGCCCCCCNCACANNGGNNTGGTTATTTTAAAAAAAAAAA 

AAAAGGGGGNNNNNNANAANCCCCCCCCCCCNAAANAAAAAAAA 

Sequence 1054 

AGGTACAA AAATGTT CTGCAATAATGTCTCAATGTAGCAATAAAAGAACTTGAAAAAAAA 

CTGCCATTATTTTTTCTTTTCTGCCAAAATAGCAATTGTCAGGCCTGCCATTTCATTTGA 

CCATTCTCTAAATAGCTCCTCCCCTGGACTGGTCTCAGATATTGGCTGAAGGTCATAATT 

GGGAGGTCACATAGCATTTGAGGGAAGTTNGATGGGCAAAATCAGTTTCACATTGGAAGN 

ATGTTTAAAGGAATGGCACAGGGACGAAAAGNACAAGTAATAGCTACAGTCCATTGATGG 

TTCATTTCACTGGATTAGNCAACTTCATTCATTTTAACTCCANTTTGNCCGNAGCCTTCA 

AGTTAAACCCTACCGTTTCAAAAGTCCATTTTAATGNCCATGTTTACAATGAACTTTCCT 

TAATTTAACTTNCAGTAAGATNATTTTTTTCCAAGTTTTTTCTTATGGTAGGAAGTTAGG 

TAAmm i i i AANTTTTTACCTTTTCCCTTAAATNGGCCTTTTNCCTTTAANTTTTTTTT 
1 GAAAATGGGGCCTTTTTGCCTTTNGGAATTTAATTGGTGGGGGTAANGCCTTT 
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TTTTTGGAAACCTTTC N AAAAAAN 
Sequence 1055 

GNGTCCGAAAATTGGGGNAGNCTTCCACACCGTCGNGNTGGGCCGGCNCCGNAGGGGTTA 

CGNCCGGGGGNAGNAAGTACCATTTGGGGTAAGGGGGTAAAAGGGGGGAAAAGGGGGNCT 

TTTTTTCNTTCCCMCCACAAAAAGGGGGNCMGNNAAGNAGGTTCCCGTTCCTNAGGAA 

I I I I i CNTTTGGNCTNGTCCCTGGCCCAAGNCTTCTTGGCCCCAATGGCCCTTGGGAAGG 

GGTGGGGTTNCCCTAACCCCCCTTCAACNGGGGGCCACCCCCCTTAAAGCCNTTGGCNTT 

GGACCAAGCCCCCTTTTTTNGTTGGGGCCCCGCCCCGNTNCCCCCAATTNCCCCCTTNGG 

CCCCCTTCNAAGGCCAACNACCAACCMNTTTTTGGCAAACCAACCAACCGCCAAAGGCT 

I I I [ ! I IGGf 7 I I I CTTTNNAAANCCCN I III! I AAACCCTTTGGGTTCNNAATTTTNCC 

CAAAAGCCANTTCCCCCNTTNGCCCCTTNNTTTTNGGCCCCAACNAAAAAAACCTTTGGG 

CCCCCCCNAAAGCCAAAAANGGAAAAATTTTTTTGGAANGGGG7TTTNTTTGGMCCA 

AACCAAAGGTTMNCCCTTTGGCCCCCGGGGGGGCCCGGGGGCCCGGTNTTTTNTTAANN 

AAAACCTTAAAGGNTNGGGGGAATTCCCCCCCCCCGGGGGGGCCNTTGGGCANAANGGGA 

AAATTTTTCGGAAATTANTTCAANAGNGCCTTTTMTT^ 

TCTGNAAAACCNCTTTTNAAANGGGGGGGGGGGGGGGG 

Sequence 1056 

TCTTTNCCCGCTTTGCTCTTCGGCTACAACNTGGNACCTTCGGCNTGGACGGGCCTTCGG 

GGGTNCNGCTCTNCCGNGTCNTAGACNGGTCNGGAAGGCCGNGNTAATTCAAGGCTTTCA 

ACGTTCCAAAAAGGGGCCGGGTNNATATTACCGGGGTGTAATTCCCCACCAGTNAMTTC 

ANGNGGGGGTATTANACCGTCNAAGGGGANAAAGGAAACCCATTGGGTGGAAGNCCAAAA 

AAAAGGGGCCCAGGCNANAAAAAGGGGCNCAAGNGNAAACCCCGGTNAAAAAAAAAGGGG 

GCCCCGCCGGTTTGGCCTTTGGGCCCGNTTTTmNCCCAATTTAAGGGGGCNTTCNCC^ 

GCCCCCNCCCCCNTTGGGACCGGGAAGGTCNAATTTCAACCAAAAAAAANAATTCGGGGA 

ACCCGGNCCTTCCAANAAGGGTTTCAAAGGGAAGGGGTTNGGGGGCCNGNAAAAAAANCT 

CCCCGGNNANCCCANGGGGMNCCTTTAATTTAAAAAAGGGGAAATTACCCCCCAAGGGG 

GCCCGG I I MM r CCCCCCCCCCTTGGGGGAAAAAGGCCTTTCCCCCCTTTCCGGNTTGG 

GCCCGGCCCTTNCTTTCCCCTTGGGGTTTTNCCCCGGGAACCCCCCCTTTGGCCCCCGNC 

NTTTI I A ANCCCCCGGGGGNATTTACCCCNTTGGGTANCCCCGGCCCCC I I I I I I I ICCN 

TTCCCCCCTTTTTCNGGGGGGGAAAMGGCCCGGTTGGGGGCCCGGNCCNTTTTTTCT^ 

CCCMNTTNAANGCCCTTTCNAACCGGCCCTTGGNTAAAGGGGGNTNMTTTCNTTTCAA 

AG ! H II i CGNGGTTGGTTAAAANGGGTTCCGGNTTTTNCGGCCNTTTCCCCNAAAAGGC 

CTTGGGGGGGCCCTTNGNTNGGTTTGGCCANCCCAAAAAACCCCCCCCCC 

Sequence 1057 

AAAAAAATCAAGCTATTAGCAAGGACGGGGAAAACTGTAGTGAGTTTGTGTATCAGCCAC 

AAGCCGTGTTGCACTGAAGTAACAACTTTTTAAAATCTCAGAGGCTGAAAATGCCAACAG 

CTTATTTCTTATTCACAATACATGCCTATTGAGGGCTGGTGGGTCTCTGCTTGTCATTGT 

CACGTGGGGATGCAGGCTGACGGAGCAGCTACCATTGTGAATGCTGCTGGCCGGCATCCA 

GGAGGCCTAGCAGGTCTCTTACAGGTAACTGAATGCTCCAGCCCAGAAGTGACAGGTCAC 

TTCCACTCGCAGCTCATTGGTCAGGACCAGTCACAGGAATCTGCCTAACCTCATGGGCAT 

TTTTGAAATCTCTTCAACTTTTTGAMCTGCTTCAAAGATGCCTCGTATTGTGAAGGCAC 

TGACACCACGTCCTGGGTGCTTTGTGCC7TAGGGGAATGGCTGATGTCTCTANGCCCTTT 

CTGNGNCTNCTTCATCAAANCCCCCGNGAACCATCAAAAGGACTAACTGAAGTCCCTTTC 

CCGNGGN I I I I M I I I IGCCAGGGCACGGGGACACCCACAATTTTCCCCANCCCTTGNTT 

GGGGGGCCTTT 

Sequence 1058 

GGTGTGGTGGCTTATGGAGTCCCTAACGTTTGCACAAAATGGTTGCTACATTCCACACTC 

CCATGTGCCCTGAGCAACCTCCAGGGAAACTTCAGATACCGTGGGGCTTGGCCTGCCAGC 

TGAGCTGACCCAGCGATCCTTTCTATCACAGCTTTGATCTTCACTGGGTTGGGAGCCTGT 
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CTGAATCTAAACATCTGCCTCTCTACTGCCCACCCCTTCGGCATTTCCTAGCTCTCCCTG 

AAAGGTGCAAACATGCATGCAGTTACTCAAACAGAATATTTATCCTTAACTCCCTGTCCC 

TCACAATTTCTTTTAAAACMTCCACTATTGTCTACCCCCGCCCCCGCATCACCCTTTCA 

TCCAGGTGCCCATTCCCTTCTCTCTAGTCCTGCAAT 

Sequence 1059 

CGCCCGNGTCGGGGCCCGGGNGCCGCCGCCCCNCGGGCCCAAGGGG7TACCCAAAAGNAA 
TGGTCCTTGGCTTTTGGCACCCATTAACNNAAGGGAGCGNGCCGGGNCNTTGGGGGAAGG 
CNAAGCGGGGAAGTANTNCCGGCNTNACCNTTTCACACCGNCTAAGTCACTTNGGCATAC 
-AGAMGCGGACNCAAGAGCMTTGGCTCCAAGGTMTTGNGCNTTTGGGGCAATTTGGGN 
CCATTCCMGNGGGGAAAGGCCCTTCTT7TTCAAGGGGMGNGNTTGNGTTTGGGGTTAA 
GTCMGACTAGGCCTAAAATNTTTTCCCACNAATTCAATATTNGNTTAANTAAAAGNAAG 
TTCCMGNTTGGGGGTMTNGGGGCACCANGGGGTNGGGGTTNTTTTCCTTmCCCNCG 
ANCCAANCGGNAAGGNAGGGAAAAAATTCCTAANAGNTNTGGGTTCCTTAAATTTNGGCC 
TANNGGGGAAGGNAAAAAAACCGGTTCCNCNCCTTTTTGGCCCCCAANCTTTCNCCCNCC 
NAANTTTCTTTCCAAATTTCAANGGGGGCCCCCANAAGNNTTGGGGGGAAGTGGGAACCT 
TGGGGGCCCCCAAGCNAAGAGGGGGGCCCCCTTGGCCAACNNAATTGGCCCAAAAAACCC 
TTNCCCNMGGTNCCCCCCCTTGGCCNCNTTTTCNAANGAAAAAAAGGCTTTTGGAAAAA 
AANA GNGGGG GGTGCCCCCCTTCGGGGGNCCCTTTNTTTTAAATTTTTTNCAAAANGGGG 
GCCCi Mill I GGCCCAANTTGGCCCGGCCCTTTCCTTAATTTTTTCCCCCCCCCNTTCC 
TTGGCCCCCTTTCTTTCCCCCNCCACCCCCTTTTTT 
Sequence 1060 

CCGGGCAGGTACCTCCGATGGTCCNGGGGTGACTGCTAAAGAGAACCTATGAAAAAGAGG 

NTGTNTAAMTTCCATCCTTTTNAGGGGTGGTGAGAGGTGCCAGGTGCTATNNTGCAGAG 

AAGTGTNAACCAAAAACCTGTAAAAAANAGCCAAGTAGGCCACAAGTGGTGGCTCATGCC 

TGTMCCCCAACCACTTTGGGATGCCAAGGCNAGGCAATACCACTtAAGACCAGGNAGTT 

TGAAGACCAGGGCNGGGGTNNACACAGGNGCAGTATCCCATNCTCTAAAANAACAATNAA 

ACNTAGAANAAATTTAGCAAGGGGTGTTGGTNGNGCACACGGTCCTTGCTGGCACCTCCA 

AAAGGACCCGAGGGTGGNGANGGGATTTGGNTTTGACCCCTGGGGAAGGTTCAAAAGGCT 

TGCANTTTGGAGCCAATTGACCAGTGGTCCAACTTGTTTAATTCNNAACCCCCTTGGGGN 

CANACCAGGAAAGCCAATGGAACNCGTTGGTTCNTTTGNANAAAAAAAAAAAAAGGGGCC 

AAANAAATTTGGNNCCTTAAGAGGGNAATTGAAAACCCCGTTGGGAAAGNCTTGGATANG 

CCACNTTCTTNNTA GCAAA AAAACCNATTGGCCATTGGGTTCCCCCTTGGCCCCGGGGGT 

TGNCAAANTTGGGCTTTTMCCGCCCTTTGGTT^ 

Sequence 1061 

TGCGCTCGGTCCGTATCGGCTGCGGGCAGAAGCCGGTTATCAGCTCACTTCAAAGGCCGG 

TAATACCGGTTATTCCACAAGAATCAGGGGGATAACCGCCANGNAAANAAACATGTTGAN 

CCAA AAAGGGCC AGCAAAAAAGGGCCAGGGAACCCGTAAAAAAGGGCCGCGGTTGCCTGG 

gccgi rrmrn iccattagggccttcccgccccccccctggacgaagccaattnaccaa 

AAAAAAAATTCGNACCGCCTTCAAAAGNTTCAAGGAAGGGTNGGCCGGAAAAAACCCCCC 

GGACCAAGGGGGACCTTNATTAAANAGGNATTACCCCAAGGGGCCGTTTTTTCCCCCCCC 

TTGGGGAAMG GC1T NNCCCCTTCGATGGCCGCCTTNTTTCCTTGGTTTTCCCGNACCCC 

TTGGCCCGGCCTTTTACCCCGGGGAAANACCCTGGTTCCCGGCCTTTTCTTCCCCTTTCC 
GGGGGAAAAG 

Sequence 1062 

AGGTACGTGTTGGCTGCTTTCTTAMTTGTGGAAATTAAAGGAAGGGCTGAACTGTTACT 

CAGATTTGAATAGAAATGAGGAAGAAGAAGAACAGTGAGATAAGGAGATAGTGACAGTGA 

TACATGTTGACAATTCCATTGAAAAAGAACTTGGCCGGGTGTGGTCNTTCACGGNTGTAA 

TCCCAGCACTTAGGACACGAAGGCCGAAGCATCGCTTGAGCCCAGGAATTTGAGACCAGC 

CTGGGCAACATAGTGAGACCCCCATCTATATAAAAATTTTAAAAAACTTGCTGGGTATGG 
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TGGTGTTTGCCTGTAGTCCCAGCTACTGGAGAAGCCGAGGTGGGAGGATCACCTNAGCTC 
GGGAGGTTGATTNTGCGGTGAGCCGTGAGCACNCCACTGGACTCCAGCCTGAGCAACAGA 
GTCAGACCCTGTCTCTAAAAAGAAAAAG 
Sequence 1063 

AGGTACACGAGATTAAATTACATAGTTATCCTAGCAAAGATGTTCTGCAACATGTTTCAT 
TATAATTGTTTCAGAAGTGCTGAAAATCATTTATATTTACCATATACCCATGTTTGCCTA 
AGAAAACTGGAAGTGGTATCTAAATATAGATAGAAGNNTATATCATCCAGCACATTACAG 
CI I I I I I CTTATTTATTATGTGAAAAGAAAATAAATATTACCACATGACTA I Mil IAGT 
. T1TAAAAAGGAAAGATAAAATATTAATAACTCAAAAGTAAATAC ACTTCC TGTGATATAA 
GTNATNCMTCCAAGCCACCTGACCTTTTGTTTGAGATTAAAACTCTTAAATATGAAGGA 
AATATAAAAAGTATCCCTT 
Sequence 1064 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCATTCATTACCTGTCTTCAGCAACTACCAGCA 
AATGGCCAATTTGATGTCA I I I I I GTGTCAACCTACTACCTTCTAAGCCAATTGGATTAT 
CTTCCAAAACATCCCAGACCTTATATCCTGTCACACATAAGTATTTCAGAATCTAAGGTA 
GTCTTATCTAAGGTAGTCTTTTATGCCATCTTATGCCTTTATGTGTCCAATTATTTTCTG 
TTAAAGTTCCTCATTTCTCCTCCATATTTTC 

GCU I I I I I ICTTTGTATACACATGGTTNCAGCCACCCTGAGGTCAAGCTGCTGCAATGC 

TTTATTTACCTTGNAGATATTGGGTATATTTAGTATTCCTCTTATCTGCAAGCTAANGAG 

ACAAAAAATNACAAAACTTCTTAATCTGCTTNCTTTTCCI I I I I I I CTCCTTG7TTGGAC 

TTCCTACTCTTTGCCTCTTGGGAAGCTGGGCAAACTGACAAGCTCATAAAATCGTCATTT 

CTCTGAAACCAGGAAGAAGCCCT 

Sequence 1065 

GGAGCTCCCCGCGGTGGCGGCCCGAGGTACGGTATATTAAAAACAATCACTGCAGCCCTT 

AAATGATCCAATAGTTTAGCTTTGAATGCATTCAAGCTMTGGGGGCTTTAATGGCTTCT 

GACAGATCATTCCATTGATCAAGGGGCATGTATGAAATACGCTGCCTACTAAACTCTGTT 

TTGATTTTGAGAAAAGAAAACTAAACGTCCATGGTAAACATGGCTTGATTGTTCACATGT 

TCACCGAGATCCCTCTCAGCGTGACCCTGGTTGGGAGCAGAGGAGGGAAGGATAATGGAA 

TCCTATATACCTGGTTTGCTTCTCTCCTGACTCTAATCAAGGCATAATCAACGGGAAACC 

TCTGTCTGCAGAGTTGGGGGCAGGGGGAAGGAAGGTGACCAGCGGTCCAGTTTTCTGGGT 

TGGATGAAGTTATAAGCCACGGCTTTGGCTGTCTGAGAAACTGATTCATCTCTTGCAAGC 

CTTGGGGGTTTGAAGGTTMCTGTTCTGGTTCCAACGTGGAAAGCCCGMTTCCCTATTT 

GAAAGGATAATCAGGTTTTTCAGTTGGGGACAGCAGTTTTTGCAAAAGAAACAAATGAAA 

AT 

Sequence 1066 

GGCTGGCCTTTCCTTTGGGAATGGTGGGTAAACCGG I I I I I 1 I NANGGCTCCCCTTNTCC 

CGGAAATCCGAAACCCCTTGATTCCCCCGGTCAACCCCGGTGGGTNAACCATGGGTAGGG 

CAACGGGNGAACTTACCATCGAAMGGTTTGMTAGGGGCAANACGTTTCCGAAATGGGG 

GTTCGGTTCGCCCGCCCCCGCGGTTACCTTGCCCGGGGCGGGCCCGNTTNTAAGAAACTA 

AGTGGGGMTCCCCCGGGGCCTTGCAGGGAAATTCCGGATATCAAAGCTTTAATCCGATT 

ACCCCGTCCNAACCTTCGAAGGGGGGGGGGGCCCCGGGNTACCCAMGCTTTTTTGGTTC 

CCCTTTAAGNGGAAGGGGTTAAATTTGCNCCGCNTTTGGGCCGTNAAATCAATTGGGTCA 

ATAAGCTTG1 I I I I ' C CCTGTGTTGG AAAATTTGTTTATTC C CGTTTC AAC AAAATTTCCC 

ACACAAACAATTACCGAAGCCCCGGGGGAGCCATTAAAAAAGTGGTAAAAAGCCTTGGGG 

GGGTNGCCCTAAATTGGAAGGTGGAAGCCTAACCTCACCATTTAAATTTGGCCGTTTTGC 

CGCNTCAACTTGCCCCGC Mill CCAAAACCGGGGGNAAAACCTTGTCGGTGGCCCAGCT 

TGCNATTAAATAGAAAATTCGGGCCCAAACCCCCCCCGG 

Sequence 1067 

CCGGGCAGGTACCAGCTTTTGCACTTTCCTCACTGTAAAGATGTTGTATGGGTTACTTAA 



Page 179 of 869 



WO 01/51628 PCTWS01/00798 

Table 2 

CCAGTTGGGGTTTAATTTCCTTGGCTGTATAGTGGACTTAGTGTCACATGCCTACCTCAG 

GATTrCTGNGACAATTAAATATGATATTTTAAGGCTTCTAACTTACTGGATCTAAATAAA 

TTATTCCTGGAAAAAAAAAGTTTGCTGTGTAATAAAGCACCTAAACAGGAATGAAAAAGC 

AGGTATAGAACTTCAGGTGTTTTGTCACGCTCCCTCCCCCAGTGGTGTGTGTAGGAAAAC 

TTTGCTMTTTCTGNAAANCCNGATGNCTAATCTTTTCTGACTTTACGTTCAATCAGGCT 

GATTATTTTCATTCANCAGATGTNNATCGGTTTTG 

Sequence 1068 

I^^ CNCCCCGCGGNGGCGGCCGAGGANCNCGGGACGGG NCANTNAANNTNTAAGCAT 

ACAGGCCAGCCGGNGGGGACNACAANCNAAAAAACCNTGGACTGGGGACTTNTACACCAA 

CAGAACTGGGGCAAAANNGNAGCCGCCGNTGATGGNTAATTCGGGGAAATNAGGCATCCN 

GCCCTGATGATGCCTGCTGGTAAATCCCCCNATCCAAAAAACAAACCCGGGTNGTANACT 

ATCGAAAANCCTAAAAGGGNAGAAATAATGGGAAAGCCCTGNCTGGTGGAGCCACACCCA 

GGNGANNACCTCTAAANAAAACNTGGCNGGAATGNCTGCNGGCGGTCTAAAAAAATAAAC 

ACNCGTAACANGGCTTAANAACAANAAGANCCCGTANGAAGNACCCCCNTCCCCGGNNAG 

NANNACNCAAAGAAAAAANACAAANGGGCCCCTNGCCCCGGGGCCGGCCCGCCTTCNAAA 
AAACCAAGGGGGGAACC 

Sequence 1069 

CGAGGTACCTGCCAGGAAAAATAGCAGCCCANCCACCAGGTGGCAACCTGCACTATTGAC 

CAGACACTTGGCCAGTTTGATGTTGGGCACCGAGGACCTGAANGCAGTGTTGCACATrCC 

AATCAGGCAGAGCAGCAGGGCACCCATGGCGATCAGCATGCTGAGGGGTAGGGCAAACTG 

GAGGACACGCAGGTCCAGCTGGTCAACTGATGAGTACCTGCCCGGGCGGCCGCTNTAAGA 

ANTANATGGATNCCCCNGGCTGNNNNGAATTCCGAATATCAAGCTTATNNANTACCGTCG 

GACCTTCCTAGGGTGGGGGCCNCGGGAACCCCAACTTTTTGTTNCCCTTTAAGGTNGAGG 

GGTrTAATNTGGCCCCNCTTTGGCCGGCTAAATTCAATTGGTTCMTAGACCTTGTTTTT 

CCCTGGTTGTGG AAAAAATT7TGTTTAATTCCCGNTTCANCAAANTTTN CNN CCANC AAA 

vnA I I CCNAAAGCCC 

Sequence 1070 

CCGGGCAGGTACGCGGGGTTGATGGGGACAGAATAATCACTCAACACCAGATGCAATGCA 
ACAAAGTAATTAATGCTCAGAACTCAGAGCTGGGTATGAATCAAGGATCCTTTACTTAGT 
AGTTAACTAGAAGACTGGGACTAGTTAATTTCTCCAGTGCTCAGCTTCTTATCTACAGGT 
ATCTGCTGAATCCTCCAAGGGGAACAGAATAAGGGGAAGATCCCTGGAGATCAGATTCTC 

AGATCCAGTATCCCGTGTGCTGACCTGCCATGACAAACACTGGCAGGGAGCCTGCCATAG 

AGGAAAAGAGTAAGAATTGGCTGAGTTCACAT 

Sequence 1071 

AGGTACTTTAGGAGACCCAGGCGGGCAGATTGCCTGAGGTCAGGAGTTTGAGACCAGCCT 

GGCTAACATGGTGAAACCCTGTCTCTACTAAAAATACAAAAATTAGCCGGGCATGGTGGC 

GCACGCCTGTAGTCCCAACTGCTTGGGAGGTTGAGGCAAGAGAATCGCTTGAACCCAGGA 

? G Z G ^ GG ^ GCAGTGAGCCGAGATCGCGCCACTTGCACTC CAGCATGGGCGACAGGAGC 

AAGACTCCATCTCAAAATAAAGAAAGAAAGGAAACAAAGTAAAAGAAAAGCTTAATATTG 

GAACCTTCTCTAAAAAAAAAGAAAAAAAAANGGAAAGCCTTGATGCCACCACCAAATCTT 

AAATTTGGCCAAAGTTCGATTCMTTTAAAAGGGAATATTTTATTTTGCCATCACCAAAA 

ATAAATTTC TTTTTAC CTCCCCCCCAAAAAATTCAAATTAAAAAAAAGGTTTTCAAAATT 

ANGCCAAACTTTTTTCCTTAAATTGNTGGTTTTAAAAAAATG 
Sequence 1072 

AGGTACGCGGGCAGACTTTAATCATCAACATAAGCTTTATTAAAATACTGGTTCATCCTG 

AAGTTCGACTCAATGGCCATTTTCTCCTTTCTCTTAATTCTGCACACTCGTGTCCACTGT 

G ^ GATGCAAGACCC ^ CAGAACAGCCCACG CCTCTTACTGTCTAATGTTTCCAGTGGCAC 

AGGGACAGA ^ CA11TGGGAATCCCT GGGCCCAGTGTCCCAGGAAGAAACCCACAGTTGA 

GCCTGGAGTGACTGTCCTCTGATTCTCACACTTACCGTTTCCTGGGGCTCCTCATCTTCA 
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ACACGGTGACCAGCGTGAAGAAACAGAGCCTGCCCCCTACAGACTATCATTTAGAGGGAG 

TTTATTTCTAGTGTCCTTCAGACCCGCCCAGTACCTGCCCG 

Sequence 1073 

CCACCGCGGTGGCGGCCGNGGTACGCGGGAGGATAATGATGTTGGGAAGNGATTTCNTGT 

ANCTATGCTGAGCAGACAGNCTTCAAGAGTTAGCAAAGTTTATATGTGCGGGTGACAAAA 

AAGAGGCTTATGAGTTTTGATGACTGAAAGGAGACAAATAATAGCTAATCTTTGATAAAG 

ACTAACTATGGGCAAGGCTTTACTCTTTGmGTAGTMCmCTTAATCCTCATTAGNA 

ACCCCCTTTATATANGATGCTGTATTTTCTCCAT 

Sequence 1074 

CCGGGCAGGTACCTGTGAACCCAACCTACAGGCTGGCCCCTGTAGACACAGAGAACAGGT 

CCATCCCAGCACCTGGATGACCCTTGCAGATCAAAAGCCTTCCTTAGCACCATGTCAGCC 

CCTATGGAAATAGGCTTCAGGCCAGCCCCTGTGGATACAGGCTGCAGGCTCATACT1 GCA 

AACCCAGGCTTCAGGCCCACCCCTGCAAACCTAAAAAACAGGTCTACCCCAGTGGGCACC 

TGCAAAAGGCCAGCTTGCATGAGGACTCTAGCTGCAAGCACGCTCATGGACCACATTAGC 

CAGCCTACTCAAAATCTCTGTGAAGGGCTTTCCCAAACAAGCCA 

Sequence 1075 

AGGTACCTGCTCCTGGGCCATCCTGTGCTAAGGTTGCATTGCTGAACGGATGACAGGGTG 

GGTGGTGTATGTCCCTAAGGCCACCAGAGATGGTTTACTCAATTCACATGCTTCCTCCAC 

CTGGGATGAAAGCGGTTMTTAATGTTTGATTCTAAGAAGTCAGGTGATGTGATTCAGAT 

TAGTTATGGGGTTGTGCTGAAGCGCGGGTGGACCCTAGGAGGTGATTTTAACGGAAGTCT 

CAAAATAGCAGCCTTGGMCTAGTTGGACTAGTTATTTTTAAAAATAATGAGAAATGTTA 

TAAAGTCGGCTGAAGGTATTGATTTACCT7TCAGGTTGTTCAGAAACAGAATCTTGAGAA 

ATAGCCTGCTACACAGTTGTAAGGGGGATAATAAAGAGATCCCCCTTCTCTAAAAAACGT 

TTACAACCCAATC 

Sequence 1076 

TCCACCGCGGTGGCGGCCGCCCGGGCAGGTACATTTAATCCTCACAGCAACCCTACGGGA 

TAATATCGATGTGCACCATTTCTATAGAAATGGGCTCTATCTTTAAAAGAATTCAGACAG 

CTTGTTAATCTACCTCCCCTTCCCCTTGCTAAAAAGGGTGTGAAAAGTTGCGGTCATTGA 

GATCAGAGAAAGMGTGCACTGGGGGGGTCMCCMTGACGTTTTTTGTCAAACTGAAGC 

TTTGCTTAGTTCCTAGTCAAGAGCTCTGCTTAGTGATCTATCTGCCCAGGGCTTAGGGAA 

GTCCCTGAGCTTATTTGTTTCTCAGCCCGAACTGCCTCAACTCCAGTGGGGAACTGTGGC 

AAGCTCCAGAGCAGTTGACTTAAGTGTTTGTAAGTGGCTCAGCCCAAAAAACAGTCCCAA 

GCCATTTCTTTCCAAGGAGGTTTCAAGGGAAAGGGAGCACTGCTGGGTCTCTCTTTTGTG 

AAAAGATCTTTATTTTNTGAAAGGGCATTTCACTTGTAATGCCCACTGGTCCCTTTGTTC 

CCACTNGNCAAAGCTTGGGTGCCAGTGGGTTTCNCCCCTGTTNATTCCCAAGGACTTTTT 

GGGGGAAGGCCTGGAGMTTCGGGAAANAATCACNTTTGAACCNCCAGAAATTTGGGAAA 

ATCAAGCCCTTGGGCNNCNCATAAATGGAGACCCTNT l Mill > AC TTAAAAAATTTAAA 

AAAC CCCAAAANTTCCTTGGTTTTTNGNGGTGATGCCTTGNTTTNGGNNCCCCAAAANTT 

TTTTGGGNGGCTTNTGGGGGGGNGNNAAACCCCCCTCCCCNAAGNNGATTACGGGTNNNA 

ATGGGGCTTTTGN NTTGGTTTTTTTN N AC AAAAAAAAAN N NG N ACCCTTTTTTN AAAN AA 

ANAAAAAAAAAA 

Sequence 1077 

GGCGGCCGAGGTACCAGGGCACTCCAMTATGTMTTTAACACATTCATTAATAACTATG 

ATAAATAGATATTTTATTCCAAGAACAATATCATTAGAGAGTTAGATTGGAGGGCAAGGG 

GATTAAGTATCTTAAGTTTATTGNTTCCCGAGTGACCATTTTTAAAGCCTNCTTCCAGGA 

ACTGTGGAATGCCTGCCAGGCTCCCTGACTGCTGTCTGGTGAGGGTCCCTACTGGTGCTG 

TGGACAGCACAATACACAATGGGGCACAAGAAAGCCACGCAGCCTCCCCCTCCTAGACAC 

ATGCTGTGCACCCCCTTGGCTCTGCTTGGAAGGTTCAGCTCTGCAAATGGAAGGTTGCAA 

ACATCAGACACCCTTCTTTTCTTGTTTCTGTCTCTCTCCCCTTCTTCTCCCCACAGTGTC 
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CANCCTTGGCCCCTGACTTTTTTCTGGGGTTGGGGGGAGGCCCTGTNATGGTTCANNAAA 

NNTNAAGCATNTTGNGTGAANACCTTCCCTTTAGCTTCAAATGGTNGCTTNTTGGCCCAT 

AAGGGGANCNAGACAAAGCAAANNCTTGAAGTTTTGGNGTNAAGCCCCTANAACNCAATG 
GG AAAAAAAN G AAA 

Sequence 1078 

GCCGNCCGGGCAGGTACGCGGGGAGACTTTCCCTGCCGGCACATGGACCTGGCCCAACCC 

TCACAGCCAGTATACNAGCTGGAGCTCTCGGTGCTCGAGCGGCAGCCAGAAGAGAACACG 

CCTNTCAATGGTGCCTGACAAGGNCTTCCCTTATNTGGACGAGGAGGTCCCCCCGGCCGC 
TCTAGAACTAG 

Sequence 1079 

gtggcggccgaggtacgcatctttgaagaaagtagctgtggatatgggtggaagcataaa 

gg atcctagtaac tgaa taacatcagtaacac attcatg attttatcttc attagcaata 

ggactcaagcaattgaaaaaccattgactttgtaaaaacacatgtgcgtgggccaggcac 

agtggctcacgcctgtaatcccagcactttgggaggctgaggcgggcagattacttgagg 

tcaagagtttgagaccagcctgctcaacatggxgaaacccagtctctactaaaaatacaa 

aaattagctgagtgtgatggcacatgcctgtaaYcccagctactggggaggctgaggcag 

gagaatctcttaaaccgggaggcaggaggtttgcaagtgagacaacatcactgcattgct 

ctccagcctgggcaacgaagtggatctccgtctcaaaacaaacaaacaaacaacaaacca 

aaaa caaacaaaagcacatctgcttgctcttgtgtagagagattcccttttttgaatttc 

ttttgtggcn ctagntttaaaaaaataag 

Sequence 1080 

GCGGTGGCGGCTCGATGTACTTTATGAGACCCATGCGNNCNGTATTGCCTGAGGTCAGGA 
GTTTGAGACCAGCCTGGCTAACATGGTGAAACCCTGTCTCTACTAAAAATACAAAAATTA 
GCCGGGCATGGTGGCTCACGCCTGTAGTCCCAACTGCTTGGGAGGTTGAGGCAAGAGAAT 
CGCTTGAACCCANGAGGTGGAGGTTGCAGTGAGCCGAGATCGCGCCACTGCACTCCAACA 
TGGGCGACAAGAGCAAGGACTCCATCTNAAAATAAAGAAAGAAAGGAAACAAAGNAAAAG 
. GAAAANGCTTTATATTTGGMCTTCTCNTAAAAAAAAGAAAAAAAAAGGAAAAGCCC 
TGCACACCAAATTNTTAAATTTTGGCCAAAGTCGATTCAAATNTAAAAGGGATTATTTAT 
TTTN CCATT CACAAAAATAATTTCTTTTACCT^ 

AAGGTTTTCAAAAATACGCAMCCTTTTTTCNCTAAAATGGGGGNTTTAAAAAAAATGG^ 
TAAATTCACCCAAAAATANCATTGGTGGTTCCCCCCMCCT^ 

TAAAATTNCCTTAATTGGTGGTTACAAAGTTGGAAGGGTNATTACCCCTTNAAAAGGTGN 
AAGGGCNMTATTTAAGGAAGAAGTTTGNAANCCCTTTCAAAAACCCTTGNCCTTGGGAG 
GAAN AAGCCNTTGTTTC I I I I i CTTTAANAAAGNTTCTNNAANGGAAAAATNGGGTT 
Sequence 1081 

GCTCCCCGCGGTGGCGGCCGAGGTACT1 I I 1 I I I I II I I 11 H I I I || I AAGAGATGGAG 

TCTTGCTCTATCACCCAGGCTAGAGTGCAGTGGTGCAATCTCGGCTCACTGCAGTCTCCA 

TCTCCTGGGTTCAAGGGATTCTTCTACCTCAGCCTCCTGAGTAGCTAGGATTACAGGCAC 

ATGCCACCGTGCCTGGCTAACTTTTTGTATTnTAGTAGAGACGGGGTTCCACCATGCTA 

GCCAGGCTGGTCTTGAACTCCTGACCTCAGGTGATCCGCCTGCCTTGGCCTCCAAAAATG 

CTGGAATTACAGGCATGTAATTCCAACCTTAGACACTCAGTTTAAAACCCTNTTATCCTA 

TCTTCACTGGAAGAGCAAAGCAATCTCCTACCTTGGTTTTGTCAATAAGAAGGGAATATT 

ATTAAGCCCCAGTAATCCTGTTGATTACCATAGGTGGGGCATGTTTNTNTAGAAGTAAAG 

TTNGAAATCTTTTGGGGTTTCCCGAATGG 

Sequence 1082 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGTTAGCATAATTTTGA 
ATTATTTCAGCTAAGTAGATGTTAAAGACTTGACTCAAGAGTGGCTGAAAAAGAAAGGAT 
TCTCTATTCCCCTGATTCTCTATCACTATATTGACTATATAGTTTCTTTTTGTAAGATGC 
GTTCATTCTGACTATTCTTACCAGCATTTATTCTTACTAGCATTTGTGACCCAGAATTTT 
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CAGAGCAAACAAGCAAGCAAAAAGCAAAATTAATTTAAATTTCATTTATAATTACTTAAT 
TTTCATTCATTGAAAAGTCATCATTCTAAAATGAGAT7TCTAT TTTMTTATAATTAC AG 

ATG CTCCTTGACTTACG ATGGG GTTAC ATTTCTTGTTTTTTrG I I I I I IGl u l l uGTT 

TTTTTGTTTTTTGAGACAGGGTCTTGCTCTGTTGTCCAGGCTGGAGTGTAGTGGCACAAT 

CATGGCTCACTGCAGCCTTGACCTCTCAGGCTCAAGTGATCCGTGCCCCTCAGCCTCCTG 

AGTAGATGGGACCACAGGTGCATGTCACCATGCCTGGCTAATTTT 

Sequence 1083 

CCGGGCAGGTACTTTTCACTCTCCTACATGTCTCAGTTTGAACAATAAATTATACAGTCA 

CCAGTCCCAGGAAAAGATGAAATATGAAAMGATGCCAGCAGATTTAGCAATTGCTATTT 

ACACATCCCTGGCATTGTCTAACTCTGCTCTGTATAGCCAGGTCTCTAACCATCATCTGG 

ACTGGAACCATATCTTCTACCAAAAGCATGAGGAAGGCTGGGCATGGTGGCTCACACCAG 

TAATCCCAGCACTTTGGGAGGCCAAGGCAGCAGGACTGC TTGAAG TCAGGAGTTTGAGAC 

CAGCCTGGGCAACGTAGAAAGACCACATCTCTACAGAAAATTTTTTAAAAATTAGCCACA 

CATGGTAGTGTGTGCCTGTAGTCCCAGCTAGTCAGGAGGCTGAGGTGGGAGGATCGCTTG 

AGCAGAGGAGGTTGAGGCTACGGTGAACCATGATTGTGCCACTGCACTCCAGCCTGGATG 

ACAGAGTAAAACCTG 
Sequence 1084 

TGGGCTTGTGTGCACNGAAACCCCCCCGTNTCAAGCCCGACCGCTGCNGCCCTTTAATCC 

CGGTAACTATTCGTNTTGGAGTTCCAAACCCNGTAAGGACACCGACTTTATTCGCCCACT 

TGGCAAGCCAAGNTCACNTGGGTTAAACAAGGGAATNTAGNCAGGAGCCGAGGTATTGGT 

AAGGGCGGGTGCCTACAGGAANTTCTTTGGAAGTGGGTGGGCCCTAAACTTACCGGGCTA 

CCACTTAGAAGNAACAAGTTNTTTTGGTATCTTGCGCCTCTGGCTTGAAAGCCAAGTTAN 

CCTTTNGNGAAAAAAAAG 
Sequence 1085 

AGTTCGGTGTAGGGTTCGTTCGCTCCCAAAGCTTGGGCCTGTGTTGCACNAAACCCCCCG 

TTCAGCCCCGACCCGCTGCGCCTTTATCCCGGTTAANCTATTCGTCTTGGAGTCCCAACC 

CCGGTAAGACACGACTTATTCGCCACCTGGCCAGCAAGCCCACTTGGGTAAACAGGGATT 

TAGCAAGTAGCCGAGGTATGTTANGGCGGTGCTTACANAAGTTTCTTTGAAAGGTGGGTT 

GGGCCTTAACTAACGGGCTTANCACTTANGAAAGAAACAAGTTATTTTNGGTAATTCTNG 

ccgccttttgcttggamnncccaatttaccctttcnnggnaaaaamgaaagtttgggg 
aaagnctttcttttgaattcccnggcaaaaaacaaaaaacccacccggnttgn™ 

GGTGGGG I I II I I I I I I IGTTTTTTGCNAAAGGCCNNGCNAANANTTTACCNNCCCCANG 

AAAAAA 

Sequence 1086 

AGGTACA l rTTTTTTTTTTTTTTTTTI I ATTTTTATTTTTAGTAGAGACGGGGTTTCACC 

GTGTTAGCCAGGATGGTCTCGATCTCCTGGCCTCATGATCCGCCCGCCTCGGCCTCCCAA 

AGTCCTGGGATTACAGGCATGAGTCACCAAGCCCGGCCAGCAGCAATAATGTTTTACCAA 

TATGAGCATCAAGCAACCGTGGTTCCCTAGAGCACCTTCCAATTGTGAGCTCTCAGTTTA 

TGTGTGTTTTTACTCACCATCTGTAACATCACACCTTCAAAACAGACCCTTGGCCCATAG 

ATAGMGATAAAAGAAGCTATAATCTCTCCTTTTCCCAAAAACAAATTTTCTTAGGAAAT 

TTGATTTATrrnGAGAMCCTAATATTCTTGTTTAAAATACAACTCTTCC 

GGTCMCAGTGAGATCCTATGGACACCTTTGATGATAAAAATAAAAATATTCTTTAACTN 

AAGGGACACAGAACCTAGGCACGGNTTCCCCAGGCACTCAGTCCAGGTGCTTCTCAGATG 

AACCTGTGTATGCTCAACTTCATTTCCAGTAAGGGACACAGCCCCANTNTGAGANGAGAC 

TGGGAAAAAAGAAAGCCCCC 
Sequence 1087 

AGGTACCTCCGATGGTCCCTTGGTAACTGCTAGAGAGAACCTAGGAAAAAGAGGTGTATA 
AAATTCCATCCTTTTCAGTGGTGGTGAGAGGTGCCAGGTGCTATTCTGCAGAGAAGTGTC 
AACCAAAAACCTGTAAAAAAGAGCCAAGTAGGCCACAAGTGGTGGCTCATGCCTGTAACC 
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CCAACACTTTGGGATGCCAAGGCAGGAATACCACTTAAGACCAGGAGTTTGAGACCAGCC 

CGGGCAACACAGGGAGATCCCATCTCTAAAAAAATAAAATAAAAAATTAGCAGGGTGTGG 

TGGCACACGTCCTGCTGGCACTCCAGAGGCCGAGGTGGGAGGATTGCTTGACCCTGGGAA 

GTCAAGGCTGCAGTGAGCCATGACAGTGCCACTGTCATCCAGCCTGGGCAACAGAGCAAG 

ACCGTGTCTTGAAAAAAAAGGCAAAATTGTCTAGGGATGAACCGTGGAAGCTGAAGCCAC 

TCTGTAGAAAACCATACATGTCCCTGGCCGGGTGCAGTGGCTCACGCCTGTAATCCCAGC 

ACTTTGGGAGGCCGAGGGCGGGCGGATCACGAGGTCAGGGAGATTTGAGACCATTCCTTG 

GCTTACACAGTGGAAACTCCCGTCTTTTCCTTAAAAATTAAAATGAAAAAAA 

Sequence 1088 

TCCACCGCGGTGGCGGCCGAGGTACTGGGCCTCTCTCTCTCAGTTCCCCACNTGGGCGCA 

GTCTGANCTCTCATATCGTCTTTCTGACATTTMTGTTTTTAAAAAGGATGACAAGGACA 

GTTCCCTTGGGAAGTCATTTCAGTCACGACCTGCTGTGTATCTGTGGAGTTGACATGTTC 

TGCATAGTTGAGTGCTCCCCTGCTCTGGGAAGCCTGCACGGGGTCCCTGCACACATCCCA 

GCATGGAGGGTCTCAGGGCTGGGTTTGAGACGAAGGTGGGCGAAAGGTTCACATGAACTG 

AGGTCATGACAGAGTGATCAGGTGGGACCAGGCAGAACTCTCGGCCCCCAGGGCTTGCTG 

GTACCTGCCCGGGC 

Sequence 1089 

NCCGGGCAGGTTCCCCACTGGGAGACGTCCTCAGCANGCAGCCNGAGAAGAGGCACAGNN 

CNGAAACTTCTGCTCTAGGGGTAGGAATGTGGTGGGACCCATCTCTCTCCTTGGAAGAAT 

GTGGGACTCTCGCCTGCTCACTAAAACCAACACCTGCCGATGTGCTAAGAGTCCTCTTGA 

AGTCCCAACAATAGACATCAAAGCTGATGTTTGGAAGTTCAGATAACAAGGGAAAGGGTA 

GTTTAAAAAAAAAAATAAAAAAAAACGGTTACCCTCGGCCCGCTCTAAGAACTAGTGGGA 

TCCCCCGGGCCTGCCAGGGAATTCCGAANATTCANAGCTNATTCNGATACCCGGACGACC 

CTCNMGGGGGGGGGCNCCGGNAACCCCANCCTTTTGGTTCCCTTTAANTGGAGGGTTTA 

AATTTGGCGCCGCCTTNGGCCGATAAANCANNGGGCCCATAAGCCNTGGNTTTCCTTGNN 

NTNGAAAATTGGTNAATCCCGNCTTCAACCAATNTTCCNCCANCAAACAATACCGAATCC 

CGGGGTAGGCCAATAAAAAGGNTGGTCNAAAGGCCCTGGGGGGGNNGCCCCTTAAAATGG 

AANTNGAAGCCTAAAACTTCCAACAATTTNAATTTTGGCCNGGTGTGGCGGCCTTCAANT 

TGGCNCCCGCI I II I i CCCAGNTCCGGGGGGAA 

Sequence 1090 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTCCCAAAACACCTTTTCCTGTTATTTCATGTA 

TTTATTCTTATTCTCTTTTCCGATGGC I I I I I AAAGACAGTTTAACCAGCCAGGCACAGT 

GGCTCACGCCTGTAATCCCAACACTTTGGGAGGCTGAGGTGGGTGGATCACCTGATGTCA 

GGAGTTTGAGACCAGCCTGACCAACATGGTGAAACCCTGCCTCTACTAAAAATACAAAAA 

TTAGCTGGGTGTGGTGGCGTGCACCTGTAATCCCAGCTACCTGGGAGTCTGAGGCAGGAG 

AATCACTTGAACCCAGGAGGCGGAGGTTGCAGTGAGCCATTGTACTCCAGCCTGGGCAGC 

AAGAGTGCAACTCTGTCTCAAATAAATAAATAAATAAATAAAGACAATTTAACCAACATG 

ACTTACTAAAACATTGTGTTTTCTTCTTCCTTTTCTTAGGTTGATGCTATGAAA 

GTAAAGGAAA 

Sequence 1091 

GCTCCCCGCGGTGGCTGCCGAGGTACATTTTATCCCANTNCTCACTCCAAGGAAGGAAGT 

TAATTAAAGTNTCATCTAGCAGACATTCCTCCCCCAGCTGTGTCTAGTCACGAGGAGAAT 

GACGATTGTGTGCATCTGGGCAAAACATTAAAGAGGAAAGTTCTGCAAAAATTAGGTATG 

GTTTGGAAATAAGAAATAAACATAGTCTACATATTTTGCAGAAACAACATTAAAGAAGCC 

CCAGGCCAGCCCTGTCGCTTGGGGGCCTCCCAGGTGGAGGCAGCAAATAGTTCAAAGAGG 

AGCTGTTGGAGTATCAAGTCTAGCTCCTAAAATGAGACTGTAAGATCCTTGAGCACAAGG 

GCCCACATTTTCTCCATCTCTGGGATGTGTTCCCCACCCAGACCTCCAGGTGAACGTGCT 

TACCCAGACCTCCAGATGAACGTGCTTACAGGTACCTGCCCGGGCGGC 

Sequence 1092 
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CCGGGCAGGTGCTTTTAGTAGAGACGGGGTTTCTCTATGTTGGTCAGGCTGGTCTTGAGG 

TCGGGAGCTCAGGTGATCTACCTGCCTCAGCCTCCCAAAGTGCTGGGATTACAGGCGTGA 

GCACTGCGCCCGGCTTTMCAGATATTTTTATAAGCAAATCATTTATAAGTATGGGTGAC 

TATTCTTTACAAATCCATTTGTGATGCAGTGGTAATTTGGTGGCACTGCAATTATGGCCA 

GCAATGAAGATTATAAAATAATTGCCTGAGGAACCAAGCTCATGTGCACTAAAAAGAAGC 

AGGGATTCTTAGMGATTCTTAGAAGTATAGAGTAGTGATTTTTTTGGGGATGGAGTCTT 

GCTCTGTCGCCCAGGTTAGAGTGCAGTGGTGCGATATCCGCTCACTACAACCTCCGTCTC 

CCCCTCTCAGGTTCAAGCGATTCTCCTGCCTCAGTCTCCCAAGTATCTGGGACTACAGCT 

GCACACCACCACACCCAGCTATTTTTTGTA 

Sequence 1093 

CGTTTCAAGCCCGAACCCGCTGCCGCCCTTATCCCGGTNAACTATCGGNCTTTGAAGTCC 

AACCCCGGGTAAAGACACCGACTTTATCGCCACTGGGCAAGCAGCNCACTGGTAAACAGG 

GATTAGCCANGAGCGGAGGTATGTTAAGGCCGGGTGCTTACAAGAAGTTTCTTGAAAGGT 

GGGTNGGCCTTAACTTACGGGCTTACACCTMGNMGAAMCMGTANTTTTNGGTTATT 

CTTGCGCTCTTGCTTGAAAAGCCCANGNTTACCCTTTTCGNGNAAAAAAAGAAGATNTGG 

GTTAGGCCNTCTTTTTGAATNCCCGGGCGAAAACNAAAAACCNAACCCGGNTTNGGGTTA 

GCCGGGGGGGGGGTTTT 

Sequence 1094 

GAATTGGAGCTCCACCCGCGGTGGCGGCCGAGGTACTTACTAATGGGCAAAAAATGAGGT 

TTAATTACATTTGCTGATTATATTAAAATGGGAAAGTGGTGAGAAATGACAGAAGGTATT 

GTGCTGGGTTTTAGCAAAAATGAAATTGTAGGTGTGGGAGACAGGTAAT7TAAACTAATA 

TTGATTGGGATTCTGTCTCAGTGAGTTGATATTCAGGAGCTTGTTTTATGTAGTTTGTTC 

TAMGGAGTCATTTTTCTTGCCTGAGAAAAGCMCAATGATAAAAMGTAGGCAGATTTG 

GAGGCTGTTTATAATTCTATATTTTGTGAATTCATAAAACTTATTTTAATGTAGAAGTT 

CTTGACAGACTTTAGGTGGTCAAAAAGGGTCAAAATGGAAGTAAAAATATAATTTGGTCA 

TTTATTTATTCTACGCATATTTATCTTGCCTAATCTGTGCCAAGCACTGTTCTAACAGTT 

TAGAATATTGGTGAACAAAACAGACTAATTCCTGACACTCCATGAAACTTAAATTCTAGC 

AAAATTATTTCCTCTTGGG I Mill 

Sequence 1 095 

TTACTACTGGGC AN AAAATG AGGN N NCATTACATNTGCTG ATTATATTAAAATGGGAAAG 

TGGTGANAAANGACNNANGGTATNGNNCTGGGNTTTAGCAAAAATGAAATTCTANGTGTG 

GGAGACAGGTAATTNAAACTAATATTGATTGGGATNCTGTCTNAGCGAGTNGATNTTCAN 

GAGCTTGTTTTATGNANTTNGNTCTAAAGGAGTCATTTTTCTTGCCTGAGAAAAGCAACA 

ATGATAAAAAAGTNGGCCCATTTGGAGGCTGATTATAATTCTATATTTTGTGAATNCATA 

AAACTTATTNNAATGTNNAAGTNNCTTGACAGACTTTAGGTGGTCAAAAAGGGTCAAAAT 

GGAAGTAAAAATATAATN NTGGTC ATTTATTTA7TCTAC NC ATATTTATC TTGCCTAATC 

TGTGCCAAGCACTGTTCTAACACTTTTAGAATATTGGTGAACAAAACANACTAATTCCTG 

ACACTCCATGAAACTTA 

Sequence 1096 

CCGGGCAGGTACGCGGGC7TCTCCTCATATCTACATGACTCCATCCTCCATTTCTCTTGG 

TCTCTCTCAAACATACCTTACACATGAGACCCTCCCTGACCACTCTATATACTCTATATA 

AAATAACCTCACCCCCACCCTGAGACGTTCTTTCCTCCATACTACTCATTCTACTTTACT 

I II I I I CTCAATAATATTACCACACTCTGACATACTACATATTTACTCGCTTATTATTTG 

TCTCTCACACCTGTAATCCTAGCACTTTGGGAGGCCGAGGCAGGCGGATCACGAGGTCAG 

GAGATTGAGACCATCCTGGCCAACGTGGTGAAACCCTGTGTCTACTAAGGGTTCAAAAAT 

TAGCTGGGCGTGGTGGCGCATGCCTGTGGTCCCAGCTCCTCAGGAGGCTGAGGCAGAAGG 

GTCCCTTGAACCCAGGAGGCAGAGGTTGCAATGAGCCGAAAT 

Sequence 1097 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGTGATTTTATTACCCAAGTGGAAGAAG 
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AGATTTCCCGACACTACAGGAGAGCACACAGCTGCTACAAATGCCGTCAGTGCAGTTTTA 

CAGCTGCCGATACTCAGTCACTACTGGAGCACTTCAACACTGTTCACTGCCAGGAACAGG 

ACATCACTACAGCCAACGGCGAAGAGGACGGTCATGCCATATCCACCATCAAAGAGGAGC 

CCAAAATTGACTTCAGGGTCTACAATCTGCTAACTCCAGACTCTAAAATGGGAGAGCCAG 

TTTCTGAGAGTGTGGTGAAGAGAGAGAAGCTGGAAGAGAAGGACGGGCTCAAAGAGAAAG 

TTTGGACCGAGAGTTCCAGTGATGACCTTCGCAATGTGACTTGGAGAGGGGCAGACATCC 

TGCGGGGGAGTCCGTCATACACCCAAGCAAG 

Sequence 1098 

AGGTACAATAGGGAGTACCCAGGTTCCCTACGATGCCCCCCAAATCAGACCCAGGGGCTG 

CCTTCCAAACAGTGACAGGCACCTAAAGAAGGGACAGGGCANACGCTGCTTCAAGCCAGT 

CAAAGCCAAGACAGGCAGGCTGTGTGGCACGGTTGTGCTCCCCTGAATCTCTTCCTGCCT 

CGGGCAACTCTTGTGGATCACTTNCTGCCTCTGCCTTTCCATTCCCTTCCTTTGCCTGGT 

TTTCTTTCCTCCTACCAAATTTTTGACCTGCAATTACCAGGAGTTACCTGGGTCCTCCCT 

GGACCTTTTCTCAATGCAATCTCCTCCTTNCAGCACCCCCAAGAAGCGTCGCTCCTGTGT 

TCAAGACCAACCAGGGATGTGTCCAAGACCAAGAGGATGGTTGTTCACCTATGANGCTCA 

TGTAGG 

Sequence 1099 

ANGTACAGGCCCAGTTGGGTCCTACTTAAAGTTTTCCTGTTTGACAATTTATCAGGAATT 

ATrATTGCTTAAGATAGCTTTGAAGTAAATTGTCTGCGTCAAAGACCTTCCTCCACTTCT 

CTCACTTACCTAGCTCTTGGCAACTGTTTCCAAAATGTTTGCACTAACTCCATAGATTCA 

TGTCTGMGGATTAGATACTCCAGGAACATCTATTGGGAGAAAGTTTAACTACAACTGTT 

GTCCATTTTTCAATTAATGTGCCAGTCACTMTCAAAAAAACAGATGCTGTCGTTTTCCT 

CATCCACTACTTGGGACACCTAAGGATAGATGGACAGATACTTAANAGGCTGCCTCCTTC 

TTCTGAGANNGGGCTTTTCATCAGGGCTGTGCCACAGGGTGTGATGCTGTGTGATAGCAN 

ATGGCTCTGCTTCATGC CCATTACTTTTTGTTTTATC 

Sequence 1100 

CCGGGCAGGTACTAAAGCATGTTMGCTTTCTTCCATGATGGAATTTATAACTCAAGAGA 

AATAAAATCAGATTATGAATTCTAAATTCTAAGGAATGAGAAAAGAATGTGGGATAGAAG 

AGACATTGCGGTCAAGAGGTGGTGTGTGAAGTTTGAAAGTTCTGTTGCCAAACATATGCA 

ACTCTCATTCAAGAGAAAAACAATCTGCAAAGACTTTGAAGAATGAGAGAAACATGATTC 

TTAGGAGCAGGCTTTCAGATTCTCGATGATGGATCATGTAGACGGCAGATATTATCCAGC 

AAAATCAGGATAATTCCTGAGTCGGAGGCTGCATTCCAGCCTCTATGTCCCAGGCTGGAG 

CAGTGGCAGGATCTCAGCTCATTGCAACCTCTGCCTCCCGGGCTCAAGCAA 

Sequence 1101 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTGGGCGGGTCTGAAGGACACTAGAAATAAACT 
•CCCTCTAAATGATAGTCTGTAGGGGGCAGGCTCTGTTTCTTCACGCTGGTCACCGTGTTG 
AAGATGAGGAGCCCCAGGAAACGGTAAGTGTGAGAATCAGAGGACAGTCACTCCAGGCTC 
AACTGTGGGTTTCTTCCTGGGACACTGGGCCCAGGGATTCCCAAATGAATCTGTCCCTGT 
GCCACTGGAAACATTAGACAGTAAGAGGGGTGGGCTGTTCTGAAGGGTCTTGCATCGCAC 
AGTGGACACGAGTGTGCAGAATTAAGAGAAAGGAGAAAATGGCCATTGAGTCGAACTTCA 
GGATGAACCAGTATTTTAATAAAGCTTATGTTGATGATTAAAGTCTGCCCGCGTACCT 
Sequence 1102 

AGGTACTGCCATTTCTTTC I Mill I AAAGAAGATATGTTTACAAAATAGGTGACCAACT 

AAGGTAGGTGTGTGTGAAGAGAGGCAGTGAGAGAGAGGGGATCCTGAAATTCTTTTCAGT 

AAAGGAAAAAGATAATTTTAGGGATGTTTTTAAAAATATATAACTTGATGAAI I I II I AT 

AGCATGAGTATTATTTAGGTGCTGTCCTAGACCAGTGTGGCGTATAGGCTATAGAAGATT 

GGTCCTGAATGCTAACGGCAGGGATGAGTGTAAGATCATCAAAACTTATTTGTGGGAAGG 

AGAGCTACTGACAGAGCCATGTATTACATGACTCCAGTCTGGGCTGATAGGCCTGCCTTA 

AAGCATAGCTAGAGGGCAGGCACGGTGGCTCATGCCTGTAATCTCAGCACTTTGGGAGGC 
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TGAGGCGGGCAGGTCACTTGAAGCCAGGAGTTTGAGAC 
Sequence 1103 

AGGTACCAAGCAGATTCTTTATGTTCCTTAGTGATAAGTAATAGGGATAAAAAATTATTC 

TTTCTTCTTCCACTGAGCCAGTGTAAGAGTTCC I I 1 II I H I CCAAAACAAATAAGATAG 

TAAAGCCATAAGAACAGCACTTGTATGCTTTTTGTATTCTAAGAATATGAGAAAAATGAT 

TTCAG TG AACCTTTATCTG AAAGG ATATAACCATGTCCTG ATGGAATACTGC G ATGGGC A 

GGTAGAAGGGAATGAGATGGGCCAGGCGCAGTGGCTTACGCCTGTAATCCCAGCACTTTA 

GGAGGCCGAGGCTGGCAGATCAGAAGGCCAGGAGTTCAAGACCAGCCTGGCCCGCATGGT 

GAAACCCCGTCTCTACTAAAAAAAATACAAAAAATTAGCTGGGCACGGGGGCATGCGCCT 

GTAATCCCAGCTACT 

Sequence 1104 

CCGGGCAGGTACGCGGGGAGACTTTCCCTGCCGGCACATGGACCTGGCCCAACCCTCACA 
GCCAGTAGACGAGCTGGAGCTCTCGGTGCTCGAGCGGCAGCCAGAAGAGAACACGCCTCT 
CAATGGTGCCGACAAGGTCTTCCCTTCTTTGGACGAGGAGGTCCCCC 
Sequence 1105 

CCGCGGTGGCGGCCGCCCGGGCAGGTCCGGACAGGTACAAATGTGTAAACAGAAAGCCAT 

GGAACACCGACTGTAATGAAGCTAGTAAAAAGCCTGAAAGGGAAGATTACAAACAACGGT 

ATGCATACTGTATATAATGCAGTTCAGATATTTTAGGCTTGAAAATGGATTTATTCCTTC 

CTATTTATGTATTTATTTTATMTAGGTAATAGTATAGAATGCAAAAGGCTTTTATTTM 

TGAAAATCAGCTTTCCTACCTTTGCCTCCAACTGCTCAGCTCCTCTATAGAACTGTTTCC 

AGTTTTTTCTCTGCCTTTTCAAAGATGATATATTTCCAAACATACACTTACTATACAGAC 

AATAACI I I I I AACCAAAAAGCACCATATTATACACATTGGTTAGCATCTTCCTTCTCCC 

CTTAATATTGTATTGAAACATTATTCTCCAACA I I I I 1 GTTAATTTGC A I I I I 1A 

Sequence 1106 

GGGGCCATTGAGACTGCCATGGAAGACTTGAAAGGTCACGTAGCTGAGACTTCTGGAGAG 
ACCATTCAAGGCTTCTGGCTCTTGGCAAAGATAGACCACTGGAACAATGAGAAGGAGAGA 
ATTCTACTGGTCACAGACAAGACTCTCTTGATCTGCAAATA 
Sequence 1107 

AGGTACAAGGAGACATTTCCCATCGCTATTAGGAAATGGGCATAGATMGAATTTTCAAG 

TGTTTCCAAGATCCCCAAACTCAAACCATCCTACCACTGCTTATCCATCGACATCAGTGG 

CTTTTACATCAGACAAAGGTGGAGTAAATCAAACATACCACCCATGGCTTCTTGAGTAGC 

ATGAGGAATGAAATAAGAA I ITTTTTTTTTl I GAGACGGGGTCTCGCTCTGTTGCCCAGG 

CCAGAGTGCAGTGGCACAATCTCTGGTCACTGCAACCTCTGCCTCCCCGGCTCAAGCAAT 

TCTCATGCCTCAGCCTCCCAGGNGGTGGGAATTACTGGTGCCTGCCACCACGCCAGGTTA 

ACTTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTATTTTTAGTAANAGATGGGGTTTCAG 

GATGTTGGCCAGGCTGGTCTCAAACTC 

Sequence 1108 

CCGGGCAGGTACGCGGGGAGACTTTCCCTGCCGGTACATGGACCTGGCCCAACCCTCACA 
GCCAGTAGACGAGCTGGAGCTCTCGGTGCTCGAGCGGCAGCCAGAAGAGAACACGCCTCT 
CAATGGTGCCGACAAGGTCTTCCCTTCTTTGGACGAGGAGGTCCCCC 
Sequence 1109 

GAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTAATCCAGGAACTTGGTCTTCCACC 

TGCACCATCTCTAATCTGTGAAGCTGGAAGGTGATGTTTTGTAAGCCCAGCATCCCTCCC 

CTCCTTCCTTGGGTAAAAATCTCCATCCCTCTGGCTACAGAGGTGGGCACAGAGCCCAGA 

ATGAGCCAATCACAGTAACTCATCTCCCACCAGGATTTTCTATATTGAAGCCAGAGAGGA 

AACTTCCTGCCTCTCTGGAGTATGTGAGCCAGGAACTGCCATGTCCTCACCAAGGAGAGA 

AAAGCTGATGTGCATAGAGAAGCACAGATAGACAGTGCTGAGCAATGGTTCAGGAAGAGT 

ATCTTCCTCCTGTACCTGCCCG 

Sequence 1110 
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TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTAAAGAATGTGCA 

TTTATTTGTGGGGAGGAGAGGAGGTCTAAATAGACAATAAACATATAAATATTAATATTG 

TGAAAG TGGTATG AAGAAAAATAAAGCATGATAAGGAGAGAGAGGTGTTCCCTGGCTCTC 

CAGAGGGCTGAGGTGGAAGAAAAGGACTCCTGTTTTAGGTAAGATATCAGAGAAGACCTC 

TCTGATTGACTGAAATGTATGCAAAGGCCTGCATGAAGCCAGGGACCATGCCGTGCTGAC 

ATCTGGGGGAGGATTATCCTCACAGAGGGGAAAGGAAGTGCAAATGCCGTGAGGTGGGAG 

GACCATGTTTGGTATGTTCAGGGAGGAACAGGGAGGCCCGTGGGGCTGGAGCAGGTTCAG 

CCATGGGAAGGCCAAAGATGAGATCAAAGAGAA 

Sequence 1111 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTGTGAACCCAAC 

CTACAGGCTGGCCCCTGTAGACACAGAGAACAGGTCCATCCCAGCACCTGGATGACCCTT 

GCAGATCAAAAGCCTTCCTTGGCACCATGTCAGCCCCTATGGAAATAGGCTTCAGGCCAG 

CCCCTGTGGATACAGGCTGCAGGCTCATACTTGCAAACCCAGGCTTCAGGCCCACCCCTG 

CAAACCTAAAAAACAGGTCTACCCCAGTGGGCACCTGCAAAAGGCCAGCTTGCATGAGGA 

CTCTAGCTGCAAGCACGCTCATGGACCACATTAGCCAGCCTACTCAAAATCTCTGTGAAG 

GGCTTTCCCAAACAAGCCAGTCTGCAAAGACTAGAATAAATCTCTACTTCTTCAGATGCA 

TGGACACCAACAGCTAGTCACAAAGATTAAAAACAATCAGAGTAACATGAAATCATTGAA 

GAAATAAAATAAAAC 

Sequence 1112 

CCGGGCAGGTACCTGTGAACCCAACCTACAGGCTGGCCCCTGTAGACACAGAGAACAGGT 

CCATCCCAGCACCTGGATGACCCTTGCAGATCAAAAGCCTTCCTTGGCACCATGTCAGCC 

CCTATGGAAATAGGCTTCAGGCCAGCCCCTGTGGATACAGGCTGCAGGCTCATACTTGCA 

AACCCAGGCTTCAGGCCCACCCCTGCAAACCTAAAAAACAGGTCTACCCCAGTGGGCACC 

TGCAAAAGGCCAGCTTGCATGAGGACTCTAGCTGCAAGCACGCTCATGGACCACATTAGC 

CAGCCTACTCAAAATCTCTGTGAAGGGCTTTCCCAAACAAGCCAGTCTGCAAAGACTAGA 

ATAAATCTCTACTTCTTCAGATGCATGGACACCAACAGCTAGTCACAAAGATTAAAAACA 

ATCAGAGTAACATGAAATCATTGAAGAAATAAAATAAAAC 

Sequence 1113 

TTGGAGCTCCACCGCNGTGGCGGNCGAGGTACTTTAGGAGACCCATGCGGGCANATTGCC 

TGAGGTCAGGAGTTTGAGACCAGCCTGGCTAACATGGNGAAACCCTGNCTCTACTAAAAA 

TACAAAAATTAGCCGGGCATGGTGGCTCACGCCTGTAGTCCCAACTGCTTGGGAGGTTGA 

GGCAAGAGAATCGCTTGAACCCAGGAGGTGGAGGTTGCAGTGAGCGNAGATCGCGCCACT 

GCACTCCAGCATGGGCCGACAGAGCAAGGACTCCATCTNAAAATAAAGAAAGAAAGGAAA 

CAATNGAAAAGAAAAGCTTATATTGAACCTTCTCTAAAAAAANGNAAAAAAAAAGGAAAG 

GCCTGGATGCACACAAATCTAAAATTTTGGCAAGGTCGATCAATTAAAGGGATATTTTAT 

TTTGGCATTCCACAAAAAATAAANTTNCTTTACTTCCCCCCAAAAAATCAAATTAAAA^ 

GTTTCAAAAATAGCCAACCTTTTTCCTTMTGGTGGTTTTAAAAAA 

Sequence 1114 

GGGGGACCTCCTCGTCCAAAGAAGGGAAGACCTTGTCGGCACCATTGAGAGGCGTGTTCT 
CTTCTGGCTGCCGCTCGAGCACCGAGAGCTCCAGCTCGTCTACTGGCTGTGAGGGTTGGG 
CCAGGTCCATGTGCCGGCAGGGAAAGTCTCCCCGCGTACCTGCCCG 
Sequence 1115 

CCGGGCAGGTACTTGGACCATCCACAGCCCAGCAAGGCAGAGCAGGATGCTTCTATTCCT 

CCTGACACCCATGAGGCCCTGCTTCAGACAGCCCTTTTCTCCTCCTCCTCCTCCCACCAG 

GCCTGTGTCTCCTCCCCAAAAGGCAAAAGAGGCACCAAACACCCAAGCCCAGGTCAGGAG 

CCTCCCATCCCTCAGAACTCCCTTCCCCTAGGAAGCAGATCACAGAAATGGGGTGGGCCT 

GGTGGCTGTTCAGAAAGTCCCAGGCCCCTGAGAGAGGAGGGCTCTATTGTGAAGCCATTT 

AGCTCTTCCCAACACTCATCACTGCTGCATTTATCTAGGCCATATCTAACACATACCACT 

GGGGATCCTAGGGCTAGGCACTGCGTGGAGCCAAAGGGAACGTGGCCTCACACCCAAGGA 
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GATCCAAGTCTGATGCAGAAGGCCCCGCCTCATTTCTTAGGTCTG 
Sequence 1116 

AGGTACGCGGGGCGTTCTTGCCGTGATGGCCGCCGCCGAGGCAGGGAGTTGAGGGTTTCT 

GCC7TTCGGAGGGTTTGGGGGATTGTCGGTCCCGTTAGAAGGCCACCAACAGCCCCCCAG 

GTTGCCAAGGGCCTGTTAGTTGTGTTCCTGGAAGGTGGGCAAGCAGCCTGATGGGTTAAC 

ATTGGGATTTGCCGTCGTTATAGCGGTCAAGGGGCCTGTAGCGGCTGCAGGGTCGGTGAG 

GGTTCCCAGC 

Sequence 1117 . 

AGGTACTTTCTTTATGCTCTTGACTGTGTCTTCTGCAGGCAGGACAGAC1TACAAATCAT 

GCAGGGCTTGCCCCTTCAGTCCTTCTGTCATTTCTAGTTAGCTCTCACGTTGCTAATATT 

CCATGACGCCGTCCAGTAACCTGGGCTGCCCTCGCTCCTGACCGCCTCTCCCCCAGCCTG 

GTCAGGGATACCCCTAGAGCAGAGGAGAAAACTGAGCTCCAAGCCCAGAGCTTCCAGCAG 

CTGAGTGGCTCCTGTTGAGAGCCTGGCTGTCCCAACTGTGGCAGCATTCGCAGATTCTGG 

GC ATCTA AGGCTTACCAGATACAAAGGATGGAAGCAGAAATATGCACAGTTCAAAAATGT 

TGCTTTTAGCTTCTAGTGTGAMCTTACAGTAGGAGATCACAAGACTGTTTTGCCCCTGG 

AATATGTAGGTCTTTTGTGAGACAAGGGGGAGGGAGGAAGCGAGGAAAGAGTGAGAGAAG 

AAGGGGAGAGAGAAATGTTGTGAGAGTGCAAAAAGTAGATTTCAAAAGAAATTCATTAAA 

A 

Sequence 1118 

CCGNGGTGGCGGCCGCCCGGCCAGGTACCTCAGAACAGTGGACTCAGAATTGAGGGTTGG 

GGCAGAGATGGCAATCTGTTTAACAGGTTTCCCAGGTGATACTTACACACACTAAAATTT 

GAGAACCACTGGTTTGGATAAACCANAAAGAATTAAATTATAAAACTGACATTGAGGAGG 

TAATCTGTGAGAGAAAAGTGAGCTTGCTATCCACACCAAACTCTATCCCTGGTAGCAACC 

TTCACNTCTGGTTAGTGGGTATGGTATTACCACCTTCTGGCCAAAAAATCTGGAAANAAA 

TAATAAACTGGAGGAAACCACCGAACCAACACTTGGCCGAACNAGAAGAGACAAAACATT 

TCCTTTCAGGGGATCC 

Sequence 1119 

GGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATATTTGTTGAATGACAAAATGAG 

GGGATAAATAAACTTTCCCTAAGATGTGTATGTAGACGTTTGAAAATAGAAGTCTGTCGC 

TGATGTTTCTGTCCACCAGGAGTCAGGCTTCAGCCAGGAAAGAGTGAGCCAGTGTCTGAG 

CTGTCTCTCGCCAGGCCTTCCACCAGCTCCAGGTCACCAGATTCAGGCCAAGACTGAGTT 

TCAGTATTTGTCGTTAACCTGTAGTCCCAGCTACTCAGGATGCTGAGGCAAGATGATTGC 

T 

Sequence 1120 

CCGGGCAGGTACTACTTTCTC TI Mill I GTTTGTTCGTTNTGTTTTTTGTTTTTGAG AC 

AGAGCCTCACTCTGTCACCTAGGCTGGAGTGCAGTGGCTNAATCTCGGCTCGCCGCAACC 

TCCGCCTCCCAAGTTCAAGCGATTCTCCTACCTCAGTCTCCCAAGTAGCTGGGATTACAG 

GCATGCGCCACCACGCCCAGCTAATTTTTTTGTATTTTTAGTAAGAAGACCGGGTT^ 

CATGGTTGGTCAGACTGGGTTCTCAAAACTTCCCAACCTCANGGTGATNCTGTTCTGCCT 

CCGGCCTCCCAAAAGTGCCCGGGGCATTTACAAGACATTGAAGCCAACCCGCCGCCCTGG 

GCCCTAATTGTAGCNTACCCTNTCTTGMTCTTAATTTTGGGATATTGAATGGNGGGTTA 

NNANAAAANGGTTANGTTTTANGAAAAGGAAATGTTTTNCCTGGNNATTMTCTTTCT^ 

TTAACCCCCATCTTTAAACCACTTTTTTTAN^ 

ANTGI MM I CCCTTCNTTTTTGGNAAAAAATCTTTT^ 

/yKTTTCAATNTGGGNATTTTTTGCTTANTGGTCTCCTAACTTTTNTT^ 

AACNAAAAAAACAAAAAAAAAACCCCCTGGCMN^^ 

CCCNAATGGGGCCCCCTACTTTTANTTANGGTTCTTTTTGGCCCAAAAAATTANTGCCNT 
GGGCTTGGGTCCTNNATT I MM I CCTTTCAAAATT 
Sequence 1121 
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AGGTACTTAAAAGCTCGTGGGCCTCA GATAA TTGGCCTCCTAACCAAATTCAGCTGCTGG 

GGCCTGATCAGAA ACTGC TGGGTTTCTTrTTCCCTAAGCCGATTTAATTCCCACCGCCAA 

GCCCTCTGACCTTATTTTCTTTCTTTCTCTGGCCTAATTCCTGGAAAATGGAGTCTGGAA 

ATACCTAGGGACTACTTGGGTGGGCTCCAAAGCTTCAGGGAGAAGAGATAGGACTGAAGG 

ATGCAAAAGGGATGAAGAGAGAAGCCAGGCACCAAGGCCACATCTTGTGACTCCAT7TAC 

AGGAAATATACCGTTTAGGTAAATGCATATCGGTGGCTGCCAGGGGCTGGGGGAGACGGA 

ACTGGGAGTGACTACCTCATGGGTACCTGCCCGGGCGGC 

Sequence 1122 

GGGGGACCTCCTCGTCCAAAGAAGGGAAGACCTTGTCGGCACCATTGAGAGGCGTGTTCT 
CTTCTGGCTGCCACTCGAGCACCGAGAGCTCCAGCTCGTCTACTGGCTGTGAGGGTTGGG 
CCAGGTCCATGTGCCGGCAGGGAAAGTCTCCCCGCGTACCTGCCCG 
Sequence 1123 

CCGCGGTGGCGGCCGGGGGGACCTCCTCGTCCAAAGAAGGGAAGACCTTGTCGGCACCAT 

TGAGAGGCGTGTTCTCTTCTGGCTGCCGCTCGAGCACCGAGAGCTCCAGCTCGTCTACTG 

GCTGTGAGGGTTGGGCCAGGTCCATGTGCCGGCAGGGAAAGTCTCCCCGCGTACCTGCCC 
G 

Sequence 1124 

TCCCCGATGTTCTCTGCATACTCTAGGGGGACAAAGATAGAGTCAGGAGTGGATCCCAGG 

CATGGCATCCACCGTGGCTGCGTCGCTTCATACCTTTCCCAGAACCCCAGCTCCCACANA 

GGTGCAAGTCGCCCTAATGCCCCCACTGTAGCCAAATTCCTTGGGGTCTTGCCTTTGAGG 

CCTCAGTGGAAAGGTCCCAACTTACCTTTCCATGCTTGGATTCCCCTCCTAGCCACACCC 

TTC ACTTNCGC CCTCATGTATTTATTCACTCAACAC AC ATGGC NTGGATGCTN NTAAGGT 

CTGAATGTTTNTGTCCCC 

Sequence 1125 

ACTCTCCACCGCGGTGGCGGCCCGGGGGCCATTGAGACTGCCATGGAAGACTTGAAAGGT 

CACGTAGCTGAGACTTCTGGAGAGACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGAC 

CACTGGAACAATGAGAAGGAGAGAATTCTACTGGTCACAGACAAGACTCTCTTGATCTAC 

AAATACGACTTCATCATGCTGAGTTGTGTGCAGCTGCAGCGGATTCCTCTGAGCGCTGTC 

TATCGCATCTGCCTGGGCAAGTTCACCTTCCCTGGGATGTCCCTGGACAAGAGACAAGGA 

GAAGGCCTTAGGATCTACTGGGGGGAGTCCCGAGGAGCAGTCTCTTCTGTCCCGCTGGAA 

CCCATGGTCCACTTGAAGTTCCTTATGCTACTTTCACTTGAGCCATNCTATGAAATACAC 

CAGTGAGAAATTCC 

Sequence 1126 

AGGTACTAAAGAATGTGCATTTATTTGTGGGGAGGAGAGGAGGTCTAAATAGACAATAAA 

CATATAAATATTAATATTGTGAAAGTGGTATGAAGAAAAATAAAGCATGATAAGGAGAGA 

GAGGTGTTCCCTGGCTCTCCAGAGGGCTGAGGTGGAAGAAAAGGACTCCTGTTTTAGGTA 

AGATATCAGAGAAGACCTCTCTGATTGACTGAAATGTATGCAAAGGCCTGCATGAAGCCA 

GGGACCATGCCGTGCTGACATCTGGGGGAGGATTATCCTCACAGAGGGGAAAGGAAGTGC 

AAATGCCGTGAGGTGGGAGGACCATGTTTGGTATGTTCAGGGAGGAACAGGGAGGCCAGT 

GGGGCCTGGACANGTTCAGCCATGGGAAGGCCAAAGATNAGATCAAAGAGAATAGTGGGA 

GATCACGTAGAGATTTATTGGGCCATTTGTGAGGATTGTGGCAGCATTGAAAAAAACTGG 

GTTTG ATG CTTTAAAAAC AAC N AGCC C ATTT 

Sequence 1127 

AGGTACGCGGGATTATTTCTGGAATTTTCCGTTTACTCTTTGCAGACTTTGGTTGACCTT 

GGGGAGCAAACCGAGGACCGCGAAACCACAGATCAAGGGGGAGCTCGTGTCATAGCAGCT 

CTAACTTAGTGCTTACAACATGCAGGCACTGTTCTAGTCATTTTGCGTATATTCATGAAT 

CCTCCCACAACCACCGTGTTCACGAGGAAACCGAGGCAGAGGGAGGTTTAATGAGTTGCT 

GGAGGCAGCCCANCTGGTGAGCTGAGGCTCANCCCGCACCGACCCGTGCTCTGGCCCCAA 

AACACTCCTCGGTAACCCCCATCCTGCCCACACAAAATGGCCCCGGAACACANCCGGCTG 
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CTT TGGAG TCANTTTCTCTCTCGAATGACTTTTTCCACCGTTGGCTTCNATCTGGGAGTT 

CCCTTTTTCTGTGTATmGCCTGTCAATTTTCTATTCCTTTGGATGTT 

TCCCAGGGNGCTAAAMGGGGGCTGGGGGGGNGTGGTTTTTTTCAATTCCTCTACAGGGN 

GGGCCCTNGGGATGGGTAACAAAAAANCCGGANMGAATNCCATANTTTTCACCCAANNT 

GAAAAAC TTGCGG GNCCCCNNTCAAATNTNGAAAAACTTGAACCCCTTGGGAAANAATTC 

TTGGCNA l l l J J I ACTTNAAAAGCCANAAAACCTTGTCTTTAANTCCAAANNNNNNANAA 

AAANAANCNNNNNNNAAMTTCCNNTCCCNCGGNGNNCNNNCNNTTTTTTNNMTT^TTTT 

NGNNNNCCCCCCNNCNNNNGGGGANNTTrNTTTTTTNNNAN I I I I 1 1 II I 1 CCCCCCNCC 

CCCCCCGGGGG 

Sequence 1128 

AGGTACGCGGGAAGMTTTTAAGTTCGTGGATTAATATCACTACTTGAATACTGACAGTT 
GTTGATTAGACACCGAAAGGTTACTGATTGTTGAATGTATCTGTGTTAGAGCTGTGCACT 
GGCACGCTTGCATCAGGGGCTGGGGCCACA 
Sequence 1129 

CCGGGCAGGTACGCGGGGAGACTTTCCCTGCCGGCACATGGACCTGGCCCAACCCTCACA 
GCCAGTAGACGAGCTGGAGCTCTCGGTGCTCGAGCGGCAGCCAGAAGAGAACACGCCTCT 
CAATGGTGCCGACAAGGTCTTCCCTTCTTTGGACGAGGAGGTCCCCC 
Sequence 1130 

CCGGGCAGGTACTTTAGGAGACCCAGGCGGGCAGATTGCCTGAGGTCAGGAGTTTGAGAC 

CAGCCTGGCTAACATGGTGAAACCCTGTCTCTACTAAAAATACAAAAATTAGCCGGGCAT 

GGTGGCTCACGCCTGTAGTCCCAACTGCTTGGGAGGTTGAGGCAAGAGAATCNCTTGAAC 

CCANGAGGTGGAGGTTGCAGTGAGCCGAGATCGCGCCACTGCACTCCATCATGGGCGACA 

GAGCAAGACTCCATCTCAAAATAAAGAAAGAAAGAAACAAAGAAAAGAAAAGCTTATATT 

GAACTTCTCTAAAAAAAAGAAAAAAAAGNAAAGCCTGATGCACACAAAATCTAAATTTGG 

CAAGGTCGATTCAATTAAAGGGATNTTTAT^ 

CCCCCCAAAAAATCAAATAAAAAAAAGGTTCAAAATNGCCAAACCTTT 
Sequence 1131 

AGGTACCAGGTTGGGCTAAGAGATGAAAAACCAGGCTCAAGTTGGGCCAGAGATGGACAA 

TGAGCCACTTCCACCAGAATTCCCAGCCACATATGCCCACCTCTGGGTCAGAAGAGCCAA 

ACGT ATTTATCGCTTTTGCTCAGGGCCAACATAGACACTTGATTCTCTCTCAGGGCTGTG 

TTTTCCCCCAAATGTATCMTACATCCCTATTTTCTTCCCCCTrTGTT^ 

GTGACCTGCTACCCCACACCCCAGCACACTGATGGTGTTCCCTCCACCACACCGAGACCC 

CGCCACAAGCTCTCAGCCCTCGTAGAGCACAGGCCCCACCCTGAAAGTTGGCAGAGCACC 

GGGCTATCAAGCTGTGTCATTGTGGAGCCAGCAAGCCCGGCACAACCGTGGATCCCGGGT 
ACCTGCCCG 

Sequence 1132 

AGGTACAGCTGCATGTGTCTTTGGGGGGCTAAGTAAGGACTGGCCTATCTGAGCACACCT 

TCCAGGGACATTTATATGCCCTGTCCACCGTCACCAGTGATAACATCAAGGGCCTGAACA 

GAGGACCATCCCAGCTGCTGGCACCTGTATATGTTATTCATGAACCTAGAGACTGACCCA 

CTTAGAACTGTGAGGGCCCTCACGTGACATCCAAGGGCCTGACGATAGATGTGCCTCGCC 

CTCAGCTACCACCTCCAGACACTGTTGGAGTCTAGGCATTGGCTGGCCATACCTGCAGCT 

GCTGACACACACACACACACACACACCTTTCAGGGGCTCAATAATGGACCTACCTTGCCT 

GCCACTGGCACCTGTGGGCACTTCCCAGGGGCCCAGGGACTGGCTTCGTCCACCCTGCCT 

CTACCACTATTGTCCCCCACATGCATTANCTTAAGACCCCAAGGGATA 

Sequence 1133 

AGGTACTTGGGCTTAGGCTGGGGTTGAGGGCTGGGCCCTGAGCCGAGAACACCTGTTAAG 
TGACTTACTCCACAGGAACTACTTAATGGCTGGTCCCTGATAGACTGGGAACTCCATGAG 
AGCAGGGGTGGGGGGCTCATTTGTCCTTGTGACCCCGGCACTCTATGCTTAGTTGAACGA 
ATGCTGGATGGGCTGATGCGGGCGGTGTCCTCTTTCCCTTGCTGCAATCAACTGGAACAG 
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GGTGGGAGGGAGCCCTGGGATCCTCCTGGAATCCTTACACAGGAACTCCAGACCCCGAGC 

CACCAGTGGCCAAGCAGATCTCCTTCCCGCGTACCTGCCCG 

Sequence 1134 

CCGGGCAGGTACTAGAAGGATTAATTATATTGCTTGCATTTTTTTTAAAA 
TTGTGACTGAGTCTCACTCTG7TGCCCAGGCTAGAGTGCAGTGTTGTAATCTCGGCTCAC 
TGCAAC CTCCAC CT CCCGG GTTCAAGCGATTCTCCTTCCTCAGCCTCCTGCCACCACACC 
TGGCTAATTTTTGTA I I I I I AGTAAAGACAGGGTTTCACCATGTTGACCAGGCTAGTCTC 
GAACTCCTGCCCTCAGGTGATCTGCCCGTCTCGGCCTCCCAAAATGCTTGGATTACGGGC 

GTGAGCCTCCACGCCCAGCCGC7TGTATTTTTTAAMTAAAGATTGCAGGTTTAGTTTTA 
ATTTGGATAATTrCTTCMTCTGTTT^ 

TGCTTACTAC ATGTGG N AGT7TTTACC ATGTAAAATATTGCCGTTTC TAG TTTTAAATTT 

TTATGACTGTACCT 

Sequence 1135 

AGGTACGCGGGAGGGAAGAGTGGCCTTATGGCTACTCAAGCCACGTGTTTACCTAAAATG 

TGCTCCTAGTC GCATAG CTTCTCAGAATAGTCAAAGAAGAAAGGAAAAGAAAGCTGGATG 

TCCCGGGTTTA i i i i i i CTGCATCTTTGCAAATATGAGACACAACTCTGTTCCAGGAGCT 

TTCGATGCGCTTCTACAGGTATTTCTTTTTTCTTTGTGTCAAGGATAAATGTGAAGATAG 

CTGTGGMTTGTCCGAACATTTTTGCCTTGAAATTAAGCMTGTTTTATG 

CAAAGGACAAGTTATTTATGCCAGTATTAAAAGCAGCAACTTATATATTCTACGCAAAAC 

AGAAAACCAGAAAGATCTAAATTTTGACAAGTCCCTAGTTTGATGAATTGAAAGACAGGA 

AGATGACTACATCACAGCAGATAAAAGAGATATTTTCATAAACTAAGCTCTGGGAAACAG 

AATACCAAAATTACCCATGTACAACCAACTACTGCTCATGGGTTCCCCANAAGACTTGTG 

TGCGGATTATACGAGTCATTGCCCAGTTGTGATAGTCTCAATTTTGAA 

Sequence 1136 

AGGTACGCGGGGAGTATGAGTAGAAAATGTGACACAGAGATAGGAATTAGTCACATAATG 

TTGGAAAMGAAGAAACAGTMCCAGGAACGGGAAGGAGTAMCAGAGCTTTTAGTTAAA 

ACAGGATCACTGCCAGACACGATGGCTCATACCTGTAATCCCAGCACCTTGGGAGACCGA 

GGTAGGTGGATCACGAGGTCAGGAGTTCAAGACCAGTCTGGCCAACACAGTGAAACCCCA 

TCTCTACTAAAAATACAAAAATTAGCTGGGCGTGGTGGTGCGTGCCTGTAGTCCCAGCTA 

CTCCGGAGGCTGAGGCAGGAGAATTGCTTGAACCCGGGAGGCGGAGGTTGCAGTGAACCG 

AGATCGCGCCATTGCACTTCAGCCTGGGTGACAAAGCGAGACTCCATCTCANANAAAAAA 
AAAAAAN 

Sequence 1137 

GTGATTGCTTTCCAAGGCCACCGAGGGTATCTGAGACTCCCAGGCCCTGGTTGGTCCTGC 

CTCTTCTCCTTCATAGTGTCCCAGTGTTGTCAGGAGGGCGCTGGTGGTAGCTTGGACCTT 

GTGTGCCAACGCTTCCTCAGGTCTGGGCCTAACAGCCTCCACTGCCTGGGCTCACTCAGG 

GAGCGCCTCATTATTTGGGCAGCCATGGATTCTATCCCAGCCCCATCATCAGTTCAGGGA 

CACAACCTGACTGAAGATGCCAGACATCCTGAGAGTTGGCAGAACACAGG 
Sequence 1138 

CGGCCGCCCGGGCAGGTACATGGGAAAAGAGGACAAAGTGGTGAGTTAAATCAGCTAGTC 

TGTAGCAAGGCATGTTTCCCCAGGGACTCAGGTCCCCGCTTCAAGAGAACCTGTTCTGGA 

GACCCCTTCCTTGACTGTGTCTGCTGTGCTGGGCTGGGACTTAGTGGACAGAGACAGCAG 

CCTGTAAGCCCCTTTTCCCAAGTAAGGAGCAAGGGCACATTGACATANAAGCAAGCTGAC 

ATTTCCCAAGGGACGTAGGAAGTGCCTGCACTTCATGTTTTCAGATGAGTCTGCAAAGGA 
AAACAGCAGTGGTACCT 

Sequence 1139 

AGGTACCATGCAATTACAGAGCAACTTCAGCTAATGGGGACTCCAGCATCTTGGCATCCT 

CGCTGGGTGGAGCAGAAGCGAGCATGGAGCTTCCACTCATAAACCCTTTGGCTTGCTAGC 

CACAACTGATCACCTAG7TTTACGTAAGTGCAAGGGGTGGCTGGAAGTAGAGGTTCCAAG 
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AAGGTGCAGAAAGCT GTCT ATTAGTGAGCATTMTAATGTATGTTAGTGTTTAACAAGAA 

GTTGCTGGTAAAGCGTTTTGTGCAGTGGCTATCACATGGAAAATTTTCCAATAAAAGAAA 

TATACTATTACAGTTATAAATAATGTCCATTAGAATAAGCATTACATAACTCCTTCAAAT 

CAAAAGACATATATTTGAATATGTGATAGCCAATCAAATATTATCTAAAATTAGGAAGGN 

GGGTCCATAGGAACTGAATATATATTTTTAAAAATGGAAATTGAAGGAG 

Sequence 1140 

AATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTAGTTTGTGGGTTGAGGGTAGGA 

GGCTAAGACTGCCAGGGAGGAGGCCCCGGCTTCCTGCTCCTACCGGGGAGCAGCAGGTGA 

AACTCTGACCACGTTACTGCAATCCTGACATGCTCCAGTGGAGTGGCGACATTTTTCTTC 

TCGAGGCAGCTTTTGGGGATCTGTATTACmCATGTGGACCACTTGCTAGTmGATTT 

CAATGATAATTTCTTCCTTCCI I I I II I I I I NAANCCTGTNTTNNTAGNGATAACAACNC 

ATTTANCATTTGTTTNGATTTNACCCTCTCCTCTCTCCCCACTNTNAGTNTGNAGCCAGG 

AAAGCAGGGACNTCCTGTGCNAACTGTCAAACCACCACAACCACACTNTGGAGNAGGAAT 

GCCAATGGGGACCCTGTCTGCAATGCCTGTGGGCTCTACTACAAGCTTNACAATGTAAGT 

GGACTGGGATCANCAANAACAGGGCTCGCTTCCTGATGGGGACCACAAAACAGNGTNCCA 

CCACCCTTTCAAGTGAATCCNCACCATNGGGGCAAATNACAGGTCCNAATAATTGATGCC 

ATAG GACCTAN CTTGGAAACTACTTTTTTTAACATAACCCTCCTTAAGCCC AGGGGCTTA 

AAACTG GNANC CAAAAAACCCCTTGGTTTCCTTTGGAACTTCCCAATACCATGCCCNAAA 

ATTCCAATTTTCTTAATNGGGNCCTTTTGGGTTTNTTTGGGGAAA 

Sequence 1141 

AGGTACGCGGGATTCAGTAACAGAAATCTCCCTTCCCTTCTTTCTCTGATGTGCAGTAGT 

TCTGCCTGTTANAATCATCTGGAGAGCTTGTTAAAAATACCAATGTCAGGAGCTCGACCC 

TAGAGATTCAAATTTAACTGATCTGGTGTGGGACCTGGAAATCCTCTATTTTTTAAAAGT 

TCCCCAGTTGAGGCCAGGTGCGGTGGCTCACGCCTGAGTCCCGGCACTTTGGGAGGCCGA 

GGCGTGTGGATCACTTGAGGTCGGGAGTTCGAGACCAGCCTGGCCGGCGTGGTGAAACTC 

CATCTCTACTAAAAATACAAAAAAAATTAGCCGGGTGTGGTGGTGCACACCTGTAGTCCT 

AGCTACTCAAAAGCTGAGGCATGAAAATCCTTTGAACCTGGGANGCGGACATCCANTGAG 

CTGAAATCATGCCACTTGCACTTCCACCTGGACMCANANTGAGAATTCATTTTCCAAAA 
AAAAAAAAAA 

Sequence 1142 

AGGTACTCGGCCTACTTCCCTATTTTCMTGCCTAATGATTTTTGCTGCGCCTGAACACA 
TCTGGAGGGAGCCAGGCCCACTGGCTGCCTCTCGTCCCACTCCAGGCCCTCCTACAAAGG 
TGCACTCAGCAAACCCTCCAAGTACCTGCCCG 
Sequence 1143 

CATAAATAAGGGTATCACAGGGTAGAAGTAGGATAGAACATAGGCCTAAAACCCCTATAA 

GCCCATTGTCTGAGCTGACCTCCACAGACTGGTGAGTTTTATGGTTCTCATCANACCCGC 

ATNTATTTAAGACAAACTGGGCTGNGGGTCCCTGAAATGAAAACCGGATGAGGGTCTCCT 

TTCGTCTTGCTTTATGCCCTTCAGAGTTTGACTTGTAACCAAAGAGGGAGCATTCTCCCT 

tGGTCTNTGCCTCTGGGGGTGTGATTTTCAGGTCACGTTCCAGTAGCTAGTCTAAAAAGC 

ANTGGGAATCCTGAACCATNGTCAAAATTTTAAAGCNGCACTNNTN I I I I i M I CCCCAA 

AATGTGCCCAAAGCNTGTCAAGGGGGAGTTCTTGTCCCTAAAAMGGGAGTTTNTTATTC 

CNTTGAAAGGGGCTTCTTGGNTNGCCTTCCATTTTTTTGTTNGGCCTGGGNTTANNGCCT 

GGGNNAAAGNTCTAAAANNCCCNTTGAAAANGGCCCTGCCCCTTGNNGGTNNNCAAAGGC 

GNTTCANCAAGGGTTNGGCCAGGGTNTTTTGANCTNAAAAAGGCCCCCCCCCANTNNTGG 

NNCNNCNTGNTNCCNTGNTGCCTCCANTNNGGGGANNACCAAGTTNACNCCCNTTCTTTT 

AAACCCNGCTTGN TGGG N N AAC ANGGN N NATCTTG NTATTTTTTAMCTTTATTTNTTTN 

GGGG NGAGAAAAATTTTGGGGGGGGGNNTCNTTAAAAGGGTTNNNNTTTTTTCCCCNCC 

TTTTATAAAAAAAAGNATNCCNAANTTTTTGGTN 

NCCCCCCCCCCCCC 
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Sequence "1144 

TNCNGGCAGGTTNTCGGGGAGACTTTCCCTGCCGGCACATGGGCCTGGTCCAACCCTCAC 
AGCCAGTAGACGAGCTGGAGCTCTCGGTGCTNGAGCGGCAGCCAGAAGAGAACACGCCTC 
TCAATGGTGCCGACAAGGTCTTCCCTTCTTTGGACGANGAGGTCCCCC 
Sequence 1145 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACACAGTGGGCGCTGGGG 

ACCAGGGCCAGAGAGTGGCCACCTGTTAGAGCTGCATGTGTGTCTTGGGCCTGGTCTTCT 

TTCTCTGCT CACTAATGACAG CTGTCCCAGCCTCTTGCAGGCTCTGGCCCACCACCTTGC 

TTGCTGTGA I rTTTTTTTTTI I CCTCACACATGATAAGCTGAGGCAGGAGAATCCGCTTG 

AACCCGGGAGGCGGAGGTTGCAGTGAGCCAAGATTACGCCACTGCATTCCAGCCTGGGCG 

ACAGAGCAAGACTCCGTCTNAAAAAAAAAAAANANNAAAAAAAAAGGTGACAAGCAGAGC 

AAGCGCATCTAGGTGACACCCGGGANATGGAATCGG GACGG CAGGATG AGC ATGGGTTTG 

TGGCGAAGAAAACCCGGGGCAATAATGGACATACAATGCAAANTTTGCCANAGTCAGCAA 

AAAGCCTATNNGGTAGAAAATTGCCAAAGAACANAAAAAAGAGGGGAAGGGTNAACNCCT 

AGTGNAGGCTACTTTGCTNNTGACAAGNCANGOCCGGGCNATTAAGGGNGTGGTTTNCAC 

TTGGGANGTGAAAGGAAAGGGNGGGAAACCCTGGAGGGGGGAANAATGCTTGGGGANCCC 

TTTTGAAAATCCCCCNCTTTTTCCCTTGGAAACNCAAAAANTAAAAA 

Sequence 1 146 

AGGTACCAGTAGGAAATTGGGGAAGAGCCAGGAAGGGACAGCAGCCAGTAAAGTGGGCAT 

TATCAAATCAGTTAAGCATTGTGGGCACCTGGAGCTTANTCACACTGGGGGACTCTTATA 

CAGCATGGAATGCTTGTCTTGTGGTTGACCCATCCTAGAGGCAAGGGAGTAGGTATTTAT 

TCACCAAAATCTGTCAGGCATTTGTTGGGGACTGCCTTCAGAGGCAAATGATTCCCTGTC 

ATTTCAGGACTATCACTTGGGCAGCCAAGAGTAATCCTCCAGGCAAAGGAATGCAGGTGT 

TGGCAGTCAAAAGACAAGGTGGTGGGCGCTGAAGTGGTGCCATGGGAGTATAGTCGAGGC 

ACCAAGCAG CATC TACCA GCAGG GCAGCCTGTAACCCCGACTTTNCACCTGAGGATTTGT 

TGGCCCTNATTTTNGGANTTTTTGCATTNGTTTCTCATGGAGGGGGCAAACTNNGTTTAA 

GCTTGGGAACCAC7TNANCTTGACCCCACACCAAGCATTTT 

Sequence 1147 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTGTTTTGTAGAGACAGGGTTTTGTCATGTTGG 

CCAGGCTGGTCTTAAACTCCTGACCTCAGGTGATCCACCCGCCTCGGCCTCCCAAGGTGC 

TGGGATTACAGGCATGAGCCACTGTGCCCGGCTTATTTCCTCTTTTAAAACTTACTATTA 

TTATTGTCATT CTCAAAC TATTATCTCATAAACGTTATATTGTCTATATCAGATATTCAG 

TGTGGGCTATGTTTTTTAAA TAACAG ATGGTATTTGAATATATAAAAGATATAAATCCTT 

TTCAAMT GTTGCTT TGTCTATTTTTCTGAAAAGTTTGACAACCATCTGTAGA 

CAGAAGACI Mill I CTTCCCTCACTTTTAGTCTCTTTAAAATC AATGACTATTTTTATT 

CTGCTTGGTCANGCAAGTANGCCCTTTGATTTTACAACAGCCTNTATTCCCATGGAAACA 

CACACAAACACACATACATATAATTTCATTTTTTGCCTTTAAAAATTTCTT^^ 

Sequence 1148 

ACGGTAATACATACACCAAGAACAGAGACAGTTACAAAAACAAGTAAGTGTTGATATAGG 

CGGACCCACAAACAAGAATGTAAACATAATCAGATAGAAATTATCTGCTAAAATACATTT 

GTCTATGCC AATAG GCATTTTACTTAACAATAGGTTATCAAAGAGTCMTCTTTGAGAAC 

ATATTTCA TATTTTCTCTT CATTATCTAAGAAATGTGAGCTATTGAAACACAATATATCT 

TAGGCAACI I I I II I I I I CTTTCCAGAAAATCTGGCTCTGAACTGAATACATGTAGTGCT 

TTG GTTTAT AAAG ATTATAG AACGTA I I I I I I I I I I CTTACAAGGGTTGCCCAGACTTAT 

TTGGGATGGTGAGGTTCTACCATATAGGAACGAGGCAGTAGTAGCATACATACTGAAGCA 

ACTGCTGTGCTTACAGACTAGTAAGCACAATGGGAAA 

Sequence 1149 

CCGGGCAGGTACGCGGAGGGAAGGCAGCTCTTCAAAGGAAAGGATGCTTGGAGACAGAAC 
ACTGGACTTCTTTCTTGCGAGCCTTCCCCAACCTCCCCAGGAGTCAAGCGCTTCCTCCTG 
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G GTGCT TCCGGAAGTCGTCATTAATACCTCTCTTCCAGCATGAGTTATGGCACCACTGTG 

GTTTTTCTGATGATGTGCCTTCTCATCCATCCAATTCAAGTTCCCAGCAGGCAGTGATTT 

TGTCTGTCTGCCCTCCATATCCACCCTCCACCCTTTCCCCACTGCTCTCCCATGTGGATG 

GCACCCTGGTGTCCTGTGCCCTCTGGCTTCTGGGGGCATTTGGCCAATGCGGCTCACCAT 

CAGCAGGAGATGAGAGCTGAGCTCTTGGTCTCTCTGGCAGCTTCCCTACAGGGTTCCTGT 

GAGCTGGTGGCCTCCTGACTAAAGGCGACAAGCCTCTTTGTCAAGAAGTCCCCCACATAA 

CTTCAAGTTCTGGAAGCAGCTTCCTTTCCT 

Sequence 1150 

CCGCGGTGGCGGCCGAGGTACCAGCAAGTTCCTTTCAAAGGTCTGTGGACCTTCACAGAG 
TTTAAGCTATAAATCAATAGTTGCTGGGCCTCAGGGGAAAGGAGAATTGGTGAATCCTGG 
CAGCCTC ACAGAC TTTAGAAATCAGAAGGCCCAGACAGATTCTAATTGTAAATGAGGCTG 
TAGGTTTGi I I I IGGA7TTTTGTTTTCTACTTTTAGTAACATCCAGAATGAGTTACTGCC 
CTTCCAAGGAAGGCCATCCATTCCCCAAGCAATTCAGTGTTTGAATTGTGATGCTGCATG 
CCCCAGTCTCATTACAAACAAACAGAGCAGTGCTGCGGCAACCTGGCAACGTCCTTAAAT 
GACCCCCTCCACACTCTGCCAGGCCCTGGGTGTCTCCACCAGCATACAGGCTG 
Sequence 1151 

CCGGGCAGGTACGCGGGGAGACTTTCCCTGCCGGCACATGGACCTGGCCCAACCCTCACA 
GCCAGTAGACGAGCTGGAGCTCTCGGTGCTCGAGCGGCAGCCAGAAGAGAACACGCCTCT 
CAATGGTGCCGACAAGGTCTTCCCTTCTTTGGACGAGGAGGTCCCCC 
Sequence 1152 

CCGGGCAGGGTACTTTTTCTTCTTCTTC I iTTTTTTTTTTI I A AAGAGAAC ATCTGAATG 

ATGTATTCGGAAAAAAACATTTCCCAAGAATATGGAGAAAGGGAATTGAACATAATTCCC 

AGTGTAAGAGTGGAATGAGCATGTGGCCCTGNGGGCCTAGTCCCAGTTTGTTAATCTTGT 

GAGCAGAGCTGCTGGCGGCTGCTCTCAGTCCGCTAATCGGGCTCTTTCTCTGTCACGTGG 

ACAGGAGACTGCTCACAGGGTTGCAGTTGGCTGCTGCCCTGCGTTGACAGCTGGTTCTAA 

TTGTGCAGAGCTACCGTAGTACCT 

Sequence 1153 

AGGTACCATTCATTACCTGTCTTCAGCAACTACCAGCAAATGGCCAATTTGATGTCATTT 

TTGTGTCMCCTACTACCTTCTAAGCCAATTGGATTATCTTCCAAAACATCCCAGACCTT 

ATATCCTGTCACACATAAGTATTTCAGAATCTAAGGTAGTCTTATCTAAGGTAGTCTTTT 

ATG CCATC TT ATGCC TTTATGTGTCCMTTATTTTCTGTTAMGTTCCTCATTTCTCCTC 

CATATTTTCCATTTTAACAAAAGCCAACTCTTCTGTGGGC < I I < I U I CTTTGTATACAC 

ATGGTTCCAGTCACCCTGAGGTCAAGCTGCTGCAATGCTTTATTTACCTTGNAGATATTG 

GGTATATTTAGTATTCC TCTT ATCTGCAAGCTAAGGGAGACAAAAAATAACAAAACTTCT 

TAATCTGCTTNCTTTCCTTTTTCTCCTTGGGTTGGACTTCCTACTCTCTGCCTCTGGAAG 

Sequence 1154 

CCGGGCAGGTACTTTAGGAGACCCAGGCGGGCAGATTGCCTGAGGTCAGGAGTTTGAGAC 

CAGCCTGGCTAACATGGTGAAACCCTGTCTCTACTAAAAATACAAAAATTAGCCGGGCAT 

GGTGGCTCACGCCTGTAGTCCCAACTGCTTGGGAGGTTGAGGCAAGAGAATCGCTTGAAC 

CCAGGAGGTGGAGGTTGCAGTGAGCCGAGATCGCGCCACTGCACTCCAGCATGGGCGACA 

GAGTAAGACTCCATCTCAAAATAAAGAAGGAAAGAAACAAAGAAAAGAAAAGCTTATTAT 

TGAACCTTCTCTAANAAAAMGAAAAAAAAMGAAAGCCTTCCGCGTTACCTTCGGGCCC 

GCTNTAGAACCTAGTGGGGATCCCCNCGGGGCTTGCAGGGAAATTCCGATATCAAGGCCT 

TATCCNATACCCGTCGGACCTTNGGAGGGGGGGGGCCCCGGNACCC 

Sequence 1155 

CACCGCGGNGGCGGCTCGAGGTACAGCATNCTGTTAAGTNACCTCTAAAGCCCTGNCAGT 
NAACT CTGTCAG AAAATAAAACAGAATGTCATTATTAGGAAAACAAGAAAGCCCAAACTC 
TCTAG rTTTTTI GTCTTAATAAACTGAAATTTTATGTGTGGTTTCTGCTGAAATGTTAAT 
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TTCCAAGCATAGTTTTAATGAATATGCTTTACAGAAAAAAAAATCATGTTTGTATTTCAC 
AATAAAAAATTCTGAAAATGGATTNTCTTCACATTANTCTCGAATGTGCAAAAAATACCT 
TGGAGAATTCTTTTAAAAAAAAATCTTGGTTATTTTTAAATTTTN 

AAAGNAAGGAAATTNAAATTTTCTATT7TCACCNAAAAACAANAAAAAACAGGNTCCNTT 

TGNTTTATTATCTGGGAAAAAAAAACCATTTTTTAAAAANTGNTCCCCCTCCTGGATCCA 

AATNNAAATATTTAGNACCATTANTGGGTGGTTTGNNTNATAATTAAAAATGGTANTTTT 

TCTTTC 

Sequence 1156 

.CCGGGCAGGTACGCGGGGAGACTTTCCCTGCCGGCACATGGACCTGGCCCAACCCTCACA 
GCCAGTAGACGAGCTGGAGCTCTCGGTGCTCGAGCGGCAGCCAGAAGAGAACACGCCTCT 
CAATGGTGCCGACAAGGTCTTCCCTTCT7TGGACGAGGAGGTCCCCC 
Sequence 1157 

ACTTAGCACTGGTTTTCAAGCTGAAAAGTAAGTATAAGTCAGCMTAAATAGTTGCCATT 

TATGAATACACAGGACAAAGAAAATTGAGCACCTAGTGGCTGATGTTTGCAAAATATTAA 

ATAACTTAAGTTCATTGAATACMTTAGAGTCAAATTAATGAAGCTTAGAGATCTAGTGT 

^ATTTAAACTACTAGTAGTGGGCCAAGTGTTGCCATTAACATTTTTTTAATGATGAM 

GMTAGTAAAAA TATG TTTATTTTACCTAATAGGTTCATTCTAGCmAAAAAAAGTGTT 

GmCATGAAGCTTTTGTGTATTTTGCATATATAGATACCCCAGTGTAAACTGTATTTGT 

TACTTTGGCTCATAGTCCAAAAACTCTTGAAAGCCACTAATCTAGCCCAGTGCCTTCATT 

TCATAGGTGGCAAAATTAAACCTATGGGTCATCTGATTTGCCAAAATTCACTCAGGGAGT 

TAGTGATAAAAATCCTTGGG GCTTCAGGACTCCAAGGCCAGAGA I M I NI IA AAAAAAC 

CAAACCCAGG 

Sequence 1158 

CCCGTTGTGGCGGCCGNTNTACTTTANNTAAAGACGGATTTCGCCATGTTGGCCAGGCTG 

GTCTCGAACTCTGACTTCAGGTGATCCGCCCGCCTCAGTCTCCCAAAGTGCTGGGATTAC 

AGGCATGAGCCACC ATGT CCCGCCTGGTTTGTTTTTTAAAAAATCTCTGGCCTTGGAGTC 

TGAAGCCCC AAGG ATTTTATCACTAACTCCCTGAGTGAATTTTGGCAAATCAGATGACCC 

AT AGGTT TAATTTTGCCACCTATGAAATGAAGGCACTGGGCTAGATTAGTGGCTTTCAAG 

AGTTTTTGGACTATGAGCCAAAGTAACAAATACAGGNTTACACTGGGGTATCTATATATG 

CAAAATACACAAA AGCTTC ATGAMCAACACTTTrTTTAAAGCTAGAATGAACCTATTAG 

GTAAAATAAACATATTTTTACTATTCTATTTCATCATTAAAAAAATGTTAATGGCAACAC 

TTTGGC 

Sequence 1159 

AGCCACGCTGAAAACTCACGAGCAAGAAGGAAGTGCTTATTTCTATCTGCCTTATTTTTG 

TAGTTGTTACGCAGCAATATTATGGCAAGAGTTAACAGATATACAACTTAAATTTCAAAA 

AATAGGGAATTTAATAAATTGCAACACATCACATAGTATTAAACCGTCATTATTATTAAT 

GACAATGATCTATATTTATATATACATGGGATATGTCTAAAACAtTAACAAAAAAAGTGG 

CGGGGCATGGTGGCACATGCCTGTAATCCCAGCACTTTGGGAGGCCAAGGTGAGTAAGAT 

CGCTTGAGCTCANGAGTTCCGAGACCAACCGTGGGCAACATAAGTAAGGACTTTGTCTCT 

ACAANAAATAACAANAATTAAGCCAGGCATCCTTGGCATGTGCCTGTGGTCCCAGCTACT 

TGGGAAGCCTTG 

Sequence 1160 

CCGGGCAGGTACATTATCCAAGAGAGAACACAAGAGTTTAACAGAAGTGACAGGAGACTG 

CAAAAACTAGAAAGGAAAAGAAAAACAGCCAGCCTGTGTGACCAANACCAGCTGGAAATC 

AGGAGTGACTCTATAATACAGGAGAAAGTAAGCMTGGCTTTTCTGTGGTCCATTTTCTC 

ACTGGGGAATCATACAATCTAGATCTTGGGAGAGCATTTTGACCCTCTCAATCCCTGAAT 

CTAGCATAGGGAGCAGTCAGGGGACTGAGACAAGGAACTGCTCAAGGGAGGGAAAATGCT 

TTTAGTTCCACACCCTTCCTGAAGCCTAAGCAGCTATAGCAAGATGCCATTGTTAATCCT 

A 
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CCGGGCAGGTACCTCTGCTGCATCAACAGGGTGCAGTGGAAGCTAAGCCTCCACTCCCAC 
TTAATATCAATGAGGTGGAATGAGGTGCTCTGAAGACATACTGGTGACACTCAGCTTCTT 
CTTTTCCACTGTCATGGTGTCACTGGGGCCCAGAAGGGAGCTGAACTTCTATCACCATCG 
TGCAGTAACAAAGTTGTGGGACTGAGCCCTCCTCCTCCTTCTCCCAGGTGGCAGTGAAGA 
CCATCGGGA AAGT GACCTTCTACCTCAACTCAAGACAATCAAGACAATGAAAGCAAGTTG 
GTGCCCTGCTTTTGCTAGGAAGGGTCAGCAGCACCAAGTGGGGAGCTGAATGTTCACCCC 
ACTGATTTGCAATGAGGCACTATGAGTAAAGTGACCCAT7TTTTATTGGTG77TTATTGG 
■TGGTGCCCAACAGAAAGTTAAACATACACACCGATCTTAGCCTTCATAAAAACTGACACT 
TCAAAAGATACCACACTCMGAGAAAATCATTATGTTCNGGGATTTANTTTGACAATTGG 
ATGTTTAAAGGCCAGTCATTGAAAMTGCTTTNA 
TTTGGNTATAGGATNAAAAAAAAGGGAAA 
Sequence 1162 

ATCAGGGGATAACGCANGGGAAAGNAACATGTGAGCNAAAGGCCAGTCAAAAGGCCAGGA 

ACCGTTAAAAAGGCCGCGTTTGCTTGGCCGTTXTTTTCCATTAGGCTTCCCGCCCCCCCT 

GACGAGGCATCCACAANANAAATCGGACGCTCAAAGTCATGAGGTTGGCTGAAAACCCGG 

ACAGGGAACTTATTANAAGATTACCANGGGCCGTTTCCCCCTGGGAAAGCCTTCCCTTNG 

TTGCCGCC TCTTC CCTGTTTCNCGMCCCTTGCCCGCTTTACCCGGNAATAACCCTGGTT 

CCCGGCCCTTTTTCTTCCCCTTTTCNGGGAAAAGCNGGTGGGGCCGCCT 

Sequence 1163 

AGGTACACATCTAATGACATCTTACTTAATACAGGGTTTTTTTCTTT^ 
GCTCTGTTGCCCTGGCTGCAGTGCAATCATGGCTCACTGCAGCCTCAACCTCCTGGGCTT 
AAGCGATC CTCCCA CC TNAGC CTCCTCAGTAGCATGGGACCACAGGTGTGCAGCACCACA 
CCCAGCTAA7TTTTCTATTTTTTGTAGAGATC 

ATTTTGGTTCTTTTATGGCAATTATTCCTTCCTTTACCTCCCTGAGGATCTTAAACATAC 
ACTTTAAAATTGTTCTATTAmACAC^^ 

TTCCCACTCCTGCAAAAAAGAAMCATACTATAAATACTGTTTCATGATTCCCATTTTTT 
CCCACTAAGCAACAAAACTTANGAGATATTTCTGTATCANAACCTGCCCG 
Sequence 1164 

CCGGGCAGGTACAGAGGACAAGCCCCACTTTATTCGGAGTTAACCTATGGTGCAGTGGCC 

TTAGAAAAACTTAAGAGTAAAAAGCTACCATAAAAMTGCTGGTTATATGTATGACCAGA 

CGGTCAACTCCTCCTTCTTCTCTCTCTTTGCTATAACAGCACCTGTTGGCTTCTAAGTGC 

ACAGAGTTCTCACAACAATGTGCCAAAACeGCCACCCTTCCCATCCTGGCACCCACCTNC 

CCCCACCCCAGGAGACCTGCATATCAGATATTACTTCCTGCTGTCTATGACCACACCACC 

TTGGCAGTTAACATCAAAAGCAAACACATGGGGAGAACAGAAGGCACTGGAGCAGCGGGG 

GAAGAACAAGCATAGGTAATGTTTCATAAACATTTTTCTGGTGCCCAGGTGCAAAAACCT 

TGGAACTTCACATGTATTTCAGCTGATCATCTAAGGACCCACCTTCTTTCTTCCAAGATA 

AGTGAGCTGAANGCCCTTCCCTGCCTCCACTCCCAGTGCTTCTGTACCTCGGGCCG 
Sequence 1165 

GGGGGACCTCCTCGTCCAAAGAAGGGAAGACCTTGTCGGCACCATTGAGAGGCGTGTTCT 
CTTCTGGCTGCCGCTCGAGCACCGAGAGCTCCAGCTCGTCTACTGGCTGTGAGGGTTGGG 
CCAGGTCCATGTGCCGGCAGGGAAAGTCTCCCCGCGT 
Sequence 1166 

ACTTTTGGNAAATTGCNGCTCCCCGCGGTGGTTGCCGCCCGGGCTNGTNCAAATTTTTTC 

TGGACAATCTGTGCTGTGATTTTGGAAATTGTGTGTGAAAGAGGAGAGTAGCTGGGCCGT 

GGTGGCTCGCGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGAGGGTGGATCATGAGGTC 

AGTTCAAGACCAGCCTGGCCAAGATGGTGAAACCCTGTCTCTACTAAAAATAAAAAAAAT 

AGCCAGGCATGGTGGTGGGTGCCTGTAATCCCAGCTACTCNGGAGGCTAAGGCAGAGAAT 

TGCTTGAACCCGGGAGGCAGAGGTTGGAGTGAGTTGAGATCACGCCACTGCACTTCAGCC 
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TGGGCAACANAGTGAGACTCTGTCTCAAAAATAAAAACAAACAAACAAAAAAGAAGAGAG 
TGAAAGAATTTGAGTCCTTGCACAAGACAATMCTC 

TTCTT ATGT TT AAAACTTTTTAATGTTTAAMCTTTCAAAAAGGTTTGAAATTAA^ 

TCATCTTTTATTGCCTTGCCANTTTATTAMTTAAGCCAAAMNTTTT 

GGTCNAAAAGGGCCCACCAAAAAANTTAMTTTTTTTAGGCTTTAAAGGTACCCNTCNG 

GCCCGN TTCTAA NAACTAGTGGGATCCCCCCCGGGCCTTNNNANNGNAANTTTCGATTAT 

CAAAGCCTTTTTCTGANTACCCGTCTNACCCTTCNAAGGGGGGGGGGCC 

Sequence 1167 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAAATTTGGGGGCCC 

CACCTCACATCAGGCMGGGCAGCTCATTTAAACTCTATTTTTATCTCTCCTACACAATA 

TACCAAGCTGTCTACAAGGATTGAGTGAATCCAGATTTCAAGGTAAAAAGAAAGACTAAC 

TGCCACATTAAGCCAAATCTAGAAGCATGAAAAAGATCTACCCCTACAGTAATTAATCTG 

AATCTTGGGCCTTCAAACCTCCAAMTrTAGGCCCAATATACTAGMCTTTAGGCGTTCC 

TCTTTGGAAAAAGAAAACCAGAACTAGATACTTGTAGCCCACACAACCAACCTGAAGACT 

CTTGAATCAGGCATGGCTCTCTCGGTAAATAATJAAGAACAGGCTCAGCCAGGCGCAGTG 

GCTCATGCCTGTAATCCCAGCACTTTGGGAGACCGAGGCGGGCGGATCATGAAGTCAGAA 

GATCGAGACCATCCTGGCCAACATGGTGAAACCCCATTTCTACTAAAAATACCAAAATTA 

GCTGGGTGTGGTGGCGTGTGCCTGTAATCCCAGCTACTTCGGGACGCTGANGCACCGAGA 

ATCACTTGAACCCAGGAGGTGGAGTTTTGCAGTGAGCCCGAGATTGCGCCACTGGCACTC 

CAACCTGGGTGACAGAAGAANGACTTCCATTTNAAAAAAGNAAAAGAACAGGGCTCAACT 

TANTTCAATTCCAGGAAT7TTTCCCAAAAAAAAAAAAAAAAAAA 

Sequence 1168 

AGGTACCAGCAAGCCCTGGGGGCCGAGAGTTCTGCCTGGTCCCACCTGATCACTCTGTCA 

TGACCTCAGTTCATGTGAACCTTTCGCCCACCTTCGTCTCAAACCCAGCCCTGAGACCCT 

CCATGCTGGGATGCGTGCAGGGACCCCGTGCAGGCTTCCCAGAGCAGGGGAGCACTCAAC 

TATGCAGAACATGTCAACTC CACAG ATGCACAGCAGGTCATGACTGAAATGACTTCCCAA 

GGGAACTGTCCTTGTCATCCTTTTTAAAAACATTAAATGTCAGAAAGACGATATGAGAGC 

TCAGACTGCGCCCACGTGGGGAACTGAGAGAGAGAGGCCCAGTACCTGCCCG 
Sequence 1169 

CCGGGCAGGTACTGGCATTCTAAATGCMGGATAAGCTTAAAGAGATTATTTGAATCTAA 

GAGTGTGGGAGGGACATTGGCTCTCACGGTCCAGCAGGTGTGAGTCGGATCTAGCTAACT 

CCCTGAGGATCAAGGATTCAATTGCCATGCTGAACGGGCACCCCTGATTCCTACAAGTTC 

AACTCTTTATGAGACTTCTCCCATTTACCTTCTCTGGACCTCGCTTAGCACTACCCTATG 

TGAGGGAATCATGGGGACAGAAGGAAGCCTCCTCGCCCTGCCACTTCCCAGCAGCCCTGC 

CCTGAAGGCACTCTCCCCACACTTTGGTTCTTTAAGGGAGCCCCACATNATGCAAAGCGC 

TGAACCACAAGCTCGCTCGTTACCTTGGCATCACCAGCACTGACCAAGAAAAGC 
Sequence 1170 

GGGGGACCTCCTCGTCCAAAGAAGGGAAGACCTTGTCGGCACCATTGAGAGGCGTGTTCT 
CTTCTGGCTGCCGCTCGAGCACCGAGAGCTCCAGCTCGTCTACTGGCTGTGAGGGTTGGG 
CCAGGTCCATGTGCCGGCAGGGAAAGTCTCCCCGCGTACCTGCCCG 
Sequence 1171 

GGTGTGGTGGCTTATGGAGTCCCTAACGTTTGCACAAAATGGTTGCTACATTCCACACTC 

CCATGTGCCCTGAGCAACCTCCAGGGAAACTTCAGATACCGTGGGGCTTGGCCTGCCAGC 

TGAGCTGACCCAGCGATCCTTTCTACCACAGCTTTGATCTTCACTGGGTTGGGAGCCTGT 

CTGAATCTAAACATCTGCCTCTCTACTGCCCACCCCTTCGGCATTTCCTAGCTCTCCCTG 

AAAGGTGCAAACATGCATGCAGTTACTCAAACAGAATATTTATCCTTAACTCCCTGTCCC 

TCACAATTTCTTTTAAAACAATCCACTA7TGTCTACCCCCGCCCCCGCATCACCC7TTCA 

TCCAGGTGCCCATTCCCTTCTCTCTAGTCCTGCAATACCTAAGCATCGATCTGCCTTTAG 

CACATCCATCCCCGCCTTATTCTCCACGCAGGAGCTGAGTGACCTGCACAAAATCCCCTG 
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CTAAATACGG N ACCT 
Sequence 1172 

AG G TACTTTC AC ATG C TATAC C AC ATTTAATTAATTCTAAC AAC AGC ACTGG AAAAT AAG 

TATAATTATGCTCTTTAGTTCCTTCTAGACAGAMGCTTTCATGGGGATGAAAGACTGTT 

CCTTAGCTCAGAGTAGGGAATCAAAGTAGGCGGGGCTTAGTTCAGAGTGGGCGGGGCTTA 

GCTCAGAATAGGCCATCAA7TTGTAGAATTGAAGTGTGAACTCTGAGGATGAAGAAACAG 

GGCTGCAGGGGTGTGTAACTTGCACCAAGTCACCCTACTTTACATTGCGCCACACCGCCC 

AGCTAAGTCTTCGAMTGTTCATAAGGTGCTTGAGCACTGATTCCTTTGCCTGTTCAATG 

TTTGAATTCCACTGTCACTGGCTTTTCTTTTTTCCTCGTMTGMGCAAMCC 

TTAATTAATCCTTCTTGATGCTGGGCAGCAGTAATAGGATCAAGAAGAAAGCCTTTCTCC 

TCCtGTGCTCCCAAGGATGCCCAGCGGGGAGACTGTGANGAGATNGAACACAGCCTTCAC 

TCTGTTAATTTGTTCAAGAGCAAAATCANCATGGAGATCTGTCCTCTCCANGGCTCTGGC 

TTCCTCAAGGACAGCTGGCCACCCAGGTNCTATCCATCTTGTAAGGTCCACCGCCGTACC 
CTTGCCCCGGG 

Sequence 1173 

CGAGGTACATTTGGAGCTTTATTTCCACTGGCMTGCAGAAACACCCAGCAAAAATGTGA 

GAGGAGTTGGTGGGGCTTCCTAAGAGGACAAGAAGTGGAACCTGGCAGGAGATCCATGGG 

AGCTCCTGGGACTAGAGTTCTGGGAAGTTGAAAGAAATTCTCAGAAAGAGATATGTAATT 

TCCAGACTGGGAGAGGAGCCTTCCCTGCAA I Mill r FGTGGGG AGG AGAAGGGCATGGC 

TCACGTGTTGTCCTGAGCTTTGTTGGCCCCCAACTGCAGGCAGAGAACCAGGAGCAGGGT 

GGCAGGGCTGGCCCTGAACAGGAGCTGGAGCAAAGCGCATGCTGGAGAAAACAGAAGGCA 

ATGTGTCCCAANAAAAAGTGGTCCCCGCGTACCTGCCCN 

Sequence 1174 

CCGGGCAGGTACCCAAATCTTAGCCCAACCCCTGGTATAGAGTAGTTCCTGGGCAGGTCA 

TTGAAGAGGGACATTTGTGCCTGGAGATGGGAATGTGTAGGACTTGTAGGTGGGAATGTC 

AGGTTGTCCAGGGTCACTTTAATGGAAGGGGGAGAAAAGGMGAGTTTGAGGCCAGACTT 

GGAGGCCCTT GGAGAA TGTGGTAAGGAGAAAGATTAGACTTGATTATGAACACACTGGAG 

AATGACTTGTG I I I I I I GAGCAGGGGAGTGATATGCTTAGATACTTGTT7TGGACAACAG 

AAAAAAATGGAGCCAGAGGAACTAATATCTTTGTGTATATGMCTTATTCATTTATTCAA 

AATTTATTGCAGACCAACCAGGC I I I II I I I I I GAGACTGAATCTTGCTCTGTCACCCAG 

GCTAGAGTGCAATGGCATGATCTCAGCTCACTGCAACCTCCGCCTCCTGGGCTCAAGCAA 

TT CTCCT GC CTCAT CTTCTTGAACAGCTTGGATTACAGGCATTCGCCACCACATTCCGCT 

AAl I I 1 IGTATTTTTAAGCAAAAATGGGGG 

Sequence 1175 

AGGTACCCAGCTGGGCTCTGGAGGGAAATCCGGAGGTAGATTCAGCTTCTGTCCTTGCCG 

GGTGGCAAACCGGGCCCTTCCATCTGGAGCTCCTAMTAATTCGTTTAAATAATTACTGC 

TAGCCCAAGGGCTCACGCCTTTAGTCCCAGCACTTTGGGAGGCCGAGGATAGTGGATTGC 

TTGAGCCCAGGAGTTTGAGACCAGCCTGGGCAATATGGCAAAATCCTGTCCCTACAAAGA 

TGTTAATAAAAAATTAGCTAGGAGTGGTGGTGCAAACCTGTAATCCCAGCTATCTTGGAG 

GCTGACTTGGGAGGATCGCTTGAGGCTGAGAGGCAGAGGTTGCAGTCAGCAAGGATCATG 

CCACTGCACTCCAGCCTGCGCAACAGAGCAAGGCCCAGTCTCAAAAATAAATTAATTTTA 

AAAAGGAGGGTGGGGCTGCCTTCCCACAGGCCTCAGCTGCTTCTGTAGAGAGCACACTGT 

TCAAATGCAGTGGCCTNTAAGCTACCCCNAAGAAAGCCCATCTTCANAGAGCTCACATGA 

TGGGAGGTTCCCGAGTTNAAAGCGNACCACCTTGCAAAGCCTTNTGNGGCACCTTCCTTN 

CCCTTATA CCNNA GAAAAMTTAMGGAGTAGGGCCTTCTNGGGCCCCTTTNTTACTTNA 

AGNGNCNNTTTTNNAAANAMTTCCCCCCCTTTTT 

Sequence 1176 

AGGTACGCGGGATTTCTGGAATTTTCCGTTTACTC 

GAGCAAACCGAGGACCGCGAAACCACAGATCAAGGGGGAGCTCGTGTCATAGCAGCTCTA 
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ACTTAGTGCTTACAACATGCAGGCACTGTTCTAGTCATTTTGCGTATATTCATGAATCCT 

CCCACAACCACCGTGTTCACGAGGAAGCGGAGGCAGAGGGAGGTTTAGTGAGTTGCTGGA 

GGCAGCCCAGCTGGTGAGCTGAGGCTCAGCCGCACGACCGTGCTCTGGCCCCAAAACACT 

CCTCGGTAACCCCCATCCTGCCCACACAAATGGCCCCGGAACACAGCGGCTGCTTTGAGT 

CAGTTTCTCTCTCGATGACT7TTCCACGTTGGCTTCGATCTGGGAGTTCCGTT 

Sequence1177 

CCGGGCAGGTACGCGGGGAGACTTTCCCTGCCGGCACATGGACCTGGCCCAACCCTCACA 
GCCAGTAGACGAGCTGGAGCTCTCGGTGCTCGAGCGGCAGCCGGAAGAGAACACGCCTCT 
.CAATGGTGCCGACAAGGTCTTCCCTTCTTTGGACGAGGAGGTCCCCC 
Sequence 1178 

GGGGGACCTCCTCGTCCAAAGAAGGGAAGACCTTGTCGGCACGATTGAGAGGCGTGTTCT 
CTTCTGGCTGCCGCTCGAGCACCGAGAGCTCCAGCTCGTCTACTGGCTGTGAGGGTTGGG 
CCAGGTCCATGTGCCGGCAGGGAAAGTCTCCCCGCGTACCTGCCCG 
Sequence 1179 

AGGTACTGACTTTTGGAGTTACTTAAGAACCACAGTTCCTAAAACATTTTATTTGGGAGC 

TCAACTTCTGGCTTCCTACTTACCAAGAGGCAGTAAGTTATAACCTCTGGCCTTTGTCTT 

TACATTTTCAACTTTCTGGATCCCTAACCTCTGGTCCCAAGTTACTACGCTTGCCTATAT 

CCACTAACCACT CACTAGGTCA CAAAGTTGATGGCTCTGGCTTGACCCAGTGAATGAGAA 

G TGTCCTA GATGI '"""""Nil ICGCAAATCTTCATAGGCACAGTTGCCTGTTT 

AAl I I I I ICTTCAGCCCTTCCCTTAGAGGAGGGAGCCTTAAGTTACCTTTCAAGGTTGGG 

GGTTAGGAATCTATCATAACTAGTCTAGAGATTTCTCACCAAGGGAAATTCTCCTTATCT 

AAAAGAGGAACTTCAGGTCTCAACCCTGCCAGTCACACCCAAT 

Sequence 1 180 

AGGTACGCGGGGAGCTCATTCCACTGAAGGTGATTGTGACAGATGAGTTTCAATAAATAA 

GCCCCTTCACATGACATACATGTGAAAAATAACTTTTGGTTTATAATAAAATGCTTGTTT 

TACACATTCTTATCCTATTTGGTTATAATATAAATTCTTAAATGTAACAATTATACTTCC 

AGAATGTGTTATTTGAATTACTTGGTTATTTTATCTTTTTTTATGAGAATTTGTTCCCTT 

CTCATTTTCTAACCACATTTAAATTGATACTGTTTGAGTATAAACTTAGTATTTCCCTTC 

ATGCTATAAATAAGTTACTATC I I I I I I 1 1 I I I I AACTTTTCCATTTGTTCTTTTTGTTG 

TAGACAGGGTCTTGCTCTGTCATTCAGGCTGGAGTGCAGTG 
Sequence 1181 

CCGCGGTGGCGGCCGCCCGGGCAGGTAC I I I 1 1 1 I I I I 1 1 1 1 I I I I I I I I I M I 1 1 I GAG 

ACAGTGTCTCGCTCTTGGGGCCCAGGCTGGAGTGCAGTGGTGCTACTGCGGCTCACTGCA 

ACCTNTGCCTCCTGGGTTCAAGCGATTCTCCTGCTTCAGCCTCCCAAGTAGCTGGGATTA 

CAGGTGCTCATCACCACNCCCATCTAATTTTTCTATTTTTAATAGANACCAGGTTTCATC 

ATGTTGGCCAGGCTCTTCTCTCTACTGNGATAATTTGATTAGCTCCTTTATGTTCCAGCT 

TGAACGTATGTATGGTAGGTTAATCCTCCCAAATCACTGGGGAAGCCATTTATCATTGAA 

ATTTTACAAATGGGGAAGCTGAGGCGAAGTGATnTCTCAATGTCNCACAGAGAAAAAGT 

AGTTTCAGTAGGACTCAAAACCAGGGCTTTGAGTCAAGTTAACGACTTTTCCACCATTTT 

ACAGCTGCCTTCATGGGTAAAGCAACTTAGTTCTCTTAAAGACACAGGACCAACATGCAG 

GAGACTCANACAGTTGTAGGTGCCTGAGAGTAACAGCAGCAAGCCAGCAACTTNTTCAAA 

AGCAGTCGCTAATTAATAAGTGCAGGGCCCTTANAAGCCTCGCTTTCCCCTNTTTTGAAN 

ACAGATGGCAACCGGCACCCTGGNCCTTTCTACCTAACTNCAGGNTGNTTTTTTCATT 
Sequence 1182 

GTGGCTAAGACTGGACTGTAGAACAGGGAGAGTATCTGGGGTTGTCCAGTGGGCTCAATA 

TAGTCACAGTCAGGGAATGTTCTCTGGCCGGAGGCAGGAGAGACATGGCAGAAGACTAGA 

GGCTGAGACTGCCCAACCCACCATTGCTGGAGGGGGCCACTTGGAAAGCATGAGAAGGAA 

GGGGGACCCAGTCCTACAGCTGCAAGGAATAATTGAGCCAACTGTGCATAAGCTTGAAGG 

CCTCCAGGGTGAACCCAGCACCTCAGACACTGGCCCCATGAGACTGAGCAGAGGACCCAG 
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CTGGGCCTGGCTGAACTCAAGCATCTGACTTAGAGAATTNGTGAGATAATCAGTGGTATT 
TAAACTGCCAAG CC TGTGGTGATTTGTTGTCATGGAAGCAATAG AAAACTAATACAAC AG 
GGCTGGGCGCGCCTGTNATCCTAGCACTTTGGGAGGTGNGCGNGATCACCTTATGTCAGG 
AGTTCAAGATCAGCCTGGCCAACATGGTGAAACCTAGTCTCTACTAAAAATACAA 
Sequence 1 183 

AGGTAC I J I I I I i I I I 1 I I I I I I I I M I I I I GAGAGATGGAGTCTTGGTATGTTGCCCAG 
GCTGGCCTCANACTCCTGGGTTTAAATGACCCTCCCTCCACGGCCCTCTGAATAGCTGGG 
ACTACAAGCACACACCACCATGCCCTGCTCCAAAATTCTTTTGATCTGAATTTCCAAGAT 
AGGTAAAGTTTTTGCTCTGGMTTTTCTCTATATTATCACCATGTGAAGAGATAATTCCC 
TGACTCCTCGCCTCCCAAAAAGGACATTTGAACCAAGTGAAGCATCCTCGTTTATTAACC 
CTAI I I I I CCATGGCTNTTTCACANAGCTA I I I I i I I I I i I I i I I GANACGGAGTCTCGC 
TNTGTTGCCCAGGCTGGAGTGCAGTGGCNCCATCTTAGCTCACTGCAAGCTCCGCCTCC 
Sequence 1184 

ACATACAAACCCCCTTGCCTTCTGTTCATTCTCCCTCACAGCAGCCAAAGTGGGCAAACC 

AGAAAATGGACTGGCCAGCCTTACCAGTGTGCCTGTCTGACCAGCGGGCGACCTGGTCAC 

TGCCCCAGAGGGTCAAGGGAAACCTATTCTCCTTGGCCACATTCCTTGAGATGGTGATCA 

CAGAGGACACTCAGTTGCCACATCAGGACTTTGATATTCAGATGGAAAAGGCTTATATGT 

TGATTTGAGTTAGTAATTAGGGCTTTTAAACTGCTCTTGTCGACATAAACACATACACTA 

TTATTCAGT 

Sequence 1185 

CCGGGCAGGTACAAAAGCTTGTTTGGGTGGCTTCAGTGTTTGGGAGTGTATATGAAGTCC 

CTACTGCCAGAGGCCCCATANACCATTTTGCAAGAGGCTTGAGCAAATCTTAAAGCTTAA 

ATATATGTTCCTGCAGACTGAGGCAAAAGAATC GCTTGAACCCGGGATGCGGAGGTTG CA 

GTGAGACGAGTATCGCGGCATTGCACTTCAGCCTGGGCAAGAGGGCAAGACTATGTCTTC 

AAAAAAAAAAGAAAATNACCAAMCTTTTTGAAANCTGGGGCCGCAATGGCATGTGCCCT 

ATCGNTCCCAACTTACTCAGGGAGGGNTGAAAGTGGGGMGGATTATTTTGGCNCCCCAA 

NAAAATTTTGNAGGCCCACCAGGTNGNGGGTATTAATCAATCTTGGNCAAAATAANCCCA 

CTTGC AACTTAC C AAAAACTTGG 

Sequence 1186 

GCTCCACCGCGGTGGCGGCCGCCCGGGCAGGTACTGAGGCTGGTGCACTCCACATTCCCA 
CCCACATCCTGCCACATGTTGGTGAGAGTCTGGGGAGCCCCAGCCTACTGGCACGCCTTG 
TTCCCTCTGAATTTCTGGAGCTTATTTATTTTTACCACCTCCCTCTCTGCTTTCTGTGTC 
CATTCCTTCTGCTCCTTTTAGGTAACTGCAGCCAGAGAAAGGATGCTGATAGGATTTTTC 
TGAAGAAAGGTG I I I I IGATCAAGAGAGTTAAAATCACAGAATGAAGTTTGGCTTGTTAT 
ACCTTTAAATTCTGTGGGCTCATGAAAGCCATGATAACCAAGGTTAGTTGTGGCATGGTT 
NGTTCAAAGCCAGTGGGCCCCTTTTCCTAAAAGCCTAAAGTNCATTCATTTTCTTCAAGG 
AGAATCATTGTNGACTGNCCTTAAAGCCTTCAATNATTTAAAAAAGGCCTTAAG I I I I I T 
CTTAGGGCTTTTCACCCTGGTGGA I I I I I CAAAAATTTTCCAAAAACCAAAGCCCCTTAA 
TGGNTAANCTGGGCCA I i I I I GGCTTCCTTAAAAANGAACCCCAA I Mill ICCTTAAAA 
ATACCCC I I I I I GGGGNAATTTTTAATAAATTNAATCCTCCCCAACCCAAAGGGCCATTG 
GGCCNANTTCNTTTGGGGAAAAAAAN I I Mil I ACCNTTATTTACNCCNAGGGGA I Mil 
GGGGGGGGTAAA 
Sequence 1187 

CCGGGCAGGTACTCTACTAGAGATTACACATTATAGAATAGTGCAGAAAGCAAGCCACAT 

GAAAGGGTCAATTTAATGTCGCATCCTGCAGAGGTGAGGAGTTGATCTCCAACTATTCTC 

ATTGGTTACACAATCACATTGAGGTATTTAAATCCTGGCAGGTGGGAGCCTTTGCTTCCT 

TTGCTCAACGGTGAATTCACATGGTGGTGTGGTAGGCAGAGGTTGCACGGCATTCTCTAA 

TAGCTACAACTTACCTAGTCTTATCCAGTATAGGTATCAGGTAAGTGATTACACATGTTC 

AGATCATCMTTTAACAGTAATCAAAGAGTTCTTTTAACTTANAACTCAGGGAAGTCAGG 
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TCTGTCTTCTCAAGCTGAGAAGAACCAAAGCCATGGTATAGACATAACCATACACAGGTA 

ATGTAAAACTATTAAGCT 

Sequence 1188 

GACTATAAAGTATACCAAGGCGTTTTTCCCCTGGAAGCTTCNCTCCGTGCCGCCTCTCCT 

GGTTCCCGACCCTTGCCCGNTTrACNCGGGATTACCTTGTCCCGCCCTTTCTTCCCTTTC 

GGGGAAAGCGTTGGCCGCCTTTTTCTCA1TAGGCTTCACCGCTTGGTNTGGTTATTCTTC 

AANATTTCCGGNTGTTNANGGTTCGGT7TCCGCNTTCCCCAAANC 

Sequence 1189 

CACCCGCGGTGGCGGCCCGCCNGGGCAGGTTCTCGGGAGGAAAAAAAACCCTCCAATTAT 

A TCTGG GGGTGCNGATACANAAMTGTTATCTGGTGTGACTCCTGTGAGGCTTTTGCCTG 

GTTTTTGGAAATGCTCTNATCCANTCATTGGACAGGTTCCTGGAANAGCAGGNTTTGGTN 

CTTGATCACAGCTGAAAAGGTNGTGGAACTGATTCATAGGGGCTTGNTTCAGAGATACAC 

AAGCTAAANACT7TCANCTTCTGTTTTCTGGGNTTCAACGGGCATTTATAT 

Sequence 1190 

GAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGACAACTAGAAGATCACTGTAT 

TCAATACTCTAATCCCTGATATATCATGAATGCTGAAAGAGGATTTAGACTCTACTACAC 

AGCCGCCCTGAAAGAGCTGCAGGAGGCACAGGGAATTATATTAACATAGTGGATTGCACG 

TTAGGGACAGAAATGGACCAAGGAGCACATGAGAAGTCCACATTGATAGAGGTGCCTGGC 

ATGGGCTCTCCTTCTCTGCCCAGATGGTGTTGGTGTGACCTGATGGGTGGCACCTTCTTG 

ACMGGATCCTTGCTTTAGATATTTGGAATATTTTATGTTATCAAGCTATTTAGA^ 

TMGTCACTGCAAAAGTGCTCATTCCAAATTCTGTTTCTTGCCTTTGTCTTGAATCCATA 

TCTGAATATTTAGGAAGATTTGGTATTGGTATAGAGAGGTATCAATTTGGCAAAGATCCT 

TCTTTGCCAAATTCCTGGAAGTGGAATTGCTGGTGTCAAGGACAGGTGCACCTGAAAATG 

TNGGACAGGTAATGCCAGATTTCCCCACCCCCCTTTTCTAATAACAGTGAATGACCCTGA 

GAAAGTAATTGN NTCCCC TTTAGAATTTCATTT 

Sequence 1191 

AGGTACTGACACTACCCTTGCTGTGTCGACCAAAAATGTTTCCAGACGTGTCACCCCAGC 

TGAG AACCACTGATTTAGAAGAAGGGAAATCAGGGCGGTGATATGACTGGAAATCATGTT 

TTTTGAAAAATAGTTGAGGGAACTAGGGTGGTTTAATTTGAAGAAGACTAAATATAGTGA 

TATCATTTCAGTATTTGAAAGGTGGTGTTTGGCCTGTTCTCCATCACTCTGGAGGAAGAA 

ACAGAGGCACTGAGTGAAAGTTCTAGGAGATTAGATTTAGACTCAAAATACTTTGGAGAA 

ACTTTCCAACAGAGCTCGCGGAGACTGGCCTGCAAATGCATGTGCAAGCAGGGCCTGGGT 

AAACACACAGGGGGTGGGAAAAGTTTCTAGTACCTCGGC 

Sequence 1192 

CCGCGGTGGCGGCCGAGGTACAACTTTGGCTCTGGGTAGGGGAAAGGAAAACAGCTTCCT 

AGAGAAATGTTAACCACAAGCTAGCCCTCATGAGAGATTTCAGCTAGGATGGATGCTATC 

TGTGTAGTGCAAAAAAAAAAAAAAAATCCATAGCCAGATTTTGTGCTTTAATGTGGACCT 

AGGTCAATAGTGACTGCAAAGCCAGAAGCAGTGGCTCACCCCTGTAATCCCAGCACTTTG 

GGAGGCCGAAGGAGGTGGATCGCTTGAGGTCAGGAGTTCGAGACCAGCCTGGCCAACATG 

GTGAAACCCTATATCTACTAAAAATACAAAAATGAGCCTANAGTGGTGGTGGGTGCCTGT 

ATTCCCAGCTACTTGGGAGGCTGAGGCACAAGAATTGCTTGAACCCAGGAGGCGGAGGTT 

GCAGTGAGCCAAAATCGCACCACTGCATTCTAGCCTGGGCAACAGAGTGAGACACCATCT 

CAAAAATAATAATAATTAAAAAAAAGTGACTGCAGGGGACTGGGAAGAGGAAACACAAAA 
TCTTTACTGGGGAAT 

Sequence 1 1 93 

AGGTACAAATCAAAGATTTGAAATGTTGTGCCGCTCATCAGCTTCACATTCATCTAGATA 
CCTCTCTCGTTAAGACATCTGTCCAAGTAGTAAGGAGGAAATGCTTTAAAAACTTGCTCA 
TTATTAGCCACCTTTAGTCTCACCCAGCCCAGAATTTCTCCTGAACTGAACGAAGTGCTT 
TTTATTCTGTATCGTTTCGAAATTGCTTATTTAAAAATGAAAAACCACAACAAAGCTGCA 
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aataataaatatgacataaaattagttaagaaaaacaggtgatgaaaaccaaaataaaac 

aagatttaaaattctataaccctgcaccatItcttttgtcactaaaacttaatttgm 

ctaagcaatatcgtgtgccactctctgtatctagtcttctctcactctctacaagccaaa 

tttctttaactagctcttactcaaaaacaatcaattatttcttgctcatcttttctcc^ 

actattattaatgagttgggnataaaaaaaaaagcctggaagctggatattctctangng 

ngccacaa / i hill icttccattcataaaccacanggctcantaccagactnactcttt 

aaacctggtccccaaacttcccaccttttcctttggnttttcaantctcttaggtaacaa 

ttntttccccctcnangctnttactnggaattnccattcaccnantttttggaacnaccg 

ggaattta 

Sequence 1194 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACAAAAACGTGAATTATTTG 

AAGCAGCAGCTAGAMTAATCTTTTCCCTTCGAGCCCCCATGACCCACTGATTCTCTCTC 

TTGTATCTTGTGTTGCAGTGACTCAACAAATACTCACTGGGCACTGACCACATGCCTCTC 

CCAGAGCATAGGCTGCATCCTACCCTATGTCACAGTTATAACCCATCCTGGCTGTGAAGC 

CCCCTGAGGCCACTCACCTCTGCCTGCCCTCTACAGTGCTTGCCACGGCTATACCGATGT 

TGGTTGCAGGAAGAACACAGAAGCATGGCCACCCTGATTTACCATGGCACATGTGGCAGG 

GTCTGGGCCCTGAGATGTTCTTCTAGTTGGGGCTGGGAGMCGAGCGAGGGGCCAGGCCC 

TCCTCTACACTGCTGCATCCATCAGCTCAACCGTTCATTCATTCTTCAGTCTCATATTTA 

TATATACTTCATTCCATACATCCTTACAATAATCTGTCTTGGAGGGATTATTTTTCCTCA 

TTTTAGAAGTAAAGGGAAACTGACAGAGGGTAAAAGTANTTCAAAGCCAAAAAACTAAGT 

TTATTATTTAATTTAAACTAACATC 

Sequence 1195 

AC II I II I I I t M I I I I I I I I I I I I I I I I NGNAAAGACAGGGTCTNATTCTGTTGCCCAG 

GCTGACCTTAAACTCCNGGGCTCAAGCGGTCCTCCCACTTTGGCCTCCCACAGNGCTAGA 

ATTGCAGGCATGAGCCACTGTGCCCAGCAGTTCTAACTGCTGACAATCTGATAAGNGCAA 

AAAGNAACTTTCTGCTCTGCTGCTTTCATTNGCACTTCTNTACTTTCTATTAAAATTGGG 

CAACACCTGTNTGTTGATTAGCTGTTTGCCAATTGTTCTTCCTTCAAGGACTCCACCCCC 

TTAGAAATGGAGCCTCTCCAGGTGTGTTTGATGAGTTGGGGTGGAGCCNAAACTNTCCCC 

TCCCTTGTTTTGCNTACTGCNCTTTTTGTTCATACAGCCCTGNTNNATTGGACACTGGGC 

NCAAAGNGCTCACAGAACAATGCCCCAAGCTGCCATTCCCC 

Sequence 1196 

GCAGACTACGCGGGAAGCAGTGGTAACAACGCAGACTACGCGGGAAGCAGTGGTAACAAC 
GCATACTACGCGGGAAGCAGTGGTAACAACGCAGACTACGCGGGAAGCANTGGTAACAAC 
GCATACTACGCGGGAAGCAGTGGTAACAACGCAGAGTACCT 
Sequence 1197 

CTGAACAGATGAGAAATTACAATGTTTGCATTCATGACACCTGGAGGAAAACAAACCTAC 

TTACTATTTTTAAACAAAMGAAAGGCAGTTTGCTCATGTAAGAGGMGAAACGAAGAGA 

GCCAACGAGAGTGAGCTTGAGCTCAAAGCAGATTAAATATGCTTTAATAGCTGCATAGAA 

ACCTGAAGCTCATAAATTTTCCTTCATGTGCCTAGGAGGACAGGGMTTGCATATTTATG 

AGTTTCGTGTTGGGAAATCTGTAACTGTATCTTCAACCATTATACAACACCTTTTTGAAG 

TAGCCATTCCAAAGGACTGCCTGTTTTGACATTTAAAGGAAAATAACTTCAGGACAATCA 

TTCCTACCCACATTTTCCACCCCATGNCTTACNNGGCCCTTTTCTTTNACTTCAAAAGGT 

GCCCTAI I I I I GGACCATTCACNACCCCC I III! \ CTTGAAAGTNAGTGGCNTTTTTAGG 

ANCAANTTCAAAAAGTNAGCCCCTTNGCCAAATTTGGNTTCAANTNGAAAAGGGCTTACC 

CAGNCCCCC 

Sequence 1198 

TGCGTATTGGGCGCTTCTTCCGCTTTNCTTNGGTCAACTGACTCGCTNGCGCNTCGGTTC 

GTTTNGGGCTGGCNGGCGANCCGGGTATCAGCTCAACTCAAAAGGNGGGTAAATACCGGT 

TATNCCACAAGAATTCAGGGGAATAACNCNNCGGAAANAAACATNTTNNNACCAAAANGC 
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t™i U . „ ^AANCCCCCCTGGGGTTTTTTTAATTNNNCCCGGGNGTn 
TGNNNANNATTTTTTTNNNCCCCCCTNCNCCCCCCCCCCCCCCCCCCCCCCCCC 



Sequence 1199 
GGG il CGGCCGCCCGGNC ^ 

caaccctnacagccagtagacgagctggagctctcggtgctcgagcggcagcca^mgS 
ISSSSISf tcgtgccactgcactccag catgggcgacagagcaagactccatctcata 

A^A^ST A ^ GCATCACAA MtAATTClTTTACTTCCCCCCCCAAAAAAATTCA 
Sequence 1201 

AGGTACTCCA^TTCTGACTTATGATATAAAMTTTTACTrTGNGCTTGTTGTGATTGA 
CAGT^TTTTTTAAAGGAATCTCMTTTGTGTTTTCTATTGTGTCATT^ 

ISSSSi^ CCA ^ AmGGAATCATCTG TGATACTTATTTCTTCCATGAGAAGAGG 

TMCAACATTCCTGAATGATCATCTTAAGAGCTGGGTTTCCCATTGTGCTTACTAGTCTG 
SI^^P^^^^^^^^ G ^ GG ^ AA ^^^GGGAGAAAAAAAAAATACCTTNTATAAT 

^ C ^ C . TGGTCTGGGGCAAA ^ ACA CTAATAACTACAATCCCCCACCCTCCCCAACAAAT 
GAGTATGATAATATAATCATGCGTCTTGGAGGATTTCTTGGATAAGTAATT^AMCACT 

ag a^™^^^^ 
i ga ^ a ^ ga 7 gg ^ 

TGGAAGCCCACTCTTACNAGTCTTTCTTGAMTMGGTAAAACTTGMTTMCCCCCGCA 
TAAATGGATGGGGGAAGGGCANCCCNGATNGTTGTTTANGTCCTAACGGGTGGC^^AATT 



Sequence 1203 

ESS? CAAGGGGCTGATGGGCCAAAGGAG GGMGGAATTAATTAGAGGAGCTAACAGA 
GGG^G C G^r^r CGAGCA ^ 

GGGCGGAGGTGGTATCTCCGCAGTTACTGAGAGAGGGGCTGTGGTCAGTTTCTTCCCCTT 

t C a™ G ^^^^^^^ 

Sequence 1204 

ccgcggtggcggccgaggtactgacttttggagttacttaagaaccacagttcctaaaac 
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ATTTTATTTGGGAGCTCAACTTCTGGCTTCCTACTTACCAAGAGGCAGTAAGTTATAACC 
TCTGGCCTTTGTCTTTACATTTTCAACTTTCTGGATCCCTAACCTCTGGTCCCAAGTTAC 
TACCCTTGCCTATATCCACTAACCACTCACTAGGTCACAAAGTTGATGGCTCTGGCTTGA 
CCCAGTGAATGAGAAGTGTCCTAGATGTTTTG ! I I fl I I I I I I I I CGCAAATCTTCATAG 
GCACAGTTGCCTGTTTAA / INI I CTTCAGCCCTTCCCTTAGAGGAGGGAGCCTTA 
Sequence 1205 

AGGTACGGCTTAGCTGGACATCACTCCGAATTAAACAAAAAGAAACCTCTTGTGAGCTTA 

GACTTMCCCAATCTAGAGGCAGAGGCTTTGTTGAGATGACCTCCTGGGGTCTTTTGCAG 

TTTACAGTTCAGATCCTTTGCTTGGTTGTTAAGATATGTGTGTGGGGCTCCCTGTAAGAG 

CCAAGAAGCTAGCGCTAGTGAAGAATGTGGACTGTTGTCTCTGCGTGTTTGTATCAGTGT 

GCTGTAGGAAAGATCAAGCTAGCCCACGGCCTCTAGGGCCTGTGCCACAGGGCTCCTTCC 

CGTGCTCAGTATGCGTGCCTTTAATCACAACCCAGCCCTCTTCACTGAGAGCCCTC I GGT 

GC 

Sequence 1206 

CCGGGCAGGTACCCAGCTGAGACTGGCTAATTCTAATTCCTCCCCCACACTAACCTGCCT 

CTGCCAGGCCAGCTGTGCTCACTGGACAGAAACCACCCCATGCTTTCCACCCTCCTGCAT 

GATGGGGACATGGATTCCAGGGCCCAGCTGTCCTGGGACATCACCACCTCTGACTGACTT 

TCCCTCATCAGCAGAGAGGCCTGCCCAAGAGGAGTCTCTCCAGTCAGTGATCAAACGTGC 

CTAGGGTAGGAGCGAGTGCCGCTGCCTCATTCATCACTGCTGCCAGGGCCCGGTGCTCCT 

AGGGTCAAATn"CATTTTCTCAGCATCC7TGTTTCTTCTTGGTGCTGAGGGGTTTGGCC 

Sequence 1207 

GCTCTGGGAGTGTTACATAACGTATGGTAAGTGTGCTACATGGCCTTTCTGAAGCCCAAA 

CCATTTTGCATTATGAAMTACCTGGCCCCAGGMTTTCAGATAAGGGATTGTGGACTTA 

TATTTTAAATCTGTAACTCCTATAGACAGTMTTTCATAGAGCCAGCTTCATGGAGTTGT 

AAATATTACTGGGCTAGGGTCAGTTTCCTCATTTTAGGCTTCCTTTAATCCAGTCCAGAC 

ACCAAATTAAATGTCTTTGTGTAGTCCCCTTCAAAGCCTCTTGCAATCCCAGTCATGCTT 

CAGGGTAGGGATGATCCCATTAATCCAAGGACCTGCCCCTAGTG 

Sequence 1208 

CCGGGCAGGTACGTTCCAGGAATGGTAGATAGAGCATACATATAGGGCAGGCTGAGAGGC 

TGGAAGGGCTGCCTTGAATGCCAGGCTAAGGAATTTGGACTTCCTAAGGAAGCATGGAAT 

GAATTTTAGGCTGGGAAGGGATAGGATCAGATCTATGTTAATATTGCTCACCCCAATTGC 

AGCGTAGAGGATGGGATAGAAGGGAATGTATGGATTCAGGGAGATGGCTTANAACCCTAA 

GATTCATGGTAGCAGGGTCTCGAGCANGGGCTTGCTTGAGCAAGCAGGTCCTGTGGAGCA 

GGTTTGTCTTAACTCTCGTTGTCTTTCTGAATGGGTATCAAAGAGG 

Sequence 1209 

CCGGGCAGGTACGCATGCTCCTTTCCCACTTATGGCAGCCACAGAGTGCTCTGGACCAAC 

AATCTAACCAGTCCGTTGAAGCGTGTGGTGAAGCCGGCCAGGAATCAAGGGGCTCAGAGG 

CAAGATCAGGGGTGTGGGATACAGCAAAGCCCCTGGGTCCTCGGACAGCATCAAATGAAG 

TATACTGTTTGTTTCCACTCTCCATTTAACTGGAATGTCCTGAGTGCTGTTTGCCGGGAC 

CTGGGCCAGGTCCTCTCGCATTCCTGATCTTCAGGATGCTCTCTTGACATTCTGATTGTT 

CTCCTCATTCAGCTTGTTCATTCATTGAGTCCTTCATTCTACCATTTGTTCTTTAGTTTC 

TT 

Sequence 1210 

ACACAGTGGGCGCTGGGGACCAGGGCCAGAGAGTGGCCACCTGTTAGAGCTGCATGTGTG 

TCTTGGGCCTGGTCTTCTTTCTCTGCTCACTAATGACAGCTGTCCCAGCCTCTTGCAGGC 

TCTGGCCCACCACCTTGCTTGCTGTGA I rTTTTTTTTTTI I CCTCACACATGATAAGCTG 

AGGCAGGAGAATCCGCTTGAACCCGGGAGGCGGAGGTTGCAGTGAGCCAAGATTACGCCA 

CTGCATTCCAGCCTGGGCGACAGAGCAAGACTCCGTNTNAAAAAAAAAAAAAAAAAAAAA 

AAAAAAANGTGACAAGCAGAGCAAGCNCATTTAGGTGACACCCGGANATGGAATCGGGAC 
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GGCAGGATNAGCATGGGTTTGTGGCGANAAAACCCGGGGCAATAATGACATACAATGCAA 

ATTTGCCAGAGTCAGCAAAAAGCCTATNTGGTANAAATTGCCAAAGAACAGAAAAAGAGG 

GAAGGGTCAANACCTTANTGCAGGCTACTTTTGNTACTTGACAAGCAACCCGGCACAAAA 

GGAGTGTTTCACTGGAAGTNAAGGAAGGAGGA 

Sequence 1211 

CGCGGTGGCGGCCCGAGGTACAGGAAAGGAGACTTAGTCGGAAAAAAGTCCCCTATAAAG 

GGAGGGGACCAGGAGGGTGCTGTCCTGGAAATCCCAGGAGGAAAATGGTCAACACCCCCA 

GCACTCCACAGAGAGTTGAAGTGAAGGTGTGTGTTGCATCTGGCAACTGGAAACTCATGG 

ATGACTTGAGCATGAACAGGTGCCCCAGGGTGGGGGTGTCATAACCCGGGAGAGAGTGAA 

GCAAGGACTTGGAGAACAGGGGTGGTGGCCACTCTCTGTGGGCAGCTAAATACAGCAGCA 

GCGCAGCTATGATGGCCTTGCCTCTGCACATTCGTGGCTGGCCCGTCAGCCCTCCTCCCT 

GAATCATATCCCC7 TACT CACTGTTGCTCACTTTGTTGTCTGCTTCATTGCCCAAATTGG 

CAATGACAGTTTCCTTTTCACCTCCCTCTGGAAGCTCAAATTTCTTCCGTAGCCTTATGA 

CTGTTCCTTTGTG ACATGTTTAG GTTTGG 

Sequence 1212 

CCGG GCAGG TACATTATAACAAAGTTTTTAACATGGA 
AACTATTTTTGG TAATA AATATT^ 

TGTCACTGATACT TTTCT GAATATGATTTTCCATCTGGTTTTACTAGTTTTAAT^ 

ATAAAATTGCAGTATTTTGCAGAGTTACATTTTATACTTGTGGTTGTTAACTTCT^ 

TGATGCTTCACTATAACCAATAATGTTTTTAGTTGCTAACATACTTTATATGTGCTGCTG 

TCCCCAGTTAGCTCAGGAAAAATTTTGCTAGATGGAGAATATTCTTAATTCTAATTTn^ 

TCTTATTGAAAATTATAGGCTGGGCGCAGTGGCTTATGCCTGTAATCCCAGCATTTTCGG 

AGGCCGAGGCAGATGGATCACCTNAGGTCAGGAGTTCAAGACTAGCCTGGCCAACGTGAT 

GAAGCCCCCGTCTCTACTAAAAATCACACACACACACACCAA 

Sequence 1213 

CCGGGCAGGTAC I I I I I I 1 I ) I I I i I I I I I I I I I | I I f I CTTTGGATTTTNAAGTC AGTG 

AACCAGGCAATCACCGACACTGCCCGCGCATTATGGGCCTAATTCTGCANAGGTCTGACA 

AACCTGCCCAGAGACAGAAGATTAAGCTATTCCAAATTACCCAGACGGAGGCAAGCCCAN 

ATTGGGCTNTNTGGGACAGAGGTGGGGGTGGGGGGACCTTTCATTTCTGCTCTTTGGAGA 

CCTCCAANACTAAACAGTTTAAACCACATNTTGAANAAAATTGCCTAATCCTTTCACCTT 
TAAGT 

Sequence 1214 

CCGCG GTGGCGGCCGAGGTACTGACTTTTGGAGTTACTTAAGAACCACAGTTCCTAAAAC 
ATmATTTGGGAGCTCAACTTCTGGCTTCCTACTTACCAAGAGGCAGTAAGTTATAATC 
TCTGGCCTTTGTCTTTACATTTTCAACTTTCTGGATCCCTAACCTCTGGTCCCAAGTTAC 
TACNCTTGCCTATATCCACTAACCACTCACTAGGTCACAAAGTTGATGGCTCTGGCTTGA 
CCCAGTGAATGAGA AGTGTC CTAGATG ITTTTTTTTTTTTI I CGCAAATCTTCATAGGCA 

CAGTTGCCTG7TTAATTTTTTCTTCAGCCCTTCCCTTAAAGGAGGGAGCCTTAAGTTACC 

TTTCAAGGTTGGGGGTT 

Sequence 1215 

CCGGGCAGGTACGCGGGCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCACGCCTGG 

CCTCACCTGGGACTCTTAAGTGATCTATTCAGGGACAAAATCAGGATGGTAGTGCACAGA 

CAGG ATCTTG ACATTGATCTGGAATMCGMTCATGCTATTTTGTTATCCAGAATTCTTT 

GACAATTTrrGTCTTGCCCTTTCCCTTTCAACCCTTCCTTTAAAAAAAGAA^ 

A GAAAC TATTCTGATCTTTATTAAATGGTTAATAGACTATAATGACAGTCAGCAAGAAAA 

ATTTTTCTATTCCTTCTAGGAGGGCTTAGAAAGTTCATTGTTGCAACTTAGCTCTTT^ 

AATTGTTGTAGTATGGGAAATATGCTGGTTTTCAGCTGTGTGGAATTAGAAAGGGCTTTA 

GGATGTAAATAATGCATTTTCAGTGAATCAANACAGCTCTTTTTTTACCATGATAAAAGT 

TTGCCATTCTTGTATTTTCAGAAAA 
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Sequence 1216 

CCGGGCAGGTACGGGGGCCTCTGTAGGCTGCCTAGGGAGCCCTGGCTTTGCTCTTCGTGT 

TGGGCTCACTCCATGATCAGGAGCCGGTGGGACTGGTCCTTCCTGATTCTTACTGTCTGT 

GGTTCCCCATCCCCTACGGGGAGCCTGCTTTGGGCCTTGAGCTGGATAGAGAGAAGAGCT 

TTGGGGCCCAGCTGGTTATAGGAGCTGAGCTTTTCCACACCTCVCTTTGTTAACCCTTGG 

AAACAGACCTGCCTTTCACCTGACCCATCTTCCTACCTGTCTGGTCTGACCTGCCCTCTT 

TGAAAGCACTCATCACCTAGTTTTACTAGGCTGATTGGCAGATGTGGACATGACA 

Sequence 1217 

CCGGGCAGGTACAGCAGTGCAATCTCAGTCCCCTGCAACCTCTGCCTCCCGGACTCAAGC 

CATCCTCCCGCCTCAACCTCCAAGCAGCTGGGACCACAGGCACGCACCATCACACTCAGC 

TAAI I I 111 I I I ICTTTGTAGI Mill I I I I GTAGAGATGAGGTCTCACCGTATTGCCTA 

GGCTAGTCTCAAATTGCTGGGCTCAAGTGATCCTCCCACCTCAGACTGCCAAAACACTGA 

GATTACAGGCCAGGATTCTCAATTTTTAAGAGTGTAAAAGTGTCCTAAGACCAAAATATT 

TGAGATCCCTAATACAGGCAATACCATTTCTCAATGCAGGTTTATTATTTTTAATTCTTA 

TTTTGAGAGA 

Sequence 1218 

CCGGGCAGGTACGGGGGCCTCTGTAGGCTGCCTAGGGAGCCCTGGCTTTGCTCTTCGTGT 

TGGGCTCACTCCATGATCAGGAGCCGGTGGGACTGGTCCTTCCTGATTCTTACTGTCTGT 

GGTTCCCCATCCCCTACGGGGAGCCTGCTTTGGGCCTTGAGCTGGATAGAGAGAAGAGCT 

TTGGGGCCCAGCTGGTTATAGGAGCTGAGCTTTTCCACACCTNTCTTTGTTAACCCTTGG 

AAACAGACCTGCC7TTCACCTGACCCATCTTCCTACCTGTCTGGTCTGACCTGCCCTCTT 

TGAAAGCACTCATCACCTAGTTTTACTAGGCTGATTGGCAGATGTGG 

Sequence 1219 

AGGTACCGTGAATCTTTAAGTTTTAGGGTGTGGGGTTTTCCCCTTACCAANTGCAATATT 
AC C N AN AAAAG N ATAATNNCCTG AGTT AAATAGCCTCTTTG AAAAAC ATTC ATAN NAN GC 
CAAGTGGGTTATATGAATGGCTACTCTGTCTTGATTCTCAGTACCTGCCATNTCNNGTCC 
ATGCCATTGATGGCCAGCACTGCATGACCCACTCGGATGCCGTTC 
Sequence 1220 

GAATCTCACGGACTTGAAAGGACAGAGACGTTTCCTGAGATGTCTGCGGCAGAATGTGGG 

AGGCTTCAGCCTGAGGAAAGTTCAAGTTGGAGAGAAAGGTTGCCAGACCTTTGAGTGACC 

CACACTGCTTATAACTTCTAGAAGGTTCTTTTAGGGAAACCTGCTTAGAANGAAACCCTG 

AAACGTCCTTTGCTGCCTCCTTAACMCTTGCATTAGAGAGTTTCTTCAGCCAAAAGGTG 

AATGTCTGTTGAGTGGAAAGGAGAGGACGGAGTCGATGTTGTCCCCTGTGAGATGTCATG 

TCTCAGGCAGTATGCTGGACATTTCCCGAGACGCCAACTCTGCTCAGGNTGACTTGTGTG 

CCCTGCCCTTTGT 

Sequence 1221 

AGGTACCAACGTAGGGCTTTGATTCAGAAGAGTGTGCTGTGAGGAGGCAGGTGCCAGGCG 

GATTTTATCATGACCTGGGTGAAAGCCATCTGCCCTGCAGAGGGAGCTGTTTCAGAACTC 

CTAATAGGAAAGTCAAACGTCCAGCACAGCCAGTGCAAGAGTGGTCCCACCAGCACAGTC 

CTGGCAAAACTCTTGACTGTGTAGCCTTCAAATCTGATTCTCTGGATCTCTGGGTGGTCC 

CATGAACTATATAATAACATTTGATGAAACTTTTTTTCTGTTAGCTGGTGTAGATTCTGT 

GATGTGTAATTGAGAATCCTAACTGAACCCACAATTTTGCTCTTGATC I I M I I I I 

Sequence 1222 

ACGCGGGATTCAAGTAACAGAAATCTCCCTTCCCTTCTTTCTCTGATGTGCAGTAGTTCT 
GCCTGTTANAATCATCTGGAGAGCTTGTTAAAAATACCAATGTCAGGAGCTCGACCCTAG 
AGATTCAAATTTAACTGATCTGGTGTGGGACCTGGAAATCCTCTAi (111 I AAAAGTTCC 
CCAGTTGAGGCCAGGTGCGGTGGCTCACGCCTGAGTCCCGGCACTTTGGGAGGCCGAGGC 
GTGTGGATCACTTGAGGTCGGGAGTTCGAGACCAGCCTGGCCGGCGTGGTGAAACTCCAT 
CTCTACTAAAAATAC AAAAAAAATTA 
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Sequence 1223 

CCGGGCAGGTACATTTCAATTCCCAGTGGTGCAGGCTGGGAGGGCCATGAGGAGGGAGGG 

AAACAGACCTGTGCCTGCAGCTGCTTTCTTGGCCACCTCTGTCCTCCTCCTTGGTGTTCT 

TTCCTCTTAGGGCTGGAGGCCTGCTCTCACCACCTGGGAAGGAGTCTGGAGAAAGACGGG 

TCCTCCCTGGCTCCGTGGCGGCAGAGCCGGGATTACCCACTGACATTATAGCTGGCAGCC 

GTGGGCTGTTGGCACCCTTTAAAAGAACACGATGGCAATTTTCCAGCTAAATTCTCCCAA 

TTAGGCAGTTAGAGAAACATTATCCACCTCAGCATAAAGCTGATGAAATGGA 

Sequence 1224 

AGGTACTTTTTCT7TTTCI I i I I I { i I If i I ! I I GTTGTTTGTTTTAAATGAAACCCTCT 

AGGCCTCATTTTCAGTGATTTTGATTAGGTGGACTGGAAGAGACACTCCTCTAATAGTTC 

CACCAAACAGTCACAGGAACAAGAAAGGCACATCAGGAAAAGCCACACAGGAAATGGTTT 

TGCCAAAAGCCAAA AACCATGA GCATGCAAAAACCAAAGTAAATTCTCTGTAAGTCACAA 

AATGAAGCATACATC I I I I I I 1 1CTTTGAGATGGAGTTTCGCTCTTGTTGCCCAGGCTGG 

AGTGCAGTGGTGCAATCTCGGCTCACTGCAACCTCCACCTCCCGG 

Sequence 1225 

CCGCGGTGGCGGCCGAGGTACCTTGCTCCACCAATCCCAGCTGTCACATCAGCCCTATAT 

GCCAGTCTCTGGGATTTTGCTTTTGTGTTTTCTGTGTAAAAACACTGCT^^ 

CTCTGCTTTCCTGTGTTACTGTTTAGACAGTGTCCCCAGACAGAAAAATGGAGAATGGCA 

AGATTACCTTATGTGTTTCCCCTTCTTTGAAGGATCACAGCCTTTTTATGTCTTCTGTCT 

AATGCATGAATAGGGTTGCTTCATGTAGTTTATCCAATTTACAATTGTTTATGGCAAA^ 

GGTMCTTTGATACCTTTTACTCTGTGATGGCTGTAACTAGAAGTCCTATTAATGCCTTT 

7TAAATTAAAAATAGCACTGCCAATAAAACCTTAATTTCCATACATCAAAAGTCAACCAA 

AGGAAACTAATAAGTAGTAAAATAAAAATGTGGGGCAAGGGTGTGAGCAGACTCGTGTAA 

mCCCATTATTTTAATTTTCTTAGMTAAAAAAAATTCCAAGAGC 

TCCCAGCAGCACAGTTCACAAGCTCTGTGGCTGACATGCAGAACATGCCTGCTGCTGTGC 

ACGCACTCTTGACACAACCCTNTCTTCAGCGCTGCTNCTTTTGCTCAGCGGGTANTTTGG 
GAAAC 

Sequence 1226 

AGG TACATAAAGTG TGTGTTGTTTTTTAATTATTTTGMCTATTCAGTAT^ 
TCAATTTTTGTTTTTCTCTGCATTTGGGCTTGTTCCTT^ 

G C ATAG ACC ATTG ACTGG ATGTTGTA I I I I I I i I I I CTCAAAGCTAAACTGTATGAAAAA 

TCAAGACTTAAAAAAGGTAAATGGAGGCAGTTGCACAATTACATTTGTTTAATGAGCTTT 
TACATTTTAAAACTTTTAATTAATGTTATACTATTTCAGAAATT 

ATTATAAAGAATGCATTTGTTATTTAAAGTGAGATATGCGTGCAGACACACAGACCATGG 

TCTTCCCTTCTTCCTCTGGTCTACTGCTTCTGCCTTATATCCCCTAGATTCCTTCTCACT 

CCCCAAAAGGCAGAGGTTAAGAAGAAAGTTAAAATATCAAGACACACTAAACTGTTTTTG 

TGGTGAGTCAAGCTGCAGTGCCTCGCTGACTGGGAAACAATGGGGCTGGGCCTCCTAGAA 
TATAGTAGG CTG G AG AAAC 

Sequence 1227 

ACGCGGGGACACAATTTTAGTAATAAGCTTCAAAACCCATCACTCCACTGAATCCAGGGA 

TGCCAGAATTGAGAGTCCATAGGACCCGTTTAACGGATCCCACATTCAGTAGACGGTTTT 

TTGTATGCTTCGAGGGAAGGAAGGTATTTTCAAATGCAACTTC I \ I I M U I I CTTGCTC 

TAGTTAGGGCTGATTTTACCACTGGGCAAATTGCATACATTTGGATCCATGCCACGCTAA 

ATACTGTTTATTTTGGAGATCTGTCAGAGGCAAGCACATGGAAGACAAGGGGATTCTTGG 

TTTGAAAAGGCAATTCTCATCCAGACAGATTCAGATGGAGAGGAGCGGTGTGGATTA 
Sequence 1228 

TCTCTGTCTTTTTAACTTTGCAGAAAGCCCTTCCAGTTATACACTCANACCCTGTTCAGG 

ACGTTGGTGGCCAAACAGTCCCCTTTTGTTMGCGAGCGTCCCATTTTTGTTTCTCNTGA 

AAGGTGTCCTCCAGGACAGCANAGCCTTTCTTTGGGCTGTTTATAATTGAGCCCTGAAGA 
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AAAACAGAAATCATTCTAAACCCAGC3CCAAGTGCCTCCTTGGCCTAGCTAGGCAAAGCTA 
AACTCACACAAGATGAAATNTAACCAGCTGGTTACCTGAGAACCACCTTTCGGGAGTTC 
Sequence 1229 

AGGTACCACACATTGCAGGCTGGAAGACACCAAGGGAGACACATGTTAGTTGCAACTATT 
GGGACAGGTGTGTCCTGTTAACACACAGGCAGTTGCTGATGTGCAGACTGGGTTTATGAA 
CAACTTGCGCATGTGGCCTATGCTGGGTGCATGTAAGGCAGATGGCCTTTGCCCCTAATT 
CTGAAAACATGAACAGTGAGTGCTCCAAGGATGCACTTACTCCCATGTCAAGGCAGAGAA 
TAAGCGAGTTACTAGAGCAAAGCAGACCCGCCTGGTGTGGCTTTGCCTTCCCACCAGCCA 
-GAGGGAGCTTTCTATTCGCCCTGCAGCAGTATTAAACCCTCGTGTCCCAGGTGGAGCCCA 
AGGCACCTTCTNTGGTCTCAGGGTGGGTATGGACAACAGTCACATTAATATGAAACCCAA 
CTACTGTTATAGGTAAGGAGCCCAGAGCCCTTGAGAAACAACCNTGAGGGAAGCATGTGT 
CCTCAAATNCTGGTTGCCAAGAGAAGGAATGGCTTTGCANTGAGCCTCAGANAACCAGGC 
TGGAAAAAAGCCCAAAAGTGGCCCGGCCTCACCTGTGTCCTGGCTTCCCCCNTGCTGCCA 
ANGGGAAGANCCTNTGGGNTNTTTGTCCCTTGACCATNTTCTTTTTTGGGAAGAAANAAA 
AAANNACTNGGGCTTCCTTAACAAACCCCTT 
Sequence 1230 

CCGGGCAGGTACGCGGGGAGACTTTCCCTGCCGGCACATGGACCTGGCCCAACCCTCACA 

GCCAGTAGACGAGCTGGAGCTCTCGGTGCTCGAGCGGCAGCCAGAAGAGAACACGCCTCT 

CAATGGTGCCGACAAGGTCTTCCCTTCTTTGGACGAGGAGGTCCCCCCGGCCGCTCTAGA 

ACTAGT 

Sequence 1231 

GGGGGACCTCCTCGTCCAAAGAAGGGAAGACCTTGTCGGCACCATTGAGAGGCGTGTTCT 
CTTCTGGCTGCCGCTCGAGCACCGAGAGCTCCAGCTCGTCTACTGGCTGTGAGGGTTGGG 
CCAGGTCCATGTGCCGGCAGGGAAAGTCTCCCCGCGTACCTGCCCG 
Sequence 1232 

CCGGGCAGGTACAGGAAATATGCTTACATAACTACTCTTTATGGTATGGGATGGGAAAAG 

CTAGGAAGCCAGAGAGCACTGGGTGCTAGATGCATTAGGAAAAGCCCCCCATCTACTCCC 

CACCCCCACACTCTGCATGGGCAGCCACCAGTCAGGGGCTGTGCTCACTGATATCCACCA 

CCCCTGAATATGTCCCCACCACCTGCAAGGGGCCGACNGATGGAGCCTACTAGCAATCAC 

TGCATCTTTGGTTTACTGGTCAGAGCATGTTMCATTTAGCTTCCTTTTCCCCCTCAGGA 

GTTTTAAAAATGATGGCAGAGAAAGTATGCCATGTAAGAGAAAAAAAAAAAATNCCACTA 

GGGAGATGAGCAGACCAGGTTAGCAGCCTGCTTCAGCTCTTGGGAAGGTAAAGAAACCTT 

TTCCGGACCCTTGGTTTCCTTTTCTTGATAAGCCACCCAACATTNTTGGTTNACCNTTCA 

AAANTTTTTCAGAAAAAACTTTTTAAAAAATNATTAAAMTTG 

CCGNAGGTATTAAAACAATTTTTAATGGTTTTTAATTTGGAATTNA 

AGTTNTTAAAACATTTTNANTGGTTTTCCCGGGNTNAATTTNAANGGGGAGNTTTTGGT^ 

CNTTTTGGGAAATANANGTGNTTTGGGTTTTAAAGGGGAAAA 

Sequence 1233 

AGGTACTGCAATATGTCATGAAGAACCAAGAGAAGCTAAGCAGTGGCGATGTAAGATTGG 

TGGGCAGACAGACTGAAAGCAAGAAGAACCGTGAAGAGGTTCCTGGAGAAACCCAAGCCT 

GAAGTGTCTAATCTAGGATGATGGGATAGTAATGGAGAGAGGAACAAATCTGAGGTAATC 

TGAGGGAAGAAACAACAGAATTTGGTGGCCGACTGGATATACAAGACGAAAAACAGTCAA 

AGGCAGGGGTTGCAAAGGCAAAGGTGGTCATGGGCCAGGCAGGAAATATGAATGGGAGAA 

GCAGCTAGGTGGAGACTGGCAAGCCGTGAACCCCGGCGCCATCTAAAGTGGATGCCTGAC 

CCTGGGATGGAGCCCAGAGTAGCCAGTTCTTGCTCTGCCTTCTCTCTGGGTCTCTCCAGT 

CTCACTTCCCTCCAGTCCAACCTNCACGATGTGTTTCAGGGTGTCAAAGCGACCTTTTGA 

AACAAAAACCTTAATTNTTGTCACATCACTGCTTAAAAATCTTTAAATGGTC 

Sequence 1234 

ACGTTCAGGMTTTTCCCAGTGAGTTATAGAAAATACTCAAGGAATACATAATCAGAACA 
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TANCAAACTCTTCCCAAGAAGAGAGATCAAAACAAAGAAACAAACAAAAACTTTAAGACA 

GCAGTAAGCTTCAGTGTCAAATCGCTTGAGGAAAGGATAAGAAAGGAAAATTAGAGGCCA 

GTTTCCCTCACGATTATACATACAGTAAGTCCCCCACCCTCCCCCAAAATCTTCAAGGAG 

TCAAAA TTCC AGCGACATGTTAAAAAAAAAGGTGTTTGTTAAAAAACCATCCATGNAAAA 

TGTT TGTTTTATCCMGTTAANACAGmCCATGTTAGAAAMTCTATTTAMTTAAA^ 

TTTTGCCCGCAATANNGAAAATTCCAACCAAA 

TAAANAGGAATGGTANGAAAAANAGGGGGACCGCCAAGGGTGGGGATTCAACCCTTGAAG 
GGGTTTGGGGGAAGGTTTCCTAAAGAACCCCAGGCCCTTGNACCCAAGGCCANTGGGGAA 
GAAAANC CACCTTGTTGGTTC NTAACCTTAAAAAAATTANCAAAAAATTTAA 
Sequence 1235 

GGCCGAGGTACTTGAGGGCATCATTAAAATCAGCCAGTGTTCAAGTGAGAGACTTGATGA 

TGGATTTAATAGTATTAAATGGGCAAAMTGCCCTTGGCCTTTACATGCTCATTTAAAAA 

CATATAAAAGAGCAGTAATTCAGGCAGTTATACTTCTCTCAAAGATTTTGCTCTAAATCA 

CAGATGGCAGCGGGAGGCCACGGCTGTGTGCTGGCTCACACCCCACTCACTCCAACCGAA 

GCCAGMGTGCTCGTTGGGTCATAATTAGCAAA.CC AC CCGCAAACTCGACTCTTTAAATC 

AGAGGCCAGTGACTGCCAGGCTGGGGCATAGACGGGAGAAAACAAGCAGAAATGGTCAAG 

GTGCTCTCTGGAGCCTGCCTTCGGATAGCTGTGACCCCCACGTTAGGCATCAAGACAGGG 

TTTCCACACCTGCGCCGCCACCCCTGCAGTGTGAGCGCTGTATTTTGCTTTCTGANAAAG 

CCTCACGGTTTGGCTCTTTNGGNTGCATGANGGCTTTTAAGGGCGATGTGNCCCCTTGCC 

CAGCTTGTTGCCCAAGCTTCCCNAGCCCTTTGGCCACAGNCCAGCATACATCTTTTGTTT 

TTAGAAAC AAATTATTTAAN NCACTTTAAATC AATGAATT 

Sequence 1236 

AGGTACATCTGGCCTTAGGGCTGTCTATAGATTCTAACTCTTAAGTAAGATGTTATGTTT 

TATGTAmACTACTGCACAGTTATCATTTTAAAAAAAAAACTCTTACmCCCTACTA^ 

CAGTANTTGAGTGATGATAAGTTTTAGAATATAAAATTCACGGTTCCAAGATGGCTGAAT 

AGGAACAGCTCTGGTCTGCACCTCCCAGCGTGATTGATGCCAGAAGATGGGTGATTTCTG 

CATTTCCAACTGAGCTCTGAAGAGAAGCCAGTTGGCTCTCCCAGCCACCGGTGTTTTGAG 

CTCTGAGAAATGGGACAGAC 

Sequence 1237 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATCTTAATAGAAACTCTAAGAGC 

CGGGGCTTCTGGGTAATAAGCACGGAGTATGAAGCCGCTGTGTTTTACTGACTTATAAAT 

AGGACACCCTTTAGTCAGCTCTGTGCCTTAGAAAGCATGACACTCACCCCCACCCCATGC 

CCCACCGAAAAAGAAACCACAGACCAGATTATCACCTCTCCGAATGAGGCATTGGAATAA 

TCTTCGCACTGTTCCGCGACCCGAAGGGCTCAGGAGTGCTGGAAAAAGATTCTTCACCTG 

TTGAGCAGTTCTAGCGGAGAAAAGATCAGGCTCGGCCTGCTTCCCTGCGAATCTGCCGTC 

TGTTGAGACAAGACATCTGAACCAGAAAGCCGGGGAGGAGGTGGCTCTCCATCCCCGTGG 

GAAG GCTTT CCCTGGTCCTTTCACTTACCTTATTCAGGCCTCTTTAGTTCAGAAATAGCC 

TTAGGTTTTGCTCCTGGCTCCATTTTCCTCCTTMTAGCATCGCTCTTTTTCCATGTT^ 

TGTGTGGATGGGGGA 

Sequence 1238 

ACCACCACACCTGGCTAATTTGGTATTTrrAGCAGAGACGGTGTTTCACCGTGTTGGCTA 

GGCTGGTGTCAAATTCCTGACCTCAGGTGATCCACCGGCCTCAGCCTCCCAAAGTGCTGG 

GATTACAGGCGTGAGCCACTGTGCCTGACCCATAACATCTTTAACTTTCAAAAAGATGCA 

CTGTTTCTGCTGGGGGTTTCTTTTAGTMTTTGATTAGAGMTTCTTTGTTT^ 

ATATTTGGAGTTTTrCTCGTGGGGGAGMTGCAGGTTTrCTCTTGATGGTAATAATTCTA 

CAGTAGAATTAGAATTCTTCTTCACTATCAAAGAGTCTTTATCTTCTGTAAAGAATCTCA 

TGATATGGTATGTGAAACCCAACCAGCCAACACATTAGGMGTTCCAGTTGAAGTTTCTG 

AAATCAAAGAAGCCACCTGCGTCCTTCCTCTCTTGAAAATATGCCACATTTGTGGGATCT 

TAAAAGGAGACCTGTTTTTCCATTTGTCTCTTCTCAAGCTTAGAATCTATAGTAAAGTTA 
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AAAAACTCAGATTGTATCTATCATGGGTTAATGCCANCCAAACCCTNAGC TTTTGGGA AA 
ANATTGTGAAAAATGTTTTTTTTNTTNNAGGGGGATTAANAAAAAAAGGNG I mTTTTC 

Sequence 1 239 

AGGTACTTGTGAATTCAAAACAGCACTCAGGCATATAGGTTACATTGGCACCAGGAAACC 

TAAGCAGTATTTCCATGACCAAATTCTMGTGAATTCAACACCAAAGGAGAAAAGGAAAT 

GTTCTTTGCTCTCTGTATAGATTCCTCTTCTGCTTGCTTGGGTTTTACAMGTTCTTCTA 

ATTTTTATGTGCGATTGGACAATGCAGTTTTGGTAMGTTGCTAAMCCATMCTGTTAC 

TCTTGGTTTTAAATATCAAACCAGGTAAAAGGGAGGGAGGAACTTAATGTTACAGAAGGG 

AAAAATGACTTAAATATCCGAACAGACACCATATGGGAGGCTTTTTACTCCCCTTTCCTC 

CTTTAAAAACAGTGTGTTATTTCTCCCTCTAGATTCCAGTTACTATGGAGG GATTTA GGA 

TTTAGTGGGACTAGTGTTGCTGAATTAGCTTTGTAAGACAAGTCTGTTTACl I M 1 1 ATA 

CTAAATTTTCATGCTTACCACTGACTCTGATTTAGTTTGAAGAAAGCTATTTCTAAGAAT 

TCTGG TTCCTTAACATTTATTAGAAAGGGGAA 
Sequence 1 240 

AGGTACTTGAGAACAGTAGGCATAGTTCAGGTTAAGGGCATCATA CTGGAA AACAGGGAG 

AATTAMTAAAAATCTGCATGTTGATCTGAGAGTTATTCCCAGTCTCTTTTTCCCTCCCA 

GTTCTGAGAATACTGGCTGACAAGTTCTTGCCCCATCTAGCTGTGAGATTAAAAGATTCA . 

TCCTGTGACAGCAAACTGCTGAGATCAGGAAAAATATCTAACTGATAACAGATGTGGCCC 

AGATTGATMTCCTACAGTATGTCTACTCATTMGACAGMTCCCTAGGCAACTTTTAAG 

TGCCTCAATCTTAAATATGAGATTAAGGATCACCAAACATTTGA GGATA ACCTTTAATAT 

GAAAGAGAAGAGGGAAATGGAAATACTTCAAGMGANGACAAGTTTTTGTAAAAAGTTAA 

CTGATTTNTTCAGAAGTAAAGGTATTTGCATCTTTGAAACAAGAAAGGATTGCTATGAAA 

AAAGCA 
Sequence 1241 

ACACGCATGTATGTGAGCCACCTTGCTCTCCTGTAGTGAAGCAGGATGTTCTATTGGACC 

ACGTTCACATTTGGTATTTGTTCACCATGGTGGAAATGAAT GGGAAC CGGGGCCTGTCAC 

ACAGGTGAGACTGCTAGCCAATTCAAGGCATATCCACACCATTTTTGAGGAAATATTAAA 

TAAAATACATATGTGACCATTTAAGTGCCTAATTTACAGCA CTTGAT TTGTCTGACACAC 

AATG AGGGTAAG AATAGTGTTCTATATAAATGGCTC CTTC GTTTTTC C AC TTTG A TAAAC 

ATTCTTAAAAAGAGC7TCATTTACAAACATTACACTTTC 

ATGCTTAGAGTTGTAGAGT 

Sequence 1242 

CCGGGCAGGTNCTTTAGGAGACCCAGGCGGGCAGATTGCCTGAGGTCAGGAGTTTGAGAC 

CAGCCTGGCTAACATGGTGAAACCCTGTCTNTACTAAAAATACAAAAATTAGCCGGGCAT 

GGTGGCTCACGCCTGTAGTCCCAACTGCTTGGGAGGTTTGAGGCAAGAGAATCGNTTTGA 

ACCCANGAGGNGGAGGTTGCAGTNAGTCNAGATCGCGCCACNTGCACTCCAGCCATGGGC 

GACAGNAGCCAAGACTTTCATTCTNAAAATTTACAGAAAAGGAAANGAAAACNANAGTAA 

NAGGNAAAAGGCTTATATTGGAAACNTTCTTCTAAAAAAAMGGAAANAAAAAAAAGNAA 

NAGCCCTGNATNGCCACAACCAAAATTCTTAAAMTTTTNGGGCAMGTTCNNAATNCAA 

ATTTAAAAAGGGGATTAATTTTTTANTmGGCNMTTCMCCA 

CTTTTNACCTTCCCCCCCCCAAAAAAM 

ATGCCANACCTTTTTTCCTTAAATGGTNTNTTTAAAAAAA 

Sequence 1243 

CCGGGCAGGTATCATAAAGAAGCCAGCAGATGGTGTTATACTAGCTTTCGAAGTTCCTTC 
GTCTGCCTCAGCTGAACTCTCCCTGAAGCCAAATAAGGACATCTCAACATCCACATGGGC 
ACACACACGCCTGCGCGTGCACACACACACAC TGCTG TTTTTATGTAACGTGTAGCTATG 
CTGAGTTTTACCTCCACTCTGTCAAAGGTGGTCTTTTTGTTGATTGATGAATTTGl I I iC 
TGTCTCTACTAAAAATACAAAAAATTAGCCGGGCGTGGTGGTGGACNCCTGTAGTCCCAG 
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CTACTCGGGAGGCTAAGGCAGGAGAATGGCGTGAAACCTAATAGGTGGAGCTTGCAGTGA 
GCANAGGTAGTGCCACTGCACTNCAGCCTGGGTGAGAG 

Sequence 1244 

CCGGGCAGGTACTGGGGATTGATTAGCCACAGTCTTTGCCACA GACTCTT TTGGACTGAG 

CCGATAGGTCAAAGGTTAGCTTATGCAGTAGCATGGGCTGTAGG MUM GCACTGATCA 

ACACCGGAACTTTTAGAAGACTTTTAGMGATTTGTTGCTGCTGTGAGTGACATCAGTTC 

ACACAGCCTGAGAGAGAACTCTCAGTCCATGCTGTCCCTCCTTGGGCTCACTCAAGCGAA 

ATGGGTCGAGCTGCAACACTGTGCGGTGCCAACCGGGAGAAATCTGCCTATCGGGGCTGA 

GAGACATCCTGTAGTGGCTGCCCGTCTCCTTTGCCCCAAATGTCAAGTGGAAGAGGTGCT 

TCACATATTTATTTCTTCCTACATGGGCTCATTTATTGTMGCCCATTGTGACCCTTGGA 

AAGCCAAGCTTrTATCTTGATTTTGGAGTGTGATCAAGGTGACAGTTGTTCTCCCGGATG 

ACAGGAAATCTGACTTTGGTTTTGCTTTTTTC 

TGGGCC 

Sequence 1245 

CCGGGCAGGTACGCGGGGCCAGCAGCCAGAGTGAGCATATAAAATGCAGTTCTGATCCTG 

TCACCTCTCTGCTTAAAACTCTTCAGAGGTTCACCATCGCTTCCAGAAAAAGATCAATAC 

TCCTGACGTGGCAACCTGTCGAGACTCAGGACCGCTAGGAAATCAGCTCTTGTTTACCCC 

TCTATCCATTAC ACTACG ATCCAGCTGGGTTTAGGTATCACTTC CTCTG AGC AATC CTCC 

CTGATGTCATGAGGCAGTGCTGAGTGCCTTCTCTGGACCCTCfGTTTACTTGCCTGTTAA 

ATCCTGAACATCTGGAAGGGAGTCGTGTTCAG 

Sequence 1246 

ACNCGGGGGAGTAGGAATGTTCCATAAGTAAGAGAGATGACTAGGGTTTCAAGAACAAGG 

ACCAGCACCTATNAAGGCTCTGAGGCACAGACCACTGTGACCTGTGGAGAAGTGGCTCCC 

ACAGTTTCCCAGAACTGCCCCCTTGGTCAGAGTCACGGTTCTCCTCTTGTAAGAGAAGCT 

CAAGGGCAGACCCTCNGCAGGAAGCANACCANCATTTT 

Sequence 1247 

AGGTACATTTTATCAAGTTACCTATGAACCGATGTATTTCGTTCACTTAATACCATTCAC 

TAAATTCAGTTATTGACCTAGTCAATAACTGCCCAACTTCACATTTCACATAC AGTGAA G 

AAACTCTTGTTTCAACATTCTTTTITGGGTTGGTTGGTTTGTTT^ I I M I I 

ACACAAGGCAATCCAAACTGCCCAATCTACTCCATCCGAGCCTGTGTGCAAACCCCTTTT 

CCCACACCCTGAATTTCCTTTCTTCCTTMCGTCCACCAAATATGAGTCAMGGATCTTT 

TTACAAAGCTATGGTTTACAAGCAAGAGCCACATTCCTCCAGAACTCGTGTAGAGTTGCT 

TTGTCiTCAMGCCCCACCTGGGTTTTCCACCATATTGCTTTCNGTCAACGACACTCACC 

ACAGCATNCTCTCTGGCGGTTTGTGAATCTCCCNGCCCTGGGTCATGACACTANGCGACC 

CCCGGCTGGGCTTTTCTGCGGGCTGCTATTCGCTTGGCTGCTGCTTCTTCGCTGGANGGT 

NGCTTTGTTTNACCAGGGGGTTGGCCAAGGGAACCTGGGANTAtTAGGCACCCTTNTTAA 

AMCAACAACMNGNCNTTGGGGCCAAAAAAGAACTACCAAACCGANCGAACTTTTNGGG 

CAN AAAGAACCAAAAANGAAAAAAGGGGGGCCCCAAGGGNA I rTTTCTTTTTTi iCCTTT 

NTTTC 

Sequence 1248 

CCGGGCAGGTACTTTTCAGATGAGCAAAMTGTAAACAATTGTTAATGTTCATGGTTGGC 

AAGAG.TGTGMGTCATGGGGATCCTTGGATCACAAGTTCTTAAAAACTGCACATGCCTTG 

CCCTAGTAATTCACTTTATAGAACTTAATTTAAGAAAATAGATGGGCATGCCCAGCACTT 

TGGGAAGTCAAGGTGGGCGGATCACAAGGTCAGGAGTTTGAGACCAGTCTGACCAACATG 

GTGAAACCCTGTCTTTACTAAAAATACAAAAATTAGTCGGGCATGGTGGTGCATGCCTGT 

AATACCAGCTACTTGGGAGGCTGAGGCGGGAGAATTGCTTGAACCCGGAAGGCAGNAAGT 

TGCAGTAAGCCGAG 

Sequence 1249 

CCGGGCAGGTACGCGGGGCCGGGCGCGGTGGCACGTGCCTGTAGTCCCAAGTAGCTGAGA 
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TTACAAGCGTGCGCCATCACACCTGGCTAATTTTTTGTATTTTTAGTAGAGATGGGGTTT 

CACCATGTTGGTGAACAGGCTGGCCTTGAACTCCTGACTTCAAGTGATCCGCCCATCTCG 

GCCTCCCAAAGTGCTGGGATTACCGGCNTGAGCCACCACGCCTGGCCTCGATGACTTCTT 

AAGAATrTATGTCAGAGTATTCTATTTGGCTACCTGTTTTATTATATTCCTGTCTGAGAA 

ATGGTGGTCGGTCTTTCAGACTGAGAGAGCAGCACGTCTCTTTTGAACTGTCACCCCTTA 

GCACTAATA 

Sequence 1250 

CCGGGCAGGTACGCGGGATACTCTACAGCGTGTGCATGGCTGGATTCAGAAT I I I IGGTT 

TTAAAAATTAAGCTTTTATTTATATTTACCCATCAGGTCACCGAATACT^ 

AAATGCTTGAAAGCTTCTTCATTTGGGGGAAGGTGGGTTTGGAGGCAGAAATCCCTGCAG 

ACTCCACACCCAGGGTTTTACTAGGTCCTCCGCGGCAGGTGGAAGGAAAGCTTG7TGC CC 

TGCAGTTCCGTTTGCTTCAAAAACAAGGGTGGGMGGGTTCACCAATTGATCTTTGrGCA 

TATCCAGTGTAGGAAAAGTGTTATTTAGAGTATCATTATAAAATCGGCTGCATATGGTGG 

CTCGTGCCTGTAATCCCAGCACTTTGGGAGTCCAAGATGGGCAGATCGCTTGAAGTCGGA 

AGTTTGAGACCAGCCTGGCCAACATGGTGAAACCCTGTCTCTACTAAAAATCAAAAAATT 

ACCCGGGCATGGTAGCCGGGCACCTGTAATCCCCGCTACTCGGGGAGGCTTGAGGCCCAA 

GGANAATCACTTTGAACCCCAAAAAGGAGGAGGNTTGCCATNGAGCCCAAAATTGCACCN 

NNTCATTTCCAACCCTGGGGTGACAGANCCAAGGACTCCATNTTAAAAAAAAAAANANNN 

NNNNNNNNNNAMGNTCCCTCGGGCCGCTTTTNAAACTAAGGGGAANCCCCCCGGGCTNN 

CAGGAAATTTNAAATTNAAACCTTTTTNCGNTNCCGCCCAACCCCNGNGGGGGGGG 

Sequence 1251 

CCGGGCAGGTACGCGGGAGGTAACATGTCATAGCTGACCTCAGACCTCGGGTTTTGATGT 

TAGTTTAAATTAAATAATAACTTAG I I I I i GGCTTTGAATACTTTACCTCTGTCAGTTTC 

CTTACTCTAAAATGAGGAAAATAATCCCTCCAAGACAGATTATTGTAAGGATGTATGGAA 

TGAAGTATATATAAATATGAGACTGAAGAATGAATGAACGGTTGAGCTGATGGATGCAGC 

AGTGTAAGAGGAGGGCCTGGCCCCTCGCTCGTTCTCCCAGCCCCAACTAGAAAGAACATC 

TCANGGGCCCAGACCCTGCCACATGTGCCATGGTAAATCAGGGGTGGGCCATGCTTCTGT 

GTTCTTCCTGCAACCAAACATCGGTTATAGCCGGTGGCAAGCACTGTAANAGGGGCCAGG 

CCAGNAGGTTGAAGTTGGCCTTNAAGGGGG 

Sequence 1252 

CCGGGCAGGTACTATACTTCCATTAGAGAC I II I I I I I I I GAGATGGAGTCTCTCTCTGT 

CGCTCAGGCTGGAGATTGCAGTGAGCCAAAATTGTGCCACTGCACTCACTCCAAACTGGG 

CGACAGAGTGAGACTCCGTCTTAAAAAATAATAAAAATAAAATAAAATAAAATGGTAAGG 

CACATGTTGTAGTCATTATTGAGAACAAAAGAAAAGCAAAATTAACTGAGAATCTTTGAA 

GTTAGGTATTGTTAACTACTGATTGTCTAAATCCACCACTTCAGTTTGTTTTAAAGTCTG 

CTTTTCTATCAAAATTTTGAACATTCACCGTAAAGTATGCCTCCAGAAATGACCACTAAT 

ATATTATTAAACTATTCTTAACTATTCCATGCATAGTGGGAGATATTGAGGTCTATTGCT 

TG ATATAAG AAAATT AAAACTG N ATTTCTTCTTTCTTTT CCCTG CTG G G AG GG G T AAATT 

TACCAAAMTAATCNGATAGGTTTCCGTGGGCNCCTGGTTNAAAGTNGGAAAATGCCNCT 

CAAAAATAAATTACCAATTTNAANGGGNGCCCTACCAGMTTTTTTATAAATNGCCCCAA 

GGTTAAATTTATTCTTTCANAAAAAATTNTTGCCCCCAATACCAGGTATTGTTTACCC 

GATACCCAATTAAAGGAATTCCAATTTN I I I I I I I I I CTTTGAANANCTTTTGGCTTGGG 

TTTTTTTGGTTTGGTTTGGTNTGGGTTGGGTTNGGTTGG 

Sequence 1253 

GGGGGACCTCCTCGTCCAAAGAAGGGAAGACCTTGTCGGCACCATTGAGAGGCGTGTTCT 
CTTCTGGCTGCCGCTCGAGCACCGAGAGCTCCAGCTCGTCTACTGGCTGTGAGGGTTGGG 
CCAGGTCCATGTGCCGGCAGGGAAAGTCTCCCCGCGTACCTGCCCG 
Sequence 1254 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTTTAATCCTCCACTGA 
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CTTTGGAAA MCA ACAAMTGCCAGCTTCCACCAGATCAmCTTCTTAATrAAAAGTTA 

GCCAGATTCTTTTGGAGCACTCAAT7TTCTGGTCAGGTCAACAAAAAAGTAATGAACACC 

ACACACACCTCCCTCCCCTGTCTTCCTAGCCCCATTCCTGGAGTTATGCAACCCTTGGGG 

GCGGCATACACTrCAGCTATTTGCTCTGTGTrCATAATTAGATTGTCTAGACAGGAGATG 

TTGCTGGAGCAATAGGAAAATATTTTACAAGATGTTCCGCAGAACCACACTTATAACAGT 

GATTAGGACACACTGCCAGACGCCGAAGCCAGCAGAATTGGCCCGGGCCACGAGCCGGTT 

TCACCCAGAGCAGAAGCCCTTTCAAGCTCTCATTTCAGGAAGAGGTGCCAAATTAGAGCT 

CTGAGGGCATGGAGCAGCAACTTCTTGGCCTTTGGATCCGAGAAAGCAAGGNTGAGATAA 

TAGAATGATGTCTGCGATGCCCTTCT 

Sequence 1255 

AGGTACGCGGGACAAAAAAGGGAGTTCTACCCCTACATCCTCATAGCTGAGTCACCAGGC 

TGACTGCATGGATGGCTCCTCCCTGGAGAGAGGAGGACCATCTGACTGCCAAGATGGGGC 

CAGCGCCACCTCTTTGAGCCCCTCCTAGGGTGCTGAAACAGCAGCTGAATGGCTGCTGGG 

CTTAGCATGGCTGAGGTGGCATCATGAGAGAGAGGCTGGGGATGATGTGACATGGACCTT 

CCCCGCCGGAAACCAAGTCCCGCATCAATCCCGCTGAGCCCGGCAGCTTCACTTGCTGTG 

AACTGAGGACAGCTTGCAGGTCAGGTTGTCATGTGAGCTAAGGACAAAACTCCAGCCTGC 

TAAGGTATNAGAATATGTGCCCTGTGGGGGATGAAGATGACCGGAAGGAGGCAGGGAAGG 

GTTGTGGG CACCGT TCAGGTCCTTTGGTGTGGNTGCTGCAGTCNAGTTTTTGAGCCATTT 

CATTCCAGCl Nil I GCTTATTGGTCTTACACCACTTTTTCGGGATTATATTTTCATATT 

T CAAT AAAAMGANGTTTTCNAAMNTTTCCMGCTTNGCTTTGGGGMGANTTCCATTC 

CTTTTACTTCTTGAATAGCCi I II I I I I IAGTCCCTTTCCCCCA 

Sequence 1256 

GGGGGACCTCCTCGTCCAAAGAAGGGAAGACCTTGTCGGCACCATTGAGAGGCGTGTTCT 

CTTCTGGCTGCCGCTCGAGCACCGAGAGCTCCAGCTCGTCTACTGGCTGTGAGGGTTGGG 
CCAGGT. 

Sequence 1257 

AGGTACGCGGGAGCGGGCTTATTTTTGAAAGGCATCTGTTACTTCAGTGGCATAAAGTGC 

CCTCACACTGCTGTGCAGCCATCACCACCATTCATCTCCAGAATTTGTTCTCAGTCCCAA 

ACTGAAACTATACCA TTCAAACA ACAGCGCTCCCCATTTCCCCCTCCCCCGAGCCCACGA 

CATCTCCTTATTATTA I I I I I I I I GAGACAGGGTGTTTCTCTGTTGCCCAGGCTGGAGTG 

CGGTGGTGTGATCTCGGCTCACTGCAGCTTCCACCTCCCGGGTTCAAGCGATTCTTGTGC 

C T PAGC CTCCCAAGCAGCTGGGACTATAGGCACGCACCACCACACCGAGCTAAGTTCTGT 

ATTTTTAGCAGAGACGGGATTTCACTAGGTTGCCCAAGCTGGTCTCAAACTACTGACCTC 

AAGTGATCCACCCGCCTTNGCCTTCCAAAAGTGCTGGGATTACANGTGGTGAGCCACCGC 

GCCTGGACCACATCTNCTTATTCTTAAATGGATGNGGTTTCTCAATGGGTTGGATTTAAT 

TCTTGCTGGGCCATTTTANCCCCTGTCATGGCTTTAATTCGCCCATTCTTAAAGCCAGAN 

GAATACTGGCTTTGACCGGNTNTTAAACCCANC7TNTTCCCTTGAACTTGNAAGGCCTGG 

^ ANTA TTCCMGTANGGTNTAACTGGGAAAATTTTGTAACCTTGGCCCCGGGCCGGGCCG 

^^TTAAAAANTTAAGTGGGAATTCCCCCCGGGCCTGCANGGGAATTTCCAAANTTAAAN 

GCCTTTNTTCCGATACCCGGCCAANCCTCTAAGGGGGGGGGNCCCCG 

Sequence 1258 

AGGTACAGGTTTTTGTGTGGATGGATGTTTTCTTGGAAGCTAACTTTTCGAACCAGACAT 
CTAGTAGCGTTTTTGTGGCTTTCTGGACGTTCAACTCTTTGAACAGTTGMCTGTGGGM 
TCTTGCCCATGTCCTGCCTCACCTTCCTCCAGGGAACTC 
Sequence 1259 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTGTGTCCTGTTTTATTAGGTCTTTAAGAATGG 
GAGTTACTCGCCCAGAGCAAATAAAGATTAACTGGCACATAGTAGGAAGCCCAAAAGGAA 
CCAGTGAAGCAA CAGA AGATAGAGTGAATGAATAAAGACACATGAATGCACTAAATCTTT 
TGTGCACATGGCTTTTACTGATTTTCACCTGTAATGTCCTATAAGGCTATAGACAGTCAA 
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ACTAAAAAATTGCTGTCATTGGGAGAGAGGCGGGTGGATCACCTGGGGTCAGGAGTTCAA 

GACCAGCCTGACCAACATGGAGAAACCCCGTCTCTACTAAAAATACAAAAATTGGCCGGG 

CATGATGGTGGGCGCCTGTAATCTCAATTACTTGGGAGACTGAGGCAGGCGAATCGCTTG 

AACCTGGGAGGTGGCGGTTGCAGTGAGCCGAGATCACACCATTGCACTTCCAGCCTGGGC 

AACAGAGTGAGACTCCCATCCCCNNNNAAAAAANNNNNNNNNNNNNNGTTCC7TCGGGCC 

CGCT 

Sequence 1260 

CCGGGCAGGTACCAGGGCACTCCAMTATGTAATTTAACACATTCATTAATAACTATGAT 

AAATAGATATTTTATTCCMGMCAATATCATTAGAGAGTTAGATTGGAGGGCAAGGGGA 

TTMGTATCTTAAGTTTATTGTTTCCCGAGTGACCATTTTTAAMGCCTCCTTCCAGGM 

CTGTGGAATGCCTGCCAGGCTCCCTGACTGCTGTCTGGTGAGGGTCCCTACTGGTGCTGT 

GGACAGCACAATACACAATGGGGCACAAGAAAGCCACGCAGCCTCCCCCTCCTAGACACA 

TGCTGTGCACCCCCTTGGCTCTGCTTGGAAGGTTCAGCTCTGCAAATGGAAGGTTGCAAA 

CATCAGACACCTTCTTTCTGTTTCTGTCTCTCTCCCCTTCTTCTCCCCACAGTGTCCAGC 

CTTGGCCCCTGACTTTTCCTGGGGTTGGGGAGGCCTGTAGGGTTCAGAAAGCTGAGCAGC 

TTNGGTGAAGACCTCCCTTAGCTCAATGTGCTCTGGCCATAGGGGAGCAGGACCAAGCAA 

GCTGAGTTTTGGGGG 

Sequence 1261 

GCGGCCGGGGGGACCTCCTCGTCCAAAGAAGGGAAGACCTTGTCGGCACCATTGAGAGGC 
GTGTTCTCTTCTGGCTGCCGCTCGAGCACCGAGAGCTCCAGCTCGTCTACTGGCTGTGAG 
GGTTGGGCCAGGTCCATGTGCCGGCAGGGAAAGTCTCCCCGCGTACCTGCCCG 
Sequence 1262 

GGACTCNCGGGTGNGGCGNGTNAGGGGGAGNTCGCTCTGCNCCAGCTGGAGTGAGTGCNC 
ATCNGGCTACTGAAGCTCTGCTCGGTGTNAAGCGATNCTCCTGCTCAGCTTCTAAGTANT 
AGGATACAGGCGCNCCGTCACCACGCTTGCCTMTTTTTGATTTTTAGTAGANCCGAGGT 
TNCACCAAGTTGGCAAGATGGTCTCAANCTTCTGNCTCGTGACCGCCGCTCGGCTCCAAA 
. GTGCCGGGATTACCAGGCCGTGAGCCACCGCCC 
Sequence 1263 

TAATACGACTCACTATAGGGGNCGAAATTGGGAGNCTCCACCGCGGGTTGGGCGNGCCGG 

GTTGGGCGNGGGNACGCAAGGGGGGGAGGGCAAGGGGGTGGACGGGCCCCGAAATCTTCC 

TCAACAAGCTCCCAAGGGAACTGGCTTGCGCCTTGGTTGGGTTGGGCGGGCGGGCTTCCC 

TCGCGGCTTTTCCTTTTCCCCCGCGGTTACCTTCGGGCCCGCTTCTAAGAAACTAAGTTG 

GGAATCCCCCCCNGGGGCTTGCAAGGAMTTTCCNAATAATCAAAAGCCTTAATCGGAAT 

ACCCGGTCGAACCTTNGTAAGGGGGGGGGGGGCCCCCGNGTACCCCAAGCl I I I I i iGTT 

TCCCCTTTTAAGTGGMGGGGGTTTAAAATTTGCCGCCGCCTTTGGGCCGTTAAATCNAA 

TTGGGGTCAATAAGCCTGGTTTTTCCCTTGGTGGTTGGAAAAATTTTGGTTTAATCCCG^ 

CTTCAAACAAANTTCCCAACAACAAACCAATTACCGGAANCCCCGGGGGAAGGCCAATTA 

AAAAAGGTTGGTTAAAAAGNCCCTTGGGGGGGTTGGCCCCTAAAAATTGGAAGGTTGGAA 

GGCCTTAAACCTTCAACCAATTTTAAAATTTTGGCGGTTTTGGCCGG 

GCCCCCGGCTTTTTTCCCAAGTTCCGGGGGGAAAAAACCCTTGGTTCCGGTTGGCCCAAA 

GNCCTTGGCCAATTTTAMTTGGAAAATTCGGGGCCCCAAACGGCCCCCCGGGNNGGGAA 

GNAAGGGCCGGG 

Sequence 1264 

CCGGGCAGGTACGCGGGGATATTTCACATGTGCCTGTCCCAGAGGAATTCCTTAACTCTC 

CCTTCCTGGAACCGAAATGCTTCCAMGACAGAGAGCTCCTATATATATTTCTCTGTTTT 

CCCCAGAGGACCGTGCATGAAAGGCGCCTTTCCTGGCGCGTGTGTCTCTGACCCTGTGGA 

TCTTCTGGACAATTCCCGGAATTTCTTGCAGGGCAAGCACTTCCCCTTAAGTCGCTCTTT 

CTCGTTTGGGATAGTCAGTATTTATATAGNCACTATTCTCTTTTTGGGTAGCCATCTAAT 

TTTATACTGNTAGTCAAGCTCTACTTCCmCCCCTAAGAGTTTGMTTTATTTTTCm 
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AAACCGCTACACAAACCCCAAATTNAAACTCAACCAAAAGCGGGCTTTACCTAACTTTCT 
AACTACCTACCAAGGAAAAATATTCCACCAAAAGGGAGAAGAAACCGGAGAAAGCTTTGG 
CAATTTCCNGGGANGAAAAAAAAAAAAGGAATTTCCCCNA 
Sequence 1265 

CCGGGCAGGTACTAATTAGCCAGTCATGATCTCCATTCTTACCAGGGTCGTTACCCTGGA 

CTTGGCACTACAGGGTCTTGTTGTCTTTGCTTTTGCCTTCTTTTCCAGCTCCTTAAAGCT 

TTTGTAAGCACAAAGTAACAAGAGGAAAGAGAAACATGATGTGAATGTCAGAAAGAACAA 

ATACTTGAGACCA CCTTG AAGGGAMCTTCCAGNCCTTTATTCATTACTCGNTCATTCAT 

TCACTTATCAMTATTTTTAGGGCATGTTCAATGTGCCTGGAACCATGCTAATTGCTAAC 

ATTACAAATACCACTTAA7TACATAAAAAGGNTGNCAGCCTTCCAGGATAACTCTTNTAT 

TTTGGTAGCCCAATAAAGGGGCCCCATGCATCTGGTCTGNGGGACTGGAAGAAAATTCAG 

T NACTCTTTNCTAAAAGAATACNGGGAGGGATGNCCCCTGGCCTGGCTGGAGGTNACCCT 

NTTTTNTNTGGCCNCTGGCTTCTGGGCNmGGGCCTCCCTGGTCATTCACAGGCTTTTT 
TTTT 

Sequence 1266 

CCGCGGTGGCGGCCGCCCGGGCAGGACGCGGGGGGATACGCTTCCATAGCAAAGGGTTGC 

CCTTGGCTACAGGAGGACGCGGGCTGGCAGGCTTGGTTCCTGGCGAGTTTCTAAGCCCCC 

GCCTGCGGTCTGAGGCACCGGCTGAACCATGTCGGAGATCCTGTGCCAGTGGCTCAACAA 

GGAGTTGAAGGTGTCCCGGACCGTGAGTCCCAAGTCATTTGCAAAGGCATTTTCCAGTGG 

CTATCTACTTGGAGAAGTTCTACGCAAGTTTGAACTTCAGGATGATTTTTCAGAATTT^ 

GGACAGCAGGGTTTCAAGTGCCAMCTTMTAATTTTTCTCGCTTGGAGCCAACACTTCA 

CCTTCTGGGTGTGCAGTTTGATCAGAATGTGGCCCATGGCATCATCACAGAAAAGCCTGG 

GGTGGCAACAAAGCTGTTATATTCAATTGTACCTCGGCCCGCTCTAGAACTAAGTGGG 
Sequence 1267 

AGGTACAAGCCAATGCCNGCGTCAGAGCTGCTTTTGTGCATGCCTTTGGAGATCTATTTC 

AGAGTATCAGTGTGCTAATTAGTGCACTTATTATCTACTTTAAGCCAGAGTATAAAATAG 

CCGACCCAATCTGCACATTCATCTTTTCCATCCTGGTCTTGGCCAGCACCATCACTATCT 

TAAAGGACTTCT CCAT CTTACTCATGGAAGGTGTGCCAAAGAGCCTGAATTACAGTGGTG 

TGAAAGAGCTTATTTTAGCAGTCGACGGGGTGCTGTCTGTGCACAGCCTGCACATCTGGT 

CTCTAACAATGAATCAAGTAATTCTCTCAGCTCATGTTGCTACAAGCAGCCAGCCGTAAC 

AGCCAAGTGGTTCGGAGAGAAATTGCTAAAGCCCTTAGCAAAAGCTTT 

Sequence 1268 

CAGAGTTTCACTCTTGTCGCCTAGGCTAGAGAGCAATGACATGACATCAGCTCACTGCAA 

CCTCCGCCTCCTGGGTTCAAGCGATTCTCCTGCCTCAGCCTCCAAAGTAACTGNGACTAC 

AG GCGTG CGCCACTATGCCCTGCTAACTTTTTTGTATTTTTAGTAGNAGACGGGGTTTCA 

CCATTTTGGCCAGGCTAGTCTNAAATTCCTGACCTCAGGNGATCTGCCCACCTCAACCTC 

CCAAAGNGCTGGGATTACAGNCANGAGCCACGGACGCCCGGCCTCTACTTTTCTACCTTT 

GACTAACCTGCTGTAG TATGN GACCACAGAACAGATTCCCTACTTTNANAAAGCTTCAGA 

CCAAAATAAACCAAACATTTTACCAGTTrTTTAAAAAATTTAACTCA 

Sequence 1269 

CCGGGCAGGTACGTATTCCAAGGAAGCTGAATGAACTTCTATAAGTATCCTTTATACAGC 

TAAAACTCATCTTGATAGTCATGTTTTAATAAATTTAGAACAATTATCAGTAAAAGAAAT 

ATAGTTGGGCTGGGCACAGTGGCTCACACCTGTAATCCCAGCACTCTGGGAGGCCGAGGC 
AGGTGGTACCT 

Sequence 1270 

CCGGGCAGGTACCCCTGTTGTAAGTCCAGTGAAGGGGTCTAGGGAACTCTCCCGTCTCAT 
AGGAGTTCCTGATGCCTCCAGTTGACACACCTTTTCTCTGCCATATGGCACTCTGTCTGG 
ACATCTGGTTTACTATCCTTTTCTTTGTCCATCAGATCTTACCACTTTCCTGAAAGTATC 
CTCAGCGACCTCAGACTGTGTGTCCCTTTTCTCAATGGGAATAATCTTAATTCTTGTGAC 
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TCCCGCGTACCT 
Sequence 1271 

CCGGGCAGGTACCTTAGAATCAGCTCACTGTTTGAGAGTAAAGCTCTTTACAGTTGTTTA 

CCCAGAMTAGACTGTTTAAA7TCCTTTGCATCTCAGCTCTCTGGTGTCTTCTGAACCCA 

CTCACCAGCTGTGGTGAGGATTCTGGCTGGTGGCCATCTTGTGCTCTATAGCTCTAGCCT 

ACAGGAATATTCTGAAGAGATGTGAGGAAAAGGAAAAAAAAAAGTACCT 

Sequence 1272 

AGGTACCCTAGACTCTCTTTGGAGGACTCCCAAGGGCTCCTGAAGCTGCTTTGCCCTCCT 

TTGC AGAGAG CAAGAGGTGCCAGGGATTTATGTCCTGGCAGGGCAGTGCTGG ATCAATG A 

CTTAGTAGGAGGTCTGGAAAGCCTAGTGCCCTTGCCTTGAGTCTGGACACACTTTGGAGT 

TGCAGTTCACATCCAGGCTGCAGCTGGGGCTCTCCCAGCACTTCCTTCAGTCACACACAC 

ATGGATTCTTGTCTTGGTGTCTGGTTTGGGGGMCCTGACTTMGTCACrnTGTGGACA 

TTTTGGAGAGCAGGTTATTTCAGCCGGCAACCAGGAACTAGACTCCCGCGTACCTGCCCG 

Sequence 1273 

AGGTACGCGGGGGACTGTCATGTAAGAAGTCACAAGAATTAAGATTATTCCCATTGAGAA 

AAGGGACACACAGTCTGAGGTCGCTGAGGATACTTTCAGGAAAGTGGTAAGATCTGATGG 

ACAAAGAAAAGGATAGTAAACCAGATGTCCAGACAGAGTGCCATATGGCAGAGAAAAGGT 

GTGTCAACTGGAGGCATCAGGAACTCCTATGAGACGCGAGAGTTCCCTAGACCCCTTCAC 

TGGACTTGCAACAGGGGTACCTGCCCG 

Sequence 1274 

CCCGGGCAGGTACCTATGCGGAAAGTCCCTTCCTTCCAGANGCCTCCACTGANTTNTGAT 

GCCAAGGAGCTTCTGCGGNGCCCTGCACGCACCTTTACAGATGCAGGTGGCTGTTTCCTG 

TGTCAGACTGCAAGCTCCCGTGAGCTNGGTTCATTGCTAATNGTNGACTTGGCCTGGCAA 

ATAATGACTTGGCCTGGTAAATAATGATTACACGAACAAATGTAACTAAGACATCCAGAT 

ACAATCTAATGCATTTCATGCMCTCGTGTAACCMGTGAGGTTTTCCAGCTGTTTCAGC 

TTCTATTATAACTTACATTACAGCCCCGCCGTACCTG 

Sequence 1275 

AGGTACGCGGGAGGCTTGGACATTTTCTCCTGCCCTACCGTCTTTGTGGTCCCAAGTGTC 

ATGCCCCAGTAAATGCTTCCATGCCTGTTCCTGAACCCAAAGAACTGTTATTGAATGGGA 

CAGAGACTGAATGACTAAGGAAACCCTAGCTGCCTCCAGCCCCACACTTGGCTAAGCTGT 

AATCAGCTCTGTGTGAGGCTGGGCATGGAGTAGGTGTAGTCAATGTCCTCCTCTCTGTCT 

CGTCTCGGTTGTATCTTCATTTGGTGCCGCTCTCACGGACGCCTATCTTCCTCTCTGTCT 

GTTCCTATTGTCTTTCCAGGCTCCTGGCTGNCCTTTCTGTCATANCANCCCCAGTGATTT 

TGATAATGTCAGCGNGGAGCCCCAAAGCTGGTCGCCCATNTGTGATAAGTTGGATAGGGG 

ANCCC NCATG AAAATGCTTG AGGGGAAA 

Sequence 1276 

CCGGGCAGGTACGCGGGGGAGCCTTACAGGCTTACAATACAAACTTGAACAGTCAGGATG 

TTTCACTGCAAATCTTTACAGAAGTGCTCTTCGACTGATAGTCATTGTGGAGGAAAAGAA 

ACAAATGCTGAACTAGGAGGCMCTTGGGGTTTTMTATGTTCCTTAAGTTTGCATCCTC 

CCACCTTCCCTGGATGAAAACCACTGGTGATGTGATTTAATTCTTTTGGCTGAGTGAGAT 

AGACAAGAGGAATGCAGAAATGGTACCT 

Sequence 1277 

AGGTACACAGGGCGGGGAGCCTCGGGAGCGGCTCCCTTAGCAGGACCTGAGGGTTTCGGG 

AGAGTCCTGGCTTTGCACTTAGTTTTCCCGCTGCCTGGCTGCCTCCGTGGCGGTTGTACT 

GCTTTTTATGCAGGATGTAGGATCGAAGGCAAATCATCCTTTCCCTACGTCCCCTGCCAA 

CCCCTGCCGCTCTCCTCTATTTTGTACCTGCCCG 

Sequence 1278 

AGGTACTCATCACGTTGAAGGGTCATCATATGTGTTCCCATTTTTTTCTCTGCAGGAAAG 
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TGAGCTTCTGGAAGGCTGTATCTGAGTCCTGTTCCTCCAATATTCAATGAATGCTGTCTG 

AATGGATCAATCAAATGGAGCTCT 

Sequence 1279 

ACTTTA CAGTT TACAAGGTGTTAmCATTACATTTTGCACTACGGTTCTAGCATAGCCA 

TTGTTATTTTTATTCACAGACAGAAAAAAATGAGGTTGTCTCACTCATATAACCTGGCCT 

CTCTCAGCTCTGGTTGCTGTGGAGAGTAGCATAAGTGATATCTCTTCAATGGTGGTTCTA 

GTGC ACAAAGAGAAAACAAAACGTTGTCTTCTGGCCTTCCAATGGCAGAATTTGACCTGC 

TTTTGGCTTTGAAATGCCTCATATGAM^ 

CTTGGGCTMG ATTAAAATTMTTGGTCTTTTC ATG ATCACACATC AGG AGAGTAATAG G 

ACCAGGCACCTGAGACCAAGCCAGGGATTAAAGATTGGTTATGGGA 

Sequence 1280 

GAATTGGACTCCACCGCGGTGGCGGCCGAGGTACNCNGGGCTTGCATAATTTGGCATTAG 

CAACCTCAAAAACTCTGGAAAAGGCTTCATTTTCTCCAGTCTCCTGGGAGAGGAGAGGCA 

CCATGAAGGCAGACCCATCCAGAGAACACCTGCGACAGGCTGAGAAGCCATTGGGAGACA 

CACTTCTGAACGCCACCACTGGAAAATCACACATGCTGAAATGGGAGAGTTCCCTGACCC 

CCTTGCAGGATATGTGACAGGAGTGTGGCTCATCTTGTTCAGCTGGGAGTGCATACTCAA 

ACCCCTTATGAGACAAAGGNAGTATGCATACCAGAAGGNTGCCAGGGAACTGGGGGAAGC 

CNAAAATAATAAACTNAGTTTAAATTTGATCCCTCCAAAGAAGTTTAAMGCCGGGTTTT 

TAAAATANTTTACTTGGACCAGGTTAAATTATTNAANCCNANNTTGGGTNGTTTNGNAAC 

CCCCCCMTTGGAATTCCTTCAATTNGAAATTCMTTAGGATTANGGCAAACCTGGCCTT 

TTNANTTNGGGAACCATTTTGGAANGGCTrTTTTATTNNNCCCATGCCTTTGCCCTANAA 
AAAAAANTAAAAATTGCCACCCCTTTNAAA I I I I I 1 1 1 AAA 
Sequence 1281 

CCGGGCAGGTACCTTCAGTCCCCAGCCTCTCACAGTAGGGCTTTGCTCAGTATACCCCAG 

TGTCCCTCAGCGTTCTTAGCAC7TGTCCTGACAGTGTCCCAGATCCGATATCAACTGCAT 

GAAATTTGCCAATGAGGGTGCTATTAGTTTCTGTAGAACGTCATCAAACCAGAGTTCCCC 

AAACCTGAATTCTCTTCCATTATCCA7TGGTTGCATTTTTAATCACGTATTCCAGGACCT 
GCTCTGTACCT 

Sequence 1282 

GCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACACCAACAACAGCCAAGCTCAGAGCCAA 

ATCAGGAAGGCAATACCATTCACAACTGCCACAAAAAGAATAAAATACTTAGGAACACAG 

CCAGCCAGAGAGGTGAAAGACCCCTGCAATGAGAATGCAAAGCACTCTTCAAAGAAATCA 

GAGATGACACAAACAAATGGAAAAAAAGTCCATCCTCATGGAGAAGAAGATTTAGAGGCT 

ACTGATTAGGAGTTTCAGTGGAAAGTGGGGTCTGAAGCCAAATTGCAGAAGTCTGAGGTG 

TA TGCA TG CAGTAAGTAAA GAGACAGCAGATAATGAATCTTCTTTAGAACACAGAGCAAT 

TGTTTTCCAi rTTTTTTTl ICATGGAAGGGAAAGCAAGTGTCTAATATATCATGACAGCC 

CCATGTTGTTCTCTAAACACTACTATTGCATCCTGAATTTTCT 

Sequence 1283 

GCACAGACGTAGTCTCATCGAGAATTAACACAGGGAGCTTTCCAGAGACCTTTAGCAAAA 
GTGTTCANATACTGAACCAAAACAGAAAGGATGGCCCTTTCTGGAGCTCCTTCCCCAGCA 
CAATCCCGCGTACCTNGGCCGCTCTAGAACTA 
Sequence 1284 

CCGGGCAGGTGGAGGGACACCAGTGAGGCTGCAGGAGGTTCTCCAGCAATATGGCACAAC 
ATAGAGACCACTCACTAGGACACTGTGCTTTTATCATAACCCATTATTGTTGGCTGAGCT 
TGCTTAAAAAACTGAGCAGACTTTAACAACTTGCAGACTTCACTTAAAGTCCAGATTCCC 
AGCTTCTCTTCTCTTTAAAAAAAAATTAATAGAACTGCAAATCAGCAGCCCTAGGACTAT 

ATTCCAATATGGTAACAACTGGCTGGATTTCAAAAGCCACTACCTCTCCTGGGTTTGCCA 

CAATCCCCAGTACCT 

Sequence 1285 
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AGGTACCTNATTCAGGTCTGATCTGGGACCCCGTGTTGTTNNGCNTTGGNNGGNACAATA 

GAAAANAAAAANANTTNTTCTNCTTCTTGCCATCCAAGCCAGTCTGGAGAAGATGGACCC 

CTTACTCTGAGCTGGTCAGAAATATGTAGATCCCCTTCCTCATGATTCATCCATCTCTCG 

TGATCCCTCAAACTTTTCCCTGGAGGATCTTAGCTNAGAAAAATGTCTATCCAGCCAAAA 

GAATTAAATCACATTCACCCAGTGGGTTTTCATCCCAGGGGAANGGTGGGGANGGATTGN 

CAAAANCTTAAGGGAACCATTATTAAAAAAACCCCCAAG 

Sequence 1286 

AGGTACMTTTATTTAGAGACACTTTTAACTAAGAAAAAATAAGAGAAAGAAAATAGGCT 

CATACTTCTTGGCATATGTCAAAAmGTmGACATTCCCTGAAATGmGCATTCTT^ 

AATTTTCCAGTAMCMCTCAGTTTTGGMCACTGGTGAA GTTAAAAAA AAATCCCCTCT 

TAAAGGACTTCACAAATTTACTTCTTAAAACATAGAAAGACI I I I I II I ICCTTTTCCTC 

ACATCTCTTTCAGAATATTCCTGTANGCTAGAGCCTATTAGGAGCCACCAAGGATGGGCC 

ACCCANCCCAAGAAATCCTCCACCCACAAGGNTGNG TGGA GTGGGGTTTCAAGAANGAAC 

ACCAGGAAGGAGGCTGGNNATTGCCAAAAGGNGAAATTTTAA 

Sequence 1287 

GGGAAGCTCCACCGTCGCGTGGNGCGGGCCGNAGNGTACCGCGGGGGAACTTGCNAATAA 

GGGAAAATCAATTGTTCTGGGTTCGCCGGGCAAAAGGGCGGGAAAAAAGGGCTTTGGGGG 

GAAAAGGGGTGGGTNGCNTAAAGCCGCCANTCGTAAGGGTGGCNTCCCCGGGNGAATAAA 

CAATCCCMNGGGCMTTACAAAAMCCCGGGCTTATTTCGCCCGGTTTTGGGCTTCGNG 

CNCCGGGTGGGGCCGNTCAAANCCGCATTTTCCCGGGTCTTTAATCTTATG GAAGG GAAG 

AACTTCGAAAGGGTGGTTGCCTTTAAAAGGGTTTTTCTTTMGAAGAAAACC 

TTCGGAAGNAACGCCCCGTCCACCCTATTACCGGGAAGCCACCGCCCC/W^\GCCGGCCC 

AAAAAACTTGTTCAACAAAGCCCCATTGGGGATTGGTTAAGTTATTAATTGGCCCCCTTA 

AAAAAACCGT 
Sequence 1288 

NNCCTCCACCGCGGGTGGCGGNTCGTCNNAAGCCACANTNTAAGGATNNTTAANCCTCTT 

TGCNGTGGGNAAGAGNGGGNGTGCTGNAAANAAANAAAACCNTTTTNCCCATTGTCCCNG 

GCGCTCCCACCTCATAGCACAAGTTAGGTCTGGCGCCCGTCCTAANGGACCAAAGGGGTC 

ATTATTAAMGGTTTGAAAGGTGAAATGTTGCTCTTGGGATCCCATTCAAGGTACCCTCN 

GNNNCCCGCNTCTTNCGAACCTTAAGTTGGGGATCCCCCCCNNGNGGNCNTGGCCATNGG 

AAATTTTNCGNNTTANTTCNAGANGCCTTTAATTCCGAATTNCCCGNTCCGNACCCTTCT 

TTAGGGGGGGGGGGGCCCCCCGGG 

Sequence 1289 

CCGGGCAGGTACTAGAAACCTAGCAGGCATTAATAATT GTTGAGG CAATGACTCTGAGGC 

TATATCTGGGCCTTGTCATTATTTATCATTTATATTTGTAT Mill I CTGAAATTTGAGG 

GCCAAGAAAACATTGACTTTGACTGAGGAGGTCACATCTGTGCCATCTCTGCAAATCAAT 

CAGCACCACTGAAATAACTACTTAGCATTCTGCTGAGCTTTCCCTGCTCAGTAGAGACAA 

ATATACTCATCCCCCACCTCAGTGAGCTTGTTTAGGCAACCAGGATTAGAGCTGCTCAGG 

TTCCCAACCGTCTCCTGCCACATCGGGTTCTCAAAATGGAAAGAATGGTTTATGCCAAAT 

CACTTTTC 
Sequence 1290 

CCGGGCAGGTACAGTGACACTGTCACGGCCCACTGCAGCCTTGACCTCCCAGGCCCAAGC 

TATCCTCAGCCTCCTGAGTAGCTGGGAGCACAGACGCGCGTCACCAAACCAGGCTAATGA 

ATTTTTTTTT1 1 GTAGAGGCAGCATTTCACTACGTTGTCCAAGCTGGTCTCAAACTCCTG 

GGCTCAAGTGATCTTCCCGCCTCGGCCTCCCAAAGTGCTGGGACCACAGGCAAGAGCCAC 

CACACCCGGCCTTTTATTTTATCCCAACTGTAATAAACAGTTTCCTTCACATGCTCTGTG 

CTTCCCACTTCCATATGTTTGCTCAAGCTGCTGTTGCCTGCATCACCACTAAGGTTCCCA 

CATGTGTCGGATGTTTCACCTGGCGCCTTCAGCTGCAAGCTCCTCCGTCTCCCCGCGTAC 

CTT 
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Sequence 1291 

AGGTACTTGCTCTTTTCTAAGATTTGCCTCCAAATTGCTACTGAAGCACATTACCTGTCT 

TACTATGAATAATTGTAATTGTTTCTCAGTGAGCATGCTAGCTACTTTAGATATATTATG 

TAATTTAATCTTCAGAATCTGTGACAGAGAAAAAAAAATCTTTGTTTCATAAACGAGGAA 

ACTGAAGTCCTCAGAGGGTAAAGTCAGTTCTTTGCCCAGCACACAAATCACGTTGTGGTG 

AAACTGACATTAAGCTTGGGTTTCTCTGACTCCAAAGGCCATGCAGTCTGCAAGACTGCG 

GCATTTGTTGGCATTTGCACCACGTAGTAGCACTGCAGGCAGCCAGGGAAAGGGGAGCTG 

AGATAGGTGAGGGTGGTCCTCATGGAGGCGATTCTGTGTGCTTGCATCCTGGAACCCCTT 

CATTCCCTTTGTGCTGTACCTGCCCG 

Sequence 1292 

CCGGGCAGGTACCTGGATGTTAGAGGGATTTGTCTAGACAGGCAGATAGATATGATCCAG 

TGCCAACACTGTAATGCTGGCTGAMTCCAGATGGTTGTTTTATTGTTAATAAAATCTAT 

CTTGTGG GTGGGTGGGTGTCAGGCCTTCATCATTATATGGTG CCTGTCAGATTTATCTAC 

AGACAGAACCCTGAAAACAGCTTGTGCCTGCCCCGCATTTTTGACCCCTTGCCAAGATAC 

TGTACCT 

Sequence 1293 

CCGGGCAGGTACTGCTCCAAGAACTCTAACTTCATCATCACTCCGACCCCCACGGTCCGC 

CACTCCTGTCTCCACATACCACTGTCCTGCTCGGTTGATCCGAAGAATGGGCTCCCGGCT 

CTCTACATCAGAACTTGGTTCAATGATCTGAGGGCTGGTGGACTCCTCAGGAGGACTTAG 

CTCTCTGATATCCAGCACAGCACTAACCTGACCTCGCCGGTTTCCCTTTGGGGTATTATG 

TCGTGGGCTAAGGGAGCGGTTGAGATTTGGTGTAACCCTGTGAGACTCTGGAGGTCTCTC 

TTGATGCTTTGTCCTTGTTTGACTT^ 

AATGAATGCCCTCCAGGATCTGTGTACCT 

Sequence 1294 

CCGGGCAGGTAC rTTTTTTTTTTTTTTI I CAGGTGTGAAATATTGCTATTAATAGTGTCT 

ATCTCATAGGGNGGCTAAGATGATTAAATGAGATGATATAGTTTAAGTTCTCAAAAAAGC 

CAACAATGATTGTTACCTTTCTTTTAGGNGTTGGGTATCAGCCTCAATTTCTAAAGAANA 

ATCTCTTTTGCCTCCCAATTGTGCTGAATTGGAAACCTTACCTATGTATTTATTTATTCA 

TTCAAAAACCATCTATTATAAACCATATTTGTGCCAAACTCTAAGCCAGTTACTGAAAAC 

ACATAAGTCAATAACAGCACAATCCTTATCCTCCAGGAGTTCATAATCCAGTCAGGGAAA 

AGGAGACACACAGATAAAGAGAGGAGGGATAGAGCGAANACATTGGAGAAATCAGGAGCA 

TACCAACACCCTAATTTTTCACAGCAGGGACCCGCGTACCTCGGCCGCTCTTAGAACTAG 

TGGGATCCCCCGGGCTGGCAGGAATTCGATATCAAAGCCTTATCGGATACCCGTCGACCT 

CGAGGGGGGGGGCCCCGGTTCCCAAGCTTTTT 

Sequence' 1295 

CGTACCCGGTGCNGGNNNGCATGATGGTGACATGCGAATGGTGATAGTAACAGAAACCAT 

TCAGCCTCI III! ICCACTTCTGGATCAGAGTTTATTCTAAATTCAGAAGCACAGTTTCA 

GACATCTCAATTTTGGATTATGGAAAAGACAAAAGCAATAATGAAACATCTACATAGTCA 

GAATGTAAAATTGCTCATATCTAGTGTGAAACAACCAGATTTAGTTAGTTATTATGCAGG 

GGTGAATGGCATATCAGTGGTTGAGTGTTTATCATCAGAAGAAGTTTCTCTTATCCGGAG 

GATCATTGGTCTTTCTCCATTTGT 

Sequence 1296 

GGAGCTCCACCCGCGGTGGCGGCCCGCCCGGGCAGGTACGCGGGGAGTTGTGGGGAAGAT 

TAGCTCCCTGCTCAGCCTCTCTGATACMCTGGGCAGCTAGTTTTTGCTTCTGCAACAAG 

ACATAGGTTTATGATGAGCTTCCTGTTCATCTCTGCTGAAACTAGGGAAAGTGGTTCTAG 

GTTGGAGGCGTTAGCAGCATTTTGGCTGGGGTATATGGGAGGCAATATGGAAACCATCAA 

ACTTGTTCTCACGTCATTCTGCAAGTCCTGGTGTCCTGAGGTGTGCTGAAGCCCTATCCC 

CAAGATGACTATATTTGGAGACAGGA I I I I I GGAGGAATATTTCATCTTTTGGTNGTAGA 

AAATTCAAAGCATTCAAGTTTA 
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Sequence 1297 

TCTCCACCCGCTGGNGGCGGTCGCAGGTACCTGCCCGCAGGAGGGGAAGGATGGGCTTCC 

TGCNGNGCTCCNTNGCCCCACGGANNACAANAATACCCNCN 

Sequence 1298 

GAGCTCCACCGCGGTGGCGGNCGCCCGGGCAGGTACGCGGGGATAGTTGAAATGGAGGGC 
GGGGAAGTCAGGCAGTGGTTTCTGAAAGCCAAGAACTTAGTAGCACTGTGCCATTCTCTT 
GCCTGATCCAGTGCCATTCCCTTCACTTGATATCTGTTTACTTTAGAGGAGGCAGTTTTT 
GAGAAAGGATCATAAATATCCTGGCCCAGTGCCCCAGGAGCTATGACAAGCAAAGGAACA 
. TACTTGCCTTGGAGATAGCCTTTTGCGATATTTAAATGTCCG TGGATAC AGTAAATNTCT 
TGCAGGCMGTTTGCTCCAGGAGCATATTTGCAGGACCAAAGCCCTGTAACCGAATAGTT 
TAAMTTTCACCGGCCATTNTGGGGATTTCCCTAAATCCTTTTCCGAAAAATGGCCAAGG 
GTTGTG AAGTTGCCCTGTAATAAAAAATAATTTCCTATGGT1TTACCCTTTCAAACC I TN 
CTTTGTTTCTGGGGCTTATGGTTGGGTATTC 

AAAATTMTTGGGGGTAACCCTTNCGGGCTCGGCNTTCTTAGGAAANCTAAGNTGGGGAT 

CCCCCCCCCGGGGGCCNTGGCAAAGG 

Sequence 1299 

AGGTACAGTGTATCTTGGGTGATTTTTTCACCAAGTATAGGATATGTCTGAATAGCTACA 

GGCTTATMCCAGCCATMTTCCATATAATTGCACTATTGTCCMGAGTCACTATTTTGG 

AATTGGTATTANAAATAATTACAGTGACTCTGCCTTAATTCTCCACTGAGCCCCAAAGGG 

TCCGACGAGACACCTCNACCCAAGGACAATTAATCGGACTGTGCCANCTCCCTACCCTGC 

GCAGTTTCAGCAGCACTGGCGTAAAAATCACATGGACCCTCAAGGGAGCCCGGCGTACAC 

ACACAGCCCCGCGTACCTTGCCCCGGGCGGCCGCTCTANGAACTAGTGGGATTCCCCCGG 

GCTGCAGGGAATTTCCGATATCAAGCTTATCGGATACCCGTTCGACCTTNGAGG 

Sequence 1300 

GAGCTNCACCGCGGTGGCGGCCGCCCGGGCAGGTACTNGGGTTCGGGGCCGAACTGGAAA 

TGTGCTTAAGCTGAAAGTTAACTTTCTTCCTGAGATTATCACACTATCCAAAGAAGTCCG 

GAACCTCAAATGGCTTGGTTTCCGCGTCCCACTGGCGATTGTGAACAAAGCCCATCAAGC 

AAACCAGCTTTACCCGTTCGCCATCTCACTGATCGAGAGCGTTCGTTACCTCGGNGCGCT 

CTAGGAACTAGTG 

Sequence 1301 

NGGTACTGCCATTTCTTTC I I I I I I I AAAGAAGATATGTTTACAAAATAGGTGACCAACT 

AAGGTAGGTGTGTGTGAAGAGAGGCAGTGAGAGAGAGGGGATCCTGAAATTCTTTTCAGT 

AMGGAAAAAGATAATTTTAGGGATGTTTTTAAAM^ 

AGCATGAGTATTATTTAGGTGCTGTCCTANACCAGTGTGGCCGTATAGGCTATAGAAGAT 

TGGTCCTGAATGCTAACGGCAGGGATGAGTGTAAAGATCATCAAAACTTATTTGTGGGAA 

GGAGAGCTACTGNACAGGAGCCCGTGTATTACATGACTCCAGTCTGGGCTTGATAGGCCT 

GCCTTAAAAGCATAGCTAGAGGG 

Sequence 1302 

CGGGCAGGTACAGGGCAACATCATGGGAATGTGGAGCCTGGGGGNNCNCAGAGTAACCAN 

AGNAACAAGANNTTCAACCAGGTTCCCCTCCGGACTGAAGGACCCTGCCTTCCCCAGGAC 

CCTCCGGAAGCANNGGACACAGCAGATCCAGCCCATGGACCAAATCTGACCACCGCCTGT 

TTTTTGTGTGGCCTGAGAGCTAAANMCAGTTTTTACATTTGTTTACAGCT^ 

NAACATAACAANANGCAAAACCAGAGANGAATACGCAACAAGAGACCATTATGCGGGCTG 

GGCAAAAGNCTTAAAAATTACTATTGGTGGGNCCCTTT^ 

ANAATTNAAAACCCCCAGTAAGGGANAAAAACATTATGGGAMCCAMTATTCTTTTCAA 
TGGACCCCTTTGGGANTTTAAAGNCACAAGGGG I I I I I CCATTGGGAATATTGGACCNNC 
NCCAAAANAGGCCATTMGGCCAAAAAATTGCNNATTCAGANATNATTTACAAGNAAAAN 
GGGAAAAAAAGGGGAAAAAAAAAAAGTTACNCTTTNGGNCCNGGCTTTNTTANGAAACCT 
TAAGGTGGGAATTCCCCCCCCCGGGGGCTTGGCAAGNGAAAATTCNGATTATTCAAGGNC 
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TTTATTCCGNATTACCCGGNGNGGAACCCTTTGAAGGGGGG 
Sequence 1303 

AGGTACGCGGGGGAAAAACTGTAACCATGATGAAGGCAAAGGAAAGAGCAGAGAGCTCGG 

TAGACAGAGGAAGGAGCCTGTCCTCAGACAGGCCCAAAAGTGAGTGTGTGTGGGAGTTTG 

TGCTAATGTTCAGTAGGTGGAAGCTCAGGGATTTCCACTGTATTCTTAAGGTGAAGTCAG 

AGATTTGAGAGAGGATACCCACTTGAACCATACAGCTGCCATTTTGGGCTGCAACAACGA 

GGCCAGTGTANAAGGAGAATAAAGAAACTGGTGCATTTGGAGTCAGACAAAAAGTTCATT 

CATGGCTCTTTCAGTCTGGAAGATTCTGAAACTTTGGTAACATAGTAATTCCTAAATAAN 

AATAAAGGCMGTGGATTTTAATAAAGGGCAATACCAGTGATTTCAAAGAAAGAGAAATA 

CTGCAAGGTGCTTGCATCATNAAAAAGCAGTCCCATGAAAGAAAAAGATTCTGTCAGAGC 

ATTCCTTATNAAAAAACCAGTNCGTAAATTGCCCATTTTTCCTTGTAAAAACAAAANGGC 

CCCTGACTTAAAAGCCCCTTTAAAAACAGGATNGGATTTTCAACCANAAGCTNCGCCTTA 

AACCTCTTCCCTGTCC ATGAAAGT CNATGGGGGGGGGAAAGGGNTAATAN I 1 1 1 1 1 1 I I A 

AAAANCANTCCCTTGG I 1 1 1 1 1 1 I 1 1 I CCTTTTATTATTTNATTCCTCCCATTTGGGGAA 

TT 

Sequence 1304 

GCTCCCGNGGTGGCGGCCGCCCGGGCAGGTACGCGGGTGGCCTACATGGGGGTGCCCGCT 

CATCTCATCTTAAGTGAGCAGTAATTATTCAACAAACTATTGTTTTGCCTGGTCATTTTC 

CCTTCGGTTTTGTCATTACAAAAATTCAAACATTCTAAACAGAGATGACTATACCTCAAC 

TTTGTAACCCCTCTGGAGTTAGAATACTAACTCCCTTTGACAATCAGAATGAAACTGTTC 

TCAGAACAGAACCTACTCAGAGGAGTTCTCAGATATGATAAACGAAGGGTGGAAGAAGGT 
ACCT 

Sequence 1305 

GACTAAAAAAAGAAAGAGCTTCCTCATCTAGTGGGAGCATCGCTCCATCATGCTCCAGAT 

CCCGTTCCTGCTCCCAGACACCTAGGGAGTTGCACTTGTTTTATGTAAGAGCTGCTTTCT 

AAGCAGCCTGGACGCTGAATGGCTTTCTCAAAAGTGAAAAATGGGCACTATTCTGATAGT 

CCATCTTTCCATCAGTTCAGCTAACTTTCACTGT 

Sequence 1306 

AGGTACACGTCATGAGGAGAAGACATAGCAGMGCTCAGGAAAACTATGnTGTTATGCT 

CCCCCATCCTAGTTACAGCAATACAGCAGGCACCCATGTTTCACAGGGTCACACAGGGAA 

GCACAAACCTTGGTCAGGAGGCCGAGGGGCAGAAACTCAGGGATAGCCTAGGCCAGGCCA 

TTATTGGAATTTCTGCAGGGAAGTTAAGGCAGAGCAGGGGAGCAGTTTAGGACTGGCTAG 

TTTGCATAATTTCAGTGGGCTCTAAGCTGAAAGAATGATCTCCAGTTACCTGGTACCTGC 
CCG 

Sequence 1307 

CCGGGCAGGTACGCGGGGGGCCGAGTCCTAGGCCAGGTCTGGGGTAACCTGGAACTTCCA 

CCTGGGCTCTGCGCTAGGTCTCTGTTTCACTCCCTCCCCGCGGGGCGCACAGCTCGCGGG 

TCTTTGGACACCACCGGTCCTGAGTCCGCGGACTGCCATTTTCATTAAGAACTGCCACTT 
AGAGGTACCT 

Sequence 1308 

CCGGGCAGGTACGCGGGGGGGTGGATATCCAAATTCAGAATAATATGTCCTATGAGAATG 

GCAGCTGTMTACATATGAATTACAGCTCTTTGTCATTAAAATAACCTGACAGCTTTCTC 

TTACAGCATGGTAACAGCAATACACTTAATATGAAGTTTGAGGATTTCAGGGCACCTAGA 

GmTCAAGATCWGAGAGAAMCCATAMGTAMTGGGACTTCTTGTGTGTTTTGTAA 

ACTGGCTCTGAAGGCAACTTCAAGAAACTTGAAAATACAGAAGCATGAAAATATTGTTGA 

AG AATAA TGCTGCGCTTGTATATCCAAGAAACTTTTA I I I I I I II I I GAGAAAGACTCTT 

GCTTTTTCCGCCCAGGCCAGAGTGCACTGGGGCAATCTTGGCTCACTGCAAGCTCCGCCT 
CCCGGGTTCACGCCATTTCTNCTGCCTCAGCCTCCCCAGTAGCTGGGAATACAGGCGCCC 
GCCACCACGCCCGGNTAATTTTTTTNGTATTTTTTAGTAGAGACCGGGGTTTCACCATGT 
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TAGCCAGGGATGGTCTCGGATCTCCTGATCTTGTGATCTGCCCGCCTTTGCCTTCCAAAA 
GTGCTGGGATTACANGGCATGAGCCCCCGTCCCCGGGNCCCANGGAAACAI I I I I J AAAA 
AAAA 

Sequence 1309 

GAGCTCTAAAATACTGCAATTGCCTCAACATTACCCTTTTATGGACAGTTCCAAATTAAC 
TTATTTTTCTATGGGAATACCACTTTGCTTAGATAT^ 

TCTACCTTCTGCATCCTGTTGAACACCATATACATCAGTTTGAGTATTACCGACTGCTGT 
AACAMCAAGACCCTCAATTTCAGGGAGTmC^ 

AAATGTGGATGTTGCCGGACAGCTGGTAGATATGCTCCACATAGTGATTCAGTGGCAATA 
TCACCTTCCTTCTTGTAGTTTTGACATTGCTTTGAGTCTCAGAACTTTCTGTGACAGTCA 
TTATTCACCCGCGTACCTGCCCG 
Sequence 1310 

AGGTACTTCTGTCCTACAGATCCAGGGTTTTGTCTTTTCTGCTCTAAAATATTTCTCCGC 

TN ANTTN NAN C AATTCGTTTTCTTCTC ATAAC AGTAGACCCTGTAGCCCTCTTATTTTG A 

TGCAGGACATTAGGACCTCTTTTCCATAAGGGAACTGATTCTTCTTCAGCAGTTCTTAGA 

GTCACAGACGGATAACTTTCTTCACAAACAGGACTTCTTTGGGTGCAATATAATGTCGCA 

TCTTGC CGGGC AACTGCTTG AGTGCC ATTTGAATGATTTA I i I I I GCGAAAGTTTGGC AC 

TCTGAAGAACAATATGATGGCAGTGATGGTATCACAGTGACTAGGTCAGAACAAGCATTT 

GAGAGTGGAGAACTACTTTCCTGATTCCACTGATTCAAACTGTGTTCTGTTGGAACCCCG 

CGTACCT 

Sequence 1311 

CGAGGTCATAAGTGGAGTGCTTGGANGAGAGAAAGAGGAGAAGAGTGAAAGCCTATTTGA 
GGTGCATTTTTGATCTGCTTACAACTGTGGGCAGCTGTGACTCANTCCCACTGATCAAAC 
TTTGAGAATTGGTGTATAGTACCTGCCCG 
Sequence 1312 

GCTCCACCGCGGTGGCGGCCGATGTNCAATTTATTTAGAGACACTTTTAACTAANANAAA 

ATAATGAGAAAGAAAATAGGCTCATACTTCTTGGCATATGTCAAMTTTGTTTTGACATT 

CCCTGAAATGATTGCATTCTTAAATTTTNCAGTAAACAACTCAGTTTTTGGAACACTGGT 

GAAGTTAAAAAAAAATCCCCTCTTAAAGGACTTCACAAATTTACTTNTTAAAACATAGAA 

agaci mrm f CCTTTTCCTCACATCTCTTNAGAATATTCCTGGAGNGCTAGAGCTAT 

AGAGCACAAGATGGGCCACCAGCCAGGAATCCTTACCACAGCTNGTGAGTGGGTTCANAA 

NACACCANAAGAGCTTGANATGCAAAGGAATTTAAA 

Sequence 1313 

CCGGGCAGGTACGCGGGAAGCGTGGTTTCTGCCCGTGAAGGCCTACAGCCTCATAAGGCT 

TCCACAGTTCAAAGAAGTCCCTTATGTATTGTGGATTAGGAGAAATACATACATGACAGA 

AAATGCTAACTCCAATAACATTTTCTCATTCTTTCATCCAAAAAACAATCCTCTATGATG 

TTGAAGGAGTTGTAATAGAAACAAAAGTATATTTAAGAATAACTAGGACCAATTCCAGGT 

AATATATGTATCTATTTCACTCTGAAAGTCACATGACTCCCGCACTCAAGTGTGGGACCA 

TGCACAGTAACTTNCTTTTAAAAGCTTACAGGCATTATGTTTAMGNAAAAAAAAAT^ 

TGGACCACTTTGTTAAAGAAACAGGGAAAGCCAGACTTTATATTAGGGACCCATTTGCAG 

TAAGGGTATAGGAAAACCACCAGCCAAATGGGGGGTTTTTGCAAATGGAAGGGTTGGAGC 

TCAACTCCTAATTACCAAGGAAGGAAAAAAAAAGTGGGGGGATTTATAGCTAAGGGATCC 

TATCCCCTTGGACTCCTTGGAGGGGTGGAGGGGA 

Sequence 1314 

ATNGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCCATAGCCATG 

AAGCAAAGATAAAATTCATCTATACACAGACTGAACTTTGTCTTCATTAACACTCTAGGC 

TAAGGGTCATAGCTAATCAGCTACAACTGTAATGTCCTGATAATTGTGAATTAACTGCAG 

GGCACCCAGCAAAAGGTTTAGTTATAATCTAATAGCTGTCTGTAGAGATTAGCCTAATAA 

AGGGATTTTTTAAAAAAGAATCTGGCCGGGCATGGTGGCTCAATCCTGTAATCCCAGCAC 
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TTTGGGAGGCCGAGGTGGGTGGATCACCTGAGATCGGGAGTCCAAGACCAGCCTGGCCAA 

CATGGTGAAACCCCATGTCTACTAAAAATACAAAAATTATCCAGGCGTTTTGGTGAGCAC 

CCACAATCCCAGCTACTTGTGAGGCTGAGGCAGGAGGATCACTTAAGCCTAAGAGGCAAG 
AGGTTGCAGTGAG 

Sequence 1315 

TATANGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGNCCGGNCAGGTNCNCGGGACNAGG 

TCAGGAAAGATNTGAAAATCAAATATTTTAATGATTGCTTTAAAAACATCTCTTT^ 

CACGGCTACAGGACTACATAGATAAAAATCAAGACAAAAAGTCTGATTCTTCAGGTTGAT 

AAATAATTACAAAGNTGAATTCACAACCACTGCATGT7TTTTTAATGAAANTG 

ATTGATCCAGAAAGAGTNGTATCCNTAGAATGGGAANGGGGACAGCTAGGTGAATCCANA 

TGTGATTGTA TCTTGA ATCCCCAAACCMTTGAGCCCTTTTTTATCCCAGCAGGAAGCAA 

ACTCATTTTCCTTrrTAATMGACTGNCCTCmGCATGCATAGNCTGTAATGACCTCAG 

CTGAGACAGTTACATTGCAAAGGGTTGCTAATTTTCTCCATGCCCT 

Sequence 1316 

TAGGGCGAA7TGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTAAAATTGCTGCACTGAG 

AGTGGAAGAAAGACAGAAGCAACAGGGCTGTGGATAGGGGAAAGGGAATGTATTGGGAAG 

AGAGACGCACCATGGAGTCACGTCCCCACAAATACACAGCACATTTCCAGAAAGAAAGGC 

AGGCAGGCACTGGCCAACCAAAACGAGTTTGAGGAAAAATCTATAAAAGGTTCTGTGCCA 

CACAGGAAAGACTGCCTGACAAATTAGCTAATGAGGACAAGGCATCTGTCCTTGGAGGTC 

TCCTCCAGATGCAAATTGGCCCAAATTTGAGGCCAACTGACCTCTATGGAAAAAGCAAAA 

CCAAAGTCAGATTCCTGTCATCCGGAGAACAACTGTCACCTTGATCACACTCCAAAATCA 

AGATAAAAGCTTGGCTTTCCAAGGGTCACAATGGGCTTACAATAAATGAGCCCATGTAGG 

AAGAAATAAATATGTG AAGCC CTCTTCC ACTTG AC ATTT 

Sequence 1317 

CNNACTCACTATTAGGGCGAAATTGGGAGCTCCANCCCGNGGGTGGGCGGGCCCGCCGGG 

GGGTTGGCTTTCGGGNGCTTTGCCTAATGCGGCCCGTGGGCCGTGGGCTTCACGGTGGGC 

CCCTAAGTTTrCCGGGGTTCTTCCTCMGTTCTTGGGAATGGGGCAATGGTTTGGGCAAG 

CCCAAGAACCGAAAAAAAAAGCCCCCGCCGTTACCTTGCCCGGGGGCGGGCNCGGCTTCT 

TAAGAAAACCTTAGGTGGGGGATTCCCCCGGGGGCCTTGCCAAGGGGAAATTTCGGATTA 

TTCNAAAGCCTTTAATCCGGATTACCCCGGTTCCGNAACCCTTCCGAAGGGGGGGGGGGG 

GGCCCCCGGGGTTAACCCCCMGCCTTTTTTTTGGGTTTCCCCCTTTTTTAAGN 

AGGGGGGTTTAAAAATTTGG CCGGC CGCCTTTTGGGCCC^ 

TNCAAATTAAGGCCCTTGGGTTTTTTCCCCTTGGGTGGGTTGGGAAAAAMTTT^ 

TTAATTTCCCCGGCNTTCCAAACCAAAMNTTTTCCCCAACCAAACCNAAAA 

CCCGGAAAGGCCCCCGGGGGGGGAAAGGNCCAANTTAAAAAAAAGGGNTNGGTTAAAAAA 

AAAGGCCCCTTGGGGGGGGGGGGTTGGGCCCCCTTTAAAAATTGGGAAGGTTTGGGAANG 

GCCCTTAAAAACCCTTTCAAACCAAATTTTTAAAMTTTTTGGGC 

Sequence 1318 

CCCACTACCCACAGAAATCCAGCCTCACTTAGAATCCTGGAATTACATAAGTTTCTGGTT 

GGAAAGGGATTTACATAAATCCCATAGCTCTTCCCATTCTGGGTCTGTGATCTTGAATGA 

AGCTATTAAAAAAAAAAAAAAAAGCGGGCAGGTCCAACCAGGAGGTCCAAGCTCAGTCCC 

ACCCAGGCCTGGTGTTCCCCGCATGCACAATTGGCTTCCCAGGTTCTAGCAGTGCTCTGC 

CCAGCACAGCAAATGCACTTAGGGAAGGCTCCAGGCAGCCTTTAATCTATTTCACAATTC 

ACTTTCTGTGTAGGCTGTTCTGGATGGGACCAGGAAACAGCCCCAAGTCAGGAATAAGGA 

CACAACCCCCCTNAGATCCCGAGGGGCTGGCACAACTCTATGGCTTTATGGCTTTATGTC 

AATGAAAAATTTATTTCATTAGAAAACTGTCCAGNTCCTTCGGCCCGNTCTAGAACTAGT 
GGATCCCCCCGGG 

Sequence 1319 

TANGGCGAATTGGACTCCACCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAGACTTT 
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CCCTGCCGGCACATGGACCTGGCCCAACCCTCACAGCCAGTANACGAGCTGGAGCTCTCG 
GTGCTCGAGCGGCAGCCAGAAGAGAACACGCCTCTCAATGGTGCCGACAAGGTCTTCCCT 
TCTTTGGACGAGGAGGTCCCCC 
Sequence 1320 

CCGGGCAGGTACCCATTCTCTCACACCCACTACCCACAGAAATCCAGCCTCACTTAAAAT 

CCTGGAATTACATAAGTTTCTGGTTGGAAAGGGATTTACATAAATCCCATAGCTCTTCCC 

ATTCTGGGTCTGTGATCTTGAATGAAGCTATTAAAAAAAAAAAAAAAAGCGGGCAGGTCC 

AACCAGGAGGTCCAAGCTCAGTCCCACCCAGGCCTGGTGTTCCCCGCATGCACAATTGGC 

TTCCCAGGTTCTAGCAAGTGCTCTGCCCANCACAGCAAATGCACTTAGGGAAGGCTCCAG 

GCAGCCTTTAATTCTATTTCACAATTCACTTTCTGTGTAGGCCTGTTCTGGGATGGGGAC 

CAGGGAAACAGCCCCAAGGTTCAGGGAAATAAGGGACACAACCCCCCTCANATCCCGAGG 

GGGCTGGCACAACTCTATGGGCTTCATGGCTTTTATGTCAATGAAAMTTTTATTTCATA 

GGAAAAACTNGTCCAGTTACCCTCNGGCCGCTTCTAAGAAACTANGTGGGAATCCCCCCG 

GGGCTTGCAGGGAAATTTTCGGATNTTCAAAGCCTTTATTCGGNTACCCGTCGGACCCCT 

TTNNANGGGGGGGGGGCCC 

Sequence 1321 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGTGCATTTTAAGAATACAT 

TATCCGCAACAGGCTGGTTCAGTTAACATTTTAAAAGTGCAATTCCAGGCTTACAGAATG 

TTAAAGTGGATGGAGTCTTATATCAGCTGGTCTCATGGTTTTCAGATTGCTCGGAGACTC 

CAMGCTGTTCCAAGGAGGTGACTCAATATTAATATTTTTACATGGCTGGGCATGGTGGC 

TCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGAGGGTGGATCACCTGAAGTCAGGA 

GTTCGAGACTAGCCTGGCCAACATGGTGAAACCCCGTCTCTCCTAAAAATACAAAAATTA 

GCTGGGTATGGTGGGGGGCCGCCTGTAATTTCAGCTACTTGGGAGGCTGAGGCAGGAGAA 

TCACTTAAACCCAGGAGGAGGAGGTTGCANTGAGCCAAGATTGCCCATTGCCTCCAGCCT 

GGGCAACAAGGAGGGAAACTCCGCCTAAAAAAAAAAAANAAAAAANAAAGGTCCTGCCCG 

GGCGGCCGCTCTAAAAACTTAGTGGATCCCCCCGGGCTGCAGGGAATTNCNANTTCAAAG 

CTTAATCGATACCGCCCACCTCNAAGGGGGGGGGC 

Sequence 1322 

AGGTACGTCTGTCCATTCC7TCCTTCAGTGTTGATGACATGTTTGTAGCTGCCAGGCCCT 

CCGGTAGTCATTGGAGGTAGGACTGCATTCTACCTGCAAGGAGTTTACAGTCTGGAAAAA 

GCGGTAGGTAACACACCGTGGACTCTTGACTGTGCCCTCAGCTCATCTAAGGCTCTGCAT 

GTTCACAGCCCCTGACCCCACCAAGGGAGGAGTTATCCCGCTTACAGATGAGGAGATTCG 

GGCATGGAGCGCTTAAGACACTTGCCTGAGGCCACACAGCTCCTAAGAGGTGGGACCACA 

GTTCACAACCAAGGCAGGCTGTCTCCAGGGCCCATGGCCCTTTACCTTACTTAATGTTGC 

TTTTGTGCCGGAGGGATCAGGCAGAGCCCAGAGAGGGGTGATGTGTTGAAGTAAGCAGGG 

TGCAGGACAGAAGAGGCGGCATTTCAGTGGGGTCTGCCTTTGCGGTTGAGCAGAGGGGA 

Sequence 1323 

GGGGGACCTCCTCGTCCAAAGAAGGGAAGACCTTGTCGGCACCATTGAGAGGCGTGTTCT 
CTTCTGGCTGCCGCTCGAGCACCGAGAGCTCCAGCTCGTCTACTGGCTGTGAGGGTTGGG 
CCAGGTCCATGTGCCGGCAGGGAAAGTCTCCCCGCGT 
Sequence 1 324 

TTTTTTNl rTTTTTTTI I NNGGNNGAANGGNTTTGNTNAGCTTTAAATTTTANTGCANGG 

CTTCANAGCTTTCACAGCCCAGGTCCTTTCCTATTTGTTAACTGTCTGGAGTGGNGTCTC 

TAACATTTTTATAAACTGCAGCANGGACGGACCTTNTGCTGAACCTCAACGACTTCCTTA 

ATGGCTTCCATCCTCCCACCATTNCCATGGAAGTCACTNTTTTCTACACCAACTCATCAG 

CTCTTCCCTGGAAATTCTGGAACACAAANAAAATAGTCTNTCTCTGGACACCTGGGTCAT 

AATATGAAAAACATGGGGNTGGCNGCCATGTTGNGCCACCTGGACCA 

Sequence 1325 

TCAATAAATGTTAGATG I I I I I GTTATTATTATTGTGACACAGGAGGGAATGATGAAGGG 
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AGAAGGGATACTCTTGAGTTGACCAAAGAATGTTACTTTAGGGAATAAGAGTTACAGTTT 
GGCTTTGCCATCTCCACCCAATTTTCAGGCAGCCCCAGAAATCTCTTTTTCTCCANATGC 
ATCTTGTGCATGTCTGTTTTATGTGTTCTCTCTGTACCTGCCCGGN 
Sequence 1326 

ATGCCTGTAATTCCAACACTTGGGAGGCTGAGGCGGGTGGATCACCTGAGGTCCNGAGTT 

CGAGACCNGCCTGACCAACATGGAGAAACCCTGTTCTCTACTAAAAATACAAAATTAGCT 

GGGCGTGGTGGTGCACACCTGTNATCCCAGCTACTNGGGAGGCTNGAGGCATGAGAATCG 

CCTGAACCCGGGAGACAGAGGTTTGTGGTGAGCCNAGATCGTGCCACTGCACTCCAGCAC 

TCCANCCTGGGCAATGACAAGCGAAACTCTGTCTCAAANAATTATATACATATANATTTT 

TTTTGGCCAGGCAGCAAGTGGCTCACCGCCTGGNAATCCCAGNACTTTTGGGGAGGCCCN 

AGGGCANGGCAAGATTCATTGGAAGGTCCANGGGANGATNGGGANGAACCCCTTCCTTGG 

GCTTAAACAACAGGGNGGAAAACCCCCCGGNTTTTCTTACCTTAAAAAAATTCCAAAAAA 
AAA 

Sequence 1327 

CACNCGCGGTGGCGGCCCGAGGTACCTGCTTGTCGATTCTCTGTAACAGGGCTGTCTGTN 
CAGCATGGAAATTATTGATNAAATAAAMTGGCTATTAAATTGGGACGTGATCCATAATC 
ACAAACAATGGTGGACATTCCTCTCTTAGATCCTCGGCCATGCATCAAGTCGTTTCTTTT 
TTTCCCCTGGCCGTAATTTATATTGCGCTGACACTTGGGCTCATTAGGTTTATCCTTTCT 

GCTCAANAGGCACCGAGCGNGTTTAGNGACCAGCANGGGAAGTTCGGGGGNAGGGAAAGA 

AGAGGGAAGAAGATGGATGGGCCCCC 

Sequence 1328 

CACCGCGGTGGCNGCCGCCCGGGCANGTACACAGTNGGCGCTGGGGATCAGGGCCAGAGA 

GTGGCCACCTGTTAGAGCTGCATGTGTGTCTTGGGCCTGGTCTTCTTTCTCTGCTCACTA 

ATGACAGCTGTCCCAGCCTCTTGCAGGCTCTGGCCCACCACCTTGCTTGCTGTGAT^ 

TTTTTTTCCTCACACATGATAAGCTGAGGCAGGAGAATCCGCTTGAACCCGGGAGGCGGA 

GGTTGCAGTGAGCCAAGATTACGCCACTGCATTCCAGCCTGGGCGACAGAGCAAGACTCC 

GTCTCANAAAAAAAAAAAAAAAAAAAANGNGACAAGCAGAGCAAGCGCATCTAGGTGACA 

CCCGGAGATGGAATCGGGACGGCAGGATGAGCATGGGTTTGTGGCGANAAAACCCGGGGC 

ANTAATGACATACAATGCANATTTGCCANAGTCAGCAAAAAGCCTATGTGGTAGAAATTG 
CCAAAGAACAGAAAA 

Sequence 1329 

NAATTTT1 l AMGGAATCTCMTTTGTGTTTTCTATTGTGTCATTTTAATATTATGTAGT 

CATCTAAGATATACTAAAGGTGACCACAATTTTAGAATATGCATAGCAGAGATTAATATG 

GAGTTGATTATCAACAGMGCTAMTGAATAAATTATTGACTAATTTACACTAGCCTGTC 

CCCGAATCCATTATTTTGGAATCATCTGTGATACTTATTTCTTCCATGAGAAGAGGAATT 

TAGAAACTCTTATTCCATTTATTTTGTAGTGGGTGAAAGTAACGCAAGCAATTTTCCATA 

AGCACAATAGCTTTTCTTTTAGAACACCGTAAATCTGAAATACTTCCTGTCTGTCTTAAC 

AACATTCCTGAATGATCATCTTAAGAGCTGGGTTTCCCATTGTGCTTACTAGTCTGTAAG 

CACAGCAGTTGCTTCAGTATGTATTGCTACTACTGCCTCGTTCCTATATGGNAGAACCTC 

ACCACCCCAAATAAGTCTGGGCAACCCTTNGTAANGAAAAAAAAAAATACCGTTNCTATA 

ATNCTTATAAACCAAAGCACTACATGTTTTCAGTTCAGAACCCAGATTTTTCTNGGAAAG 
AAAAA 

Sequence 1330 

TTGTTTTGCTTTAMTTTTAATGCMGGCTTCANAGCTTTCACAGCCCAGG 

ATTTGTTAACTGTTGGAGTGGTGTCTCTAACATTTTTATAAACTGCAGCAAGGACGGACC 

TTCTGCTGAACCTCAACGACTTCCTTAATGGCTTCCATCCTCCCACCATTACCATGGAAG 

TCACTNTTTTCTACACCAACTCATCAGCTCTTCCCTGGAAATTCTGGAACACAAAGAAAA 

TAGTCTCNCTCTGGACACCTGGGTCATAATATGAAAAACATGGGGCTGGCAGCCATGTTG 

TGCCACCTGGACCAAACAGCAGAGACTGCTGGTCCATGACAGAAAAGGAATAACTCAGAA 



Page 226 of 869 



WO 01/51628 



Table 2 



PCT/US01/00798 



ACACTGAGGGAGTGGATAGCTGGAAAAGAGGCCGAAGACAAAAGAGAGAAAGAAAGAGTG 
TCCTCCTGGCCCCATCCATGCAGCNTTCTAGCCCAATCCCTGGCTCCTAACCTGGATTCC 
GTANACAGCCTAAAATCCTTCCAACNGAATTCCCTTTTTT 
Sequence 1331 

AGGTACGGTAATACATACACCAAGAACAGAGACAATTACAAAAACAAGTAAGTGTTGATA 

TAGGCGGACCCACAAACAAGAATGTAAACATAATCAGATAGAAATTATCTGCTAAAATAC 

ATTTGTCTATGCCAATAGGCATTTTACTTAACAATAGGTTATCAAAGAGTCMTCTTTGA 

GAACATATTTCATATTTTCTCTTCATTATCTAAGAAATGTGAGCTATTGAAACACAATAT 

ATCTTAGGCAAC \ MUIIH I CTTTCCAGAAAATCTGGCTCTGAACTGAATACATGTAG 

TGCTTTGGTTTATAAAGATTATAGAACGTA I NIMH I CTTACAAGGGTTGCCCAGACT 

TATTTGGGATGGTGAGGTTCTACCATATAGGAACGAGGCAGTAGTAGCATACATACTGAA 

GCAACTGCTGTGCTTACAGACTAGTAAGCACAATGGGAAACCCAGCTCTTAAGATGATCA 

TTCAGGAATGTTGTTAAGACAGACAGGAAGTATTTCAGATTTACGGTGTTCTAAAAGAAA 

AGCTATTGGGCTTATGGAAAATTGCTTGCGTTACTTTCCCCACTCCAAATAAATGGGATA 

AGANGTTCTAAATTCCTCNTNTCATGGGGAGAAATAAGTTTCCCAGATGANTCCCNAATT 

AATN GG NTTTCN GGGGGC AAGGCCTN NTGGTN AAANTN N NC N N AN AATTTNTTN C NTTTN 

NNCNTNTTGGNGGGGNAAANNAACCCCCCNTATNAAANNCCNGNNNGNGGNNCANNTTTT 

AAAAAAGG 

Sequence 1332 

AATGTTTATACTCGGCATTTGTAATACMTAAAACATAAAATGTTTATACTCGGCATTTG 
TAGTACAATTTATATTTTAAAGTTTTCTGATATTTGATGTATCAGATGT^ 
AGATAAGGAAACAAGGGTCCGAAAGGTTTCTTAACTTCCAAGAGCTGAGCAGGCTGCTAA 
CCTGGTCTGCTCATCTCCTAGTGGGA I rTTTTTTTTI I CTCTTACATGGCATAC7TCTCT 
GCCATCATTTTTAAAACTCTGAGGGGGAAAAGGAAGCTNAATGTTAACATGCTGTGA 

Sequence 1333 

CCGGGCAGGTACCAATAAGATAGTTGTTTAAGACTATAGCTCTCTAAAGAGTAGTCTTTT 

AAATTTAGAGAATTTCTTTATCTCATAAGTGACTCAAGCCAAAAAGGCI I M I ICATGGA 

AAGTCCCTGAGGTAACTTTCCGGGTTTAGAATACGACAGGCAAAATTGGCATC l l N it I 

AAATGGGTGCGAAAGATGCAGCCTTCATGATCCTCCCCAAAATTTACTCCCAGAAATAAG 

CTAAGATAGCAGATCTCTGTTACCACAGGCGGTTAAGGATGGTGTAGCTGGTCTCCCTAT 

GAGCACATGGACACGTGGCCACATGGGGGTGCCTCTAGTCACAGACCTGCCAACCTGTGA 

CGCCTGTGACACAGGGCAGGCACTTCAGGGATTAGACTTTCCCAGCACTAACCAGGCAAC 

AAAGATGGAAAGCAACCAGAAGCCCTTCATGGATAGAACTCCTTTTAGGACAGACTCCAC 

TGACAGCTTGGCACATTTGGGG 

Sequence 1334 

CCGGGCAGGTACGCGGGGAGACTTTCCCTGCCGGCACATGGACCTGGCCCAACCCTCACA 
GCCAGTAGACGAGCTGGAGCTCTCGGTGCTCGAGCGGCAGCCAGAAGAGAACACGCCTCT 
CAATGGTGCCGACAAGGTCTTCCCTTCTTTGGACGAGGAGGTCCCCC 
Sequence 1335 

TTTGGCTCACTGCAACGTCCGCCTCCCATGTTCAAGCGATTCTCCTGCCTCAGCCTCTCG 

GGTAGCTGGGATTACAGGCATGAGCCACCATGCCCGGCTAACCTTGTATTTTCAGTANAN 

ATGGGGTTTCTCCATGTTAAGAATTGAGAGAGCCACTGAAAGGTGAGTCAGGAAGCATCA 

TGATCACAGCCGTGCCTTAAAAAGCCTTNTCCAGCAGCAGTATGGGAGATGGACTGAAAG 

AAAGAGTNGCAGGACCAANAAAGTGAGTCAGTGGTGCAAAAGCAAGGGAATNTGCAGCAG 

ANACAACCAGGACCTNTGCCATGCCAGAAGGAATGGGGATNAGAGGGGAGGTCTGGGATT 

TCAAAAAAGCAAGTTCATTAGATGANCTTTGATANGTTAACNGGGNTTAAAl i I I IGGNG 

GAGGGGANAAAGGNNNGNAATNCNATTGGGNTAACCTTAAAAGCTTTNTNGN I l I I lAGG 

GGCCTAAGGGGGNAGNGGTTTGNTGGTCNCCCNAGACCCGGAAAAAAGAAAAAA 

Sequence 1336 
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CCGGGCAGGTACAGATTTCTTTCATGTTAACATCTTATGTATCCACAATATATTTACCAA 

AACTAA GAAATT AACATTGATATAACATTACTAACTGAACTGAAGTCTTTATTTGGATTT 

CTCCAG TTT7TTCACG AATG CCCAATTCTGTTCCAGGAC C CC AGCCAGGATAC TGTG ATG 

TTCTCAGTCACCATGACTCCTAAGAGCCTTCTCTTCAGTGGTGCATTTCACAGGGTTCTA 

AAACACACTTTCAAACCAGCAGAAGGAAAGGGAAAAAAG7TAACAAAACATATGGTAAAT 

AGGATATATAAAATAAGATACAAATAAGTCCTAAAGTAACAGTATGAATAATAACTATAA 

ATAAAATGTCAGTAGCTAAATAAAAAGGCAGAAAATATC TAG GCAGAAAC 7CTTAGTCTT 

GATACTGAGAACTGTGAGGAATATACTTAAAAGGATAGAGAAAGACTGAAAAGAAACAGA 

CAGATAAATGATATAGCAGGTAAATATGAACCAAGAAAGAGTAGTGATATTAACAGCTAA 

AAAATATAATTGAAGGTAAAACCATCATTATTTATCAAAGAGAGATATAGCCATATGATA 

AAATGCACAAACACCA 

Sequence 1337 

GGGGGACCTCCTCGTCCAAAGAAGGGAAGACCTTGTCGGCACCATTGAGAGGCGTGTTCT 
CTTCTGGCTGCCGCTCGAGCACCGAGAGCTCCAGCTCGTCTACTGGCTGTGAGGGTTGGG 
CCAGGTCCATGTGCCGGCAGGGAAAGTCTCCCCGCGTACCTGCCCG 
Sequence 1338 

CCGGGCAGGTACTCATCATTAAGAAATTAAAGATAATCCGTAAATAAAACAATGACAATA 

TAATAGTTTAGAGAGTTAAGATTTAGGAAGTAGTAGATAAAATAATTATCTGTGTTATGC 

AAATATAATCCTTAGAATGCAATATTTGATTAATAGGAGTTGTCCTATATGTCATTCATA 

A ACTT GTAG CAGAG AGAAACCTCTGCATAAGCTCAGGTGTTCAAGTATGCAAGAAACACA 

GTTTTCTGTATTTTGCTCTTGCT 

TGGCAGAAACTCAATCTCAGTCAGCCCTAACTGACTATCCTCCAGGACTCTGTCTACAAA 

ATGCCAGGTGGGGGGCCATCTGGCCCTCAGTGCCATCAGCCACCCCTGCTGGCACCCACT 

GAGACTTCCTGCTCTGTGGTCACTGGTCCTTGTANGTTGCCACTGCAGCCTCCTTGTCCT 

CATTCCAAGGACACATTAGGCCTTCTGGCTCTATGAAGTATTTTTCACTGGAGAAAATTT 

TGCTTCTGCTGTACCTCGGCCCGCTCTAGAACTAGTGGGATCCCCCNGGGCTGCAGGAAA 

TTCGATATCAAAGCTTATCGATACCCGTCGACCTCGANGGGGGGGCC 

Sequence 1339 

AGGTACTATCCTAGACAAGCTCTGGGAGTGTTACATAACGTATGGTAAGTGTGCTACATG 

GCCTTTCTGMGCCCAAACCATTTTGCATTATGAAAATACCTGGCCCCAGGAATTTCAGA 

TAAGGGATTGTGGACTTATATTTTAAATCTGTAACTCCTATAGACAGTAATTTCATAGAG 

CCAGCTTCATGGAGTTGTAAATATTACTGGGCTAGGGTCAGTTTCCTCATTTTAGGCTTC 

CTTTAATCCAGTCCAGACACCAAATTAAATGTCTTTGTGTAGTCCCCTTCAAAGCCTCTT 

GCAATCCCAGTCATGCTTCAGGGTAGGGATGATCCCATTAATCCAAGGACCTGTCCCTAG 

TGATAGAAGGAAATG 

Sequence 1340 

CCGGGCAGGTACTTTAGGAGACCCAGGCGGGCAGATTGCCTGAGGTCAGGAGTTTGAGAC 

CAGCCTGGCTAACATGGTGAAACCCTGTCTCTACTAAAAATACAAAAATTAGCCGGGCAT 

GGTGGCTCACGCCTGTAGTCCCAACTGCTTGGGAGGTTGAGGCAAGAGAATCGCTTGAAC 

CCAGGAGGTGGAGGTTGCAGTGAGCCGAGATCGCGCCACTGCACTCCAGCATGGGCGACA 

GGAGCAAGGACTCCATCTCAAAATAAAGAAAGAAAAGGAAAACAAAGAAAAGGAAAAAGC 

TTATATTG GAACC TTCTCTAAAAAAAGGAAAAAAAAGGAAAGCCTGGATGCCACACCAAA 

ATTCTTAAATTTTGGCAAGGTCCGATCAAATTTAAAGGGATATTTTATTTTGGCATC 

AAAATAATTTCTTTACTTCCCCCCAAAAAATTCMTAAAAAAAGGTTT 

Cm i I iC CTAAA TTGTGTTTTAAAAATGGTAATCACCAAAATTACCACCGTTGTTCCCCC 

AACTTTCTTTTTCCAAGTTTATTAAATTTCTTATTGTGTAAAAGTNGMGGG 

TGAAGGTGGAGGCCAATTAAGAAGMGTTTGAAGCCTTCAAGACCCTTGCCCTTGGGANA 

AAGAGCCGTNGTTTCCTTTTCTTTMNAGGTTCTTGGAGGGAAATTGGGGTTGGGGGGGG 

CCANCAAAAANTTNCNTCNTCCAAATTCCNTTTTTTTCCTTTTTTCCCCCTCATT^ 
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GGANCCNTCTTTAATTTTTC 
Sequence 1341 

CCGGGCAGGTAC rTTTTTTTTTTTTTTTTTTTl I AAAGACAGAGTCTTGCTCTGTCACCC 

AGGCTGGAGTGCAGTGGCACGATCTCGGCTCACTGCAAGCTCTGCCTCCCGGGTTCACGC 

CATTCTCCTGCCTCAGCCTCCCGAGTAGCTGGGACTACAGGTGCCCGCCACCATGCCCGG 

CTGATTTCTTTTTGTATTTTTAGTAGAGACGGAGTTTCACCCGTGTTAGCCAGGATGGTC 

TCGATCTCCTGGACCTCGTTGATCTGCCCGCCTTGGGCCCTCCAAAGTGCTGGGGATTAA 

CAGGGTGTTGAAGCTTACNCGCGCCCGGCCTATTNTCCTTTGTACCTTCGGCGCGCTCTA 

NAAACTAAGTGGGGATCCCCCGGGGCCTGCAAGGGAATTCCGATATCANAGCTTNTCGAT 

ACCGNTCGAACCTTCGAGGGGGGGGGCCCGGTTACCCCAGCTTTTTGGTCCCTTTAGTGG 

AGGGGTTTAATTGCCGCGCTTTGGGCGGTNAAATCNTTGGGTCMTAGGCTTGTTTTCCC 

TGTTGTNGAAAAATTTGGTTNTTCCCGGCTTCAAAAAATTTTCCCACAACCAACNATTNC 

CGAAGCCCGGGGGAAGCCATTAAAAGGTGGTTAAAAAGCCCTNGGGGGGGTGCCCCTAAA 

TTGGAAGGTNGMGCCTTAAACTTCACCANTTTAAATTTTGGCNGTTTGGGCGGCCTTNA 

ACTTGGGCCCCGCCTTTTTTC CAAN NTTCCGGGG AAAAAACCCTNTTNG N N GGCCCCG N C 

CTTGCNATTTTNAANTGGAAATTNCGGGCCCAAACCCCCCCCCC 

Sequence 1342 

CCGGGCAGGTNCCTTCTGTTTTACAGAMGCCTTTGGGAACTTGAGTCAAAGCATTACTA 

CTTGGGATTTTATCTAAGGAAGGTAAATGAAGATTTGAGGGGAGACAGTTAGTTCAATAT 

GAAAGAGGAATTGGCTTAAMCACAATTTTACTATGGAAAATTTCAGACATAAACAAGAG 

TAGAGAAACAGTATAGTGAATCCCCTTCTACACATNTCCCAGCTTTCACAGTTACCAACT 

CATGACCAATCTTATTTTCTCTTCCCTAGCAATGCTGGTCCACCTTCTCCCTCAGATTAT 

TTTGAAAGCAAATCTCAGGAGGTTGTATTAATTTCATGGTTTCAAATATTCTGNGTATAT 

TCTCTAATAGG A I I I I I I AAAAAAGNAAAAACCTAACCTCAGTGGCATTTTTTTACACCT 

TAAAAAATATGTTAACCAATTTCCTTAGTTATCCTTAAAAAAACCTTAGACCAGTNGTTC 

CAAAATTTTCNCCTGGATCCGACTCCATGGAA l MINIMI I CGGTNTTGGTTTGGTTT 

CAAANTTAAGGAATCCCAAAATTAAAGGGTCCCAAAAAAGNC 

Sequence 1343 

AGGTACTTNNTTC I I 1 I I M M M I M I M I It I GAGACAGAGTCTCCCTNTGTCAGCCA 

GGCTGGAGTGCAATGGCACGATCTCGGCTCACTGCAACCTCCGCCTCAACCTCCTGAGTA 

GCTGGGATTATAGGCGTGTGCCACCACACGTGGCTGATTTTTGTATTTTTAGTAGANACG 

GGAGTTTCACCACCGTTNGACAGGCTGGTCTCGAACTCCTGACCTCAGGTAATCCGCCTG 

CCTCGGTCCTCCCCAAGTGCTGGGGATTGCAAGGCCGTGAAGCCCTNTGCGCCCCGGCAT 

TGAAAGCCATNTTTAMGMNAGTTTTCCATTGNTTTAMGGTTTTCANM 

AAAATTTCCATMCATTCTCCTTTCTTC 

TTCCTTTTATGCCAAAACTGGGCAGAAAA 1 I I M I I CATTATTCCTGGTATTATTCAAGT 

CCAAGTGGCCCA I INI I AANCCTTGTTGCCCCTTGGGNNGCCGGGGTCCA7TTATTTGG 

NACCCTGGCCCCCGGGGCCGGGGCCCGCTTCTTAANAAAACTAAGGNTGGGGAATNCCCC 

CCCGGGGGCCTGGCAAGGGAAATTrNCGMTTATTCAAMGCCTTTATTTCGGATTACCC 

CGGTCCGGAACCCTNNGAANGGGGGGGGGGC 

Sequence 1344 

CCGGGCAGGTACeCCAGGTGGGGGAATGAAGACAAGAAGAAGGCAGATTTCGGCATGGGG 

TCCTGGGAGAGCCACCGCCTCCCAGGAATGACACACACTGAGCGGGCTTAGAGGAGGGAG 

GACTCAGAAACCAAAGCAGTTGGCCAGGCGCAGTGACTCACATCTGTAATCCCATCACCT 

TGGGATTACAGGCATGAGCCACTGCATCCAGATCAAAGTTGGGTTTTGAAATATAGGCAT 

CAGTTCTCTTTTTTAACAACTAAATGGACATCTAAGATATTCCCTGACACCCTCATTCAT 

ACAAATCTGCTCTTACCTTGCTCCTGCTTCTGCATAGCTAGTAGTCTGGAATGCATGACA 

ACTCTACTGCCCCTGCCTCTATCCAGGCAGAGACAAAGATAACCTGCCCAGGAACAAATC 

TTAGCCCGAGGCTGATAAAGAATAGCTCTACCTAAGTATGCACCTGTTGGAAAAGTTGGC 
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GGG °™ GGAG ^ 

CCTCGTAGGCAT7TTCTATCTCCCATTGGCAANGGAGACAAACAGTTGGCTCAAATTACC 

TGNCAATGNTTGTWGATAATGAGANMTGATCCTTTACCAATTACCACCAAMGM 

CAGGCCCTGNTTTTTTTTNGGNTTTT 

Sequence 1345 

CCGGGCAGGTAC I 1 1 I 1 1 1 1 I II I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 GAGTAGAGACGGGGTTTTT 
^ C ^ GGTCAGGCTGGTCTC ^ CTCCC<WCCTCAGGT <5ATCTTCCCGCCTCAGCCTC 

JI2£^ TGTGCTATGATCATTGACAAA TAAATGGTTAAGTGAACNCGGAGGATGGAGA 
™ A J^ CMTG ^ GGGGGAGATCTGTAAGG 

GGGCG J7^ 

Sequence 1346 

SSSS^IS ftCATGACATACATGTGAAAAA TMCTTnGGTTTATMTAAMTGCTTGm 

ATCCTAmGG ^ ATAATATAAA ^ C ^ AAATG TMCMTTATACTTCC 
AG ^^ G ™ mGM ^ AC ^ GG ^ A ^ATCTrrTTTTATGAGAATTTGTTCCCTT 
CTCATTTTCTAACCACATTTAAATTGATACTGTTTGAGTATAAACTTAGTATTTCCCTTC 
ATGCTATAAATAAGTTACTATC I I I I I I 1 1 1 1 I A ACTTTTCCATTTGTTCTTTTTGTTGT 

?^x GGG ^ GCTCTGTCA ^ CAGGCTGGAGTG ^ 

ACTTCCTAGGCTCCAGCCGATC CTCCCTTACTTCCTGAGTAGCTGGGACCATA 

^ CACCACCACGCCTGGCTGA "nTTTGGATTTTTTGTAGGCATGGGGTTTCACCAT 

ATGGCCCAGGCTGGGCTTGAACTCTTGAGCTCAAGCAACCCACCTACTTTTAG 
Sequence 1347 

AGGTACTTTTTGATCMCACCTTCCCTTTCTCCATNCNCATTNNNCCCCAGCCTCTGACA 

CATTTTCACTCATATTAGGTGACAmAMTTATAGAAACTAATGGTrTTAGTATACAAC 

^?^ T ^ CATCAnAACG ^ AAATACAm ^ AGAAAGT ^ G ACTAATTAGGGAT 

AGAAATAAATGGGCAGCATTTTTGGCCAGAAAACAAAAGATAATGATTATTTTCAATTAC 

CAGACCCAAGNGATTTGAAATGGAAAGGTTTAGGCCTAAAATACGTTATTGAATTTATGA 

£ G ? T CA ^ 

CTAATATGTMCCCAANCAMTAATATTCTAGCTACCATTCAMTTTTATTAAAGGTNTT 

TAAAAAA7TAACTTCCTGGGTTAAATGGGAAGATTCCAAACTGAGGATTGCATTTAATGG 

GATTGAGGGGGAAATGTTAATGAGGAAGGTATATTGATGAAAAGCCCAAAACCTTAAATT 

^ AAA ^ AAGTAAmGCCCAATA ™ G ^ A CCTTCTATTAGGAAAAAACCAGTNGAAA 

GTTNTTATCACTTTCACNAAATTTTGGTGCACCAATGNTAAGNATGGAGTTNTTCTTAAT 
TATTTTAAAAA 

Sequence 1348 
CCGGGGAGG J ACT ^ 

AATAAACATATAAATATTAATATTGTGAAAGTGGTATGMGAAAAATAAAGCATGATAAG 

GAGAGAGAGGTGTTCCCTGGCTCTCCAGAGGGCTGAGGTGGAAGAAMGGACTCCTGTTT 

I* GG J AAGATATCAGAGAAGACCTCT CTGATTGACTGAAATGTATGCAAAGGCCTGCATG 

™ C ^ GGGGACCATGCCGTGCTGACATCTGGGGGGAGGA ^ AT CCTCACCAGMGGGG 

GAAAAAGGGAAAGTGCCAAAATGCCCGTTGAGGTTGGGGAGGGACCCATGTTTTGGTTAT 

G I£^ G ^ 

TMGAGAATTTTATGGGGCCCANTTGGTGAGGGATTGGTGGCNAAGCATTTGAAGAAGAC 
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TGGTTTG AATGCCTTTAAAAACCAAACCAGCCCA J J I I I AATTTTAAAATATCTTCCATG 
GAAI I I I I GTGGGT 
Sequence 1349 

"GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGGTGTGGTGGCTTATGGAGTCCCTAACGT 

TTGCACAAAATGGTTGCTACATTCCACACTCCCATGTGCCCTGAGCAACCTCCAGGGAAA 

CTTCAGATACCGTGGGGCTTGGCCTGCCAGCTGAGCTGACCCAGCGACCCTTTCTATCAC 

AGCTTTGATCTTCACTGGGTTGGGAGCCTGTCTGAATCTAAACATCTGCCTCTCTACTGC 

CCACCCCTTCGGCATTTCCTAGCTCTCCCTGAAGGGTGCAAACATGCATGCAGTTACTCA 

AACAAGAATATTTATCCTTAACTCCCTGTCCCTCACAATTTCTTTTAAAACAATCCA 

TTGTCTACCCCCGCCCCCGCATCACCCCTTCATCCAGGTGCCCATTCCCTTCTCTCTAGT 

CCTGCAATACCTAAGCATCGATCTGCCTTTAGCACATCCATCCCCGCCTTATTCTCCACG 

CAGGAGCTGAGTGACCTGCACAAAAAATCCCCCTGCTAAATACGGGNACCTCGGCCGCTC 

TTAGAACTAGTGGGATCCCCCGGGGCTGCAGGAATTCGGATATTCAAAGCTTATCGATAC 

CCGGTCGGACCTCGAGGGGGGGGGC 

Sequence 1350 

CCGGGCAGGTACAATGGCTTGATCTTGGCTCACTGCNNNNTGNGCCTCGCANGAANNAAN 

CNATTCTNCTGCCTCAGCCTCTGGAGTGGCTGGGATTACAGGCGCCCACCATTCACGCCC 

GGCCAAI MM IGTAI I I I IAGTANAGACAGGGTTTCACCATGTTGGCCAGGCTGGCCTC 

GAACTCTTGACCTCAGGTGATCCACCTGCCTTGGCCTCCCAAAGTGCTGGGATTACCGTC 

GTAAGCCACCGTGACCGGCCACAAAAGAGTCTTTTATGGGAGNAAAATAGAGGAATGGGA 

ATTAAAAGCCCNGATAAMTTCAAATTTACTGGCTCATTAAAAATGGGCCCATNGCTTCC 

TTTTCTTTMCAATGGTTCCCCCCTTTACCTGGCAAGAAGGGCCTATTATTATTCTT^ 

AGAAATTGGAGGNAGGAAAAAAGGCCAAATTCACCACCTNGTn"GGCTTTTATTGAAGCC 

AAATTCTTACCAATTGAMNNTACCAAANAAMTTCCCCCCTTTTCTTGAAAAAM 

TTN AACAAATTTTN NGTTTATTTTN AAGGANTGTTGTTAATTGTTAAAAA I ! M1CCCCC 

ACCCTTGGTGGTTTACC I M M M I I ATTACCAAAAGNCCC 

Sequence 1351 

AATTGGACTCCACCGCGGTGGCGGCCGCCCGGGCAGGTNCTTTAGGAGACCCANGCGGGC 

AGATTGCCTGAGGTCAGGAGTNAGANACCAGCCTGGCTAACATGGTGAAACCCTGTCTCT 

ACTAAAAATACAAAAATTAGCCGGGCATGGTGGCTCACGCCTGTAGTCCCAACTGCTTGG 

GAGGTTGAGGCAAGAGAATCGCTTGAACCCAGGAGGTGGAGGTTGCAGTGAGCCGAGATC 

GCGCCACTGCACTCCAGCATGGGCGACAGGAGCAAGACTCCNTCTCAAAATAAAGAAAGA 

AAGNAAACAAAGAAAAAGGAAAAGCTTATATTTGAACCTTCTC 

AAGGAAAGCCTGGATGCACACAAAATCTANMTTTTGGCAAGTCGATCAATTANAAGGGA 
TATTTATTTGGCATCACAAAAMTAATTTCTTT^ 

GTTCAAATAGCCAACTTTTCCTTAATGTGTTTTAAAATGTAACCCACCCAAATACCAT^ 

TGTCCCCCCAACTTCTTTCCAAGTTATAATTCTTATTGGTGTAAAGGTGAGGTATTACCT 

GAAGTTGAGGCCAATTACGAGAAGTTTGMGCCTTTCAGACCTTGCCTGGAANAGGAGCG 

TGTTCTTTTCTTTANGAGTNTGGAGGGAAAATGGGTTTNGGGGGGCACCAAAATTCTNNT 

CNAATCI M I'M ICTT 

Sequence 1352 

TTGNACTCCACCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGCGGNGAGCCNAGATTGC 

GCCATTGCACTNCAGCTGGGCAACAAGAGCAAAACTCTTCTCANAAAAAAAAAAAAAAAA 

AAAANCNGAANGAGAAAAAGTGTTNTAGTTTGAGCAATAAATTGTATGACCACTCTACTA 

AAATGAGTAGCTTTTGATCGGTGCCTGAGTTTATTTCTTCCCAACCATGTAGATTCCATT 

CTCAGACCCGGTGAGCCGAGATTGCGCCATTGCACTCCAGCTGGGGCAACAAGAAGCAAA 

ACTCTTAANGTGANANGAAAAAGAAAAAAAAAAAANAGGGTTCCCTNGGGCCNNTNTAAA 

AAACTAGGGGGGATCCCCCGGGGCNTGGCANGGGAATTCGATATCAAAGGCTTTATTCGN 

ATACCGNCCGACCCTCNAAGGGGGGGGGG 
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Sequence 1353 

TTGGATCTCCACCGCGGTGGCGGCCCGAGGTACTTTAGGAGACCCAGGCGGGCAGATTGC 

CTGAGGTCAGGAGTTTGAGACCAGCCTGGCTAACATGG TG AAACCCTGTCTCTACTAAAA 

ATACAAAAATTAGCCGGGCATGGTGGCTCACGCCTGTAGTCCCAACTGCTTGGGAGGTTG 

AGGCAAGAGAATCGCTTGAACCCAGGAGGTGGAGGTTGCAGTGAGCCGAGATCGCGCCAC 

TGCACTCCAGCATGGGCCGACAGAGCAAGGACTTCCATCTCAAAATAAAGAAAGAAAGGA 

AACAAAGGAAAAGAAAAGCCTTATATTTGGAACCTTTCTTCTAAAAAAAGGAAAAAAAAA 

AGGAAAGCCTGGATGCACACCAAMTCTAAAATTTTGGCCAAGTCCGATCAATTAAAAGG 

GATATTTTATTTTGGCATCMCAAAAATAAATTTCTTTTT^ 

TCAAATTAAAAAAGGTTTCAAAAATTAGGCCAMC 

TAAAA AAATGNTAAANTCCACCCCAAAATTANCAATTGTTGGTCCCCCCAAAACNTTTTC 
TTTTTCCCAGGTTNATTAAT7TCCTMTTTGGGGGGGAAAANGGGGGANGGGGTA7TTNC 
CCTGGAAAGGTlslGGNGGGGCAAATTAMGGMGAAGGTTTGGGNGCTTTTAANAACCCTT 
GCCTTNGGAAAAANGAAACCNGTGGGTTTN I III I Ml I AANANGGTCNTGNAGGAAAAN 

ANGGGTTTNGGGGGGCCACCAAAAAANTTNTTNCNAAAANN I nTTTTTTTTT 
Sequence 1354 

TGGCGGCCGAGGTACGCGGGGGCACATAGCATAGACCCATATAAGAGGAAGGATCTGAGA 

AGGAGTNTGAAAAAAGTGTTGGAGAGGCTGCGAGAAAACAAGGCAGGTGTCCTGGCTGAG 

GGCAGGMGGAGACATTGCGATGGAAATTGTGCCCCATTTTCTTGGAGGATTGGGAAGAG 

GACCTGTGGACTGGGCGGCCCACACAACTGCTGTAACCTCAGGCAGTTTTCAAGCAGATT 

GACCTTCCTGCTGGAACACGGCCACATGAGGTGAAAGTTCTCCTCTACCCAGGGTTTTGC 

TTGGCCTTGCCTCCACCATGCCCTTGACACTGGGCGGCTCTGCTTGGGGAGGGAAAAAAG 

CCTTGCTCTTGTNTGCCTGGCTGGTGTTGGAAACAGAAGAGGGGAAGTGGCAGNACAGGT 

CTGGGCCCANGCACNTTGGGGTATTCGNNGCTTCTAGCCCATTGCCGACTCATTNTTTCC 

AGTTCAAN ACCC TGCCCCCNTGGGAAAAAAAAAAGAAGCCCCAAGAACCTTCATCAATGG 

AAATTGCCTTTTCCCCTAAGGGAGNAAAAAATTGGAAACCAGCCTTGGGACCAAATTGCC 

TNCCCTNGGGTCCAANGTTANGGCCCCTGG 

Sequence 1355 

TCCACCGCGGTGGCGGCCGAGGTACCTGTGGTGGGGAAGGCACTGAGCTGGATCCTGGGG 

CAATGAAGATGAACAGGACATTGTGTGCCACAGTCCAGGCAGGAACCAGGGTGGTGGAGG 

GCACACCCCGATTGGGTCGTAGGAGAAGAGTTTAATAAAGGACATATTGATTCAGTGTGG 

GCAGAGTATAGGGAACCCTAAGGAGATAGTGCAGACTCCAGGGCCACGAACAACAGAACA 

AGGTGGGCAAGGCTGAGGGAAGGGAGCTGCTTCCAGAACCTGAAGCTATGTGGGGGACTT 

CCTGACAGAGAGGCTGTCGCCTTTAGTCAGGAGGCCACCAGCTCACAGGAACCCTGTAGG 

GAAGCTGCCAGAGAGACCAAGAGCTCAGCTCTCATCTCCTGCTGATGGTGAGCCGCATTG 

GCCAAATGCCCCCAGAAGCCAGAGGGCACAGGACACCAGGGTGCCATCCACATGGGAGAG 

CAGTGGGGAAAGGGTGGANGGTGGATATGGAGGGCANGACAGACAAAATCACTGCCTGCT 
GGGGA 

Sequence 1356 

ACTAATATG TGTTCCTCTGAGGGCTGCTGAGTATAGAACTGAACTGGCAGAAGGGCTATT 

TTTTTTTI I CAGTCTGATATGCCGAAAGCAGGAATCCTCTAATCTGGTAACTTTTGTTGG 

ATTGGAACACTANAATCTGAAT7TATACAGTGGCCACATGAAGGGAAAGGACATGAAAAT 

GTGAAAAACTCTGGTAGMGAMTGACTCATTTCTTCATTGCTGATCCAGGCATTGAATT 

TGACTATCCACTGGAGGCCAAGTGCTGCCTGGGTTTGGTTTTAAAGATNCAGAGGGTTAG 

GATAGGTGCTTTTGGCAATGGTGATGGGAATCCAATATTGAAAAGAAAAGGAGGTCAGNN 

GAAACTGCGAATGGCTCATTAAAATCAGTTATTGGTCCTTTGGG 
Sequence 1357 

TCCACCGCGGTGGCGGCCGAGGTACTGGGATTGGTTGATACCATTTACACAGGCTTCTTT 
TCTCAAAGCTGCAAACTTCCATAACACAGGATGCTTCATCTGATTTGTCAGGGCAGTCAT 
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ATTTAAMTCACATTTCTGCATTTTTTCAATGCAGTGACCATCAGACCTGCAGATAAATT 

CATTGTCTGTGCAGTTGTGTGGAGGTAATGTTACAGGAACTGACGTTTCTGGTGGAGTTG 

GGAGGTCTGCTGGCAAATTACCAGGGTAGAGGGTGCAGTCCATAAATGACAGATCATCAA 

TCGCAATATCTNCAGTGAAGCCATCTCCCACTGAAGCCTCCACCAATATCTGAAAGGGCT 

GCCTGCTTGGAAATGTTGAGGTGTTTCCTAATCCATCTGTTGCCTTGATTCCCANNTNTN 

TGCCAAAGCAGNTGTCCCCTTTGNGTTCACTCCCCCCGGGTACCTGCCCG 

Sequence 1358 

TCCACCGCGGTGGCGGCCGAGGTACGGGTTTTCATGCCGAATTAGCCATCAAGACATGGT 
TATCCTGGCTGTTTTATGGGAATGCCCTACCTCTCTCCTTGCCCTGCTTATCAAATTAGC 
TCAGGGTGGCTAGGTTTTACAACCATCTGCAGTAGGCATGGTTGGTAGCTACAGGTGGAC 
TTTGCTTAGAAATCGGAAGCTACGACGATTTCACCCTCACAGTCI i I I I j GTTGGT7TGC 
TTGATGATTMGACATAGTTTGGAAATTCCCTGTCTTCCCCTTCTCTGCCTCCTAGATGA 
CTAATTTGAGCAGTGTTTATTGCAGCTGTATTGTGTTGGGTGGCTATGTTCTGCTATTGA 
AAAAAGTCCACCCAGCCTGAAATGCTGTCTCATGTAGTTGTGGGTACCTGCCCG 
Sequence 1359 

CCACCANCCTGGAGTGCAGTGGCCGGTCTTGGCTCACCATTTGCAGTGATTTTTTTTTAA 
AATAGCTTAATTTTTCTGGCAMTTAATTACTTTGGAAGTGTMTCCTTTT 
CCAGTTTTGTAAGCAGCTACAGAGTACCT 
Sequence 1360 

CCGGGCAGGTACTGGATTTCCTTCGATTGGCCCCCAGATCCATTCTTTCCCCTCCTCTGC 

TC AGCTCTGTGCTGACATGTGTGAATCACCG GTCC I I I I I AAATAGAAGACTTTGAGTTT 

GTTTTGTCAATGGGAGAATGGAATCCTTAGATCAGCACACAGAAGGCGAGAGAACTCTGG 

ATATTTCCTCCTCTCCTCCTTTATGTTTCAGTGACACTTTTCTGGCAGTGTCTGTATCCC 

TTCC CACTGGGTG ACCC CTCCTCTAGGACTCTGGCTATC ACTGGCAACCCTAAAATAATA 

TTTCCTCCCCATCTTCTTAAGCTCAAGGACTGGTAGCAAGTTCCTGTCTATTACTTGCCT 

ATCATCATTATCCCTTGTTTCmCACMGTGCCTGCACCTTTGTAACTGGTTACTTTTA 

TTAAAAGTCTCTTCACTTNAAATCAAACTNGGGAAGAATTCTGGTCCTACTCANGACACT 

TAACTTCAAACTGGTAGTATATACCATCACAGTNAGGNACCAGGTTCATAAAAANGGG 

Sequence 1361 

AGGTACTTTGAATGTGAGTTTTAAAGATATATATCCTGAAAATAAGGACAACAAGTGCCA 
GGGGAGTTAGGAATGCCCTGTCTTCCCAGGCCTNAGCACCGCTCTAGAACTAGC 
Sequence 1362 

AGGTACATGCCTGTAATCCTCAGCCTCTCGTGTAGCTGGAATTACAGNCGGCCGCCACCA 

TGCTTAGCTTATTTTTATATTT1TAATAGAGACGGGGTTTCACCATGTGGGCCAGGC 

TCTTGAACTCCTGACCTTAGATGATCTGTCTGCCTCGGCCTCCCAAAGTGCTGGGATTAC 

AGGCTTGMCCACCAAAGCCCGGCCTGACATTTTTCTTTTAAATCTTTTATCAACCAGCT 

TTCATTTCTTGTCAGATGTCACTTTTCAGCAACCTGTTGCTTATATTCCTCTGTTAGGGA 

AGAACC rTTTTTTTTn CTGGCCCANGACTGGAGGAGAAAAGGGAAATGAAAAGGGATGG 

GAAGGAAATTACAAGAATCAGGCCACTGCTTGTCTGGTCTGTTCCACGGGATGTAACCCA 

CCAGCACACCGCCGTGGCTTCACGGGTACCTGCCCCGGGGCCGGCCCGGAGGGTACCTTG 

GAGAGTGAAGGAGGCCACCTCCCCTGGCATTGGGTCTGGTTCTGGGCTGGAGNTAATTTC 

AGCCCAGAATCTTNGGTCCGTCCGGGG I I I I I I I I AACTTTCCGGG I I I I I I GGGAGGCC 

AAACACTTC I I I M I I IGG 

Sequence 1363 

CATCACCACAGTCAAGATTCTGTCGCCTTCAACAATCCTGTCCTTTGAAGTCAGCCCACT 
GCTCACATCACTGAGCTCTTCCCCATTCCCGTAGTTCAGCCATTTCTATGGTGTCATTAG 
TGGAATCATGCAGATTGTAGCCTTTGAGTCTGGTTTCTTTCACCTAATGCCGTTGGAGAT 
CCATCCAGGACCTTGTATGTGTCAATAAGTTCCTTCCTTTTTATTGCTGAGGTACCTNGG 
C 
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Sequence 1364 

GGGGGACCTCCTCGTCCAAAGAAGGGAAGACCTTGTCGGCACCA7TGAGAGGCGTGTTCT 
CTTCTGGCTGCCGCTCGAGCACCGAGAGCTCCAGCTCGTCTACTGGCTGTGAGGGTTGGG 
CCAGGTCCATGTGCCGGCAGGGAAAGTCTCCCCGCGT 
Sequence 1365 

TCCACCGCGGTGGCGGCCGAGGTACCCCATGGCCAAATCTCTCTGCTTATTGACAATGCC 

AGAGTGTGCTCCTCGCCACACGCCAACTGTAAAATCCTGACTGCTAACAAAGGGCTGGCC 

TCAGAATCAGCAATGCTGACAGGATTTGGTTCCGGCACATACTGCTGGTTGGCTACTGCA 

CACTGGCCAGCAGAATTCTCTCCCCACATGTACCTGCCCG 

Sequence 1366 

ATTGGACTCCACCGCGGTGGCGGCCGCCCGGGCAGGTACTTACAGATNTTTACCCATTCC 

TGTAAAATTCTTGCAGTCAGAGTTTCTACAGACCTGMTATTAGGAAMTGCAGGTTTAG 

TCCTAAAAAGGATACCGTGATGCAGCCATTGATGTAGCAGTGACTGGCTCCCCTAGGCTC 

AAACAATGCATTGACCTTTAGAGGGCAGCCAAGTGGAGCTCACGCAGGTTAATTCCAGTG 

CATGGCCTTGTTAGATCACATCATATCCCCTGCCTAGTGCCAGACACTTAATGCAGGAAT 

AGTATTGGTTTCAGGGAGCCCCTTCTCTTGTCCAAAATTTGGTACCTCGGCCCGCTCTAG 

AAACTAGTGGGATCCCCCGGGCCTGGCAAGGAATTTCGATATCAAAGCCTTATCCGAATA 

CCCGTCCGACCCTCGGAAGGGGGGGGCCCCCGGTTACCCAAGCTTTTTGGNTCCCTTTAA 

GTGGAGGGGTTTAAATTTGGCCGCCGCTTTGGGGCGGTAAATTCAATTGGTTCCATAAGC 

CTGGTTTCCCTTGTGGTTGGAAAATTTGGTTNATCCCCGCrrCACCAAAATTTNCCACAA 

CCAAACAATTACCGMGCCCCCGGGGAAGCCATTAAAAAAGTNGTTTAAAAGCCCTGGGG 

GGTGCCCCTNMTGGAAGTGGAAGCCTTMCTTCACCATTTTAAATTTGGCCGTTGGCCG 

CCTTCMCTT GGGC CCCGCNTTTTTCCCAAGTNCCGGGGAAAACCCTTGTNCCGGGGCCC 

MGCTTGCCATnTAAATGGAAATCGGGGCCCAACCCCCCCCGGGGGGAANAAAGG 
Sequence 1367 

GGNAGCTCCACCCGCGGTGGCNGGCTCGAGGTACGCGGGGCTCTTCAGTTGTCTGCTACT 

CAGAGGAAGGGGCCGGTTGGTGCCGGCCTCCATTGTTCGTGTTTTAAGGCCGCCATGAGG 

GGGTGGACAAGAGGCCCGTGGTCCGTGGTGGGGCCGCTTTTGGTTCCAAGAAGGGAGGGC 

CCA AGGNA AGGGAAGGGGGTTCAAGGCCCTTTGGGGACCACATNATCCCATTTTGGACTT 

TCTATTTTGGTGGTTGAAAAATGGGCCTTTTCCCCGGGGTNCAAANGCCAAGCCACCTGG 

ATGGAAAACTTTCCTTTCAAGTTGGAAGGGGCCCTTTGGCCTTGAAAAGNAAGGGAAANT 

CA AGGGA ACCCTTGGGCTTCCCCAAATTTNCTNGCNTGGAAACCAGGGGCATTCTAATTC 

CCm I I ICTCNTGGGGTNGGACCAAAAAAAAATTAAAACCAAAATGGGTGGAATTTGGAA 

TAAAATCTTGGAATTTGGTNGGGCTTCCCMGGGGGACCATTTTGGGAAAAGTNGCCCAA 

AAAATTGGAAAGNAAAGNTTTCCGGACCAGGGTGGGGGGAATTCCCTTATTAAAAAAAAA 

GGGGGGGAACCAATTTGGACCTTAACCAGGGGGACCACCAAAATGGTNGGGGCTTGGACC 

CC7TGGGTNGGGGTGGGAATAACCTTCAAAAGGMTTTCNTGGCCCAAACCCGTTTGGGA 

AAAAGCCTTGTTTGGGCNTGGCCCCCTGGGGGG 

Sequence 1368 

GAAGCTCCACCCGCGGGTGGGCCGGGCCCGANGGTTACACCGAAANGGTGCTTGCTTCAA 
NGGAGGGACNGGCCAAGGGGCCGAAAGAATCCANGGAAGTCATTTCTTGGGTTTGGTCAC 
CCTTCTTCTTACATTCTTCCC AAGA CCCTTCGGGTGGTCCCAANCCAANAAACTAAGGGA 
TTGTTGGGGCCT TGGCT TTCTTTTGGGGGGGGTTGGGGGGGCCACCAACAACCCACATTT 

CCMGAATTTCCCTTTTAAGTTGGTGGAAGAACTTGCCACCCCGGTGGGGAAGCCCCAAA 
AAAAANGAAAACCCCC II I I I I 
Sequence 1369 

GGGCCGGGCCCGAAGGGTACCGCCGGGGTTGGGCCCGAAGGGAANAAATCNCCGAAGGGC 

CTTACCNAAACCGTCTTCCTTGGAAAAGAATGCCAAAAAACCAANCCGTTGGGCCGGGCC 

CCGCTTCTTAANAACTAAGTGGGGATTCCCCCCGGGGCTTGCAAANGGAAATTTCGAAAT 
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TATTCAAAGCTTTAATCGGAATTACCCCGGTTCGGACCCTTCAAAANGGGGGGGGGGCCC 
Sequence 1370 

NATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGCCAGAAAGGATCAGGACC 

TGGAGTCTGGCAAGAGGAAGACAGAGGCCTGTGTGGGAAGCGAGTTGTTATCTTTGGTTA 

TCTAGCTGTATGAGTGTATTGGTCTTCATAAAGCTAGATAACCGAAAGTAAAAACTCCTT 

CAAGATCGCCGGGGAGCGTGTGAGAATGAAAGACTACAGCCGAGAGACAGTAAAAACCAN 

AMGGTCAGGAATACTTATTGAATCTAACTTTGTTTTTGNTTTGTTT^ 

TAAAGGNGGGATGAGAGAAAATTAAATGACACACACATGCTAAAATATCAAGGGTCCACA 

TATGTCTTGACAGGGGTTTGTTGCAACTGCCAANGANAAATGNNTANTGATAATGAGTAA 

AGATGCATGNGCAATTTNGTCTATTTTAAGGCAAAAGTTNTNTCAGGGAI I I I I I I MA 

Sequence 1371 

AGGTACGCGGGGGGGGGTGGTGGGGAAGAAAAAAAGAAAAAGAAAAATTAAGCATTAACA 

CGTAAGATACCCAGAAAAGAGGGCTTACTCAATCAATCGANAAAACGCACGGAGTCTCTG 

CTGTGATAGGCAGCAAGCTGAGGGGGCTGTTAANANAACAGCAAGACAAACAGAGGACAC 

ATCTGGTGTATCAGGGAGCAGGACACACAGTCTGGGTCAATGGGTGCACANATGGGACAT 

ATGCAATGAGANACAGCATGGAAAGACAGGCTGCCTGACGGGCCTGACAGGTTCTGAATG 

CTGGGCCTTAGGTGGTCGGCAGGANAAAGTCAGGGCTTCACAGCCAGGACACGACACAAA 

GAGACAAAACTGTANAAAGAGGAGGGACCAGCTGTACCTTCCCG 

Sequence 1372 

CCGGGCAGGTACCAGGTTATAGATGGGTGCTGAGGATAGAGCAGTGAACAAAACCAACAA 

AAATTCCTGGCCTTTGTCTTTCATTCTAGCAAGTAGACACAGAAAAGGCAATAAATGCTA 

AACATAATAAACAAATTATATAGAATATTAAAAGATAGCCGGGCACAGCGGCTCGTGCCC 

GTAATCCCAGCGCTTTGGGAGACTGAGGCGGGCAGATCTCCTGAGGTCAGGAACTCGAGA 

CCAGCCTGGCTAACATAGTGAAACCCTGTCTCTACTAAAATATAAATACAAAAATTAGCT 

GGGCATGATGGTGTGCGCCTGTAGTCCCAGCTACTCGAGAGGCTGGTGCAACAAANTTGC 

TTGAACCTGGGAAGTGGAAGGTTGCANTGAACCAAGATCACACCACTTGNACTTCAGCCT 

TGGGCNACAGAATGAGMCTCTGTCTTCAAAAAATTAATTTAATTMTTAAAAATTAAAA 

ATTNGAAGGCTTAAAAGAATAAAAAAAGTGCCCCCGGGGGGAAAMCCAAACCAMGTTT 

TGGNGGATTGGGAGTACCCTNGGGCCGCTNTAAAACTANTGGGATNCCCCGGCTTGCAGG 

AATTTNANATTAANNCTTATNNATACCGCGACCTTCAAGGGG 

Sequence 1373 

CCGGGCAGGTACCTTTTCTTCTTTGATTATAGTCAAMGCCAAGGCTATGACATTCTTGA 

AATAACGTGGATTCTCATATGGCCTTATTGGGGAATTTAMTTGGTTTCATCAGAAAGAT 

GTATACI I I I I AATATTGTTCTTCCTGAATACAGTAAATAGCCTTCATGCCTCAGGCAAG 

TMCTCCATCTTCTGCCAAATATCCATGGGCACAAGCACMGTTCTCCGGGAATTTCCTC 

GATAAAGACAGAGTTGCTTACAGCCACCATTGTCCCTGGCACAAACATTGGTCCCTTTCT 

CTCTTACTCGGTTAAATACCCGCGTACCT 

Sequence 1374 

CCGCGGGGTTGGGCCGGGCCCGGGGGGGGTTAAGGGAATGGGGGAAAGCCCGTTGGGNGG 

GCCAAAGGGGGAAGGTTTGCCAAGGGAAAGCCCAATCCCTTCCCCTTCCTTGCCGGGCAA 

GGGTTACCCTTCGGCCCGCTTCTTANAAAACTAAGTGGGGATCCCCCCGGGGCTTGCAAG 

GGAAAATTTCCNAATANTTCAAAGCCTTTMTTCGNATTACCCGTCCGMCCCTTCTAAA 

NGGGGGGGGGGGCCCCCCGGTTACCCCCAANCCTTTTTTTGTTTCCCCT 

Sequence 1375 

GGGANGCCTTCCCACCCCGCCGGGGTTGGGCCGGGGCCCCGGGGGGGGGTTAAGGGAAAT 
GGGGGGAAAAGCCCCGTTGGGGGGGCCAAAAGGGGGGAANGGTTTGCCAANGGGAAANGC 
CCCCCAATCCCTTTTCCCCCCTTCCCTTGCCGGGGCCAAGGGTTACCCTTCGGGGCCCCG 
CCTTCTTTAANAAAAACCTAAGTTGGGGAATCCCCCCCCGGGGGCCTTGGCAAGGGAAAA 
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TTCCNNAATTATTCCAAAAGCCTTTAATCCGAAATAACCCCGGTCCGACCTTCCGAAAGG 

GGGGGGGGGCCCCCNGGTACCCCAAACCTTTTTTTGTTTC 

Sequence 1376 

TGGGGAAGCTCCACCGCGGGTTGGGCCGGGCCCGGCCGGGGGGANGGCTTGACCNAAGAG 
CCGGGGGANGGCCGTTTAAGCCGAAAAGGGAAAAGAAGAAAAAAAACCCGAANGACNGAA 
AAGCCACCTTACAGGCCCCCGCCGTTACCTTCGGGCCCGCTCTANGAAACTTAAGTGGGA 
ATCCCCCCGGGGCCTTGCAAGGGGAATTCCGAATATCAAAAGCTTTATTCCGAATTTACC 

CC GTCCG ACCTTCNGAANGGGGGGGGGGGCCCCCGGGTACCCCAAACTTTTTTGTTTCCC 

CCTTTTTAAATTGGAAGGGGGTTTA 

Sequence 1377 

CGCGGTGGCGGCCGAGTCTTCTCCTTCTAAGAGTG7TCNCCTTCAGCAGGAGGACTCCAG 

GCC TCTTCA ATCAGTCGAAAGACAGAATCGAAATAAGTCAGATAGAACTGCCAGTCATCT 

GAG i mm I AGTAAGAGGCGCCGGGAAAGGGCATTGACTNTNGGCCACCTGCTCAGCTTC 
TTGT 

Sequence 1378 

GGACTCNCGCGGTGNGGCGCCGGCAGGTCCGGGTGNCTGACAGGNCTCTAGTNNCTNCTC 
AATGAGGCTCCTGTACTCTGGGNCTGTATNCTCTGGNCTCTGGGNCAGTGGGGNATTGGA 
TACTAGCTCACTTTCCTGCCGCNCCCTGAGAGC • 
Sequence 1379 

CGAATTGGAGCTCCACCCGCGGTGGCGGCCCGCCCGGGCAGGTACAGTGAGAAAGACATG 
ACATGGCTGAGATGGAGAGAATATGAGGTGGCCACAGCAGAAAAGTTGAGAACAATTTGA 
AAGAGGAAAATCTGTGGCCAAATTCAAGGCACCCTAGGCTGTGATCCTGGAACTGAACAT 
CTGATGAGTCAATACAGGGCACGGAGTAGGACTTTGAAGTCCTCCATTGGATCTTCTCGG 

AAGATGAGGGAAATGAGAGAGTGTGGAGAGAGATTTTGTAAGTCCGGTCTGGAAGTGGTC 

ATTGTTCAGAATCCANCCACGTGGGT 

Sequence 1380 

AATTGGAGCTCGCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGGTGAAGATATGGC 

GGCGTCTGCGTCTGCAGCTGCAGGGGAGGAGGACTGGGTCCTTCCCTCTGAAGTTGAAGT 

ATTGGAGTCCATCTATCTAGATGAACTACAGGTGATTAAAGGAAATGGCAGAACTTCACC 

ATGGGAGATCTACATCACTTTGCATCCTGCCACTGCAGAGGACCAGGATTCACAGTATGT 

CTGCTTCACTCTGGTGCTTCAGGTCCCAGCAGAGTATCCCCATGAGGTGCCACAGATCTC 

TATCCGAAATCCCCGAGGACTTTCAGATGAACAGATCCACACNATCTTACAGGTGCTGGG 

CCACGTGGCCAAGGCTGGGCTGGGCACTGCCATGCTGTATGAACTCATTGAGAAAGGGAA 
G 

Sequence 1381 

ACATNAAATGGGGAGGTCACATCTATTAGTGATCAGCTGTCATAATCAGAAGAGGCCATA 

CAGCATTCATTGAATATTGGAGGAACAGGACTCAGATACAGACNTACANNANCTCACTTT 

NCTGCAGAGAAAAAAATGGGAACACATATGATGACCCTTCAACGTGATGAGTACCTGCCC 
G 

Sequence 1382 

GCCNATTGGAGCTCCCCGTTGTGGCGGCCGCNCGGGCAGGTTNNCGGGGGGGTCTGAAGC 

TGAAAACTGTAACCATGATGAAGGCAAAGGAAAGAGCAGAGAGCTCGGTAGACAGAGGAA 

GGAGCCTGTCCTCAGACAGGCCCAAAACCAGAGAACCTCTTGTCATCAATGAACTAATGA 

CATAGACAGATCCAAATGACTGCCAGATCTGCCAGGAATCAGTATTTTTCAGGCTTCAGT 

GATCTCCTCTGTCCTGGAGAACAATTCCTGTATATTATGAGGACTCTGACATCTTCTCTG 

ACCACCAGTTTATTTTCTTCTCTTAATACAGAATCAACTGCATTAAGGTCATGGTAGAAA 

CCAAAGCTAAAACAGTTATTTTAAAACATCATGATGTAAGTGCTGTAAACCTTCTTAGAA 

TTTAAACATGCTCAGGGCATGAAAGAGAAATGAT 

Sequence 1383 
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GTGGTAACTGATCAATCTTGGTGAGCTCTAAAATACTGCAATTGCCTCAACATTACCCTT 
TTATGGACAGTTCCAAATTMCTTATTTT^ 

GAAAAAGCTTCTGATTCAACACTCTACCTTCTGCATCCTGTTGAACACCATATACATCAG 

TTTGAGTATTACCGACTGCTGTAACAAACAAGACCCTCAATTTCAGGGAGTTTACAAA7T 

AGAAATTAAATTTGCGAAAGTAAAATGTGGATGTTGCCGGACAGCTGGTAGATATGCTCC 

ACATAGTGATTCAGTGGCAATATCACCTTCCTTCTTGTAGTTTTGACATTGCTTTGAGTC 

TCAGAACTTTCTGTGACAGCCATTATTCACCCGCGTACCTGCCCG 

Sequence 1 384 

GCGGTGGCGGCCNNGGCACCTTGCCGCANGAGGGGAAGGATGGGCTTCCTGCAANGAANG 

GGTTGCCCCACGGCTTNCCATNCTNNCCN 

Sequence 1 385 

CCGGGCAGGTACCGCTAGCCAGTGCCGCATTTATCCATCTGTAAGAAGGCCCTGGTGGAG 

AGGATGGGATGAGAACAAGAGGCTACCTCCAGTTAACCAGGACATAAAGTCCCCAGCGGT 

TCCTGTCACACCTGCTCCTCCCTCCCCAGGGTGCATCCATGATCGTGGATGTTTGCCCAG 

GGGTGACCATGTTTGGCTGGCTTGGAATGCTGTGCATTCTCAGAGCTCTGTTAAGTGTCC 

CCTCTTGGGGGTCAAGAAGATGAAGGTGTGGCCAGGGGTCTTAGNAGGNAATGAGTGTCC 

AAGGCAAGNAGTTCAANAAGGTTAAGGNAATGTCCCTCTTGGATAGNGGNCTGAAATTCA 

AAGGGGATTTCCTGGGCTTTAANAAAMGGGGTCTTGCTTATTCTmNCAAAA 

TGCCMNGTATrCTTGTAGCCCAGGTGTAMTTGAAAATTCACTTTCCCAAAATCCAANC 

CCTGTrTACCCTGGACCTTTTTGNTCAATTGGTTTTTTNCCCC^ 

NTATTCANTTNGAAMCNAGGGTCMTTTTTAAMCCAAMTCCTTATT^ 

TTCCTNCAAGGGGNTNTGGGGAAAATTAAATTTCGAATTGAATTTGGCTTTTAAATCCCC 
TTTCCCCCCC 

Sequence 1386 

ACCGNGGTGGCGGNCGCCCGGGCAGGTACGCGGGTGAGTGTGTGTGGGAGTTTGTGCTAA 

TGTTCAGTNGGTGGAAGCTCAGGGATTTCCACTGTATTCTTAAGGTGAAGTCAGAGATTT 

GAGAGAGGATACCCACTTGAACCATACAGCTTGCCATTTTGGGCTGCAACAACNAGGCCA 

GTTGTATAAGGAGAATAAAGAAACTGGTGCATTTTGGAGTCAGACAAAAAAGTTCATTCA 

TGGCTCTTTCAGTCTGGAA GATTC TGAAACCTTTGGTAACATATGTAATTCCTAAAATAA 

NGAATAAGNGCAAAGTGGATTTTTAAATAAAAGGGGCANATACCAGTTGATTTTCAAANG 

AAANGAGGAAAAATTACTTNCCANAGGGTGCCTGCCATTTCATTGAAAAAAAGNCCAGTT 

NCCCCATTGTAAAAGAAAAAAAANGAATTCNTGTTNCAAGGMGCCAATTCCCTTTNATT 

TCAAAAAACCCCAGGNTCCGTTAAATTTTGCCCCCATTTTTCNCCTTTGTTAAA 

AAAA AAAGNGGCCCCCTGGACCTTAAAAGGCCCCNCTTTAAAAAAACNAGGANTGNGNAT 

TTTTCNAANCAANGAGGCCTCCNNCTTTAAACCTTCCTTCTCCNTTGGTCANTGAAAATT 

NCATTGGGGGGGGGGGAAAANGGGANTNAATTNN iTTTTTTTTI 

Sequence 1 387 

TCCGGGCAGGTACGCGGGGGAAGGGGCGGGATCCGAGGCAAGCGTTGGTTCTGTGCGCCT 

CAGAGTTGGAGCACACAGCTGTATTAAAAAGGCAAATCGAAGGCCGGGCGCGGTGACTCA 

CGCCTGTCATCCTAGCACTTTGGGAGGCCGAGGCGGCTGAATCACTTGAGGTTAGGAGTT 

TGAGATCAGCCCGGGCAACATGGTGAAACCCCGTCTCTACAAAAATAGAAAAATTAGCCC 

GAGCCGTGNATGGTTGGATGCCTGTAATCCTAGCTCCTCGGGGAGGCTAAGGAAGTATTA 

ATGTNTCAAGGGAAGGGCGAATTATTGGGGAAGGTGGGGACCCATTATCTTAGTTAAGTG 

ATGAAAAGGTGAGGGAANTAAAAAAGCCCAAAAACCCAGNACAAGCCCATTCTANCTCTT 

TCTCTTTCTCTGGTGCCCAGGCCAAGNGAGGCCATGCTAANTTGAGGCCCCAAGGGNTAN 

CCTTCGGGCCCGGCTTCTTAAGGAAACCTAAGTGGGGATTCCCCCCCCGGGGGCCTGGCC 

AAGGGAAATTTCNGAATTATTTCAAAAGGCTTTTATTCCGATTACCCCGGTTCNGAACCC 

TTNCNAAGGGGGGGGGGGGCCCCCGGG 

Sequence 1 388 
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AGTGGGTTCTGAAAGTCCATTCCGGCGTAGGGTGGAAACCGGCGGGAGGCGTTGCTGATT 

CTCGATTAAACAATAGTGACGAGACGCGTGCGGGGCGCCTGCGAGTCCCTTCCTCCCCGC 
GTACCTGCCCG 

Sequence 1389 

ATACCj i i i rTGTCCATTTrATGTGATTTGCTATAACTTCTTTTTTACTTACCTCTGAAG 

TAGMTTTGATTGANAAGATAAATNTAGCTAAATTCCAAATACCATTCCCTCTTGTTTTA 

CAACAATTAGMCACATTAAAATGAANACTTGATTCATGACACAGCACCTAACAAACTTN 

TTGGGCATTTCCTGATGTCCTGTTGTCAC TG NATGCC TGCAGCACTGTTTCC C TGTTTC A 

TCATGTGGGGAGGTGGGAGTAGGTMTGGAGCTTCTGNTAAAAAAATTAGAAAACTTTAA 
AAAATA 

Sequence 1390 

TATCTGTATTATCTCATTTAATCCTAAACCCTGTGCGTTAGGTTTTACCCTTATTCTTAT 

CCACGGCAGAGTTAGGGTCCAAGATCATGCTGTTGAACTGGCAAATNTGGGATTTAANCN 

CAG GTCATC TGATTNCAGAGCACCTCCTCATTAACTGTCTCTCAAAAATGATCAGGTCCA 

ACAGTTTTrCAAATTATATACCACTGTCAGCMGATTTCTATATTGAATTT 

TATAAATAACTAGAAGGATCTAATAACAACATATATATTTATAGGANCAAATCTAAATAA 

CACTGAAATGATTAAACAAGGGTNAAATTTAGGCTCATATGATCCTGATTTAATTCTCAT 

TTTGGTACCTCGGGCCCGGCTCTAGAAACTAGTGGGATCCCCCGGGGGCCTGGGAGGGAA 

ATTTCGANTTNTTCANAGCNTTTATTCCGAATTCCCGTTCGGACCCCTTCGAAGGGGGGG 

GGGGCCCCCGGGGTACCCCAAGCCTTTTTTNGNTTTCCCCTTTTAGNTNGGAGGGGGGTT 

Sequence 1391 

ACGCCTATGCCAAGTGGAAACTCTGTTCTGCTTCAGCAATATGCTTCATTTTCATGATTG 

CAGAGGCCGTGGGTGGGCACATTGCTGGGAGTCTTGCTGTTGTCACAGATGCTGCCCACC 

TCTTAATTGACCTGACCAGTTTCCTGCTCAGTCTCTTCTCCCTGTGGTTGCATCGAAGCC 

TCCCTCTAAGCGGCTGACATTTGGATGGCACCGAGCAGAGATCCTTGGTGCCCTGCTCTC 

CATCCTGTGCATCTGGGTTGGTGACTGGCGTGCTANTGTACCTGCCCG 

Sequence 1392 

TGGTNACTATCCAAATTCATGGTCGCCAAGGACATCGGTAAAGCCTGGAGTANCTTTCAG 

ATGTGAGGAGCTTCCTTAAAGCTCTGGAATAAGCAGGATTCAGGAATAGTANGGATAAAT 

GCANAGCGCTTAGAAGATGACCAATAAGAACAGATCTTAGTCCTGCCTGAACATGCTGTG 

AAAGCTTGAGGAACACACCAGTGCCAGCCCTCAATCTCCTCAACTGACTTGGGGATTAGA 

CTACAGGCTATCGATAAAGTTTCCATCCATCCTTAAAATCCTATGACTCCAAAATTTCTA 

TCACATGCAAAAGTATAAAGGCAGGAAGGAACTGTACCTGCCCGGN 

Sequence 1393 

CCCTTAGCGTGGTCGCGGCCGAGGTACATGCAAGCTACTCTTGCTCTTTGGATCCATGGC 

TGAATCTGGACACCAGCAGTGACCAGCTCTAGAGTAAAAAAATCAGGAAGGGGGCTGGCC 

ACACTGGGTTGGAGAATCAGACTTGGGGATACCATGCCATGAGGTGTGTGGCCTCTGGGC 

AACAAAACMCACCAGTTTAGACTCATCTCCTTGAAACAGAAGGAAAAGCCAGGCTTGAA 

AACATGGGTTAGAACAAGGAATCTTGGGCCAGGCGCAGTGGCTCAGGCCTGTAATCTC 
Sequence 1394 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACTCTTGCCTGTGGACTTTTATTATCCTTCCAA 

CTTCATTCTTGTAGGGTCTCAAATCCTTTCCTTGGCTTCCCTTCAGACTGCCTCTGTGCT 

GTCATGGTCTTTCTTGGGAAGACCTTCTTCTGCTTTCTCTCTAGGAACAACTCGGCCAGG 

TATAGTTGCCTTTCCCAAGATTACAATTTAGCCCAGGATTCAGTCCCATGGTATCTCCGN 

CAGGATAGAGCCCACGGGCAAGATTAAATTATCTCATTCTCACAGTTAGATCTTTGCAGA 

AGCTTAGCAATGCCATCTGAGAAAGGAGGCATCTGAAGTTATCAAGCAGACAGGTAACTC 
CGA 

Sequence 1395 
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• CCCTTAGCGTGGTCGCGGCCCGAGGTACTTTCTGGCTTGTTGAGCGTGTNCTNACTGCTG 
GCCCTNTTGAGCCTGCTGAGTCGNGACTCAAAAGCCAAGGAAGGTGAAGACTTATAACTC 
TTCATGCCGGA 
Sequence 1396 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACTGTGGTTATGCCGGGAAAAAGACACAAGTCC 

TTTTCTTTGAAAGATAAACMTGAAATATTTATGGGTGAAATAGTGTCTGGGATTTGT^ 

TAAAATATATTAACACAAAAAGTGTGAGGAATAAATAAAATATGAAGCTGATGGTTTGCT 

GACAGATACATTGGGTTCCCCGTACTATTCATTCCTTTGCCTTAAAAATAAAAACACTCA 

AACAAAATTAGAAAAAAAAAAAAGAAAAAGAAAAAAGTACCTCGGCCCGCGACCACGCTA 

AGGG 

Sequence 1397 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACCTCCATGCATGGCTAAI I I I I GCTTTTCTGT 

TTTTGTAGAGACTGGGTTTCGTCATGTTGCTCAGGCTGGTCTTGAACCCCCGAGCTCAAG 

CAATCCTCCCACCTCAGCCTCCCAAAGTGCTGGGACTACAGGCATGAGCCACCACGCCTG 

GCCTTTATATGTATATTTTTGATCATAGTAACATCAGTATGCAA7TAAAGAGTAATAATA 

TGTATGTAAAATAAAGACACGMTATTTATTCAACTCTAGACAACGTTTGATGCCATAAT 

GGGTTTAAAATAACTCTGGGAGCCTGGGAATCTAGTCTCTAGATTGGAAACCACTGAACT 

GGGTATTTTATTATTAAAAACAATAATCATTATTA I Nil I CCATTGTAATAACATGTAA 

AAAAACATATTTCCCATATGCTCAAGTGGAGAAMTTTGGAAATAAGGAAAGTAGCAAGA 

AGAAGTGGGGGGAAAGMGCACCCATAATTCTATTAGCCAGAGTCNAAAACATCTTTTAA 

CACTTTTTCTGTGCATTAAAAAAAAAAAAAAAA 

Sequence 1398 

CCCTTAGCGTGGTCGCGGCCGAGGTACGGACAACAGTGTATGTGCAACCATGCTCGCTCT 

GCCCTGGGAGGGGCAGTGGAGGCAGAAATGGATGAGGAAATAGTGAAAGATATCAAAGAT 

ACCCCATACTCTCGGGGCATCAGCTACTGGCCTTATTTGGTTTTATAAAACATAAACGTT 

CCTCCCAAACCCTTCCAGTATTTTGAGGGGCTTTCACATTTCCAAATACAAAGTTTTACA 

AACAAGCCTATTATAAAAATTGCATTGTCATGTGGCTAAATTACAAAAATGACTTCAGCT 

TTGTATMGTCATTTAAAACAGTGTATACACTAAGGCTATTTTAATCTTCCAAAGAACTA 

TTGTGGATAGCACAGTTTTCCTCACAGCTAATGCTTTATCAATCATNTATCTCAAATTCC 

TGTTTTAACCATTTTGGATCAAAAGCCTTTA 

Sequence 1399 

CCCTTCGAGCGGCCGCCCGGGCAGGTACTCTTGCCTGTGGACTTTTATTATCCTTCCAAC 

TTCATTCTTGTAGGGTCTCAAATCCTTTCCTTGGCTTCCCTTCAGACTGCCTCTGTGCTG 

TCATGGTCTTTCTTGGGMGACCTTCTTCTGCTTTCTCTCTAGGAACAACTCGGCCAGGT 

ATAGTTGCCTTTCCCAAGATTGCAATTTAGCCCAGGATTCAGTCCCATGGTATCTCCGCC 

AGGATAGAGCCCACGGGCAAGATTAAATTATCTCATCCTCACAGTTAGATCTTTGCAGAG 

CTTAGCAATGCCATCTGAGAAAGGAGGCATCTGAAGTTATCAAGCAGACAGGTAACTCCG 

ACCAATTCAGCACTCTGCCCCAACTCCAGATTCTGTGGTAGGCATCTGATATGGGGCTTC 

CAGCTTCAAACAGCAAGCTACATGTTCATAAGCATCACTGGAGGTCTTTG 

Sequence 1400 

GATATCTGCAGAATTCGCCCTTAGCGTGGTCGCGGCCCGAGGTACI Ml I III I I Ml I I 

TTTTTAGACAGAGTCTCACTCTGTTGCCCAGGCTGCGTGATCTCAGCTCACTGCAACCTC 

CACTTCCCGCTTCAAGCGATTCTCCTGCCTTAGCCTCCTGAGTAGCTGGGATTACAGGTG 

CGCACCACCACGCCCGGCTAATTTTTGTAT7TTTAGTAAAGACGGGGTTTCACCATGTTG 

GTGAGGCTGGTCTCGAACTCCTGACCTCATGATCCACCCACGTTGGCCTCCCAAATGCAG 

TAGCGTCTTAAAGACCATCTTAAAGGCTGGTAAGAGTCTGGACTTTATTCTAAGGACTAG 

AAGAGAAGCCATTTGGTTTTATGCAGGGGGATGACAGGATGTGATTTTCACTTTAAGAAG 

ACAACTAGTGCTACAGTATGGAGACTAGATTGTAGAGGATGAGAGTGGTGATTGGGAAGG 

CCGTTCAAAAGGCTGTCACAGGGAGCCAAGGG 
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Sequence 1401 

CCCTTANCNGNGGCCGNWCCGACGGGCCCTTTGCCTTTTGAATTCTTGATTGTmCATT 

GAAGAAA TGTGGAA ATCTGAAACAAACTATATTCTCAATCATGTTAGAAAAAGTATATGT 

AATATGTA I M I I i I AAAGTCCAGTGGCTAAGAACTTAGACTGGGTTTTGAAATAATAAT 

TTCCTAACACAGTTTCTCAGTATGTTTCCTCTAGACAAGAGAAAACTCTGTAATTATGAT 

TCTTAAriTACCCTGGTAATTTCATAAGAAAAATGATTAATTACTTTGAGCMTCA 

TACAGATAATCATGGTTCTTTATTTAAACCACAAAAAATAGAAAGAAAGCAGAGAAGCTT 

AAG CATAA AATATAMGACAAATTCAGTCAmAGTAAAAATGAAATCAGTGTTAAAATA 

TTTCT7TTGACACAGTCAGCTAACATCTTTCTATTTTTGCTGAGTAACTATTMTTATAT 

TTTAAGTTCAACTAACTCTTGACACAGCAATGGGMTTTGAAGTGAATATACTACTTTGA 

AAMTA AATG GTGTATTTAGAGTGGTGACAmAGAAMTGGTAGNTTTGTGAA^ 

AATGTGTTTTAAAA7TATAMTGCTAAAGTNAGGTGAATGCATTGATTATNTTTAA 

AATAAAAAC C G AAG 

Sequence 1402 

CCCTTAGCGTGGTCGCGGCCGAGGTACATGTCATTCCTCCTGACAGCAACTTTAAAGCCC 

CAGGATCAGCTTTCTGCCCTGCAGTTGCTGGTCTACCTGATGCCACCCTGCCACAGTGAT 

ACCCTGGAGCGTCTGCTGAAGGCCCTGCATAAAATCACTGAGAACTGCGAGGACTCAATT 

GGCATTGATGGACAGTTGGTCCCAGGCAACCGTATGGCTTCCACTAACTTGGCCTTGGTG 

TTTGGATCTGCTCTCCTGAAAAAAGGAMGTTTGGCAAGAGAGAGTCCAGGAAAACAAAG 

CTGGGGATTGATCACTATGTTGCTTCTGTCAATGTGGTCCGTGCCATGATTGATAACTGG 

GATGTCCTCTTCCAGGTGCCTCCCCATATTCAGAGGCAGGTTGCTAAGCGCGGGTGGAAG 

TCCAGCCCGGAAGCACTTGATTTTATCAGACGCAGGAACTTGAGGAAGATCCAGAGTGCA 

CGCATAAAGATGGAAGAAGATGCACTACTTTCTGATCCAGTGGAAACCTCTGCTGAACCC 

GGGCTGCTGTCCTTGCTCAAAGCAAGCCTTCTGATGAAAGGTNCTNTGANGAGCCAGCTG 

TACCTTTCCGGACTTGCCGTTCCCTTGACNATGAGGAAGGAGCGGGTAACCTTCCATTTC 

CGGACAAGAACCGCCCATTGCTTCGTGTGCCCCGGGAGAAGGANGGCAAAAACTGGGGNN 
AN NTCCTTNTTTCCTTAA 

Sequence 1403 

CCCTTAGCGTGGTCGCGGCCGAGGTACCTATGACCATCTTACATTATTTTTATGGGTGGG 

GGGCATTGACTGTGGMTGTGGGCAGTAACTTGCACAGTCAGTAACCGTTTGAGTAACTT 

CTTGTTGGCATCCCCATTCTGGCACTCCTCCTCTAGGTCTCCACCTCACACGCTGGTTTG 

TGGGCGGAGGGGCAGGTTGGTGCGTGGGGTGTCCGGGCACTGGCTGTGCATGCCTTCTTC 

CTCTTCTGTCTCTTGGCCACCTTTTCCAAAAAGTCACCAGTGACCAATTCTCCCAGTGTT 

TCTTTGGGACTCAATGCCTTGGGCTTGGCATTGGGTAAAGCCAACTGGCCAGT 

Sequence 1404 

CCCTTAGCGTGGTCGCGGCCGAGGTACTTCCTGGCTTGTTGAGCGTGTCCTCACTGCTGG 
CCCTCTTGAGCCTGCTGAGTCGGGACTCAAAAGCCAAGGAAGTTGAAGACTTAGAACTCT 
TCATGCCGGAAGAGGCTGCAGGCATAGGCCGCACCCNGTCTGGGCCGNGG 
Sequence 1405 

CCCTTTCGAGCGGCCGCCCGGGCAGGTAC I I I I I I I I I I GTTTTTTGTTTTTTGTTTTTT 

CCTTAACTACGCGTGCTTTAAAATCTCAGTTAAATTCTTCCTTGAGCATTACTTTCTTAG 

AGTGGCCTTTCCCACTGTGTTCATCAGMCATTAGTTTTGTTGGCTAAATGGCACTATGT 

GACATAAAGCCAAATGAGATTGGGAAATTTCAGTCTAGATTGGACACACGTCACCCTCTN 

CAATCCNTAGGACTTCTTAGAGCTGTNAACGCTAATGGACATTGTGGATCTCTGGGAGGG 

GATATGGATTATACAGTTTTCATTAAACCTTMTCCTGGACCAAAGGAANTNTCCTTTCA 

TTTTCTTT 

Sequence 1406 

CCCTTAGCGTGGTCGCGGCCGAGGTACTCATTGGAGGATCAGCTCACCTGCTTNGCTCTC 
GATGTAGCCTAACTGGGTTTAGAGCCTTCCCTTGAATGAAGAACCCTCCCCAGCTGGAAA 
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GGGGATGCTCTTGAAAAGCTCAGCTGACAACACACATGGGCATCAAAGTCATTGGCCACA 

TACATGCCTCAAGTGTNCTAAAACCCNAATATGATCAAAANAAAACTGCTGTTCANCAAG 

TGGAGACTGGCATGCAANATTCCCCTGGGCCTGCAAGCCTAGTTGNAAAAAAGATACCCA 

AAATACTGNTNGGAANAATGAAAAGGGATTGAAAAGGATGGTCATCCAAAGTTNGTGNTT 

TTNCACTTGGATGGGAAAAGGACTTAAAACCGCAAANGCATGTTTCANNGATCCAAACCA 

CAANCACCCACCAGGGGATTCC 

Sequence 1407 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACTCCAATCTGGGGCCAGAGCAAGACCCTATCT 

CAGTCAATCAATCAAGATTTTGTGTAAATATMTCATTATTCCACAATTTTAAAAM 

GCTTCAAACACAGAAATATGAAAATCAGAATTAGGAAGGCTGGTGGTCAGTCTTCACTTA 

AAAATAAGTTTATAMGAGTCCATTATCTGTTAGAGATATGTATTGAAGCATTTTCCAAT 

AAAGTGAATACTGGGATTTGCTTTAAAATCATCTGAGGTAGGGGGAGCCTAGAATTAACT 

GTTAAAGCTAAGTGATGGGAGTTCACTAAATTATTCTATTTTTCTGTTTGAAATCCTCCA 

TCATGAAATGTTTCAmTG H I I" IAAGTAAGTTTCCAAAGATCTTTATCATTAATATAC 

TCTGTAGATAAAATGACAGGCTGCCCTTAAGCTGTCAAACTCTGGTGGCTCAACCAGGTA 

TGCTCCACACTGAGCCCAACAGTGTAGACTCACTCAATGCAGGCCCCTTCACAGAAGGCC 

ACAAAANGCCCCTCAGAAAACAATCAGAGCCCCAGTCAGACCACCAGCCACANTGCAGGG 

AGCTGGTGCGGNCTAATCTCTCTCACATTAGGTATCCCGAATGTGAAAATCCAAAATCCA 

A 

Sequence 1408 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACTAGCACTTGTGACCAACAGCTAGTCTTGCTT 

TCAACAGCTAGTCTTGCTTTCCCTGATCGTAGATCCCTACT>VVAAGCTCTTGAAAGTGTG 

CATATGCAGGCTACCTAATGATTTAACGGAGGATCAAGATGGTTACACTGCAGAGAAACA 

GGAGGGATAG GGCATATGCC CTTTGTCC AGCAAGGCAG ACAAAAACTAGG AACTTGAATT 

CTCTTCTCTGGATTCCTGTATCTGGAGCCAAGGCCAAGGATCAGTTCTTCTAGAAAGTTC 

CCTCCTGTGTGTGTGTGTGTAAATGTGTGTGAATGCAGCCATAACTTCCCTGCCAAAATT 

CATTGCCAAAAATTTTGTGATTTGAAGCAGGCCTTACCTGTTAATGCAAATTTGATTCT 

AAATCCTAAGAGTTTCCAAAGGTTGAGTGAAGTGAGATGAGACTCTTAATTGAAGATCCA 

GATGCTTNCCTGGTGCTATCTAAATGCTACCAAAACTTCCAGCACAACAAAGTNTATGAA 

AGAGANGCTCTAGGGTCATGAAATGGTAAAAGTTGGAAAAATGGGCCC 

Sequence 1409 

CCCTTAGCGTGGTCGCGGCCCGAGGTACTAAGACCTCCAGGTGGTAGATGTAGATGGATA 

CCGGCTATAATGGTAGCAGTAGCCTAAGTGGAGCCATCCTCAAGCTCCCAGGAGAAGTGC 

ACAGATGCCAACAATGGTGGATGGCGGGAGCAATATCCAGATTCCTGGAAATTGTGTTTA 

GTCACTGAGGAACAGGGGGAGTGCCAGACCAGATAGACCCATCCTTGGGTCCCCTAGTGG 

TGCACACATGCACAGGCTGTGGTAGGCAAGATGGTCTTATTTCGAGGTTCCCCAATGGAA 

TGCTTGGGTGGCAAGTAGCAACAGCTGCAGTGAACAAGGAGACCCCACCTCAGGGCACAG 

GCAAGTGCACTATGGCCTTGCTGCTGGGGAAACAGGGTTGCTACCAGTGGCAGCAGCCAC 

AGGCAAGCANCANTCANGCTCTGGGGAGGGCATGCTTCACCCCCAATAATGGTAGCCTTT 

GATNGTGGCAATGGTAGCTTCAAGGAAAACCGGCCTTAATGTGCCTTNCAGTGTGGCACA 

TTCCTTCTTGTTGGGGGT 

Sequence 1410 

CCCTTTCGAGCGGCCCGCCCGGGCAGGTAC ! I I M I M I I I I III II I I ITAAGAAACAC 

GGTCTCACTCTTTCACCCGGGTTAGAGTGCAGTAGTGTGATTGAGCTCACTGCAGCCTCG 

AACTCCTGGGCTCAGGTGATCCTCCCATATCAGCCTCCTGAGTAGTTGGGACTACAGGCA 

TGTGCCACCATGCTCAGCTAATTTAAAAAAAAAAATTGTAGACATGCAGTCTCGCTATAT 

TGCCCAGGCTGATACTATACTCCTGGCTTCAAACAATCTTCCTATCGCAGCCTCACAAAG 

TGTTGGCATTACAGGTGTGAGCCACTGCACCTGGCCAG I I I I I I I ACAATTATAAAAATA 

ATCTATGTTCATTTTCTACAATAAACATTGTAGAAAATGAAAAATAGAGAAAAAAGCTGT 
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CCCATAAAGCTTTTTTAAMTCACTGATTGAGTTTMTATTTTATTACTGCATATATTGC 
TTCATTTCTCA 

Sequence 1411 

CCCTTAGCGTGGTCGCGGCCGAGGTACTAGTGGGTGTTATTTTGTTGAATGAATGTTTTA 

CCCAAACTTAT^GCATATTAGATTGTTTAAAAGCTCAATTAAAAAGTGAAAAATAGACA 

CTGAAAATTTTTTGAGTGATTTCACATGTGTTCATGAAACAATATATCTATTACAAATCA 

TTGAATATTTTAAAAGTAATGTCATGTATTCATTTTCTTATTTTATCTCCTCAGTAGACT 

GAGATCAGTGTGTAGGGGTGAAGCAAGGAGTCCTACCCTGTTTGACTAAGGTAAAAATTA 

■AGAATCAGTGAGAAATGGAATTTGCAAAAGTGCCTGCCAGATAATGTTAGAACTGGACCA 

GAAAATAGGAGTTGGTATAAAACTAGACCAGCGAGCTTTTTTTTCCTTCAAGATGCAGTT 

CAGTTrATTGCTTTTGTAAATTAGAGATTGTGTTTCTTGATCTTTATTAAAGTAGAATAC 

AATGTTAACCTACTTCAAATTTTAAAAAATATCACACATGTATATGTATGTGTGTGTGGT 

ATATACACACAGGAT7TTAAGGACAGNTTTTTGTGTGTGTGTGCATGCGCACCCATGCCA 
AAGGAAATT " 

Sequence 1412 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACCACAATCACAMTGCAGCACTGTTTACTGAC 
AGGACCATTACTCTGTCAAAATCAGCACATCAAAAATATTATCCTGGAATCTAAAATAGT 
AGTCAACTGGGTTGTTAAAGCAAGGGATTGCTATAGATCTACAGGACAAAGTTCCATAGT 
GAAACACAAACTCCTGGG7TAGTCCTAGGCCAGGCAGGTGACCATAAATGTTCACATTCT 

GGTAGAATCCCATTTTCTAAAAATTATACAAACACATCGAAATCACTAGATTTTATATAT 

ATATATACNCACACACACACA 

Sequence 1413 

CCCTTAGCGTGGTCGCGGCCGAGGTACATAACTTATAAATCTCAACTAAATACATAGCTT 

ACTGTAGGAATATGTTGCCMTTGACAAGAAAAATAGTAAATATGAATAAGCTAAGAATT 

AGAAAAGAAAAAATTTTAATTATCAAACAATTACCTTACCAAATGGCTCAGATGAGTCCA 

MCAGTGAGCTCTTCCMCTTTTCCATTATTCCTAAGCCATATCAATTTCAGAATATTAA 

AAATGATGAAATGCTATCCAGTTAACCTTATAAAGAAGAGCTCTGATAGCAAAGTCCAAC 

AAAGGGAAAAAAAAATACACTACACAGATATTTCACCCATGAGTATATATTTAAAATTAA 

AAATAAAATATGGCCTAACAGAATTCAACATAATATTTAACAATAATACACTATGAACAA 

GTAATCCAAGAAATGCAAAGATATTTCAATATTA 

Sequence 1414 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACCTGAGGGTCACCAGATGCAGCAGCTCACGGC 

TGATGAAGATCCCTCTGAGGAAGTCTGTATTCTCTATGGCCTTCCACAAGTCCGAGCTCA 

GGAGGACTCCAGGGCTTGTCTGAACCTTCTGGAAAACATGCTGGTAGTGAACAAGGCGCT 

TCCACAGTCGCGCAAACTCCTCTTGGTCTAGCCGCCCATCCACTTTCAGTTCCATCAGAG 

CCACCAAGCTGCGGCACTCATCTAAGGAGAACATGTCCCCTGGAGGTCCTGTTAGAAGCT 

CCTGGTTGAGAAGGCCCTGAAGCTGGGTGGCATCAATGTCCAGCCTCTGCTGAGCATATC 

TGTTGAAAATGCTTTGTTGGGAGCCATGTTCTGAAGGGCTTCCT 

Sequence 1415 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACGCGGGATGAGGAAAGGGATGCCTGGAGACTG 

GAAGAACCACTTTACTGTGGCTATGAATGAGAACTTTGATAAGCATTATGAAAAGAAGAT 

GGCAGGGTCCACACTGAACTTCTGCCTGGAGATCTGAGAGGAACAACAACAAACTAGGTG 

ACAGAGACTATGCCAACTATTTCGCCTTTTATTCTGTTGAGCAAGGAACTGTGACTGAAT 

GTGGAGCTTATGAGCTTCAGTCCATCTCCTATAGTGTGGCTAGTTTGCTATAATATTAAA 

ACATGATTTAAAATATCAACAAACCAGTTACTCCAGCAAATAAAATAAGAGAATTAGAGA 

GGAAAAAAAANAAAAAAAAAAAAAAAAAAAG 

Sequence 1416 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACAAAAATTAGCTGGGCATAGTGGCACATGCCT 
GTAGTCCCAGCTACTCGGGAGGCTGATGAACAAGAATCACTTGAACCTGGGAGGAGGAGG 



Page 242 of 869 



WO 01/51628 



Table 2 



PCT/US01/00798 



TTGCAGTGAGCTGAGATCGTGCCACTGCACTCCAGCCTGAGCCGACAGAGCAAGACTGTC 

TCAAAAAAATAAAAAAGAAAAAAATAATAAAAAGGAAAACAAAGCCTTTAGAAAAACAAA 

GCCTTTGAAAAAGGATGTAAMTTGMTATATAGATACAATTATTTTTAAAMGTT^ 

AGAAAAATATCCTGGATTGTTGATGTTGNTTTAATATATTTTAATCATTTTTGTA 

NCCGTGTATAATTATGTTATAATGAAAGTAAAAAAAAAAAGGTCTTAGCANAAGCAGGCA 

GA 

Sequence 1417 

CCCTTAGCGTGGTCGCGGCCGAGGTAC I I I I i I I I I I I I I II I I I I I I I M I I I ATT ANA 
NATGGGGTTTCACCATATTGGCCAGGCTGGTCTCAAACTCCTGACCTCAAGTGATCCACT 
CGCCTCGGCCTCCCAAAGTGCTGGGATTACAGGCGTGACTCACTGTGCCCGGCCAGCAAA 
CAAGTTTTANATTTMCTTTATTTGM 

GGGGCTGCTGAACTTCAMTAMTGTTTCTTCATTGGTTCCTAAAAGGGATAAATAAGCC 
AGTTATTCTCAAACTCAAGTATGTGCCAAAATCCCCTGGAGGACTTGTTAAATCACAGAT 
TGCTTNGGCCCCACCCCCAGGGTTTNTGATTCAGGAGGTCTTGGGGTGGGGCCAANAATN 
TGN ATTTTC AAC AAGTTCTCAAGN G ATGCTTGATGTTGCTN GTCTGGGGACC ACACTTTG 
AGAAC C AC TTGTATTAAGGGC AAGG AAAAAAAAG AATTGN G GG AAAA 
Sequence 1418 

CCCTTTCGAGCGGCCGCCCGGGCCTCCCAAGTAGCTGGGACCACAGGCACATGCCAGGAT 
GCCTGGCTAATTTTTGTTA I III! IGTGGAGACAGAGTCTTACTATGTTGCCCACGCTGG 
TCTCGAACTCCTGAGCTCAAGCGATCTACTCACCTTGGCCTCCCAAAGTGCTGGGATTAC 
AGGCATGTGCCACCATGCCCCGCCAC I 1 1 1 1 1 1 GTTAATGTTTTAGACATATCCC ATGTA 
TATATAAATATAGATCATTGTCATTMTMTAATGACGGTTTAATACTATGTGATGTGTT 
GCAATTTATTAAATTCCCTA I I I I I I GAAATTTAAGTTGTATATCTGGTTAACTCTTGCC 
ATAATATTGCTTGCGTAACAACTACAAAAATAAGGNAAGATAGAAATAAGCACTTCCTTC 
TTGCTCGTGAG I I I ! I CAACCGTGGCTGANGGGGGATTGGGTACCTCGGCCCGCGACCAC 
CGCTAAGGGG 
Sequence 1419 

CCCTTAGCGTGGTCGCGGCCCGAGGCACTAACACTTTCCACACTCTGTCTCACTGCCCCC 

CTCCCAGGAAGTAAGACCTGTATCATCCTTATTTGTTGGGCGGGGAAATTGAGGCTCAGA 

GGTTGAGTGAC I I I I I 1 1GGGGTCTCAGCTGGTCACTCGGGAGCTGGGTTGGAGCCCAGT 

TGTGCCTGAGTCCACTGAATAAAAGCATTATTGGGAACTGGCTTCATTAACAATGAGCCC 

CAGTGGAAACGTAGTATGGAATGGAAATGGAAATGGAAATGCAGTGGAAATGTAGAGTGG 

ATATGGTGTAAAGACACCTGGAAAGCTTAAGGAGTGGGAGCTAGAGAGACAGAGAGAGAG 

ACCGGGAGACAGAGAGAGAGAGAGAGAGAAATCCCCAATTTCCTGGGTAACAGACACACA 

CCCTTCTNTACCCTGGCTCANGGCTGGCCATGGCTGTCCGTGGCTTGCTTTCTGGCCAGC 

AGAAAGGAATAACCTTG 

Sequence 1420 

CCCTTTCGAGCGGCCCGCCCGGGCAGGTACCATAATTTATTCAACCATTCTCCTATGATA 

AACAAGCAC! I I I I II CCACTCTATTGCCACATAGACTATGCTGCAATAGACATCTTTGC 

ACAC ATATCCTTATATGCTG ATGCTTTTATTTGTATG GAAAG ATTCC AGAAATG TG AGTT 

CTGCTTTCAGCAGTATGTGTTTTTAATAGATGTTGCCAAATTGCTTTCCCAATAGGCTAT 

AGCAATTCATATTTCTATCAGCAACTGTTA I I I I I GTTCTTTCTGA I \ I i i GACAGTTCA 

ATGGGATGTAAAGCAATCTCTCATTATTACTTTTAAAAAGTCATCAAGTGTTATTATCCC 

MTTTTGCAGATAACTGAGGCTCAGAAAGTTTGAATAGCTGGATCAGTTTCCCAAAAGCT 

AGAACCAGAATTTGGAAGCCAAATCTGTCTGAACCCAAAGCCCATGCATTTTCTATCCAC 

CATGCTGCCTCCCGTATCATGAATTTGGAGCAGAGGACTTTNTTTGCCTGAGTGGGTAGG 

ATGCAGGTGTGTCTAGAGAANGCANAGGGTGTCATGGGAA 

Sequence 1421 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACCATAATTTATTCAACCATTCTCCTATGATAA 
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™™ ' ' " " lCCACTCTATT GCCACATAGACTATGCTGCAATAGACATCTTTGCA 

CACATATCCTTATATGCTGATGCTTTTArTTGTATGGAAAGATTCCAGAAATGTGAGTTC 

TGCmCAGCAGTATGTGTTTTTAATAGATGTTGCCAAATTGCTTTCCCAATAGGCTATA 

GCAATTCATATTTCTATCAGCAACTGGTATTTTTGTTCTTTCTGATTTTTGACAGTTCAA 

TGG^TAAMGCAATCTNTCATTATTACTTTTTAAAAAGTCATCAGTGTTATTTATCCC 

AAl M I IGCAGATAACTGAGGCTCAANAAAGTTTGAATAGCTGGATCAGTTTTCC 

Sequence 1422 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACATGAATGGTCAAACACCAGTCTAGAGCATCT 

TATTGTCMCAGCAAAATMTATTTTGCCCACCCTGTTTGTGACATTGAGTTGTGACTTC 

TATATTCMTAGATTTTTGTAAATGTTAAMCATCTATATTTAMTGTTAAAACACTAAA 

TATAGAGAGGGGCTTTATTTCAATCATAGAGCAACAACAAAAATAATGCTTATAGCTAAA 

CTGCCTGTTCTAGAAAGCATCTGCTTTTTCATGTTATTCCTAAATCCTCTTGTCATACTT 

TTGTCATTGAACAATGCTCTCCCTCTCGTCTTCCATCCTCATTCAGMTTTTTAGAAGAC 

?^ TCGTGGAGATACACTACCCAGT ATTGmGATACATTmAm^ 

AGTGCAGGAAACTGTGATTTGCTATATGTTTATGTATATAATCTTA 
Sequence 1423 

^^ C ^ G J^ TGATGGGATATCTGCAGAA ^ CGCCCmcGAG CG 

CACATACGCAmCTTCCTCCCTTCATGTAAGTCTCCCATATGAAGTTTGCCATAAAGTC 

CTGTAAGTTCACCCCAMGCTTCATTCMTTGTTTTTMTCCATTATCMGGGATGAAAA 

TGTCACCACTCTAATTCAGGCCCTTGTAATCTCTGTGATATACCATAGAAGATTCTTTCT 

ATTTrTCAGGGTTTTAATCFCAAGTTAACTTCCATAGTGTCACCGTATGTATCTTAGCA 

ATCAGAAGCGTCCTCATAAACGGGAGATGAAACGTCAGAGTCACATTAGGAATCCTGAGG 

CTGGAAACAGAGGTTCGAGTGATGGGAGGAAGCCACTAGGAGGCAGGGAATGCAGGCTCC 

CTAGAAGGAGAAAAGGCMGGAAGCAAATTTTACCATCAAGAGCCCCAGAAGAAACCAGC 

CCTGTCCACACCCATATCCCCAAGATTTAACTTTATTTCATAAGATCAGTTTTAGGTTAC 

TGACCCTCANAACTATAAAAAATTAAATT 

Sequence 1424 

CCCTTAGCGGCCGCCCGGGCAGGTACAAMGTTTGTTT7TGGGCAAAAATGGTGAAAGTA 

AGTCTAATACTTTATATACAAAACTGATTAAACTGAATCTTAAAGGTTCCTGATTTAAGT 

GATTTGACTAAGATCAACTCTATTTTTTTAGGCAGTATAGTCTATTAGAACTTAAATAAA 

TGGATATAATAGTATATTTCAAMGAAATATTTTTACTATGCTACATATAACTAACTCTT 

TTTAAAAAGTGGAAATGGTTTATTTCACATTTTGAAGAATATAGTCTAAGAGATTTTCAC 

ATATTGGTTGACAGCACAMCTTTGGAGTGAGATAGTCGTAGGTTCAAATTGNGGCTCTA 

CTGCAAGCTGCTACGTGACCCTAGGCAAATCAtTTAACTCTCTAGtTTTCAANTTCTGNA 

TGGTGTGAAACAGTGGGTTCACTGTTGGAAGGNGAAACTGGTGTANGGTGGTAGGCAAAA 

acaaaaacttnatanccactggtttattannannaattatggggaattacct 

Sequence 1425 

ggagtcgccccgcgtccgccccatccagaagctgcttcccttcagacctacctacggcat 
gacccctcaaagtcagatatgggacaagagcctccttgaacaaactctggtatccctgca 
gcaagcaaggatacattgcagaggtgcccggagtggagtcagatgggctagctcagccac 
ccctgcatctcccaaaccctgggagacatgtggcccatgggagtaaatccaggacattga 



ctcaactctcagaagtgttattcagtcaaggaggctctcccttcactgaaggaaggaaag 
tcagctctctcctgaaaggccagatcactggctgagtagatgagacaagggtatgaaagg 
cctttgccatcttctttgcccagtcctgaaagcactgacgtaagagaccagtcaagttct 
aatgtaaggtgtatattttaagtgtcagggtattgcaattgtcacccctgtggtcaatat 
cattaaacaggtatgagaaattcgcttggcatagacttnctggtctgcttaataagaatn 
caactaaggatgtcacatgacaagtttcccagaaaatgtgaacaagtgtccatcttgaca 

^ AC f G ^ AC , CAATGACAAACCAAAGA ^ G TTATTCTGGCTGAGTCTTMGTTGCTGAACTA 

ataaattanctgcggttctttgcaaaaaaaaaaaaaaaaaaaaaaa 
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Sequence 1426 

CGCTTAGCGTGGTCGCGGCCGAGGTACGCGGGTAATCAAAATATAAAGGCTGGAGAGAAA 

CATGAAAAAGAAAATGAAG AATTCACAGGGCAACTAAAAGTGGC TAAAGATG TTG AAAAA 

CTCATCGGACAAGTGGAAATCTGGGAGGCAGAAGCCAAATCTGTTTTGGATCAAGATGAT 

GTGGACACCTCAATGGAAGAATCTTTGAAGCATCTTATTGCCAAAGGCTCTATGTTTGAT 

GAGCTTATGGCAAGAAGTGAAGATATGTTACAAATGGATATACAAAATATTTCAAGCCAG 

GAGTCCTTTCAACATGTTCTCACAACTGGGCTTCAGGCAAAGATTCAAGAAGCTAAAGAG 

AMGTCCAGGTCMTGTGGTAAAACTCATTGCAGCGTTGAAGAACTTAACTGACCGTTTC 

ACCAGATTTGGACATCAGGCTGAAGATGGAAGAATCCCCAAAGGGACCTTGAATCATATA 

TGATGAGGGCTCAGCAGTTACTGGGGGCAAAGAGAGAGCCCCGGTGGACTCANTTTCAAA 

C ACCAGGGAAGCCCCTAATAATTTCTAATACAAAAGGC TGGCC NAGTATTTNAAAAGCTG 

TTGAAGMCTTAAAMTAATGGTACNTGAGGACATAAAAATNTCTTTAGAANAAAAGANG 

TTGAGATNTCTGTNCCAMTNGGGAGTNTTTTCATTATNGMCTGGCTTTATNTTTCCAN 

CACCTTAAAAATNGNTTTGGAAAMGGAAACCTTAGTGGCANATTn*AAAANTTGAAAGG 

CNAAAAAAAAAA 

Sequence 1427 

CCCTTAGCGTGGTCGCGGCCGAGGTACCTAGCTTTTATTCATTTTATTATTCATCTTTA 

TGCT7TCATTGCACATGCTGTTMCTTCTTATGAACTCTTTGTTCTAATAAATAAGCATA 

TCCAGGTTTTTCATAAATAAATGAGGAGTCTATAMTTCTTACTAGTTTTAATAGAAACA 

TTGCCACTAAAATGGTGTATTCTAGCTGTGTGAGCTTrrGGAAAATGTTTTCATTTCCTT 

TATGTTTGMTGTTCCTTCCAGGTCTTTGAAAATATACATTCTGAGTTGAAAAGCTTTAT 

AATAGGTAAAACTCATTAACCCATATACTTTCCATGCTGGGCACAAGCANGTGTGGTAGG 

TAGAAAATATAAAGATGACTCCCCAAGATTTTTGCACCCTGGCATACACCATATGCTTTT 

CCAGTTTGTTGAMCACTMTTTTAANGGTGCTGCTTGTGAANGGGMTTTNTGTAGATG 

TGAATAAATNGTAN NAAN ATCAGTTTGGGCCTTAAGA 

Sequence 1428 

CCCTTAGCGTGGTCGCGGCCGAGGACAGTATATTATTTACACTGAAGGGGCTTGTGTGTG 

GACAAGAAAGCGCTGACAGCTCAAATGGATCCCATGGAACTGAGAAATGTCAACATCGAA 

CCAGATGATGAGAGCAGCAGTGGAGAAAGTGCTCCAGATAGCTACATCGGGATAGGAAAT 

TCAGAAAAGGCAGCAATGAGCAGTCAATTTGCTAATGAAGACACTGAAAGTCAGAAATTC 

CTGACAAATGGATTTTTGGGGAAAAAGAAGCTGGCAGATTATGCTGATGAACACCATCCC 

GGAACCACTTCCTTTGGAATGTCTTCATTTAACCTGAGTAATGCCATCATGGGCAGTGGG 

ATCCTGGGCTTGTCCTATGCCATGGCCAACACAGGGATCATAC I I I I I GTAATCATGCTG 

CTTGCTGTGGCAATATTATCACTGTATTCMGTTCACCTTTTATTAAAAACAGCCAAGGA 

AGGAGGGGCTTTTGATTTMTGAAAAATTAGGAAAAAAGGGCATTTTGGATGGGCCGGGA 

AAAAATTG 

Sequence 1429 

CCCTTAGCGTGGTCGCGGCCGAGGTACATATACTTAATATAAGTGGTGTATGCTCAAAAA 

ATTTTCACTGTTAGAGCACAGGATCATTACAGTTTAGTCAGCACTGTCCTAACGTAAACC 

TAAATAATACCATGCTCTGATGAGTGGTATTGTGGCAGGTCATGAGGTGTTGTGGGAACA 

CAAATCTGGCCAAGTTTGCCAGAAAATGAGAGGCATTTCATGAAAAGCAGATGATATAGT 

CAAGGTGATCTTTTTTATCATTGMCATTTGTCAGCTAATCATAAACTT^ 

GATTATTAAATAAGTTTGGTAATGATTATTGAGAGTAAAGTTCAATTTCTGGTGTGCANG 

ATTGCTTTCCCTGACATGAATACATTGTATCATTTGTTATGAATGTANCCATCCCGTTAG 

GGGGAGGATTTTATNACAGACTTTGACATAGACTTCCACTTAACATGATTACTCAGTAGC 

CTCCTTAAACATCANGACTAAAAGGAATAACCTTACAATCTTAGANTACCTTTTTGAAAT 

CCAATAACCCATTTGGTTGGTTNAANAACCACCCTTTAA 

Sequence 1430 

CCCTTAGCGTGGTCGCGGCCCGAGGTACTTGTTACTCTTACAGATGTCTCAATTMTTTT 
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AACCTCTATCCCTAATGGTGTTTAAATTTTGGTAMGAAAAATTCATTTTTTC 

TCCAAGAAAT GGTAA GCTATCTTTGTCCCTATGGGAGATGTGGACAGAGAAAACTTTCTG 

TCTCTGATGTCTTTTGTGTGAGACAAGATTTTAGGAAATAAATAAGTGGTGCAGGGAATT 

GTGAGATTACATTTCACCTCCTTTTCTGTCTCCTCTCTCTTTTATATAATCTTTATCTCT 

GCTTCTAACAATCTCCTTATCAGAANTTTCTAACATATGTAATGGATGAGAGAGAGAAAA 

ACT GCCA CCACAGGATCAAATTTCATTTCTAATCTGGAGAAACAGGAGTCTAGTTTTCAA 

ACATTTTATTCTAAAAAAAGAAAAGGTTAAGCTA 

Sequence 1431 

GATATCTGCAGAATTCGCCCTTTCGAGCGGCCCGCCCGGGCAGGTACGCATAGCTATGCT 

ATGTGACCTAAATGCCMGTTTTCATTTTGTTTTATATTAATATTAGGGCT^ 

TCATCTCTGTTTATGTTTACATAAATTACATTATATCAAATCCAGATTTGAAGCATCTCG 

GAATACTTGAGTAGTTCTTGCTAATATTTTCATAGTTCAACTT7TATTTTT 

CT GGGAA ATMTGTTTGTGTGTATAAA1TTMTTCAATCACATTACTATGCTCCTACATT 

ATGTTTTCATCATTTGCACMCAAAGTTTGCTGTTTATCTAATATATATACATATGTT^ 

ATGTCCATCTAGAAATATGTATTTTCCTGTTTTATAACAGCCAAATAATGCTAGGTATAT 

ATATTAAACACAAAAGCCTGCAAGAAAACTTTTTCCTTTATAATATTCATATTTACGTC 

TAAGGGGGCATAAATTATTTTTAACTTATATGGTTAAATGAT 

Sequence 1432 

CCCTTAGCGTGGTCGCGGCCGAGGTACCGATACTCAAGACACCTTTTTTAGTTTTGATTT 

AGAAGGACTTTAGGAATTAGTTCATTCCCTTCATTCACTAAGCATGAATACCTACCATTT 

TCAGCCCTGTATGCATGAGCAGTGCAAAATAAATATATTTCCTGCCTTCAAATCACTCAC 

TGTTTGGAGATATTAGAATCTCCTAGCTTAATCTGTTGGATTTTTCATAGTTTTATGATT 

TTTAG ATTATTTT CCTGCTTGTTTCTTGATCTGTATTGACCCCTTAAAAGATCACCATAC 

TGTTGTTTTTTTGAGTTTTAAGTAGAACATTTCTTTCATAGTGGGAA^ 

CAAATATAAATCTTTTGTGGGTTAAATATTACTTAGTGGGAACCCTTAAGCAAAGGGTCA 

AAAAGTCACTATATTGGTTGCCTGAGGGGGCCTGTTCCCAAGAGACAGGAATACCCATAT 

TGAGGCCCTACCGTGTCAAAGTTTACAGCTCTTGATTCANTCCTTCTTACAAGCATGGAA 

ATAAATGGCTAAGAAGGTGAATATGCCCTAAAAAAATTGGGATTTTAGAGGTNNGGAATG 

GCCAATNGAAGGTGGCTAAAAAAACCCATTTTTCAAAAGGAATGGNCCTCAATANTTAAG 

AAATTTTTTC ANTTTTTG ACC NC AG AG NTTN N N TTTTAATCCTTTTTT 

Sequence 1433 

CCCTTAGCGTGGTCGCGGCCGAGGTACACAATATTCCAACACCATTGTCAATACTTAGCG 

TTATAAAAAGGTTTCAGTTACTAACATGAACTGAGAGATCAGCAAATATTAAAGATGATT 

GAACATATGAAACTGAAAAACTATAAAAAGGTAGAAGCTATTGTAAAAATGAAGAAAACT 

GAAAAACTGAAAAGATAAGAAGTTACTGTAAAAAGAAAACGGTTATACCATTTGAAGCAG 

AACAAAATGTTCCAATAAAC I mil I ACTTAATAATAGAAATATTGAAAATGCAAAACT 

CTGAAGGGTATTACAAGGGGCTAGACAAATTTTGAATTGTTTTGGCATAAGCAGGGAGAG 

GATTTATTTGAATATTATTAATATGAAGACAAAATTCTGCTTATTGAGATATCTCACCTA 

TTATCATTTTAATAGAAGAAGTCTTGGAAAAGTTTGGTTGGTAATTCTCTTTT 

ATTGGCTCAAAC 

Sequence 1434 

CCCTTTCGAGCGGCCCGCCCGGGCAGGTACGCGGGTATCTTAAACAAATTGATTGATAAA 
GATAGTTGTTCATCTTATTATCATAATAGATAAMTATTTCTATTGATATGTATATAAAT 
GTATGCATGACMTTAAATAGATMTAGGGCAGCTTGTTTCATTATAAAGGTAAAGACAT 
TTTGGGAAGGAAAGCCTTTGTTATTCAATTTTTTTGTGA 

T AAAA C TAC ATTGTG ATAATTTTAAACATAGAAAG CATAGTTATAGTTGACTTAAACCC G 
ATTTTCCATGGCTCCCTCTTTCCTTTCAGTAGTAGCAGAAAGATCTGAATGGCAAGGGAT 
GGAATGACTCCTTGAAGTAGTAGCCAGCTGGGGGCTGTGTTTGAGTGCCATANGGCTGNC 
TTGTTGGGTAGGTTCCAAATTACTTTCTAGGTCTGGCTCTTGCTGGTAGAATACCTGAAG 
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CATTGCTCTTCTTTAGAAAAACCTCTGCTCAANAATGTCTNCTTGGGATTTTTN 
Sequence 1435 

GTCGCCCCGCGTCCGCGGACGCGTGGGAAGCAATGCTTGAACTCTGTAACACACAGCAGA 

ACTTGATTTGGAGCCATGCTCATTCTTGAATCAGAAATGTGTGAGAACTCTTCTGACCAC 

CTTAATTGGCCATGTCAGGTGCTTTGGGGATGAGGTGTGAAATCAGAAGGAACAGCCAAA 

GTAGGAGAGTGAGGGCAGGGATACGTGGGAGAGGAGCTGAGTGAGAAGAAGCCAGAATGC 

CATTTGCACATGAAGATGGACAGTCCAAGACAATGAGCCTAAGAAGAAAACAACTACCAA 

XAAAAGAAATATTCTCAAAAGTGTCATTAAGAGGAGAAATMGTTTMTTATTAATTTAT 

TTGCATCTTAAAMTCAAMCAGTCTCTTTGGTTATTGNAGTATGTATAAGATGGATCGT 

TCTCCCTTTACCTGTTGTGCGCTCTGTTAACCATTTAGCACCTCTCCAGGAGTTTGGACA 

GAGGAAAAATGCATMCATTMCAAATTTGCTGCCATTCAATATTAGATCATATATGTTA 

GCATACATCTACCATAGCCCCTGACCACCTTCATCCCAAGGAAAATGAAGATCAAGTCTC 

AATTATAMTTCTGTGGAAATTTCAACCCAAAATGACTGGAGAAGTCATAGCCTGTGGNC 

ATAATGCTTTCCCTTGGGGAA 
Sequence 1436 

CCCGCGTCCGGGTGAAGAACACCTGTGTCTTGTAGAGGCCCCAGTGCTCAGAACCGGGAG 

TGACAACTTGTGCTGTTTAAAAAGAGAGCAACTGCTTGCGTCTGAGATCATTAGGGGTTT 

GGTCTTTGTTTCCCAGTGCTGTTGGGCTGCTTAGGGCCTCCTCTTGACCCTGTCCTCCCA 

CAGCTCCCCTGCCCTTCCCCATCGCCTGTCTCCTCT AACATCACTAGG AAGGAGCCCCAT 

TCTTGTAGCGGCACTGTTTGTTTGATTTATGTGGG I I I I I l I II I ill I I AAGTAN AT AA 

AGGTTTTTTAAAAAATCANGATTCTC 

TNCCNGANTNNAACANTNNNNTGNNANCGGGCGGCCCGNTA 
Sequence 1437 

ACGCGTCCGGTAAGTGGGGCAGGCCTTTATTGGGCCCATCATTAGTAAGAGGACACAGCC 

TTCCAAGAGGAGGTTCTTGTGAGCCGGGCAGGACAAGGCGGGAGGGCAGGGGGAGCGGGG 

GTTGTGGGGAACGGGTCCCAGATCCCTTTGGGAGAAGTTGACCCGAGGTTTTCCTCTGCT 

CCAGCCCTCCCCCCTCCTGCACAGAGCCCTGCCTGCCCGGCCTGTGGGTTAACCCCAGGG 

CTAATCCAAC AG AGGTGGGTCTC AAATCACTTGTG AG ACTGGGCACCTGG GTGGGG GTGG 

GGGAAAGGTGGGCAAAAGAAGGAGTGAGTGTTGGAGGG 

Sequence 1438 

TGCCCACTTAAATAAGCTCTGTATGACATGAAATTCTGTTNAAACATTGTAATTCATGGN 

GACTTNTAACTNATAAAAATACTACTTGCATGGGTTACTTGATTTATGGATATATGAAAA 

CTTNCANGACGANTGATCTTCTTTCTCTAGAACTATTCTTCTGTCGGTCATGCAGAATGC 

TGTTATTCTGAAAAGTGTCCCTGTTGCATATGATGGTCACTTTATTTGGGGGGATTCTTC 

ATAAGATGTGAGATGTTGATGCCAGTCTTTCCCAAGTAAGTGCTCGTAAAAAAGGACTAC 

TAACTAGCCTGCATCTGTCTCTAACTGGGACCAANGGGTCTGCTGAANGAAACTGAAGAG 

CTCTAACATTTTCACAGCTTGGAGAAGATAGAATCTTTAAAAGTNCAACTGAAGCTTGAT 

CTATTTTACA 
Sequence 1439 

GCAAAGTGGAAGGMGMGCCCATTTTAATGTTGGGTGGGAAAAATAGGACTTAATAGAA 

ATGTTTAAAACAGGTTGTATGGATAGATCTGGAAACAGCCCTAATCTTAATTTGGGCAAA 

GCATGAACCAAGGGTCTGCAAGGGCTGGAGGTTGGTTCAAGTCTCATCGTGGGGCTGAAT 

GTTTTGGAGAATTTTTTAGCTAAGATTTGCANGGGMCAGAAGCAGTGTGGCATMTTAG 

ACAAATAGAAAGGTCTATGAGTAGACTCTGATTACAGAGACCAAGCNTTCCTCACCATTT 

GCCTTTGGCACACTGGAAACCCAANAAAGCCTGTGGTATANTACATACCCCGAGAACTGG 

AGAATATTGACATTTCCACCTTGCCTCTGTCTTANATTTCTATTTCTGTGATGAAACACC 

ATGACCCAAAAGCAAGTTTGGGAAAGGAATGATTTTATTTGCTTGTGTAGAAACTCAAAC . 

TGAACAGGAATCTAGAGTCAGGAGCTGATGAAGANGCCATGGAGGGGGTGCTGGTTGCTG 

GCTTGCTCTNCATGGCGTATTTAATCTGCTTTCTTATAGAATNCAGGATCCCCACCCAGT 
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GGTGGCCCATCACAATGGACTGGCCCCCCCATCAATCAGTAAGACAATCTGCTGGAGTGA 

TGATGGGTCANGCTCACANGGNCATTGNGGCATCTTATTTACTGNCCATTGGGGTGCTGT 
GTGGAAGGGA ■ 

Sequence 1440 

CGCCCACGCGTCCGATCCCAACAGTTCCCTCGCCTCTGTTACCCACAAATCACCAACACT 

TCCCTCACCTCTGTTACCCGACCAGGCATCAAAACTTCCCCCGCCTCTGTTACCCCACAG 

GCTAGCATGGGCACCAAAGCTTCCTTCACCTCCTCTATCTTCGGCAGCCTTCTTTTCTGA 

AGCTGCTGTTCTGAGGTTCCTGGGCTCTGCTCCATCCAGTATCCCATGGGTGTCAACACC 

TTTCTCTTCATATACCTTCCAGTTAGACTGTCCAGTGGGTTGGAAGTCACAGGACTCCAG 

GCAATGCTAGCAGCCACAGTATCTCCTTTATTAAAGTGTT 

Sequence 1441 

TTAGGGAGTCGCCCACGCGTCCGGTTGCAGGAGCCCCGTGGTATGTGTACCCAGTGTGAG 

GATAAGCCGAGGTTAACACAAGCAGTGGGAATTCCCCAGGCCAGTGTAAGCTGCTTATGC 

TGTTATC CTGG GCAGCCTTGAGGCAGGAGAGTMCGTACAMGGATTATTTATGAATTTG 

CCAGTTATTTTGAAAGAATTAGCATCACAGTGAAAGTGACATAAAATAATTTTAGTAGAA 

ACAGTTTGGTGAAATAMGGCCATTTGGGAGTAlnTGAACAGCACCATAGTGCATTCTGG 

GTrCTGGCATGTTCACAGCAGGAGGAMGGCATTTTTTAGCATCTGGTTm 

GACTGGGGCTCATATATCCCAGACCAGCTTCAAACATGTAACTGGGGACCTTGATCAGAT 
CTTGATCCTCCTGTC TCTACCTCCTAACTA 
Sequence 1442 

TCGACCACGCGTCCGTCCAGGTCCTGCGCTGTGCAGCCTGGTGGGAGCTGGCCACGTGTG 

GATGGGTCCATGTCCCCAGCAAGTCCATCAGCTCAGACATGCAAGTCTTCCAGGACACTG 

CCCTCTGCCCCAGCAGTGGCTTCTGGACTGGCATCTGTCCTCTACCTCACCAACCAGGCT 

CTGGACTCCATGACAGATCTGCCTGCAGGGAGCCCAGCACACATGCTGCATGTCACCGAG 

GCAGAGGCCACAGTCCCCACACTCTGGGATAGAACTCGGGAGCCACGGCCACCCACACTT 

GCATGGGAAAGGGTGTCTGCTGCAGGATGACAAGCAGGGACAGCTGGGTGGTCCTCCTAG 

AGCTCTGGGGATGGGGGCAGGGGACATCCCCTAGGCACCCTGTCTAACCAGAATGGATAG 

TTAGAGGAGCGCTGGGGCCTANCCACCTGCTTCTGCCATTGGATTTGACAAGCCATTTTA 

TATTGGGTGTCCAAGCAGTCCTTTCACCTCTTGGGTAGTCGGAGGTGCTCCGCGAGGTCC 

CGCTCGAGGGCCCTGGCTTTANGTTGTCAGGGGATTAAA 

Sequence 1443 

TTGGCGGAACGCTGGGAAGCACACATCTATGCCAGCGAGGATGAAAACTGAGACTGGAAG 

CAGGGGGCCATTCTCCTGCCTCACTGCAGTCCTGTGGAGGCCGCTAGACCTGCCCCCAGT 

CTCTGCTGTCCCTATTTAAACATACAGCCCAGGCTGATGTTTGTGGCCTGCAAGTTGATT 

TGGTTTTCCCTGGGGTGGGCGAGTGATTCGTGCTTAGGGGTGTTCTCAGCTAAGGCTGAA 

ATGATAATCCACCGATCCCCCTCACATCCCTCAGCCCTCCGCACAGCCCAGAGCTTGGAT 

CTGTCACTGCAGAAAGAAGGCTCTGTTTGGGGACCTGCATTCCACTCTTGCCAACCCCAA 

ACGTATGCCTGTTGCTATGAAATAAACCTGTCCTGAACTGCCAAAAAAAAAAAAAAAGG 
Sequence 1444 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACTTTTTTC U I I I t I M I TTTTTG AAATGC AG 

TCCAGCCTGGGCGACAGAGTGAGACTCTGTCTCAAAATTTAAAAAAAAAAAAAAAAAAAA 

AATTTNTATGGGACTGGAAGAGGAGGTCCAGTCCATTGTAACCTGTGGATAATACCAGCT 

CCATGTCAGGGTGGTTCTTTCAATTCATTCACCAAGCAAGCATGCTTTTGAGGCACGCTC 

TGTCCTACACATAAGGAAATCAGACATAGGGGGTCCCACTCCCCTGAACTCATAATCCAA 

GAGGTGGGGAGACAGACAGAAAATGAGTAGGTGATAAGGATGGGAGGGAAATGAAGGGAG 
GATGATGGGAT 

Sequence 1445 

CCCTTAGCGTGGTCGCGGCCGAGGTACCCAAGCATCATGTGAACACTGTGCAAATGCTAT 
ACAGAGAGTCACGGCAGCTTAACCAGATGGAGAACAAAAACACAGCAGCGCACAACGAAT 
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GAGTAAGTAAAATCCCTATGAATCAATTCACTCAGGCATAATACTCTAAATCCTTAAAAA 

GGGGGGAAAAAAAGAATGTAACAGAAATCTATTTCCACAATACACCCAAGGGTAACTAAT 

GTAACTAACTGACTAATGACAATACAAGATTAAAATAAACACCAAAACCCACTATACCTG 

GATTCATATTACTTAGATGAGAAGCTMGTTAAGATATTGTAGGGAAGCTCACTTAATTA 

CACTGGTGATGAAATTGNCTTTTCTCTTTTCTTGAGTTTAAGCAGTTATGCCTTGGGCCC 

AACTTTCCTGACCTTGTGAGTAAAGTTGAGCCACATGTTCAGCAGAAGATATCACAGTAG 

CTGTATTTTTATAAAGAAAATTCCTTTTTAATTATAAAACTAAAGCCCGAGGAAAACC 

N N AAAAAAN N N AAAAANGGG AAG AAA 

Sequence 1446 

CCCTTAGCGTGGTCGCGGCCGAGGTACCAGGCTATGTGCTAAGTATGTTATTCCACTTAA 

CCAGCCCTCAGTCTMCAGAGCAGATGCTATTATTATCCCCTTTAAACAGTGAAGAAGCG 

AGGCTTAGGGGGAACTGTGTCATGCCCCAGGGCCACATGGCCGGGTAAGGGGTCCAGCAG 

CTTTAAGCCAGGGAGGTGGGCCCAACGTCCATACCCTCAGCCATCTTCATCCTACTTCAT 

CACAGGGTCGGGGAAGATACCAAGTGATGAAGGCATTGTGGTACCAGACTNTTGTGATTG 

GCCGAGGGATGAGTGAGGCCTTTG 

Sequence 1447 

CCCTTAGCGTGGTCGCGGCCGAGGTACGCGGGTCTTCTCTCCTCC7TATGCCTTTTCTTC 

TTCCTCCTCACCCTCATGGCTCCAGGTCCATGCCCAGGGAGCATGTTAGCATGTTGTCAG 

GTCTCAAAGTATCTGAAAAGATTGTCTTCTCTGTGGCCAGGCTGCTTAGAGGCAGCCTGA 

TATAAACTGTAAAAAGGGGGAGAGTGTTTCTCTGTGTCCTCTGCATCCACTCTTCATGCA 

TTTGCTCCAAACCAAATCTGCTCTTAGGAAGGGATCAGACGAACCTGTTTAGAGTGAGGT 

AGCMTGATAGGTTAGCAGTGGGGTAAACCACATAAATGAAACTTTAAAATGAGGAATTC 

CACCTTGTTAAAGAAGTAAGGTGGGCCAAGGCACAGTGGCTCACGCCTGTAATTCCAGCA 

CTTTGGGGGGCCA 

Sequence 1448 

CCCTTAGCGTGGTCGCGGCCGAGGTACTAGATCCAGGATCCAGGGAATCCCTGTCTTTTA 
AAAAGCCAAAAGAGGATGGGGAAGACAATGGGAACCATGGGTGAMTTGCATTTAAGGAA 
AGAGCAAAGTAAATTATGGGACTTAGATATTCACAAGTAAGTCTGTTGAACATTCTGTCT 
TATGTAAGGTATAGCCTTAGCAATTAGAGTTTTTGTTAAGTGTGTTACTACTATT AATAA . 
AAATTGCATGGAGATGGGAAAACTATTTTTTACTTG 

TCTAATTATCAACATTTCTTCTGCCTTCTAAGACAAAGTAAAATATAGTAATTGGTAGAG 

TAAGGGGCTTAATTCAATTTAGTTTTATATGTTAATGGCCAGAAACTAGGTGCTTGCTTC 

CAGTCAGCTTTATTTCGTTCCTACTATCTCTCTAAATAATTTAGGATATAAAATTGACTG 

AATAAAAACCAGTMGCCMGGTTTTCACCCTTCGATTTTTATACCANGGCAAGATGGGA 

GAAATATCTGGAAAGGGACT 

Sequence 1449 

CCCTTAGCGTGGTCGCGGCCGAGGTACCGGCGCGCGCCACCACACTCGGCTAATTTTTGT 

ATTTGTAGTAAAGACGGGGTTTCATCATGTTGGCCAGGCTGGTCTTGAACTCCTGGCCTC 

AAGTGATTCGTCCACGTCGGCCTCTCAAAGTGCTGGAATTACAGGCATCCACCGCGCCCG 

CCCAGTTTTGTTGTATTTGATTAGTAGTTTGGTCACTGCACATACAAAAGCTAAAAAAGG 

CTGTGTGTAGAAATCATTCAAATATTGAGATATTTAMTGAAAGGTTAACGCTTTGGGAA 

CTCCTTTGTGGTTCCCTTTTCCTTCCCAGCTTTGGAAAGTCTGGGGACTAAGTCTGTTAT 

TTTCATCGGATTCAGTACCATGATCCTAATTTGTGCCACTTCATTC 

Sequence 1450 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACTAGCAAATAAAACATAAAGAACAAGAAAAAA 

CAGAAAGTGAATATCAAATCTTCATACATTTAGTATTATGACTGAAACTCAAATGATGGG 

GAAAATTAAAATCTCTGGCAAAAAAATATGGGTTTCTTCTGCAAATTTCATGCTGTCATG 

AGGAACCGATGGACTTGAAAGATCAGGGTGCAGACAAGAGAGTAAGGAGAAGTCATGTCG 

CCAGCCAAGAAAATCATGTCTTCTTTGTTCTCTGATCTGGCAACAAAGAATTGAAGGTCT 
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CATGTTGCCCTGGTAACACCCGGGAGGACACTCCCAGACTGCTGAAAGGGCTCTACCCAG 
CCCCACACAAAGTGTCACAGGTCCAAACTCTGGCCAAGGACTCTTCTGGAGCCAGAAACA 
TAACTGGGTTTGGACACTCCTTTAGCTTNTTTAGGGCG 
Sequence 1451 

CCCTTTCGAGCGGCCCGCCCGGGCAGGTACAGACAATGTGAAGTTAGGACTTAAGTAAAG 
G TAGTATTTATTTACTAAG AATAAACTTTC TTTTAAGAAG TATATAG AAAG TAAAATAAA 
TATTTGTTGCTTTATATTCAGCCA™^ 

CTATTTTATTATTATTATTATTACTTTATAGAGACGGGGTCTCCTTATGTTGTCCAGGCT 

GTTCTCGAACTCCTGGGCTCAG 

Sequence 1452 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACATTCACGGTGTTGTGCAGCTATCACTGCTAA 

CTAGTTCCAAGATATTCCATCACCCCGGAAGGAAACCCTGTACCCAGATAAGTAGTTATT 

TTGATCCTTATTTACA1TTACTCTCTAATTTCTCTTTGAAAAAGAATAAAAGM 

TTTAAATTACAGCATAAGAACTTGATGTGTTTTCATTTCTTCTGCAACTTGTT 

AAAAATTGGTAAAGATTTGAAGCTGAAATGAGGCGAGTCTTAGCAAAATGAAACCAGGAT 

TGAAAAGTCCTGGAAGGAAGAMCCCAAAAAGAGATTTGAGGAAAAMTCAAAATMTTC 

ACAATTCTGTCCAGGCCCCATCCTTGTCAAGAAAGCCAGATTCTACTTCAATGACTGATA 

GTGCTGAATC I" I I I I I GGAAAGAAAAAAAAAAANGAATTGGAAATTGAATGCTTTGGATC 

TCCAAGTANGAAATTTTGGA 

Sequence 1453 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACGCGGGGTCACCTGCTGTGCTCTTGCTTGCAC 

AGTGTCCTGGAGCTGGACCTGGCTCTGGGTTTCCAGGAAGCAGTTTGCTAAAGGCAGCAA 

GCTGCTTCCTCTGCTGCCTGAGATACCAGATTCCCAATGGCGAAGATTGAGAAAAACGCT 

CCCACGATGGAAAAAAAGCCAGAACTGTTTAACATCATGGAAGTAGATGGAGTCCCTACG 

TTGATATTATCAAAAGAATGGTGGGAAAAAGTATGTAATTTCAAGCCAAGCCTGATGATC 

TTA7TCTG 

Sequence 1454 

CCCTTAGCGTGGTCGCGGCCGAGGTACCACACCCAGATTATCATGGCATATTATTAATTC 

TTTCCATGCACAAAGGTCTTGCAGAAGGACCTCAATGGTTTATTACATCAGAGCACGCAT 

AGTAGTGCAGATTCATATGTTCTGTCTCCAAATAGTAGATTACCATTGCCTTCCAGTGAT 

GGCACTGTGTCCTTGAGTATTGGAAGAAAAAGGGAAGTAGCCCAAGTTCCTCCTTTGTGG 

ATGACCTGGAGATCTCTCCTGAGTTTGACTGTAAGGTGAATCAAGAATATAGACAGGCAT 

CCTTCATTGTTGGTCTTAATGCATGAGATGGATGACTTTATAACATGTAATGATGACCTA 

NGGTGAAGCTCANGTAGATTCAGAGGAAGACACAGACCACAGAAGTTTGCAAACTGGTGG 

TCCACAMCATGCTTGGGTTGGCTTGNATAGTGTTTGAGGCAACCAGTTAAAACTGA 

Sequence 1455 

CCCTTAGCGTGGTCGCGGCCCGAGGTACGCGGGGACCTGCTGTGCTCTTGCTTGCACAGT 

GTCCTGGAGCTGGACCCGGCTCTGGGTTTCAGGAAGCAGTTTGACTAAAGGCAGCAAGCT 

GCTTCCTCTGCTGCCTGAGATACCAGATTCCCAATGGCGAAGATTGAGAAAAACGCTCCA 

CGATGGAAAAAAAGCCAGAACTGTTTAACATCATGGAAGTAGATGGAGTCCCTACGTTGA 

TATTATCAAAAGAATGGTGGGAAAAAGTATGTAATTTCCAAGCCAAGCCTGATGATCTTA 

TTCTGGCAACTTA 

Sequence 1456 

CCCTTAGCGTGGTCGCGGCCGAGGTACAAAATACAAAATTTGGTTTAGCTACCACCCTAT 

TTAGTGCCAACTCCTTATTTTAACCTMTTAGCATCATTTTTCCCTCTGACAAAAACAA^ 

TTACACACACACACACACAGGAGTCATTTTCTCTCTTTTGTTAAGTATGGAGAAATGTTC 

CAAAGAAAACTGATCTTAAAATATATATATATACTTGAAAGCAACTTCTCATAATAAAAT 

TAAATAACl \ I I I CCTGACTTCTAGGGATATGAAAAGCTTTATATATGGGCAAAAAAGTA 

AACATATAATCCGTTCATAATTTGGAGTTGGATAAGTAACAGGCTTAGAAAGGCTTCAAC 
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TA 

Sequence 1457 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACCTAGCACCAAAACGTGTTTTGCCACATTATG 

AGGAGAAATTAAGACACATGCAAATGGCCCAGAGTTTGGTGGTTTGCAGTCACAAGGAGG 

AAAGAAATGTGTGACTCCAATAAGAGTCCCTCTATACTCATTTGGTTCCCTCGGCCCAGA 

AAGACAAAGGGGAGTTTATTCAGAGTGACATAAAAGAAGGGTCGCACAAGCAAATCAGTG 

ACGGGGCAAAGCATTGATGGTTTGMGGAAMCATGCA7TATCTTTTCTTAACGGTGTTC 

AATGAATTTCCTTTATTATTCATTACATTGGCCAACCAGCCTCTAAGGACTTTGGGCAGC 

CCAATGGAAGGCTCTCAAAGGCCTTTGAGTTTAGCTTTTGGGCTTACGATGGTTTTMTT 

AGCCACAGGGAGAAAAAAAGCCATCC 

Sequence 1458 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACTCTTGCCNGTGGACTGAGATGATCCTTCCAA 

CTAAANANAAAAANGGTNTNNAATCCTTTCCTTGGCTTCCCTTCAGACTGCCTCTGTGCT 

GTCATGGTCTTTCTTGGGAAGACCTTCTTCTGCTTTCTCTCTAGGAACAACTCGGCCAGG 

TATAGTTGCCTTTCCCAAGATTACAATTTAGCCCAGGATTCAGTCCCATGGTATCTCCGC 

CAGGATAGAACCCACGGGCAAGATTAAATTATCT 

Sequence 1459 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACGGTGTGAGAAAAAGTGGGCCTGCNATGGNAG 

ACTGCACCTNAATGGGCAAAATATGCNNTCCCCTTTCTGTTCCTTCTTCCTGATGGCTGG 

AAGGGAATATGGACCATCTTGGATCATGTGGATAAGACTAACACTCCTATAGGTGGCAGA 

GCAACAGAATAAAGGGAGTCCAATGTCACAGAGCAGAACTAGCACACTGCCTCAGATTGA 

CAATGATAGAAAAATAAGCTCTTGTCTTGTATTTMGCCACTGTTATTTGGAATCTATCC 

CATGAAACAGCCCCATGCTCTAATTAATATGTMCGGAGATGCTTCTTTTTCTAAGGATT 

TTTGGTTCTGAAAAATGCCCGCTTAAAAGAAAAGAACCAAAATTGAATAGTNTAAGCAAA 

GAAAAGGTTTAAT77TCTAACTTTTTCATGGAAAM 

ATCTTATAAATAACCNAATAGAGATTTTMTTAAAMTCTTGTAAATTCTCAGAAAAGGA 

ATAATTTTAAGGTAATNCATTNGGGGGGGCCTTAAGTTTAAA 

Sequence 1460 

CCCTTAGCGTGGTCGCGGCCGAGGTACCCAAGCATCATGTGAACACTGTGCAAATGCTAT 

ACAGAGAGTCACGGCNNNTTAACCANATGGAGAACAAAAACACAGCAGCGCACAACGAAT 

GAGTAAGTAAAATCCCTATGAATCAATTCACTCAGGCATAATACTCTAAATCCTTAAAAA 

GGGGGGAAAAAAAGAATGTAACAGAAATCTATTTCCACAATACACCCAAGGGTAACTAAT 

GTAACTAACTGGCTAATGACAATACAAGATTAAAATAAACACCAAAACCCACTATCCTGG 

ATTCATATTACTTAAATGAGAAGCTAAGTTAAAGATATTGTAGGGAAAGCTCACTTAATT 

ACACTGGTGATGAMTTGGCTTTTCTCTTTTCTTGGAGTTTAAGCAGTTATGCCTTGGGC 

CCAACTTTTCTGACCTTGTGAGTMGTTGAGCCCCATGTTCAGCAGAAGATTTACAGTAG 

CTGTTTTTTATAAGAAAAATTCCNTTT^ 

AAAAAAAAAAAGGGAGAAAAAT 

Sequence 1461 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACTTCCAACCCCATGTCATCTGGAAGAGGACTC 

AGCTGGTTGATCGTCTATCACAGTTTGTTGGAGAGACCTAI II ! 1 1 ! 1 I I ACATTCAGAT 

GCTTCTCAGAGTTCTGTTTATAAACATTGTCAGGAGCAGAGAGTGTTAGGGATCTTCCAC 

TTGAAAGAAAAATGACAAAAAGMTAAAATCACTACTTGACTTGCCTGCGGTTTTGTGTT 

CAGCCTTTCGTCTTAATGCTATTGAGGATTACTCTTGCTCACTAGAGAGCACAGAATGAT 

GTTGTGTCAAGCAAATCGAAAACAGTATAAAATGCTATGTGCGCCTCTCATTTC 

Sequence 1462 

GATATCTGCAGAATTCGCCCTTTCGAGCGGCCGCCCGGGCAGGTACGGTGGCGCAATCTT 
GGCTCACTACAACCTCCACCTCCCAGGTTCAAGCGATTCTCCTGCCTCAGCTGGCTGAGT 
AGCTGAGATTACAGGCGCCCGCCACTACGCCCAGCAAA1 MM I GTATTTTTAGTAGAAA 



Page 251 of 869 



WO 01/51628 



PCT/US01/00798 



Table 2 



CGGGGTTTCACCATGTTGGCTAGGCTGGTCTTGAACCCCTGAACTCGTGGTTTGCCCACA 
TTGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGACATTGCACCTGGCCGGTATTTCTG 
CTTTTTAAAATTATTTATGAATTTGGTTGTCTCATTCCCATGCAGGCm 

ACTTGGGAAAATTATTTAACTTCTGTGTAACTGAACTTATTCATATATAAAATGGGGCAC 

TAATAATACCCACTTCAGAAGTGTTGAGTATTATATGAGGTAATAACATGTAACGGACTT 

AGTTAATGG TAAGCGCTATATATGTGGTAGCTATGGTGATTATGATTATC TACAAATAAT 

ATTGNTCTATAAGTGTTTCTTANrrCTTTTGAMGMGGCTTTTCCTATTATTGGMTTA 
AAGT 

.Sequence 1463 

CCCTTrCGAGCGGCCCGCCCGGGCAGGTACGGTGGCGCAATCTTGGCTCACTACAACCTC 

CACCTCCCAGGTTCAAGCGATTCTCCTGCCTCAGCTGCCTGAGTAGCTGAGATTACAGGC 

GCCCGCCACTACGCCCAGCAMTTTTTTGTATTTTTAGTAGAAACGGGGTTTCACCATGT 

TGGCTAGGCTGGTCTTGAACCCCTGAACTCGTGGTTTGCCCACATTGGCCTCCCAAAGTG 

CTGGGATTACAGGCGTGAGACATTGCACCTGGCCGGTATTTCTGCT7TTTAAAATTATTT 

ATGAATTTGGTTGTCTCA1TCCCATGCAGGCTTTO 

TAACTTCTGTGTAACTGAACTTATTCATATATAAAATGGGGCACTAATAATACCCACTTC 

AAAAGTGTTGAGTATTATATGAGGTAATAACATGTAACGGACTTAAGTTAATGGTAAGCG 

CTATATATGTGGTAGCTATGGTGATTATGATTATCTACAAATAATATTGGTCTATAAGTG 

GTTCTTAAGTTCTTTT 

Sequence 1464 

CCCTTTCTTAGCGGCCGCCCGGGCAGGTACTGCCCTAGTCCACAGCTGAGTGGGGCTGGA 

ACTAAGCTACAGGGTTGCTTCAGGGCCTACAGTCAAGACTTAGGTCTTAGGCCTAAGGTG 

TCACAAACCTGCATGTCTCCCTTCAGGTCCTTAAGTTAGCGGGCCTGCTGTATTGAGAGG 

GGCTGGAGCCACTTTTCAGGGGTGTTTCAGGGACTGATGAGGGACCTAGGTCAGGTGGCC 

TGTTTGCAGGGCTATGGACAGCCCCATCTCCCTCCAGGATGCTGGGTAGGCAGAACTGCT 

CTCTGACCACAGCTTGAGAGGGGCTTATGCCAAGATTCATGGTTGTTTCAGGACCTGCTG 

TGGGACTGAAATNTGCCTGCCTGTNCTGAGGACACANAAGGATGTATCTTNCTGACTACA 
GGGAGG ATTTC TG 

Sequence 1465 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACCTAGCACCAAAACGTGTTTTGCCACATTATG 

AGGAGAAATTAAGACACATGCAAATGGCCCAGAGTTTGGTGGTTGGCAGTCACAAGGAGG 

AAAGAAATGTGTGACTCCAATAAGAGTCCCTCTATAGTCATTTGGTTCCCTCGGCCCAGA 

AAGACAAAGGGGAGTTTATTCAGAGTGACATAAAAGAAGGGTCGCACAAGCAAATCAGTG 

ACGGGGCAAAGCATTGATGGTTTGAAGGAAAACATGCATTATCTTTTCTTMCGGTGTTC 

AATGAATTTCCTTTATTATTCATTACATTGGCCAACCAGCCTCTAAGGACTTTGGGCAGC 

CCAATGGAAGGCTCTCAAAGGCCTTTGAGTTTAGCTTTTGGGCTTACGATGGTTTTAATT 

AGCCACAGGGAGAAAAAAAGCCATCCTGATGTGGGACTCCTGAAGTGTGTCCTACTGAAA 

Sequence 1466 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACTGCCCTAGTCCACAGCTGAGTGGGGCTGGAA 

CTAAGCTACAGGGTTGCTTCAGGGCCTACAGTCAAGACTTAGGTCTTAGGCCTAAGGTGT 

CACAAACCTGC ATGTC TCCCTTCAGGTCCTTAAGTTAGCGGGCCTGCTGTATTGAGAGGG 

GCTGGAGCCACTTTTCAGGGGTGTTTCAGGGACTGATGAGGGACCTAGGTCAGGTGGCCT 

GTTTGCAGGGCTATGGACAGCCCCATCTCCCTCCAGGATGCTGGGTAGGCAGAACTGCTC 

TCTGACCACAGCTTGAAAAGGGGCTTATGCCMGATTCAAGGGTTGTTTCAGGACCTGCT 

GTGGGACTGAAATCTGCCTGCCTGTCCTGAGGACACANAAGGATGTATCTCCTGACTACA 

GGAGGATTTCTGTGCAGGATTTCTGTAAGACTTGTGGCTGANTGGAGCTGAATCCCCGAT 
TC " 

Sequence 1467 
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CCCTTTCGAGCGGCCGCCCGGGCAGGTACGCGGGATGTCCCTGAAGTCCTCCAGGCCCAC 

ACCTCCACCCGCCTTCTGTCCTGTATCTGCGGAAATATTTATTTTCTGTAATGAACTTTC 

TTGGGGCTCCAGACACCCTCTCAGCCTCTTCCCACACAGAACTTTGCCTACACATTCCTA 

CTACCCCTGGAATTCTAACTCAGATGTGGGTAGCAGCTTCCTCAAAGAGAAACTTTTTCC 

CAGCTGGGTGCTGTGGCTCACACCTGTAATCCCAGCCCTTTGGGAGGCTGGAGTGGGCAG 

ATCGCTTGAGCCCAGGAGTTTGAGATCAGCCTGGGCAACATGGTGAAACTCCATCTCTGT 

GAAAAATACAAAAATTAG 

Sequence 1468 

CCCTTAGCGTGGTCGCGGCCGAGGTACGCGGGTCTTCTCTNCTACTTATGCCTTTTCTTC 

TTCCTCCTCACCCTCATGGCTCCAGGTCCATGCCCAGGGAGCATGTTAGCATGTTGTCAG 

GTCTCAAAGTATCTGAAAAGATTGTCTTCTCTGTGGCCAGGCTGCTTAGAGGCAGCCTGA 

TATAAACTGTAAAAAGGGGGAGAGTGTTTCTCTGTGTNCTNTGCATCCACTCTTCATGCA 

TTTGCTCCA 

Sequence 1469 

CCCTTAGCGTGGTCGCGGCCGAGGTACAAAGGTTCAGTGGTGAGAAGAGGGAGCAAGGCC 

TTTGGMTMTGAACTCCAGTTGTTCCTCATAGGTGCAGCAGAAATAGCGAGAGGTCAGG 

ATTATGGAGATTGGTAAGGCGAGATCATCCAAGGGCCTTTTGCTTGGTMGCCATTTTAC 

TTTMTCTTGA GTGCCATAGGGATTCATTGACGGATTGATACAGGGAAATGAAATGATTT 

TTTTTTTTTTTGGTTGGGGGAGGCAAGAGTCTTGCTCTGTTGCCCAGGCTGGAGTGCAGT 

GGCACAACGTTCGGTTCACTGCAGTGTCTGCCTCCCAGGTTCAAGCAATTCTCATGCCTC 

AGCCTACCTTGTAGCTGGGATTACAGGTGCACACCACCACACCCAGCTATTT7TTA 

Sequence 1470 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACCTGACTGTGGCTCAGATCTGCGTCGCAGCAG 
CGAGAGAAGAAATCACTCCATATCCGATGAGAGGAAGAGTGGCACAGAGATGGTGTCTAC 
AATTAGAGACATTTCTGACTCCACCTTAG 
Sequence 1471 

CCCTTAGCGTGGTCGCGGCCGAGGAAC I I I I I II I I I I I I I I I I I I I I I I! I I GCAACAG 

GGTCCTACTATATTGCCCTGGCTACACTTCAACTCCTGGACTCAAGCAATTCTNTCATCT 

CAGTCTCCTGAGTAGTTGGGACACCACTGTGCCCMTTTNTATTTTTTAGAANAAATTAT 

GTTCATTCTCCACAGTATACACTCTTATCTGTAGCTTGCATTATTCACTTAATTAAGGAT 

C7TGGAAATTACACAGTTTATCAGATCTTTCACTCCTTTANAGTATACACGCTGTATACT 

CTAGTCAACCAGNCCCCAATMTTTCTAATCTTTTGCTATTTATTATAAACAAGGACTGC 

NTGCCATACACATACACATAAGAGTAAAAAGACAGGATATAGAGTTAGGAACAAATGTGG 

TAAGGATATTGTATGTCAAATATTTTATNGGGCTATGCATATTTTATGNNATTTNAGGCA 

CCCCCCCCCCCCCAAAAAAA 

Sequence 1472 

CCCTTAGCGTGGTCGCGGCCGAGGTACTGAGCCTGTGTGTCAGGTAATGTGGTTAACTTA 

ACATTCATTACAGCATGTCATCACCACAAAGGCCTCTTGTGGCTGGTGCTA I rfTTTTTC 

CATTTTGCCGACAAGGTTTAGAGAGATTGAATCACCTGCTTAGACCCACACACCCAGTAA 

GTGTTGGAATCTGGATTGATGCTGTAATGACCATTATCCAGATCCTTATCACCGTTTGGG 

GAAATTGTCTGAAAATCCCTCTGGGTTATTTAGAGTCMGCTTTCATCTGGGTCTTCAGA 

AAGTTGTAGTGGAGCCATCTGCACCACCACACACCTGAAGACATTTCAGGACTAACCTGG 

ATGAAAAGGGAATTGACTTTTAGCTAGGATAGGACAAAAGGGCATAATCATAAAATTAAT 

TCACTTTCCTTCCTTTCTTTTGAAAA 

Sequence 1473 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACCTGCTCACATATTCAATCACTCCATCTCAAA 
AT7TATGAGGAAAGGGATGCCTGGAGACTGGAAGAACCACTTTACTGTGGCTATGAATGA 
GAACTTTGATAAGCATTATGAAAAGAAGATGGCAGGGTCCACACTGAACTTCTGCCTGGA 
GATCTGAGAGGAACAACAACAAACTAGGTGACAGAGACTATGCCAACTATTTCGCCTTTT 
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ATTCTGTTGAGCAAGGAACTGTGACTGAATGTGGAGCTTATGAGCTTCAGTCCATCTCCT 

ATAGTGTGGCTAGTTTGCTATAATATTAAAACATGATTTAAAATATCAACAA 

Sequence 1474 

CCCTTAGCGTGGTCGCGGCCGAGGTACGCGGGACAGGCTTCATGGTGAGGGGCTAGAAAG 

CAACGTCAGGGCCGTTGCCCAGGGAGCCTGAGGGCAGAAGTCAGAGGAGGAAGTTACCCA 

CTTCAGGACTCAACTTGGAAGGCAAGTAGACAGGTCTATTCTGCTGTCAGAAAAAATCTG 

CACACTTGAGCTTGGGCACTCACCACCGGGTGCTTAGTTTTGTTCTTAGAGTAGCTTCCT 

CACACTGATCTCAAAGAGTAAGTGGACTTTTGGGGAAGGACTTCCTGGTGGGTTTGGAAC 

AGAACTGGGGAGAAGTGGCTCAAAGGCCCCCAGGCAGTTGTGGGGGCAGAGAAAGCTTTG 

GGCTGCCAGAGAGGAGCAGTCAtCTGTGTCTCCGAGGGGAAGAGCAGGGAGCAGGCCTGG 

TTACAGTGGAAAGGGATCACACACAGGGTGCCTGCCCCCAAAATGCTGCGGAAACGTGGT 

TTGGGGCTT 

Sequence 1475 

CCCTTAGCGTGGTCGCGGCCGAGGTACTAGTGGGTGTTATTTTGTTGMTGAATGTTTTA 

CCCAAACTTATATGCATATTAGATTGTTTAAAAGCTCAATTAAAAAGTGAAAAATAGACA 

CTGAAA Al MM fGAGTGATTTCACATGTGTTCATGAAACAATATATCTATTACAAATCA 

TTGAATATTTTAAAAGTMTGTCATGTATTCATTTTCTTATTTTATCTCCTCAGTAGACT 

GAGATCAGTGTGTAGGGGTGAAGCAAGGAGTCCTACCCTGTTTGACTAAGGTAAAAATTA 

ANAATCAGCTGAGAAAATGGAATTTGCAAAAGTGCCTGCCAGATAATGTTAGAACTGGAC 

CAGAAAATAGGAGTTGGTATAAAACTAGACCAGCGAGC I I I I I I 1 1 CCTTCAAG ATGC AG 

GTCAG7TTTATTGCTTTTGGTAAATTAAGAGATTGTGTTTCTTGATCTTTA 

AATACAATGTTAACCTACTTCAAATTTTAAAAAATATACCACACATGTATATGTATGNGT 

GTGNGTATATACACACAGGG ATTTTAAG GGC AG 

Sequence 1476 

CCCTTAGCGTGGTCGCGGCCGAGGTACCMTTTATTATATGCTGCCTTCAGTTATTTATT 

TATATATTTGTCTGTTAAACTAATTGTAAAGTGTTCATTAATACTGGCTTATAATCCACA 

TGTAAGCAAATTAAAAGTATATCTTAAACATTTACTTTAATATTTTAA^ 

CAAATG AGTTCC AATTTTCCCAACCTGCTCTTAACCAGTA I rTTTTTI I AACCAGTTCTC 

TAC7TCTTTAATGAAATTAAAGAAACTAAAATAATTTCACCACCTTTAATAACAM 

CATCAGGGAAATTATATGTGAACATGTAATAACAAAATTAAAGCTACTGATCTGTAAAAG 

GACAGTAAGAAAAATTAGTATTCAAAATGACAGGGAGATATTTAAATACCGACACTGCAG 

CAGCATAAACATGATAAATCTCATTTTGATATAAATGGGGAATTTCCAAGATCCAGCTTT 

TATGCTTTGCTATTCTAAAAA 

Sequence 1477 

CCCTTAGCGTGGTCGCGGCCGAGGTACTTATAGGCAATAAGGCGAGTCTAAGACCTAAAC 

TAGATAATTTGAGAACAGGGAAAAAAGATTCCATTTCGATTCCTGAAGGTTACCCCCATA 

CCTATTATAACAGAATAAAATAAAATAATTCGAAACTGCACAACCTCTAACTTATCAAAT 

CCTATATATGCCTCATTTTCTCAAATGACTCCTAATTTGTGTAAAGAAAAGGCAAAAAGA 

GAAAGGACAGAAAGTATGTCAAGGGGGGCTAAAGCTATGAATACCCTTTTATGTAAACCN 

AGAAAAAAATAGATCNCNCCCNTTTTTTAAAAGGGGA 

Sequence 1478 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACGCGGGAACCAGCATTCTGACTTCGGCCAGGT 

TAATTTAATCTTTCTGATCTGTAAATGGGTAGATTTCCTGCTGATAGTGG I ' nTTTTT GT 

GTGGGGGGGGGGTGCCTAAAATAACATAAAATATGCATAGCCCAATAAAATATTTGACAT 

ACMTATCCTTACCACATTTGTTCCTAACTCTATATCCTGTCTTTTTACTCTTATGTGTA 

TGTGTATGGCATGCAGTCCTTGTTTATAATAAATAGCAAAAGATTAGAAATTATTGGGGA 

CTGGTTGACTAGAGTATACAGCGTGTATACTCTAAAGGAGTGAAAGATCTGATAAACTGT 

GTAATTTCCAAGATCCTTAATTAAGTGAATAATGCAAGCTACAGATAAGAGTGTATACTG 

TGGAGAATGAACATAATTTCTTCTAAAAAATAGAAATTGGGCACAGTGGTGTCCCAACTA 
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CTCAGGAGACTGAGATGAGAGAATTGCTTGAGTCCAGGAGTTGAAGTGTANCCCAGGG 
Sequence 1479 

GGAGAAGGCTTCCATTTACTTCAGAAAGCTACACTGACTCTACTAAAAGGCCCCTTTCCT 

GCTGGCT7TTAGGGAAAAGAGAGAATTCCCTCATCAGTCTCAANAATCACAAGCAACAGG 

AAATGGGGGAAAATCGTTTTACTACAGAATGANAATGNTGTCTGATTCAAGTAATACCTC 

CTTGCATCTTCCCCTTTAGCNCAAAGAAACCCCACTTAGGCCTC7TCAAGNAATATTCTT 

GAGGTAGCCTTGTCAGAAGGGTGGTCCCTTTACANGACCAGGCANAANTTTGCCTNAAAA 

TMNCAGTTTTGGTTrGGAAATMTmACCTTAMGNCCGCCCAAGAGCMCTTGGGNG 

CCTAAAAGGGGGCCTCCCTTTGTNGGAAAATTTGTGNNCMC7TTGGATCCCTCNACCCA 

CAAGN CCCCTCCGTTCACCCTTCCCATTTTTGG ACCAGNATGGAGGAAAAAACC TTG AAG 

AACCTCACCTTAAAGGGCCNNGAAGGCCTTCCAACACCAAGCCTTAGGTGGACCAGGGGC 

CAAAAAGCCCCAGGACCTTTGGGGGGACCCCTTGGGCTTTCCCAGNTCCTTGCTTGGAAG 

CCAGTTCTTGGCCCTTCCCAGGAAACCCCTATTANCTCTTNTAAAAATCCCCCGNAAAAA 

ACCAGG AGGCTAATAAG NATGGNCT 

Sequence 1480 

CTTGATGCATGCTCCTCCGAGCCGNCAGTCTGAGTGGATGANATATCTGNAGAATTCGCC 

CTTNCGNGCGGGNGCGCCCGGGCAGGNACCCGTTTTCATCTCTGCCTTGATAACATCCGT 

GGATACAATACTATCCATACTCCAAGCCGTTATTGATATTTGAACATGTCGNGGC7TGCC 

TNACCTGCCTGCGGGTTTGGATTTCCCATAGATGGGCTTTGGCTCCCTCTGGGGNCTACA 

CTGTGCCCCACATANACAAGGGGAAGCTANACAATTNTTGCNCNCAGTAGGGCCAAATCA 

CCAGGGCCCCAAAGGNGTTGCNAGGCCAAGTGGGCTTGGAATAATTGGAAANTCTCTAGC 

CNTNCTTCANCCNACTGGTGGGGNANTTTCAANACCGNACCTGGTTGGCCCCCCTTGGAA 

GNGNTTTNNTTANMNCNCCAGNGTGGGGGTAATTTTTTTNGGGGAAAA^ 

AN NGG G A GGCCCAACTTTTTTTCCTTMTTTAATTC AACCAATTG GTGGGAAAAAATAAT 

CC mm NATTTTNAANANNCTTAATATAAAAAAAAGNTCTCGANGNGNTAATTAACCA 

AATTNGTTTTAATTAAAAAAAAAAAAANNAAAAAAANGG 

Sequence 1481 

AGCGTGGTCGCNGCCGANGTACGCGGGGGCTCACAGTGGGATCCTGTTAGGTGTGGGTGG 

ATGAGAGTCAGGGTCCATCAGTGTATTCATTTMCTGTTCATTTGTATMCCCCGTTTAA 

GAATACTGTCCTCCAAGTGCCAAGAATGGTGCTCAGGGGATTACCACCTAATTGCTGACT 

CAAGTTGCTGGTTTGCAATGGGCACAGAACTTCTNTTAGTAGGTGGCATGAGTTGAGAAG 

GTTCTGGATCAGAGATAGGGGCCCCTCTGATCACCTCCACTCCTATAGGTATGAAGACAG 

AGCTGGCCATGCGCCAGTCTGTGGAGAACGACATNCATGGGCTCCGCA 

Sequence 1482 

AATTCGCCCTTAGCGTGGTCGCGGCCGAGGTACACAAAAGGGGTTCATCCACTGTTAGAA 

TAATGGATTAGGCCATTTCCTTGGGGAGCCACAATGCCAGCTTCTTTGGTCTTTTATTCC 

TCTTAGACTAGTCAGATACCACAGAGAAAATTCCAGTTGCTGCTTAGAGATAAACATCTG 

GCTGCTATTGTGTGGGAGACAAGTCCGAAAGAGAAAGCAGGGAGTCTTNAATTAAATACA 

TGAACCTATGTC1TAACCCCACTGTTTGCACACTGTCCTGAGTCCAGAGACACTCANTTT 

CACACTACCCAAAG 

Sequence 1483 

CCAGCGGCCGCNCNGGCNGGCACCTGACTTTGGGTAAGTTGCCAGAATAAGATCATCAGG 

CTTGGCTTGGAAATTACATACTTTTTCCCACCATTCTTTTGATAATATCAACGTAGGGAC 

TCCATCTACTTCCATGATGTTAAACAGTTCTGGCTTTTTTTCCATCGNGGGAGCGTT^ 

CTCAATCTTCGCCATTGGGAATCTGGTATCTCAGGCAGCAGAGGAAGCAGCTTGCTGCCT 

TTAGTCAAACTGCTTCCTGGAAACCCAGAGCCAGGTCCAGCTCCAGGACACTGTGCAAGC 

AAGAGCACAGCAGGTCCCCGCGTACCTCGGCCGCGACCACGCTAAGGG 

Sequence 1484 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACACTCGGCCACTGTTCTCTGACCTACTGCCAG 
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CCCTAGTGCTGCCTGCACCCCAGGAGCTAGCTGTAAGACCTGGGGCAGGTATGCAGCCGC 

TGCAAGCCTTGGTTCCACACCACGCCTGTGAAAGGAAGGAAGGATACAAGGCAATGAGCC 

ACACAGCTCCTTCCAGCGCCAACAGCCACTGCGAMTTAGGCTC7TTTGCTTTGGCCATA 

CTTGTATACCGGCATTCGGGCTCAGCTGGGCACTGGAAAGCAGCTGTTCCGTGCTTGTTA 
AGGTAATTAAGCA 

Sequence 1485 

CCCTTAGCGTGGTCGCGGCCGAGGTACAGGGAAAAACAGTAGTTAAGGGTTGCATTTCCA 

GGCCTCTGCCTCCATCAAGTATTATITACCAAGCTAAAGAAGCTGAGGGGTGTGGGGGAA 
ATATATTTG 

Sequence 1486 

CCCTTTCGAGCGGCCGCCCGGGCAGGTA C I iTnTTTTTTTTTTTTTTTI I ATTTTTTTT 
1 1 Minn ii ii i i i i i M | | | | | | | | | j | | | | | | | | | ANGACAANCAAANAN AAAANA 
N ACAAAAN NTTTAAAAAAAAAAN NN AAAANTTAAN NAAAATTNTTGAAAANTTTTTTGN N 
N NANAATNNGGCAN N NCTTAAAAAAN NTGGNNTGANTAAANANANAAAAAAAAAACANTA 
AAANTGGGANGGGAAAANAACCAAAA 
Sequence 1487 - 

CCCTTAGCGTGGTCGCGGCCGAGGTACATGCAAGCTACTCTTGCTCTTTGGATCCATGGC 

TGAATCTGGACACCAGCAGTGACCAGCTCTAGAGTAAAAAAATCAGGAAGGGGGCTGGCC 

ACACTGGGTTGGAGAATCAGACTTGGGGATACCATGCCATGAGGTGTGTGGCCTCTGGGC 

AACAAAACAACACCAGTTTAGACTCATCTCCTTGAAACAGAAGGAAAAGCCAGGCTTGAA 

AACATGGGTTAGAACAAGGAATCTTGGGCCAGGCGCAGTGGCTCAGGCCTGTAATC 
Sequence 1488 

CCCTT AGCGTGGTCGCGGCCGAGGTACCAGGATGCAAAAGCAAGAATGTTCACAGCAGTA 

TTTTTGATAACAACCACATTTGGAAATTATTCAAATGTCCACCCACAGTAAAACAAACTT 

ATACAATGGAATCAGTGGACTACAGCATGTAGCTGTTAAAATGCAATTAAAAATGACAAA 

CATATATMGCAAGTCACAAAAGTGCATACTTTTAAACTTrTAACATTGGCAAACAATAT 

TGTTTGCAGATATATATACTGGTAGTAAAAACTGTAAATAAAAGTAAGGAAAGGATTACT 

ATAAAAAGGCAATGGTGGTCA 

Sequence 1489 

ATTTCCAGGCGGTAGGCGTCTACCGCGATATCCTGGGTGGCGGAAGCGAAGGCGACGATA 

ACCGCGATAGCGATCAGCCAGGACAGGTGTTTTTGCGGATCGCAGAACCCCATGCCGATC 

AATCCGAGGATAACCAGCGTCTGGGACAAGACTAACCAGGATCGACGACGTCCGAGCTTG 

CCGAGCAGTGGCAGGCGCCATTGGTCGAGCAAGGGCGACCAGACCCACTTGAAGGCATAG 

GCCAGGCCGATCAGGCTCGCGTACCCGATGGTTTCGCGGGCCACACCGGCCTCACGCAAC 

CACACCGAGAGCGTCGAGAACACCAACATGTAAGGCAAGCCGGCAGCAAAGCCGAGCAAC 

AGCAGCACTAACGTCGAGGGGCTGGCATTAGGCCGGCGAGCGCGGCGCGCCAGGTTTTAC 
GGGGCATGGG 

Sequence 1490 

TTTGNAGCTCCCCGCGGTGGCGGCCGTGGAAGCGGCGCGGGCCGGCGAGTTTNGGACGCG 

GCTTTGCCGTNGTCGCCACCGAGGTGCGCACCCTGGCGCANCGCTCGACGGCCGCCGCGA 

AAGACATCAAGGATTTGATCCAGGCGTCCGCCGGCACGGTGGACCTGGGCCACGCCCAAG 

TGAGCCAGGCCAGCGCCACCATGGATACCGTGGTGGCCAGCGTGCAGCAACTGAGCACCA 

TCATGGCCGAGATCGCGCAGGCCAGCGACGAGCAGCGCAGCGGCATCGAGCAAGTNAACC 

AAGGCCATCGCCCAGATGGACCAGGTGACCCAGCAGAACGCAGCCTTGGTTGGAAGAAGC 

GGCCGCAGCCGGCAGACGCCTTTGCAGGAGCAGGCGCANGAGCTTGAACCAGGTGGTGGG 

GCGTGTTCAAGCTTGTMCAAGCAATTCCCGGGTGCTTAAGATTGGGGGAACGAAATTCC 

ACCCTTTGCCAAGGGAGGCACCCATTGCCATTGAACGACTTAC^ 

CCCCCGGGCTTGATTGAATNCCCCAACNGGGATTGGCCGCCACCGTTGATTTCCGCCGGG 
GTTGCCCTGGGCCCCGCCGGCCCGGCATTTCNAACTTTNCTTTGNTNGACCGGGACCNAA 
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CCANGCNACATTTGCNAGGGCCCGAAMTTGCNTTGNTTGGGCGGGGCCGGCCANTTTGG 

GCGGGGGGNCCCCCGNGTTNANTGGNTNCCGCCGNAACCANCNAANATNGGGCCCCNCCN 

CNAANTTGAAATTTNTTTGGCCNGCTTTTTANGCGGG 

CCAAAATTTTCCCANTTTTNGGGNCCCCCCCNNCCCCCC 1 I 1 1 I 1 1 I 1 CNGNNTTNTTTT 

TTCTANGTTTNAANNCCCCCNCCCCCC 

Sequence 1491 

AGGTACGTGGTGGAGCCAGGATCTGTCGGACTCTGGAGTCCAGGTTTGCAGAGAGCCCCC 
GGCTCATCCTGGCCTCTCCGGCATCTCACTTCCTCCCTGCAGGCACGTAGGACCGGCCTC 
• AGCTCTGTCAAGATCTGCACTGTAGGACCCCAGAGATTTTTTTCTTTCTTTGGGTAGTCA 
GACCCATCTGCAAAAGTTTTGCAATCATGAAATCAGATAGCGTATGAAATGCTAGGAATA 
AGCGCAGGTCTAGCATACCACGCGGCCAGCGGTAGCGTGACCTGACAGGAGGAGTGCGTC 
TTCAGAGACATTGGGAAACTGGCGTGCCTGTCGCCCGTCCCAGGGTCAGAGTCTTCGAAA 
CCAGGCTTTCTGGGAACCCAACAAGGAGTTTCTGATCGAAGAACCGCACTGAGGACAGTT 
AGTCTTTTTTTCCACACTGGGCATTTTCAGCATTGGAAGTCAATGGGAGC 
Sequence 1492 

CCGGGCAGGTACGCGGGGAGAATAGAGTGGCTGACTTGCTGATTGAGGTCGGCAGCTCCC 

GAGAAGCAGGAGAAGCAGAGTTTTTGCAGACAGACCCACGAAGTCAAGCATGCGGAGTGC 

AGCCAAGCCCTGGAACCCAACCATCAGAGCAGGGGGCCACGGCCCAGACCGGGTGCGGCC 

TCTGCCTGCAGCCTCTTCCGGCATGAAGAGTTCTAAGTCTTCAACTTCCTTGGCTTTTGA 

GTCCCGACTCAGCAGGCTCAAGAGGGCCAGCAGTGAGGACACGCTCAACAAGCCAGGAAG 

TACCT 

Sequence 1493 

AGGCTTGGCTTGGAAATTACAGACTTTTTCCCACCATTCTTTTGATAATATCAACGTAGG 

GACTCCATCTACTTCCATGATGTTAAACAGTTCTGGCI Mill I CCATCGTGGGAGCGTT 

TTTCTCAATCTTCGCCATTGGGAATCTGGTATTTCAGGCAGCAGAGGAAGCAGCTTGCTG 

CCTTTAGTCAAACTGCTTCCTGGAAACCCAGAGCCAGGTCCAGCTCCAGGACACTGTGCA 

AGCAAGAGCACAGCAGGTGAGGTCCCCGCGTACCT 

Sequence 1494 

TTGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGGCACTTGAGGTAGCTTATC 

TCATTTATTCTTCACGATGAACCTTTCTCCACCATCTAATGGCTTCTTCCTCTGCAGCAT 

CTGGTTCTGTATATTCTGGCCACCAAAAGCGAAGTTCCTAAATGTTGGACTGAAATCCTA 

GCTCACTGTAGGAAGCCAGCTAAGAGCTGAAMGATTTTGGAAATAGGATTTCTGGAGCA 

GCCCAGAGTCAGGCACCAACTATCCTGTCCATAGTCATGTCCTCCCCGCGTACCTGCCCG 

Sequence 1495 

AGGTGGCCGAGGTACGCGGGGGGCCAAATAAGCACTATGTAAGI I I I I I I AATTTTCCCA 

MCACATGCACAGAATTMTATTGCCTTAMTTTTGTTAGCACAAGTTTGAGAGAGCTCT 

TTAGTTCATAAAAAAATGTTTTATGGACCAGGTGCCGTGGCTCACACCTGCAATCTCAAC 

ACCCTGGGAGGCCCAGGTGGGAGGATCGCCTGAGCCTAGGAATTTGAGACCATCCTGGCC 

AACATGGTGAAGCCCCGTCTCTACTAAAAATACAAAAAATTATCTGGACATGCTGGCGTG 

CGCCTGTAGTCTCAGCTACTAGGGAGACTGAGGTGGGAGAATCACCTGAGCCTGGGAGGT 

GGAGGTTGCAGTGGGTTGAGATCATGTCACTGTACCTGCCCG 

Sequence 1496 

AGGCCAGGGTGGCTCACACCTGTAATCTCAGCACTTTGCGAGGCCGAGGTGGGAGGACAG 

CTTGAGGCCAGGAGTTTGAGACCAACCTGGGCAGCATAGCAAGACCCTGTCTCTACAAAA 

AGTTAAAAAATTAGTCGAGCGGCCGCCCGGGCAGGTACAAATAATAGCTGCGCCCACACA 

CACTAATGCCCCCCCACAGCCCACAGCCTTGGGGCGCTCCAAGGAGAGTGAGTTCAATAC 

CAGCAGTAAGGGCGACTCGCTCCTGGACGTGAATCTAGCATGCAAGATCATTTTCATTAA 

ACACGTMCTGCACAGCGACGATTTACATCTCCTCTTATTTCCTTTGAATGAGAAGCAGA 
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CGCATAACAGCAATCTGGAATAGAAAGTGGAGCTTTTACAGGCTGTTGTATGTGCACCAA 
GGCCGGGAGAGGCTGACGTCCGTCAGCATCATGTGGCAGGAAAACCGCAAATAGTTCGAA 
AGGCCACACTC TCCA GCACCGACTGAGTAGCTGCTGCAGGAAGGGGGACAAATGGTCCTC 
AGCAAGGACCCTTTTCAGAAGGAACANGGAGACCGCGTGGAGGGCCCATGANAATGAATT 

Sequence 1497 

AGGT ACCTGA CTTTGGGTA AGTT GCCAGAAATAAGATCATCAGGCTTGGCTTGGAAATTA 
CATACTTTTTCCCACCAT TCTTTTGA TAATATCAACGTAGGGACTCCATCTACTTCCATG 
•ATGTTAAACAGTTCTGGCTTTTTTTCCATCGTGGGAGC^ 

TGGGAATCTGGTATCTCAGGCAGCAGAGGAAAGCAGCTTGCTGCCTTTAAGTCAAACTGC 

TTCCTGGGAAACCCANGAGCCAGGGTCCAAGCTCCAGGAACACTGGTGCAAGGCCAAAGA 

AGTACAGCAAGGTCCCCGCGTNCCCTGCCCCGGGCCGGCCGCTCCGAGAATATCNAAGCT 

^^C GATACCCGTCGACCTCCGAGGGGGGGGGCCCCGGGTACCCAGCTTTTTTGTTCC 

CCTTTTAGT GAGGG GTTTAAATTTGCGCCGCTTTGGGNCGTTAAATTCCANTGGGTCCAA 

TAAGCCTGGTTTTTCCCTGGTGGTNGAAAAAATTNGTTTNATTCCCGCCTTCCAA 

TTTCCCACCACCAAACCATTAACNGAAGCCCCGGGGGGNAGCCANTAAAAAAGTTGGTAA 

AMGGCCCCTNGGGGG^GGCCCTrAMTNGMGTTGGAGGCTTAAACCTTCMCCATTT 

AAANTTTGGCGGTTTGGCCNGCCTTCCAACNTGGGCCCGGCTTTTTTCCAAAGTTCCNGG 

GGAAAMCCCCTGGTCCCNTTGGCCCCANCCTTGGCAATTTAAAATTGGAAAATTCGGGG 
CCNAAAC 

Sequence 1498 

GAGCTCCCCGCGGTGGCGNTCGAGGTACTTTGGGAGGCTGNTGCAGGAAGATCCCCTGAG 
CCCAGGAGTTCGAGAACAGCCTGGACAACATAAGGAGACCCCGTCTCTATAAAGTAATAA 
TMTAATAAMTAGTTGTTTGCCTTGCAGCCCAGGGTGAGTAGAAAAGTATGGCTGAATA 
TAAAGCMCAAATATTTATTTTACTTTCTATATACTTCTTAAAAGAAAGTTT 

TAAATAAATACTACCTTTACTTAAGTCCTAACTTCACATTGTCTGTACCTGCCCG 
Sequence 1499 

TCCACCGCGGTGGCGGCCGAGGTACCAAAGACCTCCAGTGATGCTTATGAACATGTAGCT 

GCTGTTTGGAAGCTGGAAGCCCCATATCAGATGCCTACCACAGAATCTGGAGTTGGGGCA 

GAGTGCTGAATTGGTCGGAGTTACCTGTCTGCTTGATAACTTCAGATGCCTCCTTTCTCA 

GATGGCATTGCTAAGCTCTGCAAAGATCTAACTGTGAGGATGAGATAATTTAATC7TGCC 

CGTGGGCTCTATCCTGGCGGAGATACCATGGGACTGAATCCTGGGCTAAATTGTAATCTT 

GGGAAAGGCAACTATACCTGGCCGAGTTGTTCCTAGAGAGAAAGCAGAAGAAGGTCTTCC 

CAAGAAAGACCATGACAGCACAGAGGCAGTCTGAAGGGAAGCCAAGGAAAGGATTTGAGA 

CCCTACAAGAATGAAG7TGGAAGGATAATAAAAGTCCACAGGCAAGAGTACCTGCCCG 
Sequence 1500 

GAGACAAAGTCTCACTCTTGCCCAGGCTGGAATGCAGTGAGCCGAGATTGCACCACTGCA 
CTCTAGCCTG GGCAAC AGAGCAAGACTCCATCTCAAAAAAAAAA t I I I I I I i A GTAAAAA 
AATCCCGTAG I I I I I I AATGCTACGAAGATGGCAAAAAAGAAGACATAGACAAAGCAGAA 
ACTGATGG CCTCTGCTA TTTCATGCCCTGTGNGACTACTTCATCATTTGTTGGTTTGGGG 
GTTTTATAAi iiiilll I CTGTAANATGCGTGTTTTTCTTTCTCTCTATTTTTATTTGAN 
AGGGTGGCTGCCTATGGGACAGCAMGCTAATATTTAGAGTTTGAACTGCAGACATTGTT 
GCAGGCTTNTTATAAAGGTNGNCCGATTCTTTAAAGTCTATAATAAAGGTNCATAGACCA 
CAGCTTTACAACTTTTAAATCAAAAAGTATTAAGNAGGCTGGGGCANCANNTGGGCTCAC 
CCCCTGNTAATTNCCAAAAACGATGGGGAGGCCCGAGGCAANGGGANGATTTGGTTGGNN 

GGNCCANGNAANTTTCAAANACCCAGCCCCTGGGGTGAANAANAAGGGAAGATCCCCTGC 
CTNAAAAAAAAAA 

Sequence 1501 

GCCATGCTCTCCTCCTCTGCCAGTCTCCTCCACCACTCTCTAACCTGAGAGCCTGTGGAA 
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CCTGCCCGTCTCCCCTCCTCCATCAGACACACCTGCCTAGGAAACAGGAAAGGACCTCGG 

AAGTCTTCTAAGGAGAGTCATGGCGTATTACCAGGAGCCTTCAGTGGAGACCTCCATCAT 

CAAGTTCAAAGACCAGGACTTTACCACCTTGCGGGATCACTGCCTGAGCATGGGCCCGGA 

CGTTTAAGGATGGAGGACATTCCCTGCAGCAAGATTCTTCCATAGGCCAGNAAGCTGCTC 

CAGGGAAAAAACGCCTCTCCAATGTGATATTGGGAAAGCCGGCCACAGGGATCTACNCAG 

GGGGGGTCCCTCCTCACTTCATCCTGGGATTGATATAAAGCCAGATTTGGACATCCAACA 

AGGGAGGGCGCCAGCTTGACTGCTGGGTTTCCTGGGCAAGCACCTGGGGGAATCCTTGGA 

CTTNAANAAACCCACAAGTACCCTCGGGCCCGCTTCTAAGAAACCTAGGTGGGATCCCCC 

CCCGGGGCTGCAAGGGAAATTTCGATTATTCCAAGNCTTTATTNGATTANCCGGTTCGAA 

CCCTTNGGAGGGGGGGGGGGGGC 

Sequence 1502 

CCGGGCAGGTAC rTTTTTTTTTTTTTTTTTTTTTn I 'GAG ACAAAGTCTC AC TCTTGCCC 
AGGCTGGAATGCAGTGAGCCGAGATTGCACCACTGCACTCTAGCCTGGGCAACAGAGCAA 
GACTCCATNTCAAAAAAAAAA 1 1 I I I I I I AGTAAAAAAATCCCGTAG I Mil I AATGCTA 
CGAAGATGGCAAAAAAGAAGACATAGACAAAGCAGAAACTGATGG CCTCTGCT ATTTCAT 
GCCCTGNG NG ACTACTTCATCATTTGTTGGTTTGGGGGTTTTATAA I Him iCTGNAG 
ATGCGGGTTTTTCTTTCTCTCTATTmATTTGATNGTGTG 

GCTMTATTTANAGTTTGAACTGCAACACATTGNTGCAGGCTTNTTATAAAGGTAGTCGA 
TTCTTTAAAGTCTATAATAMGTACNTAGACCACAGCTTTACAACTTTAAATCAAAAAGT 
ATTAAGAGGCTGGGCACCAGNGGCTCACGCCTGTAATCCCNAAACGATNGGAGGCCGAGG 
CNGGAGGGATTGGTTGNGGCCNGGAANTCAAGACCAGCCTGGGGGACCAGANTGAGATCC 

TGCCNTCAAAAAAAAA 
Sequence 1503 

CTCCATTGGATCTGGTAGGTCTGGTAGTAGGCAATGTCACTCAGCAGGGCGCCGAGAAAC 

AGCGGGACGGTACCGGCCAGCAAGGTCGCATGAAGCGGGCCAGGCGTGCAGCGGTAGGTG 

AGGGTGGTGGTGTCGGTCACGGGCGGCTCCTTATCGTTGAGCGGCCCTGATGAGGCTCAT 

CAGGGCCGCGCTGACTGCGCTTAAGGAAGGAGCGTGCTGCTTGCGCGTTAGT TCACC CGC 

GATTCGTTTCCAAATATGTTGCGTCAGTTGATGCGTGGATGTTGCTGCACCAGGTTTTCA 

CGCTTGGCTTCCAGCTCGGCGATTTGCGCGTCGATATCTTCGA 

Sequence 1504 

AGGTAC l HUH I CTC l IHIIIHHIUI I IH i 'GTATTTTCGGTAGGGACAGGGTT 

TCTCCATGTTGGCCAGGCTGGTCTCGAACTCCTGACCTCAGGTCATCTGCCCACCTCGGC 

CTCCTAAAGTGCTGGGATTACAGGCATGAGCCACCGCACCAGGCCTGCCTTCCTTTTTTT 

TCTTACAAGGGAGAAAGCATTTTCCAGCTGTAGATAGTGGTTTGAAAGAGTTAGTGTTTA 

CTATTTTGAGCTCTTTGAAATATGTCTCTTATACAGGGGAGTTTCTTTC AAAC ACATTCT 

TGTMTTAGGATCAGGCATTAGCATCTGAGAGGMTGAGTAATCACCATTTTCTA CTCTG 

CCTTTTCACCTGAGGACNGTCACGTGTGGGAATGTAGCGTTACCTTTGCTCTTCCTTTTT 

GCTGCTTATTATCGGGATATCATCCTAGGAACTGAGCCGCCCAGCCTTCACCACCTTTAC 

CTCCTGCATCCTGCGTTCCCCTC 

Sequence 1505 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCG AGGTACl I I I I I I I H W I I I M M I I u 

TTTTTTGGATTCTTAGTAGAGATGGGATTTCACCATGTTAGCCAGGCTGGTCTCGAACTC 

CTGGCCTCAAGTGATCTGCATACCTCGGCCTCCCAAAGNGCTGGAATTACAGGCATGAGC 

CACCGNGCTAGGCCTAGGTTTCACTACTTAACTGTATGTATTAATCATTGCTGAGCCATA 

CTGGGTAGTCAAGTTCTTTCCTTATACCGGGTTTTGCTTTTCTGMTTTAAAATAGCT^ 

ATTTTCAGTTACCAATCATTATTTCAGCTTGAAATTCTAGAGAACTA ACAATATC CTNTG 

GATAAAGATAGATCAAAGATTTTGTCTGTTTTGAGCANCTTCTTTTC I HUM (GGA 

Sequence 1 506 

CNCGTGGGCGGGCCCGCCCGGNCAGGTACTACNGNTTNTNTTTTTC MINI GGAGCANT 
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AAAMCGTTTTAATACAGGAMTTATTTTGCTGTACMTATANGCAGACAGTTTGCCTTC 

AGAAATTCAGAAATGCAGNTTTTGNGGGAGGTCANCATCATTGGTCTCAGCTACTAAAGT 

NGAGGATGATTCAGCCACTTTTATTCCAGCCACTCCATTTNTAGCATACACTACAGGAGA 

AGTGTNTNTTTAAAGAGGCCATTTTCCTGCAGGATGTTTATTAAATAGNTCCTGGCAAAG 

GAGGTNGCNCGTATAGGGNCCTACAAGGTAACGCTAGGTGGNANNATTNTCCTTCCAGNG 

TNNGGGTCTTAAGCCCACCAAGGTTCTTCCTTCCCATCTACCCCCGGAGTACCTCNGGCC 

CGCTCTAGAACNAGGTGGATCCCCCCGGNGCTTGCAAGGAATTCNCATATTCAAGCTTTA 

TCGATACNCGTCGACCTTTGANNGGGGGGGCCCGGTACCCCAACC7TTTGGGTC 

Sequence 1507 

GGTGGCGGNCGCCCGGGCAGGTACAANGTCATGGTGAGCAGGTGTTCTGAGGGAAGACAA 

AGGAAAAGCAGAGGGAGTGTTTGACAATTCTGAGCTTCCATATGGCAGACATTCGGGGCC 

CGTTGGCATGGTCCTCAGAGCAGCAACAACAGCATCAATTGAGGTTCATTAAAATGCANA 

ATCGCAGGTTCATGTGGACCTACTGAATCAGAACCTGCATTCTAACAACAGTTTTCAGTT 

GGTTCTTCCCGCACATTAAAGATTGGAAAAGCACTGGTCTGGAGGAGGCAGGCTCTACAA 

AAAGGGGTTTTGGGTATTGAGTGAGCCCGAAAAAGACAACCTGGGAACTTGAGGATTNCA 

GGGAT TGTACCCTGANAAMCAAAAGCATTTTCANAAANGCCTCAGTAAAAGGCANNGAA 

TTTTTAAACTTACTCTCA TTTTAT ATTTATCGCAACTACATGGTTC 

AAAAATTAGTCTTTCAGGTTTTTCAAAACCACNT^ 

TGGTATTTANTGGGAATTGGGTCTTGGGGGCCAAACCAAGGGGGATTAGNACCCCCAT 
Sequence 1 508 

CTCCCCGCGGTGGCGGCCGAGGTACCTGACTTTGGGTAAGTTGCCAGAATAAGATCATCA 

GGCTTGGCTTGGAAATTACATACTTTTTCCCACCATTCTTTTGATAATATCAACGTAGGG 

ACTCCATCTACTTCCATGATG7TAAACAGTTCTGGCTTTTTTTCCATCGTGGGAGCGTTT 

TTCTCAATCTTCGCCATTGGGAATCTGGTATCTCAGGCAGCAGAGGAAGCAGCTTGCTGC 

CTTTAGTCAAACTGCTTCCTGGAAACCCAGAGCCAGGTCCAGCTCCAGGACACTGTGCAA 

GCAAGAGCACAGCA 

Sequence 1509 

ATTGTTTAAAAGCTCMTTAAAMGTGAAAAATAGACACTGAAAATTTT^ 

CACATGTGT TCATG AAACAATATATCTATTACAAATCATTGAATATTTTAAAAGTAATGT 

CATGTATTCATTTTCTTATTTTATCTCCTCAGTAGACTGAGATCAGTGTGTAGGGGTGAA 

GCAAGGAGTCCTACCCTGTTTGACTAAGGTAAAAATTAAGAATCAGTGAGAAATGGAATT 

TGCAAAAGTGCCTGC CAGATAAT GTTAGAACTGGACCAGAAAATAGGAGTTGGTATAAAA 

CTAAGACCAGCGAGC I MINI I CCTTCAAGATGCAGTTCAGTTTATTGCT1TTGTAAAT 

TAMGANTGGGNNTCTTGATCTTTATTAAAGGAGAATACAATGTTAACCTACTTCAAATT 

TTAAAAAATATTCCCACATGTATATGGTATGGGGGGNGGATATACNCNNCAGGGATTTTA 
ANGGACAGGTTTT 

Sequence 1510 

CCGCGGTGGCGGCCGAGGTACTGAGCCTGTGTGTCAGGTAATGTGGTTAACTTAACATTC 

ATTACAGCATGTCATCACCACAAAGGCCTCTTGTGGCTGGTGCTA l II I M I I CCATTTT 

GCCGACAAGGTTTAGAGAGATTGAATCACCTGCTTAGACCCACACACCCAGTAAGTGTTG 

GAATCTGGATTGATGCTGTAATGACCATTATCCAGATCCTTATCACCGTTTGGGGAAATT 

GTCTGAAAATCCCTCTGGGTTATTTAGAGTCAAGCTTTCATCTGGGTCTTCAGAAAGTTG 

TAGTGGAGCCATCTGCACCACCACACACCTGAAGACATTTCAGGACTAACCTGGATGAAA 

AGGGAATTGA CTTTT AGCTAGGATAGGACAAMGGGCATAATCATAAAATTAATTCACTT 

TCCTTCCTTTCTTTTGAAAATTTTTCTAMCTGTGTATCCTCCCAGATATTCCCTC 

AGAAGACCCCAGTGAAGCCTTGGTTTGGAGAAACTGGGACTGTGTGGGGCAAGTGAANGT 

GGTCTGGGGGATAAAAGAATATGGTGGTTGCTGAGTGTGGNGTAATTNCAGCTTCTCANG 

AGTTGGGAGGATGGTTGNACCCGGGAGTGGGGANGGCCGNCCTGGGCACATGGCAAGGNC 

TCATCTCTTAAAAAAAA 
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Sequence 1511 

TCCACCGCGGTGGCGGCCGGCCATGCTCTCCTCCTCTGCCAGTCTCCTCCACCACTCTCT 

AACCTGAGAGCCTGTGGAACCTGCCCGTCTCCCCTCCTCCATCAGACACACCTGCCTAGG 

AAACAGGAAAGGACCTCGGAAGTCTTCTAAGGAGAGTCATGGCGTATTACCAGGAGCCTT 

CAGTGGAGACCTCCATCATCAAGTTCAAAGACCAGGACTTTACCACCTTGCGGGATCACT 

GGCCTGAAGCATGGGGCCGGGACGTTTAAGGGATGAGGACATTCCCTGCAAGCANGATTC 

TTTCCATAAGGCCCANAAGCTGCTCCAAGGAAAAACCGCCCTCTTCCCAAATGGTGGATN 

TTGGGAAAGCCGGGCCCACCAGGGAATCTTACCCAGGGGGGGTCCTTCCTTCAC7TTTCA 

TCCCTTGGGAATGGATTANTAAANGCCCGGATrTTTGACCAATTCCCAAACCAAAGGGAA 

NGGCGGCCAAGCCTTGGACCTTGGCTTGGGGTTTNCCTTGGGCCAAGNCAACCTNGGGGG 

AATTCCCTTTGGACCTNCCAAAAAAANCCCCCCCAGGGTTNCCCCTTCCGGGGCCGGGTT 

CNTAANNAAACCTMGGGGGGGATTCCCCCCCGGGGGCCTTGGGAAAGGGAAATTTNCGA 

NTATTTANAAGGCCCTTNTTTGGGATTCCCCGGGCGGAANCCTTCCGAAGGGGGGGGGGC 

C 

Sequence 1512 

GGAGCTCCACCGCGGTGGCCGAGCGGCCGCCCGGGCAGGTACCGTGAGTGCAGTGGAGTG 

GGAGGGCTGCATGATGTCTGTGTTACCATACTCTGCTGGTTGTCCTTTTACCTGACTGCT 

CCTCAGTCTCCTTTTCAGACATATTTGTTCTGCCTGTGCCTTACATGTTTGTGCTTCTCA 

TGGTTACCCTAGGCTCTGCTTTAATTTCTCCCTACTTTCTCCTGAAATTGTTATCTGTAT 

CCTCATTATTCTTAACMTATCTTCAGCCCAGTTCTTGTGCCAGAGCTTCACATTCATAT 

AGCTCAATTCCTACTTAACATTGCCACCTACATGTCTCAAATTCAGCAAGCCCAAAATTG 

AGTATACCACGTTCTGCCCCAAATATATTTCCCCCACACCCCTCAGCTTCTTTAGCTTGG 

TAAATAATACTTGATGGAGGCAGAGGCCTGGAAATG 

Sequence 1513 

GCCGCTGACCAGCCCCGCGCCGAACTGCTGCTGGGCAAGCTCTACTACGAAGGCAAGTGG 

GTCCCGGCAGATGCCAAGGCCGCCGAAGAGCACTTCCAGAAAGCCGTCGGCCGCGAAGTG 

GCTGCCGACTACTACCTCGGCCAGATCTACCGCCGTGGCTACCTGGGCAAGGTCTATTCG 

CAAAAGGCCCTGGACCACTTGCTGACGGCTGCGCGCAACGGCCAGAACAGCGCCGACTTC 

G CGAT CGCCCAACTGTTTTCCCAAGGCAAGGGCACCAAGCCCGACCCGCTTAACGCCTAT 

GTTTTCAGCCAGTTGGCCAAGGCCCAGGACACGCCGGAAGCCAACGACCTGGCGACCCAG 

CTCGAAGCTGCGCTGCCCCCGGAACAACGCGCCGAAGGCCAACGCCTGGTGCAACAGGAG 

CTGGCCGTGCGCGGCACCCTGGCC 

Sequence 1514 

GTGCGCTCGGTCGTTCGGCTTGCGGCGAGNCGGTATCAGCNTCACTCAAAGGGCGGGTAA 

NTACGGGTTATCNCACANNAATTCAGGGGGATTAAACGCCANGGAAAAGNAACCATTNGT 

GNAGCAAAAAAAGGGCCAGCCNTAAAANGGCNCANGGTAACCCGTGAAAAAAGGGGCCCG 

CCGNTTTGCCTTGGGCCGTTTTTTCTCATTAAGGGCTTCCCGCCCCCCCCTTGNACCGAG 

GCATTCACAAAAAAMTTCGACCGCTTCAAAAGTCCAGGAGGGTTGGGCCGAAAAACCCC 

GMCCAAGGGGACTTATTAAAAAGNATTNCCCCAGGGCCGTTTTTCCCCCCCTTGGGGAA 

AGGCCTTCCCTTCGGTGGCCGCTTCTTCCCCTGNTTGCCCGNACCCCTTGCCCGGNTTTT 

ACCCGGNAATNACCCCTGGTCCCCGNCCTTTTTCTTTCCCCTT 

Sequence 1515 

GCTCCTTGCATTGCAGGCACAGTGGCTTCTCCTGAGTCCCACGGCACATTCTTCTACTGG 

GCTCAAGCTGCCTTGCCTGAAGGGCTTTGTGCTACTGTCCCTGTCCTTAAGGTCTTCATG 

ACTCGTGGGGGTGATAAGACTCAAGCCAGCAACTGAGAAAGGATGAATATTCAAAGGAAG 

GGAGGAACACATCTGGCTGGAGTGATCAGGAACATTTTTAAAGGAGAAGGTGACAACTGA 

GCAGAGGAGCTTGGTCTTGAAGAAATGGAGAGATGGGATAATATTCCAGAAGAAGCAATG 

GAGTAAAGTTGTGMGGTGGTTCCAGAGCAGTGAAAGGTCTAGTTTCCCTGGAGCCTTGT 

CTTTGGGTAAAACTGCAACTGCATGCCAGCCTACTGATACCTGAGTGCTGATAAACAAAG 
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GCCGTTTTTGTGGGTTGAAACAGCCATAGGCCAGGGTGCAAGGACTTTGTCAGGANAAAA 

^^ GGGTmG ^ CCCCAm GGTCTCTCTGCCCANATGCCTTAACATTTTAAAAG 

TCTTTTCTTTCCANTTTCTTTTTCTTTTTTTCTA 

Sequence 1516 

CGTTTI ICCATAGGCTCNCGCCCCCCTGGACGAGCATCAACAAAAAATCGACCGCTTCAA 
AGTTCAAGAGGTGGGCGAAAACCCGACAGGACTATAAAAGATACCAAGGCCGTTTTCCCC 
C ^ G ^ GCTCCCTCGNGCG CTCTTCCTTGTTCCGACCCTGCCGCTTTACCCGGNATAC 
SS™2 CCmCTCCCmCGGGG ^ GCCGTGGG CCGCCTmCTCATAGCCTCACC 

GCCTGTAGGGTAATCTCAAGTTCGGGTGTAAGGTTCGTTTCGCTCCCA 
Sequence 1517 

AGGTAC II 1 1 1 II I 1 1 1 1| (I | n | , || , , | GGGACGGAGTCTTGCTCTGTCGCCCAG 
GCTGGAGTGCAGNGGCACGATCTCGGCTCACTGCAAGCTCCGCCTCCCGAGTTCACACCA 
rTCTCCTGCCTCAGTCJCCCAAGTAGCTGGGACTACAGGCGCCTGCCACCACGCCCAGCT 
AATTTCTTGNAI M III I I I I III I ANGANANACAGGGTTTCACCATGTTAGCCAGGGAT 

GGTCTTGATCTTCTGACCTAGNGATCCGCCTGCCTNGGCCTCCCAAAATGCTGGGGA1TA 

CNGGGCNTGGATTACNCACCCCAGGCCCTTATTAAGCTTGCATTTNTAAGGAAAGTTTTT 

TGAAAACCTGGAAATGGTTAANAAACAACCNGGNTACCTTTV^ANATTNAGNATTAAACCT 

AGT^TGGTANTGACNNCAAMGTCTGCCATTTTACCNCAAAGGAATTTTCAAGTCTGG 

GATTTTCAGTCTGGAATTCCCATTGATTAGGAAAAAGGAAAAAAAAAAATGCAAAGTTCT 
TACTTGCTTTGG 

Sequence 1518 

CCGGGCAAGGTACTCCACCTGCCTAGGCCTCCCAAAGTGCTGGAATTACAAGCGTTAGTC 

ACTGTGCCCAGCCTCTCTCTACCAATTTCCACCTGTCCTTCAAGGCCCAGCCGCAGTGCT 

^JJC^GT^GCCTTCCCTAGCTCCCCAGCTAGAGTTATATATTGTAACACCTAGC 

AGTGG ^ CTATGG ^^ GCC " 1 ^ 

I^IT^ GAGCTGGGGTCTCACTGTCACCCACGCTGGA GTGCAGTGGTGTGATCn 

TCTCTGCAACCTCCGCCTCTCAGGCTCAAGCGATCCTCCCCCCGCGTACCTCGGCCGCTC 
TAGAACTAGTG 

Sequence 1519 

AGGTACCAAAGACCTCCAGTGATGCTTATGAACATGTAGCTGCTGTTTGGAAGCTGGAAG 

CCCCATATCAGATGCCTACCACAGAATCTGGAGTTGGGGCAGAGTGCTGAATTGGTCGGA 

GTTACCTGTCTGCTTGATAACTTCAGATGCCCCCnTCTCAGATGGCATTGCTAAGCTCT 

GCAAAGATCTAACTGTGAGGATGAGATAATtTAATCTTGCCCGTGGGCTCTATCCTGGCG 

GAGATACCATGGGACTGAATCCTGGGCTAAATTGTAATCTTGGGAAAGGCAACTATACCT 

GGCCGAGTTGTTCCTAGAGAGAAAGCAGAAGAAGGTCTTCCCAAGAAAGACCATGACAGC 

ACAGAGGCAGTCTGAAGGGAAGCCAAGGAAAGGATTTGAGACCCTACAAGAATGAAGTTG 

GAAGGATAATAAAAGTCCACAGGCAAGAGTACCTGCCCG 

Sequence 1520 

ACCGTGGTGGCGGCTCGGACCGTNGTGGCTCACGCCTGTAATCCCAGCACTCTGGGNGGC 

CAAGGCGGGTGGATCACANANGTCAGGAGATCAAGACCATCCTGGCTAACATGNCTCTAC 

TAAAAATACAAAAAATTANCCGGGCATGGTGACAGGCTTCTGTAGTCCCAGCTACTCGAA 

CAGGCTGAGGCAGGCAGAATGGCGTGAACCCGGGAGGCAGGAGCTTGCAGTGAAGCCCGA 

GATAGCGCCACTGCACTCCAGCCTGGGTGGACAGGAGCGAGGACTCTGTCTTCAAAACNA 

AANCGNTNAGAACCTNTAAAGGGGGGCCTCCGGCNCGCTCTTANNAACNTAGNGGGAATC 

CCCCCCGGNGCNTGCAAGGGAAATTCCGATTATCAAAGCTTTATCGGATACCCGTNCCGA 

CCCTTCGGAGGGGGGGGGCCCCGGTTACCCCAGCCTTTTTNGGTTTCCCTTTAA 
Sequence 1521 

AGGTACGTGGTGGAGCCAGGATCTGTCGGACTCTGGAGTCCAGGTTTGCAGAGAGCCCCC 
GGCTCATCCTGGCCTCTCCGGCATCTCACTTCCTCCCTGCAGGCACGTAGGACCGGCCTC 
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AGCTCTGTCMGATCTGCACTGTAGGACCCCAGAGATTTTTTTCTTTCTTTGGGTAGTCA 
GACCCATCTGCAAAAGTTTTGCAATCATGAAATCAGATAGCGTATGAAATGCTAGGAATA 
.AGCGCAGGTCTAGCATACCACGCGGCCAGCGGTAGCGTGACCTGACAGGAGGAGTGCGTC 
TTCAGAGACATTGGGAAACTGGCGTGCCTGTCGCCCGTCCCAGGGTCAGAGTCTTCGAAA 
CCAGGCTTTCTGGGAACCCAACAAGGAGTTTCTGATCGAAGAACCGCACTGAGGACAGTT 
AGTCI UN MCCACACTGGGCATTTTCAGCATTGGAAGTCAATGGGAGCCCTGCGTCT 
Sequence 1522 

CCGGGCAGGTACAGTTTTGACAGAAGGCACACATTTTATTTCTAMTTTCTGT^ 

-GGATCCTCCAGTTTGAGTCCGTTTACTTTTTGCAAATGGTGGAGTTGGAGACAGTAGAGA 

AAGTTTTGCAGCGTTAAGTATGATGTCACMTTAAAGGTTATAAAAAGCAAATGTGAAGG 

TCCTTTGTAAMGMGTGGTTCACTGGGCGTGAGTTTGAATTTACAAATTAANCnTTGA 

AATTAATGCTAAATATCMCTTATAAACATTTTTCCCTGGAAAGGAAAAAAAAAAAAGT^ 

TTATTCTTACTTCTGGCAAGCCATATTTCTTTTTTATCCACCAGTAATGCCTCT^ 

TTGTCCCATTCAAGGTGAAACGCCCAGTTGGAAAGGTGGAAACTGGAATCAATCTGGTAA 

AGGTGCCATCCCAGGGGGTCCTCCCCTTGAGNCACCTGCTTTGTCCTGAGGAGTCATCAA 

ACTTCCACGTTTGAAAAGCCATGACCCATTGTTGAGGGATCCTCAGGGGAAAGGTTGCCT 

TTGGATCATAGGGGAAACCAGACCAGGGGGCCTTNAAGGGAAAGGGGTNCANTACTTTGG 

GCTGTTATNGANGTTGTCCNATTGTCCAACAAGGGGGGGACTTGGGCCGCCTCTTCCCTT 
TGGGGCCAAA 

' Sequence 1523 

CGACGCTCAAGTCAGAGGTTGGCNGAAAACCCCGACAGNGACTTNTTAAAGATACCAAGG 

CCGTTTCCCCCTGGGAAAGC7TCCCTTNGTGCCGCTCTCCTGTTCCCGACCCTTGCCCGC 
NTTTACCC GGGAA TTACCCTTGTCCCGCCT^^ 

GGGCCGCCTTTTCTTCMTANGCCTTCACCGCTTGTTANGGGTTATTCTTCAAGTTTTCG 
GGGTGGTAAGAGGTTCGTTTTCGCTTCCAMGGCCTTGGGGCCTTGTTNGTTGCCACCGG 

AAACCCCCCCCCGTTTCANGCCCCCGACCGGCTTGGCCGGCNCTTTATTCCGGGTTAAAC 

CTATTCCGTTCCTTTGGAGGTTC 

Sequence 1524 

CGGCCGCCCGGGCAGGTACTGCCCTAGTCCACAGCTGAGTGGGGCTGGAACTAAGCTACA 

GGGTTGCTTCAAGGCCTACAGTCAAGACTTAGGTCTTCGGGCCTAAGGTGTCACAAACCT 

GCATGTCTCCCTTCAGGTCCTTAAGTTAGCGGGCCTGCTGTATTGAGAGGGGCTGGAGCC 

ACTTTTCAGGGGTGTTTCAGGGACTGATGAGGGACCTAGGTCAGGTGGCCTGTTTGCAGG 

GCTATGGACAGCCCCATCTCCCTCCAGGATGCTGGGTAGGCAGAACTGCTCTCTGACCAC 

AGCTGAGAGGGGCTTATGCCAAGATTCAGGGTTGTTTCAGGACCTGCTGTGGGACTGAAA 

TCTGCCTGCCTGTCCTGAGGACACAGAAGGATGTATCTCCTGACTACAGGAGGATTTCTG 

TGCAGGATTTCTGTMGACTGTGGCTTGAGTGGAGCTGMTCCCCGATTCAGAGCTTTTT 

CAGCATATGCTATGAGACAGAGATTGGCAAGTCTGTACCT 

Sequence 1525 

NCCACCGGNGCCCGTACTGGCCGCCGCCAGCGACAGCGGCGCCTCGAACAGCGACGGCGT 

CACGAATGTCACCCGTCCGACCATCACGGGCCGCGCCGAAAGCGGCAGCACCGTCACCCT 

GTACGATACGGATGGCGTCACCGTGCTGGGCACGACAGTGGCCACGGGCGGCGTGTGGAG 

CATCACCAGCAGCGCGCTGGGTGATGGCATCCACAGCCTTAGCGTGAAAGCCACTGATA 
Sequence 1526 

CCGGGCAGGTAC I M I II I M I M I I I I i I I I CGGGC CTGCCTATTTCCGACTGTCTCTC 

TCTGTTACTCGATGTCTCCTTTTCCTGTCTACATCCTAAACCTCCTGAACCCGACCAG7A 

ACATTCATTGCTAAGCAGTTGCTAACACCTGTTCANAATCTCCTTCAAATATCTGCAGGT 

TCCAAGATTTCCAAAAACTTACCCTTATTCACAATACACCAGCTACCACAACCACTGGCC 

ACTAGGGAGGCAAAATAGCATAGCAGTCATGGATGTGAGAGTTGTACTGTTTGGAGTCAG 

GCTGCCTGGGCTCAAATCCCACCTTTCATCAGCTATGTGACCTTTGGGCAAGCTACCTAA 
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CTTTGCTATGGCTTAGTTCATCTmGTACTCCAGGMCAACAGTAGCAACTACTTCCTA 
AGTGCTATGMCTGAATGTTTGTGTCCCTGAAAAAATGTATATGTTGTACCCCTAAATTC 
CACTGTGATGGTATTTGGAGATAGGGCCTCTTGGAGGAA 
Sequence 1527 

AGGTACAATG TGG G ACTTTGG TGGAACTG CCTGGGAG AAC TTC ATAATTACTACCC TGTA 

TGTCATGCCCCTTGCAGGTAAAACAGAAGTGGCAGAGCAGAGGTCAAAGGCACAGATCAG 

CAAAGGGAATCCTACTGGATCCTGAGACTAGCCTGGAAGGGGTGTCATTTGTCACTGGGA 

ATAGAGGTGCACGGCCTGGTGGACCCTCCGAGAGAGCTTAAGATTCATTTTTAAAACANA 

GGATTTAAAAGACACAATAGGCATTGGAATCGGGTAGTAAGAAGAGAAAACCAGAGCCCC 

AAGTGAGGAAGTGGGTGATCTGTCCTNACACAGTTGGTGGGGGAGCTGGGC 
Sequence 1528 

TGGAGCTCCGNGGCciGCCGCCCGGGCAGGTACAGGTCATGGTGAGCAGGTGTTCTGAGGG 

AAGACAAAGGAAAAGCAGAGGGAGTGTTGACAATTCTGAGCTTCCATATGGCAGACATTC 

GGGGCCCGTTGGCATGGTCCTCAGAGCAGCAACAACAGCATCAATTGAGGTTCATTAAAA 

TGCAGAATCGCAGGTTCATGTGGACCTACTGAATCAGAACCTGCATTCTAACAACAGTTT 

TCAGTGGTTCTTCCGCACATTAAAGTTTGAAAAGCATTGGTCTGGAGGAGGAGGCTCTAC 

AAAAGGGTTGGGTATTGAGGAGCCGAAAAGACAACCTGGAACTGAGATTCCAGGGATGAC 
CTGAAAACA 

Sequence 1529 

A GGTACATG ™ ™ GC mA ^^ 

Allium rAGAATTTTCTGTGTTTATTCAAATTTATTGAACTTAATTATTTAATATAA 

GAGAACCAAGGAACCACTGAACTTTGAGMCACMCCGAAATGCTAACTTTAGTGTTTTA 

TTTGAACTAGTGACTAAAATTTCTATGGCMTATATGAATATTTTGAGAGGTTAGCCCAA 

ATTGTMGAAATTMTGGTAAGCTGGCATGTTTTTGTAGAAAAAAAATTT 

AATTATATTTAG CAATTTTAAAACTTTTACCTAGATTCTGTTTACTTGCTG AAGTGG CAG 

TAl I I lACCATGTATATCAATCTmAAMmCTTAAACAGTTCTTCCATTTAAGAAGT 

AACCAAGCCAGGTGCAGGTGGCTCACATNTGGTAATCCCAGCTACTCAGGAGGCTGAGGC 

AGAAGAATTGCTTGGAGCCCCAGGAGGCGGAGGTTACAAGTGAGCAGAGATCGCGCCATT 

GCACTCCAGCCTGGGGCAACAAGAGCGAAAACTCCTTNTCAAAAAAAAAAAACNG 
Sequence 1530 

AC I u ll I CTTTACATTTATAATGTGGCTGNNAAAAAATCTTTAATTATGTGGCTGTGTN 

ACACTGGTGACTCATATTATGCTGCTTTCAGACAGCACTGACTAGAGTGTGAGCCCCGTC 

CTCAAGTTGCTTAGAGCTTAACACAGGAGACACAAGTGGAAACCGCTGTATATAAGCTGT 

TTGACAGGGAGCTGCATGGTGCAGAAAAGAGGAAACTACCTCTCTGGCAGTGGAATTGCT 

TGTTTGCTGAGAAATAAAGGGANANCTNNNANNAGNNTCNANTNATTNAAAAAATGTGTA 

CCCTTCGGCCGCTTCTTAGAAACNTAAGTGGGATNCCCCCGGGGCCTTGCAGGAAATTTC 

NNATTTTCNCNGGCTTTTATTCTGATAACCCGTTCCGACCCTTTNAAGGNGGGGGGGGC 
Sequence 1531 

CCGGGCAGGTACGCGGGGA GAATAG AGTGGCTGACTTGCTGATTGAGGTCGGCAGCTCCC 

GAGAAGCAGGAGAAGCAGAGTTTTTGCAGACAGACCCACGAAGTCAAGCATGCGGAGTGC 

AGCCAAGCCCTGGAACCCAACCATCAGAGCAGGGGGCCACGGCCCAGACCGGGTGCGGCC 

TCTGCCTGCAGCCTCTTCCGGCATGMGAGTTCTAAGTCTTCAACTTCCTTGGCTTTTGA 

GTCCCGACTCAGCAGGCTCAAGAGGGCCAGCAGTGAGGACACGCTCAACAAGCCAGGAAG 
TACCT 

Sequence 1532 

CGCGGCGGCTTACAGGCATCAGCCATCACACCCATCCATAAACATTATATTAATGTGCNC 
ATG AC AN AAC TG AG AC C TAAN T TG C TTATG ATAAAATG AAATTGG AAG ACTAGCT AAC AT 
GAAAA TTTATATTTTGGCTT TTrCAT GTTTTT^ 

TTTTTGATGTTTAGTAATGGTTTTTTATGTTTGTTTTTT I I lllllU IGNAGA 
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CGGGAGTCTCACTCTGGTCGNCCAGTCTTGGAGTGCTTAAGTGGCGCCCGAATCTTTGNG 
NCTTTAACTTGGCCMGCCTTNTCAACTTTTC 

TTCACTTGNCNNTTCAANNCCNTTCNCCCGNAGGTTAAGCCTTGGGNGAACCTTACCAAG 

GNGGN CGGCCAATTGGCCCAAACTCAACCGCCCCCCGGGGNTTAAAl 11)1)1 \ GGTNAA 

TTTl INNTMGTTAAGNNAGTMCCNGGANGTTTTTCNACCNTGGTGGTTTTGGGCCCA 

TNGGAATGGGNTTCTTNCGNAAACTTCNGAACCCCTACGGNTGGAATCAANNCCNANCCN 

TTTTGNGGCCCTTCTTCCAAAMGTNGGCCTTGGGGGAANTTTACCAAGAGGCCNNTTGN 

AGCCNNNCCCCNCCTCCCCNTGGGNCCCTNATTGNTTTTAAAATAAAATGGTTTTGCAAA 

ATTA GNGGG ATTGGGGAMTTMNTT7TNTNGNTTNM 

AMNTTTTTGAAACNTACNNCCTGT6TTTTTAAMTTTTTTTM 

Sequence 1533 

GNGGTGGCGGCCGGGCGTGGTGGCTCACGCCTGTAATCCCAGCACTCTGGGAGGCCAAGG 

CGGGTGGATCACGAGGTCAGGAGATCAAGACCATCCTGGCTAACATGTCTCTACTAAAAA 

TACAAAAAATTAGCCGGGCATGGTGACAGGCTTCTGTAGTCCCAGCTACTCGGGAGGCTG 

AGGCANGANAATGGCGTGAACCCGGGAGGCAGAAGCTTGCANTGAGCCGAGATAGCGCCA 

CTGNACTCCAAGCCTGGGTGACANAAGCCGAGACTCTTGTCTCAAAAAAGGAAANTTTNA 

TMTTAMGGNGTTACCCTTTGGGCCCGCTCTTAGAACNTAGGNGGGAATCCCCCCGGGG 

CTTGCCAGGGAATTCNATATCMNGCTTTATCGGMNACCCGGNCCGACCTTTCAAAGGG 

GGGGGGGC 

Sequence 1534 

CCGGGCAGGTACTCCACCTGCCTAGGCCTCCCAAAGTGCTGGAATTACAAGCGTTAGTCA 

CTGTGCCCAGCCTCTCTCTACCAATTTCCACCTGTCCTTCAAGGCCCAGCTGCAGTGCTG 

CTTCAACTGNNAAGCCTTCCCTAGCTCCCCAGCTAGAGTTATATATTGTAACACCTAGCA 

^^I5ZT^ ^ GTGTTTAGTG G AAACTATG G AATTGCCTTTTAAAAATTTATTTATTTATTT 

Au 1 I I iGAGCTGGGGTCTCACCGTCACCCACGCTGGAGTGCAGTGGTGTGATCTTGGCT 

CTCTGCAACCTCCGCCTCTCAGGCTCAAGCGATCCTCCCCCCGCGTACCTCGGCCGCGAC 

CATCGAGCGGCCGCCCGGGCAGGTACTGGATCAAGTTTCTCCTGCGTGAGGTATGGGTGG 

CACTCAACCTGGTTAAGTCACTGGGTTATATTTCAGTCCANGTTTGTTCAAGAACCTCTC 

NATCTGGMGTGGGTGAAATTTGANACCCCAAGGGCTTTTNAACCAGCCCCCTCGTNCAC 

CMCTTNCTTCATGGGCCTCCCCAGGCATTCCAAANAACGTTTCCTTTTTCCACTGAATC 

ATATTACCCTTTATCAATCTTTTGGGG 

Sequence 1535 

CCGGGCAGGTACCACAACTGGTGAACCCAGCAGGAAGGTGGAGAAAAATTTCATGAGCAC 
CGAGACCGTAAGATCAGGAAAGGGGAAATCCAAGGGGGGGTCCCGGGCTGACCATCACAT 
TTATGTGGTGCCCAAAAGCCACTATCCTTGATGGATGGGACCCAATGTATGAGTACCT 
Sequence 1536 

GAAGCCGCCATGGTCGACCACGGCTTGCAGGGTGCGCCATTGATCAAGGGTAACGCGGGG 

CGCTTTCAAGATGAGCTCCTCTTGTCCTAAGCTGGCGGTCCTTATTGGAGACTGCCGAAT 

GAGAAMTTCTGTTGTGTGCTGCTGACGCTGCTGCCGTTGAGTGCGTTTGCATTCAGTAC 

CGATOTGACGAAAGACATTCAGGGCGTGAAGATCGATTACAGCGCGTCGGACGTCGACAG 

CAACATCAGTTCGATCCAACTCACCAACTACGGCCCCAATGACGCGGAGTGCAAGGTACG 

CTTTACCAATGGTCCGGAAGCGCCGCGTACCCGCAAGGTCACGGTCGCAGCAGGGAAAAC 

TACCAACACCACGGTCAATTTCAGCCGTGCCATTATCAAGATGCGTATCCAACTGACCTG 

CGCACCGAAATAAGGCAAAAGCGGAGGACGCCTACAGCAATACTCCGCTTATAAACAAAT 

TTCTAGATGGGTTATTAGCAA I I I I T I ACGCTTTTTTAA 

Sequence 1537 

CGGTGGCGGCCGANGTACGCGGGGNCAGGCCATCTCGCTATAGGAAAGGAAAGNGGAACA 

GCATTCATCCTCMCATTTTTACGAAGACAAAATGAAGGACTGGAGTAGANCACTGATCA 

GTGCAGGTGTAGCATAAAAGTGTAATCCTGGAAGATGTGGTGTGAGAAGGTAGCACAAGT 
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GAAGCAGAGATACAGGANATAGGGAAGGGAAGCTGGAAGCAGANGTCACTGGAGGGAGAG 

GGAGATGGACACATTCAGGGCNTACAAAGCAAGTTCTATGTGATTTTGCTCACTNTCTCA 

ATTGTGGGGGACCCCTNTNAAAATGTTGTACNCTGCCCGGGGCCGGCCCGCTCTANGAAC 

TAGTTGGAATCCCCCCGGGGCTTGCCATGGGAATTCNGATAATCCAAGCCTTATTCGATA 

CCCGTCGACCCTCCGAGGGGGGGGGCCCCCGGGNTCCCCAACCT7TNTGTTTCCCTTTTA 

AGTGGAGGGGTTAAATTGGCGCCGCCTTGGGCGTTAAATCNATGGGTCCATAAGNCTGGT 

TTCCCTGGTGGTGGAAAAATTTGGTTTNATTCCCGNTTCACCAMTTTCCCCCGNCAANC 

AATTAACCGAAGCCCCGGNGGANCCATTAAANANGNGGTCAAAANGCCCTGG 

Sequence 1 538 

AGGTAC AG ACTTGC CAATC TC TGTCCC ATAGC ATATGCTGAAAAAG CTCTGAATCGGGGA 

TTCAGCTCCACTCAGCCACAGTCTTACAGAAATCCTGCACAGAAATCCTCCTGTAGTCAG 

GAGATACATCCTTCTGTGTCCTCAGGACAGGCAGGCAGATTTCAGTCCCACAGCAGGTCC 

TGAAACAACCCTGAATCTTGGCATAAGCCCCTCTCAGCTGTGGTCAGAGAGCAGTTCTGC 

CTACCCAGCATCCTGGAGGGAGATGGGGCTGTCCATAGCCCTGCAAACAGGCCACCTGAC 

CTAGGTCCCTCATCAGTCCCTGAAACACCCCTGAAAAGTGGCTCCAGCCCCTCTCAATAC 

AGCAGGCCCGCTAACTTAAGGACCTGAAGGGAGACATGCAGGTTTGTGACACCTTAGGCC 

TGAAGACCTAAGTCTTGACTGTAGGCCCTGAAGCAACCCTGTAGCTTAGTTCCAGCCCCA 

CTCAGCTGTGGACTAGGGCAGTACCTGCCCGGGCCGGCCGCTCGAGTCGCGCCCACAGCT 
GGAAGG 

Sequence 1539 

AGG TACCTG ACTTTGGGTAAGTTGCCAGAATAAGATCATCAGGCTTGGCTTGGAAATTAC 

ATACTTTTTCCCACCAT TCTTTTGA TAATATCAACGTAGGGACTCCATCTACTTCCATGA 

TGTTAMCAGTTCTGGCTTTTTTTCCATC^ 

GGAATCTGGTATCTCAGGCAGCAGAGGAAGCAGC7TGCTGCCTTTAGTCAAACTGCTTCC 

TGGAAACCCAGAGCCAGGTCCAGCTCCAGGACACTGTGCAAGCAAGAGCACAGCAGGTCC 
CCCGCGCACC 

Sequence 1540 

TATCCACAGAATCAGNGGGGATAACGCAAGGAANGAACATTGTGAAGCAGAAAGGCCCAN 

CTANANGGCNCAGGGAACCCGTNAATAAAGGCCCGCGTTGCTTGGCCGTTTTTTCCCATT 

AGGGCTCCCGCCCCCCCTGNCGAGNCATTCAACAAAANAATCGACGGCTCAAAGTTCAAG 

NMGGNTGGGCCGANAACCCCGGNCNNGGGACTTATTNAAATTATACCCCAGGGCGGTTT 

TCCCCCCTTGGGGAANGCTTCCCTNCGGTGGCCGCNTCTTCCCTGTTTCCCGAACCCCTT 

GCCCGNCTTTACCCCGGGAATACCCCTTGTCCCCGNCCTTTTTNTTCCCNTTTCCGGGGA 

AAGCCGTTGGGCGNCT7TTTCCTTNAATAAGCCTTCAACGGCCTTGTTNANGGTAATTCN 

TCCANGNTTCCNGGTTGGTTNAGGNNNACGGAATCNGCCTTCCAAAANACCTGGGGGGCT 

TCGNTGNTGCCACCGAAAACCC 

Sequence 1541 

TCCACCGCGGTGGCGGCCGAGGTACTCAAGAAACCCAGGCCAGAGTTACTGGAGCCCTTC 

CTCACAATTGGCCTTGGCTCCCTCTCCACTTGTGTG7TTTCACATTGACTTGATCTTCAG 

TGGAGTAACTCTGAGCAGTGGCTGTAGCTCAGCAGGTAGAAAAACTCTCTAACCCTTGAA 

GAGGGGAATACAGTAGCACCATTGAAAAAAGAGGTTGTGTATAGAGTTTCATTCTTAAAT 

TTGCTTTGCCAAGGAGAAAGCCTGTGAGAGTGAAAGCAAGGTGGAACTGGGGTCCAAAGA 

GAGGGAGTAAACTAGATGCCATATTMTTTGTATTTCTGGAGAATATGACTGCAGTTTAT 

ATATAACTTGGTTTAAACATTTTTGTAACCTGC 

ATTTTCATTTGGAAGAAAAGATTAGGAAGACAGTATAGTTTTGGGGACACAAGCATGGCT 
ACCTACAGGAGACAGGTCNGAAAGGAAAATAAGTGATGCAGCCAGCTGAGTACCTGCCCG 

Sequence 1542 

GACTAATANAANGATACCAGTGCCGTTTTCCCCCNTGGAAAGCTTTCCTCGTGGCGCTCT 
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CCTNTNTCCAGACCCTTGCACGCTTACCNNGATTACNCTGTCCCGCCTTTTCTCCCTTTC 

GGAGAAGCGTTGGGCGNCTTTTCTCATTAGCCTCACGCCTGNTAAGGGTAATCTCAAGTN 

TCNGGATGNTAA GGTT CGGTTCCGNTTCCAAAGNNTGGGGGCNTNGTGTTGNCAACAGTA 

AACCCCCCCCCGTTTTCCAAGCNCCCGNACCCGGATTGCCNGCCCC7TTAAT 
Sequence 1543 

AGGTAC i u n ii i i u IGU I U I I I I AAGAAAAATACCTGGAAAAGCCTTGCTTCTAA 
TGTAGGAATTTTTCCTTACAGTGAGATCTATGTAACT^ 

CAGCTGCCAGGAAA TTTGG TATTACATGAGCATTTCAAATAAAGCAACAGTGTATAATTT 

TAAGGCCCA CATTA G TTTTC ATTTTTCACTACTTTTGACACCCT^ 

TCACTTTATCTTTTGCTTTTAATTTGTC 

MTGAGATGGGGCAGAATGGAGGAGTATATCCATGTMCTTGATTTTATTCCACTTTCCA 

ACTATATCTTCTTTCTGTCTTAAA 

Sequence 1544. 

TGGCTTCATAGAATGAGGGAMTGTTTATCCTTCTAAAMTATTTANTANAGNGGNCTAA 

GATGAGAGTGTAAGTGAGTAMCTTTCTCMTCAMTATTTTGNGATATGTTTCAGGAAT 

NTCATTACATGTTNGGTTCAATTTTTAAAGCATTTNTTTTTATTAT^ 

ATTGGTGACCATCCATGTGCTGCTTTTATTGCCAGCTTAACTTGTTTTTAT^ 

AGGMGAAAACCTCTGNTGTCTATTTMTGGAGATGMTTGNTAAACTTTTACCATTTGC 

AACTATACCTCTCTATGAGTCAGACTTTTATTGATCGTGTACAACTAANTGAAACCCGGA 

AATGAAGG AAATGTAACANGCG ACCNTATGTATTCTTN GTN NGCN CAAATTGC CTTTATA 

AAANAGATTGATNCTCTATAATNAGCAAATGTTGTGGATTTGGAAATTT 

Sequence 1545 

CCGCNGTGGCGGNCGAGGTACCAAAGACCTCCAGTGATGCTTATGAACATGTAGCTGCTG 

TTTGGAAG CTGGAAGCCCC ATATCAN ATGCCTACCAC AGAATCTGG AGTTG GGGCAGAGT 

GCTGAATTGGTCGGAGTTACCTGTCTGCTTGATAACTTCAGATGCCTCCTTTCTCAGATG 

GCATTGCTAAGCTCTGCAAAGATCTAACTGTGAGGATGAGATAATTTAATCTTGCCCGTG 

GGCTCTATCCTGGCGGAGATACCATGGGACTTGAATCCTGGGCTAAATTGTAATCTTGGG 

GAAAGGCAACTATACCTNGCCCGAGTTTGTTCCTAGGAGAGCAAAGCATAANGAAGGTCT 

TTCCCAANT AAAG GACCATGNCAGGCACCAAGAGGGCAGTCTTGAAANGGGAAGCCAAGG 

GAAAAGGGATmGAGAACCCTACCAANGMTTGAAANGTTGGTMGGMTAATTAAAAA 

GTCCACAAGGCAAAGNAGTTNeCTGGCCCCCGGGGCCGGCCCGCTTCTTAGAAACTTAGN 

TGGGMTCCCCCCCCGGGGCTTGNCAAGGGGAAATTTCCNATNATTCAAAGGCCTTTTAT 

TCGGATTAACCCGTTCNGAACCCTTNCAAAGGGGGGGG 

Sequence 1546 

ACGCGGGAGAGTGGCTGACTTGCTGATTGAGGTCGGCAGCTCCCGAGAAGCAGGAGAAGC 

AGAGTTTTTGCAGACAGACCCACGAAGTCAAGCATGCGGAGTGCAGCCAAGCCCTGGAAC 

CCAGCCATCAGAGCAGGGGGCCACGGCCCAGACCGGGTGCGGCCTCTGCCTGCAGCCTCT 

TCCGGCATGAAGAGTTCTAAGTCTTCAACTTCCTTGGCTTTTGAGTCCCGACTCAGCAGG 

CTCAAGAGGGCCAGCAGTGAGGACACGCTCAACAAGCCAGGAAGT 

Sequence 1547 

ACTTTTTTTTTTTTTn I AMCCATTTCAAGAGAGTCTCTGGATTTCTTGATCCTTGATT 

TGTGGTTTTCATGTTGTCTGTGAGCTCCAGCAGGGCCTCATCAGCCCTGTCCGTGGTGCT 

GCTCCAGCTCTGTGCTCCCCGATAGGTGGCCCTGCGCTCTATGCACATCCTTCTTAACAT 

AGGAAAGATGTATGCCCTTCAAAGAAAGGCATTCAAGACAGATCAGCACCAAGTGGAGGC 

TCTACCCACAGATGTTAATGTGCATTAAGAAACAGCGTTACTGAGAAATCTGAGCCATCA 

TTCCAGAGTCTCTCTCCTGGGGAGAAGTTTGTT^ 

TTAATTCANAATTTGGAACTAGAATTATTTGTAATAAACCTATCAMGCCTAGCCTAAG 

GCTGCCATCTTCTTGTCAAAATTCTAGNTNGTTTATACCAGTANAAAAAAAGACANTTT^ 

AAAGCTCTTCAAAAGCTGAATCCTCCTCAATAGTAAGGGGACTGTGTGAAAAGTTTTm 
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TATGC TGGGC TTANCCAACCCTTTC 

AAAACTTTTTNGCCCCNGANCTTrTTANNAACTATlTAAAAATAGANGTTTNTNATNGTN 

TTNANNTNAANTGGAGGGGTTTAAAAAATTGTTTCTTTTTTTA 

Sequence 1 548 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTAGCACTTGTGAC 

CMCAGCTAGTCTTGCTTTCAACAGCTAGTCTTGCTTTCCCTGATGGTAGATCCCTACTA 

AAAGC1 CTTG AAAGTGTGCATATGCAGGCTACCTAATGATTTAACGGAGGATCAAGATGG 

TTACACTGCAGAGAAACAGGAGGGATAGGGCATATGCCCTTTGTCCAGCAAGACAGACAA 

/W\CTAGGAACTTGAATTCTCTTCTCTGGATTCCTGTATCTGGAGCCAAGGCCAAGGATC 

AGTTCTTCTAGAAAGTTCCCTCCTGTGTGTGTGTGTGTAAATGTGTGTGAATGCAGTCAT 

MCTTCCCTGCCAAAATTCATTGCCAAAMTTTTGTGA7TTGAAGCAGGCCTTACCTGTT 

AATGCAAATTTGATTCTCAAATCCTAAGAGTTTCCAAAGGTTGAGTGAAGTGAGATGAGA 

CTCTTAGTTGAAGATCCAGATGCTTCACTGTTGCTATCTAGATGCTACCAAAACTCCCAG 

CACACAAAGTATATGAAGAGAGGCTCTANGTTCATGAAATGTTAANAGTTTGGAAAAATG 
GGCCTTTGGGGT 

Sequence 1549 

AGGTACACATTTTGAGGGGTCCCACAATTGAGAGGTGAGCAAATCACATAGAACTTGCTT 

TGTAGCCCTGAATGTGTCCATCTCCCTCTCCCTCCAGTGACCTCTGCTTCCAGCTTCCCT 

TCCCTATCTCCTGTATCTCTGCTTCACTTGTGCTACCTTCTCACACCACATCTTCCAGGA 

TTACACT7TTATG CTACACC TGCACTGATCAGTCTTCTACTCCAGTCTTCATTTTGTCTT 

CGTAAAAATGTTGAGGATGAATGCTGTTCCACTTTCCTTTCCTATAGCGAGATGGCCTCC 
CGCGTACCTGCCCG 

Sequence 1550 

AGGTACTTTTAGTGGAGATGGGGTTTCACCGTGTTAGCCAGGACAGTCTCGATCTCCTGA 

CCTTGTGATCTGCCCGCCTCGGCCTCCCAAAGTGCTGGGATTACAGGTGTGAGCCACCGT 

GCCCTCTTTTATTCTCAGTAATACACTCTACTCCGAGAAAAATATTGGATTTAAATTGCA 

A ^ GAT C TGGCTAAAAATTTTATCATCACTGAMTTCCTAGMTGTGAAAAMGM 

TATTTTTCTGTAATGTGAGTTTCACTGTAGCCTATCAAAGCACATGGATGAAATCGGAAA 

TGCCTAAAATTCTGAGGTTGGCAGAAATACACACACTCAAAGGACATGCAATCACATAAA 

CAGAACTACTTTATCTTGACAGCATCCAGGCTGCCTCCATACAAATGTAGCTTCATGTTG 

GTGGC TAGTCCGCA TTGCATTTTCAGAGATAGAAAGAAGAAATTCCAGAAAGGGAAAACA 

TTGGCATTTTGTmACAGTAACTCTTTACCTCGCCTTTAAAAAGACATTGTG 

TACATGATACAGCCAATTTCAAAGGACTTATTCTGGA 

Sequence 1551 

ACGCGGGGTGAAGAAGAGTATTCCTTTCCTCTGAGAAGTACACTTGAAAAAATCCTGTAA 

CTCTAGCGTANACTACCAGTTATTGACTTCCAACATTTACTCTGCTTTCCTTTTTCTAA^ 

ATTGATAAAGTCCAAACCTCTCTTGTTTGGATTTGAAATTACTATGTTTTTTGGCCNGGC 

GCGGTGGCTCACGCCTGTAATCCCAGCACTTTTTGGAGGCTGAGGCGGGCAGATCATGAG 

GTTTAGGGAAATCNAGTACCATCCTGGNTAACATTGGTGAAACCCCATCTNCTACTAAAA 

ATACAAAAAANATTTAGCCAGGCCGTGGTTGNGTTGGNGCNCNNTGTTAATTCCCCAGGC 

TNCNTANCAGAANGGCTGGAGTGCCAGGGANNAAANTGGGCCGTGAACCCAANNANGGCG 

GGAGGCTTGGCAGATGAAGCCCGAANATTCACACTTGNTTGCACCTCCCAAGCCTGGGGG 

TGNACCANGANTCCAAAGGAACTTCCCATTCTTTCAAAAGNNAANNNTTGANTANNNACA 

AGANANNNNATTTGGGTTNNCCCTTGCNCCCGGGGGCCGGGCACNGGNTTCTTAAAAAAA 

CNAATGATGGGAAACCCCCCCCCGGGGGCTTGNAAAGGGGAANTTTTCGAATTTTCCAAN 

GCCTTTATTCCGAANNACCCGGTCNGNAANCNTTCGAANGGGGGG 
Sequence 1552 

AGGTACACATTGAAATCTGCAACATGCTGGGACTGCAGAGAGCCTGGGCTGGGAGTCGTG 
AGCTCCACCCGGCTGTTTTTATGACAGCTGGCAAAACACGCTAAGGCACATTGAGGACTC 
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AGTGAGGCAGGTTCCTGGGCCTCTGTTAGGCTAAGTTTGAAGTTGTTCCATCACTGCCTC 
CCTCTGCCTTCTTTGCCATTTCCATCCATGATAGCACTACCCTAACTAGTGATGAGAAGC 
AGTCTTTGGAGTTCCTCTTCTCTCCTGTCCCTCATGGCTMTTTCCCTGCTCAGACTTCT 
GCCTCTGGCTCTCTTTCCTTCTGTGCACACTGAC I Mil I CCCTCTCCTGTGCCGAGCTG . 
GAGTCGAGCGGCCGCCCGGGCAGGTACACTGCCCAGGCAAAGGCGTCCNGGGCAGCGTAG 
GCNGGCGACTCAGATCCCAGCCAGTNGACTTAAGCCCGTGGTTTGCTCCTCCGAATAACT 
NGGGGG TGACCTTGGTTAAATA7TCACCANGCAGCCTTCCCCCGTT 
Sequence 1553 

CCAGGGMGGATTAAAGAAGAATTTAACATAGGGAATAAATGCTAAAGAAAACCCTCAAC 

MTATACATGTATAGTTAGAMGAMTACAAMCTATTGGTTrTTGTTTGANACAATTCT 

TCCTCCATGGNGGTTTCATACCCATGATCTACAGTTGCCTATTGGATAGTCCAAGGTTTC 

TCATCATTTATGCTAGCAAATTATGATTGCCTGGAAGTGCATTTTAAATTCJ I ] I IATTT 

GAGATGTATAGAATCTTGCTCAMTGGTATmCCAAACTGTTGAAAAATAGGGCATTTT 

TTAANAAANAACACTTTTAC I MINI I GAAAAANANAAGTTTTCTCTGCAGATTGGTCT 

ATAGGNGCGGGTCTTCTGGAAAATTG I I I I I I CTGTAGAGAGTGAACATATGTTTGGTAT 

GTGTGTGTCTGAGTGTGTTTGGGGGTGTCTCTGGGTTTCTGGGGGTGNGTGGGGTCTGCT 

GNGGNGTTTGGGGNTTGGNGNCNGGGANNANTGGGGAGCCGGGGGNGAANTGGGGCCTNG 

GGGNCCTGGGGNCTGGGG 

Sequence 1554 

CCGCGGTGGCGGCCGAGGTACGCGGGGACCTGCTGTGCTCTTGCTTGCACAGTGTCCTGG 

AGCTGGACCTGGCCCTGGGTTTCCAGGAAGCAGTTTGACTAAAGGCAGCAAGCTGCTTCC 

TCTGCTGCCTGAGATACCAGATTCCCAATGGCGAAGATTGAGAAAAACGCTCCCACGATG 

GAAAAAAAGCCAGAACTGTTTAACATCATGGAAGTAGATGGAGTCCCTACGTTGATATTA 

TCAAMGAATGGTGGGAAAAAGTATGTMTTTCCAAGCCAAGCCTGATGATCTTATTCTG 

GCAACTTACCCAAAGT 

Sequence 1555 

AGGTACTTTN I I I I II II I I II I I M I I I I I I ! GNGATGGAGTCTCACTCTGTTACCCAG 

ACTGGAGTGCAGTGGCATGATCTTGGCTCACTGCAACCTCCACCTCCCGGGTCCAAGTGA 

TTCTCCTGCCTCAGCCTCCCGAGTAGCTGGGACTACAGGCATGCACCACCATGCCCAGCT 

AATTTTATATATATTTTTAGTAGAGATMGGTTTCACCATGTCGGCCAGACCGGTCTTGA 

ACTCCTGACCTCAGGTGATCCACCCGCCTTGGCTTCCCAAAGTGCTGGAATTACTGTGCC 

TGGCCTAGTCATTAATAT1TTGATTAACGCCTACCCCTGTGATCAACGACAACTTATTCA 

GGMGAAGGGTTCTTTCTACTCTAGTATGCTTCCAGTTATTTACTGNGTATCTAGCTAGG 

TTGNGAAAAGAAAAGAATNTGAAGCNCGNANTTTATGAAACCTAACTGGGCTATCATCTA 

CTTTACCAAATTTCTTTTAAAAAAGCAACCNTC 

TTTMCTTTAAAGGCCCTTCACATTAAGTTGGGATTTTTTATTCATATGGAG/^ 
TNTTTTATGACCTGTGCNCTTTAMGAAATGATGAGG 
GGMTTAACNAGTTTTTNTAATAMTCCCCAAAAAAAAAAAAACCCAAAAA 
Sequence 1556 

TGCTCI I II I I I I INTNTGTCNTCNCTNCCACTCTGNAACNNGAGAGCCTGIMNTAACCNG 

CCNGTTTCCTCTCCTGNNTTGATACACACCTGCNTNGNNGANNNNGNATGGNNNNTNGAA 

NNCTNATNNNGNGAGNCNTNGGGGTNNCANNGGNAGNCTCGGNGGCTAACNTCCATAAGT 

TGATAGGGCAGAACCATTCACAATGGGCACGCTTTGCCGGGCANCAACTGTACTGTGCCA 

TGGGGCCCGGGACCGTTTTTAMGGGATTGAGGACAATTTCCCCTTGCANGCNCANGAAT 

TTCTTTTCNAATTAGGGCCCAANAAANGCNTTNCNTTTCCCMGGGAAAAAAAMC 

CCTTTCTTTNCAAAATNGTTGGAATAATGGGGAAAGGNCCGGGCCCAACCAAGGGAATTC 

CTAACCCCAAGGGGGGGGGTTCCCTTCCCTCAACNTTTCAATNC 

Sequence 1557 

AGGTACATGTTATTAGCTTTAGCAGGTAGTTCTAATAAAAGAGTTCAGTATGCTNTTTAA 
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ATTTTTTTI I AGMTTTTCTGTGTTTATTCAMTTTATTGMCTTAATTATTTAATATAA 

GAGAACCAAGGAACCACTGAACTTTGAGMCACMCCGAAATGCTAAC7TTAGTGTTTTA 

TTTGAACTAGTGACTAAAATTTCTATGGCAATATATGAATATTTTGAGAGGTTAGCCCAA 

ATTGTAAGAAATTAAT GGTAA GCCTGGCCATGGTTTTTGTACGAAAAAAAAAATTT^ 

CTTACACATAATTTATATTTTAGCAATTTTTAAAACNTTTTACNCT^ 

TTTGC TGG AAGTGGCC AN GTATTTTTAC CCATGGTAN TATTCAATCN TTTTAAAAATC TT 

CTTAAAACAGGTTCTTNCCCATTTTAAAGAAGGTAAACCCAAGCCCAGGTNCAGTGGCTT 

CACATCTGTAATCCCAGCTTACCTTCAAGGGAGGCCTGAGGGCAGAAAGAATTTGCTTGG 

AGCCCANGANGCCGGAGGGTTACAGTTGAGCCANAGGATNCGNGCCCATTGGCAACTTCA 

ANCCCTTGGGGCCMCCMGGAGCCGAMCCTTCCTT^^ 

GN G AAG AACC AACC AAANCC AAAATN N CCGGG AATTC CTTTTAN N ATTNTTAAATATN GG 

AAAAAAAACCTTCCANTTTANTTACCGANGGAGGACCTNN 

Sequence 1558 

AGGTACACCTACACTCCAGTCTGGATGACAGAGCCGGGCCCTGTCTCAACAGCAACAATA 
ACAAAATGTCAATGTAATATTTTGACCCACTCTCACGTGTC 

TGTAA GGATA TAATAATGT TATTC GGGCTGCAGAAAGGTAAACCAAATTTCTACGGATAC 
ALA^TTTTGGGTArTGATTTTTATTAAT^ 

CTT^G AG AATCCTAATTATTATC G TTATTATTTGCCCTTG AACTT1TGTATTTATTTGA 

TCTTAGTTATCTAAGACCATGCTAGGTCTATGGTATATGGATGCTGCACTCATATTGGTA 

GAATATAATATGAGAGACAGTCTATAGACAAACATATTAATATATGATGACAATCTATGA 

TATATACCATGAAGGAAGCTAACAAGCATAATGTTATGGATTAATTATGAAGGAA 
Sequence 1559 

AGCTTGGTGTACTTGATGATGTAGGTCAGCAGGCCCATGCCGACAAACGCGGCCACCAGG 

ATGAACACCTTGGTGTAGGTCAACTGCACGAAGCCGGTGCCGATGTCGACTCTCCAGGCG 

CCGGCCAGCAGCGTCGGGATGCCTTGTTGCTTGGCGCCCTGGGCGATCTGCGCGTAGTTC 

TGCAGGATCAGTGAGATACCGATGGCGCTGATCAGCGGTGCCAGTCGGGTGGAGTTGCGC 

AGCGGTTTGTAGGCGACACGCTCGATGACCCAGCCATAAACGCCAGTGACGACGACGGTG 

AAGATCAGGGTGCCAAGGATGAGCAGCGGGAAGGATTCGATGCCGAAGTAAGCCAGCAGT 

GCCAGACTGATCGCCGCGAGGTAAGCGGAAATCATATAAACCTCGCCGTGGGCGAAGTTG 

ATCATGCCAATGATGCCATAGACCATTGTGTAGCCGATGGCGATCAAGCCATAGACCGAC 

CCGAGGGTCAGCCCATTGACCAGTTTGCTTGCAGGAAAATACCATCCATAACGCAATCTT 

CACGCAGTGAGGAGCCTGCACACCTTGTGGGTGTGCGGATCTTTNTA 

Sequence 1560 

GAATTGGAGCTCCACCCGCGGTGGCGGCCGCCCGGGCAGGTACTCTCACCTGTGGGCATT 

TGTCAAAATCAGTATTACTGCTGGGGGGGATGAGGGCTGAAAACTCCTATTTCACCATCT 

TTCTGACATCATTTGCCACAGTGTGCTTTTTTTGTTCTGGTATTTCCTTATACCCT^ 

TCTCTTCCCTATTTTAGTCAACCCAGCAGTCTGTATTCATTCCTCTTAATCAGGATTTGA 

GCTTTGCACCTGCTCCTTTTGGCTGGGGTTGAGTCTACTTCAGAGGCTAGGGATATAGAG 

GGTCCAAATACAATTTAAATAAGAAAATGCTACGMTAAAAGTGTTGAGAAGAATAAGAA 

TTCTAGAAAAAAAGAAAAAGAAAATGGAMGAAAAGACTTTTAAAATGTTAAGGCATCTG 

GCAGAGAGACCAAATGGGAACAAAAACCCAATTCCTCTCTCCTGACAAGTCCTTGCACCC 

TGGCCTATGGCTGCTTCAACCCACAAAAACGCCTTTGTTTATCAGCACTCAGGTATCAGT 

AGGCTGGCATGCAGTTGCAGTTTTACC 

Sequence 1561 

GGTGGCGGCCGAGGTACTGAAACATGTTGGAGGGTTAGTTTTGTTTAATTTGAAAATTCT 

GGGCTGGACATGGTGGCTCACGCTTATAGTCAGCTGTAGCACTTTGGGAAGCTGAGGTGG 

GAGAACTGCTTGAGTCCAGGAGTTCAAGACCAGCCTGGGCAACAGAGTGAGACCCTATCT 

CTACAAAAAGTTTAAAAATTAGCCAGGCTTGGTGGTGCATGCCTGTAGTCCCAGCTACTC 

AGGAGGGCGAGGCAGAAGAATTGCTTGAACCGGGGAGGTCAAGGCTGCAGTGAGCCACGT 
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TCATGCCACTGTACTCCAGTCTGGGTGACAGGAATAAGACCCCCTGTCCCNAAANNATAA 
ANAATANTAAAAAAAAAAAGTACCTGCCCGGGGCCGGNTTCTAAGAACTAGGGGATCCCC 
CCGGGGCCTGCAAGGAATTTCNATA7TNAANCTTAATCGAATACCCGNTNCGGACCCTTC 
CGAAGGGGGGGGGGCCCCNGGGTTACCCCCAANCC [TTTTTTGT 
Sequence 1562 

GCTCCACCGCGGTGGCGGCCGAGGTACTTAA I nTTTTTTTTTTTTm I AGTAGAGACA 

GGGTTTCACCATGTTGGTCAGGCTGGTCTCGAACTCCTGACCTCGTGATCTGCCCCCGCT 

TGGGCTCCCAAAGTGCTGGGATTACAGGTGTGAGCCACTGCGCCCAGCCACAGTCAGCCC 

AGATTCAAGGGGTGCGTCAATAGACTCTATCTCTTCGTGGAAGGAGCTGCAAACTATTGC 

G ATC AC GTTTGCAGTCTACCACAAATGTATTGCTATATG TTAGCGCTATTCTAATAGCTT 

TATATGTGTGAACTCACTTAATTGTTACAGCAACCCAGAGAGGTGAGTTACTATTTCTAA 

TTATTTCTCAGATGAGGAAGCAGCTACTGAGACGTTGAGTAACTTGCCCAAGGTTATATA 

GGTAGTGGCAGAGCTGAATTTGAGTCCACCGCAGTCTGGTTCCAGAGGCTATACTCCCAA 

ATMTAAGCTATACCGCCTCAAAAGCAAGGCGGAAGAACTTTATCACTTGTTTGGAGCAA 

ACTGAATAAGGCTTCAGACTTGTAMCATTAMGACAATTTGGCTCATTTGGCAAACTTG 

GATATATTGGATGCATTGTTTTATATTAAAGTAAAAAATTTTGT 

Sequence 1563 

GACGAGCATCACCAAAAAATTCCGACGCTCAAAGTNACGANGGTTGGGCGNAAAACCCCC 
GAACANGGGNACTATTAANAGGATTACCCAANGCCGGTTTTTCCCCCCCCTGGGAAAGCT 
CCCCNTACGGNGCCGCTTCTCCC7TGTTTCCNGACCCCTTGCC 
Sequence 1564 

CGGNAATTACCNGTTATCCACACCAATCAAGNGGGATAACCGCAAGGGAAAANGAACATG 

GTGAGCAAAAANGGCCCANNCAAAAAAGGCCCANGGGAAACCCGTTAAGAAAAGGCCCCG 

CCNGTTTGCCNTGGGCAGTTNTCTTCCCATTAAGGGCCTTCCCGGCNCCCCCNCTTGAAC 

NGAAGTCCATTNCACAAANANAAATTCGGACCGNCTTCCAAAGNTACAAANANNGTAGGG 

CCGAAAAACCCCCGAACCAGGGG 

Sequence 1565 

GGCGGCCGCCCGGGCAGGTACACATTTTGAGGGGTCCCACAATTGAGAGGTGAGCAAATC 

ACATAGAACTTGCTTTGTAGCCCTGAATGTGTCCATCTCCCTCTCCCTCCAGTGACCTCT 

GCTTCCAGCTTCCCTTCCCTATCTCCTGTATCTCTACTTCACTTGTGCTACCTTCTCACA 

CCACATCTTCCAGGATTACACTTTTATGCTACACCTGCACTGATCAATCTTCTACTCCAG 

TCTTCATTTTGTCTTCGTAAAAATGTTGAGGATGAATGCTGTTCCACTTTCCTTTCCTAT 

AGCGAGATGGCCTGTCCCCGCGTACCT 

Sequence 1566 

NNTGTGGGCTTGTGACCTTTTTCAAMTCNTmAGTGTCATTTTGGT 

TGTA TTCAGTCTGCCATTACTGGGAATCTCAGAGTTCTC^ 

TTTTAMGCCTTTGAAATCAGGCATATTTCACATC 

CAAGAAATGAGCCACTCTGNATMTCTCTTTNGTTCAGGAAGGGGACATTTTTTCCCTGA 
CAAATCAATCTTCCAGAAAACTCTTCATTTCAGNTTTGGAAAGGCACCGGAAATGGCCAA 
CCTAAAGCACCAACTGTTCGCGTTCCCGGAGTTTGACTCATTGGCCCCACCTTGTGGCCC 
TGTGGGGMGGGGCGGGCAGGCTGAAGAAACCGCCTGACAATTTAAATTNGCTTTTT 
Sequence 1567 

GGCATGATGGCTCATGCCTGTAATCCCAGCACTTCGAGAGGCCCAGGCAGGTGGATTGCT 

TGGGCCCAGGAGTTCAAGACCAGCCTGGGCATCATGAAGAAACCCTGTCTCTACTAAAAA 

TACAAAAAATTAGCCAGGCATGGTAGCTCACACCTGTAGTCCCAGCTACTGGGGAGGCAG 

AGGCACGAGAATCACTTGAACCAGGAAGGCAGAGGGTGTGGTTGAGCCAAGATCATGCCA 

CTTGCACTCCAGCATTCCAGCCTGGGCCGAACAGGAGCCAAGACTCAGTCTCAAAAAAAA 

AAAAAANTAAAGTTCCCTTCGGCCCGCTTCTAGAAACTAGTGGGATCCCCCGGGCTTGCA 

GGGAATTTCGATATCAAGCCTTATCGGATACCCGTCGACCCTTNGNAGGGGGGGGGGCCC 
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CGGGTACCCCAGCTTTTNGTTTCCCTTTTAGTGGAGGGGTTAAATTTGGCGCCGCCTTTG 

GCNGTAAATCATTGGGTCAATAGCNTGTTTCCTGTGTNGAAAATTGGTTTATTCCCGCTC 

ACAATTTNCACACAAACCATTACGGANGCCCGGGAGCATAAAAGNNGTAAAAAGCCTGGG 

GTGC CTAM TGAAGNNGANGCTAACCTCACATTTAATTTGCGGTTTGCGCTCACTGGCCC 

CGGCTTTTTCCAMTCNGGGAAAACCTTGTCNGNNGCCCAAGCTTGCATTTAAATGAMT 

TCGGGCCCAACCC 

Sequence 1568 

NNAATCAGGGGGATAACCGCANGGAAAAGAACATGTGAGCATAAAGGCCAAGCCAAAAAG 

GCCCAGGNAACCCGTTAAAAAGGCCCGCGTTTGCNTGGCCN"nTTTTCCATTAGGGCTCC 

CGCCCCCCCTGAACGAAGCATTCAACAAAAAAATTCGAACGCTCAAAGGTNANGTAAGGG 

TGGGCCGAAAAACCCCCGGAACAGGGGAACTNATTANAAAGGAA7TACCCCAAGGGCCGG 

I I I I I I CNCCCCCCTTGGNAAAAGNCTTCNCCCTTCCGTTNGCCGCCTTTTTCCCTGGTT 

TCCCGG 

Sequence 1569 

CGGTTTATCCACAGAATTCAGTGGGGATAACGCANGAAAAGAACATGTGAGCCAAAAAGG 

CCANNCAAAAATNGCNCAGNGAACCGNTAAAAAMGGCCNGCGTTTGCTGGGCCGTTTTT 

TTTTCCCATAAGGGCTCCCGGCCCCCCCCTTGNACCGNAGCCATTCAACCAAAAAAAATT 

CGGACCGCCTTCAAAAGTTCANGAAGGGTGGGCCGGAAAAACCCCCTNACCAGGGGAACC 

TANTAAAAAGGATTACCCCAGGGGNCGTTTTTCCCCCCCTTGGGAAAAGGCCNTCCCCTT 

CCGTTNGNCGCCTTCTTCCNTTGNTTTTCCGGMCCCCTTTGCCCCGNTTTTACCCCGGG 

NATTANCCCTNGTTCCCCGGCCNTTTTTCNTTCCCT 

Sequence 1570 

GCCCGGGCAGGT ACAGA CAATGTGMGTTAGGACTTMGTAAAGGTAGTATTTATTTACT 
AAGAATAAACTTTCTTTTAAGAAGTATATAGAAAGTAAM 

TTCAGCCATACTTTTCTACTCACCCTGGGCTGCAAGGCAAACAACTATTTTATTATTATT 
ATTATTACTTTATAGAGACGGGGTCTCCTTATGTTGTCCAGGCTGTTCTCGAACTCCTGG 
GCTCAGGGGATCTTCCTGTCTCAGCCTCCCAAAGTACCT 
Sequence 1571 

NGCCGACTTCAGGGCCAGTACCTTTTCGGCGAGGTCCAGCAAGTGGTCCGCGCTGAACAG 

CTCGCGAACGGGTAACTCCACCCCCAACTGCTCGCGCACCCGCACCACCACCTGAGTAGC 

CAGCAGCGAGTGACCGCCCAGCTCGAAGAAGTTGTCATGCAGGCCGACGCGCTCGACTTT 

CAGCACTTCCTGCCAGATGCCGGCAAGTTGGCTTTGCAGTGGCCGTCGCCGGGGCCACGT 

ANTCCGCTGTGTTGCCTGGGCCGGGGGTCTTGGCCGCAGGCCAGGGCCTTGCGGGTCCGA 

GTTTTGCCCGTTTGGGGGGGTCAAGGGGGCCAACCCGGGGGGTAGTAAGTACCCAANAAG 

GGTCCGGGTTTGCAATATTAAGTCCCGGGCMGGCCTGGGCCCTTTGAANTTTTGGGGCC 

GCCGAAACCCTCTGCCCCAANGCCGCCGCCTTTTTCTTGGGGGTTGGTTCCGAAGGGTTC 

NGGGCGCCCCCNGGCNNAAANCCGTTTNGGGANTCCGGGTh4GGGG 

Sequence 1572 

AGCTCCACCGCGGTGGCGGCCGACTGCACCACGGATGGGTGGGCGTTGAGCGCCGTTTCC 

ACTTCCAGCGGATACACATTGAAGCCGGAGCGGATGATCAGTTCCTTGGTGCGACCCACG 

ATGAACAACGCGCCGTCCGGCTCCTGGCGCGCCATGTCGCCCGTATTGAGCCAGCCGCCG 

TTGCGCATGGTGGCCGCCGTCATGGCCGGTTCGCGGTAGTAGCCGGCCATGATGTTTGGT 

CCGCGTATCCATAATTCTCCCGGCTCGCCAGGCGCGACATCCAGGCCGGCCACGTCCACC 

ACGCGCACTTCCACGCCCGGGATCGGCATGCCCACCGAATCGTCGCGGCGCGGCGCATCC 

AGGCGCGTCTGGCTGATGGTGGGAGCACTTTCCGNCATGCCGTAGCCATTGTGCAGCGCG 

GTGCCCAGCAGCTTTTCCACGTNGCGCTTGAGCGACGGCGCCAGCGGCGAGCCGCCCGCA 

TACGCAAAGCGCAGCCCGTGTGGGTA 

Sequence 1573 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGGTGGCGCAATCTTGG 
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CTCACTACAACCTCCACCTCCCAGGTTCAAGCGATTCTCCTGCCTCAGCTGCCTGAGTAG 

CTGAGATTACAGGCGCCCGCCACTACGCCCAGCAAAT7T7TTGTATTTTTAGTAGAAACG 

GGGTTTCACCATGTTGGCTAGGCTGGTCTTGAACCCCTGAACTCGTGGTTTGCCCACATT 

GGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGACATTGCACCTGGCCGGTATTTCTGCT 

TTTTAAAATTATTTATGAATTTGGTO 11 IAC 

TTGGGAAMTTATTTAACTTCTGTGTAACTGAACTTATTCATATATAAAATGGGGCACTA 

ATAATACCCACTTCAGAGTGTTGAGTATTATATGAGGTAATAACATGTAACGGACTTAGT 

TAATGGTAAGCGCTATATATGTGGTAGCTATGGTGATTATGATTATCTACAAATAATATT 

GTTCTATMGTGTTTCTTAGTTCTTTTTGGAAAGMGGGCTTm 

TAAAAGTGGAACCG 

Sequence 1574 

AGGTACGCGGGGACCTCACCTGCTGTGCTCTTGCTTGCACAGTGTCCTGGAGCTGGACCT 

GGCTCTGGGTTTCCAGGAAGCAGTTTGACTAAAGGCAGCAAGCTGCTTCCTCTGCTGCCT 

GAAATACCAGATTCCCAATGGCGAAGATTGAGAAAAACGCTCCCACGATGGAAAAAAAGC 

CAGAACTGTTTAACATCATGGAAGTAGATGGAGTCCCTACGTTGATATTATCAAAAGAAT 

GGTGGGAAAAAGTCTGTAATTTCCAAGCCAAGCCTGATGATCTTATTCTGGCAACTTACC 

CAAAGTCAGGTACCTGCCCG 

Sequence 1575 

CTGGAGCTGGACCTGGCTCTGGGTTTCCAGGAAGCAGTTTGACTAAAGGCAGCAAGCTGC 

TTCCTCTGCTGCCTGAAATACCAGATTCCCAATGGCGAAGATTGAGAAAAACGCTCCCAC 

GATGG AAAAAAAGCCACTAACTGTNTAAC ATC ATGGAAGTN GATGGAGTN CCTACGTTGA 

TATTATCAAAAGAATGGTGGGAAAAAGTCTGTNNTTTCCANGCCAAGCCTGATGATCTTA 

TTCTGGCAACTTACCCAAAGTNAGGTACCTGCCCG 

Sequence 1576 

ATGATATACTTCTGGTGAAATTAATATTAAGATTAAATAAGATTTAGAGTGGTTTCTACT 

TGTGGGTCATGTTTCTGTTTATGATGCAAAACTGTAAAGGATGAAGATAAACTCTAGTTT 

GAGATGGGAGATTATGTCACTA I I I I I GCCTGAATCCCTTCGAAGAACCTCTTCTTCATC 

TTTTGGTGATAAAGAGAGTATCCMCCTGAAGGGAACTTCTAGTGTCTTTTGATACTTTA 

TTTTACTCTTGTGGATTTGAATAGGCAAACGTGGAGACTTTTTGTGTTAT^ 

TAAAACTTATGATATGAAAACAAAACATCMTATTCATATTATTAAAATTCCAGTAGAAC 

AAMTTTGATTTAGGTCAAMTTGATTCACTTTATMTTTAMTCAT^ 

ACCTTTAAAA 

Sequence 1577 

GTGGAACAGCATTCATCCTCMCATTTTTACGAAGACAAAATGAAGACTGGAGTAGAAGA 

CTGATCAGTGCAGGTGTAGCATAAAAGTGTAATCCTGGAAGATGTGGTGTGAGAAGGTAG 

CACAAGTGAAGCAGAGATACAGGAGATAGGGAAGGGAAGCTGGAAGCAGAGGTCACTGGA 

GGGAGAGGGAGATGGACACATTCAGGGCTACAAAGCAAGTTCTATGTGATTTGCTCACCT 

CTCAATTGTGGGACCCCTCAAAATGTGTACCT 

Sequence 1578 

ACGCCGCTGCGCAACCTGTTCCTGAACACGGGCCACGGCACCCTGGGCTGGACCATGTCC 

TGCGGTTCGGCGCAATTGCTGGCCGACCTGATGTCGTCGAAACAGCCGGCGATTCTGGCC 

GACGACCTGTCCGTCAGCCGCTACAGCGGCGCGCAGCGCGGCGGCAAACTGCAACACGCG 

GTGGCCTGAGATGGCGGCCAGCATGCCTGCAAAGGAACGTAGCGGCGCCATCCTGACGGT 

AGACCTGGACGCCGTGCGCGCCAACTACCGCTTGCTGCGCGACAAGGCGCACCCGGCCGC 

CTGCT 

Sequence 1579 

GCGCCAAGCTGCAGTCCATCATCGACGTGGTCAACAAGCCTGGGCGCGTCGCCCGACGAC 

ATCATGGCAATTTTGCAAGCCCTCGACCAGGCTGGAGCCATCGAAGGCGAACTGGTGGTA 

ATTTGATGCTTAATATCAACGACAGCAGCAGCGCCCTGCCCTCGGCGCTCGATGACAAGA 
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CCCCCGCCGCGGCCACGCCGGCCGATCCCCGCTACGCGGCCAAGGCCACCGA 
Sequence 1580 

CCTAAACTAGATAATTTGAGAACAGGGAAAAAAGATTCCATTTCGATTNCTGAAGGTTAC 

CCCCATACCTATTATAACAGAATAAAATAAAATAATTCGAMCTGCACAACCTCTAACTT 

ATC AAATCC TATATATG CCTCATTTTCTCAAATGACTCCTAATTTGTGN AAAG AAAAAG G 

CAAAAAGAGAAAGGACAGAAGTATGTCAAGGTGGGCTAAAGCTATGAATACCC7TTTATG . 

TAMCTAAGAAAAAAATAGATACACACGCATTTTTTAAAAGGGAACTTTT^ 

AGCCGCAAAGAGGAAAAATTCCTGGCTAAATTGCACCACTCAAAGACAACTAGACTTACG 

GTCATAAATTTCTTCTCCAACCCATTTCTTTCA 

GCAAGCTGACATAGGACCCTTTCAAACA 

Sequence 1581 

CCGGGCAG GTACCT GACTTTGGGTAAGTTGCCAGAATAAGATCATCAGGCTTGGCTTGGA 

AATTACATACi I I I ICCCACCATTCTTTTGATAATATCAACGTAGGGACTCCATCTACTT 

CCATGATGTTAAACAGTTCTGGCTTTTTTTCCATCGTGGGAGCGTCTTTCTCAATCTTCG 

CCATTGGGAATCTGGTATCTCAGGCAGCAGAGGAAGCAGCTTGCTGCCTTTAGTCAAACT 

GCTTCCTGGAAACCCAGAGCCGGGTCCAGCTCCAGGACACTGTGCAAGCAAGAGCACAGC 

AGGTCCCCGCGTACCTCGGC 

Sequence 1582 

CC CGCG TCCGGATGGATAAAGAAAAGGAAGAGTTGGGGTTAGTTGAGTCAGAGTACAGTC 

ATCTTTTGTCATG TCCTTA GTCTGTTGCTTAAGATTTTCATTTGTGGTTTGCA/^ 

TTTT AAAAAG TC AATTTTTG ATT AATTT AGTCTTTT AG AAG CTTCTG C AT AC C AAT AAAA 

GMTATGTGCCACTGTTTTTrGAGGGTrTTGATCCCCCTATTTAATNTGTNTTGAAGATA 

AAAAATNTTATCCAAATTAAAACTTNTCCATTGC^ 

AATG AN TTTAACCC ATTTTTTTAAAAATGCAC AC ACCTGG GCCTATAATTTGTACACATA 
AAATTAGCTGAAGTCCCAGANGTGTGGATTCCCAGACTCTTTGAGTTGAAATGATCCCAT 
TATTAAAAATAAGGTACCTGATCAGGGCAATCAGGCAGGAAAAAAAAAA 
Sequence 1583 

GGAGTCGACCCCGCGTCCGGGCAAATTACTGAAGCAGTTATGGTTCATAGTAGCAATTGT 

ATATGTTACCTAGGAGATGGTGTCAATGGCCGGGGGCCTCATCGCCACGTAAGGGTGTCT 

TGCCAGGTGACTTGCAAGGGTGACTGCCAGGTTGAAGCCAAGGCATTCACTGAACGGAGG 

CAGAAGCAGAAACAGTCCCAGGCAATTCTTGGAAAACTGGCCCATTCTTACTGCGGGACA 

TAGCACCTTACTGTTGGGTCCAAAACCCCATGTGTCCTGCCAAAAAAGGCAGCCCAGCAC 

AGATGTGTTCAGGGCCAGAGACACCATTTGCCTACTGAATGACTTGAGCAATTTGGTGCA 

TCTCACCGAGCCTTCACCTTCTCTGTTAMTGGAACATCTCCTCCCTCCTTTGCAAGATT 

GTCATTAGGATTTGTGAGGTATTGGCAATAACGGGCCCTCCCATCCTTCCCATTGCTGTT 

GTTATTACTGTATTCATAGCTTTGTGTGCTTCATAGGTACCCTCTAGGGTCCTCTACGTA 

CAACATACAACAGCTTCGCAATAGCCAGTGTTGGAAGCTCCAANAACGAAGGTCCTGTGG 

GCTATGTGGAGAGTAAGCAGCANAGCAGGACTGCCACTCAAACTAGTTTCACCC 

Sequence 1584 

MTAAGATTCATATGGCACCTAGTTMTTTCATTTACTC 

AAGCTCTTTTrCACCANCTACTAAACAAAAGCAATTTATTCMCTAGGTTTTC 

GAGTTGNTCTGTATGTAACAAGAAAGCMGAAAATAAATATTTTTAAATGTCACACTTCC 

CTGCAAAAATAATCACAATAATAATAATAATAATAATGAATAAGAGTTTATAGGCTAGGT 

ATAGTGGCTCACACCTGTGGATCCTAGCATTTTGAGAGGCTAAGGCGGGTGGATCACTTG 

AGGTCAGGAGCTCAAGACCAGCCTGGCCAACATGGTGAAACCCCATCTGTACAAATACAA 

AATATTACTCTAAAAAAAAAAAAGTGCNNCCGTTANACTANTCTANANAAAAANCNTCCC 

NNACNNNNCCCNGAACCTGAAANATAAAANGAATGCCNTGNAGGGGNAAACTGGNNAATN 

GCCCC TTATANNGGNTCCAAATAANNGCAATNGCNTCACNANTTTCCCAAATAAAACANN 

TTTTTNCCTGCNTNCTANNGGGGGGTTTGTCCAAACCANATAAANTGTTCNTACCAAGGG 
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GGGGATCCCNGGGNTNCCNNACTCCAAAAAATTTCTTTTCC 
Sequence 1585 

CCC G C GTCCGGAAATTTTGAAG AATATTAGAAAATATCTG AAACTGAATG AAAATAAAAC 

ACAACATATCAAAATTAGTGAMTAGAMTAAAGTAGTACTTAGTGGTAAGTTTATMTT 

MTTTACTTACGTTAGMTTAATTACTTTT^ 

ACTTTAAGCTTTTGCTTTAAGAAATCTAAAAAAAAA 

Sequence 1586 

CCCGCGTCCGCTTTCTTATTTGGAAAGAAAACAACACTCACCACGGCATTCACAATGCTG 

GGGACCTTATGAATGGGAATGGAGAGGCTGCACTAATGTAAGTGGGGTCAACATCCCTGC 

TGGAGGGCAGATGCAAGTGGGGTCAACATCCCTGCTGGAGGGCANATGCAATTTTGGCAA 

ACTMTGACTTTTGGAAGCCATTTTCTAGACTGCATTTTTCTCTTCTTTN1TAG 

TTTCTCAAGGGGAAAAAGAAAGCAGGAGGTGTAAATGGTATCTTGGCCGTTCTAAGAGCC 

CGTCTGGGAGACTCTCTGCTGACCACAGCACTGTCTCTCCCCAGGGCCCAGTCCCACAGT 

GTCACTGAGTCCAGANTCGGGACTCCAACAAGCCCTCCGCATTCCAGCATGACTGTGGTA 

AAAACAAAAAAAATCTATTTTATTCTAGTCCATCACATTTTCTATATTTGCTAAAGAM 

CTCGATCACTCCATAMTGATCGGTTAAAATTAAAAATATAGGCTGGGTACAGCAGCTTA 

TGCCCTGTAATCCCAATGCTTTGGGACACTGAGGCTGGAAGATCACTTGAGGCCAGGAGT 

TCCGAGACCAGCCTGGGCAACATAGAAAGACCCCATCTNTTCCAAAAATAAAAAATAATT 

AGCTGGGGGGAGCCTAAGTNCCAGCTACTTTGGGAAAGCTGAGCCTGGANGAATGC7TTG 

ACCCTGGGAGTGANAGGCTACANTGAGCTTTGATTGGCCCCNCTTGCACTTNCACCTTTG 

GGCCAACAANANTGAAACCCTTGTCCGTAAAMTTCAANTCNNAATCAMTANTTTTAAA 

AAAAAAAAAA 

Sequence 1 587 

GTCCGTGCACCCCAAATGGGATCTGTGGGACATGAGATGCCGGCTGCATGTGAGCATCTT 

GGTAACCAAATAATAGGAAAGAATGTAGGAGCAGGCAATCCTGGGGCAAAAGCTGACGCC 

AACGTACCTATTCATCTTAGAATTGAGACTCTGATCTGTCTTTCCCAAGGCCAGATCTAC 

AGAGAAACTTTCTGTCTTGGGCAGAATAGCTGGCTGAGACATCATAGAGAGAGACTGAGG 

AAATATCCCAGGAGCCTGTTGACACAGAGGCCCTGATAACTAAAGAGAAGTCCCAGCTGG 

GCGTGGTGGCTCATGCCTGTAATCCCAGCACTTTGGGAGGCTGAGGCGGGCGGATCACTT 

GAGGTCAGGAGTTNGAGACCAGCCNGGCCAACATGGCGAAACCCTGTCTNTACTAAAAAA 

Sequence 1588 

TAGGGAGTCGACCCCGCGCNCCGATCCMTGGTCCTTTTCCTTCCAAATGAATGTAGAAA 

ATTCGTTATTATCCTGATATTTGCACCACTCATCCCCATGCCACTTTTNCCACATATNAA 

TTTATTTCACTGCCTGATCTTTGGATATGAAATTATC 

AGTGCCAATCAAACAGCATGGTGATATTCTTTTTC I I I I I ! I I I CCCCCCACAAGGCTGT 

TGNTCAGTGTGANATGACATCATTTCTTATTTCTGCCATGCTGTGGC I MM l GTGTTCT 

GCTTTTAAACTACNCCTGTAGTACAGTTAATATG I Mil I ATTGGTATTGATTTNTNNAA 

N N N ATTANTGNTCTCTATTTGGGACCCAAGTCATAAGAACCAATTNG AGG AC ATAATAAA 

AACACTTMTGCTTGTAGTCTGTANAGATCTTCACCATTTCACAGCNCACACATTATTTT 

CATMTAMTTGCCACTGGTGGGACATGGGACTGACTTATTTGAATAGAGTTTGAAATAC 

CACCTTTGTATGMGCCANACAATAmGATTTATTTTATCTAANAAATCTATGCNTGGG 

GTATTMGTTAAAAATAGTGGGGGTTAAAATACMTTTMTAAAGCCTTANNAAATGGTA 

CCCTAAACTCTTTCTNGGGGTTAAAATGGGAATAAGGGAAATGTNTTGGGGAAATGGTGG 

AAAAAAAACTTTTTNGNAA 

Sequence 1589 

NAGAGAGCCTTGTCATCTCTCTGGGGCTTGAGGCAGGGCTGGGGCAAAGTGGCTATTGGA 

CAGGCTGGCATGAGACAATGTCCTTCTTCTCAGCTCTACTTTACTCACTTCTGTGGAGAC 

CTGCCTGCGTTCATTCTCATCCCACATAGAACCGGGAATCTTTCTGGCTCAAGTCACTAA 
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ACAGACACATCAGGGAGGCAACAGCTTGAAATTGAAAGCAGAAAAAAGTGTGCGTGTACA 

CAAACATGTACAAGCATGAATGTGTGTGCGCACATACACGCATGCACACCTGCACATGCA 

CACACCTGCACGTGCATGCATGCACACTTGGCCACCAGTTCAGGACTGCAAGGACAGGCA 

TCTC TGCC AGGGGGCTAGCAAACAGCAAACCCCACAGCGTAGAATCCGCTTTTGCTTTCT 

TGACTTTTCCTGCCCTGGTCTGAAAGAACCAATGACAGAACTGGCTCAGGGACCGAGCAT 

GGGGTGAAGCAGAAGATCTGAGTAACCCATATCCTGGCCCACCAGCCCCAGAATTATTGA 

GCATTi GATTGTTTAAAAAAAAAA 

Sequence 1590 

CCCGCGTCCGGAAAAACTGATAGAAAAATAGATCTTGCAACCCTGAAACCAAAGTAAATG 

AAAAAGTTTCGAGATGAAGTCCATCACAAGGGAGAGAGTTGCCTCCCCAGGACGGCGTCT 

CTCTAGAGGGACTGAGTTCCAAACCCCAGTGCANAAAGGTTGTTAAGGAAAATCAAAGAA 

GGTTCTGGAATTTGGGTTGCATATTAAAATCACCTANGGAGCT7TAAAAGGGAAAGACAA 

AAC TGAA CCCAGACACTACCACAGGCAATTTGATM 

ACATTTTAGATAGAAAATTTTNAGAACACACAATTTACAGAGATCACCATGATGAACCCT 
CATGTACCTCATCNAACATCAGTAACTCACAACTACGGATCAANCTTGTGTCAACCTCTG 
CTTCC ACCCAC TTCC CTCTCC ACCCC AAAATAATMTTTTTTTTT 
Sequence 1591 

GGAGTCGACCCCGCGTCCGATTTTAGGCAGCTTTCTCTTTTTCTGATTGGTTTCAGAAAG 

ATAAGATGCAGTTACAAGCAAATGGAGAAATTAAGTACAAATGTCTGGTCCGAGAAGCTT 

AGCAAACGTAGCAGTATTAGTTCTAACATCCATTTTCTGTCMTTTATTGAAAATATCTT 

C AA AACT G ATAC ATC T7TTAG AAG TAATAC AAATTAAAG AGTTC ATTG AAAT AAAGG AG A 

TAGTTTTATGACACCAAGATTTATCTCCACACTTGTTCCTTCATCTTAGATGGAACAAAT 

GCAGAGAGTGTAAAGCTTAAAATAACTATATGCTCTACAATGTAGATAATCAAATGAAAA 

TGCTGTGGTCTCAAAGCCAGAGGAATGTGGACCACTTTGGTGGCCAAATTTAACCAGCTG 

AGGTCTTTCTTGAGTACCATCTGAGTCTTCCTTTTTAAAAAAAAAAAAAAAAAAAANGGG 

GNNGNCCNTTAANNTNTTTNAAAAAAAAAACTTCCAANNNNTCCCCTGGACCTGNAANNN 

NAAANNNAAGNCATTNGNNGNNGTAANTTNTTTNNTGGCNCTTTTTAGGGNTCNAAAAAA 

GCNAANCCCCCCCCAAATTCCCCAAAAAAACCTT 

Sequence 1592 

GGAGTCNACCCACGCGTCCGCTAGGTGGAGACCCCCGTCCCGCTAAACACTGAGCCCGTG 
CAGCGTCCAGGCCTGACACCCCTCCGGCGACTCGCCGCCTCTACACCTCCGCGCCACTGT 
CCCCGGCCCGACCGACCCAGCCCTTAG 
Sequence 1593 

TTTTCTAAAGATTTTATTTGAATATGTTTCCTC 

CTCCCTCTGCTATATCCATTTATTCATTTAATGCCCATAGAGGACCATGCCAACCCACAT 

CCATCCANACTTCCAGGCCTAAGGTTCTGGTTGAAATGACACAGAAACCATGGCAATGGC 

TATGG ACAG TCCTGTTTCTANAAGGTGACCTTCACACACATTCCCACAAAGGCTGCCTGT 

GCTTGTTTrAAATTCAGGGTTACCAATATCCTTCCATTTCAGGGACTTCAACCACAAGAG 

TTCTTCTATGTTATCAGGAATTGGAGGTACACCAAAATAATATCAAATATGCAAGGAACT 

AAAGAGATGCTAATCTCATTCCTGCATCTCCTGGTTCTCTGGCTGGGTAAAACACCCAGC 

ATATTGTTACCTTTGNGATTAAAA^ATCCACAGCATTTAAMGGATTATAC 

TCGTAAACTAGGCTGACTGAGGTTTAAAAAAAAMCATAAAGGGCAAGTGTTTATCTTTA 

AAmCAMCATTGGGCAAGGGAAATNAAAANGTTCmCCTACCCATGGAATNAACATT 

CCCAGACTGGGNATTAATTTATCCAATGTTTCCTTTTCNCCCCANCCNTTTNTTT^ 

GCAAAA AAGGGGGGGNGGGANMGNGAAACAAMNGGGGNTTC 

I I n t I GGNAAAAANCNTTNTTNAATGNAATTNGAGCTNTTTTTCATTAANCATTTTNGA 

AGGGTNNGGGGGNNTGTCGNCNAACNTANAAANGGCNNNTTGTNTGNGGCCCANNAGTGG 

GGGGTTNAACANACAAAAANAAAAA 

Sequence 1594 
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GTCGACTTGTGGGGATTTCCTCCTCCTGGGTTCACCACCTAGTTTTCCTTGTGATAAAGT 

TATTTTGTATGTAGCTATGAGCACAGGTTGATGCTGGAAACACACATGTAGTGTGGATTG 

TCATTAAGAATGTGGAGAACTGGCTAMGAMCCACTTGTACAGTTTTGAGATGAGCACT 

GCCTGCCTCAGCATCTGTGGTAACGGCTTCACTTGGGTGCAGAGCTTGGCAGAGGTTTGA 

GTGACTCTATCTCGATGATCGTCTGTCAGAGGACAAGGTTAAGTGTCACCACCGAGGACC 

CCGTTGAGCCTCTGGGCTGGAAGTAACTTCCCCTGGGCCGGTGAGGCAGCCTCTGGGGGT 

CTTTCCTGACAAGACCCTTCACTGANCTGCAGTTTGGCCACCTCTGCCCTCGCCTCCCTG 

ACTGCAGGGCCTNGAGGTCTCACCACCTCCAGCCTGAGCGCCCAGTGCCCAGGGGCAATT 

TCCCGCCTTCCCCNGGGGCTCAGGATGCTGGCTCCTGGGGCAGTGCCCGCAACTGGGCCC 

CTCCTTCAGGGCTGGCCACNGCCTCTGCTGTGTCATACCANCTGCTTTCNNGNACTGCAA 

CCCACACTGGGCCGCCGCTAACTAGTTANAAAAAAAAACCTTCCACACCTNCCCCTG 

Sequence 1595 

GGAGTCGACCCCGCGTCCGAGCAAGCCGAGATCTCACCACTGCACTCCAGCCTGGGCAAC 

ACAGCAAGACTCCATCTCAAAAAATAAATAAATGAATAAGATTCATATGGCACCTAGTTA 

ATTTCATTTACTCAATACTTTTCAGAA 

AAMGCMTTTATTCAACTAGGTTTTCTTTGCATGAGTTGTTCTGTATGTMCAAGAAAG 

CAAGAAMTAMTATTTTTAAATGTCACACTTCCCTGCAAAAATAATCACAATAATAATA 

ATAATAATAATGAATAAGAGTTTATAGGCTAGGTATAGTGGCTCACACCTGTAATCCTAG 

CATTTTGAGAGGCTAAGGCGGGTGGATCACTTGAGGTCAGGAGCTCAAGACCAGCCTGGC 

CAACATGGTGAAACCCCATCTGTACAAATACAAAATATTACTCTAAA 

Sequence 1596 

GGAGTCGCCCCGCGTGACGNNGNNTCGCNGGNCATGCGACTCATGGCACGGAGCAAAACA 

TGGTNCAGCCGCTAAAAACACTGATGGCCGATGATNCGGATTACAGTTNCNATATTCTGG 

AANCCCGCTACGCCATTGAAGCCAGCACCGCATGGCATGCGGCAATGCGCGCCACACCTG 

GCGACAAAGAAMGATTCAGCTTTGCTTTGAAGCAACGCTAAGTGAANACCCGGATATCG 

CCTCACAAGCGGACGTTCGTTTTCATCTGGCGATTGCCGAAGCCTCACATAACATCGTGC 

TGCTGCAAACCATGCNCGGTTTCTTCGATGTCCTGCAATCCTCAGTGAAGCATANCCGTC 

AGCGGATGTATCTGGTGCCACCGGTTTTTTCACAACTGACCGAACAACATCAGGCTGTCA 

TTGACGCCATTTTTGCCGGTGATGCTGACGGGGCGCGTAAAGCAATGATGGCGCACCTTA 

GTTTTGTTCACACCA 

Sequence 1597 

CCGGGCAGGTACCTGACTTTGGGTAAGTTGCCAGAATAAGATCATCAGGCTTGGCTTGGA 

AATTACAGACTTTTTCCCACCATTCTTTTGATMTATCAACGTAGGGACTCCATCTACTT 

CCATGATGTTAAACAGTTCTGGC I Mill I CCATCGTGGGAGCGTTTTTCTCAATCTTCG 

CCATTGGGAATCTGGTATTTCAGGCAGCAGAGGAAGCAGCTTGCTGCCTTTAGTCAAACT 

GCTTCCTGGAAACCCAGAGCCAGGTGCAGCTCCAGGACACTGTGCAAGCAAGAGCACAGC 

AGGTGAGGTCCCCGCGTACCT 

Sequence 1598 

TTGGAGCTCCCCGTNGTGGCGGCCGATGTACTGCCATNCAATGCAAGGCTGCACACCATG 

GGGGTTAAAAAAGAAACCTTCCCATGTTGCTCGGAAAGACCCGAGGCATGCTAAGATGTG 

CATGGGTCTCCTGAGCAGTTGTGGAACTTGCTGCCATGTGTCTATAAAGGGCAGGCAACA 

TATCTTCAAAAACATTTATTTMGTCTCTGGAAGAACACAATAATGATGTTTGCCGATAG 

GGAAAGGTAAATGGGGACAGGTGTGAGGAGGCTTCCATCCTTTCTCATGCCACTTGGACC 

ACGTGAATGTGGAAGGTGGCCAGGGAGACCCTCTCTGCAAGTCCAGGAATAGGGCG 

Sequence 1 599 

CCNGCTCCCCGCGGNGGTTTGATTACTCATTNCTCTCANTTNCTAATGCCTGATCCTAAC 
TACAAGAr4TGTGTTTGAAAGAAACTCCCCTGTATAAGAGACATATTTCAAAGAGCTCAAA 
ATAGTAAACACTAACTCTTTCAAACCACTATCTACAGCTGGAAAATGCT7TCTCCCTTGT 
AAGAAAAAAAAGGAAGGCAGGCCTGGTGCGGTGGCTCATGCCTGTAATCCCAGCACTTTA 
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GGAGGCCGAGGTGGGCAGATGACCTGAGGTCAGGAGTTCGAGACCAGCCTGGCCAACATG 
GAGAAACCCTGTCCCTACCGAAAATACCGANAAAAAAAAAAAAAGAGAAAAAAAAGT 
Sequence 1600 

ACNCCCCCCCCNATCNCGNGGGAAAAAGNTCCCCGCCCCACCGCGGGGNGGGNTTTGCCG 

GNGGGNACNANTTTCCCCCCGGGGNTTTTNAAAAACCCCCCGNNCACAAACCCTCCCCGG 

GGGGGTTCGGTNTAGAANGNGGNNAAAANGGGGCCCCGGCCCANGCNGGCCCNNCCNGGG 

GGGAANGGGGGGGGAAGGCCCGNNNNNCNANAGGGGGCCNCCCCNNCNNNCNGGGGGANA 

ACCGGAAACCCCCGGGGAGNCCCCCCNGGGGCCACCCCCGGGGGGGGCCACCCAAGGGGG 

GANNGGGGCCNCGGGGGNAGACCNAACCCAAANCGGNAAAGNGGNGGGAAAA 
Sequence 1601 

GCCATGCTCTCCTCCTCTGCCAGTCTCCTCCACCACTCTCTAACCTGAGAGCCTGTGGAA 

CCTGCCCGTCTCCCCTCCTCCATCAGACACACCTGCCTAGGAAACAGGAAAGGACCTCGG 

AAGTCTTCTAAGGAGAGTCATGGCGTATTACCAGGAGCCTTCAGTNGGAGACCCTNCATC 

ATCAAGTTCAAAAGACCAGGANCTTTACCACCTTTGCGGGGGTCACTTGCCCTGANCATG 

GGGCCGGACCGTTTAAAGGAATGANNACATTCCTTGCAGCAGNATTNTTCCATAGGGCCA 

AAAAGCTGCTCCAGNNAAAAACNCCTCTCCAATGTTGNTNTGGAAAGNCGCCCACANGGA 

TTTACCAGGGGGNCCCNCCCTNANTTNANTTCNGGNNNNAAAAAAANNNCCNNTTTTTNT 

CTNNCC NNAAMNGG GGGGGNGNCCNNTTTTTNTTTNTNTTTTT^ 

GNNNANi minim l NAAAAAAAAAAAAAAAAANTTATATNTGGGGGNTTNAAAAAAAA 

AAAAAAGANGNNCCCCCCCCCCGGGNGNNNAANNAAATAT^^ 

TTACC CCCCCCCCCCCCCNGGGGGGGGGGGGGNCNCCCCCCCNNC t rTTTTTTTTTTTTT 
Mil I AGNGNNNGAAAAANAANCCCCCCCCCNCCNAAAAAAAAA 
Sequence 1602 

CTCCACCGTTGJGGCGGCCGGTTTGCGAGGGTNNGCANGTACGGGCGTGCGCGCCGCTGG 

CGGGGCCGCAGTCTGGCAAGCCGACAGCAGGCTGGCGGCGGCAAGGACGATCATCGCGTG 

TCTCATGCCTGCCAGCCTACCATGGTTTAACTGAACAGGCGTGCCAGCTCCACGCCGGGG 

TCGGGGGCGCGCATGAACGCTTCGCCCACCAGGAAGCTGTGGATGTGCGCATCGCGCATG 
CGTTGCACAT 

Sequence 1603 

AGGTACTTGGTCTTTGMCTCTCGTGTCGAMGAGTTGAACACAACTAAACTTTAATGTG 

AAAAGGTCTCAAGTAGTTAATCAGAAATGAGAGGCGCACATAGCATTTTATACTGTTTTC 

GAT7TGCTGACACAACATCATTCTGTGCTCTCTAGTGAGCAAGAGTAATCCTCAATAGCA 

TTAA GACGA AAGGCTGAACACAAAACCGCAGGCAAGTCAAGTAGTGATTTTATTCTTTTT 

GTCATTTTTCTTTCAAGTGGAAGATCCCTMCACTCTCTGCTCCTGACAATGTTTATAAA 

CAGAACTNTGAGAAGCATCTGAATGTAAA 

Sequence 1604 

CACGGNTTTCCGCCACGCCGCCGNTGATGGCATNCTGCATGAAGTTCAGGTAACCGGTGA 

CGGACGGCTCAGACGTGATCTGCATTTCCGGCACCACCAGGCCCGCCGTCGCCAGCGCGG 

AATTGGGCGGCACGTAGGAGGGACGGAAGTAGTTGAAGACGGACGGCGCGTTCATGGGGT 

TCTGGCCGAGTCCCGACAGCGGATCGCTGAGGTAATANGATGCGGTACTGCAAAGCAGCA 

TCCTTGGCGTTAAACGCGCGCATCCAGTTGC 

Sequence 1605 

GGCATCCTGCATGAAGTTCAGGTAACCGGTGACGGACGGCTCANACGTGATCTGCATTTC 

CGGCGCCACCAGNCCCGCCGTCGCCAGCGCGGAATTGGGCGGCACGTAGGAGGGACGGAA 

GTAGTTGAAGACGGACGGCGCCGTTCATGGGGTTCTGGCCGAGTCGCGACAGCGGATCGC 

TGAGGTAATAAATGCGGTACTGCAAAAGCAGCATCCTTGGCGTTAAACGCGCGCATCCAG 

TTGCCCAAACGCACCAGCGGTTCGNTGCAGTTTGCCCGTGCNCAGGGTGCTGCCGGCTGG 

CNCAAGCGCCTCGGGGNTCCAACAGCACGGNACGGATCACNGN 

Sequence 1606 
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NGGTAC mill! NTTCTJ rTTTTTTTTl I NGCTGACCTGMAGTTNGATTTNCNNTGTNT 

ANAC TTAACANTCTGAATTTAAACACGAGGCATATACACAAAATCATCTGGAACTGATTT 

TTTTCACACTAAACACTTCmCTGATCCTACTTTCCAGGAGGCTCCTTCCTCCAAACAT 

AC CTC CAGATGAAVVTCACTTAATTAACCAAGAMG ACTC AACCGC CCTTAACTGCTTTT 

CTAMCAGTCACACATACTGGATTTACTTAT7TTGAGACATGACAAACCAGTTGCAAACC 

ACAMTTCATTCTCATGGGCCCTCCACACGGTCTCTTGTTCCTTCTGAAGAGGGTCTTGC 

TGAGGACCATTTGTCCCCCTT 

Sequence 1607 

ACCNGGCCNTGCTTNGATGCAAGACTCCTCTCCCTAACGATTTTGAAGAGATATCCATCC 

AGTTCTCCTCTAAAACCTMTTGCTTACCCAGATGAAMGATATTTGAGAATMTGAACC 

CATGAGTTGGGTGGGAAGTTTAGGATACCTTTTTATTCTGATGCTCATGANAGCACCATT 

TANAMCTACAGCATTGTGGATATTTGGTTCTATTTAATATTAAAAAATTANAGCACAAG 

GCATTTTTA 

Sequence 1608 

CCGGGCAGGTACGCGGGATTTATGTCMTGATMCAATTTACCTATT1TTCMTTTCTGT 

TTGTTTTATGAAAAAGCAAATAGAAGCAATCAGGGCACTGCAAGTTGTGACTACTCCAAG 

ATGTGAATCATGGATCATGCAAATTACMTCATGTTTTAACCTGACCTCCAAAGGGAGAA 

TAMGTAAAAATTATCCCATGTGAGGATTATTCACCAGTTTATATGTCATTAGTTACCAG 

TNTT TCTTTATGAATAATGTTTAGCAATATTATAAAAGTATGTCTAATAGNTTATCAGGT 

TTTTGG C TTGTTACTTTTTGGTAG N AACTTATAA . 

Sequence 1609 

TCCCACAATTGAGAGGTGAGCAAATCACATAGAACTTGCTTTGTAGCCCTGAATGTGCCC 

ATCTCCCTCTCCCTCCAGTGACCTCTGCTTCCAGCTTCCCTTCCCTATCTCCTGTATCTC 

TGCTTCACTTGTGCTACCTTCTCACACCACATCTTCCAGGATTACACTTTTATGCTACAC 

CTGCACTGATCAGTCTTCTACTCCAGTCTTCATTTTGTCTTCGTAAAAATGTTGAGGATG 

AATGCTGTTCCACTTTCCTTTCCTATAGCGAGATGGCCTCCCCGCGTACCT 

Sequence 1610 

AG GTACTGACTTATGCTGGGATGGTTCTCGAATTTTTATTAGMTCC CCC AGAGTGCTTT 

TAAMCTCTTCATGTTCAGGCCAAGCCTTTAAACCATTAACCTAAGAATCTCTGGGTGGA 

GGGTGTGACCTAGGCTCAGCTTCTTTTAATGTTTCTCAGTTGTGTCTGAAGTGCAGTCAG 

GATGAGAACCACTTGTATGCATCTCCTMCTTTCTAAACTTTACTGTGCATGAGAAGCAC 

CAGGAGATCTTGTTAGAATGCAGTTTCTGATTTAGGAAGCCTATGCTAACGTCAGAGATC 

TACATTTCCACCAGCTCTCAGGTGTAGTGACTCTCCTGCTCTGTGGACCACACTTTAAAT 

AGCCAGGCTTACCAATCCACTTAAAAATGGCCAAAACCACAATTACTTTTGCACCAAACT 

AATACACTGTGCTGTCCAGTATGGTAGCTCCTAGCCACATTTGCCTATTTAAAAGCTCAG 

TTCCTCCATCCTACCAGCTGTTTTTCAG 

Sequence 1611 

CCGGGCAGGTACCTGACTTTGGGTAAGTTGCCAGAATAAGATCATCAGGCTTGGCTTGGA 
AATTACATAC I I ! J ICCCACCATTCTTTTGATAATATCAACGTAGGGACTCCATCTACTT 
CCATGATGTTAAACAGTTCTGGCTTTTTTTCCATCGTGGGAGCGTTm 
CCATTGGGAATCTGGTATCTCAGGCAGCAGAGGAAGCAGCTTGCTGCCTTTAGTCAAACT 

GCTTCCTGGAAACCCAGAGCCAGGTCCAGCTCCAGGACACTGTGCAAGCAAGAGCACAGC 

AGGTCCCCGCGTACCT 

Sequence 1612 

CGAGCGGCCGCCCGGGCAGGTACTCTCACCTGTGGGCATTTGTCAAAATCAGTATTACTG 

CTGGGGG GGATG AGGGCTGAAAACTCCTATTTCACCATCTTTCTGACATCATTTGCCACA 

GTGTGCTTTTTTTGTTCTGGTATTTCCTTATACCCTTGAGTCTCTTCCCTATTTTA 

ACCCAGCAGTCTGTATTCATTCCTCTTAATCAGGATTTGAGCTTTGCACCTGCTCCTTTT 

GGCTGGGGTTGAGTCTACTTCAGAGGCTAGGGATATAGAGGGTCCAAATACAATTTAAAT 
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MGAAAATGCTACNAATAAAMGTGTTGAGAAGAATAAGAATTCTAGAAAAAAAGAAAAA 

GAAAATGGAMGAAAAGACTTTTAAAATGTTAAGGCATCTGGCAGAGAGACCAAATGGGA 

ACAAAAACCCAATTCCTCTCTCCTNACAAGTCCTTGCACCCTGGCCTAATGGCTGCTTNA 
ACCCACAA 

Sequence 1613 

GCCGGTGATCGACCCGTTCAACTGGGATGCGGGCAGCGTGGCGCGTCCCGTCATCAACTA 

CGCGCAGTGGGGCACGGCCAGCATTACCGAGCAAAGCGGCGTGTATGCGGGCACGCGCCT 

GCGCCTGGCCGATCCGCTGTCGCTGGTGCTGGGCGGGCGCGTGAGCTGGTACAAGGACGA 

TGCCGGCTACTCGGTGGCGCGCGAATTCACTCCTTATGCGGGTGTCGTGTATGACCTCGA 

CAAGCAGCATTCGGTGTACGCCAGCTGGACCGAAATCTTCCAGCCGCAGGCTTCCACGGA 

CGCCAATGGACAGCCCTTGAAGCCGATCAGTGGCACCAACTATGAAGCGGGCGTGAAAGG 

CGAATACTTTGGCGGCGCCCTCAATGCCAGCGCAGCCGTGTTCCAGATCCGCCAGCAAAA 

CCGGGGCGTGGACGANCTGGCCGGCCCGAATCCTTGCCCCGG 

Sequence 1614 

ACTTCCTGACTTGTTGAGCGTGTCCTCACTGCTGGCCCTCTTGAGCCTGCTGAGTCGGGA 

CTCAAAAGCCAAGGAAGTTGAAGACTTAGAACTCTTCATGCCGGAAGAGGCTGCAGGCA 
Sequence 1615 

CCGGACGGTACCGGGCTCGGTTTGGCCATTACCCAGAACATCATCAGCCAGCACCAGGGT 

TTGATCGAATGTGAGAGCCATCCCGGCCACACCACCTTCTNGATCTTTCTGCCACTGGAA 

CAAGGAGCCCCATCGACATGAGCCGTAGTGAAACTGTCTGGATCGTCGATGACGACCGTT 

CTATCCGCTGGGTCCTGGAGAAAGCCTTGCAACAGGAAGGCATGACCACCCAGAGCTTCG 

ACAGCGCCGACGGGGTGATGAGCCGCCTGGCTCGCCAGCAGCCCGACGTGATCATCTNCG 

ACATCCGCATGCCCGGCGCCAGTGGCCTGGA 

Sequence 1616 

AACTACTGTTTCTATAATGGGATATTTTATTACATGAGGTCTNTGNGGGCATAGTTATTT 
CATTAANAAACACNCTTCATAAAATTGCATCCATGTATGGTAATCTAAAAAAGCCAATCT 
AGATGACAGAAGCTTCTGACTATAAATTATAAGTAATCTTATCCATACATNCNCAGAANC 
TAGTGATTTATTAAGTNCTCTACTGTACTTCCAMCACATNTAAGTTTTCCAAAAGATTT 

CAAAAACAATTCACTGGCAGCAAGCAAAAAGTCAGGGANAAACTGCTGANACAGNGGTGC 
TGGTAAGGGCTCCTGGTAG 
. Sequence 1617 

CCGGGCAGGTACTTGACTCTCCCTGCACTGCCTTGCTTTTGAAGGCATCTAAAATCTCTC 

ACTTTTCCCCATACCACATTTCACAGCTGGGTCATTCTGCTGGACTCAGCACAGGAGGAC 

CCTCTCTCANACTGGTTGGAMCTTTGGGCACATTTGAAAACTGTCAACACAATCTGTAA 

ACACATCAGTCATAATGATTCTTTCCTAGGAACTNGACACACCCAGATGAACTCTNTTGT 

TATGACAGGATCAGTACCANGAGAATTTCCTCTCCCGCGTACCTTGGACGCTTTTAGAAC 
TAN 

Sequence 1618 

GCCATGCTCTCCTCCTCTGCCAGTCTCCTCCACCACTCTCTAACCTGAGAGCCTGTGGAA 

CCTGCCCGTCTCCCCTCCTCCATCAGACACACCTGCCTAGGAAACAGGAAAGGACCTCGG 

AAGTCTTCTAAGGAGAGTCATGGCGTATTACCAGGAGCCTTCAGTGGAGACCTCCATCAT 

CAAGTTCAAAGACCAGGACTTTACCACCTTGCGGGATCACTGCCTGAGCATGGGCCGG 
Sequence 1619 

ACTGCCATGCAATGCAAGGCTGCACACCATGGGGGTTAAAAAAGAAACCTTCCCATGTTG 

CTCGGAAAGACCCGAGGCATGCTAAGATGTGCATGGGTCTCCTGAGCAGTTGTGGAACTT 

GCTGCCATGTGTCTATAAAGGGCAGGCAACATATCTTCAAAAACATTTATTTAAGTCTCT 

GGAAGAACACAATAATGATGTTTGCCGATAGGGAAAGGTAAATGGG 

Sequence 1620 

ACCTGACTTTGGGTAAAGTTGCCAGAATAAGATCATCAGGCTTGGCTTGGAAATTACAGA 
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Ci i n I CCCAC C ATTCTTTTGATAATATCAACGTAGGG ACTCCATCTACTTCCATG ATGT 
TAAACAGTTCTGGCTTTTTTTCCATCGTGGGAGCGTTTTTCTCAATCTTCGCCATTGGGA 
ATCTGGTATTTCAGGCAGCANAGGAMGCAGCTTGCTGCCTTTAGTCAAACTGCTTCCTG 
GAAACCCCCCCGCGTACCTGCCCGGGCGGGCCGCTCTAAAACTAAGTGGATCCCC 
Sequence 1621 

CGAGGTACTCAAGAAACCCAGGCCAGAGTTACTGGAGCCCTTCCTCACAATTGGCCTTGG 
CTCCCTCTC CACTTGTGTG TTTTCACATTG ACTTGATCTTC AGTGGAGTAACTCTGAGC A 
GTGGCTGTAGCTCAGCAGGTAGAAAAACTCTCTAACCCTTGAAGAGGGGAATACAGTAGC 
ACCATTGAAAAAAGAGGTTGTGTATAGAGTTTCATTCTTAMTTTGCTTTGCCAAG 
AAGCCTGTGAGAAGTGAAAGCAAGGTGGAACTGGGGTCCAAAGAGAGGGAGTAAACTAGA 
TGC CATATT MTTTGTATTTCTTGGAGMTATGACTGCAGTTTATATATMCTTGGTTTA 
AAC ATTTTTGG N ACCTCTTAAAG AAAATTTGG 
Sequence 1622 

CCGCGGTGGCGGCCGAGGTACTTTGGGAGGCTGAGGCAGGAAGATCCCCTGAGCCCAGGA 
GTTCGAGAACAGCCTGGACAACATMGGAGACCCCGTCTCTATAAAGTAATAATAATAAT 
AATAAMTAGTTGT TTGC CTTGCAGCCCAGGGTGAGTAGAAAAGTATGGCTGAATATAAA 
GCMCAAATATTTATTTTACTT^ 

TAAATACTACCTTTACTTAAGTCCTAACTTCACATTGTCTGT 
Sequence 1623 

TACTACTATAGGGCNAATTGGAANTGCACCTCCACNGCGGNNGCNNNNGCAGNTCCGCGG 

GGANAATAGAGTGGCTGACTTGCTGATTGAGGTCGGCAGCTCCCGAGAAGCAGGAGAAGT 

AGAGTTTTTGCAGACAGGCCCACGAAGTCAAGCATGCGGAGTGCAGCCAAGCCCTGGAAC 

CCAACCATCANAGCAGGGGGCCACGGCCCANACCGGGTGCGGCCTCTGCCTGCAGCCTCT 

TCCGGCATGAAGAGTTCTAAGTCTTCMCTTCCTTGGCTTTTGAGTCCCGACTCANCAGG 

CTCAAGAGGGCCANCATTGAGGACACNCTCAACAAGCCANNAAGTACCT 

Sequence 1624 

ACTGACTGGGTCCCAGGTTTGTCTCACTGCTGTCGGCTTGATGGAGGTCTCCTCACCACA 

GAACCCAAG TCTCT GTGGGCCCTCAGAGCCTCAGGTCCCTTTCTCTCTGCCTTCCAGCAT 

GTTGCCTTCCTTTTGGMGGTGATCCTTTCAMGATGTGTCATCTCTTCTTCCCAGGAGC 

TCCCCATGCCTTCTTTTCCCACTGCTCCCTTTCAACTCTCAGCGCCGCTGGGGGCAGGCC 

CCAGGTGCTGGCTCTGCCTGCCAGGCCGCACTCCCGCGT 

Sequence 1625 

TGTCGGTGTAGTGCTCCTCGATCCGCAGGTCGGACTCGTGGTACAGCAGGCCGTCGAGCA 

CATAGGTGGAATCGCGCACGCCGACATTCACCACGCGGGTGCTGAACGGCGCGTACTGGT 

CGGAGATATGGGTGTAGAACAGCCGTCCCGGCTCACTGCCGTACTTCGGGTTGACGTGTC 

CGGTGCTCTCGCCC 

Sequence 1626 

CCG N GGTACC AATTTATTATATGCTGCCTTC AGGTATTC ATTTATATATTTGTCTCTT AA 

ACTAATTGTAAAGTGTTCATTAGTACTGGCTTATAATCCACATGTNAGCAANNNAAAAGT 

NTATCTN N AAC ATTTACTTTAATATTTTAMTATTATAATTTCAAATGAGTTC C AATTTT 

CCCAACTTGCTCTTAACCAGTA I ! ! I M I I I AACC AGTTCTCTACTTCTTTAATGAAATT 

AAAGAAACTAAAATAATTTCACCACCTTTMTMCCAMTAAACATCAGGGAAATTATAT 

GTGAACATGTAATAACAAAATTAAAGCTACTGATCTGTNAAAGGACAGTANGAAAAATTN 

GTATTCAAAATGACANGGGGATATTTAAATACGACACTGCAGCAGCATAAACATGATAAA 

TCTCATTTTTGATATAAATGGGGAAT 

Sequence 1627 

GGCTGTCGCCCNGTTCAGTGTCACNGCAGGNAGTGTGCCGTCCTACNGACTCACNTTCGG 

NGCTGTCGAGANNCATNNCCMGCTTTAAGTNAANGTGNANTTTATANTTTTCCANTCCA 

NGCGGGGGNGGANNCAAAAGAAGCCTTCTCGGATCTTCCCCAAGGGCCANTCCCAAAGGA 
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ACCGTTTCCTNTTTTCCMCCTTGGATTC 

GGGGGGGNAAAAAAAAGGTCAATCCCCCCAGGTTCTTTGGGAAATTCCCCCTGGTCGGGC 

CCCAAGGTGGAAATTAAAGGAATAAGACCGGTCCCCAAGNAATTAAGGCCTCNAAGCCTT 

TCCANGGG TCCCCTTTGGAGGGGGGTTCTTTTCCCTTCAAAAAGGGGGCN7TTTNTCCCC 

TCCAACCAAAAGGGGGGGGGNTCCTTC7TCCAAAAAAAGANAAAAAGGGTGGGGGGGGCC 

CMCCMCCCTTTNGCCTTNGAACCNAAATNGGAAAAACCAGGGGGGTTCNCCTTNCCCC 

GGGCNATTTCAACAAAA 

Sequence 1628 

GCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCTCNNCTGTGGGCATTTGTCAAAATC 

AGTATTACTGCTGGGGGGGATGAGGGCTGAAAACTCCTATTTCACCATCT7TCTGACATC 

ArTTGCCACAGTGTGCTTTTTTTGTTCTGGTATTTCCTTATACCCTTGAGTCTNT^ 

ATTTTAGTCAACCCAGCAGNCTGTATTCATTCCTCTTAATCAGGATTTGAGCTTTGCACC 

TGCTCCTTTTGGCTGGGGTTGAGTCTACTTCANAGGCTAGGGATATAGAGGGTCCAAAAT 

ACMTTTAMTAAGAAAATTGCTACGMTAAMGTGTTGAGAAGAATAAGAAATTCTAGA 

AAAAAAGAAAAAGAAAATGGAMGAAAAGACTTTTAAAATGTTAAGGCATCTGGCAGAGA 

GACCAAATGGGAAC 

Sequence 1629 

AGGTGGCCGAGGTACGCGGGGGGCCAMTMGCACTATGTAAGTTTTTTTAATTTTCCCA 

MCACATGCACAGAATTMTATTGCCTTAMTTTTGTTAGCACAAGTTTGAGAGAGCTCT 

TTAGTTCATAAAAAAATGTTTTATGGACCAGGTGCCGTGGCTCACACCTGCAATCTCAAC 

ACCCTGGGAGGCCCAGGNGGGAGGATCGCCTGAGCCTAGGGAACTTGAGACCATCCTGGC 

CAACATGGTGAAGCCCCGNCTNTACTAAAAAATACAAAAAAATTATTCTGGGACATGCTG 

GCGTGCGCCTGTAGGTCTCAGCCTACCTAAGGGTAGACTGGAGGTGGGGGANAAATCAAC 

CCTGAANCCCTNGGGNAGGTTGGAANGTNTNGCAATTNGGGGTTTGAGAATCAATGGNTA 

CTTGGNNACCCTTGGCCCCGGGNCCGGGNTCCGGCTNCTAANAAACCTTAGGNGGGGATN 

CCCCCCCCCGGGGGCCTTGNCANGGGGAAATTTCCNATNATNNAAANGCCTTTATTTCNG 

TTTAACCCCGGANTGNNACCCCTTGGANGGGGG 

Sequence 1630 

CGCGGTGGCGGCCGANGTNCGCGGGGGTATGCGATCGTGGCAGGATTGCAGGACGGGCAG 

CACTCTTTCTGCAGAAGGTAAAAATCACCTTGCTGAGGAAGCTTTTCGTCTGCTGATTCT 

TCTTTGTGACATCGGGGMTMGGATTTATTTCTAACAGTCTTGGGGGCTCGTCTGGGAT 

CACCCAGTCTCCTTCGGGTCGGGGTCTCTGATTCCCGCCCCCCCCACCAAGGGGGAGGCC 

GCCCCACTGCACTTACTTGCGGTGGCNCTCAGNGGACTGNGGANTTCCGGNACTTACCTG 

NTTGTGGATGGAATAAACCCGGGACCCCTTC 

Sequence 1631 

AGGTACTCAAGAAACCCAGGCCAGAGTTACTGGAGCCCTTCCTCACAATTGGCCTTGGCT 

CCCTCTCCACTTGTGTGTTTTCACATTGACTTGATCTTCAGTGGAGTAACTCTGAGCAGT 

GGCTGTGGCTCANCAGGTAGAAAAACTCTCTAACCCTTGAAGAGGGGAATACAGTAGCAC 

CATTGAAAAAAGAGGTTGTGTATAGAGTTTCATTCTTAAATTTGCTTTTGCCM 

AANCCTGTGGAGGAGTGAAAGCAAGGTNGGAAACTTGGGGGTCCAAANGAGAGGGNAATT 
AAAACTAl^ 

CAGGTTTT TTATT ATTAATAAACCTTTTGGGGTNTTTAAAAACCA I I I M > 1 1 NGTAAAA 

CCCCTGGCTTTTANAAGAAAAAATTTTGGGGGTTCTTGGGGGTTGGGGTC 

TTCMNTTTTTGGGTAMGGAAANATAGGNATTTNNTGGGMTGTACCCAGGTTTTTNTG 
TTTTTT 

Sequence 1632 

GCAAATGCAACGTATCGCCCGAGAGATGAACCTGTCGGAAACCACCTTCGTGCTCAAGCC 
ACGTAACGGTGGTGATGCGCTGATCCGGATTTTCACCCCGGTCAACGAACTGCCCTTTGC 
CGGCCATCCGTTACTGGGTACGGCCATTGCCCTGGGCGCGCACACCGATAATCACCGGCT 
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GTACCTGGAAACCCAGATGGGCACCATCGCTTTTGAACTGGAGCGTCAGAACGGCAGCGT 

GATCGCCGCCAGCATGGACCAGCCCATACCGACCTGGACGGCCCTGGGCCGCGACGCCGA 

ATTGCTCAAGGCCTTGGGCATCAAGTGAATCGACCTTTCCCATCGAGATCTATCACAACG 

GCCCGCGCCATGTGTTTGTTGGTCTGCCGAGTATCGAGGCCTTGTCGGCCCTGCACCCAG 

ACCATCGTGCGCTGTCCAGCTTCCACGACATGGCCATCAACTGTTTTTG 

Sequence 1633 

GACAAAGTNTGCTTGAAACAACAACAGCTCTCATCGAATATTCAGAGAGTCCACTAGGNG 

CCAGGCAANGTCTGAAGCTCCTGAGGACAGAGCAGNGTACAAAACTACGTNTCCCGCTCT 

TGCAGGANACTCCTTTCTAAGCATAANAN AGAC NATAAAATTGGAGGCAGGAGTGGGTG G 

TGGGCNATGGGGTCCACTTTANATACAGNGATAATGGAAAAAACTTTGCTGAGATGGCAT 

TTGAACTGAAACTTGAATGANATAAGACAGCAANGCCTAAGGTC 

Sequence 1634 

CCGGGCAGGTACTCCAATCTGGGGCCAGAGCAAGACCCTATCTCAGTCAATCAATCAAGA 
TTTTGTGTAMTATAATCATTATTCCACM 

ATGAAAATCAGAATTAGGMGGCTGGTGGTCAGTCTTCACTTAAAAATAAGTTTATAAAG 

AGTCCATTATCTGTTAGAGATATGTATTGAAGCATTTTCCAATAAAGTGAATACTGGGAT 

TTGCTTTAAAATCATCTGAGGTAGGGGGAGCCTAGAATTAACTGTTAAAGCTAAGTGATG 

GGAGTTCACTAAATTATTCTATTTTTCTGTTTGAAATCCTCCATCATGAAATGTTTCA 

TTGTTTTTMGTAAGTTTCCCAMGATCTTTATNATTAATNTACTCTGTAGATAAAAT 

C AGGCTG NCCTTAAGCTGT 

Sequence 1635 

CGCGGTGGCGGCCGAGGTACCGCGGGAGAAATACAAAAMTCAGCTGGGACTACAGGCGC 

CCGCCACCMGCACAGCTMGTTTTTTATTTATAGTAGAGACGGGGTTTCACTGTGTTAG 

TCAGGATGGTCTCGATCTCCTGACCTCGTGATCTGCCTGCCTCGGCCTCCCAAAGTGCCG 

GGATTACAGGCGTGAGCCACCGCGCCGGGCCTGATTTCAGTTTCTTCCAGCCCTTCCTAT 

TGTTAACATGGGGGTTGTGTTGAAGAATATAAAGTTACAAAGTCAAGGAAGTAGGAAACA 

TTTTTACAAGTATTATGTAGCCATCTTGGTGGGGCTGTGGTGAGGTAGGCTGCAAATGAT 

TCTCCTATTTCTTTCCCTGAGTTCAGAACATAGGAATTAGATTGATAGACATCAACATAC 

CCGCTTTATTGCTGACTCATGACAACTAATGGGAAGACATGGCTCAGATGTGCAGCCACA 

GTGAGCTTCTGAACATTTCTTCTCAGACTAAGCTCTTACACACAGTTGCAATTGAAAGAA 

AGAATTGCTTGACAT 

Sequence 1636 

AGGTACTCTGCGTTGAACCACTGCTTCCCGCGTACACATTACTTGCACAAACTGTTTTTG 

ATTTGTCCCTTACTGAAACTGTCCTTATGCTTAATCA I I I f I GTCTTTCTTTCATATCTA 

TTAG TTTC TCTCTAATTG CTTTG ATCTGTATTTTTAGCTACATTCACTGNTATTTTCCTA 

AGCCTTTCTTGTATTTTGAAGTGTCCATTTTGTTTCCMCTCTCTCTCGm 

TTATTTTTATCTTTAGT 1 1 T I ATATATTACGTTTTATCTTTAG 1 I I I I AGTTTTCTAATT 

TTTAAAATTAGTTCC AGC ATGTGGN NTATAT AATTTTATTTTTAAACTTATGCCC CC CAA 

TGATTACTTAAAATTATTCTTTTCTGAGGAATTTACANGGATTTTAANTATA^ 

TTTNTGGNTATTTATAAGATTGATNGGTCATTAAATTAAAGGNGGNTNATT7TTCATNGA 

AAAAAGTTAGAAAATTAAACTTTTTCTTTTC 

CTTTTTAAGGC 

Sequence 1637 

GCCATGCTCTCCTCCTCTGCCAGTCTCCTCCACCACTCTCTAACCTGAGAGCCTGTGGAA 
CCTGCCCGTCTCCCCTCCTCCATCAGACACACCTGCCTAGGAAACAGGAAAGGACCTCGG 
AAGTCTTCTAAGGAGAGTCATGGCGTATTACCAGGAGCCTTCAGTGGAGACCTCCATCAT 
CAAGTTCAAAGACCAGGACTTTACCACCTTGCGGGATCACTGCCTGAGCATGGGCCGGA 
Sequence 1638 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTTTTGCCTTTTGAATTCTTTAT 
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TGTmCATTTAAGAAATGTGGAAATCTGAAACAMCTATATTCTCMTCATGTTAGAAA 

AAGTATATGTAATATGTATTTTTTTAAAGTCCAGTGGCTAAGMCTTAGACTGGGTTTTG 

AAATAATMTTTCCTAACACAGTTTCTCAGTATGTTTCCTCTAGACAAGAGAAAACTCTG 

NAAmTGATTCTTAATTTACCCTGGTMTTTCATAAGAAAAATGATTMTTACTTTGAG 

CCATCAGTTATAC C AGATAATCATGGGTCTTTATTTAAACC NCAAAAAATAGAAAGAAAG 

CCGAGMGCTTAAGCCTTAMTNTTMGGCCAATTCCGGCCTTTAGGNAAAAATGGAATC 

NGGGGTTAAAAANATTTTCTTTTAGACACAAGNCAGGCTTAACAATCTTTTCTAT^ 

GGTGGGGGTNACCTANTTTAATTATTNTTTTTTAAGNTTCMCCTAAC 

AACCAAATNGGGAATTTTGGAAGGGGGGAATNTACCTANCTTTGGNAAAAA 

Sequence 1639 

CGCCAGTGTGATGGGATATCTGCAGAATTCGCCCTTAGCGTGGTCGCGGCCGAGACCACT 

TCCTGTGTTTTGTTAAGTCATGAAAATTAGATTTCTGATTCATCTTTATTTAGCATTTCA 

CTATG TATG TATTTGTTCCTAATATTCAAATCACTTGCTACTTGAAATTG AATTTCTGAA 

GATMTACTACAATTTAAAAATATTTATTTTTGCCTATGTAAAATTAATAGGGAG 

ATCTCTACAAAAAAAATAAAAATAAAAAGTTAGCTGGGTGTAGTGGCACATGCCTTATAG 

TTCTGGCTACTTTGGAGGCTGTGGTAGGAGGATCACTTGAGCCTGGGCGGGTGAGGCTGG 

AATGAGCCATGACCATGCCACTACACCCTTGAGACCCTGTCTTAAAACNAAAACNAGGAA 

CCCAAACCCCTATTATGGTGCTACTCAGAAATACTGNTAACATTTTACCGGTTTCTTCNG 

GGCCTTAGCCTTTCTTTAATGCACACCNCMCTGNACTTACTTAAATACAGGGTGNGTTT 

TCCCTTATTTGGAAAANACTTGGGAACCNAGAAGNGGAATTTC 

Sequence 1640 

TATCTGCAGAATTCGCCCTTAGCGTGGTCGCGGCCCGAGGTACTTTTTAAACCTCAATGC 

CACAC TCATAATGAATAACCAGTCTACTATCTCTTATGAATTCCTGATAATTTTCTCTCA 

TTTTTACTCAAAAACTGCCAAATGAATCCAGGACCGTCTTTCCAAGACTTAAACTT^ 

TTATTTTGMTACTATGAGCTCTAAAATATCTAGATTTTGTCAGATAAAATCCATGGTGT 

TGAGAATCTTTGTGTTAATATTTGGAATTAATAAATGGAATGAAATCTGATATTAATATA 

ATAGAAAGACTAGAGTCTGGCAGAAGTTGGGCTGGATTNGTGCTTTTGGACCTTGGGNGA 

CTTTATGTAACTACTCCAAACCTTNGGTTTTNTTTCCCAAGGAAACTGGAATGACCCT 

Sequence 1641 

CAGTGTGATGGGATATCTGCAGAATTCGCCCTTTCGAGCGGCCCGCCCGGGCAGGTACCA 

AGAAAATGTGTCCAGGTCAAGCCCTAAGGCTTTCTGATAAACGCATTACCAAAGTTTACC 

GAAAATATAGTGCACTACCAGAGACAGTGGCTCACGCCTGTAATCCCAACAGTCTGGGAT 

TACGAAGAGGCTGAGGCAGGAGGATCACTTGAAGCCAGGAGTTCAAAACCAGCCTGGGCA 

ACACAGGGAGAACCCATCTTAACAAAAAATTTAAAAATTAGCTAGGCCAGGTAGTGCATG 

CCTGTAGTCCCAGTTACTTGGGAGGCCAAAGATGGGTCACTTTGAGCCCAAGGAGTTTGA 

GGCTACAGTCAGCTATGATCATGCCACTTCACTCCCATCTGGGCAACAGAGCAAGACCCC 

GTCTTCTTTAAAAAAAAAAAGTGCCAGCCGGGCCGTGGTGGCTCACACCTGTAAATCCCA 

GCACTTTGGGAGGGCCAAGGCGG 

Sequence 1642 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACTCACTGCCCTATGCCAGGACCACACACTAGC 
TCTCAGTTTCTACTCCCTACTTTAACMTCTCAAGATGTTAAAATGTCTCAAGGCAAAAT 
GGCTTGCTATTTAGCAAGTGTTACTAGAAGAGAAAGAAAAGCATGGGTCTGTAAGCTCTC 
CATTTCCTTGGTATATTGGCA'I I I I I I I ICTC 
Sequence 1643 

CCCTTAGCGTGGTCGCGGCCGAGGTACAAAGGGTTATGCATTAGGGACTGAGGGCGTCTA 
TGTTTATATAAAATAGCATTTCTATAGAAATTAAAAAACATACCCATCACTGTGATATGA 
GATGCTC TGTCTCCAT GTGATGCCGAAAATATGTAAGGAGCTTTTGGGCTAAAGCGTATT 
CAAATGAAl I I t I i M I AAGCTAAGGAATCTAATCATTGAAAGTATCTTTGTAAGTAATT 
CACCAAATATTTGG C C AGGAGAAAAATAAACTTTTAT^ 
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TTG7TATTAAACTTCTTGTATTTGCTTGMGACGAGGTCTTTAAAATCCTGCATAACTTT 
AAAATGTATTGTATTTCAGMTTATTTGTCAGAGAAAGATAACAGGGGATACGAATNTAA 
ATTAATATAGGGATGATMTTATGATCTAAAACCAGTAGGNMTCATGTTATTTAAA 
Sequence 1644 

GATATCTGCAGAATTCGCCCTTTCGAGCGGCCCGCCCGGGCAGGTACACACTTGAGAGGC 

ATCGCTGTCCTCCC7TCAGCTAGCCATATTCCGTCAATATATGAAGGTTTACCTATTAGT 

ATGATAMTATCAGATATGACTTTTTACTACCAAAATCAGAGCTTAATCATCAGCATTGT 

TTCTTGCTATTTGTTTGGAGAAATTGTTTCCTGACCCCAACTATAAGACTATAGGACCCA 

GCTTATGGMTGCTGCCTTATCTATCTTGGCTGAACTTTCCACACAAAGTGACAATAGTG 

CATGAAAMCAAGTTTMGTGGACAGGTTTACAGACTGTTTATCCCTGCTTTCTAAGGAA 

TA7TTTTC 

Sequence 1645 

CGCCAGTGTGATGGGATATCTGCAGAATTCGCCCTTTCGAGCGGCCGCCCGGGCAGGTAC 

TT7TCTTTATGAATGTTATACCAGAACTTAGGAGGAAAAAATTTTTGAGCATA 

TTAGGAATTGGATATCTCCCTAAATTATTAAAGTTCATCTTCCATAAATTCTGTAAAACT 

GMTGTAGTATTTCCCCCTCTTCCCATGCAAGTAACTGATATCACTTTAGAAAACCTGAT 

ATGAACATTATTTGTTATTGTGCTTTTATGMGAATTCTGTCTMTCTTCTCATAAGAAG 

AAAGAATTAGAACCCAAAAATCTAATTATCAGATTTAGTAAGATGTAGGCAAGATCCACC 

TAI I I I I I I CATTTATGTCTTTCAAAAATCAATCACATTCTATTA 

Sequence 1646 

CGCCAGTGTGATGGGATATCTGCAGAATTCGCCCTTTCGAGCGGCCCGCCCGGGCAGGTA 

CCTACTAGATGCCAAGGTCACGAATTAGGGTAGGAAAGGAGCTAAGAAGAGCAAAAATGT 

TAAMGGMTCCAAATGATTTTATAGGAGTGGTGAGATGGTGGTAGGAAGAGTTGGGATG 

ACATCAAGGTGGAAGAAGTGTTTTGTTTTGGAGACAGGGACTTGCTCTTTTGCCCAGGCT 

GGAGTGCATATCACGGCTCACTGCACCTCGATCTCCTAGGTTCAAGCAGNNGTNCTGCCC 

CAGCCTCCCAAAGCAGGCCGGGACCACCGGATNTGGTGTCACCATAGCTNGCCTAATTTT 

TAA 

Sequence 1647 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACCTAGAGGAATAGTGCTGGGAGGTGCAGGATG 

GGGTCAGGAGTAGGGGCAMTGAGGGATGTCTTTTGAAAGTGAGAAGAAAGCAGAGATTC 

TTTTTCCMGAAAAGATTTTTTAAAATATCMGTAA^CATATGATGATGATATC 

ATAAACATTGTAATACATTTGCTACTCTACTCATAAATCCTTTATCAGACAAACTGATGG 

GATTTAATAATGACCGTCTTTTTGTATTTGTCTATGGTAGGCATTGTAGTTTCTACATAT 

ACTGACAATTCATTTATCTTTCTTGGTGTTCTTTTATTATTMCAGTTCTCAGAGTAAM 

CAAAAACMTTTTGAAAAAATTTCTTTATCTCATTCAATATCATGGCAAATGA 

CAAAAAATGGTCATAATTTACATTGAGAAACMTTACAGGTTAAAATCTAAAAAGTGGAA 

AAACTTGCA 

Sequence 1648 

CCCTTAGGCGTGGTCGCGGCCGAGGTACGCGGGAAGAGAAAATCACTTGGTTTATAGAAG 
GCCTTTAACATTTTGTGAGTATGTGTTTTAAACCAAAACCCATAGATAGGAAGAGCCGAC 
AGGGAGCTGGAAACTGCAATAAGCTTTCTTCTAACTAGCAAACGCAGTTGACAGACATCT 
TTTTATAGACGTATCTGTGATTGCACGCTGTnTTAGGCTTGATAAi I I I i ICAACAAGG 
AAAAACTTTTCTTCAATTTGAAAAAGCTC 

AGAAAGCCATCCTGCTCATTCTAAAGCTAAATTGGGCCACATTTCCCTTAAAGTATAGAC 

AACAGCAGCAGTGGMCACAAGTTAMTGTGTGTTTAGTGAAAGAAAAGTTTTTGAATAC 

TGCCGNCACAATAATTTTTCNAACCTTGGMTTCTTTTATTGAGAAM 

NTGGAGAGANGGGAAAGGGAAAAGA 

Sequence 1649 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACAGACGGTCAGAAGGAAAGAAGGAGAGGGATT 
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GCCreCTGCCTCCCCGCGTGCACACACGAGAGTGGGTGCTCCCACCAGCTTTCAGGGGGC 

TTTCTTCACGAATGTGAGCACTGATTTTGGGAGATCTGCAGTGGAAAGTCAAGTCATGAA 

TATTTTTTATAAAGAGAGAAATGATGTMTTTTATCAGAGAAGATATTTCAGATGTATTT 

TTCCATTTTAAAAATTCATTGGCAGTGCTCATACMGAGMTTACTTGAGCTGAAAATGA 

CTCTGTCCAGnTCTTCCTAmCGTTMTGATTTTGCAGTCACTGAATTCTTTCTAAAA 

GTTGTATAACCCAGATAAAGTCAGGCCTCCTGGAAGCCAGCTTCAGCCCAGAGACATACG 

AAAAGAAGCACCAAATNTCACTGGAAAACCAATTTAAATTTATACTGAATAATCTCATTC 

ATAAACCACAGCACAACATTCTGAAGCCTGNCCCTTCCAAAATCTATAAGCTCTCTGCCA 

CTCTGATGACTTTTCATTCTGNCTCTTTTTTCTACCCCAAAAAATCTCTCAACTCTCTTC 

MTCTTTCTGCAAAAACTTTTNCCTTNCTGGCTTAANNCTAACTNTTGGGGTTTTCAAGT 

GCCCCCNCTGTATTNTATTTCTNTTGNCCCCTCMTTTMNCNTTTGCTTNTTTTCCCGA 

TTGTCMTTCCANMCCGGCCNCAM7TTTTTCCCACTTAACCTTCTTTAACCCCT 
Sequence 1650 • 

CCCTTAGCGTGGTCGCGGCCGAGGTACACTCTCTGCCTTAGAACTACCATCTTrTGCACT 

^!Z CCAGAT ^ GGAmG1TA CTACATTCTAGGTAAAGGATATTGTTACTATCCTC 

AAGTTACACAGAAAACACTCAAGGATGTAAAATGAATATTTATCTCAAATTTGTTGACTG 

CTACTGCTATCTTTTTTGAAGAATTAAAAGATAAATTAAAATTTCTAAAAATATGCCATA 

TATCAATAATTTACAATAGCTTGATCAGCCAAAAAATCCACCTTGAGCTTAAAGCTAGAG 

TTTGATAGGGTGATCCTTACTCTCCTAATTTAAATATCACTGTATATTAAGTTTTACAAT 

ATACAGTGTATATTGTGTATATTGNGTATACMTATACAGTGGATATTCTTTTrCCAAAC 

ACACmATTAATAATTACAAATCACAGGGAAGTAACTGCACAACTTTCTACTAGGGTGT 

TTATTAAGTTACTGGAAAMGTTTAATTACAAAAAACGTATAGGTCCCTAGATTTTGAAT 

GGTAAACCACACATACCTTGTTAGCTGGCTAAAAGMTATATTTGGAATATATTTTNGGA 

TTTTGCAATGGTTGCCCCAAAACTAATGTTAGGGAAGTTACCCTAAAAAAATTAAA 
Sequence 1651 

CCCTTTCGAGCGGCCGCCCGGGCAGGNACTCACAGCTGCATCACACTGAATATCATCTCC 

CTGCACTGCCATTJ^TTTGAGCAGGGCGTTGCCTGTCAAGAACATTCATATCTTTTGAT 

AAACAGTGCATTTTTTCCCTGATAGTCTGTATGGGACAGCCTCCTTGTTTGCATCTGAAG 

GGACAGGGACTTTCAGGACCTCCTGCTTGTGGTCCACCAAANGGAATCACCATTCAAAAC 

TTTCTTCAGCAAGGGTGTTATGGGNCCTTTCTCANGTTTTGCCCTANCGATCCCATTTGA 

GGGCAAAAANGGTTTNCCNTTCTTNTNCAAANTACMCCCCCTTCTGCTTTTTANNCCAA 
GGGGGGGAAC 

Sequence 1652 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACACCACTATCGCTGGCTAATTTTTGTATnTT 

TCCAGAGACGAGGTCTCACTGTCTCACTGAGCTCAGGCTGGTCTCAAATTCCTGAGCTCA 

AGAGATTTACCAGTCTCAGCCTTCCAAAGTGCTAGGATTACAGGCATGAGACACTGTGCC 

TGGCCAGTAATTTTGTTTTATTATATTAAGGTGAGGTTTATACCACCTTCTTCTGATTAC 

AGAAGTAATACATGCTCATTATGTAACACTGAAAATGTAAGCAGTATAAAAAAGAAAATA 

AAAAAGAATATGAAAATCACTAGTGGTCCCATTGTCTACTGATACACTATGTGCTGCTTC 
CTAATCTTTAC 

Sequence 1653 

CCCTTAGCGTGGTCGCGGCCGAGGTACTGATGGCAAGG7TACTCAAATATTTATCAAGCG 

TCTCAAAGAAGAAGACTGATATAACCTTGCATGnTGAATACAAAAATTAATATTTAGTT 

ACTGTCCACTCTCTTATTTCCAGGTAGAGGGACTCTCTGACCCTCACTCACTCCTCTCCT 

A ^I^ AG A CTGG ^ GGmAGAGAA GGAAATGAACATGCCTAGATCAACCCCTTCCTCTC 

CCTTrnTCTTGGGATCTGCACACTCCGCAGAGAGCACGAATTCTCCTCTCCACCCAGAT 

CCATAAATGTTCAGGATCTGCTCCCTGCTGGAAATGCCACAGTTGGGTGGGGGTATATTT 

?mIZ^ GGGGCTACTGGCAAGTCCACAG ^ ccccacaa ^aaaactactttc 

ANACTGCCATTTTCCTAACTCCTGGCTTGTCAATCTTGTGGGGGAGGGAAAAAGTGAAAG 
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TGCAACCTAATGAAGTCTTTTTAAAA 
Sequence 1654 

CCCTTAGCGTGGTCGCGGCCGAGGTACCAAAGAGATTAAGGAAAAAATGCAGGCTGTGAA 

TGATGGGGCCCCTGTG7TCTTTCCCCACTTGGCTCTCACAGTCATGGCAAMGGCCTTTT 

ATNCTTCAGTTTAAAACTCAAAACTTTCTTTGGAMGCTTGACTTATTCTAAAGATATCG 

CAGGACTTAACCCAACAAACAGCCAGGCCGTTGCAGGCCCAATCGTGATGCCAGCGTGCT 

CCACCCTATTAAAAATAAGCAGATCAGCNTATNTCNGAGAAAANCCTTTAACATG 

Sequence 1655 

CCCTTAGCGTGGTCGCGGCCGAGGTACACTGGAGGTAGGGAGCTCANGGATGGCAGCTCA 
NATCCG GAACAATTACAATTCAATACTTGGGCATCAGCAC TCTAAATCCCG AGGAGCTAG 
C C AGGAGTG AAGTGAG G AAAGAGCAAATCAA7TTAAACATTGCTAAATACC AAAG AC AAG 
CTAGCTATTTCTTACTTTGCATGAGGCTTGCCCACGTCCTTTCTTGTAAATTGTCTGGAC 
CATCTCTGGTCATTT 
Sequence 1656 

CCCTTAGCGTGGTCGCGGCCGAGGTACTTTTTAAAACAACCTCTCCAACATAATATCAAA 

AANGTAMGTATTCTCTTACCTTGCATATCTACATCAAAGAAAACCATAAGGAAAAGAAA 

CTACAGATGTGCACGCAGTTAGTCCAAGATGCTTAANAATGTCACATCATGTTNAACCTG 

AGTCAACCTAACAGTANGGCTGTAAAAACAACTCTACTTCCTGATTTGTTACTGCATTCA 

GCTAATGTATTGAGGGCCTATAATGACTCAGATCCTAAAAATATNAGGAATAANGAAGCA 

AAG AACCCTTNCCTAAGCTAG N N CAGGAGACCTATC AGTAAAGACATGACTGTAACACAG 

NATGGATNTNGTTNACTGGTNTTAGGTANCAACAANGGGTTTTNAAAGAAAGGNTTATCG 

GTTTCCANATAAAAATTTGATTTTTAAAT^ 

Sequence 1657 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACTTTGCTACACGGCCGGGGGCCATTGAGACTG 

TCATGGAAGACTTGAAAGGTCACGTAGCTGAGACTTCTGGAGAGACCATTCAAGGCTTCT 

GGCTCTTGACAAAGATAGACCACTGGAACAATGAGAAGGAGAGAATTCTACTGGTCACAG 

ACAAGACTCTCTTGATCTGCAAATACGACTTCATCATGCTGAGTTGTGTGCAGCTGCAGC 

GGATTCCTCTGAGCGCTGTCTATCGCATCTGCCTGGGCAAGTTCACCTTCCCTGGGATGT 

CCCTGGACAAGAGACAAGGAGAAGGCCTTAGGATCTACTGGGGGAGTCCGGAGGAGCAGT 

CTCTTCTGTCCCGCTGGAACCCATGGTCCACTGAAGTTCCTTATGCTGCTTTCACTGAGC 

ATCCTATGAMTACACCANGTGAGAAATTCCTTGAAATTTGCAAGTTGTCTGGGTTCATG 

TCTAAGCTTGTTCCAGCTTATCCAGAAATGCCCACAAAGAATTCAACTGGATCTTGGAAG 
AGGAAAA 

Sequence 1658 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACTTGTTAAAGTCTAGAATGGTGGTAGGTGCCT 

ACATATATTATTTMTTCTTATAACMTGCCCAAAAGGGTCATTCCAGTTTCTTGGAAAG 

AAMCCAAGGTTTGGAGTAGTTAGATAAGTTACCCAAGGTCCGAAGCACAAATCCAGCCC 

ACTTCTGCCAGACTCTAGTCTTTCTATTATATTMTATCAGTATTTCATTCCATTTATTA 

A rrCC AMTATTMCACAAAGATTCTCMCACCATGGATTTTATCTGACAAAATCTAGAT 

ATTTTAGAGCTCATAGTATTCAAMTAAAATAAAGTTTMGTCTTGGAAAGACGGTCCTG 

GATTCATTTGGCAGTTTTTGGGTAAAAATGAGAGAAAATTATCAGGAATTCATAAGAGAT 

AGTAGACTGGTTATTCATTATGAGTGTGGCATTGAGGTTTAAAA 

Sequence 1659 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACGCGGGTCGTCTTCAGATGGAGCTTGTGTTGT 

GAGCTCTGGAGAGGGGGTTGTCTTTCTACACTGCATCTCCCATCCTTCCTAACGAGTCAC 

GGAGCTGTCGACTCCGCCTTCTTGGCTTTAGTTAACAGGTTCTTCTTGTGTAGTCACATC 

AACGTCGGGTCACATGGGAATGTGGTAAAGCCTCATTACTGTAGAGTTCAGACATGATCA 

CTTAAAMGAGCTTTATTGGGCCGGGCCGCGGTGGCTTACTCCTATAATCCCAGCACTTT 

GGGGGGCCGAGGCAGGCAAGATCACCTGAGGTCAGGAGTTCGAGACCAGCCTGNCTANAC 
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ATGGCAAAACCCTATCTCTTCTAAAAATACAAAAAATAAGCCAAGGGCGT 
Sequence 1660 

C CCTTAGCGTGGTCG CGGCCG AG GTACCTGTAAATC TC TCAGCTAAGTTAAG CTATAAAT 
TTGATTAGGTTGACTTGGGCAAATGCACCATAAAGTCCTTTCCAGAGAAGGAGTTCACTT 
TTCATAGGTTCTACTTGAAGGAATTGTCCAAGAAGGTTTACTGTATAAAGGAAATTGAAC 
A CA GAA GGMTGAAAAGTAAAMTTAAACCTAAAATTTGCAGM 

ATTTTTGTGACCTGGGGTTTGGCAAAGATTTCTTAAATATGACACCAGAAGCACAGTCTA 

TAATGMCCGTATTAGTTTATTGGGTTAATCAAAATGTAAAACTTCTGCTCTTCAAAANA 

CAGNATTAAGAGAATAAAMGAGAAACCACAGATTGATMTCTTTTGTAAGCCATGTNTC 

TGATCAAGGACATGTANAAATATATAAAGAACTCTGGAAAGCTCAATAGTAAGAAAAA 
Sequence 1661 

CCCTTAGCGTGGTCGCGGCCGAGGTACTATTTTTTTCAGTTGTAAAAAGTAAAAGAGCTA 

GGTTTTATGAACAGGATGGG AAGCC AG7TATATTCCAGTTATATTCGATTTCTGGTTTTA 
ATAA^ 

■ i I u > i CATCAT CTTATTAAATMCCTATTGCACACTGATTACATTTATTCTTACTATT 

TTCTTTAi Mill CCTGGATCATTTTCACAATTTTATTTTTCACAGCATTCTCAATACTT 

TTCTTCATGTTTCATTAATGTTCTGTATATAGTCCGMTTCTGTAGCAACCTCTTTAGAA 

GCTCTTTATTAATACCTAGCTGAAATATAAAAAATATGTAAGTGTAAAACTACTCGA 
Sequence 1662 

CCCTTCGAGCGGCCGCCCGGGCAGGTACTGGGGAAAAAAATCGTGGMTTTTGGAGACAA 

ACAAATCCACGTNTAAATCCTCACTTTGCTACTTACTTACTAATATTCTACCCAGGGCAG 

GTTACTTMTGTCTCTGAGACTGAATTTTTCTTCAITTAAAATGGATAAATNACTCTT^ 

TCTTAAMGCTTGTTTTGAGATAAATAACTTTNGATGGTATGTAGGTGTGTTTCAACAAG 

AGTAACAGTAGTTTTCATTTTTTAAGT 

Sequence 1663 

CCCTTTCGAGCGGCCCCCGGGCAGGTACGAAGTGTGTTTCAGAGTGGCGAGGAAGGGCAA 

GTTGTTMGATTGGTTGTTGAATTAGTTTCTGTTTGATGTTAAAGAGAACATAGAGTAAA 

TGATAATCCCTCGAAAGTGGAGATCTTGGCAGGCTGGCGCCTGGTGGTATAGTAGAAATC 

TGAGAAAGGGGGAGGATATTAAGTCAGTTTTATCAGGTAAAGTTGAATGAAATAATCAAG 

nTAAGTGCGTCTTGGGTATTTGCAAAGATGTATAGATTAAGGCTAAAAGGGTTGGAGAA 

ATAGATTTGGGAGTTACCTATGATTTTTTTTGGTTATTCTGCTCTCAGGATTGAAAACTA 

MGAATCTCAGAACTGCATTTCTAATTAGTGCCCATAAAATTCTTTATTGATGCCAAGTT 
TTTGGT7TTT 

Sequence 1664 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACTTTTAATACACCATCCTGGTGGATCTGGCGC 

AAQ GQAA CTCCACACAGAAAAACACTGCCCTCACGTTATGAAAGGATTCMCATTATTCT 

CTCTTTTGAAAGATAATTGGCTTCAMGAATMTTGAMGTTAGAAACTTG 

CAGAACCAGGTAATATCTCTACTACTCTTTTGGGAATTAGTCTTGAACTGCAGCCACTGT 

GAAAGCTTCTGGTGTTATTTAAATTAATTCT^ 

TTGTTTATCTCCTCAACTAGGGTTTGTGTTCCTGAGAGGCAAGAACTATGCTTATATATT 
TTTAAAAMTATTTTCAACATTATTTAGCACTCTTATTAAATATTTA 

AAACATACTGTAATTTTGGTTAAAAAGCACCTAAATTGCTTTAAAATCCA 
Sequence 1665 

CCCTTAGCGTGGTCGCGGCCGAGGTACCTGCAAAGGCACTGAGGTGGGAGGGAGCATGCC 

AATGTAGGGAAATGAAGAAACCCAGTGTGTATGAGCCAAGCTGAATAAAACATGAGAAGA 

AGCTGGAGAATGAGAGAGACCAGTCCCCAAGCTCTCAAGGAGCAAGAGGAAGCCTTTTCG 

GCATTTGMGTGGAGGGATGGCATGATCTCGTGCCGTAGTTTTTAAAAGAACCACTCAGG 

CTACTCTTTTGAAGAATATTGTGGGGAGGCCAGAGTAGATGCCTATAGGACTTATACAGA 

AGGTTAAAGGCAGTTGATAAAGAGTATTAAACAATGA 
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Sequence 1666 

CCCTTAGCGTGGTCGCGGCCGAGGTACACTGAGCCTAGAATATCTTGTGGGGTCAAAAGG 

TAAGGCAGTGCTCAAAAAACAACAGTAAAATGGCAAAAATACATAGAACCCAACTTGAAG 

GGCATCCTAATGTAAAATCTAGAAAAATCTGAGCACAAAATGTATTATAGTCATGGGTTA 

TAACCAATATAATGAGAATCCAAGAGTCCAGACTGAT^ 

ATATAAAAGAAAATATCTTCCTTATAGTAACAT^ 

TGGAAAGTANGAAAATCACTGGTTNGGCACACACTGGNGTNATMCTGGTTTTCAGGCAA 

GAAATTATCCCACNMTGCCTAAAATTANGTTTGGGTGGAAAAGTNTNGATTGAANGAAC 

CAAAGGATTCCTTTANCATTNAGGGNCCAAAANGCATTCTTCCTTGACCAAAGNATACCT 

TTNNTTTCATTTTCCCCCANGGAAAAAAAAA 

Sequence 1667 

CCCTTAGC GTGG TCGCGGCCGAGGTACGCGGGAAGAGAAAATCACTTGGTTTATAGAAGG 

CCTTTMCATTTTGTGAGTATGTGTTTTAAACCAAAACCCATAGATAGGAAGAGCCGACA 

GGGAGCTGGAAACTGCAATAAGCTTTCTTCTAACTAGCAAACGCAGTTGACAGACATCTT 

TTTAT AGAC GTATCTGTGATTGCACGCTGTTTTTAGGCTTGATAATTTTTT 

AAAACTTTTCTTCAATTTGAAAAAGCTGTTTAC^ 

GAAAGCCATCCTGCTCATTCTAAAGCTAAATTGGGCCACATTTCCCTTAAAGTATAGACA 
ACCAGCAGCAGTTGGAACACAGTTMGTTGTGTGTTTAGTGGAMGAMGGNTTTTTGAA 
TACTGCCGCCCCAAANMTTTTTCAAMCCTTTGMTTCNTTTATTGGAGAAAGGGACCA 
ATCAATTG ANAAGAAGGGGAAAAGGGG AAAAAN N AAAAATTAAATTTG AATTTGGC AAAA 

rrn ictttnaaccaacctngggatttncnngmgttnaamgaaatgaattttt 

Sequence 1668 

cccttagcgtggtcgcggccgaggactatgaattttaaaattatttgcttagacagaata 

tgtggacatacaaatgatgaatgtgataattgtcatacatatatatttatcttagacact 

tantcaaaacatgtggttittggttatgagttanacactgtcactttacctggaagatac 

aaagtmttgggtcatagacctctcamtnattggaacacttanattttt^ 

tgacatggaaggggagcccagggngtcttatgattaaggtaaaa 

Sequence 1669 

ccctttcgagcggccgcccgggcaggtactagaccctatccctgcaaggactgagctgta 

tttcaacaatcagagattagaataagatggcaccaatgcccagcagaagatggtcacaga 

tgagatgagcagcamtmcatttgtggactgactttatcagaactcaacagaaactggc 

cctttggcctgcaaaggtcaagatatcaggtgaaaaatatatgggtaatgagtaaatatc 

atatctaggcatc 

Sequence 1670 

CCCTTAGCGTGGTCGCGGCCGAGGTACACTCTGGACTCCCCAGCTCAAGGTGACTCCGAT 

AATATAACGCATGTGTGGAATGAGGATGATGGACAGACCTTATCTCCAAGCAGTCTGGCT 

GCACAGCTCCTAATTCTGGAAAACTTTGAAGATGCCCTCTTAAATATATCAGCAAATAGT 

CCTTATATTCCTTACTTGGCATGTGTGAGAAATGTGACTGACAGTTTGGCCAGAGGTTCA 

CCAGAAAATCTAAGACTCCTGCAGTCCACAATACGATTTAAAAAATCTTTTCTTCACAAT 

GGTTCCTATGAAGATTACTTTCCTCCAGTTCCTGAAGTCCTAAAATCAAAACTGTCTCAA 

CTTCGAAACTTGACCGAACTTCTTTGTGAATCTGAAACTT 

Sequence 1671 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACCMCAGCTTTTCAGCAGATGTTAGCATTCAC 

ATATGAGAAAGTTGCAGCAGTGTGCACATTGTGCTTCTGCCCTAGATCCTTATGTAAATA 

GGACGTCCGTGACTTGATGTTGAAATGACCTTTCAGGCAGTATTCTGTTGCCACATGCCT 

GAGCTTGTATCCCAGAGTAATTAGACGTTTTGACTTACACATCTTGGTATACAGTGTTTA 

TGCTAATTATAAATGTCTGAAAGTTTCCTGAAACTTTTTCTTTGTAGCTTT^ 

ACAGA CAGGAG GTTGGGAGGAGGGTTGTGGATTGTGGTCCTMGCCTTTGATTTTCAAGT 

CTCTCAl MM I GG G AATG AAAAATTAAAAATCTG CTTTTCC AGTTG CC C G AAN AC ATG G 
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:ATA 



GNGGGGATA t i I I I I ATTATTTCTTTCNATCTTNGAACAAAANAAAACTGNTTTGGGAAA 
TCACCA GCAAT TTTTAAATAACCTTTTTTTAAAAN TAAAATATTTTTTAATTTCTTTAAA 

GGGGCATTnTAGNCTTAAAAAAATTTTAAAGGANTTGGNGGAAGTNCCCCTTAAAAT 
Sequence 1672 

CCCTTAGCGTGGTCGCGGCCGAGGTACTTTTAGTGGC7TTAGTTTAAAAGATATGCTTCA 

TTACTTCATGC CAAATA GTCAATAAAAGGTTTGTAA7TCAACATCAAACTGGGTAAGTCC 

TTCTTAAAATTGT7TTTAAAAGCAAAAGATCCCAAAATACCAGTTGTCACCATTTTAAAA 

GGATATTTTAAAAGATTTCTTTTATGGNTATTCAAGTAAGAAAAATGAAAAAGNCAGAAA 
" TTTTACTTCTC 

Sequence 1673 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACTTTGCTACACGGCCGGGGGCCATTGAGACTG 

CCATGGAAGACTTGAAAGGTCACGTAGCTGAGGCTTCTGGAGAGACCATTCAAGGCTTCT 

GGCTCTTGACAAAGATAGACCACTGGAACAATGAGAAGGAGAGAATTCTACTGGTCACAG 

ACAAGACTCTCTTGATCTGCAAATACGACTTCATCATGCTGAG7TGTGTGCAGCTGCTGC 

GGATTCCTCTGAGCGCTGTCTATCGCATCTGCCTGGGCAAG7TCACCTTCCCTGGGA 
Sequence 1674 

CCCTTNNGNGCGGCCGCCCGGGCAGGTACCTGGGTGATGGCCATACTGCGTGCCGCCA" 
AGCTCAAGCCATGTGCCNGAGGCTGTGCATTTAGGGAAGAGAAAGAATGTTCACTCCCAA 
AAGAACTGATTCAGGCNTGAACAAGAACCATTGCACATCCTCAGGAGGTTCTAGCAAACC 
TTGCACATCCATGTTCTTGCACTTATACATACATAAACAGGAGTGAGGAATGCCTTTCCC 
ANNAGCANCAGCAAGAAAGTTTCAACTNGGCAAACCGACCCANGGANGAATTTTTTCAGG 
CTTCATCCNTTTTACCAGNAAAAAATNGTAAACCTTCCCATTG 
Sequence 1675 

CCCTTCGAGCGGCCGCCCGGGCAGGTACTCAAGGCCACAAGACAGGCCGCCATAGAACTT 

TGGCCTTGGGGAATGAACCAAAGCTAGTTTGGAGAGTAGGGTGATTGGGGTGATCACTTT 

TCCCTTTAAAGGTGGCATGCTGCAAGAAGAAAGGCTTTGAAGAGTGATGGGTAGTCACTA 

ACCATGGCACTATNCTATNTTNTAACCAACGTGATGGGGTCCTGGAATGATCCTAGGTCT 

GCACTAGAAGAGGTNGACCCAGAGGACCTTGTCTAACATACCCCTTGCTTCTTGACTTCT 

TCCCATACCTNGCGTTTCTCTCCGGCCTCAATTACTGGNCTTGAGATGTTACACCCTTTN 

CAAGCATTAAAATGATGAACCCGCTTCATGGTGCCTGTAAAAGGCCCTNTTTGCAAAAGC 

ACCCTGCAGNNGGTATTGGTTTNGNG7TTTCCTTCNACTTTCACTTNTAAAAAAAAAATG 

AAANAGGNNTTNAAGGGGNAAAAAANTGAATTTTNGGGCCNTGGGACTTTTT^ 

GNTTTTTGAAAANACTT GCAATT TGGCCTTTTTAACTTTNGGGGGNTTTGTTNAGNCAAN 

CCCAAAAANMNGAAGCTTTTTNATTNNAAAAGGGGCNCCANAAAANAACAATAAAANCT 

TGNCCTNGGTNT NTTTTM CTNATTAAACCCMTTGMGGGNGTTTTNNGNAAAAACNAA 

AAMTGGCCAAANTTTTTAAAMNAAAMTNGCCNTTTTCCTTTGGGGGGG 
Sequence 1676 

CCCTTAGCGTGGTCGCGGCCGAGGTACTCCCCTCCTTCGCTACCAACTTCCTGGNCACAG 
CCACCATCATCTNCCCGCCAAGCTTCCTAACAGGACTCCCTGCTCCTTCTCTTACCCCAC 
CCGTGGTCTATTCTCAACACGGTAGCAAAAGAGAACTTCTTAAAACTTATAGCAGATCAA 
CACCTCCACCTCTCGCACCAGAACTGTGTGAAAGTTGATTATACAAAATGGGTCAGGGTG 

TANCGCAAGGTGTGAAAGTTCATGATGTGGAATGGGTCATTCTTTGCATACCCCAAACTG 
AATCACTCTGAGGGCCTGGAGG 
Sequence 1677 

CCCTTAGCGTGGTCGCGGCCGAGGTACATTTGGGGACAAAATGTTTTTGTGGAGATAATG 
TAGACTTAAAATTTTAAATCCGTAGCTGTATTTTCGTCACAATGAAAATTTGATGTCCTT 

AGTMGCCCCGAATAGAAAGCCTTGACTTGCTTTTTGAGCATAGTTAACAGTGTTGAGAT 
AAGAGAGGTTTGCTGGCAGTTGCACT 
Sequence 1678 
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CCCTTAGCGTGGTCGCGGCCGAGGTACACAGCAGCAAACAGTTCAGATTGTTGATGCTTT 

TGATGACCA7TAATGCAAAAGGATTATGGATTATGGCACTGTATTATGAATAGACAAAAA 

GATTAGTGAAMGTTGTTTAGCCTAGTAGACATTrrGACGCTTTCTGCGTGAMTTGA^ 

GATAAACCAGAGAACCATTGAAAGTTCATGTATCAATTAATAGGTCAAGTAAAAATGTAT 

CTAAAATTCMCAAGAAGAGATTGTTTTCTATTT^ 

CCGAAATCAAATCAACAAAATAAATGGTCCCAAGGACTCTTATTATAAGTATACAAATGT 

TGTAACTCCCAGGAAATGAAAATNTTAATGGGTNTAAATGAGAAGAACTNTTAAAACAAA 

GCCATCCC AAACTACC AAATAAAGCCTAATTCTACTGACAGGTANCTCCACATAATAACC 

TAGAACCAATTTTTTTAAMTAAAATAAAAACTCAAGGCCTTTTAGCTGAC 

TNTTGAMTGGAAAGACCCCTTAGGAMGTAAAAAAAMTTNTACCTTAAMTATNAm 

TNCCAAAGCCATTATTAAGTAA 1 rTTTTTl I AAACCTTTATTGAATGNNGGGGAGCCCTG 

CTCCATTTGGTTTAAAGAATTTAATTAAAAAAGG 

TTAAAAAAATCC NTAAAATTTTTTAAATCC N TTTTTTT 

Sequence 1679 

CCCTTAGCGTGGTCGCGGCCGAGGTACTCCAGCTATCAAAGGAGAATAGCCTTTAAAACA 
CCAGGATCCTGGTCGAGATGGTAGAGGTGGTCfGTTTGAATTTGGGTGAATAGAGGAAAT 
GCCAGTTAAGGGATAGCCATTCTACAGACAAAMTGCAGCCGTCTATACTTTTACTCCGT 
GGTMTACATTATTTGTATTTCTTCTTTCTTM 

TG TCTT ATGTATTTGNCACCTACATAAAATATGCTCACTAAAACGCCACTGACTTTAAGG 
AATT TTAAGTAT GATTATATGTGGTCCTTGTAGAAAMCCATCTTTAMGTGTAAAAAAA 

gaagi nrm iaaaaagctaaattaggaaacaaaaaaagatctggaaaactctggaatg 

TATACCCTATAGAAATGG 
Sequence 1680 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACTTTGCTACACGGCCGGGGGCCATTGAGACTG 

CCATGGAAGACTTGAAAGGTCACGTAGCTGAGACTTCTGGAGAGACCATTCAAGGCTTCT 

GGCTCTTGACAAAGATAGACCACTGGAACAATGAGAAGGAGAGAATTCTACTGGTCACAG 

ACAAGACTCTCTTGATCTGCAAATACCACTTCATCATGCTGAGTTGTGTGCAGCTGCAGC 

GGATTCCTCTGAGCGCTGTCTATCNGCATCTGCCTGGGCAAGTTCACCTTCCCTGGGATG 

TCCC 

Sequence 1681 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACCAGGATTATAGTTAGAATTTACGGACTGGGT 

AGGAAGACTGGATAGAAATCTAAAGATTGCTGATTCAAACACAATGTGGTTTCTTTGCTT 

TATTGTCACAGCTCTGAATTCACAATTATTAGTTAAATTCATAGGCACTATMCTTTAGA 

AAGCACCTTCCCAAACCMGTATTAAGTGATTTATTATAATTTCTCTGACTTCTTATAGA 

ATTGACTTTCCAAGTGTTCATGAGAATTATTGAGAAATTGCTACATAGTATCATCTCAGC 

TCTGTCCACATGAGCTATCTGTCACCTTGTCTTAATGAATAATTGTTCAGTAGGAATATT 

GGTTTTG 

Sequence 1682 

CCCTTAGCGTGGTCGCGGCCGAGGTACCCAGNGCTTTAACTGAGTTGGGA i MIN I M A 

ATGTTATANCATTTAATAGGAAAAATCATTTGMGATCATTTATCTTTTGNATMCTGG^ 

ATTGAAMTGTCAACTAATTATACAGTGTTTTTCCCCATAAAACACAATTTAGTGCCAGG 

AGAAAAAAGAACTATTTCTTCAGATAACTTTATATGTGTCC \ M M I I \ CCATTAAAAAT 

TAAATAGCCCAAAGTATGTTCCTTTTGGCCTTAA 

Sequence 1683 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACTAACATCAATAAGTCGAGAAAATTATATTAA 

CTGAAAGAAAACAAAATMTAGAGAATTTTATTAAACGTATTTCTMTGTT^ 

GTTTGGAGAAAAGCTGCCACATAATTAAMCMTTCTTACCCTGTAAAACTGATTGTCTT 

CCAATCTCAGGAGGTTTACATTAACAGGAATATAGAATAAGAAACAGGCCTATGGCCGGG 

CTCCGTGGCTCACGCCTGTAATCCCAACACTTTGGGATGCCGAGGCGGACGGATCACGAG 
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GTCAGGAAAT 
Sequence 1684 

TATATAAGMTGAACAATTMTCTGMTMTAAAMTAAGGCTTAmATA^ 

TTGAATAGTATATACATGCACATATATGCAAATATATATAATTCAGATATTTTATCTTTC 

TTAATGCTOCATGACTCTAAACTCTCACMTTTTAAATTAAAACATAAATATGACATAA 

Sequence 1685 

^?™ GAGCGGCCGCCCGGGCAGGTACATGATM11 ^CTAGTCTAGAGCTGGATGA 
G ^J GCTCCG ^ CTGCTACC ^ GT CAMCCCACCTACACCCCTC 

GGG f GA ?^ 

TGCACTTGGGGAGAGAGCCTTCCCAGGACAGTTTCATGGACTTCATGACGGCAGCCATGr 
CTCCCAGAGGCCACATCCTGCCCAGCTGCTGA ^ATGACGGCAGCCATGC 
Sequence 1686 

Ii?° A ^ ATCAATACTAAAAAGAAAAGCATG CCGAGAGCTGGCAGAAAGAAGGCTTCAGGC 
AGAAAGAACMCAGCAAAAGTCCTMGACTGGAAMGGTCTGAGTTGTTCA^ 
AAGGAAACCATTGTGGCTGGACCCGAGGTMGAAGCMTGAGGGAMGG^ 
CMGGAGGGCCCGAGGTCATGGGATGGAGTTAGGCTGT^^ 

TCCTAACTCATTGGGAGGTTTTCAGCAAGGGTAGGAACATGGT "AGAGCCTAACA 
Sequence 1687 

SSI!I CG . AGCGGCCGCCCGGGCAGGTACAGGCT1TGA TGTAGGACCGTCTACGTCATGA 
Sequence 1688 

S C £S G ^ GCCC ^ GG ^ 

GTCAAAATGTCATGTTCMTTTTACATTCAGTGCCTGGMTCTmCTTCACAATTGAAA 

TGAAATGTGCTGAAGGAGGTGAATCCATGCATTAATCTTCAGCTCACAAAGGAAATACTA 

r™ GA ^^^ 

GGAAAGAAAGGTACACACAGTTAACCACAAAACAGGCCTNTNTTGAAAAAACCATTGCCA 

GACATGTGAAGCTTTCCCGCTGATGCTTCCTTCTTATATCAAAAGATCAC7TTTCACAAG 

T A JZT3]7 TCCAGCmG ^ AAA NCTNTTNT^ 

^rrI GG M^f^ TA ^ NAA ^ 

GGGGGACTNAAGG 2^ 

A jJ A j^NMNNGCTTTAAA^ 
Sequence 1689 

GCTGAATTTTACTGGCACAATTCCTGTGATGTATCANGGTAATACATATAACATACCAAT 
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TCCGTTTCTGGATTTTGGATTCTCACCCTTTCGCTCCCCCTATTTGCTTCTTGAAGCCAA 
CTGCAAATATGGGAATCTTACTCGGAAAACATGTGGATGCTCAAGGCAGAATATATTTTG 
CCCTATCTCCA 
Sequence 1690 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACCAGAAGGCAGGTTGGGTCTAGAAAGCCCTGA 
CAGTCTAAGAAGGTCCTGAGTTCAGGATACTGAATAGTGTTACTTCACACTCTAGTCAAG 
TGATTCTGAGACCTTAGCATGCATCAGAGGGCTGGTAAAACAGATTGTTGTCTCCACCTC 
AAAGTATGTGATTCAGTAGATTTGGGGTGGMTTGGAATTGTC 

GGTGATGCTGATGTTGTTTGTGGTCCAGGGAGCTCACGTTGAAGACTGCTGCCCTGGGCA 

ATTG GG ATCCATGGAATGTTTGGAAGTAAAAGAATAAAATTATCATGACTGTG TCTAAAA 

AAGATGACATTTAGCAGANTGATGGATATATTGGGAGAGGAAAGGGCCTAAGGATGGTNA 

GGCCAGTTAGAACTTTGAGTANTCCATGTGTGANGTAATCANAGAATGAACCNTTGATTG 

GAGGTGGGAA 

Sequence 1691 

CCCTTTGAGCGGCCGCCCGGGCAGGTACGCGGGACATTCAGAGGTGAGCCCAGAGGGGGT 

AAAGTGGACTGGGGAGGACTTCGGAGGATGTTCATGTCCAGGAGCAGCCCCACGCCCTGT 

ATGGTCGGTGTCTAGAGCCTCACAGCAACTAAGAGCAACCCAGCTCTCAGAAGAAGGAAT 

GTCAAMTGTCATGTTCAATTTTACATTCAGTGCCTGGAATCTTTTCTTCACAATTGAAA 

TGAAATGTGCTGAAGGAGGTGAATCCATGCATTAATCTTCAGCTCACAAAGGAAATCTAC 

ATAAGAAGCAAAGACCACAGACTCAAAGACGGACATAATTGGA l MINI IG CCATGGCC 

TGGAAAG 

Sequence 1692 

CCC TTAGC GTGGTCGCGGCCGAGGGACTTNNNNNTNTTTNTNTNTTNTNNNNTAGCGGGT 
NNNI I I I ICTTNTTNANAACTTTTTNANGATNTTNATNTCCATNANCATNCNCACGCCCT 
NTATGGTCGGTGTNTANAGCCTNACATTNNCTAANACCAACCCAGCTCTCAGAAGAAGGA 
ATGTCAAAATGTCATGTNCAATTTTACATTCAGTGCCTGGAATCTTTTCTTCACAATTTG 
AAATGAAATGTGCTGAANGAGGTGAATCCATGCATTAATCTTCAGCTCACAAAGGAAATA 
CTACATAAGAAGCAAGACCACAGACTCAAGACNGACATAATTGGA I I H I M I GCCATGG 

CCTGNAMGAAAGGNACACACAGTTMCCACAAAACAGGGCCTNTCTGAAAAAGCCATTT 
GCCAT 

Sequence 1693 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACTTCCATCAAGTCGAGGGAGTAACGCAAACTA 
GAATAGTAGCCATCTATTAAGGATAGTTCTGAGGTTTGGMTTTGTMCTGATT7TTAGG 
AGCTACGACATGAATATCATATCAGGTAGAAATACTCAGAGTGATCACCTTTTATGAAAG 
TTAGTTCTATCTTrCATGTCTA^ 

TGTCTCCAAGCAGAGGAAACTGACAGTTTTATGTTTTAAGTGCCGAMGTTGTTTTT 

TAG GCAGTT CCAGAGTTMTCACTGGGATGTTAACTTGMTTTTTGTCTTTAAMTG 

TA GAt i ) l I AAAACGAAATATACTTCCATTATTTCCAAACTGACACCATGTGTTCTGGAT 

TANTTTTANAGAAATGTTTTCCAAATTC 

TATTTAGTTTAGTTTTTT 

Sequence 1694 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACTGTTTTCAGTGACCTTCAACATAAATTATAC 

CATTTGAACTCACAACCACTTCAGACCTAGGAAGGGCTGGGCCAATTATCCTCCCTCTAC 

AGATGAGGAAACTAAGGGCTGGAAGATGATATGTTTTACCTTTTGTAAGACAGTCAGAAT 

TAAATCTAGGTATTGTGACTCCATGTCTACAGGACTCTTCCTCAGCTGTTGAGAATGAAT 

ATCAGGATTTCTTTTCGTGAMGTAAAGTCCACTCTGCTTCTTTACCAAACTGTCTATAT 

CTTGCTAAATAAATCMGTCTCTAGGTMGAATAGGTGCTATAATTAATTTGGTGCTATT 

CATTAGATGCTATTTCGATTTCCACAGCCAGAGTCTCAGGACTCGAAAGATTTGATTTTT 

CATTTGGACMCATGmATGmCTTTTGAGACAAACAnATAAACCACCTTCTGTCAA 
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AGAATTCANGTTGCTGGAGAACTCTGTTCATAGCCGTTGTTGGGATGCAAGGGGGTGTCT 

GGGAAAAGGATGGGGGNGTTNCAGATCCCNCCACCAAAAGCNCAANGGGTCATTGGTGTT 

TCNCGCCCTTGGANGANGAANGGGANGCATGGGGGAAAAACACANTGGGCNGTGACAGTC 

ACAGCAAGCTGAGGCCCCCAGGGCAGNTTTCCAACCTCAAGGGCTTGAANANTNTTGGGG 

NGGGNTGGAATCTTTCAAGNCCCTAAGNGGTTGAAAAAA 

Sequence 1695 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACTAGTAGTATATTTGAAAAGTGATATTTAATA 

AGAAGCCTGTAAAACCCTAAGCTTCTCTCCCTTATTATTTGAAAATTATTTTAAGACCTG 

TAGCCCCCACAT7TTGATGTTGTTAGAT7TATCAACATTGTCTGGGTAATGGTCCTGTTC 

TATTTAGGGNGGCCTTTCTGCTTATGGNCTGGGCTACCTCTTAATATACAGTATTAAAGA 

GTTTTGGTTAA GTNTGAA CTGCCCGGGTTGATTAGAGTGATTTCCAGGACAGATCGTGTT 

TTTGAACTTACAi mi I ATTTTCCAATTTTAATGGGANTTGAAAAAACATTTATTTTAA 

TAATGTTGTTTCTAAGGCCCANGGCACAANTGGGCTTCACCGCCNTGGTNAATCNGNAAC 

CACCTTTTNGGGNAGGGGCCCGCAGGNGGANCNAAGATTCAACCCNTTGAGGGGTCCTGG 

GAGGTTTCCGANGAACCCAAGCCCTTAAGCCCAMCCATTGGGTTGAAAACNCCCNNTTC 

CTTTTTCCTTAANAAAANTNCCCA 

Sequence 1696 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACATCCTTCTGGGATTCTAAACCTCCCTTCTCC 

AAATATGGCCTTGTTGTANAGTATAGTCACCAGGAATGATTAACCAACATTNAATCTGAG 

CTAAGAGCAAANACAAACAAGTGACTGCCTAANACTGTATCAAGTAAGNGCACAGCACAC 

ACAAACACACATGANTTACTTGCTTGCCACTTTGGCCTTAGGAGTATAAAATNCCCAGAT 

TTGAGACAAGATNCATTTAAGNTTTNCACTTTCACACACAACACAATCCTGTTANAGAAA 

AGGAATGTGTNGTTCCTCCTTTTTGNCCTTCAAAACAAAAGGAATTATTTTAAAAAATGA 

AAACCCATTTCTACCCCGTTAACAGGGNCTGGGGGATTCATCCTNTGGGGACTTTGGTTT 

TTTCTTCCCCCNATGCCAAGGGATTACCTNNCTNGGGGGGGGAG 
Sequence 1697 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACTCMGCTTTGGCTTTTCTGAACTTTCCTTAT 

TTTCAAAAATGTCCCCCAGCCCCACTTCCACCTGAGACATTCACACACCCCATTTCCTCT 

TCCAGGMGGCTCTTATGTCGCCTGGGTAAACTTACTCTTCAAGTCTAGTGACCTTTTTC 

CAGAAGCTTTCCTGATATCTTTCCATTTCACCCCACTGCTGACTTATTAAAATTTCTAGA 

A ~'3[ A J AC1 ^ ACACTACA ^ C ^^ 

MTTmMGTTCCTTGAAAATAGAGACAATTTCATTGGTTTTCATCAGTTTTGGTCCAA 

GTNTATATAACATAGATGAAAAATAGATATTTTGGGATTAATATATATTATATAAAATA 
Sequence 1698 

CCCTTAGCGTGGTCGCGGCCGAGGACTGACTGCTACTGGTAGACCTAGGGTCAGCTTTGA 

GGACTGAGGTMCCACCACAGGAMTMGTTTTGAGGTCTGATTTTGAAACAATATTGGA 

AGACCATrCCTTTGTGAGATAGAAACTTCTCCATTTTAATTTTAGTATTTTMGCTTTTC 

CTACAGGTCAGTTGGGAATAATTTTTATTTAGGGACTCACAATCTTGAATTTTTAGCTAA 

ATGCCTTAAGAATAAAATATTATTTAAAAAGTATTAAAATGCTGTGATTCCAAACAGTTT 

TCTTGTTCAAGGATGAAGNAATNTAAAAAATATACCACCCTGTCTCGGCAACTNGGAAAA 

AAGCCCCATTTTAATTTTCCATTCCCAAAAATGGANGAGACTGGAAAAATNAAAGGGCTT 

ANGGACCCACAAATCCCACAACCCCTNCCTNCCAAAANCTNCTGGTCTTGGTCTTTTTGG 
AT 

Sequence 1699 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACCACTTCCTAGATGTGAGATTGTTGGCAAGCA 

TTTCGTCTTTGTGAGCTTCAGTTTCCTTATCAATACAGTGCAGCTAATGCTACCCGGCTT 

ACAGGATCATGGTGAAGATTCAGAAGAGGTGACATAGAGCAAGTGTCCAGGGGNTGTGTC 

TGATGCCAAGGAGGCACATGGTAGGTCAGTTCTCTTCATCCCTTGCCTTGCCCCTAGAAA 

AGGCATTTGTTTTAAGTCGCTTGCTGCAATCCATCATCCCCCTTTTCCTCCACCTNTACA 
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GGNTNTNTCCCACTATTTCTCCCTAACCTTTTNCCCCTTGAANGANTTGGGGAAATTTGG 

GGTTTC 

Sequence 1 700 

CCCTTAGCGTGGTCGCGGCCGAGGTACTTTATCTTTTCGGGGTTTTCTTATTCCAGAAAC 

^^3^ZT^"^^ ^ ^ TGTAAAATG C AAA TG AATTTTGCACTTATT7TG TTTTA TTTN CTC TTG 

TGTTTTCA GAAA TGTTTTGATGCATCAGAAATTGAGAGATCAGTGATTTCAGTTCTCCTG 

TCACACCCTT7TATGATAAAATATTTTGGCCATTTCTAAGGAGTAGTTAGAGGGAAATCC 

TTTCATAAATCANTTTCACT 

Sequence 1701 

CCCTTAGCGTGGTCGCGGCCGAGGTACCACACAGACACCTCAGGACCCCAGCAAATCCAA 

ACCTGATCTATCCCTTCACTACCAGATAGCCTCCTCTTCCTCTATCCACTTAAAGGCACA 

AGTGTATTGCCTATTTTCCCTACCAAMGTTGGGACCCCTTCTTAACTMTTCCACrTTC 

TCATC C TC CTAATCTACTTGTTGG AAGATAATTCTGGCATTTCTTCAG AAATGGGTAAAA 

GAGATTGATAGAATTGTTAAGGATGTTTCCAAAACTATTTTGGAAAGAATGTATGTATGT 

CTACATTTAAGGCTAATAAGCCGTGAGCCTTTCTACTCTAGACATTNCAGGGTCCTAAAA 

AATAATGTCTAAG AGTGACATCAGCAAAAATGGGTGGATTAG GG AACTGTCCAGGGC CAT 

TCCCTTTCCCCAAAGCCAAAA 

Sequence 1702 

CCCTTTCGAGCGGCTCGCCCNGGCAGGTACAGAAGAANGAAGGGGGCTGAGTCTTGCATA 

CTTCTACTGCCCCCATTTTACCTGGTCCCTAGGAGTAGTGTGAGACTGGGACAAGTGATC 

TGGGATTACACTGATTAGCATGCTTCACAGGAGGGAAATATAAGGAGAAAGGAAGAAGTA 

TTTCTCAGTAACATCCCGAGTTAGCCAGAAAGGCTCAGGAAATCCCTGTTTGGTGATGAG 

A TCAT TGTATCATTG TTTAGA GATTTCTAATTCATGATGCAATGTATTTTTAAATG 

GTTTT AGAAAGCTTGTTTTW 

CTTTTGTTTACAGGACCTACTCAATCTAAATCAGAAAMGMCAGGCTGAAGCTACAAAT 

AAGATAAAAAGAA 

Sequence 1703 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACGAAGTGTGTTTCAGAGTGGCGAGGAAGGGCA 

AGTTGTTAAGATTGGTTGTTGAATTAGTTTCTGTTTGATGTTAAAGAGAACATAGAGTAA 

ATGATAATCCCTCGAAAGTGGAGATCTTGGCAGGCTGGCGCCTGGTGGTATAGTAGAAAT 

CTGAGAAAGGGGGAGGATATTAAGTCAGTTTTATCAGGTAAAGTTGAATGAAATAATCAA 

GTTTAAGTGCGTCTTGGGTATTTGCAAAGATGTATAGATTMGGCTAAAAGGGTTGGAGA 

MTAGATTTGGGAGTTACCTATGATTTTTTTGGTTATTCTGCTCTCANG 

AAGA ATCTC AGAACTGCATTTCTMTTAGTGCCATA 

TTGGTTTTTCCTTGNAAATTGNGGTANGTAGAATTCTAAATGAC 

Sequence 1704 

CCC1TAGCGTGGTCGCGGCCGAGGTACTCTCTTTTCCAACTTTCCATGGCCACCTTTGCC 

TGAAGATTCAGCTATAMTCCTTTTTCAAACTTGACAAGCACAGCCTTTCAAAACATAAC 

CCTAACTTATTTTACCAATCTTGTTGCTCACATMCTCTGTATTATGTGCTTTAGCTAGT 

AGACAATTTGATGCTTCCTTTTCAAATCTTTTGTTTATCCTGTTC 

TCCTCTCTCTAGCTATCCTGCCTCTTCTCTCTT^ 

TGAATATATAGATATTATTTATTAATTTATCTGTTTTCTTCAGCTTCATAACATT^ 
GTCATTTATAAACTTTAGMCTTTGTTAGAAGTTTAATTTCTCTTCTG Mill ATAGTGA 
GTAGAGAGAATTATTTTCACATTTATATTACTCTAAATCATGCTAGCCAAGGAATMTTT 
ATCTCATTAATTTA 
Sequence 1705 

CCCTTAGCGTGGTCGCGGCCGAGGTACACTGAGCCTAGAATATCTTGTGGGGTCAAAAGG 

TAAGGCAGTGCTCAAAAAACAACAGTAAAATGGCAAAAATACATAGAACCCAACTTGAAG 

GGCATCCTAATGTAAAATCTAGAAAAATCTGAGCACAAAATGTATTATAGTCATGGGTTA 
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TAACCAATATAATGAGAATCCAAGAGTCCAGACTGATT7TTAAAAAATTGCATTTT7TCA 
ATATAAAAGAAAATATCTTCCTTATAGTMCATTTTAATTGACAAATGTAGAAGTAATGA 
TGGMGTAGAAAATCACTGTTTGGCACACACTGTAGTMTMCTGTTTCAGACAAGAATT 
ATCC ACG AATGCTAAAATTAGTTTG GTG AAAGTATGATG AG AAACAAG ATAC TTAC ATAG 
G TCC AAAG C ATNTCC TG AC AAGATAC TTATTC ATTAC AC AGAAAAAAAATAG 
Sequence 1706 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACACACTAGCTGATAAGACACTGTTGCCCATAA 
TGTCTATTTATTGGATCAGCAATTTATAAGTCCCACATTCTC ATG CCACATAGC TCTACA 
CAGCTGCAAAAATATACCATAGCTTGCAGGTGATCATTGGTTTGATAAAAGATATTGAGT 
CGCTCATC TTGTG AAAGTG ATCTTTGATATAAGAGG AG CATCAGCGGGGAAGCTCACATG 
TCCCGTGGC TCACA CACCAGAAGGTATTTGTGTCTTGTCATTGTCTGTCTGGCAGTCCAT 

GGCMTGGCTT7TTCAGAGAGGCCTGTTTTGTGGTTAACTGTGTGTACCTCGGCCGCCGA 
CCACGCTAAGG 

Sequence 1707 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACCMCAGCTTTTCAGCAGATGTTAGCATTCAC 

ATATGAGAAAGTTGCAGCAGTGTGCACATTGTGCTTCTGCCCTAGATCCTTATGTAAATA 

GGACGTCCGTGACTTGATGTTGAAATGACCTTTCAGGCAGTATTCTGTTGCCACATGCCT 

GAGCTTGTATCCCAGAGTAATTAGACGTTTTGACTTACACATCTTGGTATACAGTGTTTA 

TGCTMTTATAAATGTCTGAAAGTn"CCTGAAACTTTTTC 

ACAG ACAGG AGGTTGGGAGGAGGGTTGTGGATTGTGGTCCTAAGCCTTTGATTTTCAGTC 
TCTCA TTmGGAATGAAAATTAAA^ 

TTTTTATATTCTTCATCTTGAACMNAMCTGTTTGGMTCACAGCATTT^ 

TAAATAATATTTTATTCTTAAG GCTTTTAN CTAAAAA7TC AAG ATTG NG AG N CCC TAAAT 
AAAAATTA 

Sequence 1708 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACCCTGTTGTCTTCATGCGTAGAAAGGATTGAG 

CCANCTGAG CAAT CTCCMGGTTATCTCAAAGACAATATTTCATTTTATTTTTGTCGTGA 

GAACCAGTGTTTTANACTCTGTATGACCTAAGACTGCTGGAAAGAAGGAAGATCTGGTTA 

AACTAACACAAAGAGCTCCTCAGAGTGTAGTTTTCTTACCTTTATTGCTTCCTTGAGTTC 

TTATATTGNAGNGGAGACTGAGTCCTCCATACTGCCTGGGCTCTGATTAAAATAAGACCT 

CTGATATCACATGCAATGAAGAATAGANAGTTCTAGTATGTAAAGCA 

Sequence 1709 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACCACCAAGTCCATTTCCTGAGTTTAAATACCC 

ACCCCACCGAACTGGGGTTGTGACCTTCAGCAAGATTGTGTTCTATTTTAGTTTGTTTTA 

AACTTCTTCAAGTCTTCATTGTCTCATCTGTAAAGTGAGGATCACAATGGGATCCCACTT 

CTAAGGTAGGTATGAAAATTAAATGCAGTCCTGAATGCCACGTGCTTAGCTCAAGTCCCA 

GCTAAATGGAGGCTACAGAGGACATTGGGTGAAACCACACCTTTCTTATCTTGTTGATGG 

TCCCAACAACCTTGGGTGGGGTAGCCTCTTTTGTCTCCCTCATTGGGCAGTTACAGAAAG 

CTACACCTGAGAGGTCACACATCTAGGAAGTAGAAGAACCTGAACT 

Sequence 1710 

CCCTTAGCGTGGTCGCGGCCGAGGTACTATACTTGGCCCTCTATAATTCTGGGTGTAATG 

TCAGTTGGGTTCAGGCACATTACCTGCAGCAGCAACAAACACCTTCCTATCAAAGTCTCA 

AmCCCACTCTAAMTACCAATATGTCAGTGTTGMGTTAMTTAGGTTTCTAACCCAA 

TGCANGG AAAAA GAGAATTGATGGCANGTCCTTGATATTMTTAGTTGAGTAAATGTTGA 

AAATGACATTTTGAACAAAATATTGAAAAAGATAACATGCNCGGTCANATGTNCAGCAAG 

CCAAAAGMGAAAGGCTTTCCTGAAAAATGTAAAATTTCAMGMGAA 

Sequence 1711 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACGCGGGGACATTCANAGGTGAGCCCAGAGCGG 
GTAAAGTGGACTGGGGAGAACTTCGGAGGATGTTCATGTCCAGGAGCAGCCCCACGCCCT 
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GTATGGTCGGTGTCTAGAGCCTCACAGCAACTAAGACCAACCCAGCTCTCAGAAGAAGGA 
ATGTCAAMTGTCATGTTCMTmACATTCAGTGCCTGGMTCTTTTCTTCACAATTGA 
AATGAAATGTGCTGAAGGAGGTGAATCCATGCATTAATCTTCAGCTCACAAAGGAAATAC 
TACATAAGAAGCAAGGACCACCAGACTCAAGACGGACATAATTGGAI ! I 1 I I I J IGCCAT 
GGCCTGGAMGAAAGGTACCCCACAGTTMCCCACAAMCMGGCCTTTTNTTGAAAAAA 
GCCATTGGCCATGGACTGGCAGACNAGACAANTNNCCAGGACNCCAAATTNCCCTTTCTG 
GGGGTGTTGAGTCACGGGACATGTTGAGCTTTNCCCGCTTGATGCTCNTNTTAATTTCAA 
AGATCACTTTTCCAAAGAATNGAGCNGACCTCA I rTTTTTTTT! I NTTCAAAACCCATGN 
NTTCCCCCTTGCNAANGCTTTTGGNTN I I J II H I GCAAGCNTTGGNTAAAANC I I II M 
TGGGCTTGMNAATTNNNGGGANCCTTTNTAAMTTNGTTGGNTCCCMTAAATTAAAAN 
CATTTTTGGGGCAANAAGNGNTTTTTATTAACTTTTNGGGGGGACTCCCCCCCCCC 
Sequence 1712 

CCCTTAGCGTGGTCGCGGCCGAGGTACTATrTTTTTCAGTTGTAAAAAGTAAAAGAGCTA 

GGTTTTATGAACAGGATGGGAAGCCAGTTATATTCCAGTTATATTCGATTTCTGGTTTTA 

ATAMCT AAAGAGAAAAGTGClTrrTAGGCAAAAATCCTGAGATTTCTAATAACAGACTG 

TTTTTTTCATCATCTTATTAMTAACCTATTGCACACTGATTACATTTATT 

TTCTTTATTTTTTCCTGGATCATTTTCACAATTTTATTT^ 

TTCTTCATGTTTCATTAATGTTCTGTATATAGTCCGAATTCTGTAGCAACCTCTTTAGGA 

AGCTCTTTATTAATACCTAGCTGAAATATAAAAAAATATTGTAAGTGTAAAACTACTCGA 

TTTTATNGGGGAGCCCCGTTTGGCTTTANGTGGGACCTTTTTAAAMCCTTCCACCGGNT 

NGGGGATANTTTCTNGNATTTTNGANGGAAAMGNNAAMC^ 

CTTTNTTANGAAAAAAAAAAGGCATTTTCCAAAAANANTM 

AAAAAAAMGNAATTTTGGCTTGGGAAAAAANANTTGNGGAANTTTNNCCCAAAAAANNT 

TTTAAGGGTTTT 

Sequence 1713 

GAAATAGTTCAAATCATGGCACATTGAATGTCCTCACTGGATTTTTAGGAATGTGTTCAC 

TGAGACAGCCAAATCCTATTTCATTTTCTTTGGCTCATTGCATTGGCTGTAAATTGGAGA 

TATTCACTTTAATATGTGAGTCAAMTTTATTTNCAAACATAATACTGCAGGNGTTCTGT 

CACAGAATATAAATTTTCTTATTTATTTCC 

CACTTCTTCTATTTTTATT^^ 

NTTTNGCCATTATTGCCCATTAANTCCTAACNNGATGGAANTACCTTTTTTT 
Sequence 1714 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACCAGAAGGCAGGTTGGGTCTAGAAAGCCCTGA 

CAGTCTAAGAAGGTCCTGAGTTCAGGATACTGAATAGTGTTACTTCACACTCTAGTCAAG 

TGATTCTGAGACCTTAGCATGCATCAGAGGGCTGGTAAAACAGATTGTTGTCTCCACCTC 

AAAGTATGTGATTCAGTAGATTTGGGGTGGAATTGGAATTGTGGAi I I I I AAC ATTCTTA 

GGTGATGCTGATGTTGTTTGTGGTCCAGGGAGCTCACGTTGAAGACTGCTGCCCTGGGCA 

ATTGGGATCCATGGAATGTTTGGAAGTNAAAGAATAAAATTATCATGACTGTGTCTAAAA 

AAGATGACATTTAACAGAAGTGATGGATATATTGGAGAGGAAAGGGCCTAAGGATGGTGA 

GGCCAGTTAAATACTTTGAGTAGTCCATGTGTGANGTAATCAGAGAATGAGCCCTGATTG 

GAGGTGGGNAAGCCTGGGGCCTGTTGMTTCTTTCATTTCNTTTTAAAGGGCTTATNNTG 

AATTCATTTTATCAANCNAATNGGANACCAATGGGCCCTNCCTCTTAATTNCCTTTTGAN 

AA! II I I II r I ANGGNTTGGCC AAANTTTNAGGGAAAATAAACCAATTTC ANGGATGGGC 

CNCCI I I II ! I I I IGAGTTTTNNMGGTAAAAMCNAAMNANACCTT1TTTNANNNATA 

GGGTTTNGGCCCTAAATNTTNGGGGCNGGCCCCCCCCCCCC 

Sequence 1715 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACCTTGCTACACGGCCGGGGGCCATTGAGACTG 

CCATGGAAGACTTGAAAGGTCACGTAGCTGAGACTTCTGGAGAGACCATTCAAGGCTTCT 

GGCTCTTGACAAAGATAGACCACTGGAACAATGAGAAGGAGAGAATTCTACTGGTCACAG 
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ACAAGACTCTCTTGATCTGCAAATACGACTTCATCATGCTGAGTTGTGTGCAAGCTGCAG 

CGGATTCCTCTGAGCCGCTGTCTATCGCATCTGCCTGGGCAAGTTCACCTTCCCTGGGAT 

GTCCCTG GACAAG AGACAANGGAGAAAGCCTTAAGGATCTACTGGGGGGAGTCCCGGANG 

AACCANTCTTTTTTNTCCCGCTGGAACCCATGGNNCACTGGAGTTTCCCTTTANTGCTTA 

C mCNA CNTGGNGCATTCCCTNTGGAAAATTACAACCCAGNTGAGAAAATTCCTNTGAA 

AAATTTTNGAMGTTTNTNTGGGGNTTCATTGN 

CAAAAAATTNCCCCANCAAAANAATTTNAAACTNNGNATNCTTGGGAANGNAGGAAAANN 

AMCTTNNNTGGGNNGGTAMCNTGGAACCCCNTTTTTGGNTTGGANAACCTCCCCAAGG 

GGCTGGANGTCANTTNTNTTGGGAAANCCGCNAANAAAACTTGGGGNTTTTTCCCTTTGC 

Sequence 1716 

CCC7TAGCGTGGTCGCGGCCGAGGTACAAGCGTGCCCTGCAGGTGGCACACAAAGCTGAG 

ATATTGTCACCTTTGGGAGCACTGTATTACAACACTGGCCGATACGAAGAGGCTTTGCAG 

ATTTACCAGGAAGCTGCAGCACTTCAGCCTTCTCAGAGGGAGCTCGGCTTGGCACTGGCT 

CAGG7TTTGGCCGTGATGGGTCAGACAAAAGAAGCTGAAAAGATGACCAATCACATTTGT 

GTCAGAAGAGACCGGATGCCTTTGAATGCTATCCGCCTCTTGTCAAGCCCTTCTATAGCA 

AGCAAGGAGAACCACGACAAGGGCACCTTGNTGCTNNTAGACAANGCTTCTCCAGCTGAA 

ACCCAAANGGACCCAAAAAGTCATTTTCTGAAAC I II < IN ( NCAAAAAGGG 

Sequence .1717 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACAGGCTTTGATGTAGGACCGTCTACGTCATGA 

GTGAAGAGCGGTAAAGTGCCAGGGTGTCTGGTGGACAGAGTGAATAAGAACATGGTATAA 

GATGAGGTTATGACTATATTGTGTAGAACCTTATAAGCCACT7TAATGATTTTGGGTTTT 

G1TCTAGACGGGAAACCATT 

Sequence 1718 

CCCTTAGCG TGGTC GCGGCCGAGGTACTGTCAGTGTGTCACACAGGTTAGTGAGGCTTGA 
AGCC CTTGCTTTTTACACTTACAGTTGGTATTCCTGCAGCACTTGGAAGTGATCATGCCT 
TTTTCTAGMGAGTTACCTACAGCAGAAGTGTTAAAATGGAAGAGATTTACTTCCACTAT 
TNCTGGGGGGATTCTTGTTACCTTGGCTTACCTGAGGACCTACAGTTCTACATTTACT 
Sequence 1719 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACTCTTCCTTCCAGAGGTTTCCCCATGCCCTCT 

TTTGGACTTGATGGGGGTCATTTGGGACAATAAGGCCTGATAACTCCTTGGACTTAGGAA 

GCGAGAGAGCAGGAAJCAAGAAAAGCTTTTGTGTTTTTTGGTTTGTGTAAGAAMTA 

TGGATTGAGATAAMT1TTTCAAAATAGGCCCAATGAAGAAGAGCAGATTCAAGGAGTAA 

AGGATTATT 

Sequence 1720 

GATATCTGCAGAAATTCGCCCTTAGCCGTGGTCGCGGCCCGAGGTACTTCTATACAGTGG 

GAATGCTACTCAGCAATGAAAAAGAAAAAGATGCAACAACCTGGATAGACCTCAAANGCT 

TATGTATAGTAAAAAGGTCAACCCTTAAAAAGGTATATATTATATGATTGCATTTATATA 

ACATTCTCAAAATAAAAAAAAACTATAGAGGATGAAGAATAGACTAGTGATTTCCANGGG 

CACAGGGACAGGGTAAGGAAAAGAATTGGTAGACCAATGTGAATGCCAAAGAAGGTCTTC 

TGGTGTTGGATGGGAACCAGTCTGTATCTTGGATTGGGGGTAGTGGGCTTCCTCAAAATC 

TATGTATGGGAATAMTTAAATTAAAAATTATACCTTATACCCNCCAAATNACTGCAGGT 

TTAAAAATGGTTTAAAAAAAAATGGTTGNAAAACTAAGTTAAAM 

CTMCCAATAAATGAACCAATGTCAACTTAACTGGTTTNGGGGTATTATACCAGCTAAAA 
TA AGAA TGTCATTCATTGGGGNCCAGGCCGTGGGTGGCTTCACACCCTATAATCTCAACC 
ACTTTTGGGGANGCCCA 
Sequence 1721 

CCCTTTCGAGCGGCCCGCCCGGGCAGGTACCCCCAGAGATTATTCAATCATTGCTTATGA 
CTGTGGCAAACAATTTTGTTACCGACAAGAACTCTGGAGAAGTCATGACAGTAGGAATCA 
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ATGCTATAAAGGAGATAACAGCTCGATGTCCTCTGGCCATGACTGAAGAACTTCTCCAAG 

ACCTGGCTCAGTATAAAACACACAAGGATAAGAATGTAATGATGTCTGCTAGAACTrTGA 

TTCACCTCTTCCGAACACTGAATCCTCAGATGCTGCAGAAGAAATTCCGGG 
Sequence 1722 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACCATAGATCACTGGTAGGGGAAACAAAAGCAA 

AAGCAAMCAAAACAAAACAATAGATCCTGATGACACAGGTCTATTTATACAAACGATTG 

MGCAAAMTCAATTGTAAC.TGTATCAAGTTTATGCAGGGAGAAATGACAATTCTATGTC 

ATGTGACTAGACMTATGGTGACAGATGGGTTTTGGAMGCTTCAAAATAA-nrGGTGTAT 

G7TTAAACAGCTCATAGTGCCCATTACACATACCGTATGGGCCCGCCAATTATTTCTTTC ' 
CAGTTTCTGTTGCCAAAATGCTTGAATTCAACCAACC • 
Sequence 1723 , " <- 

CCCTTrCGAGCGGCCCGCCCGGGCAGGTACAAMGCTGAGGGAAAAAGTTTCAGCTTCM 

GCATTAACGTTTTAGTTCATAAATCTGAAGGAAAATAMGAGAAAATAAAGGCATTAAGA 

GATATGAMCAATGTAAAAATGAATATTTCTTTTATGAATCCTTGTGAATATATGACAGT 

atcaaagctcagaaaactagtttactggggaggatgaggaaggtcaggagatctaagAcc 

ATCTGGC|MCACGGCGAAACCCTCTTCTCTACTAAAAAATACCAAAAATTAGCCAGGAG 



ccccggagtcaaaaggttgcngtgaGcccAaatcgcnccccttggacttcaacctgggcg 
acagaamnagactttggcttcaaaaaaaaaaaaaaaaaaaaggaaaggaaaaaaggaaa 

CCGGl I lAAG 
Sequence 1724 '* 

CCCTTAGCGTGGTCGCGGCCGAGGTACAGGTAAGGGGGAAGTTCCAAAGCTGTTAGTCAC 

CTTGTTTTCATGCTGATCACCCAACCAGATCTANTGTTTGTGTTCTAAGAACTTTAATGT 

TTTGGAGGAAATATCTJGTGGCCTTCAAAAAATCATTCTGTGAAATAGTTGTTTCTACCT 

ACATTCGTCTCATTAATTTTTCTACATACAGCAGAATTCTGCATATATTAGAGGTAACTC 

AGTCAGGGTGTCATGGAGGAAGGTGACCCATGGTTCACCATCTTGCCAATAGAAAAACCA 

ATAGGAAGTCATCTAACCATCATTCGGAGGGATTGAGGTCTGTCATAGGGAGAACAAACT 

AAAGAACTGGACTTTTGCTTTCAGTCAAGATGGAGTAACAGGGACTAAATTTACCCTNCC 
ACCTGAA ;*■ ■ ■. 

Sequence 1725 

^A^JI^^^^ T ?^^^^^^ A ^^ TA ^^^^--^^^^^^GTGGGTGAAGGGGCCACC 
CANCAAGGATGGCAATGTGTATGGGAGGAGAGCCGCGTTGCTTCTAGAGGCTGAATGCCT 
GTC^TGGAGMGGCTTNCATTTACTTCANAAAGCTACACTGACTCTACTAAAAGGCCC 
CTTTCCTGCTGGCrnT AGGAA AAGAGAGAATTCCCTCATCAGTCTNAAAATCACAAGCA 

ACAGAAATGGGAAATCGTTTTTCTAAAGAAATGAAATGTGTTGAATTCANTAATACTCTT ' 
TCATCTTCCCCTTANC f • . ; ■*■>'> — - 

Sequence ,1726' ' : f •' " ' ,. ' „• .\. , • . ' 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACTTTGCTACACGGCCGGGGGCCATTGAGACTG 

CCATGGAAGACTTGAAAGGTCACGTAGCTGAGACTTCTGGAGAGACCATTCAAGGCTTCT 

GGCTCTTGACAAAGATAGACCACTGGAACAATGAGAAGGAGAGAATTCTACTGGTCACAG 

ACAAGACTCTCTTGATCTGCAAATACGACTTCATCATGCTGAGTTGTGTGCAGCTGCAGC 

GGATTCCTCTGAGCGCTGTCTATCGCATCTGCCTGGGCAAGTTCACCTTCCCTGGGAfGT 

CCCTGGACAAGAGACAAGGAGAAGGCCTTAGGATCTACTGGGGGAGTCCGGAGGAGCAGT 

CTCTTCTGTCCCGCTGGAAACCCATGGGTCCACTGAAAGTTCCTTATGCTACTTTTCACT 

GAGCATTCTATGAAATACACCCAGTGGAGAAAAATTCCTTGA 1 ' ', . 

Sequence 1727 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACGCGGGGACACATTCAGAGGTGAGCCCAGAGC 

GGGTAAAGTGGACTGGGGAGAACTTCGGAGGATGTTCATGTCCAGGAGCAGCCCCACGCC 

CTGTATGGTCGGTGTCTAGAGCCTCACAGCAACTAAGACCAACCCAGCTCtCAGAAGAAG 
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